Row	SumFunc
1	
2	
3	
4	
5	This gene encodes a protein that may promote clustering and fusion of late endosomes and lysosomes. The protein may also act as an adaptor protein that modulates the transforming growth factor-beta response by coupling the transforming growth factor-beta receptor complex to the Smad pathway.
6	
7	
8	
9	
10	
11	
12	The heat-shock response is elicited by exposure of cells to thermal and chemical stress and through the activation of HSFs (heat shock factors) results in the elevated expression of heat-shock induced genes.  Heat shock factor binding protein 1 (HSBP1), is a 76-amino-acid protein that binds to heat shock factor 1(HSF1), which is a transcription factor involved in the HS response.  During HS response, HSF1 undergoes conformational transition from an inert non-DNA-binding monomer to active functional trimers. HSBP1 is nuclear-localized and interacts  with the active trimeric state of HSF1 to negatively regulate HSF1 DNA-binding activity. Overexpression of HSBP1 in mammalian cells represses the transactivation activity of HSF1. When overexpressed in C.elegans HSBP1 has severe effects on survival of the animals after thermal and chemical stress consistent with a role of HSBP1 as a negative regulator of heat shock response.
13	
14	This gene is a member of the cadherin superfamily, genes encoding calcium-dependent, membrane-associated glycoproteins. Mapped to a previously identified cluster of cadherin genes on chromosome 16q22.1, the gene localizes with superfamily members CDH1, CDH3, CDH5, CDH8 and CDH11. The protein consists of an extracellular domain containing 6 cadherin domains, a transmembrane region and a truncated cytoplasmic domain but lacks the prosequence and tripeptide HAV adhesion recognition sequence typical of most classical cadherins. Expression is exclusively in kidney, where the protein functions as the principal mediator of homotypic cellular recognition, playing a role in the morphogenic direction of tissue development.
15	The function of this gene has not yet been determined; however, it may be involved in cell growth regulation and differentiation as it contains epidermal growth factor (EGF) -like repeats.
16	
17	N-methyl-D-aspartate (NMDA) receptors are a class of ionotropic glutamate receptors.  NMDA channel has been shown to be involved in long-term potentiation, an activity-dependent increase in the efficiency of synaptic transmission thought to underlie certain kinds of memory and learning.  NMDA receptor channels are heteromers composed of the key receptor subunit NMDAR1 (GRIN1) and 1 or more of the 4 NMDAR2 subunits: NMDAR2A (GRIN2A), NMDAR2B (GRIN2B), NMDAR2C (GRIN2C), and NMDAR2D (GRIN2D).
18	
19	
20	
21	
22	The origin recognition complex (ORC) is a highly conserved six subunit protein complex essential for the initiation of the DNA replication in eukaryotic cells. Studies in yeast demonstrated that ORC binds specifically to origins of replication and serves as a platform for the assembly of additional initiation factors such as Cdc6 and Mcm proteins. The protein encoded by this gene is a subunit of the ORC complex. It has been shown that this protein and and ORC1L are loosely associated with the core complex consisting of ORC2L, -3L, -4L and -5L. Gene silencing studies with small interfering RNA demonstrated that this protein plays an essential  role in coordinating chromosome replication and segregation with cytokinesis.
23	
24	
25	This gene encodes the protein elongin B, which is a subunit of the transcription factor B (SIII) complex. The SIII complex is composed of elongins A/A2, B and C. It activates elongation by RNA polymerase II by suppressing transient pausing of the polymerase at many sites within transcription units. Elongin A functions as the transcriptionally active component of the SIII complex, whereas elongins B and C are regulatory subunits. Elongin A2 is specifically expressed in the testis, and capable of forming a stable complex with elongins B and C. The von Hippel-Lindau tumor suppressor protein binds to elongins B and C, and thereby inhibits transcription elongation. Two alternatively spliced transcript variants encoding different isoforms have been described for this gene.
26	
27	
28	
29	
30	Vitamin K is essential for blood clotting but must be enzymatically activated. This enzymatically activated form of vitamin K is a reduced form required for the carboxylation of glutamic acid residues in some blood-clotting proteins. The product of this gene encodes the enzyme that is responsible for reducing vitamin K 2,3-epoxide to the enzymatically activated form. Fatal bleeding can be caused by vitamin K deficiency and by the vitamin K antagonist warfarin, and it is the product of this gene that is sensitive to warfarin. In humans, mutations in this gene can be associated with deficiencies in vitamin-K-dependent clotting factors and, in humans and rats, with warfarin resistance. Two pseudogenes have been identified on chromosome 1 and the X chromosome. Two alternatively spliced transcripts encoding different isoforms have been described.
31	
32	Heme oxygenase, an essential enzyme in heme catabolism, cleaves heme to form biliverdin, which is subsequently converted to bilirubin by biliverdin reductase, and carbon monoxide, a putative neurotransmitter. Heme oxygenase activity is induced by its substrate heme and by various nonheme substances. Heme oxygenase occurs as 2 isozymes, an inducible heme oxygenase-1 and a constitutive heme oxygenase-2. HMOX1 and HMOX2 belong to the heme oxygenase family.
33	
34	
35	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the proteasome B-type family, also known as the T1B family, that is a 20S core beta subunit. Pseudogenes have been identified on chromosomes 2 and 12.
36	
37	Cofactor D is one of four proteins (cofactors A, D, E, and C) involved in the pathway leading to correctly folded beta-tubulin from folding intermediates. Cofactors A and D are believed to play a role in capturing and stabilizing beta-tubulin intermediates in a quasi-native confirmation. Cofactor E binds to the cofactor D/beta-tubulin complex; interaction with cofactor C then causes the release of beta-tubulin polypeptides that are committed to the native state.
38	
39	
40	
41	
42	HELZ is a member of the superfamily I class of RNA helicases. RNA helicases alter the conformation of RNA by unwinding double-stranded regions, thereby altering the biologic activity of the RNA molecule and regulating access to other proteins (Wagner et al., 1999).[supplied by OMIM]
43	
44	This gene encodes an integral membrane protein that is a major component of myelin in the peripheral nervous system. Various mutations of this gene are causes of Charcot-Marie-Tooth disease Type IA, Dejerine-Sottas syndrome, and hereditary neuropathy with liability to pressure palsies. Alternative splicing of this gene results in three transcript variants that encode the same protein.
45	The protein encoded by this gene belongs to the src family kinases. It is a cytoplasmic protein which is preferentially expressed in T-lymphocytes where it interacts with the protein-tyrosine kinase p59fyn. The presence of a PH domain and a SH3 domain suggests that this encoded protein is capable of interacting with several other intracellular proteins.
46	
47	
48	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein and is similar in sequence to its family member CD53 antigen. It is known to complex with integrins and other transmembrane 4 superfamily proteins.
49	The protein encoded by this gene is a member of the STAT family of transcription factors. In response to cytokines and growth factors, STAT family members are phosphorylated by the receptor associated kinases, and then form homo- or heterodimers that translocate to the cell nucleus where they act as transcription activators. This protein mediates the signal transduction triggered by various cell ligands, such as IL2, IL4, CSF1, and different growth hormones. It has been shown to be involved in diverse biological processes, such as TCR signaling, apoptosis, adult mammary gland development, and sexual dimorphism of liver gene expression. This gene was found to fuse to retinoic acid receptor-alpha (RARA) gene in a small subset of acute promyelocytic leukemias (APLL). The dysregulation of the signaling pathways mediated by this protein may be the cause of the APLL.
50	
51	
52	
53	
54	This zinc finger gene has been identified at the breakpoints of a recurrent chromosomal translocation reported in endometrial stromal sarcoma. Recombination of these breakpoints results in the fusion of this gene and JAZF1. The protein encoded by this gene contains a zinc finger domain in the C terminus of the coding region. The specific function of this gene has not yet been determined.
55	
56	
57	Leber congenital amaurosis (LCA) accounts for at least 5% of all inherited retinal disease and is the most severe inherited retinopathy with the earliest age of onset. Individuals affected with LCA are diagnosed at birth or in the first few months of life with severely impaired vision or blindness, nystagmus and an abnormal or flat electroretinogram. The photoreceptor/pineal -expressed gene, AIPL1, encoding aryl-hydrocarbon interacting protein-like 1, was mapped within the LCA4 candidate region. The protein contains three tetratricopeptide motifs, consistent with nuclear transport or chaperone activity. AIPL1 mutations may cause approximately 20% of recessive LCA.
58	Leber congenital amaurosis (LCA) accounts for at least 5% of all inherited retinal disease and is the most severe inherited retinopathy with the earliest age of onset. Individuals affected with LCA are diagnosed at birth or in the first few months of life with severely impaired vision or blindness, nystagmus and an abnormal or flat electroretinogram. The photoreceptor/pineal -expressed gene, AIPL1, encoding aryl-hydrocarbon interacting protein-like 1, was mapped within the LCA4 candidate region. The protein contains three tetratricopeptide motifs, consistent with nuclear transport or chaperone activity. AIPL1 mutations may cause approximately 20% of recessive LCA.
59	
60	
61	
62	
63	
64	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the ultrahigh sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
65	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the high sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
66	The protein encoded by this gene is a member of the keratin gene family. As a type I hair keratin, it is an acidic protein which heterodimerizes with type II keratins to form hair and nails. The type I hair keratins are clustered in a region of chromosome 17q12-q21 and have the same direction of transcription.
67	
68	
69	
70	This gene encodes a protein which interacts with the carboxy-terminus of transcription initiation factor TFIIF, a transcription factor which regulates elongation as well as initiation by RNA polymerase II. The protein may also represent a component of an RNA polymerase II holoenzyme complex. Alternative splicing of this gene results in two transcript variants.
71	
72	This gene encodes a member of the bombesin-like family of gastrin-releasing peptides. Its preproprotein, following cleavage of a signal peptide, is further processed to produce either the 27 aa gastrin-releasing peptide or the 10 aa neuromedin C. These smaller peptides regulate numerous functions of the gastrointestinal and central nervous systems, including release of gastrointestinal hormones, smooth muscle cell contraction, and epithelial cell proliferation. These peptides are also likely to play a role in human cancers of the lung, colon, stomach, pancreas, breast, and prostate. Alternative splicing results in multiple transcript variants encoding different isoforms.
73	The protein encoded by this gene contains a RING finger domain, a motif present in a variety of functionally distinct proteins and known to be involved in protein-protein and protein-DNA interactions.
74	Aminoacyl-tRNA synthetases are a class of enzymes that charge tRNAs with their cognate amino acids.  Asparaginyl-tRNA synthetase is localized to the cytoplasm and belongs to the class II family of tRNA synthetases.  The N-terminal domain represents the signature sequence for the eukaryotic asparaginyl-tRNA synthetases.
75	
76	
77	Secreted frizzled-related protein 1 (SFRP1) is a member of the SFRP family that contains a cysteine-rich domain homologous to the putative Wnt-binding site of Frizzled proteins. SFRPs act as soluble modulators of Wnt signaling. SFRP1 and SFRP5 may be involved in determining the polarity of photoreceptor cells in the retina. SFRP1 is expressed in several human tissues, with the highest levels in heart.
78	
79	
80	
81	
82	
83	Secreted frizzled-related protein 1 (SFRP1) is a member of the SFRP family that contains a cysteine-rich domain homologous to the putative Wnt-binding site of Frizzled proteins. SFRPs act as soluble modulators of Wnt signaling. SFRP1 and SFRP5 may be involved in determining the polarity of photoreceptor cells in the retina. SFRP1 is expressed in several human tissues, with the highest levels in heart.
84	
85	
86	
87	
88	
89	
90	Autophagy is the process by which endogenous proteins and damaged organelles are destroyed intracellularly. Autophagy is postulated to be essential for cell homeostasis and cell remodeling during differentiation, metamorphosis, non-apoptotic cell death, and aging. Reduced levels of autophagy have been described in some malignant tumors, and a role for autophagy in controlling the unregulated cell growth linked to cancer has been proposed. This gene encodes a member of the autophagin protein family. The encoded protein is also designated as a member of the C-54 family of cysteine proteases.
91	
92	
93	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum. In rodents, the homologous protein has been shown to metabolize certain carcinogens; however, the specific function of the human protein has not been determined.
94	
95	
96	
97	Complement component C3 plays a central role in the activation of complement system.  Its activation is required for both classical and alternative complement activation pathways.  People with C3 deficiency are susceptable to bacteria infection.
98	
99	
100	
101	
102	Kallikreins are a subgroup of serine proteases having diverse physiological functions. Growing evidence suggests that many kallikreins are implicated in carcinogenesis and some have potential as novel cancer and other disease biomarkers. This gene is one of the fifteen kallikrein subfamily members located in a cluster on chromosome 19. Alternate splicing of this gene results in two transcript variants encoding two different isoforms which are differentially expressed.
103	The Shaker gene family of Drosophila encodes components of voltage-gated potassium channels and is comprised of four subfamilies. Based on sequence similarity, this gene is similar to one of these subfamilies, namely the Shaw subfamily. The protein encoded by this gene belongs to the delayed rectifier class of channel proteins and is an integral membrane protein that mediates the voltage-dependent potassium ion permeability of excitable membranes.
104	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S17P family of ribosomal proteins. It is located in the cytoplasm. The gene product of the E. coli ortholog (ribosomal protein S17) is thought to be involved in the recognition of termination codons. This gene is co-transcribed with a small nucleolar RNA gene, which is located in its third intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
105	
106	
107	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities and are involved in a variety of biological processes including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. The function of this growth factor has not yet been determined.
108	
109	
110	
111	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and a single intracytoplasmic catalytic domain, and thus represents a receptor-type PTP. The extracellular region contains eight fibronectin type III-like repeats and multiple N-glycosylation sites. The gene was shown to be expressed primarily in brain and liver, and at a lower level in heart and stomach. It was also found to be expressed in several cancer cell lines, but not in the corresponding normal tissues.
112	Alpha actinins belong to the spectrin gene superfamily which represents a diverse group of cytoskeletal proteins, including the alpha and beta spectrins and dystrophins.  Alpha actinin is an actin-binding protein with multiple roles in different cell types.  In nonmuscle cells, the cytoskeletal isoform is found along microfilament bundles and adherens-type junctions, where it is involved in binding actin to the membrane.  In contrast, skeletal, cardiac, and smooth muscle isoforms are localized to the Z-disc and analogous dense bodies, where they help anchor the myofibrillar actin filaments.  This gene encodes a nonmuscle, alpha actinin isoform which is concentrated in the cytoplasm, and thought to be involved in metastatic processes.  Mutations in this gene have been associated with focal and segmental glomerulosclerosis.
113	This gene encodes a neuron-restricted protein that contains a CRAL-TRIO motif common to proteins that bind small lipophilic molecules. Mutations in this gene are associated with cerebellar ataxia, Cayman type.
114	Lysophospholipases are enzymes that act on biological membranes to regulate the multifunctional lysophospholipids. The protein encoded by this gene is a lysophospholipase expressed in eosinophils and basophils. It hydrolyzes lysophosphatidylcholine to glycerophosphocholine and a free fatty acid. This protein may possess carbohydrate or IgE-binding activities. It is both structurally and functionally related to the galectin family of beta-galactoside binding proteins. It may be associated with inflammation and some myeloid leukemias.
115	This gene encodes a member of the hepatoma-derived growth factor (HDGF) family. Two alternatively spliced transcript variants encoding slightly different isoforms have been found for this gene.
116	Mannose 6-phophate receptors (MPRs) deliver lysosomal hydrolase from the Golgi to endosomes and then return to the Golgi complex. The protein encoded by this gene interacts with the cytoplasmic domains of both cation-independent and cation-dependent MPRs, and is required for endosome-to-Golgi transport. This protein also binds directly to the GTPase RAB9 (RAB9A), a member of the RAS oncogene family. The interaction with RAB9 has been shown to increase the affinity of this protein for its cargo.
117	This gene encodes subunit 2 of the splicing factor 3a protein complex. The splicing factor 3a heterotrimer includes subunits 1, 2 and 3 and is necessary for the in vitro conversion of 15S U2 snRNP into an active 17S particle that performs pre-mRNA splicing. Subunit 2 interacts with subunit 1 through its amino-terminus while the single zinc finger domain of subunit 2 plays a role in its binding to the 15S U2 snRNP. Subunit 2 may also function independently of its RNA splicing function as a microtubule-binding protein.
118	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein.
119	The protein encoded by this gene is a galanin-like G protein-coupled receptor that binds metastin, a peptide encoded by the metastasis suppressor gene KISS1. The tissue distribution of the expressed gene suggests that it is involved in the regulation of endocrine function, and this is supported by the finding that this gene appears to play a role in the onset of puberty.
120	The sodium-iodide symporter (NIS), a member of the sodium/solute symporter family, actively transports iodide across the basolateral plasma membrane of the thyroid follicular cells.[supplied by OMIM]
121	
122	Hepsin is a cell surface serine protease.[supplied by OMIM]
123	
124	The product of this gene belongs to the acyl-CoA oxidases family. It encodes the branched-chain acyl-CoA oxidase which is involved in the degradation of long branched fatty acids and bile acid intermediates in peroxisomes. Deficiency of this enzyme results in the accumulation of branched fatty acids and bile acid intermediates, and may lead to Zellweger syndrome, severe mental retardation and death in children.
125	This gene encodes a member of family I of the G protein-coupled receptors, as well as the EDG family of proteins. This protein functions as a lysophosphatidic acid (LPA) receptor and contributes to Ca2+ mobilization, a critical cellular response to LPA in cells, through association with Gi and Gq proteins.
126	This gene encodes a protein that belongs to the fibrillin gene family. Fibrillins are extracellular matrix molecules that assemble into microfibrils in many connective tissues. This gene is most highly expressed in fetal tissues and its protein product is localized to extracellular microfibrils of developing skeletal elements, skin, lung, kidney, and skeletal muscle. This gene is potentially involved in Weill-Marchesani syndrome. While several transcript variants may exist for this gene, their full-length natures have not been described to date.
127	
128	
129	
130	The protein encoded by this gene belongs to the ARF family of GTP-binding proteins. With its distinctive nuclear/nucleolar localization and interaction with HP1alpha, the protein is developmentally regulated and may play a role(s) in nuclear dynamics and/or signaling cascades during embryonic development. Alternative splicing occurs at this locus and two transcript variants encoding distinct isoforms have been identified.
131	Malate dehydrogenase catalyzes the reversible oxidation of malate to oxaloacetate, utilizing the NAD/NADH cofactor system in the citric acid cycle. The protein encoded by this gene is localized to the cytoplasm and may play pivotal roles in the malate-aspartate shuttle that operates in the metabolic coordination between cytosol and mitochondria.
132	This gene is a member of the TIS11 family of early response genes. Family members are induced by various agonists such as the phorbol ester TPA and the polypeptide mitogen EGF. The encoded protein contains a distinguishing putative zinc finger domain with a repeating cys-his motif. This putative nuclear transcription factor most likely functions in regulating the response to growth factors.
133	
134	The protein encoded by this gene is a member of the class A scavenger receptor family and is part of the innate antimicrobial immune system. The protein may bind both Gram-negative and Gram-positive bacteria via an extracellular, C-terminal, scavenger receptor cysteine-rich (SRCR) domain. In addition to short cytoplasmic and transmembrane domains, there is an extracellular spacer domain and a long, extracellular collagenous domain. The protein may form a trimeric molecule by the association of the collagenous domains of three identical polypeptide chains.
135	This gene encodes a CXC chemokine receptor specific for stromal cell-derived factor-1. The protein has 7 transmembrane regions and is located on the cell surface. It acts with the CD4 protein to support HIV entry into cells and is also highly expressed in breast cancer cells. Mutations in this gene have been associated with WHIM (warts, hypogammaglobulinemia, infections, and myelokathexis) syndrome. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
136	
137	
138	This gene encodes a member of von Willebrand factor A domain containing protein family. This family of proteins are thought to be involved in the formation of filamentous networks in the extracellular matrices of various tissues. This protein contains two von Willebrand factor A domains; it is present in the cartilage extracellular matrix and has a role in the development and homeostasis of cartilage and bone. Mutations in this gene result in multiple epiphyseal dysplasia.
139	Hyaluronan or hyaluronic acid (HA) is a high molecular weight unbranched polysaccharide synthesized by a wide variety of organisms from bacteria to mammals, and is a constituent of the extracellular matrix.  It consists of alternating glucuronic acid and N-acetylglucosamine residues that are linked by beta-1-3 and beta-1-4 glycosidic bonds.  HA is synthesized by membrane-bound synthase at the inner surface of the plasma membrane, and the chains are extruded through pore-like structures into the extracellular space.  It serves a variety of functions, including space filling, lubrication of joints, and provision of a matrix through which cells can migrate.  HA is actively produced during wound healing and tissue repair to provide a framework for ingrowth of blood vessels and fibroblasts.  Changes in the serum concentration of HA are associated with inflammatory and degenerative arthropathies such as rheumatoid arthritis.  In addition, the interaction of HA with the leukocyte receptor CD44 is important in tissue-specific homing by leukocytes, and overexpression of HA receptors has been correlated with tumor metastasis.  HAS2 is a member of the newly identified vertebrate gene family encoding putative hyaluronan synthases, and its amino acid sequence shows significant homology to glycosaminoglycan synthetase (DG42) from Xenopus laevis, and human and murine hyaluronan synthase 1.
140	
141	
142	This gene belongs to the DAZ gene family required for germ cell development. It encodes an RNA-binding protein which is more similar to Drosophila Boule than to human proteins encoded by genes DAZ (deleted in azoospermia) or DAZL (deleted in azoospermia-like). Loss of this gene function results in the absence of sperm in semen (azoospermia). Histological studies demonstrated that the primary defect is at the meiotic G2/M transition. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
143	
144	
145	The WNT gene family consists of structurally related genes which encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family. It is strongly expressed in the cell lines of promyelocytic leukemia and Burkitt@#$%&s lymphoma. In addition, it and another family member, the WNT6 gene, are strongly coexpressed in colorectal cancer cell lines. The gene overexpression may play key roles in carcinogenesis through activation of the WNT-beta-catenin-TCF signaling pathway. This gene and the WNT6 gene are clustered in the chromosome 2q35 region.
146	The WNT gene family consists of structurally related genes which encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family. It is strongly expressed in the cell lines of promyelocytic leukemia and Burkitt@#$%&s lymphoma. In addition, it and another family member, the WNT6 gene, are strongly coexpressed in colorectal cancer cell lines. The gene overexpression may play key roles in carcinogenesis through activation of the WNT-beta-catenin-TCF signaling pathway. This gene and the WNT6 gene are clustered in the chromosome 2q35 region.
147	This gene encodes one of seven subunits of the human Arp2/3 protein complex. The Arp2/3 protein complex has been implicated in the control of actin polymerization in cells and has been conserved through evolution. The exact role of the protein encoded by this gene, the p34 subunit, has yet to be determined. Two alternatively spliced variants have been characterized to date. Additional alternatively spliced variants have been described but their full length nature has not been determined.
148	
149	
150	
151	The protein encoded by this gene is a member of the serine/threonine protein kinase family. This kinase has been shown to specifically activate MAPK8/JNK. The activation of MAPK8 by this kinase is found to be inhibited by the dominant-negative mutants of MAP3K7/TAK1, MAP2K4/MKK4, and MAP2K7/MKK7, which suggests that this kinase may function through the MAP3K7-MAP2K4-MAP2K7 kinase cascade, and mediate the TNF-alpha signaling pathway. Alternatively spliced transcript variants encoding different isoforms have been identified.
152	
153	
154	
155	
156	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein.
157	LHCGR encodes the receptor for both luteinizing hormone and choriogonadotropin. Mutations in this gonadotropin receptor result in disorders of male secondary sexual character development, such as familial male precocious puperty and male pseudohermaphtoditism.
158	
159	This gene encodes a member of the nucleoporin family. The protein is localized to the nuclear rim and is part of the nuclear pore complex (NPC). All molecules entering or leaving the nucleus either diffuse through or are actively transported by the NPC. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
160	
161	Cell surface carbohydrates modulate a variety of cellular functions and are typically synthesized in a stepwise manner. HNK1ST plays a role in the biosynthesis of HNK1 (CD57; MIM 151290), a neuronally expressed carbohydrate that contains a sulfoglucuronyl residue.[supplied by OMIM]
162	
163	
164	This gene encodes a molecular chaperone that is a member of the chaperonin containing TCP1 complex (CCT), also known as the TCP1 ring complex (TRiC). This complex consists of two identical stacked rings, each containing eight different proteins. Unfolded polypeptides enter the central cavity of the complex and are folded in an ATP-dependent manner. The complex folds various proteins, including actin and tubulin. Alternate transcriptional splice variants encoding different isoforms have been found for this gene, but only two of them have been characterized to date.
165	
166	
167	
168	The protein encoded by this gene is highly similar to the protein product encoded by gene INSIG1. Both INSIG1 protein and this protein are endoplasmic reticulum proteins that block the processing of sterol regulatory element binding proteins (SREBPs) by binding to SREBP cleavage-activating protein (SCAP), and thus prevent SCAP from escorting SREBPs to the Golgi.
169	
170	MSH2 was identified as a locus frequently mutated in hereditary nonpolyposis colon cancer (HNPCC). When cloned, it was discovered to be a human homolog of the E. coli mismatch repair gene mutS, consistent with the characteristic alterations in microsatellite sequences (RER+ phenotype) found in HNPCC.
171	CTLA4 is a member of the immunoglobulin superfamily and is a costimulatory molecule expressed by activated T cells. CTLA4 is similar to the T-cell costimulatory CD28 (MIM 186760), and both molecules bind to B7-1 (CD80; MIM 112203) and B7-2 (CD86; MIM 601020) on antigen-presenting cells. CTLA4 transmits an inhibitory signal to T cells, whereas CD28 transmits a stimulatory signal (Magistrelli et al., 1999).[supplied by OMIM]
172	
173	
174	
175	Spectrin is a tetrameric cytoskeletal protein essential for determination of cell shape, resilience of membranes to mechanical stress, positioning of transmembrane proteins, and organization of organelles and molecular traffic. Alpha- and beta-spectrin subunits form antiparallel dimers that self associate to give the spectrin tetramer. Beta subunits, such as SPTBN1, contain most of the spectrin binding activity (Hayes et al., 2000).[supplied by OMIM]
176	
177	The protein encoded by this gene is a nuclear transcription factor involved in the activation of the solute carrier family 2 member 4 gene. The encoded protein interacts with another transcription factor, myocyte enhancer factor 2, to activate transcription of this gene.
178	The protein encoded by this gene is a nuclear transcription factor involved in the activation of the solute carrier family 2 member 4 gene. The encoded protein interacts with another transcription factor, myocyte enhancer factor 2, to activate transcription of this gene.
179	
180	This gene encodes a member of the WNT1 inducible signaling pathway (WISP) protein subfamily, which belongs to the connective tissue growth factor (CTGF) family. WNT1 is a member of a family of cysteine-rich, glycosylated signaling proteins that mediate diverse developmental processes. The CTGF family members are characterized by four conserved cysteine-rich domains: insulin-like growth factor-binding domain, von Willebrand factor type C module, thrombospondin domain and C-terminal cystine knot-like (CT) domain. The encoded protein lacks the CT domain which is implicated in dimerization and heparin binding. It is 72% identical to the mouse protein at the amino acid level. This gene may be downstream in the WNT1 signaling pathway that is relevant to malignant transformation. Its expression in colon tumors is reduced while the other two WISP members are overexpressed in colon tumors. It is expressed at high levels in bone tissue, and may play an important role in modulating bone turnover.
181	This gene encodes a member of von Willebrand factor A domain containing protein family. The proteins of this family are thought to be involved in the formation of filamentous networks in the extracellular matrices of various tissues. The specific function of this gene product has not yet been determined. Three alternatively spliced variants have been described.
182	This gene encodes a member of the WAP-type four-disulfide core (WFDC) domain family. The WFDC domain, or WAP signature motif, contains eight cysteines forming four disulfide bonds at the core of the protein, and functions as a protease inhibitor. Most WFDC gene members are localized to chromosome 20q12-q13 in two clusters: centromeric and telomeric. This gene belongs to the telomeric cluster.
183	
184	Defensins are cysteine-rich cationic polypeptides that are important in the immunologic response to invading microorganisms. The protein encoded by this gene is secreted and is a member of the beta defensin protein family. Beta defensin genes are found in several clusters throughout the genome, with this gene mapping to a cluster at 20p13.
185	Syntaxin-1, synaptobrevin/VAMP, and SNAP25 interact to form the SNARE complex, which is required for synaptic vesicle docking and fusion. The protein encoded by this gene is membrane-associated and inhibits SNARE complex formation by binding free syntaxin-1. Expression of this gene appears to be brain-specific.
186	
187	The protein encoded by this gene is similar to an uncharacterized protein from C. elegans. Although the function of the encoded protein is unknown, it shares weak sequence similarity to a region of S. cerevisiae translation initiation factor subunit Gcd10p.
188	
189	
190	
191	
192	This gene codes for a centrosomal protein shown to be required for centriole-centriole cohesion during the cell cycle. The protein interacts with NIMA-related kinase 2 (NEK2), which is also associated with the centrosome.
193	Iron-sulfur clusters are required for the function of many cellular enzymes. The protein encoded by this gene supplies inorganic sulfur to these clusters by removing the sulfur from cysteine, creating alanine in the process. This gene uses alternate in-frame translation initiation sites to generate mitochondrial forms and cytoplasmic/nuclear forms. Selection of the alternative initiation sites is determined by the cytosolic pH. The encoded protein belongs to the class-V family of pyridoxal phosphate-dependent aminotransferases. Two transcript variants encoding two different isoforms have been found for this gene.
194	This gene encodes an enzyme that converts the pancreatic proenzyme trypsinogen to trypsin, which activates other proenzymes including chymotrypsinogen and procarboxypeptidases. The precursor protein is cleaved into two chains that form a heterodimer linked by a disulfide bond. This protein is a member of the trypsin family of peptidases. Mutations in this gene cause enterokinase deficiency, a malabsorption disorder characterized by diarrhea and failure to thrive.
195	
196	
197	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that belongs to the ribosomal protein S6P family. Pseudogenes corresponding to this gene are found on chromosomes 1p and 12q.
198	Microfibrillar-associated protein 2 is a major antigen of elastin-associated microfibrils and a candidate for involvement in the etiology of inherited connective tissue diseases. This gene encodes two transcripts with two alternatively spliced 5@#$%& untranslated exons. These two transcripts contain the same 8 coding exons, and therefore, encode the same protein.
199	
200	
201	SUMO proteins, such as SUMO3, and ubiquitin (see MIM 191339) posttranslationally modify numerous cellular proteins and affect their metabolism and function. However, unlike ubiquitination, which targets proteins for degradation, sumoylation participates in a number of cellular processes, such as nuclear transport, transcriptional regulation, apoptosis, and protein stability (Su and Li, 2002).[supplied by OMIM]
202	
203	This gene is located within the minimal DGS critical region (MDGCR) thought to contain the gene(s) responsible for a group of developmental disorders. These disorders include DiGeorge syndrome, velocardiofacial syndrome, conotruncal anomaly face syndrome, and some familial or sporadic conotruncal cardiac defects which have been associated with microdeletion of 22q11.2. The function of the encoded protein has not been determined.
204	
205	
206	
207	Crystallins are separated into two classes: taxon-specific, or enzyme, and ubiquitous. The latter class constitutes the major proteins of vertebrate eye lens and maintains the transparency and refractive index of the lens. Since lens central fiber cells lose their nuclei during development, these crystallins are made and then retained throughout life, making them extremely stable proteins. Mammalian lens crystallins are divided into alpha, beta, and gamma families; beta and gamma crystallins are also considered as a superfamily. Alpha and beta families are further divided into acidic and basic groups. Seven protein regions exist in crystallins: four homologous motifs, a connecting peptide, and N- and C-terminal extensions. Beta-crystallins, the most heterogeneous, differ by the presence of the C-terminal extension (present in the basic group, none in the acidic group). Beta-crystallins form aggregates of different sizes and are able to self-associate to form dimers or to form heterodimers with other beta-crystallins. This gene, a beta acidic group member, is part of a gene cluster with beta-B1, beta-B2, and beta-B3.
208	The protein encoded by this gene catalyzes the formation of CTP from UTP with the concomitant deamination of glutamine to glutamate. This protein is the rate-limiting enzyme in the synthesis of cytosine nucleotides, which play an important role in various metabolic processes and provide the precursors necessary for the synthesis of RNA and DNA. Cancer cells that exhibit increased cell proliferation also exhibit an increased activity of this encoded protein. Thus, this protein is an attractive target for selective chemotherapy. Two alternatively spliced transcript variants encoding the same protein have been described for this gene.
209	
210	This gene product belongs to the 14-3-3 family of proteins which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals, and this protein is 99% identical to the mouse, rat and bovine orthologs. This gene contains a 7 bp repeat sequence in its 5@#$%& UTR, and changes in the number of this repeat has been associated with early-onset schizophrenia.
211	The specific function of this gene has not yet been determined, yet it may involve the translocation of growth factor receptor complexes to the lysosome for degradation.  This gene is localized to 22q13.1, with HMOX1 and MCM5 distally and HMG2L1 proximally positioned.
212	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein, which shares sequence similarity with the yeast ribosomal protein YL41, belongs to the L41E family of ribosomal proteins. It is located in the cytoplasm. The protein can interact with the beta subunit of protein kinase CKII and can stimulate the phosphorylation of DNA topoisomerase II-alpha by CKII. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
213	The protein encoded by this gene is a basic leucine zipper (bZIP) transcription factor that lacks a transactivation domain. It is known to bind the US-2 DNA element in the promoter of the oxytocin receptor (OTR) gene and most likely heterodimerizes with other leucine zipper-containing proteins to enhance expression of the OTR gene during term pregnancy. The encoded protein can also form homodimers, and since it lacks a transactivation domain, the homodimer may act as a repressor of transcription. This gene may also be involved in the cellular stress response. Two transcript variants encoding the same protein have been found for this gene.
214	This gene is upregulated by interferon-gamma and encodes a protein that is a member of the AGP11/GTPBP1 family of GTP-binding proteins. A structurally similar protein has been found in mouse, where disruption of the gene for that protein had no observable phenotype.
215	
216	The protein encoded by this gene belongs to the aconitase/IPM isomerase family. It is an enzyme that catalyzes the interconversion of citrate to isocitrate via cis-aconitate in the second step of the TCA cycle. This protein is encoded in the nucleus and functions in the mitochondrion. It was found to be one of the mitochondrial matrix proteins that are preferentially degraded by the serine protease 15(PRSS15), also known as Lon protease, after oxidative modification.
217	
218	
219	
220	FAAH encodes the integral membrane enzyme fatty acid amide hydrolase. FAAH is responsible for the hydrolysis of a number of primary and secondary fatty acid amides, including the neuromodulatory compounds anandamide and oleamide.
221	
222	This gene encodes a transmembrane protein that is located in the endoplasmic reticulum and is involved in GPI-anchor biosynthesis. The glycosylphosphatidylinositol (GPI)-anchor is a glycolipid which contains three mannose molecules in its core backbone. The GPI-anchor is found on many blood cells and serves to anchor proteins to the cell surface. This gene encodes a mannosyltransferase, GPI-MT-I, that transfers the first mannose to GPI on the lumenal side of the endoplasmic reticulum.
223	
224	
225	
226	The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21. This protein differs from the other S100 proteins of known structure in its lack of calcium binding ability in one EF-hand at the N-terminus. This protein is markedly over-expressed in the skin lesions of psoriatic patients, but is excluded as a candidate gene for familial psoriasis susceptibility. The exact function of this protein is not known.
227	
228	
229	
230	
231	Alzheimer@#$%&s disease (AD) patients with an inherited form of the disease carry mutations in the presenilin proteins (PSEN1; PSEN2) or the amyloid precursor protein (APP).  These disease-linked mutations result in increased production of the    longer form of amyloid-beta (main component of amyloid deposits found in AD brains).  Presenilins are postulated to regulate APP processing through their effects on gamma-secretase, an enzyme that cleaves APP.  Also, it is thought that the presenilins are involved in the cleavage of the Notch receptor, such that that they either directly regulate gamma-secretase activity or themselves are protease enzymes.  Two alternative transcripts of PSEN2 have been identified.
232	
233	
234	Regulator of G protein signaling (RGS) proteins are regulatory and structural components of G protein-coupled receptor complexes. RGS proteins are GTPase-activating proteins for Gi (see GNAI1; MIM 139310) and Gq (see GNAQ; MIM 600998) class G-alpha proteins. They accelerate transit through the cycle of GTP binding and hydrolysis and thereby accelerate signaling kinetics and termination.[supplied by OMIM]
235	
236	
237	
238	
239	
240	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and is thought to be the predominant enzyme responsible for epoxidation of endogenous arachidonic acid in cardiac tissue.
241	
242	The human MDM4 gene, which plays a role in apoptosis, encodes a 490-amino acid protein containing a RING finger domain and a putative nuclear localization signal. The MDM4 putative nuclear localization signal, which all Mdm proteins contain, is located in the C-terminal region of the protein. The mRNA is expressed at a high level in thymus and at lower levels in all other tissues tested. MDM4 protein produced by in vitro translation interacts with p53 via a binding domain located in the N-terminal region of the MDM4 protein. MDM4 shows significant structural similarity to p53-binding protein MDM2
243	
244	SELL is a cell surface component that is a member of a family of adhesion/homing receptors which play important roles in leukocyte-endothelial cell interactions. The molecule is composed of multiple domains: one homologous to lectins, one to epidermal growth factor, and two to the consensus repeat units found in C3/C4 binding proteins.
245	
246	This gene encodes disintegrin and metalloprotease (ADAM) domain 28, which is a member of the ADAM protein family. Members of this family are membrane-anchored proteins structurally related to snake venom disintegrins, and have been implicated in a variety of biologic processes involving cell-cell and cell-matrix interactions, including fertilization, muscle development, and neurogenesis. The member encoded by this gene is a lymphocyte-expressed ADAM protein. Alternative splicing of this gene generates two transcript variants. The shorter version encodes a secreted isoform, while the longer version encodes a transmembrane isoform.
247	This gene belongs to the neuron navigator family and is expressed predominantly in the nervous system. The encoded protein contains coiled-coil domains and a conserved AAA domain characteristic for ATPases associated with a variety of cellular activities. This gene is similar to unc-53, a Caenorhabditis elegans gene involved in axon guidance. The exact function of this gene is not known.
248	
249	
250	The annexins are a family of calcium-dependent phospholipid-binding proteins. Annexin 31 cDNA encodes a 338-amino acid protein with less than 40% identity to other annexins.  Members of the annexin family contain 4 internal repeat domains, each of which includes a type II calcium-binding site. The calcium-binding sites are required for annexins to aggregate and cooperatively bind anionic phospholipids and extracellular matrix proteins.  The most striking feature of annexin 31 is  a complete ablation of all four homologous type II calcium-binding sites in the conserved tetrad core. Although all 4 type II calcium-binding sites in annexin 31 contained amino acid substitutions that ablated their function, structural analysis suggests that the conserved putative ion channel formed by the tetrad core is intact.
251	
252	
253	
254	Nestin is an intermediate filament protein that was first identified with a monoclonal antibody by Hockfield and McKay (1985). It is expressed predominantly in stem cells of the central nervous system in the neural tube. Upon terminal neural differentiation, nestin is downregulated and replaced by neurofilaments.[supplied by OMIM]
255	
256	
257	
258	
259	
260	This gene encodes a large, carbohydrate-rich, epithelial glycoprotein with numerous O-glycosylation sites located within threonine, serine, and proline-rich tandem repeats. The gene is similar to members of the mucin and the glycosyl hydrolase 18 gene families. Regulation of expression may be estrogen-dependent. Gene expression and protein secretion occur during late follicular development through early cleavage-stage embryonic development. The protein is secreted from non-ciliated oviductal epithelial cells and associates with ovulated oocytes, blastomeres, and spermatozoan acrosomal regions.
261	The IgE receptor plays a central role in allergic disease, coupling allergen and mast cell to initiate the inflammatory and immediate hypersensitivity responses that are characteristic of disorders such as hay fever and asthma. The allergic response occurs when 2 or more high-affinity IgE receptors are crosslinked via IgE molecules that in turn are bound to an allergen (antigen) molecule. A perturbation occurs that brings about the release of histamine and proteases from the granules in the cytoplasm of the mast cell and leads to the synthesis of prostaglandins and leukotrienes--potent effectors of the hypersensitivity response. The IgE receptor consists of 3 subunits: alpha, beta (MIM 147138), and gamma (MIM 147139); only the alpha subunit is glycosylated.[supplied by OMIM]
262	
263	The protein encoded by this gene belongs to the Ser/Thr family of protein kinases and HIPK subfamily. It phosphorylates homeodomain transcription factors and may also function as a co-repressor for homeodomain transcription factors. Alternative splicing results in four transcript variants encoding four distinct isoforms.
264	
265	
266	
267	
268	
269	
270	
271	
272	
273	
274	This gene encodes an alpha chain for one of the low abundance fibrillar collagens. Fibrillar collagen molecules are trimers that can be composed of one or more types of alpha chains. Type V collagen is found in tissues containing type I collagen and appears to regulate the assembly of heterotypic fibers composed of both type I and type V collagen. This gene product is closely related to type XI collagen and it is possible that the collagen chains of types V and XI constitute a single collagen type with tissue-specific chain combinations. Mutations in this gene are associated with Ehlers-Danlos syndrome, types I and II. Two transcripts that differ in the length of the 3@#$%&UTR due to the use of alternative polyadenylation signals have been identified for this gene.
275	This gene is one of seven beta-1,4-galactosyltransferase (beta4GalT) genes. They encode type II membrane-bound glycoproteins that appear to have exclusive specificity for the donor substrate UDP-galactose; all transfer galactose in a beta1,4 linkage to similar acceptor sugars: GlcNAc, Glc, and Xyl. Each beta4GalT has a distinct function in the biosynthesis of different glycoconjugates and saccharide structures. As type II membrane proteins, they have an N-terminal hydrophobic signal sequence that directs the protein to the Golgi apparatus and which then remains uncleaved to function as a transmembrane anchor. By sequence similarity, the beta4GalTs form four groups: beta4GalT1 and beta4GalT2, beta4GalT3 and beta4GalT4, beta4GalT5 and beta4GalT6, and beta4GalT7. This gene encodes an enzyme that may be mainly involved in the synthesis of the first N-acetyllactosamine unit of poly-N-acetyllactosamine chains.
276	
277	
278	
279	Aldo-keto reductases, such as AKR7A3, are involved in the detoxification of aldehydes and ketones.[supplied by OMIM]
280	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This gene is part of a cluster of cytochrome P450 genes on chromosome 1p33.
281	
282	Glutamate-cysteine ligase, also known as gamma-glutamylcysteine synthetase, is the first rate limiting enzyme of glutathione synthesis. The enzyme consists of two subunits, a heavy catalytic subunit and a light regulatory subunit. Gamma glutamylcysteine synthetase deficiency has been implicated in some forms of hemolytic anemia.
283	The product of this gene belongs to the phosphatase 2 regulatory subunit B family. Protein phosphatase 2 is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. The B regulatory subunit might modulate substrate selectivity and catalytic activity. This gene encodes an alpha isoform of the regulatory subunit B72/B130 subfamily. Alternatively spliced transcript variants encoding different isoforms have been identified.
284	tRNA splicing is a fundamental process required for cell growth and division. SEN15 is a subunit of the tRNA splicing endonuclease, which catalyzes the removal of introns, the first step in tRNA splicing (Paushkin et al., 2004).[supplied by OMIM]
285	tRNA splicing is a fundamental process required for cell growth and division. SEN15 is a subunit of the tRNA splicing endonuclease, which catalyzes the removal of introns, the first step in tRNA splicing (Paushkin et al., 2004).[supplied by OMIM]
286	BPNT1, also called bisphosphate 3-prime-nucleotidase, or BPntase, is a member of a magnesium-dependent phosphomonoesterase family. Lithium, a major drug used to treat manic depression, acts as an uncompetitive inhibitor of BPntase. The predicted human protein is 92% identical to mouse BPntase. BPntase@#$%&s physiologic role in nucleotide metabolism may be regulated by inositol signaling pathways. The inhibition of human BPntase may account for lithium-induced nephrotoxicity.
287	
288	
289	The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21. This protein may function in motility, invasion, and tubulin polymerization. Chromosomal rearrangements and altered expression of this gene have been implicated in tumor metastasis.  Alternative splicing of the 5@#$%& UTR results in two gene products.
290	
291	
292	
293	
294	
295	
296	
297	
298	
299	
300	The protein encoded by this gene is the human ortholog of yeast mitochondrial AAA metalloprotease, Yme1p. It is localized in the mitochondria and can functionally complement a yme1 disruptant yeast strain. It is proposed that this gene plays a role in mitochondrial protein metabolism and could be involved in mitochondrial pathologies. Several transcript variants encoding different isoforms have been found for this gene.
301	Mammalian apolipoprotein B mRNA undergoes site-specific C to U deamination, which is mediated by a multi-component enzyme complex containing a minimal core composed of APOBEC-1 and a complementation factor encoded by this gene. The gene product has three non-identical RNA recognition motifs and belongs to the hnRNP R family of RNA-binding proteins. It has been proposed that this complementation factor functions as an RNA-binding subunit and docks APOBEC-1 to deaminate the upstream cytidine. Studies suggest that the protein may also be involved in other RNA editing or RNA processing events. Alternative splicing occurs at this locus and three full-length transcript variants, encoding three distinct isoforms, have been described. Additional splicing has been observed but the full-length nature of these variants has not been determined.
302	
303	
304	This gene is a member of the F-box/WD-40 gene family, which recruit specific target proteins through their WD-40 protein-protein binding domains for ubiquitin mediated degradation. In mouse, a highly similar protein is thought to be responsible for maintaining the apical ectodermal ridge of developing limb buds; disruption of the mouse gene results in the absence of central digits, underdeveloped or absent metacarpal/metatarsal bones and syndactyly. This phenotype is remarkably similar to split hand-split foot malformation in humans, a clinically heterogeneous condition with a variety of modes of transmission. An autosomal recessive form has been mapped to the chromosomal region where this gene is located, and complex rearrangements involving duplications of this gene and others have been associated with the condition. A pseudogene of this locus has been mapped to one of the introns of the BCR gene on chromosome 22.
305	
306	
307	
308	Phosphatidylinositol-4,5-bisphosphate, the precursor to second messengers of the phosphoinositide signal transduction pathways, is thought to be involved in the regulation of secretion, cell proliferation, differentiation, and motility. The protein encoded by this gene is one of a family of enzymes capable of catalyzing the phosphorylation of phosphatidylinositol-4-phosphate on the fifth hydroxyl of the myo-inositol ring to form phosphatidylinositol-4,5-bisphosphate. The amino acid sequence of this enzyme does not show homology to other kinases, but the recombinant protein does exhibit kinase activity. This gene is a member of the phosphatidylinositol-4-phosphate 5-kinase family.
309	
310	The protein encoded by this gene functions in the second step of the mitochondrial fatty acid beta-oxidation pathway. It catalyzes the hydration of 2-trans-enoyl-coenzyme A (CoA) intermediates to L-3-hydroxyacyl-CoAs. The gene product is a member of the hydratase/isomerase superfamily. It localizes to the mitochondrial matrix. Transcript variants utilizing alternative transcription initiation sites have been described in the literature.
311	
312	
313	
314	This gene encodes a member of the oxysterol-binding protein (OSBP) family, a group of intracellular lipid receptors. Most members contain an N-terminal pleckstrin homology domain and a highly conserved C-terminal OSBP-like sterol-binding domain, although some members contain only the sterol-binding domain. Transcript variants derived from alternative promoter usage and/or alternative splicing exist; they encode different isoforms.
315	MYPN is a component of the sarcomere that tethers nebulin (MIM 161650) in skeletal muscle and nebulette (MIM 605491) in cardiac muscle to alpha-actinin (see ACTN2; MIM 102573) at the Z lines (Bang et al., 2001).[supplied by OMIM]
316	
317	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this DEAD box protein family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box enzyme that may be involved in ribosomal RNA synthesis or processing. This gene and DDX21, also called RH-II/GuA, have similar genomic structures and are in tandem orientation on chromosome 10, suggesting that the two genes arose by gene duplication in evolution. This gene has pseudogenes on chromosomes 2, 3 and 4. Alternative splicing of this gene generates multiple transcript variants, but the full length nature of all the other variants but one has not been defined.
318	This gene encodes a member of the lipoxygenase gene family and plays a dual role in the synthesis of leukotrienes from arachidonic acid. The encoded protein, which is expressed specifically in bone marrow-derived cells, catalyzes the conversion of arachidonic acid to 5(S)-hydroperoxy-6-trans-8,11,14-cis-eicosatetraenoic acid, and further to the allylic epoxide 5(S)-trans-7,9-trans-11,14-cis-eicosatetrenoic acid (leukotriene A4). Leukotrienes are important mediators of a number of inflammatory and allergic conditions. Mutations in the promoter region of this gene lead to a diminished response to antileukotriene drugs used in the treatment of asthma and may also be associated with atherosclerosis and several cancers. Alternatively spliced transcript variants have been observed, but their full-length nature has not been determined.
319	Members of the protein-tyrosine phosphatase superfamily cooperate with protein kinases to regulate cell proliferation and  differentiation. This superfamily is separated into two families based on the substrate that is dephosphorylated. One family, the dual specificity phosphatases (DSPs) acts on both phosphotyrosine and phosphoserine/threonine residues. This gene encodes different but related DSP proteins through the use of non-overlapping open reading frames, alternate splicing, and presumed different transcription promoters. Expression of the distinct proteins from this gene has been found to be tissue specific and the proteins may be involved in postnatal development of specific tissues. A protein encoded by the upstream ORF was found in skeletal muscle, whereas the encoded protein from the downstream ORF was found only in testis. In mouse, a similar pattern of expression was found. Multiple alternatively spliced transcript variants were described, but the full-length sequence of only some were determined.
320	Sulfation is a common modification of endogenous (lipids, proteins, and carbohydrates) and exogenous (xenobiotics and drugs) compounds. In mammals, the sulfate source is 3@#$%&-phosphoadenosine 5@#$%&-phosphosulfate (PAPS), created from ATP and inorganic sulfate. Two different tissue isoforms encoded by different genes synthesize PAPS. This gene encodes one of the two PAPS synthetases. Defects in this gene cause the Pakistani type of spondyloepimetaphyseal dysplasia. Two alternatively spliced transcript variants that encode different isoforms have been described for this gene.
321	
322	
323	The mitochondrial oxidation of long-chain fatty acids is initiated by the sequential action of carnitine palmitoyltransferase I (which is located in the outer membrane and is detergent-labile) and carnitine palmitoyltransferase II (which is located in the inner membrane and is detergent-stable), together with a carnitine-acylcarnitine translocase. CPT I is the key enzyme in the carnitine-dependent transport across the mitochondrial inner membrane and its deficiency results in a decreased rate of fatty acid beta-oxidation.
324	
325	
326	
327	
328	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes one of the ATPase subunits, a member of the triple-A family of ATPases which have a chaperone-like activity. This subunit may compete with PSMC2 for binding to the HIV tat protein to regulate the interaction between the viral protein and the transcription complex. A pseudogene has been identified on chromosome 9.
329	
330	
331	
332	
333	
334	This gene encodes subunit 1 of the splicing factor 3a protein complex. The splicing factor 3a heterotrimer includes subunits 1, 2 and 3 and is necessary for the in vitro conversion of 15S U2 snRNP into an active 17S particle that performs pre-mRNA splicing. Subunit 1 belongs to the SURP protein family; named for the SURP (also called SWAP or Suppressor-of-White-APricot) motifs that are thought to mediate RNA binding. Subunit 1 has tandemly repeated SURP motifs in its amino-terminal half while its carboxy-terminal half contains a proline-rich region and a ubiquitin-like domain. Binding studies with truncated subunit 1 derivatives demonstrated that the two SURP motifs are necessary for binding to subunit 3 while contacts with subunit 2 may occur through sequences carboxy-terminal to the SURP motifs. Alternative splicing results in multiple transcript variants encoding different isoforms.
335	NFKB1 (MIM 164011) or NFKB2 (MIM 164012) is bound to REL (MIM 164910), RELA, or RELB (MIM 604758) to form the NFKB complex. The p50 (NFKB1)/p65 (RELA) heterodimer is the most abundant form of NFKB. The NFKB complex is inhibited by I-kappa-B proteins (NFKBIA, MIM 164008 or NFKBIB, MIM 604495), which inactivate NFKB by trapping it in the cytoplasm. Phosphorylation of serine residues on the I-kappa-B proteins by kinases (IKBKA, MIM 600664, or IKBKB, MIM 603258) marks them for destruction via the ubiquitination pathway, thereby allowing activation of the NFKB complex. Activated NFKB complex translocates into the nucleus and binds DNA at kappa-B-binding motifs such as 5-prime GGGRNNYYCC 3-prime or 5-prime HGGARNYYCC 3-prime (where H is A, C, or T; R is an A or G purine; and Y is a C or T pyrimidine).[supplied by OMIM]
336	
337	
338	
339	NEPH2 is a member of the nephrin-like protein family, which includes NEPH1 (KIRREL; MIM 607428) and NEPH3 (MIM 607762). The cytoplasmic domains of these proteins interact with the C terminus of podocin (NPHS2; MIM 604766), and the genes are expressed in kidney podocytes, cells involved in ensuring size- and charge-selective ultrafiltration (Sellin et al., 2003).[supplied by OMIM]
340	This gene encodes the human ortholog of chicken YAP protein which binds to the SH3 domain of the Yes proto-oncogene product. This protein contains a WW domain that is found in various structural, regulatory and signaling molecules in yeast, nematode, and mammals, and may be involved in protein-protein interaction.
341	The protein encoded by this gene is a proinflammatory cytokine. This cytokine can induce the IFN-gamma production of T cells. The combination of this cytokine and IL12 has been shown to inhibit IL4 dependent IgE and IgG1 production, and enhance IgG2a production of B cells. IL-18 binding protein (IL18BP) can specifically interact with this cytokine, and thus negatively regulate its biological activity.
342	This gene is a member of the Krueppel C2H2-type zinc-finger protein family and encodes a zinc finger transcription factor that contains nine Kruppel-type zinc finger domains at the carboxyl terminus. This protein is located in the nucleus, is involved in cell cycle progression, and interacts with a histone deacetylase. Specific instances of aberrant gene rearrangement at this locus have been associated with acute promyelocytic leukemia (APL). Alternate transcriptional splice variants have been characterized.
343	
344	The muscarinic cholinergic receptors belong to a larger family of G protein-coupled receptors. The functional diversity of these receptors is defined by the binding of acetylcholine and includes cellular responses such as adenylate cyclase inhibition, phosphoinositide degeneration, and potassium channel mediation. Muscarinic receptors influence many effects of acetylcholine in the central and peripheral nervous system. The clinical implications of this receptor are unknown; however, mouse studies link its function to adenylyl cyclase inhibition.
345	The tectorial membrane is an extracellular matrix of the inner ear that contacts the stereocilia bundles of specialized sensory hair cells. Sound induces movement of these hair cells relative to the tectorial membrane, deflects the stereocilia, and leads to fluctuations in hair-cell membrane potential, transducing sound into electrical signals. Alpha-tectorin is one of the major noncollagenous components of the tectorial membrane.  Mutations in the TECTA gene have been shown to be responsible for autosomal dominant nonsyndromic hearing impairment and a recessive form of sensorineural pre-lingual non-syndromic deafness.
346	
347	
348	Translation initiation is mediated by specific recognition of the cap structure by eukaryotic translation initiation factor 4F (eIF4F), which is a cap binding protein complex that consists of three subunits: eIF4A, eIF4E and eIF4G. The protein encoded by this gene shares similarity with the C-terminal region of eIF4G, that contains the binding sites for eIF4A and eIF3; eIF4G in addition, contains a binding site for eIF4E at the N-terminus. Unlike eIF4G which supports cap-dependent and independent translation, this gene product functions as a general repressor of translation by forming translationally inactive complexes. In vitro and in vivo studies indicate that translation of this mRNA initiates exclusively at a non-AUG (GUG) codon.
349	
350	
351	This gene encodes a member of the membrane-spanning 4A gene family. Members of this nascent protein family are characterized by common structural features and similar intron/exon splice boundaries and display unique expression patterns among hematopoietic cells and nonlymphoid tissues. Though this member is not expressed in hematopoietic cells specifically, it may be involved in signal transduction like many of its related family members. The gene encoding this protein is localized to 11q12, among a cluster of family members.
352	
353	This gene is a member of a photoreceptor-specific gene family and encodes an integral membrane protein found in the photoreceptor disk rim of the eye. This protein can form homodimers or can heterodimerize with another photoreceptor, retinal degeneration slow (RDS). It is essential for disk morphogenesis, and may also function as an adhesion molecule involved in the stabilization and compaction of outer segment disks or in the maintenance of the curvature of the rim. Certain defects in this gene have been associated with the degenerative eye disease retinitis pigmentosa.
354	
355	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. The gene is biallelically expressed, despite its location within a region of imprinted genes on chromosome 11.
356	Prostaglandin-D synthase is a sigma class glutathione-S-transferase family member. The enzyme catalyzes the conversion of PGH2 to PGD2 and plays a role in the production of prostanoids in the immune system and mast cells. The presence of this enzyme can be used to identify the differentiation stage of human megakaryocytes.
357	Transduction of extracellular matrix signals through integrins influences intracellular and extracellular functions, and appears to require interaction of integrin cytoplasmic domains with cellular proteins. Integrin-linked kinase (ILK), interacts with the cytoplasmic domain of beta-1 integrin. This gene encodes a serine/threonine protein kinase with 4 ankyrin-like repeats, which associates with the cytoplasmic domain of beta integrins and acts as a proximal receptor kinase regulating integrin-mediated signal transduction. Multiple alternatively spliced transcript variants encoding the same protein have been found for this gene.
358	This gene is a member of the basic helix-loop-helix (BHLH) family of transcription factors. It activates transcription by binding to the E box (5@#$%&-CANNTG-3@#$%&). Dimerization with other BHLH proteins is required for efficient DNA binding. Involved in the determination of the neuronal precursors in the peripheral nervous system and the central nervous system.
359	
360	
361	
362	The protein encoded by this gene contains ARF-GAP, RHO-GAP, ankyrin repeat, RAS-associating, and pleckstrin homology domains. In vitro, this protein displays RHO-GAP and phosphatidylinositol (3,4,5) trisphosphate (PIP3)-dependent ARF-GAP activity. The encoded protein associates with the Golgi, and the ARF-GAP activity mediates changes in the Golgi and the formation of filopodia. The RHO-GAP activity may mediate cell rounding and loss of stress fibers. At least three transcript variants encoding different isoforms have been found for this gene, but the full-length nature of one variant has not been determined.
363	Calpains are calcium-dependent cysteine proteases involved in signal transduction in a variety of cellular processes. A functional calpain protein consists of an invariant small subunit and 1 of a family of large subunits. CAPN5 is one of the large subunits. Unlike some of the calpains, CAPN5 and CAPN6 lack a calmodulin-like domain IV. Because of the significant similarity to Caenorhabditis elegans sex determination gene tra-3, CAPN5 is also called as HTRA3. Multiple alternatively spliced transcript variants have been found for this gene, but the full-length nature of some variants has not been defined.
364	The protein encoded by this gene is similar to the gene product of S14, a rat gene whose expression is limited to liver and adipose tissue and is controlled by nutritional and hormonal factors. This gene has been shown to be expressed in liver and adipocytes, particularly in lipomatous modules. It is also found to be expressed in lipogenic breast cancers, which suggests a role in controlling tumor lipid metabolism.
365	Malic enzyme catalyzes the oxidative decarboxylation of malate to pyruvate using either NAD+ or NADP+ as a cofactor. Mammalian tissues contain 3 distinct isoforms of malic enzyme: a cytosolic NADP(+)-dependent isoform, a mitochondrial NADP(+)-dependent isoform, and a mitochondrial NAD(+)-dependent isoform. This gene encodes a mitochondrial NADP(+)-dependent isoform. Multiple alternatively spliced transcript variants have been found for this gene, but the biological validity of some variants has not been determined.
366	This gene encodes a protein which is a member of the Kruppel family of zinc finger proteins.  The function of this gene has not been determined; however, it may play a role in normal development gene transcription.  Mouse mutation studies indicate possible involvement in human foregut malformation.
367	This gene encodes an ETS family transcription factor. The product of this gene contains two functional domains: a N-terminal pointed (PNT) domain that is involved in the protein-protein interactions with itself and other proteins, and a C-terminal DNA-binding domain. Gene knockout studies in mice suggest that it is required for hematopoiesis and maintenance of the developing vascular network. This gene is known to be involved in a large number of chromosomal rearrangements associated with leukemia and congenital fibrosarcoma.
368	
369	
370	This gene encodes a molecular chaperone that is member of the chaperonin containing TCP1 complex (CCT), also known as the TCP1 ring complex (TRiC). This complex consists of two identical stacked rings, each containing eight different proteins. Unfolded polypeptides enter the central cavity of the complex and are folded in an ATP-dependent manner. The complex folds various proteins, including actin and tubulin. Alternate transcriptional splice variants of the gene described in this record have been observed but have not been thoroughly characterized.
371	The protein encoded by this gene is a member of serine/threonine protein kinase family. This kinase contains a leucine-zipper domain, and is predominately expressed in neuronal cells. The phosphorylation state of this kinase in synaptic terminals was shown to be regulated by membrane depolarization via calcineurin. This kinase forms heterodimers with leucine zipper containing transcription factors, such as cAMP responsive element binding protein (CREB) and MYC, and thus may play a regulatory role in PKA or retinoic acid induced neuronal differentiation.
372	
373	
374	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP contains two tandem Src homology-2 domains, which function as phospho-tyrosine binding domains and mediate the interaction of this PTP with its substrates. This PTP is widely expressed in most tissues and plays a regulatory role in various cell signaling events that are important for a diversity of cell functions, such as mitogenic activation, metabolic control, transcription regulation, and cell migration. Mutations in this gene are a cause of Noonan syndrome as well as acute myeloid leukemia.
375	
376	K6IRS3 belongs to a family of type II keratins that are specifically expressed in the inner root sheath of hair follicles.[supplied by OMIM]
377	
378	
379	
380	
381	This gene encodes a nuclear protein, which colocalizes with mRNA splicing factors and intermediate filament-containing perinuclear networks. This protein has 95% amino acid sequence identity to the mouse Wbp11 protein. It contains two proline-rich regions that bind to the WW domain of Npw38, a nuclear protein, and thus this protein is also called Npw38-binding protein NpwBP. The Npw38-NpwBP complex may function as a component of an mRNA factory in the nucleus.
382	
383	
384	Synaptobrevins/VAMPs, syntaxins, and the 25-kD synaptosomal-associated protein SNAP25 are the main components of a protein complex involved in the docking and/or fusion of synaptic vesicles with the presynaptic membrane. VAMP1 is a member of the vesicle-associated membrane protein (VAMP)/synaptobrevin family. Multiple alternative splice variants that encode proteins with alternative carboxy ends have been described, but the full-length nature of some variants has not been defined.
385	The glycoprotein encoded by this gene functions as both an antihemophilic factor carrier and a platelet-vessel wall mediator in the blood coagulation system.  It is crucial to the hemostasis process.  Mutations in this gene or deficiencies in this protein result in von Willebrand@#$%&s disease. An unprocessed pseudogene has been found on chromosome 22.
386	
387	
388	AATK is a novel gene which is induced during apoptosis.  It encodes a protein with a tyrosine kinase domain at the N-terminal end and a proline-rich domain at the C-terminal end.  AATK expression may be a necessary pre-requisite for the induction of growth arrest and/or apoptosis of myeloid precursor cells
389	MVK encodes the peroxisomal enzyme mevalonate kinase. Mevalonate is a key intermediate, and mevalonate kinase a key early enzyme, in isoprenoid and sterol synthesis. Mevalonate kinase deficiency caused by mutation of MVK results in mevalonic aciduria.
390	
391	
392	
393	
394	
395	
396	
397	
398	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. This gene displayed oncogenic transforming activity when transfected into mammalian cells. The mouse homolog of this gene exhibits a restricted expression profile predominantly in the myogenic lineage, which suggested a role in muscle regeneration or differentiation.
399	
400	
401	
402	
403	
404	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. This protein was isolated as a secreted factor that exhibits a growth-stimulating effect on cultured glial cells. In nervous system, this protein is produced mainly by neurons and may be important for glial cell development. Expression of the mouse homolog of this gene was found to be dependent on Sonic hedgehog (Shh) signaling. Mice lacking the homolog gene displayed a male-to-female sex reversal phenotype, which suggested a role in testicular embryogenesis.
405	
406	
407	
408	
409	
410	
411	
412	Type III iodothyronine deiodinase, a selenoprotein, is an important enzyme for thyroid hormone inactivation. It catalyzes the inner ring deiodination of the prohormone thyroxine (T4) and of the bioactive hormone 3,3@#$%&,5-triiodothyronine (T3) to 3,3@#$%&-T2 to inactive metabolites. Selenocysteine, at aa position 144, is essential for efficient enzyme activity. The expression of this enzyme in placenta and several fetal tissues during early development, suggests that it plays a role in the regulation of thyroid hormone inactivation during embryologic development.
413	
414	Alpha actinins belong to the spectrin gene superfamily which represents a diverse group of cytoskeletal proteins, including the alpha and beta spectrins and dystrophins.  Alpha actinin is an actin-binding protein with multiple roles in different cell types.  In nonmuscle cells, the cytoskeletal isoform is found along microfilament bundles and adherens-type junctions, where it is involved in binding actin to the membrane.  In contrast, skeletal, cardiac, and smooth muscle isoforms are localized to the Z-disc and analogous dense bodies, where they help anchor the myofibrillar actin filaments.  This gene encodes a nonmuscle, cytoskeletal, alpha actinin isoform and maps to the same site as the structurally similar erythroid beta spectrin gene.
415	
416	
417	
418	NFKB1 (MIM 164011) or NFKB2 (MIM 164012) is bound to REL (MIM 164910), RELA (MIM 164014), or RELB (MIM 604758) to form the NFKB complex. The NFKB complex is inhibited by I-kappa-B proteins (NFKBIA or NFKBIB, MIM 604495), which inactivate NF-kappa-B by trapping it in the cytoplasm. Phosphorylation of serine residues on the I-kappa-B proteins by kinases (IKBKA, MIM 600664, or IKBKB, MIM 603258) marks them for destruction via the ubiquitination pathway, thereby allowing activation of the NF-kappa-B complex. Activated NFKB complex translocates into the nucleus and binds DNA at kappa-B-binding motifs such as 5-prime GGGRNNYYCC 3-prime or 5-prime HGGARNYYCC 3-prime (where H is A, C, or T; R is an A or G purine; and Y is a C or T pyrimidine).[supplied by OMIM]
419	Members of the protein kinase C (PKC) family function in many extracellular receptor-mediated signal transduction pathways. See PRKCA (MIM 176960) for further background information. The PRKCM gene encodes a cytosolic serine-threonine kinase that binds to the trans-Golgi network and regulates the fission of transport carriers specifically destined to the cell surface.[supplied by OMIM]
420	The Notch signaling pathway is an intercellular signaling mechanism that is essential for proper embryonic development. Members of the Notch gene family encode transmembrane receptors that are critical for various cell fate decisions. The protein encoded by this gene is one of several ligands that activate Notch and related receptors. Two transcript variants encoding different isoforms have been found for this gene.
421	
422	
423	DAD1, the defender against apoptotic cell death, was initially identified as a negative regulator of programmed cell death in the temperature sensitive tsBN7 cell line.  The DAD1 protein disappeared in temperature-sensitive cells following a shift to the nonpermissive temperature, suggesting that loss of the DAD1 protein triggered apoptosis.  DAD1 is believed to be a tightly associated subunit of oligosaccharyltransferase both in the intact membrane and in the purified enzyme, thus reflecting the essential nature of N-linked glycosylation in eukaryotes.
424	
425	
426	
427	
428	Various cellular organelles and vesicles are transported along the microtubules in the cytoplasm. Likewise, membrane recycling of the endoplasmic reticulum (ER), Golgi assembly at the microtubule organizing center, and alignment of lysosomes along microtubules are all related processes. The transport of organelles requires a special class of microtubule-associated proteins (MAPs). One of these is the molecular motor kinesin (see MIM 148760 and MIM 600025), an ATPase that moves vesicles unidirectionally toward the plus end of the microtubule. Another such MAP is kinectin, a large integral ER membrane protein. Antibodies directed against kinectin have been shown to inhibit its binding to kinesin.[supplied by OMIM]
429	Pyruvate dehydrogenase (PDH) is a mitochondrial multienzyme complex that catalyzes the oxidative decarboxylation of pyruvate and is one of the major enzymes responsible for the regulation of homeostasis of carbohydrate fuels in mammals. The enzymatic activity is regulated by a phosphorylation/dephosphorylation cycle. Phosphorylation of PDH by a specific pyruvate dehydrogenase kinase (PDK) results in inactivation.
430	
431	
432	
433	Alzheimer@#$%&s disease (AD) patients with an inherited form of the disease carry mutations in the presenilin proteins (PSEN1; PSEN2) or the amyloid precursor protein (APP).  These disease-linked mutations result in increased production of the longer form of amyloid-beta (main component of amyloid deposits found in AD brains).  Presenilins are postulated to regulate APP processing through their effects on gamma-secretase, an enzyme that cleaves APP.  Also, it is thought that the presenilins are involved in the cleavage of the Notch receptor, such that they either directly regulate gamma-secretase activity or themselves are protease enzymes.  Three alternative splice variants of PSEN1 have been identified.
434	The Fos gene family consists of 4 members: FOS, FOSB, FOSL1, and FOSL2.  These genes encode leucine zipper proteins that can dimerize with proteins of the JUN family, thereby forming the transcription factor complex AP-1.  As such, the FOS proteins have been implicated as regulators of cell proliferation, differentiation, and transformation.  In some cases, expression of the FOS gene has also been associated with apoptotic cell death.
435	This gene is a member of the glutathione S-transferase (GSTs) super-family which encodes multifunctional enzymes important in the detoxification of electrophilic molecules, including carcinogens, mutagens, and several therapeutic drugs, by conjugation with glutathione. This enzyme also plays a significant role in the catabolism of phenylalanine and tyrosine. Thus defects in this enzyme may lead to severe metabolic disorders including alkaptonuria, phenylketonuria and tyrosinaemia. Several transcript variants of this gene encode multiple protein isoforms.
436	
437	
438	TRIP11 was first identified through its ability to interact functionally with thyroid hormone receptor-beta (THRB; MIM 190160). It has also been found in association with the Golgi apparatus and microtubules.[supplied by OMIM]
439	
440	This gene encodes the GA-binding protein transcription factor, beta subunit. This protein forms a tetrameric complex with the alpha subunit, and stimulates transcription of target genes. The encoded protein may be involved in activation of cytochrome oxidase expression and nuclear control of mitochondrial function. The crystal structure of a similar protein in mouse has been resolved as a ternary protein complex. Multiple transcript variants encoding distinct isoforms have been identified for this gene.
441	
442	CPEB1 binds the cytoplasmic polyadenylation element (CPE), a uridine-rich sequence element within the mRNA 3-prime untranslated region (UTR) involved in directing cytoplasmic polyadenylation and translational activation. CPEB1 has been implicated in mediating both polyadenylation-dependent translation activation and CPE-directed translational repression.[supplied by OMIM]
443	The aquaporins/major intrinsic protein are a family of water-selective membrane channels. Aquaporin 9 has greater sequence similarity with AQP3 and AQP7 and they may be a subfamily. Aquaporin 9 allows passage of a wide variety of noncharged solutes. AQP9 stimulates urea transport and osmotic water permeability; there are contradicting reports about its role in providing glycerol permeability. Aquaporin 9 may also have some role in specialized leukocyte functions such as immunological response and bactericidal activity.
444	
445	SEMA7A is a membrane-bound semaphorin that associates with cell surfaces via a glycosylphosphatidylinositol (GPI) linkage. SEMA7A is also known as the John-Milton-Hagen (JMH) blood group antigen, an 80-kD glycoprotein expressed on activated lymphocytes and erythrocytes.[supplied by OMIM]
446	
447	
448	This intronless gene is located in the Prader-Willi syndrome deletion region. It is an imprinted gene and is expressed exclusively from the paternal allele. Studies in mouse suggest that the protein encoded by this gene may suppress growth in postmitotic neurons.
449	
450	
451	
452	
453	Ornithine decarboxylase catalyzes the conversion of ornithine to putrescine in the first and apparently rate-limiting step in polyamine biosynthesis. The ornithine decarboxylase antizymes play a role in the regulation of polyamine synthesis by binding to and inhibiting ornithine decarboxylase. Antizyme expression is auto-regulated by polyamine-enhanced translational frameshifting. The antizyme encoded by this gene inhibits ornithine decarboxylase but does not accelerate its degradation.
454	C17 is a cytokine-like protein specifically expressed in bone marrow and cord blood mononuclear cells that bear the CD34 (MIM 142230) surface marker (Liu et al., 2000).[supplied by OMIM]
455	
456	
457	
458	The protein encoded by this intronless gene belongs to the methyltransferase superfamily and acts as a G(1)/S and G(2)/M phase checkpoint regulator. It has been hypothesized that cigarette smoke-induced oxidative stress and transforming growth factor beta 1 may inhibit cellular proliferation by modulating the expression of this protein.
459	
460	
461	
462	
463	This gene encodes N-acetylgalactosamine-6-sulfatase which is a lysosomal exohydrolase required for the degradation of the glycosaminoglycans, keratan sulfate, and chondroitin 6-sulfate. Sequence alterations including point, missense and nonsense mutations, as well as those that affect splicing, result in a deficiency of this enzyme. Deficiencies of this enzyme lead to Morquio A syndrome, a lysosomal storage disorder.
464	Glutamic-oxaloacetic transaminase is a pyridoxal phosphate-dependent enzyme which exists in cytoplasmic and inner-membrane mitochondrial forms, GOT1 and GOT2, respectively.  GOT plays a role in amino acid metabolism and the urea and tricarboxylic acid cycles. The two enzymes are homodimeric and show close homology.
465	
466	This gene encodes a member of the WAP-type four disulfide core domain family. The WAP-type four-disulfide core domain, or WAP signature motif, contains eight cysteines forming four disulfide bonds at the core of the protein, and functions as a protease inhibitor in many family members. The encoded protein shares 81% amino acid identity with the rat ps20 protein, which was originally identified as a secreted growth inhibitor. This gene is mapped to chromosome 16q24, an area of frequent loss of heterozygosity in cancers, including prostate, breast and hepatocellular cancers and Wilms@#$%& tumor. Owing to its location and a possible growth inhibitory property of its gene product, this gene is suggested to be a tumor suppressor gene.
467	
468	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure, such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of the DEAD box protein family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene is intronless. It encodes an RNA-dependent ATPase. The encoded protein is localized in the mitochondria and the nucleus, and can be transported between the mitochondria and the nucleus.
469	Carbonic anhydrases are a large family of zinc metalloenzymes that catalyze the reversible hydration of carbon dioxide. They participate in a variety of biological processes, including respiration, calcification, acid-base balance, bone resorption, and the formation of aqueous humor, cerebrospinal fluid, saliva, and gastric acid. They show extensive diversity in tissue distribution and in their subcellular localization. The cytosolic protein encoded by this gene is predominantly expressed in the salivary glands. Alternative splicing in the coding region results in multiple transcript variants encoding different isoforms.
470	
471	
472	Enzymes of the phospholipase C family catalyze the hydrolysis of phospholipids to yield diacylglycerols and water-soluble phosphorylated derivatives of the lipid head groups. A number of these enzymes have specificity for phosphoinositides. Of the phosphoinositide-specific phospholipase C enzymes, C-beta is regulated by heterotrimeric G protein-coupled receptors, while the closely related C-gamma-1 (PLCG1; MIM 172420) and C-gamma-2 enzymes are controlled by receptor tyrosine kinases. The C-gamma-1 and C-gamma-2 enzymes are composed of phospholipase domains that flank regions of homology to noncatalytic domains of the SRC oncogene product, SH2 and SH3.[supplied by OMIM]
473	
474	
475	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S5P family of ribosomal proteins. It is located in the cytoplasm. This gene shares sequence similarity with mouse LLRep3. It is co-transcribed with the small nucleolar RNA gene U64, which is located in its third intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
476	
477	Proteins of the large Rab GTPase family (see RAB1A; MIM 179508) have regulatory roles in the formation, targeting, and fusion of intracellular transport vesicles. RAB11FIP3 is one of many proteins that interact with and regulate Rab GTPases (Hales et al., 2001).[supplied by OMIM]
478	The LUC7L gene may represent a mammalian heterochromatic gene, encoding a putative RNA-binding protein similar to the yeast Luc7p subunit of the U1 snRNP splicing complex that is normally required for 5-prime splice site selection (Tufarelli et al., 2001).[supplied by OMIM]
479	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a non-ATPase subunit of the 19S regulator. A pseudogene has been identified on chromosome 17.
480	
481	Sulfotransferase enzymes catalyze the sulfate conjugation of many hormones, neurotransmitters, drugs, and xenobiotic compounds. These cytosolic enzymes are different in their tissue distributions and substrate specificities. The gene structure (number and length of exons) is similar among family members. This gene encodes one of two phenol sulfotransferases with thermostable enzyme activity. Two alternatively spliced variants that encode the same protein have been described.
482	This gene encodes a protein that is a member of the seven in absentia homolog (SIAH) family. The protein is an E3 ligase and is involved in ubiquitination and proteasome-mediated degradation of specific proteins. The activity of this ubiquitin ligase has been implicated in the development of certain forms of Parkinson@#$%&s disease, the regulation of the cellular response to hypoxia and induction of apoptosis. Alternative splicing results in several additional transcript variants, some encoding different isoforms and others that have not been fully characterized.
483	Pancreasin is a pancreatic tryptic serine peptidase that cleaves peptides after an arginine residue.[supplied by OMIM]
484	
485	
486	This gene encodes a protein of unknown function. Mutations in this gene have been observed in patients with Bardet-Biedl syndrome type 2. Bardet-Biedl syndrome is an autosomal recessive disorder characterized by severe pigmentary retinopathy, obesity, polydactyly, renal malformation and mental retardation.
487	
488	
489	
490	
491	
492	
493	The protein encoded by this gene is a component of desmosomes and of the epidermal cornified envelope in keratinocytes. The N-terminal domain of this protein interacts with the plasma membrane and its C-terminus interacts with intermediate filaments. Through its rod domain, this protein forms complexes with envoplakin. This protein may serve as a link between the cornified envelope and desmosomes as well as intermediate filaments. AKT1/PKB, a protein kinase mediating a variety of cell growth and survival signaling processes, is reported to interact with this protein, suggesting a possible role for this protein as a localization signal in AKT1-mediated signaling.
494	
495	
496	
497	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein does not contain a coiled coil region, like some family members. This gene encodes a protein of unknown function.
498	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein. A pseudogene corresponding to this gene is found on chromosome 7p.
499	
500	The function of putative receptor protein is unknown.
501	
502	Sarcomere assembly is regulated by the muscle protein titin. Titin is a giant elastic protein with kinase activity that extends half the length of a sarcomere. It serves as a scaffold to which myofibrils and other muscle related proteins are attached. This gene encodes a protein found in striated and cardiac muscle that binds to the titin Z1-Z2 domains and is a substrate of titin kinase, interactions thought to be critical to sarcomere assembly. Mutations in this gene are associated with limb-girdle muscular dystrophy type 2G.
503	TNK1 is a nonreceptor tyrosine kinase. These kinases, like members of the SRC (MIM 190090) and JAK (see MIM 147795) families, mediate intracellular signaling downstream of receptor activation.[supplied by OMIM]
504	
505	
506	
507	This gene encodes a member of the RING finger protein family of transcription factors. The protein is primarily expressed in brain. The gene is located within the Smith-Magenis syndrome region on chromosome 17.
508	The protein encoded by this gene contains four calcium-binding EF-hand domains and belongs to the recoverin family of neuronal calcium sensors. Recoverin may prolong the termination of the phototransduction cascade by blocking the phosphorylation of photo-activated rhodopsin. Recoverin may be the antigen responsible for cancer-associated retinopathy; an autoimmune disease of the retina caused by a tumor in another tissue.
509	
510	
511	This gene belongs to a family of nonclustered, diverged homeobox genes. It may play a role in regulating growth and differentiation.
512	
513	The protein encoded by this gene is a member of the STAT protein family. In response to cytokines and growth factors, STAT family members are phosphorylated by the receptor associated kinases, and then form homo- or heterodimers that translocate to the cell nucleus where they act as transcription activators. This protein is activated through phosphorylation in response to various cytokines and growth factors including IFNs, EGF, IL5, IL6, HGF, LIF and BMP2. This protein mediates the expression of a variety of genes in response to cell stimuli, and thus plays a key role in many cellular processes such as cell growth and apoptosis. The small GTPase Rac1 has been shown to bind and regulate the activity of this protein. PIAS3 protein is a specific inhibitor of this protein. Three alternatively spliced transcript variants encoding distinct isoforms have been described.
514	Huntington@#$%&s disease (HD), a neurodegenerative disorder characterized by loss of striatal neurons, is caused by an expansion of a polyglutamine tract in the HD protein huntingtin. This gene encodes a protein that interacts with huntingtin, with two cytoskeletal proteins (dynactin and pericentriolar autoantigen protein 1), and with a hepatocyte growth factor-regulated tyrosine kinase substrate. The interactions with cytoskeletal proteins and a kinase substrate suggest a role for this protein in vesicular trafficking or organelle transport. Two transcripts encoding different isoforms have been completely described. Although additional isoforms have been reported, the full-length nature of the transcripts encoding them has not been defined.
515	
516	
517	
518	This gene encodes a protein that may modulate the transcriptional repression activities of death-associated protein 6 (DAXX), which interacts with histone deacetylase, core histones, and other histone-associated proteins. In mouse, the encoded protein binds to the putative leucine zipper domain of macroH2A1.2, a variant H2A histone that is enriched on inactivated X chromosomes. The BTB/POZ domain of this protein has been shown in other proteins to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes. Alternative splicing of this gene results in multiple transcript variants encoding the same protein.
519	This gene belongs to the homeobox family of genes. The homeobox genes encode a highly conserved family of transcription factors that play an important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, located on different chromosomes, consisting of 9 to 11 genes arranged in tandem. This gene is one of several homeobox HOXB genes located in a cluster on chromosome 17. The product of the Hoxb2 mouse gene is essential for motor neuron development in the @#$%&ventral@#$%& basal plate of rhombomere 4. The exact role of the human gene has yet to be determined.
520	
521	The protein encoded by this gene is believed to be a secretory protein. It has regions of similarity to hydrophilic segments of yeast mannosyltransferases. Its expression is ubiquitous and the gene appears to be relatively conserved among mammals.
522	
523	
524	This gene encodes a member of the Ced-4 family of apoptosis proteins. Ced-family members contain a caspase recruitment domain (CARD) and are known to be key mediators of programmed cell death. The encoded protein contains a distinct N-terminal pyrin-like motif, which is possibly involved in protein-protein interactions. This protein interacts strongly with caspase 2 and weakly with caspase 9. Overexpression of this gene was demonstrated to induce apoptosis in cells. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene, but the biological validity of some variants has not been determined.
525	
526	
527	In mouse, Hes7 expression is associated with somitogenesis and is controlled by Notch (see MIM 190198) signaling (Bessho et al., 2001).[supplied by OMIM]
528	This gene belongs to the TIMP gene family. The proteins encoded by this gene family are natural inhibitors of the matrix metalloproteinases, a group of peptidases involved in degradation of the extracellular matrix. Unlike the inducible expression of some other TIMP gene family members, the expression of this gene is largely constitutive. In addition, this gene diplays several features of a housekeeping gene and is likely to play a role significantly different from that of other family members.
529	
530	
531	Mice homozygous for the rd mutation display hereditary retinal degeneration which has been considered a model for human retinitis pigmentosa. In affected animals, the retinal rod photoreceptor cells begin degenerating at about postnatal day 8, and by 4 weeks no photoreceptors are left. Farber and Lolley (1974, 1976) showed that degeneration is preceded by accumulation of cyclic GMP in the retina and is correlated with deficient activity of the rod photoreceptor cGMP-phosphodiesterase. Bennett et al. (1996) tested the possibility of altering the course of retinal degeneration through subretinal injection of recombinant replication defective adenovirus that contained the murine cDNA for wildtype beta-PDE. Subretinal injection of rd mice was carried out 4 days after birth, before the onset of rd retinal degeneration. Following therapy, beta-PDE transcripts and enzyme activity were detected, and histologic studies revealed that photoreceptor cell death was significantly retarded.[supplied by OMIM]
532	
533	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the ultrahigh sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
534	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs.  This protein is a member of the high sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
535	The protein encoded by this gene is a member of the keratin gene family. As a type I hair keratin, it is an acidic protein which heterodimerizes with type II keratins to form hair and nails. The type I hair keratins are clustered in a region of chromosome 17q12-q21 and have the same direction of transcription.
536	
537	Heterochromatin protein-1 (HP1) is localized at heterochromatin sites, where it mediates gene silencing.[supplied by OMIM]
538	
539	
540	
541	This gene encodes a member of the aquaporin family of intrinsic membrane proteins that function as water-selective channels in the plasma membranes of many cells. The encoded protein is the predominant aquaporin found in brain. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
542	NPC1 was identified as the gene that when mutated, results in Niemann-Pick C disease. NPC1 encodes a putative integral membrane protein containing motifs consistent with a role in intracellular transport of cholesterol to post-lysosomal destinations.
543	
544	
545	This gene encodes a small nuclear ribonucleoprotein that belongs to the SNRNP core protein family. The protein may act as a charged protein scaffold to promote SNRNP assembly or strengthen SNRNP-SNRNP interactions through nonspecific electrostatic contacts with RNA.
546	This gene encodes electron-transfer-flavoprotein, beta polypeptide, which shuttles electrons between primary flavoprotein dehydrogenases involved in mitochondrial fatty acid and amino acid catabolism and the membrane-bound electron transfer flavoprotein ubiquinone oxidoreductase. The gene deficiencies have been implicated in type II glutaricaciduria. Alternatively spliced transcript variants have been found for this gene.
547	
548	
549	
550	
551	
552	The B lymphocyte antigen receptor is a multimeric complex which includes the antigen-specific component, surface immunoglobulin (Ig). Surface Ig non-covalently associates with 2 other proteins, Ig-alpha and Ig-beta, which are necessary for expression and function of the B-cell antigen receptor. This gene encodes the Ig-alpha protein of the B-cell antigen component. Alternative splicing has been observed and two transcript variants have been described.
553	
554	
555	
556	
557	
558	
559	
560	
561	
562	
563	Kallikreins are a subgroup of serine proteases having diverse physiological functions. Growing evidence suggests that many kallikreins are implicated in carcinogenesis and some have potential as novel cancer and other disease biomarkers. This gene is one of the fifteen kallikrein subfamily members located in a cluster on chromosome 19. Its encoded protein is secreted and may play a role in suppression of tumorigenesis in breast and prostate cancers. Alternate splicing of this gene results in two transcript variants encoding the same protein.
564	
565	
566	
567	
568	
569	Sulfotransferase enzymes catalyze the sulfate conjugation of many hormones, neurotransmitters, drugs, and xenobiotic compounds. These cytosolic enzymes are different in their tissue distributions and substrate specificities. The gene structure (number and length of exons) is similar among family members. This gene sulfates dehydroepiandrosterone but not 4-nitrophenol, a typical substrate for the phenol and estrogen sulfotransferase subfamilies. Two alternatively spliced variants that encode different isoforms have been described.
570	
571	
572	
573	
574	The galectins are a family of beta-galactoside-binding proteins implicated in modulating cell-cell and cell-matrix interactions. Differential and in situ hybridizations indicate that this lectin is specifically expressed in keratinocytes. It is expressed at all stages of epidermal differentiation (i.e., in basal and suprabasal layers). It is moderately repressed by retinoic acid. The protein was found mainly in stratified squamous epithelium. The antigen localized to basal keratinocytes, although it was also found, albeit at lower levels, in the suprabasal layers where it concentrated to areas of cell-to-cell contact. The cellular localization and its striking down-regulation in cultured keratinocytes imply a role in cell-cell and/or cell-matrix interactions necessary for normal growth control.
575	
576	
577	This gene encodes a member of the hepatoma-derived growth factor (HDGF) family. Two alternatively spliced transcript variants encoding slightly different isoforms have been found for this gene.
578	
579	AP3D1 is a subunit of the AP3 adaptor-like complex, which is not associated with clathrin. The AP3D1 subunit is implicated in intracellular biogenesis and trafficking of pigment granules and possibly platelet dense granules and neurotransmitter vesicles.[supplied by OMIM]
580	The protein encoded by this gene is a methyltransferase that converts guanidoacetate to creatine, using S-adenosylmethionine as the methyl donor. Defects in this gene have been implicated in neurologic syndromes and muscular hypotonia, probably due to creatine deficiency and accumulation of guanidinoacetate in the brain of affected individuals. Two transcript variants encoding different isoforms have been described for this gene.
581	This gene encodes a member of the PUF family. The encoded protein contains a sequence-specific RNA binding domain comprised of eight repeats and N- and C-terminal flanking regions, and serves as a translational regulator of specific mRNAs by binding to their 3@#$%& untranslated regions. The evolutionarily conserved function of the encoded protein in invertebrates and lower vertebrates suggests that the human protein may be involved in translational regulation of embryogenesis, and cell development and differentiation. Alternatively spliced transcript variants encoding different isoforms have been described.
582	The protein encoded by this gene is a member of the X11 protein family. It is an adapter protein that interacts with the Alzheimer@#$%&s disease amyloid precursor protein. This gene product is believed to be involved in signal transduction processes. This gene is a candidate gene for Alzheimer@#$%&s disease.
583	
584	
585	This gene encodes a member of a family of small membrane proteins that share a 35-amino acid signature sequence domain, beginning with the sequence PFXYD and containing 7 invariant and 6 highly conserved amino acids. The approved human gene nomenclature for the family is FXYD-domain containing ion transport regulator. Mouse FXYD5 has been termed RIC (Related to Ion Channel). FXYD2, also known as the gamma subunit of the Na,K-ATPase, regulates the properties of that enzyme. FXYD1 (phospholemman), FXYD2 (gamma), FXYD3 (MAT-8), FXYD4 (CHIF), and FXYD5 (RIC) have been shown to induce channel activity in experimental expression systems. Transmembrane topology has been established for two family members (FXYD1 and FXYD2), with the N-terminus extracellular and the C-terminus on the cytoplasmic side of the membrane. The protein encoded by this gene is a plasma membrane substrate for several kinases, including protein kinase A, protein kinase C, NIMA kinase, and myotonic dystrophy kinase. It is thought to form an ion channel or regulate ion channel activity. Transcript variants with different 5@#$%& UTR sequences have been described in the literature.
586	
587	Cytochrome c oxidase (COX), the terminal enzyme of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. It is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in electron transfer, and the nuclear-encoded subunits may be involved in the regulation and assembly of the complex. This nuclear gene encodes subunit VIb. Three pseudogenes COX6BP-1, COX6BP-2 and COX6BP-3 have been found on chromosomes 7, 17 and 22q13.1-13.2, respectively.
588	
589	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein likely localizes to the endoplasmic reticulum. When expressed in yeast the enzyme is capable of oxdizing arachidonic acid; however, its physiological function has not been determined. This gene is part of a cluster of cytochrome P450 genes on chromosome 19.
590	The protein encoded by this gene is a member of the phosphatidic acid phosphatase (PAP) family. PAPs convert phosphatidic acid to diacylglycerol, and function in de novo synthesis of glycerolipids as well as in receptor-activated signal transduction mediated by phospholipase D. This protein is similar to phosphatidic acid phosphatase type 2A (PPAP2A) and type 2B (PPAP2B). All three proteins contain 6 transmembrane regions, and a consensus N-glycosylation site. This protein has been shown to possess membrane associated PAP activity. Three alternatively spliced transcript variants encoding distinct isoforms have been reported.
591	Gap junction protein, alpha 4 is a member of the connexin gene family and a component of gap junctions. Gap junctions are composed of arrays of intercellular channels and provide a route for the diffusion of materials of low molecular weight from cell to cell
592	
593	This gene belongs to the ADAMTS (A Disintegrin And Metalloproteinase domain, with ThromboSpondin type-1 modules) family of zinc-dependent proteases. ADAMTS proteases are complex secreted enzymes containing a prometalloprotease domain of the reprolysin type attached to an ancillary domain with a highly conserved structure that includes at least one thrombospondin type 1 repeat. They have been demonstrated to have important roles in connective tissue organization, coagulation, inflammation, arthritis, angiogenesis and cell migration. The product of this gene plays a major role in growth and in skin, lens, and heart development. It is also a candidate gene for autosomal recessive Weill-Marchesani syndrome.
594	
595	This gene encodes a protein which is capable of interacting with the major nonstructural protein of parvovirus H-1 and 70-kDa heat shock cognate protein; however, its function is not known. Since this transcript is expressed ubiquitously in various tissues, this protein may serve a housekeeping function.
596	
597	
598	
599	
600	CENTG2 belongs to an ADP-ribosylation factor GTPase-activating (ARF-GAP) protein family involved in membrane traffic and actin cytoskeleton dynamics (Nie et al., 2002).[supplied by OMIM]
601	Aspartyl-tRNA synthetase (DARS) is part of a multienzyme complex of aminoacyl-tRNA synthetases.  Aspartyl-tRNA synthetase charges its cognate tRNA with aspartate during protein biosynthesis.
602	
603	
604	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene, but the biological validity of some variants has not been determined.
605	
606	This gene belongs to the ephrin receptor subfamily of the protein-tyrosine kinase family. EPH and EPH-related receptors have been implicated in mediating developmental events, particularly in the nervous system. Receptors in the EPH subfamily typically have a single kinase domain and an extracellular region containing a Cys-rich domain and 2 fibronectin type III repeats. The ephrin receptors are divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands.
607	
608	
609	
610	
611	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein of unknown function. It shows high transcription levels in 2 retinoblastoma cell lines and in tissues of neuroectodermal origin.
612	This gene is a member of the beta-1,3-galactosyltransferase (beta3GalT) gene family. This family encodes type II membrane-bound glycoproteins with diverse enzymatic functions using different donor substrates (UDP-galactose and UDP-N-acetylglucosamine) and different acceptor sugars (N-acetylglucosamine, galactose, N-acetylgalactosamine). The beta3GalT genes are distantly related to the Drosophila Brainiac gene and have the protein coding sequence contained in a single exon. The beta3GalT proteins also contain conserved sequences not found in the beta4GalT or alpha3GalT proteins. The carbohydrate chains synthesized by these enzymes are designated as type 1, whereas beta4GalT enzymes synthesize type 2 carbohydrate chains. The ratio of type 1:type 2 chains changes during embryogenesis. By sequence similarity, the beta3GalT genes fall into at least two groups: beta3GalT4 and 4 other beta3GalT genes (beta3GalT1-3, beta3GalT5). This gene is expressed exclusively in the brain. The encoded protein shows strict donor substrate specificity for UDP-galactose.
613	Protein kinase C (PKC) is a family of serine- and threonine-specific protein kinases that can be activated by calcium and the second messenger diacylglycerol. PKC family members phosphorylate a wide variety of protein targets and are known to be involved in diverse cellular signaling pathways. PKC family members also serve as major receptors for phorbol esters, a class of tumor promoters. Each member of the PKC family has a specific expression profile and is believed to play a distinct role. The protein encoded by this gene is one of the PKC family members. This kinase can be activated rapidly by the agonists of G protein-coupled receptors. It resides in both cytoplasm and nucleus, and its nuclear accumulation is found to be dramatically enhanced in response to its activation. This kinase can also be activated after B-cell antigen receptor (BCR) engagement, which requires intact phopholipase C gamma and the involvement of other PKC family members.
614	
615	
616	The protein encoded by this gene is a member of the serine/threonine protein kinase family. This kinase has been shown to specifically activate MAPK8/JNK. The activation of MAPK8 by this kinase is found to be inhibited by the dominant-negative mutants of MAP3K7/TAK1, MAP2K4/MKK4, and MAP2K7/MKK7, which suggests that this kinase may function through the MAP3K7-MAP2K4-MAP2K7 kinase cascade, and mediate the TNF-alpha signaling pathway. Alternatively spliced transcript variants encoding different isoforms have been identified.
617	
618	
619	Glucosidase I is the first enzyme in the N-linked oligosaccharide processing pathway. GCS1 cleaves the distal alpha-1,2-linked glucose residue from the Glc(3)-Man(9)-GlcNAc(2) oligosaccharide precursor.  GCS1 is located in the lumen of the endoplasmic reticulum.
620	
621	The protein encoded by this gene belongs to family 1 of G-protein coupled receptors. It is the receptor for follicle stimulating hormone and functions in gonad development. Alternative splicing occurs at this locus and two transcript variants encoding distinct isoforms have been identified.
622	
623	
624	The protein encoded by this gene is a member of the Toll-like receptor (TLR) family which plays a fundamental role in pathogen recognition and activation of innate immunity. TLRs are highly conserved from Drosophila to humans and share structural and functional similarities. They recognize pathogen-associated molecular patterns (PAMPs) that are expressed on infectious agents, and mediate the production of cytokines necessary for the development of effective immunity. The various TLRs exhibit different patterns of expression. This gene is ubiquitously expressed, and at higher levels than other TLR genes. Different length transcripts presumably resulting from use of alternative polyadenylation site, and/or from alternative splicing, have been noted for this gene.
625	
626	
627	
628	This gene encodes a member of the RSK (ribosomal S6 kinase) family of serine/threonine kinases. This kinase contains 2 non-identical kinase catalytic domains and phosphorylates various substrates, including members of the mitogen-activated kinase (MAPK) signalling pathway. The activity of this protein has been implicated in controlling cell growth and differentiation. Mutations in this gene have been associated with Coffin-Lowry syndrome (CLS).
629	This gene encodes a member of the oxysterol-binding protein (OSBP) family, a group of intracellular lipid receptors. Most members contain an N-terminal pleckstrin homology domain and a highly conserved C-terminal OSBP-like sterol-binding domain. Transcript variants encoding different isoforms have been identified.
630	
631	The protein encoded by this gene is a member of the dual specificity protein phosphatase subfamily.  These phosphatases inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues.  They negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily (MAPK/ERK, SAPK/JNK, p38), which are associated with cellular proliferation and differentiation.  Different members of the family of dual specificity phosphatases show distinct substrate specificities for various MAP kinases, different tissue distribution and subcellular localization, and different modes of inducibility of their expression by extracellular stimuli.  This gene product inactivates ERK1 and ERK2, is predominantly expressed in hematopoietic tissues, and is localized in the nucleus.
632	
633	This gene encodes one of the members of the superfamily of potassium channel proteins containing two pore-forming P domains. The product of this gene has not been shown to be a functional channel, however, it may require other non-pore-forming proteins for activity.
634	The protein encoded by this gene contains a RING finger motif and is highly similar to the ring finger proteins RNF28/MURF1 and RNF29/MURF2. In vitro studies demonstrated that this protein, RNF28, and RNF29 form heterodimers, which may be important for the regulation of titin kinase and microtubule-dependent signal pathways in striated muscles. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
635	
636	Spectrin is a tetrameric cytoskeletal protein essential for determination of cell shape, resilience of membranes to mechanical stress, positioning of transmembrane proteins, and organization of organelles and molecular traffic. Alpha- and beta-spectrin subunits form antiparallel dimers that self associate to give the spectrin tetramer. Beta subunits, such as SPTBN1, contain most of the spectrin binding activity (Hayes et al., 2000).[supplied by OMIM]
637	
638	This gene encodes a protein with src homology domain 3 (SH3) patterns. Mutations in this gene cause familial juvenile nephronophthisis.
639	
640	This gene encodes a type I integral membrane protein found only in the rough endoplasmic reticulum. The encoded protein is part of an N-oligosaccharyl transferase complex that links high mannose oligosaccharides to asparagine residues found in the Asn-X-Ser/Thr consensus motif of nascent polypeptide chains. This protein is similar in sequence to the yeast oligosaccharyl transferase subunit SWP1.
641	
642	The protein encoded by this gene is involved in mRNA splicing and may play a role in the determination of alternative splicing. The encoded nuclear protein belongs to the splicing factor SR family and has been shown to bind with and modulate another member of the family, SFRS12.
643	
644	This gene encodes one of the members of the superfamily of potassium channel proteins containing two pore-forming P domains. The product of this gene has not been shown to be a functional channel, however, it may require other non-pore-forming proteins for activity.
645	The protein encoded by this gene is a transmembrane (type I) heparan sulfate proteoglycan that functions as a receptor in intracellular signaling. The encoded protein is found as a homodimer and is a member of the syndecan proteoglycan family. This gene is found on chromosome 20, while a pseudogene has been found on chromosome 22.
646	
647	Sal-like genes encode putative zinc finger transcription factors. For background information on SALL genes, see SALL1 (MIM 602218).[supplied by OMIM]
648	
649	
650	This gene encodes a precursor protein consisting of arginine vasopressin and two associated proteins, neurophysin II and a glycopeptide, copeptin. Arginine vasopressin is a posterior pituitary hormone which is synthesized in the supraoptic nucleus and paraventricular nucleus of the hypothalamus. Along with its carrier protein, neurophysin II, it is packaged into neurosecretory vesicles and transported axonally to the nerve endings in the neurohypophysis where it is either stored or secreted into the bloodstream. The precursor is thought to be activated while it is being transported along the axon to the posterior pituitary. Arginine vasopressin acts as a growth factor by enhancing pH regulation through acid-base transport systems. It has a direct antidiuretic action on the kidney, and also causes vasoconstriction of the peripheral vessels. This hormone can contract smooth muscle during parturition and lactation. It is also involved in cognition, tolerance, adaptation and complex sexual and maternal behaviour, as well as in the regulation of water excretion and cardiovascular functions. Mutations in this gene cause autosomal dominant neurohypophyseal diabetes insipidus (ADNDI).
651	The protein encoded by this gene is a membrane glycosylphosphatidylinositol-anchored glycoprotein that tends to aggregate into rod-like structures. The encoded protein contains a highly unstable region of five tandem octapeptide repeats. This gene is found on chromosome 20, approximately 20 kbp upstream of a gene which encodes a biochemically and structurally similar protein to the one encoded by this gene. Mutations in the repeat region as well as elsewhere in this gene have been associated with Creutzfeldt-Jakob disease, fatal familial insomnia, Gerstmann-Straussler disease, Huntington disease-like 1, and kuru. Two transcript variants encoding the same protein have been found for this gene.
652	More than 99% of the vertebrate ocular lens is comprised of terminally differentiated lens fiber cells.  Two lens-specific intermediate filament-like proteins, CP49 (also known as phakinin) and the protein product of this gene, filensin (or CP115), are expressed only after fiber cell differentiation has begun. Both proteins are found in a structurally unique cytoskeletal element that is referred to as the beaded filament (BF).
653	
654	
655	
656	
657	In infectious mononucleosis, anti-EBNA-1 antibodies are produced which cross-react with multiple normal human proteins. The cross-reactivity is due to anti-gly/ala antibodies that cross-react with host proteins containing configurations like those in the EBNA-1 repeat. One such antigen is RALY which is a member of the heterogeneous nuclear ribonucleoprotein gene family.
658	
659	Components of the extracellular matrix exert myriad effects on tissues throughout the body. In particular, the laminins, a family of heterotrimeric extracellular glycoproteins, affect tissue development and integrity in such diverse organs as the kidney, lung, skin, and nervous system. It is thought that laminins mediate the attachment, migration, and organization of cells into tissues during embryonic development by interacting with other extracellular matrix components. Laminins function as heterotrimeric complexes of alpha, beta, and gamma chains, with each chain type representing a different subfamily of proteins. The protein encoded by this gene belongs to the alpha subfamily of laminin chains and is a major component of basement membranes. Two transcript variants encoding different isoforms have been found for this gene, but the full-length nature of one of them has not been determined.
660	The protein encoded by this gene contains a structural module related to that of the seven transmembrane domain G protein-coupled receptor superfamily. This protein has sequence and structural similarities to the beta-amyloid binding protein (BBP), but, unlike BBP, it does not regulate a response to beta-amyloid peptide. This protein may have regulatory roles in cell death or proliferation signal cascades. This gene has multiple alternatively spliced transcript variants which encode two different isoforms.
661	This gene encodes one of the many ionotropic glutamate receptor (GluR) subunits that function as ligand-gate ionic channels. The specific GluR subunit encoded by this gene is of the kainate receptor subtype. Receptor assembly and intracellular trafficking of ionotropic glutamate receptors are regulated by RNA editing and alternative splicing. These receptors mediate excitatory neurotransmission and are critical for normal synaptic function. Two alternatively spliced transcript variants that encode different isoforms have been described. Exons of this gene are interspersed with exons from the C21orf41 gene, which is transcribed in the same orientation as this gene but does not seem to encode a protein.
662	
663	
664	
665	The encoded protein, which directly binds to pituitary tumor-transforming gene 1 protein (PTTG1), facilitates the nuclear translocation of PTTG1 and potentiates the transcriptional activation of basic fibroblast growth factor by PTTG1. The gene product localizes to both the cytoplasm and nucleus. Its NLS is required for its own nuclear localization, the nuclear localization of PTTG1, and its interaction with PTTG1.
666	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the ultrahigh sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
667	
668	Catechol-O-methyltransferase catalyzes the transfer of a methyl group from S-adenosylmethionine to catecholamines, including the neurotransmitters dopamine, epinephrine, and norepinephrine.  This O-methylation results in one of the major degradative pathways of the catecholamine transmitters.  In addition to its role in the metabolism of endogenous substances, COMT is important in the metabolism of catechol drugs used in the treatment of hypertension, asthma, and Parkinson disease.  COMT is found in two forms in tissues, a soluble form (S-COMT) and a membrane-bound form (MB-COMT).  The differences between S-COMT and MB-COMT reside within the N-termini.  The transcript variants are formed through the use of alternative translation initiation sites and promoters.
669	
670	
671	Glutathione S-transferase (GSTs) theta 2 (GSTT2) is a member of a superfamily of proteins that catalyze the conjugation of reduced glutathione to a variety of electrophilic and hydrophobic compounds. Human GSTs can be divided into five main classes: Alpha, Mu, Pi, Theta, and Zeta. The theta class includes GSTT1 and GSTT2. The GSTT1 and GSTT2 share 55% amino acid sequence identity and both of them were claimed to have an important role in human carcinogenesis. GSTT1 gene is located approximately 50kb away from the GSTT2 gene. GSTT1 and GSTT2 genes have a similar structure, being composed of five exons with identical exon/intron boundaries.
672	
673	
674	In response to DNA damage and replication blocks, cell cycle progression is halted through the control of critical cell cycle regulators. The protein encoded by this gene is a cell cycle checkpoint regulator and putative tumor suppressor. It contains a forkhead-associated protein interaction domain essential for activation in response to DNA damage and is rapidly phosphorylated in response to replication blocks and DNA damage. When activated, the encoded protein is known to inhibit CDC25C phosphatase, preventing entry into mitosis, and has been shown to stabilize the tumor suppressor protein p53, leading to cell cycle arrest in G1. In addition, this protein interacts with and phosphorylates BRCA1, allowing BRCA1 to restore survival after DNA damage. Mutations in this gene have been linked with Li-Fraumeni syndrome, a highly penetrant familial cancer phenotype usually associated with inherited mutations in TP53. Also, mutations in this gene are thought to confer a predisposition to sarcomas, breast cancer, and brain tumors. This nuclear protein is a member of the CDS1 subfamily of serine/threonine protein kinases. Three transcript variants encoding different isoforms have been found for this gene.
675	
676	
677	
678	The protein encoded by this gene is a cytosolic regulatory component of the superoxide-producing phagocyte NADPH-oxidase, a multicomponent enzyme system important for host defense. This protein is preferentially expressed in cells of myeloid lineage. It interacts primarily with neutrophil cytosolic factor 2 (NCF2/p67-phox) to form a complex with neutrophil cytosolic factor 1 (NCF1/p47-phox), which further interacts with the small G protein RAC1 and translocates to the membrane upon cell stimulation. This complex then activates flavocytochrome b, the membrane-integrated catalytic core of the enzyme system. The PX domain of this protein can bind phospholipid products of the PI(3) kinase, which suggests its role in PI(3) kinase-mediated signaling events. The phosphorylation of this protein was found to negatively regulate the enzyme activity. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
679	
680	
681	
682	
683	
684	This gene encodes a protein with an ankyrin repeat region and two BTB/POZ domains, which are thought to be involved in protein-protein interactions. Expression of this gene is activated by the phosphatase and tensin homolog, a tumor suppressor. Alternate splicing results in three transcript variants encoding different isoforms.
685	
686	The gamma-aminobutyric acid (GABA) A receptor is a multisubunit chloride channel that mediates the fastest inhibitory synaptic transmission in the central nervous system. This gene is mapped to chromosome 15q11.2-q12 in a cluster that includes the alpha 5, and gamma 3 subunit genes of the GABA receptor. Alternative splicing generates 2 transcript variants. Deletion mutation of this gene may be involved in the pathogenesis of Angelman syndrome and Prader-Willi syndrome.
687	
688	
689	
690	The protein encoded by this gene is a highly conserved ring finger protein. It is an essential subunit of SKP1-cullin/CDC53-F box protein ubiquitin ligases, which are a part of the protein degradation machinery important for cell cycle progression and signal transduction. This protein interacts with, and is a substrate of, casein kinase II (CSNK2A1/CKII). The phosphorylation of this protein by CSNK2A1 has been shown to promote the degradation of IkappaBalpha (CHUK/IKK-alpha/IKBKA) and p27Kip1(CDKN1B). Alternatively spliced transcript variants encoding distinct isoforms have been reported.
691	
692	The gene product belongs to the nuclear receptor superfamily, members of which are transcription factors characterized by a ligand-binding domain and a DNA-binding domain. The encoded protein is a transcriptional regulator of the cytochrome P450 gene CYP3A4, binding to the response element of the CYP3A4 promoter as a heterodimer with the 9-cis retinoic acid receptor RXR. It is activated by a range of compounds that induce CYP3A4, including dexamethasone and rifampicin. The gene product contains a zinc finger domain. Three alternatively spliced transcripts that encode different isoforms have been described, one of which encodes two products through the use of alternative translation initiation codons. Additional transcript variants derived from alternative promoter usage, alternative splicing, and/or alternative polyadenylation exist, but they have not been fully described.
693	
694	This gene encodes a member of the latrophilin subfamily of G-protein coupled receptors (GPCR). Latrophilins may function in both cell adhesion and signal transduction. In experiments with non-human species, endogenous proteolytic cleavage within a cysteine-rich GPS (G-protein-coupled-receptor proteolysis site) domain resulted in two subunits (a large extracellular N-terminal cell adhesion subunit and a subunit with substantial similarity to the secretin/calcitonin family of GPCRs) being non-covalently bound at the cell membrane.
695	This gene is a member of the synapsin gene family. Synapsins encode neuronal phosphoproteins which associate with the cytoplasmic surface of synaptic vesicles. Family members are characterized by common protein domains, and they are implicated in synaptogenesis and the modulation of neurotransmitter release, suggesting a potential role in several neuropsychiatric diseases. This member of the synapsin family encodes a neuron-specific phosphoprotein that selectively binds to small synaptic vesicles in the presynaptic nerve terminal. The TIMP4 gene is located within an intron of this gene and is transcribed in the opposite direction. Mutations in this gene may be associated with abnormal presynaptic function and schizophrenia. Alternative splicing of this gene results in two transcripts.
696	Many DNA-binding transcriptional activator proteins enhance the initiation rate of RNA polymerase II-mediated gene transcription by interacting functionally with the general transcription machinery bound at the basal promoter. Adaptor proteins are usually required for this activation, possibly to acetylate and destabilize nucleosomes, thereby relieving chromatin constraints at the promoter. The protein encoded by this gene is a transcriptional activator adaptor and has been found to be part of the PCAF histone acetylase complex. In addition, it associates with the tumor suppressor protein p53 and is required for full activity of p53 and p53-mediated apoptosis. At least four alternatively spliced variants have been found for this gene, but the full-length nature of some variants has not been determined.
697	This gene encodes a member of the G-protein coupled receptor family. The encoded protein may play a role in energy homeostasis and regulation of body weight. Two identified transcript variants are expressed in several tissues and are evolutionary conserved in fish and swine. One transcript, 1a, excises an intron and encodes the functional protein; this protein is the receptor for the Ghrelin ligand and defines a neuroendocrine pathway for growth hormone release. The second transcript (1b) retains the intron and does not function as a receptor for Ghrelin; however, it may function to attenuate activity of isoform 1a.
698	
699	
700	
701	This gene is one member of a family of sulfate/anion transporter genes. Family members are well conserved in their genomic (number and size of exons) and protein (aa length among species) structures yet have markedly different tissue expression patterns. This gene is mainly expressed in the kidney and pancreas. Three splice variants that encode different isoforms have been identified.
702	
703	
704	
705	
706	This gene encodes a type I membrane protein that is a member of the immunoglobulin superfamily. This protein is expressed by antigen-presenting cells, and it is the ligand for two proteins at the cell surface of T cells, CD28 antigen and cytotoxic T-lymphocyte-associated protein 4. Binding of this protein with CD28 antigen is a costimulatory signal for activation of the T-cell. Binding of this protein with cytotoxic T-lymphocyte-associated protein 4 negatively regulates T-cell activation and diminishes the immune response. Alternative splicing results in two transcript variants encoding different isoforms. Additional transcript variants have been described, but their full-length sequences have not been determined.
707	
708	The GATA family of transcription factors, which contain zinc fingers in their DNA binding domain, have emerged as candidate regulators of gene expression in hematopoietic cells (Tsai et al., 1994 [PubMed 8078582]). GATA1 (MIM 305371) is essential for normal primitive and definitive erythropoiesis and is expressed at high levels in erythroid cells, mast cells, and megakaryocytes. GATA2 is expressed in hematopoietic progenitors, including early erythroid cells, mast cells, and megakaryocytes, and also in nonhematopoietic embryonic stem cells. In chicken erythroid progenitors, forced expression of GATA2 promotes proliferation at the expense of differentiation (Briegel et al., 1993). GATA3 (MIM 131320) expression is restricted to T-lymphoid cells and some nonhematopoietic cell types, including embryonic stem cells.[supplied by OMIM]
709	
710	The protein encoded by this gene is a member of the ADAMTS family. Members of this family share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. This gene product contains 2 C-terminal TS motifs, like those found in ADAMTS5 and ADAMTS6 proteins.
711	
712	
713	
714	
715	
716	The eukaryotic cell cycle is governed by cyclin-dependent protein kinases (CDKs) whose activities are regulated by cyclins and CDK inhibitors. The 8 species of cyclins reported in mammals, cyclins A through H, share a conserved amino acid sequence of about 90 residues called the cyclin box. The amino acid sequence of cyclin G is well conserved among mammals. The nucleotide sequence of cyclin G1 and cyclin G2 are 53% identical. Unlike cyclin G1, cyclin G2 contains a C-terminal PEST protein destabilization motif, suggesting that cyclin G2 expression is tightly regulated through the cell cycle.
717	
718	The MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) family consists of six human proteins, several of which are involved in the production of leukotrienes and prostaglandin E, important mediators of inflammation. This gene encodes a protein which catalyzes the conjugation of leukotriene A4 and reduced glutathione to produce leukotriene C4.
719	
720	The protein encoded by the gene is a cysteine proteinase and a member of the papain superfamily. This proteolytic enzyme is involved in cellular protein degradation and turnover. The recombinant form of this enzyme was shown to degrade synthetic peptides typically used as substrates for cysteine proteinases and its proteolytic activity was abolished by an inhibitor of cyteine proteinase.
721	Sterol-C4-mehtyl oxidase-like protein was isolated based on its similarity to the yeast ERG25 protein. It contains a set of putative metal binding motifs with similarity to that seen in a family of membrane desaturases-hydroxylases. The protein is localized to the endoplasmic reticulum membrane and is believed to function in cholesterol biosynthesis. Alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
722	HERC5 belongs to the HERC family of ubiquitin ligases, all of which contain a HECT domain and at least 1 RCC1 (MIM 179710)-like domain (RLD). The 350-amino acid HECT domain is predicted to catalyze the formation of a thioester with ubiquitin before transferring it to a substrate, and the RLD is predicted to act as a guanine nucleotide exchange factor for small G proteins (Hochrainer et al., 2005).[supplied by OMIM]
723	Actin alpha 1 which is expressed in skeletal muscle is one of six different actin isoforms which have been identified. Actins are highly conserved proteins that are involved in cell motility, structure and integrity. Alpha actins are a major constituent of the contractile apparatus.
724	The protein encoded by this gene is an inflammatory chemokine that belongs to the CXC chemokine family. This chemokine is produced concomitantly with interleukin-8 (IL8) in response to stimulation with either interleukin-1 (IL1) or tumor necrosis factor-alpha (TNFA). This chemokine is a potent chemotaxin involved in neutrophil activation.
725	The protein encoded by this gene is an isozyme of the long-chain fatty-acid-coenzyme A ligase family. Although differing in substrate specificity, subcellular localization, and tissue distribution, all isozymes of this family convert free long-chain fatty acids into fatty acyl-CoA esters, and thereby play a key role in lipid biosynthesis and fatty acid degradation.
726	
727	ZNF185 encodes a LIM-domain zinc finger protein. These proteins are thought to be involved in protein-protein interactions.[supplied by OMIM]
728	KIT encodes the human homolog of the proto-oncogene c-kit. C-kit was first identified as the cellular homolog of the feline sarcoma viral oncogene v-kit.  KIT is a type 3 transmembrane receptor  for MGF (mast cell growth factor, also known as stem cell factor).  Mutations in KIT are associated with gastrointestinal stromal tumors, mast cell disease, acute myelogenous lukemia, and piebaldism.
729	
730	This gene belongs to a family of genes that function as receptors for tachykinins.  Receptor affinities are specified by variations in the 5@#$%&-end of the sequence.  The receptors belonging to this family are characterized by interactions with G proteins and 7 hydrophobic transmembrane regions.  This gene encodes the receptor for the tachykinin neurokinin 3, also referred to as neurokinin B.
731	
732	Organelles of the secretory and endocytic pathways are distinguished by their luminal acidity, which is generated by the activity of an electrogenic vacuolar-type hydrogen ATPase. Progressive acidification of vesicles in the endocytic pathway is essential for the redistribution and degradation of internalized membrane proteins, such as ligand receptor complexes and fluid-phase solutes. This gene is expressed predominantly in the trans-Golgi network, and mediates the influx of sodium or potassium in exchange for hydrogen. It may thus play an important role in maintaining cation homeostasis and function of the trans-Golgi network. This gene is part of a gene cluster on chromosome Xp11.23.
733	This gene encodes a member of the CHIC family of proteins. The encoded protein contains a cysteine-rich hydrophobic (CHIC) motif, and is localized to vesicular structures and the plasma membrane. This gene is associated with some cases of acute myeloid leukemia.
734	The protein encoded by this gene was identified as an interacting protein that binds specifically to MAP kinase kinase MAP2K1/MEK1 and to MAP kinase MAPK2/ERK1. This protein enhances the activation of MAPK2, and thus is thought to function as an adaptor to enhance the efficiency of the MAP kinase cascade.
735	
736	
737	
738	Slightly proximal to the Huntington disease locus, the human MSX1 gene is deleted in patients with Wolf-Hirschhorn syndrome. This gene is also called HOX7. The Msx family of vertebrate HOX genes was originally isolated by homology to the Drosophila msh (muscle segment homeo box) gene. This is a candidate gene for human cleft palate.
739	
740	
741	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections.
742	This gene is part of a 500 kb inverted duplication on chromosome 5q13. This duplicated region contains at least four genes and repetitive elements which make it prone to rearrangements and deletions. The repetitiveness and complexity of the sequence have also caused difficulty in determining the organization of this genomic region. This copy of the gene is full length; additional copies with truncations and internal deletions are also present in this region of chromosome 5q13. It is thought that this gene is a modifier of spinal muscular atrophy caused by mutations in a neighboring gene, SMN1. The protein encoded by this gene contains regions of homology to two baculovirus inhibitor of apoptosis proteins, and it is able to suppress apoptosis induced by various signals. A second transcript for this gene has been described, but its full length nature has not been determined.
743	
744	
745	THO2 is part of the TREX (transcription/export) complex, which includes TEX1 (MIM 606929), HPR1 (MIM 606930), ALY (MIM 604171), and UAP56 (MIM 606390).[supplied by OMIM]
746	Follistatin is a single-chain gonadal protein that specifically inhibits follicle-stimulating hormone release.  The single FST gene encodes two isoforms, FST317 and FST344 containing 317 and 344 amino acids respectively, resulting from alternative splicing of the precursor mRNA.  In a study in which 37 candidate genes were tested for linkage and association with polycystic ovary syndrome (PCOS) or hyperandrogenemia in 150 families, evidence was found for linkage between PCOS and follistatin.
747	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. Multiple alternatively spliced transcript variants that encode different protein isoforms have been described for this gene.
748	
749	
750	
751	The MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) family includes a number of human proteins, several of which are involved the production of leukotrienes. This gene encodes an enzyme that catalyzes the first step in the biosynthesis of cysteinyl leukotrienes, potent biological compounds derived from arachidonic acid. Leukotrienes have been implicated as mediators of anaphylaxis and inflammatory conditions such as human bronchial asthma. This protein localizes to the nuclear envelope and adjacent endoplasmic reticulum. Multiple protein isoforms are encoded by transcript variants of this gene.
752	
753	The protein encoded by this gene is a member of cyclin-dependent protein kinase (CDK) family. CDK family members are highly similar to the gene products of Saccharomyces cerevisiae cdc28, and Schizosaccharomyces pombe cdc2, and are known to be important regulators of cell cycle progression. This gene was identified as a gene absent in leukemic patients with chromosome 5q deletion. This loss may be an important determinant of dysmyelopoiesis. The function of this gene has not yet been determined.
754	
755	
756	This gene encodes a secreted, homodimeric glycoprotein that is expressed in a wide variety of tissues and may have autocrine or paracrine functions. The encoded protein has 10 of its 15 cysteine residues conserved among stanniocalcin family members and is phosphorylated by casein kinase 2 exclusively on its serine residues. Its C-terminus contains a cluster of histidine residues which may interact with metal ions. The protein may play a role in the regulation of renal and intestinal calcium and phosphate transport, cell metabolism, or cellular calcium/phosphate homeostasis. Constitutive overexpression of human stanniocalcin 2 in mice resulted in pre- and postnatal growth restriction, reduced bone and skeletal muscle growth, and organomegaly. Expression of this gene is induced by estrogen and altered in some breast cancers.
757	
758	
759	Pre-mRNA splicing occurs in two sequential transesterification steps. The protein encoded by this gene is a splicing factor that has been found to be essential during the second catalytic step in the pre-mRNA splicing process. It associates with the spliceosome and contains a zinc knuckle motif that is found in other splicing factors and is involved in protein-nucleic acid and protein-protein interactions.
760	
761	The expression of DUSP1 gene is induced in human skin fibroblasts by oxidative/heat stress and growth factors.  It specifies a protein with structural features similar to members of the non-receptor-type protein-tyrosine phosphatase family, and which has significant amino-acid sequence similarity to a Tyr/Ser-protein phosphatase encoded by the late gene H1 of vaccinia virus.  The bacterially expressed and purified DUSP1 protein has intrinsic phosphatase activity, and specifically inactivates mitogen-activated protein (MAP) kinase in vitro by the concomitant dephosphorylation of both its phosphothreonine and phosphotyrosine residues.  Furthermore, it suppresses the activation of MAP kinase by oncogenic ras in extracts of Xenopus oocytes.  Thus, DUSP1 may play an important role in the human cellular response to environmental stress as well as in the negative regulation of cellular proliferation.
762	Striated muscle sarcomeres are highly organized structures composed of actin (thin) and myosin (thick) filaments that slide past each other during contraction. The integrity of sarcomeres is controlled by a set of structural proteins, among which are titin (TTN; MIM 188840), a giant molecule that contains several immunoglobulin (Ig)-like domains and associates with thin and thick filaments, and alpha-actinin (ACTN1; MIM 102575), an actin cross-linking protein. Mutations in several sarcomeric and sarcolemmal proteins have been shown to result in muscular dystrophy and cardiomyopathy.[supplied by OMIM]
763	This gene encodes a secreted, 16 kDa protein that acts as a chemotactic factor to neutrophils and stimulates the growth of chondrocytes and osteoblasts. This protein has high sequence similarity to the chondromodulin repeat regions of the chicken myb-induced myeloid 1 protein. A polymorphism in this gene may be associated with rheumatoid arthritis.
764	This gene product belongs to the SCAMP family of proteins which are secretory carrier membrane proteins. They function as carriers to the cell surface in post-golgi recycling pathways. Different family members are highly related products of distinct genes, and are usually expressed together. These findings suggest that the SCAMPs may function at the same site during vesicular transport rather than in separate pathways. Multiple transcript variants encoding different isoforms have been found for this gene, but the full-length nature of some variants has not been defined.
765	The protein encoded by this gene is the receptor for colony stimulating factor 1, a cytokine which controls the production, differentiation, and function of macrophages. This receptor mediates most if not all of the biological effects of this cytokine. Ligand binding activates the receptor kinase through a process of oligomerization and transphosphorylation. The encoded protein is a tyrosine kinase transmembrane receptor and member of the CSF1/PDGF receptor family of tyrosine-protein kinases. Mutations in this gene have been associated with a predisposition to myeloid malignancy. The first intron of this gene contains a transcriptionally inactive ribosomal protein L7 processed pseudogene oriented in the opposite direction.
766	Centromere and kinetochore proteins play a critical role in centromere structure, kinetochore formation, and sister chromatid separation. The protein encoded by this gene colocalizes with inner kinetochore plate proteins CENP-A and CENP-C in both interphase and metaphase. It localizes outside of centromeric heterochromatin, where CENP-B is localized, and inside the kinetochore corona, where CENP-E is localized during prometaphase. It is thought that this protein can bind to itself, as well as to CENP-A, CENP-B or CENP-C. Multimers of the protein localize constitutively to the inner kinetochore plate and play an important role in the organization and function of the active centromere-kinetochore complex.
767	The A-kinase anchor proteins (AKAPs) are a group of structurally diverse proteins, which have the common function of binding to the regulatory subunit of protein kinase A (PKA) and confining the holoenzyme to discrete locations within the cell. This gene encodes a member of the AKAP family. The encoded protein is localized to the sperm flagellum and may be involved in the regulation of sperm motility. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
768	Cytochrome c oxidase (COX), the terminal component of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. This component is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in electron transfer, and the nuclear-encoded subunits may function in the regulation and assembly of the complex. This nuclear gene encodes subunit VIIc, which shares 87% and 85% amino acid sequence identity with mouse and bovine COX VIIc, respectively, and is found in all tissues. A pseudogene COX7CP1 has been found on chromosome 13.
769	
770	
771	
772	This gene encodes a protein involved in regulating responses to cytokines by members of the Rel/NF-kappaB transcription factor family. These factors play a central role in innate immunity in response to pathogens, inflammatory signals and stress. This gene product interacts with TRAF proteins (tumor necrosis factor receptor-associated factors) and either I-kappaB kinase or MAP kinase to activate either NF-kappaB or Jun kinase. Two alternative transcripts encoding different proteins have been identified. A third transcript, which does not encode a protein and is transcribed in the opposite orientation, has been identified. Overexpression of this transcript has been shown to reduce expression of at least one of the protein encoding transcripts, suggesting it has a regulatory role in the expression of this gene.
773	
774	
775	
776	Cytosolic and membrane-bound forms of glutathione S-transferase are encoded by two distinct supergene families. These enzymes are involved in cellular defense against toxic, carcinogenic, and pharmacologically active electrophilic compounds. At present, eight distinct classes of the soluble cytoplasmic mammalian glutathione S-transferases have been identified: alpha, kappa, mu, omega, pi, sigma, theta and zeta. This gene encodes a glutathione S-tranferase belonging to the alpha class. The alpha class genes, which are located in a cluster on chromosome 6, are highly related and encode enzymes with glutathione peroxidase activity that function in the detoxification of lipid peroxidation products. Reactive electrophiles produced by oxidative metabolism have been linked to a number of degenerative diseases including Parkinson@#$%&s disease, Alzheimer@#$%&s disease, cataract formation, and atherosclerosis.
777	FOXF2 encodes forkhead box F2, one of many human homologues of the Drosophila melanogaster transcription factor forkhead. FOXF2 is expressed in lung and placenta, and has been shown to transcriptionally activate several lung-specific genes.
778	
779	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the proteasome B-type family, also known as the T1B family, that is a 20S core beta subunit. This gene is located in the class II region of the MHC (major histocompatibility complex). Expression of this gene is induced by gamma interferon and this gene product replaces catalytic subunit 1 (proteasome beta 6 subunit) in the immunoproteasome. Proteolytic processing is required to generate a mature subunit. Two alternative transcripts encoding different isoforms have been identified; both isoforms are processed to yield the same mature subunit.
780	
781	This gene encodes a ubiquitous nuclear protein that stimulates both cell proliferation and differentiation. It is a member of the MADS (MCM1, Agamous, Deficiens, and SRF) box superfamily of transcription factors. This protein binds to the serum response element (SRE) in the promoter region of target genes. This protein regulates the activity of many immediate-early genes, for example c-fos, and thereby participates in cell cycle regulation, apoptosis, cell growth, and cell differentiation. This gene is the downstream target of many pathways; for example, the mitogen-activated protein kinase pathway (MAPK) that acts through the ternary complex factors (TCFs).
782	The protein encoded by this gene is a member of the bone morphogenetic protein family which is part of the transforming growth factor-beta superfamily. The transforming growth factor-beta superfamily includes large families of growth and differentiation factors. It is thought that this protein may be involved in oocyte maturation and follicular development as a homodimer or by forming heterodimers with a related protein, Gdf9.
783	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. This structure consists of approximately 146 bp of DNA wrapped around a nucleosome, an octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is intronless and encodes a member of the histone H3 family. Transcripts from this gene lack polyA tails; instead, they contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6p22-p21.3.
784	
785	
786	
787	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
788	
789	MICA encodes the higly polymorphic MHC (HLA) class I chain-related gene A. The protein product is expressed on the cell surface, although unlike canonical class I molecules does not seem to associate with beta-2-microglobulin. It is thought that MICA functions as a stress-induced antigen that is broadly recognized by intestinal epithelial gamma delta T cells.
790	This intronless gene encodes a 70kDa heat shock protein which is a member of the heat shock protein 70 family. In conjuction with other heat shock proteins, this protein stabilizes existing proteins against aggregation and mediates the folding of newly translated proteins in the cytosol and in organelles. It is also involved in the ubiquitin-proteasome pathway through interaction with the AU-rich element RNA-binding protein 1. The gene is located in the major histocompatibility complex class III region, in a cluster with two closely related genes which encode similar proteins.
791	A cluster of genes, BAT1-BAT5, has been localized in the vicinity of the genes for TNF alpha and TNF beta. These genes are all within the human major histocompatibility complex class III region. The protein encoded by this gene is a nuclear protein. It has been implicated in the control of apoptosis and regulating heat shock protein. There are three alternatively spliced transcript variants described for this gene.
792	
793	This gene is a member of the beta-1,3-galactosyltransferase (beta3GalT) gene family. This family encodes type II membrane-bound glycoproteins with diverse enzymatic functions using different donor substrates (UDP-galactose and UDP-N-acetylglucosamine) and different acceptor sugars (N-acetylglucosamine, galactose, N-acetylgalactosamine). The beta3GalT genes are distantly related to the Drosophila Brainiac gene and have the protein coding sequence contained in a single exon. The beta3GalT proteins also contain conserved sequences not found in the beta4GalT or alpha3GalT proteins. The carbohydrate chains synthesized by these enzymes are designated as type 1, whereas beta4GalT enzymes synthesize type 2 carbohydrate chains. The ratio of type 1:type 2 chains changes during embryogenesis. By sequence similarity, the beta3GalT genes fall into at least two groups: beta3GalT4 and 4 other beta3GalT genes (beta3GalT1-3, beta3GalT5). This gene is oriented telomere to centromere in close proximity to the ribosomal protein S18 gene. The functionality of the encoded protein is limited to ganglioseries glycolipid biosynthesis.
794	
795	PDEF is an ETS transcription factor expressed in prostate epithelial cells. It acts as an androgen-independent transactivator of PSA (MIM 176820) expression.[supplied by OMIM]
796	The protein encoded by this gene is a member of the immunophilin protein family, which play a role in immunoregulation and basic cellular processes involving protein folding and trafficking. This encoded protein is a cis-trans prolyl isomerase that binds to the immunosuppressants FK506 and rapamycin. It is thought to mediate calcineurin inhibition. It also interacts functionally with mature hetero-oligomeric progesterone receptor complexes along with the 90 kDa heat shock protein and P23 protein. This gene has been found to have multiple polyadenylation sites.
797	
798	
799	Ecto-5-prime-nucleotidase (5-prime-ribonucleotide phosphohydrolase; EC 3.1.3.5) catalyzes the conversion at neutral pH of purine 5-prime mononucleotides to nucleosides, the preferred substrate being AMP. The enzyme consists of a dimer of 2 identical 70-kD subunits bound by a glycosyl phosphatidyl inositol linkage to the external face of the plasma membrane. The enzyme is used as a marker of lymphocyte differentiation. Consequently, a deficiency of NT5 occurs in a variety of immunodeficiency diseases (e.g., see MIM 102700, MIM 300300). Other forms of 5-prime nucleotidase exist in the cytoplasm and lysosomes and can be distinguished from ecto-NT5 by their substrate affinities, requirement for divalent magnesium ion, activation by ATP, and inhibition by inorganic phosphate.[supplied by OMIM]
800	
801	
802	
803	
804	
805	
806	The A-kinase anchor proteins (AKAPs) are a group of structurally diverse proteins, which have the common function of binding to the regulatory subunit of protein kinase A (PKA) and confining the holoenzyme to discrete locations within the cell. This gene encodes a member of the AKAP family. The encoded protein is expressed in endothelial cells, cultured fibroblasts, and osteosarcoma cells. It associates with protein kinases A and C and phosphatase, and serves as a scaffold protein in signal transduction. This protein and RII PKA colocalize at the cell periphery. This protein is a cell growth-related protein. Antibodies to this protein can be produced by patients with myasthenia gravis. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
807	Arginase catalyzes the hydrolysis of arginine to ornithine and urea.  At least two isoforms of mammalian arginase exist (types I and II) which differ in their tissue distribution, subcellular localization, immunologic crossreactivity and physiologic function.  The type I isoform encoded by this gene, is a cytosolic enzyme and expressed predominantly in the liver as a component of the urea cycle.  Inherited deficiency of this enzyme results in argininemia, an autosomal recessive disorder characterized by hyperammonemia.
808	
809	HSF2, as well as the related gene HSF1, encodes a protein that binds specifically to the heat-shock element and has homology to HSFs of other species. Heat shock transcription factors activate heat-shock response genes under conditions of heat or other stresses. Although the names HSF1 and HSF2 were chosen for historical reasons, these peptides should be referred to as heat-shock transcription factors.
810	
811	
812	
813	Polyspecific organic cation transporters in the liver, kidney, intestine, and other organs are critical for elimination of many endogenous small organic cations as well as a wide array of drugs and environmental toxins. This gene is one of three similar cation transporter genes located in a cluster on chromosome 6. The encoded protein contains twelve putative transmembrane domains and is a plasma integral membrane protein. It is found primarily in the kidney, where it may mediate the first step in cation reabsorption. Two transcript variants encoding different isoforms have been found for this gene.
814	This gene encodes a protein similar to thrombospondin type 1 domain-containing proteins. In addition, the protein contains a furin-like cysteine-rich region. Furin-like repeat domains have been found in a variety of eukaryotic proteins involved in the mechanism of signal transduction by receptor tyrosine kinases. The function of this protein has not been determined.
815	
816	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
817	
818	
819	
820	
821	This gene encodes an N-acetylgalactosaminyltransferase, which has 97% sequence identity to the mouse protein. This gene is deleted in Williams syndrome, a multisystem developmental disorder caused by the deletion of contiguous genes at 7q11.23.
822	MIST1 is a basic helix-loop-helix (bHLH) protein that binds to the DNA target sequence CANNTG (E box) present in the promoter or enhancer region of numerous developmentally regulated genes.[supplied by OMIM]
823	This gene encodes a protein which belongs to the C2H2-type zinc finger proteins subclass of the Gli family. They are characterized as a transcription factor that binds to DNA through zinc finger motifs. These motifs have conserved H-C links. Gli family zinc finger proteins are mediators of Sonic hedgehog (Shh) signaling. The protein encoded by this gene localizes in the cytoplasm and activates patched Drosophila homolog (PTCH) gene expression. It is also thought to play a role during embryogenesis. The encoded protein is associated with several phenotypes- Greig cephalopolysyndactyly syndrome, Pallister-Hall syndrome, preaxial polydactyly type IV, postaxial polydactyly types A1 and B.
824	In vertebrates, the genes encoding the class of transcription factors called homeobox genes are found in clusters named A, B, C, and D on four separate chromosomes. Expression of these proteins is spatially and temporally regulated during embryonic development. This gene is part of the A cluster on chromosome 7 and encodes a DNA-binding transcription factor which may regulate gene expression, morphogenesis, and differentiation. Methylation of this gene may result in the loss of its expression and, since the encoded protein upregulates the tumor suppressor p53, this protein may play an important role in tumorigenesis.
825	The protein encoded by this gene is the fourth major glycoprotein of the platelet surface and serves as a receptor for thrombospondin in platelets and various cell lines. Since thrombospondins are widely distributed proteins involved in a variety of adhesive processes, this protein may have important functions as a cell adhesion molecule. It binds to collagen, thrombospondin, anionic phospholipids and oxidized LDL. It directly mediates cytoadherence of Plasmodium falciparum parasitized erythrocytes and it binds long chain fatty acids and may function in the transport and/or as a regulator of fatty acid transport. Mutations in this gene cause platelet glycoprotein deficiency. Three alternatively spliced transcript variants encoding the same protein isoform have been found for this gene.
826	This gene encodes a member of the PSCD (pleckstrin homology, Sec7 and coiled-coil domains) family. PSCD family members have identical structural organization that consists of an N-terminal coiled-coil motif, a central Sec7 domain, and a C-terminal pleckstrin homology (PH) domain. The coiled-coil motif is involved in homodimerization, the Sec7 domain contains guanine-nucleotide exchange protein (GEP) activity, and the PH domain interacts with phospholipids and is responsible for association of PSCDs with membranes. Members of this family appear to mediate the regulation of protein sorting and membrane trafficking. This encoded protein is involved in the control of Golgi structure and function, and it may have a physiological role in regulating ADP-ribosylation factor protein 6 (ARF) functions, in addition to acting on ARF1.
827	
828	
829	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
830	
831	This gene, which is expressed only during embryogenesis, encodes a protein that is instrumental in patterning the early embryo. It has been implicated as the key inductive signal in patterning of the ventral neural tube, the anterior-posterior limb axis, and the ventral somites. Of three human proteins showing sequence and functional similarity to the sonic hedgehog protein of Drosophila, this protein is the most similar. The protein is made as a precursor that is autocatalytically cleaved; the N-terminal portion is soluble and contains the signalling activity while the C-terminal portion is involved in precursor processing. More importantly, the C-terminal product covalently attaches a cholesterol moiety to the N-terminal product, restricting the N-terminal product to the cell surface and preventing it from freely diffusing throughout the developing embryo. Defects in this protein or in its signalling pathway are a cause of holoprosencephaly (HPE), a disorder in which the developing forebrain fails to correctly separate into right and left hemispheres. HPE is manifested by facial deformities. In addition, it is thought that mutations in this gene or in its signalling pathway may be responsible for VACTERL syndrome, which is characterized by vertebral defects, anal atresia, tracheoesophageal fistula with esophageal atresia, radial and renal dysplasia, cardiac anomalies, and limb abnormalities.
832	
833	Cancer-associated chromosomal changes often involve regions containing fragile sites. This gene maps to a commom fragile site on chromosome 7q31.2 designated FRA7G. This gene is similar to mouse Testin, a testosterone-responsive gene encoding a Sertoli cell secretory protein containing three LIM domains. LIM domains are double zinc-finger motifs that mediate protein-protein interactions between transcription factors, cytoskeletal proteins and signaling proteins. Multiple protein isoforms are encoded by transcript variants of this gene.
834	The developmentally regulated expression of the globin genes depends on upstream regulatory elements termed locus control regions (LCRs). LCRs are associated with powerful enhancer activity that is mediated by the transcription factor NFE2 (nuclear factor erythroid-2). NFE2 recognition sites are also present in the gene promoters of 2 heme biosynthetic enzymes, porphobilinogen deaminase (PBGD; MIM 176000) and ferrochelatase (FECH; MIM 177000). NFE2 DNA-binding activity consists of a heterodimer containing an 18-kD Maf protein (MafF, MafG (MIM 602020), or MafK) and p45 (MIM 601490). Both subunits are members of the activator protein-1 superfamily of basic leucine zipper (bZIP) proteins (see MIM 165160). Maf homodimers suppress transcription at NFE2 sites.[supplied by OMIM]
835	
836	This gene encodes a mitochondrial transcription termination factor. This protein participates in attenuating transcription from the mitochondrial genome; this attenuation allows higher levels of expression of 16S ribosomal RNA relative to the tRNA gene downstream. The product of this gene has three leucine zipper motifs bracketed by two basic domains that are all required for DNA binding. There is evidence that, for this protein, the zippers participate in intramolecular interactions that establish the three-dimensional structure required for DNA binding.
837	
838	The protein encoded by this gene is a member of the RAMP family of single-transmembrane-domain proteins, called receptor (calcitonin) activity modifying proteins (RAMPs). RAMPs are type I transmembrane proteins with an extracellular N terminus and a cytoplasmic C terminus. RAMPs are required to transport calcitonin-receptor-like receptor (CRLR) to the plasma membrane. CRLR, a receptor with seven transmembrane domains, can function as either a calcitonin-gene-related peptide (CGRP) receptor or an adrenomedullin receptor, depending on which members of the RAMP family are expressed. In the presence of this (RAMP3) protein, CRLR functions as an adrenomedullin receptor.
839	
840	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein that belongs to the L32 ribosomal protein family. A pseudogene corresponding to this gene is found on chromosome Xp.
841	
842	
843	
844	The protein encoded by this gene is similar to ING1, a tumor suppressor protein that can interact with TP53, inhibit cell growth, and induce apoptosis. This protein contains a PHD-finger, which is a common motif in proteins involved in chromatin remodeling. This gene can activate p53 trans-activated promoters, including promoters of p21/waf1 and bax. Overexpression of this gene has been shown to inhibit cell growth and induce apoptosis. Allelic loss and reduced expression of this gene were detected in head and neck cancers. Multiple alternatively spliced transcript variants have been observed.
845	Hyaluronidase degrades hyaluronic acid, a major structural proteoglycan found in extracellular matrices and basement membranes. Six members of the hyaluronidase family are clustered into two tightly linked groups on chromosome 3p21.3 and 7q31.3. This gene was previously referred to as HYAL1 and HYA1 and has since been assigned the official symbol SPAM1; another family member on chromosome 3p21.3 has been assigned HYAL1. This gene encodes a GPI-anchored enzyme located on the human sperm surface and inner acrosomal membrane. This multifunctional protein is a hyaluronidase that enables sperm to penetrate through the hyaluronic acid-rich cumulus cell layer surrounding the oocyte, a receptor that plays a role in hyaluronic acid induced cell signaling, and a receptor that is involved in sperm-zona pellucida adhesion. Abnormal expression of this gene in tumors has implicated this protein in degradation of basement membranes leading to tumor invasion and metastasis. Multiple protein isoforms are encoded by transcript variants of this gene.
846	Dihydrolipoyl dehydrogenase is a component of the mitochondrial pyruvate and alpha-ketoglutarate dehydrogenase complexes. Defects in the enzyme cause congenital infantile lactic acidosis
847	
848	
849	
850	
851	
852	
853	This gene is a member of the cadherin superfamily, genes encoding calcium-dependent, membrane-associated glycoproteins. The encoded protein is cadherin-like, consisting of an extracellular region, containing 7 cadherin domains, and a transmembrane region but lacking the conserved cytoplasmic domain. The protein is a component of the gastrointestinal tract and pancreatic ducts, acting as an intestinal proton-dependent peptide transporter in the first step in oral absorption of many medically important peptide-based drugs. The protein may also play a role in the morphological organization of liver and intestine.
854	
855	This gene encodes a member of the Ly-6/neurotoxin gene family, a group of lymphocyte antigens that attach to the cell surface by a glycosylphosphatidylinositol anchor and have a unique structure showing conserved 8-10 cysteine residues with a characteristic spacing pattern. Functional analysis indicates that this protein is not a ligand or neurotransmitter but has the capacity to enhance nicotinic acetylcholine receptor function in the presence of acetylcholine. This gene may also play a role in the pathogenesis of psoriasis vulgaris. Alternatively spliced variants encoding different isoforms have been identified.
856	LOXL2 is a member of the lysyl oxidase (LO; MIM 153455) gene family. LO is an extracellular, copper-dependent enzyme that initiates the cross-linking of collagens and elastin by catalyzing the oxidative deamination of peptidyl lysine to alpha-aminoadipic-delta-semialdehyde. Members of the LO family have diverse functions, including tumor suppression and cell adhesion and senescence.[supplied by OMIM]
857	
858	
859	This gene is deleted in the primary tumor of hepatocellular carcinoma. It maps to 8p22-p21.3, a region frequently deleted in solid tumors. It is suggested that this gene is a candidate tumor suppressor gene for human liver cancer, as well as for prostate, lung, colorectal, and breast cancers. Alternative splicing at this locus results in several transcript variants encoding different isoforms.
860	Gamma-interferon (IFNG; MIM 147570) has an antiproliferative effect on many tumor cells and inhibits intracellular pathogens such as Toxoplasma and Chlamydia, at least partly because of the induction of indoleamine 2,3-dioxygenase (INDO; EC 1.13.11.42). This enzyme catalyzes the degradation of the essential amino acid L-tryptophan to N-formylkynurenine.[supplied by OMIM]
861	
862	
863	The protein encoded by this gene is a member of the syntrophin family. Syntrophins are cytoplasmic peripheral membrane proteins that typically contain 2 pleckstrin homology (PH) domains, a PDZ domain that bisects the first PH domain, and a C-terminal domain that mediates dystrophin binding. This gene is specifically expressed in the brain. Transcript variants for this gene have been described, but their full-length nature has not been determined.
864	
865	
866	
867	This gene encodes a transcription factor that interacts with specific negative regulatory elements (NREs) to mediate transcriptional repression of certain NK-kappa-B-responsive genes. The protein localizes predominantly to the nucleolus with a small fraction found in the nucleoplasm and cytoplasm.
868	This gene encodes a member of the Argonaute family of proteins which play a role in RNA interference. The encoded protein is highly basic, and contains a PAZ domain and a PIWI domain. It may interact with dicer1 and play a role in short-interfering-RNA-mediated gene silencing.
869	The enzyme encoded by this gene utilizes diacylglycerol and fatty acyl CoA as substrates in order to catalyze the final stage of triacylglycerol synthesis. It is also involved in cellular as well as physiological metabolic processes.
870	
871	
872	
873	INSL4 encodes the insulin-like 4 protein, a member of the insulin superfamily. INSL4 encodes a precursor that undergoes post-translational cleavage to produce 3 polypeptide chains, A-C, that form tertiary structures composed of either all three chains, or just the A and B chains. Expression of INSL4 products occurs within the early placental cytotrophoblast and syncytiotrophoblast.
874	
875	
876	
877	
878	B-type receptors for the neurotransmitter GABA (gamma-aminobutyric acid) inhibit neuronal activity through G protein-coupled second-messenger systems, which regulate the release of neurotransmitters and the activity of ion channels and adenylyl cyclase. See GABBR1 (MIM 603540) for additional background information on GABA-B receptors.[supplied by OMIM]
879	
880	The protein encoded by this gene is an oxidoreductase belonging to the copper type II, ascorbate-dependent monooxygenase family. It is present in the synaptic vesicles of postganglionic sympathetic neurons and converts dopamine to norepinephrine. It exists in both soluble and membrane-bound forms, depending on the absence or presence, respectively, of a signal peptide.
881	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to cytoplasmic bodies. The protein has also been localized to the nucleus, where it interacts with the activation domain of the HIV-1 Tat protein. The Tat protein activates transcription of HIV-1 genes.
882	
883	
884	This gene encodes a member of the F-box protein family, members of which are characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into three classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene contains WD-40 domains, in addition to an F-box motif, so it belongs to the Fbw class. Alternatively spliced transcript variants encoding distinct isoforms have been identified for this gene.
885	
886	
887	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L29P family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
888	
889	Alpha-1,3-galactosyltransferase is a Golgi membrane-bound enzyme involved in the biosynthesis of the carbohydrate chains of glycoproteins and glycolipids. Enzyme levels are developmentally regulated and differentiation dependent. The enzyme is present in most mammals but cannot be detected in man, apes, or Old World monkeys. The carbohydrate structure produced by the enzyme is immunogenic in man, and most normal, healthy individuals have a significant titer of a natural antibody against the enzyme. Aberrant expression of the enzyme in man has been implicated in autoimmune disorders and in the occurrence of certain germ cell tumors.[supplied by OMIM]
890	This gene is one of several CC cytokine genes clustered on the p-arm of chromosome 9. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The CC cytokines are proteins characterized by two adjacent cysteines. Similar to other chemokines the protein encoded by this gene inhibits hemopoiesis and stimulates chemotaxis. This protein is chemotactic in vitro for thymocytes and activated T cells, but not for B cells, macrophages, or neutrophils. The cytokine encoded by this gene may also play a role in mediating homing of lymphocytes to secondary lymphoid organs. It is a high affinity functional ligand for chemokine receptor 7 (CCR7) that is expressed on T and B lymphocytes and a known receptor for another member of the cytokine family (small inducible cytokine A19).
891	This gene encodes a member of the MutT family of nucleotide pyrophosphatases, a subset of the larger NUDIX hydrolase family. The gene product possesses a modification of the MutT sequence motif found in certain nucleotide pyrophosphatases. The enzyme asymmetrically hydrolyzes Ap4A to yield AMP and ATP and is responsible for maintaining the intracellular level of the dinucleotide Ap4A, the function of which has yet to be established. This gene may be a candidate tumor suppressor gene. Alternative splicing has been observed at this locus and three transcript variants, all encoding the same protein, have been identified.
892	Aquaporin 3 is a water channel protein. Aquaporins are a family of small integral membrane proteins related to the major intrinsic protein (MIP or AQP0). Aquaporin 3 is localized at the basal lateral membranes of collecting duct cells in the kidney. In addition to its water channel function, aquaporin 3 has been found to facilitate the transport of nonionic small solutes such as urea and glycerol, but to a smaller degree. It has been suggested that water channels can be functionally heterogeneous and possess water and solute permeation mechanisms.
893	Aquaporin 3 is a water channel protein. Aquaporins are a family of small integral membrane proteins related to the major intrinsic protein (MIP or AQP0). Aquaporin 3 is localized at the basal lateral membranes of collecting duct cells in the kidney. In addition to its water channel function, aquaporin 3 has been found to facilitate the transport of nonionic small solutes such as urea and glycerol, but to a smaller degree. It has been suggested that water channels can be functionally heterogeneous and possess water and solute permeation mechanisms.
894	
895	
896	This gene encodes the alpha chain of type XV collagen, a member of the FACIT collagen family (fibril-associated collagens with interrupted helices). Type XV collagen has a wide tissue distribution but the strongest expression is localized to basement membrane zones so it may function to adhere basement membranes to underlying connective tissue stroma. Mouse studies have shown that collagen XV deficiency is associated with muscle and microvessel deterioration.
897	
898	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. Four sequence-related potassium channel genes - shaker, shaw, shab, and shal - have been identified in Drosophila, and each has been shown to have human homolog(s). This gene encodes a member of the potassium channel, voltage-gated, shal-related subfamily, members of which form voltage-activated A-type potassium ion channels and are prominent in the repolarization phase of the action potential. This member mediates a rapidly inactivating, A-type outward potassium current which is not under the control of the N terminus as it is in Shaker channels.
899	This gene encodes adenosine kinase, an abundant enzyme in mammalian tissues. The enzyme catalyzes the transfer of the gamma-phosphate from ATP to adenosine, thereby serving as a regulator of concentrations of both extracellular adenosine and intracellular adenine nucleotides. Adenosine has widespread effects on the cardiovascular, nervous, respiratory, and immune systems and inhibitors of the enzyme could play an important pharmacological role in increasing intravascular adenosine concentrations and acting as anti-inflammatory agents. Alternative splicing results in two transcript variants encoding different isoforms. Both isoforms of the enzyme phosphorylate adenosine with identical kinetics and both require Mg2+ for activity.
900	Leptin (LEP; MIM 164160), an adipocyte-specific hormone that regulates adipose-tissue mass through hypothalamic effects on satiety and energy expenditure, acts through the leptin receptor (LEPR), a single-transmembrane-domain receptor of the cytokine receptor family.[supplied by OMIM]
901	
902	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a non-ATPase subunit of the 19S regulator. Two transcripts encoding different isoforms have been described. A pseudogene has been identified on chromosome 3.
903	This gene is contained entirely within intron 22 of the factor VIII gene; spans less than 2 kb, and is transcribed in the direction opposite of factor VIII. A portion of intron 22 (int22h), containing F8A, is repeated twice extragenically closer to the Xq telomere. Although its function is unknown, the observation that this gene is conserved in the mouse implies it has some function. Unlike factor VIII, this gene is transcribed abundantly in a wide variety of cell types.
904	
905	
906	
907	
908	
909	
910	
911	The protein encoded by this gene is a phosphoprotein that binds to the sodium-potassium exchangers (NHEs). This protein serves as an essential cofactor which supports the physiological activity of NHE family members. It has protein sequence similarity to calcineurin B and it is also known to be an endogenous inhibitor of calcineurin activity.
912	
913	
914	The cystatin superfamily encompasses proteins that contain multiple cystatin-like sequences. Some of the members are active cysteine protease inhibitors, while others have lost or perhaps never acquired this inhibitory activity. There are three inhibitory families in the superfamily, including the type 1 cystatins (stefins), type 2 cystatins and the kininogens. The type 2 cystatin proteins are a class of cysteine proteinase inhibitors found in a variety of human fluids and secretions. The cystatin locus on chromosome 20 contains the majority of the type 2 cystatin genes and pseudogenes. This gene is located in the cystatin locus and encodes a type 2 salivary cysteine peptidase inhibitor. The protein is an S-type cystatin, based on its high level of expression in saliva, tears and seminal plasma. The specific role in these fluids is unclear but antibacterial and antiviral activity is present, consistent with a protective function.
915	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has three repeats of quasi-RRM domains that binds to RNAs. It is very similar to the family member HNRPH1. This gene is thought to be involved in Fabray disease and X-linked agammaglobulinemia phenotype.
916	Cell surface heparan sulfate proteoglycans are composed of a membrane-associated protein core substituted with a variable number of heparan sulfate chains. Members of the glypican-related integral membrane proteoglycan family (GRIPS) contain a core protein anchored to the cytoplasmic membrane via a glycosyl phosphatidylinositol linkage.  These proteins may play a role in the control of cell division and growth regulation.
917	
918	This gene encodes a member of the ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) protein family. ADAMTS family members share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. The protein encoded by this gene has a high sequence similarity to the protein encoded by ADAMTS19, another family member. The function of this protein has not been determined.
919	
920	
921	This gene encodes a member of the ephrin (EPH) family. The ephrins and EPH-related receptors comprise the largest subfamily of receptor protein-tyrosine kinases and have been implicated in mediating developmental events, especially in the nervous system and in erythropoiesis. Based on their structures and sequence relationships, ephrins are divided into the ephrin-A (EFNA) class, which are anchored to the membrane by a glycosylphosphatidylinositol linkage, and the ephrin-B (EFNB) class, which are transmembrane proteins. This gene encodes an EFNA class ephrin which binds to the EPHA2, EPHA4, EPHA5, EPHA6, and EPHA7 receptors. Two transcript variants that encode different isoforms were identified through sequence analysis.
922	Hyperpolarization-activated cation channels of the HCN gene family, such as HCN1, contribute to spontaneous rhythmic activity in both heart and brain.[supplied by OMIM]
923	Exonucleolytic degradation of the poly(A) tail is often the first step in the decay of eukaryotic mRNAs.  The amino acid sequence of poly(A)-specific ribonuclease shows homology to the RNase D family of 3@#$%&-exonucleases. The protein appears to be localized in both the nucleus and the cytoplasm. It is not stably associated with polysomes or ribosomal subunits.
924	
925	
926	
927	
928	The androgen receptor gene is more than 90 kb long and codes for a protein that has 3 major functional domains: the N-terminal domain, DNA-binding domain, and androgen-binding domain. The protein functions as a steroid-hormone activated transcription factor. Upon binding the hormone ligand, the receptor dissociates from accessory proteins, translocates into the nucleus, dimerizes, and then stimulates transcription of androgen responsive genes. This gene contains 2 polymorphic trinucleotide repeat segments that encode polyglutamine and polyglycine tracts in the N-terminal transactivation domain of its protein. Expansion of the polyglutamine tract causes spinal bulbar muscular atrophy (Kennedy disease). Mutations in this gene are also associated with complete androgen insensitivity (CAIS). Two alternatively spliced variants encoding distinct isoforms have been described.
929	
930	
931	
932	Cytokine signals are mediated through specific receptor complexes, the components of which are mostly members of the type I cytokine receptor family. Type I cytokine receptors share conserved structural features in their extracellular domain. Receptor complexes are typically heterodimeric, consisting of alpha chains, which provide ligand specificity, and beta (or gamma) chains, which are required for the formation of high-affinity binding sites and signal transduction.[supplied by OMIM]
933	
934	RAS guanyl nucleotide-releasing protein (RASGRP) is a member of a family of genes characterized by the presence of a Ras superfamily guanine nucleotide exchange factor (GEF) domain. The corresponding rat gene rbc7, which lacks a 5-prime exon, represents a 5-prime and 3-prime truncated version of a larger normal rat transcript that encodes a predicted 90-kD protein. This shorter transcript has not been found in humans.
935	
936	This gene encodes a protein containing three zinc finger domains and a nuclear localization signal. The mRNA and the protein of this gene are upregulated by wildtype p53 and overexpression of this gene inhibits tumor cell growth, suggesting that this gene may have a role in the p53-dependent growth regulatory pathway. Alternative splicing of this gene results in two transcript variants encoding two isoforms differing in only one amino acid.
937	This gene encodes a member of the transcription elongation factor A (SII)-like (TCEAL) gene family. Members of this family contain TFA domains and may function as nuclear phosphoproteins that modulate transcription in a promoter context-dependent manner. Multiple family members are located on the X chromosome. Alternative splicing results in multiple transcript variants encoding a single isoform.
938	
939	
940	
941	
942	
943	Apelin is a neuropeptide expressed in the supraoptic and paraventricular nuclei that acts on specific receptors located on vasopressinergic neurons (De Mota et al., 2004).[supplied by OMIM]
944	
945	
946	
947	KIF21A belongs to a family of plus end-directed kinesin (see MIM 600025) motor proteins. Neurons use kinesin and dynein (see MIM 600112) microtubule-dependent motor proteins to transport essential cellular components along axonal and dendritic microtubules.[supplied by OMIM]
948	
949	
950	This gene encodes a protein which is associated with adenosine triphosphatases (ATPases). Proton-translocating ATPases have fundamental roles in energy conservation, secondary active transport, acidification of intracellular compartments, and cellular pH homeostasis. There are three classes of ATPases- F, P, and V. The vacuolar (V-type) ATPases have a transmembrane proton-conducting sector and an extramembrane catalytic sector. The encoded protein has been found associated with the transmembrane sector of the V-type ATPases.
951	
952	This gene is a member of the MAGEA gene family. The members of this family encode proteins with 50 to 80% sequence identity to each other. The promoters and first exons of the MAGEA genes show considerable variability, suggesting that the existence of this gene family enables the same function to be expressed under different transcriptional controls. The MAGEA genes are clustered at chromosomal location Xq28. They have been implicated in some hereditary disorders, such as dyskeratosis congenita. Two transcript variants encoding different isoforms have been found for this gene.
953	
954	Lymphocytes proliferate and differentiate in response to various concentrations of different antigens. The ability of the B cell to respond in a specific, yet sensitive manner to the various antigens is achieved with the use of low-affinity antigen receptors. This gene encodes a cell surface molecule which assembles with the antigen receptor of B lymphocytes in order to decrease the threshold for antigen receptor-dependent stimulation.
955	The protein encoded by this gene is a transmembrane protein containing three extracellular CUB domains. This protein is found to be overexpressed in colon and lung cancers. Its expression level is correlated with the metastatic ability of carcinoma cells. This protein is located on the cell surface. It has been shown to be tyrosine phosphorylated in a cancer cell line. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
956	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. Pseudogenes corresponding to this gene are found on chromosomes 2p and 17q.
957	The protein encoded by this gene is the beta subunit of a heterodimeric core-binding transcription factor belonging to the PEBP2/CBF transcription factor family which master-regulates a host of genes specific to hematopoiesis (e.g., RUNX1) and osteogenesis (e.g., RUNX2). The beta subunit is a non-DNA binding regulatory subunit; it allosterically enhances DNA binding by alpha subunit as the complex binds to the core site of various enhancers and promoters, including murine leukemia virus, polyomavirus enhancer, T-cell receptor enhancers and GM-CSF promoters. Alternative splicing generates two mRNA variants, each encoding a distinct carboxyl terminus. In some cases, a pericentric inversion of chromosome 16 [inv(16)(p13q22)] produces a chimeric transcript consisting of the N terminus of core-binding factor beta in a fusion with the C-terminal portion of the smooth muscle myosin heavy chain 11. This chromosomal rearrangement is associated with acute myeloid leukemia of the M4Eo subtype. Two transcript variants encoding different isoforms have been found for this gene.
958	Potassium channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. Four sequence-related potassium channel genes - shaker, shaw, shab, and shal - have been identified in Drosophila, and each has been shown to have human homolog(s). This gene encodes a member of the potassium channel, voltage-gated, shaker-related subfamily. This member includes three distinct isoforms which are encoded by three alternatively spliced transcript variants of this gene. These three isoforms are beta subunits, which form heteromultimeric complex with alpha subunits and modulate the activity of the pore-forming alpha subunits.
959	
960	
961	
962	The protein encoded by this gene is a member of the platelet-derived growth factor family. The four members of this family are mitogenic factors for cells of mesenchymal origin and are characterized by a motif of eight cysteines. This gene product can exist either as a homodimer or as a heterodimer with the platelet-derived growth factor beta polypeptide, where the dimers are connected by disulfide bonds. Studies using knockout mice have shown cellular defects in oligodendrocytes, alveolar smooth muscle cells, and Leydig cells in the testis; knockout mice die either as embryos or shortly after birth. Two splice variants have been identified for this gene.
963	HLA-B belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon 1 encodes the leader peptide, exon 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Hundreds of HLA-B alleles have been described
964	
965	
966	
967	
968	
969	
970	
971	
972	Microtubules, polymers of alpha and beta tubulin subunits, form the mitotic spindle of a dividing cell and help to organize membranous organelles during interphase. Katanin is a heterodimer that consists of a 60 kDa ATPase (p60 subunit A 1) and an 80 kDa accessory protein (p80 subunit B 1). The p60 subunit acts to sever and disassemble microtubules, while the p80 subunit targets the enzyme to the centrosome. This gene encodes the p80 subunit. This protein is a member of the AAA family of ATPases.
973	
974	
975	
976	
977	The protein encoded by this gene is a member of a family of human VCX/Y genes. This gene family has multiple members on both X and Y chromosomes, and all are expressed exclusively in male germ cells. Members of the VCX/Y family share a high degree of sequence identity, with the exception that a 30-bp unit is tandemly repeated in X-linked members but occurs only once in Y-linked members. VCX/Y genes encode small and highly charged proteins of unknown function. This gene encodes a small, positively charged protein. The presence of a putative bipartite nuclear localization signal suggests that this gene encodes a nuclear protein. The genome has two identical copies of this gene within a palindromic region; this record represents the more centromeric copy.
978	The protein encoded by this gene is a component of the exocyst complex, a multiple protein complex essential for targeting exocytic vesicles to specific docking sites on the plasma membrane. Though best characterized in yeast, the component proteins and functions of the exocyst complex have been demonstrated to be highly conserved in higher eukaryotes. At least eight components of the exocyst complex, including this protein, are found to interact with the v-ral simian leukemia viral oncogene homolog A (RALA). RALA is a Ras-related small GTPase involved in controlling actin cytoskeletal remodeling and vesicle transport, which therefore suggests that the exocyst may act as an effector for activated RALA in directing site of exocytosis. Alternatively spliced transcript variants encoding distinct isoforms have been described.
979	
980	The protein encoded by this gene is a component of the exocyst complex, a multiple protein complex essential for targeting exocytic vesicles to specific docking sites on the plasma membrane. Though best characterized in yeast, the component proteins and functions of the exocyst complex have been demonstrated to be highly conserved in higher eukaryotes. At least eight components of the exocyst complex, including this protein, are found to interact with the v-ral simian leukemia viral oncogene homolog A (RALA). RALA is a Ras-related small GTPase involved in controlling actin cytoskeletal remodeling and vesicle transport, which therefore suggests that the exocyst may act as an effector for activated RALA in directing site of exocytosis. Alternatively spliced transcript variants encoding distinct isoforms have been described.
981	NEPH1 is a member of the nephrin-like protein family, which includes NEPH2 (MIM 607761) and NEPH3 (MIM 607762). The cytoplasmic domains of these proteins interact with the C terminus of podocin (NPHS2; MIM 604766), and the genes are expressed in kidney podocytes, cells involved in ensuring size- and charge-selective ultrafiltration (Sellin et al., 2003).[supplied by OMIM]
982	
983	
984	The product of this gene belongs to the family of transient receptor potential (TRP) channels. TRP channels are cation-selective channels important for cellular calcium signaling and homeostasis. The protein encoded by this gene mediates calcium entry, and this entry is potentiated by calcium store depletion. Alternatively spliced transcript variants encoding different isoforms have been identified.
985	This gene was identified by the reactivity of its encoded protein to a monoclonal antibody prepared against brain homogenates from patients with Alzheimer@#$%&s disease. Analysis of the original protein (fetal Alz-50 reactive clone 1, or FAC1), identified as an 810 aa protein containing a DNA-binding domain and a zinc finger motif, suggested it might play a role in the regulation of transcription. High levels of FAC1 were detected in fetal brain and in patients with neurodegenerative diseases. The protein encoded by this gene is actually much larger than originally thought, and it also contains a C-terminal bromodomain characteristic of proteins that regulate transcription during proliferation. The encoded protein is highly similar to the largest subunit of the Drosophila NURF (nucleosome remodeling factor) complex. In Drosophila, the NURF complex, which catalyzes nucleosome sliding on DNA and interacts with sequence-specific transcription factors, is necessary for the chromatin remodeling required for transcription. Two alternative transcripts encoding different isoforms have been described completely.
986	This gene is located in the nonrecombining portion of the Y chromosome, and expressed specifically in testis. It encodes a protein which has a low degree of similarity to protein tyrosine phosphatase, non-receptor type 13. Two nearly identical copies of this gene exist within a palindromic region. This record represents the more telomeric copy.
987	
988	
989	The protein encoded by this gene contains a leucine zipper-like motif and a proline-rich region that shares marked similarity with an SH3-binding domain. The protein localizes to the nucleus and is down-regulated in some cancer cell lines. It is thought to regulate the transcriptional response mediated by the nuclear factor kappa B (NF-kappaB). The gene has been proposed as a tumor suppressor gene whose protein product may have an important role in the development and/or progression of some cancers.
990	
991	The product of this gene belongs to the family of highly homologous synovial sarcoma X (SSX) breakpoint proteins. These proteins may function as transcriptional repressors. They are also capable of eliciting spontaneously humoral and cellular immune responses in cancer patients, and are potentially useful targets in cancer vaccine-based immunotherapy. SSX1, SSX2 and SSX4 genes have been involved in the t(X;18) translocation characteristically found in all synovial sarcomas. This gene appears not to be involved in this type of chromosome translocation. Two transcript variants encoding distinct isoforms have been identified for this gene.
992	
993	
994	This gene encodes an ubiquitin-like protein (ubiquilin) that shares high degree of similarity with related products in yeast, rat and frog. Ubiquilins contain a N-terminal ubiquitin-like domain and a C-terminal ubiquitin-associated domain. They physically associate with both proteasomes and ubiquitin ligases, and thus thought to functionally link the ubiquitination machinery to the proteasome to affect in vivo protein degradation. This ubiquilin has also been shown to bind the ATPase domain of the Hsp70-like Stch protein.
995	
996	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein and may have a role in the control of neurite outgrowth. It is known to complex with integrins. This gene is associated with X-linked mental retardation and neuropsychiatric diseases such as Huntington@#$%&s chorea, fragile X syndrome and myotonic dystrophy.
997	
998	This protein is a ubiquitously expressed nuclear protein and belongs to a highly conserved subfamily of WD-repeat proteins. It is found among several proteins that binds directly to retinoblastoma protein, which regulates cell proliferation. The encoded protein is found in many histone deacetylase complexes, including mSin3 co-repressor complex. It is also present in protein complexes involved in chromatin assembly. This protein can interact with BRCA1 tumor-suppressor gene and may have a role in the regulation of cell proliferation and differentiation.
999	The protein encoded by this gene is an antioxidant enzyme and belongs to the peroxiredoxin family. The protein is localized to the cytoplasm. Peroxidases of the peroxiredoxin family reduce hydrogen peroxide and alkyl hydroperoxides to water and alcohol with the use of reducing equivalents derived from thiol-containing donor molecules. This protein has been found to play a regulatory role in the activation of the transcription factor NF-kappaB.
1000	The protein encoded by this gene is a member of the ankyrin repeat and SOCS box-containing (ASB) family of proteins. They contain ankyrin repeat sequence and SOCS box domain. The SOCS box serves to couple suppressor of cytokine signalling (SOCS) proteins and their binding partners with the elongin B and C complex, possibly targeting them for degradation. Two transcript variants encoding different isoforms have been found for this gene.
1001	Progesterone binding protein is a putative steroid membrane receptor. The protein is expressed predominantly in the liver and kidney.
1002	
1003	CD183 is a G protein-coupled receptor with selectivity for three chemokines, termed IP10 (interferon-g-inducible 10 kDa protein), Mig (monokine induced by interferon-g) and I-TAC (interferon-inducible T cell a-chemoattractant). IP10, Mig and I-TAC belong to the structural subfamily of CXC chemokines, in which a single amino acid residue separates the first two of four highly conserved Cys residues. Historically, CD183 is the third CXC chemokine receptor discovered and, therefore, commonly designated as CXCR3. Binding of chemokines to CD183 induces cellular responses that are involved in leukocyte traffic, most notably integrin activation, cytoskeletal changes and chemotactic migration. Inhibition by Bordetella pertussis toxin suggests that heterotrimeric G protein of the Gi-subclass couple to CD183. Signal transduction has not been further analyzed but may include the same enzymes that were identified in the signaling cascade induced by other chemokine receptors. As a consequence of chemokine-induced cellular desensitization (phosphorylation-dependent receptor internalization), cellular responses are typically rapid and short in duration. Cellular responsiveness is restored after dephosphorylation of intracellular receptors and subsequent recycling to the cell surface. A hallmark of CD183 is its prominent expression in in vitro cultured effector/memory T cells, and in T cells present in many types of inflamed tissues. In addition, IP10, Mig and I-TAC are commonly produced by local cells in inflammatory lesion, suggesting that CD183 and its chemokines participate in the recruitment of inflammatory cells. Therefore, CD183 is a target for the development of small molecular weight antagonists, which may be used in the treatment of diverse inflammatory diseases. [PROW]
1004	
1005	
1006	This gene is a member of the XAGE subfamily, which belongs to the GAGE family. The GAGE genes are expressed in a variety of tumors and in some fetal and reproductive tissues. This gene is expressed in placenta and fetal liver/spleen, and may function in inhibiting cancer cell growth. The protein encoded by this gene shares a sequence similarity with other GAGE/PAGE proteins. Because of the expression pattern and the sequence similarity, this protein also belongs to a family of CT (cancer-testis) antigens. Alternative splicing of this gene generates 2 transcript variants differing in the 5@#$%& UTR.
1007	
1008	DNA methylation is the major modification of eukaryotic genomes and plays an essential role in mammalian development. Human proteins MECP2, MBD1, MBD2, MBD3, and MBD4 comprise a family of nuclear proteins related by the presence in each of a methyl-CpG binding domain (MBD). Each of these proteins, with the exception of MBD3, is capable of binding specifically to methylated DNA. MECP2, MBD1 and MBD2 can also repress transcription from methylated gene promoters. In contrast to other MBD family members, MECP2 is X-linked and subject to X inactivation.  MECP2 is dispensible in stem cells, but is essential for embryonic development. MECP2 gene mutations are the cause of some cases of Rett syndrome, a progressive neurologic developmental disorder and one of the most common causes of mental retardation in females.
1009	
1010	
1011	
1012	The protein encoded by this gene contains a RING finger domain, a motif present in a variety of functionally distinct proteins and known to be involved in protein-protein and protein-DNA interactions. Multiple alternatively spliced transcript variants have been suggested, but their full length natures are not clear.
1013	The protein encoded by this gene is a highly conserved, calcium-binding protein, that is preferentially expressed in the liver and kidney. It may have an important role in calcium homeostasis. Studies in rat indicate that this protein may also play a role in aging, as it shows age-associated down-regulation. This gene is part of a gene cluster on chromosome Xp11.3-Xp11.23. Alternative splicing results in two transcript variants having different 5@#$%& UTRs, but encoding the same protein.
1014	
1015	The protein encoded by this gene has sequence similarity with members of the WD40 repeat-containing protein family. The WD40 group is a large family of proteins, which appear to have a regulatory function. It is believed that the WD40 repeats mediate protein-protein interactions and members of the family are involved in signal transduction, RNA processing, gene regulation, vesicular trafficking, cytoskeletal assembly and may play a role in the control of cytotypic differentiation. This gene is highly similar to TBL1X gene in nucleotide sequence and protein sequence, but the TBL1X gene is located on chromosome X and this gene is on chromosome Y. This gene has three alternatively spliced transcript variants encoding the same protein.
1016	
1017	
1018	
1019	
1020	
1021	
1022	
1023	Histone acetylation and deacetylation, catalyzed by multisubunit complexes, play a key role in the regulation of eukaryotic gene expression. The protein encoded by this gene belongs to the histone deacetylase/acuc/apha family and is a component of the histone deacetylase complex. It also interacts with retinoblastoma tumor-suppressor protein and this complex is a key element in the control of cell proliferation and differentiation. Together with metastasis-associated protein-2, it deacetylates p53 and modulates its effect on cell growth and apoptosis.
1024	Crystallins are separated into two classes: taxon-specific, or enzyme, and ubiquitous. The latter class constitutes the major proteins of vertebrate eye lens and maintains the transparency and refractive index of the lens. The former class is also called phylogenetically-restricted crystallins. This gene encodes a taxon-specific crystallin protein which has NADPH-dependent quinone reductase activity distinct from other known quinone reductases. It lacks alcohol dehydrogenase activity although by similarity it is considered a member of the zinc-containing alcohol dehydrogenase family. Unlike other mammalian species, in humans, lens expression is low. One pseudogene is known to exist.
1025	Although the specific function of this protein is not known yet, it is thought to be involved in protein-protein and protein-lipid interactions.
1026	
1027	There are 2 forms of glucose-6-phosphate dehydrogenase. G form is X-linked and H form, encoded by this gene, is autosomally linked. This H form shows activity with other hexose-6-phosphates, especially galactose-6-phosphate, whereas the G form is specific for glucose-6-phosphate. Both forms are present in most tissues, but H form is not found in red cells.
1028	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum. In rodents, the homologous protein has been shown to metabolize certain carcinogens; however, the specific function of the human protein has not been determined.
1029	Potassium channels are present in most mammalian cells, where they participate in a wide range of physiologic responses. The protein encoded by this gene is an integral membrane protein and inward-rectifier type potassium channel. It is activated by internal ATP and may be responsible for the potassium buffering action of glial cells in the brain. The encoded protein, which may form a heterodimer with another potassium channel protein, has a greater tendency to allow potassium to flow into a cell rather than out of a cell.
1030	
1031	PMVK (EC 2.7.4.2) is a peroxisomal enzyme that catalyzes the conversion of mevalonate 5-phosphate into mevalonate 5-diphosphate as the fifth reaction of the cholesterol biosynthetic pathway.[supplied by OMIM]
1032	This gene encodes a chromatin-associated enzyme, poly(ADP-ribosyl)transferase, which modifies various nuclear proteins by poly(ADP-ribosyl)ation. The modification is dependent on DNA and is involved in the regulation of various important cellular processes such as differentiation, proliferation, and tumor transformation and also in the regulation of the molecular events involved in the recovery of cell from DNA damage. In addition, this enzyme may be the site of mutation in Fanconi anemia, and may participate in the pathophysiology of type I diabetes.
1033	
1034	This gene encodes a member of the Rho family of GTPases. This protein can activate PAK1 and JNK1, and can induce filopodium formation and stress fiber dissolution. It may also mediate the effects of WNT1 signaling in the regulation of cell morphology, cytoskeletal organization, and cell proliferation.
1035	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that belongs to the ribosomal protein S15P family. The encoded protein is more than two times the size of its E. coli counterpart, with the 12S rRNA binding sites conserved. Between human and mouse, the encoded protein is the least conserved among small subunit ribosomal proteins. Pseudogenes corresponding to this gene are found on chromosomes 15q and 19q.
1036	LIM proteins are defined by the possession of a highly conserved double zinc finger motif called the LIM domain.[supplied by OMIM]
1037	
1038	
1039	Hook proteins are cytosolic coiled-coil proteins that contain conserved N-terminal domains, which attach to microtubules, and more divergent C-terminal domains, which mediate binding to organelles. The Drosophila Hook protein is a component of the endocytic compartment.[supplied by OMIM]
1040	Fibromodulin is a member of a family of small interstitial proteoglycans, containing a central region composed of leucine-rich repeats with 4 keratan sulfate chains flanked by disulfide-bonded terminal domains. It may participate in the assembly of the extracellular matrix as it interacts with type I and type II collagen fibrils and inhibits fibrillogenesis in vitro. It may also regulate TGF-beta activities by sequestering TGF-beta into the extracellular matrix.
1041	
1042	
1043	
1044	
1045	
1046	
1047	
1048	This gene encodes a protein containing thrombospondin type 1 repeats (TSRs). TSRs are extracellular protein domains that often contain a protein binding site. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
1049	Alpha actinins belong to the spectrin gene superfamily which represents a diverse group of cytoskeletal proteins, including the alpha and beta spectrins and dystrophins.  Alpha actinin is an actin-binding protein with multiple roles in different cell types.  In nonmuscle cells, the cytoskeletal isoform is found along microfilament bundles and adherens-type junctions, where it is involved in binding actin to the membrane.  In contrast, skeletal, cardiac, and smooth muscle isoforms are localized to the Z-disc and analogous dense bodies, where they help anchor the myofibrillar actin filaments.  This gene encodes a muscle-specific, alpha actinin isoform that is expressed in both skeletal and cardiac muscles.  Transcript variants resulting from the use of multiple poly_A sites have been observed.
1050	
1051	
1052	A number of specific carrier proteins for members of the vitamin A family have been discovered.  Cellular retinoic acid binding proteins (CRABP) are low molecular weight proteins whose precise function remains unknown.  The inducibility of the CRABP2 gene suggests that this isoform is important in retinoic acid-mediated regulation of human skin growth and differentiation.  It has been postulated that the CRABP2 gene is transcriptionally regulated by a newly synthesized regulatory protein.
1053	
1054	
1055	
1056	
1057	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This protein localizes to the nucleus and the cytoplasm. Its function has not been identified.
1058	This gene encodes a protein that specifically complexes with the C-terminus of hepatitis B virus X protein (HBx). The function of this protein is to negatively regulate HBx activity and thus to alter the replication life cycle of the virus.
1059	
1060	
1061	
1062	
1063	
1064	
1065	
1066	
1067	
1068	The protein encoded by this gene is known to associate with HS1, a substrate of Src family tyrosine kinases. It also interacts with the product of PKD2 gene, mutations in which are associated with autosomal-dominant polycystic kidney disease, and with F-actin-binding protein, cortactin. It was earlier thought that this gene product is mainly localized in the mitochondria, however, recent studies indicate it to be localized in the cell body. Two transcript variants encoding different isoforms have been found for this gene.
1069	
1070	The protein encoded by this gene, pre-angiotensinogen or angiotensinogen precursor, is expressed in the liver and is cleaved by the enzyme renin in response to lowered blood pressure. The resulting product, angiotensin I is then cleaved by angiotensin converting enzyme (ACE) to generate the physiologically active enzyme angiotensin II. The protein is involved in maintaining blood pressure and in the pathogenesis of essential hypertension and preeclampsia.
1071	This gene was identified by its translocation in a case of mucosa-associated lymphoid tissue (MALT) lymphoma. The protein encoded by this gene contains a caspase recruitment domain (CARD), and has been shown to induce apoptosis and to activate NF-kappaB. This protein is reported to interact with other CARD domain containing proteins including CARD9, 10, 11 and 14, which are thought to function as upstream regulators in NF-kappaB signaling. This protein is found to form a complex with MALT1, a protein encoded by another gene known to be translocated in MALT lymphoma. MALT1 and this protein are thought to synergize in the activation of NF-kappaB, and the deregulation of either of them may contribute to the same pathogenetic process that leads to the malignancy.
1072	
1073	
1074	
1075	
1076	Aldo-keto reductases, such as AKR7A2, are involved in the detoxification of aldehydes and ketones.[supplied by OMIM]
1077	
1078	
1079	
1080	
1081	Fucosidosis is an autosomal recessive lysosomal storage disease caused by defective alpha-L-fucosidase with accumulation of fucose in the tissues. Different phenotypes include clinical features such as neurologic deterioration, growth retardation, visceromegaly, and seizures in a severe early form; coarse facial features, angiokeratoma corporis diffusum, spasticity and delayed psychomotor development in a longer surviving form; and an unusual spondylometaphyseoepiphyseal dysplasia in yet another form.[supplied by OMIM]
1082	The protein encoded by this gene is a cytokine that belongs to the tumor necrosis factor (TNF) ligand family. This cytokine is a ligand for receptor TNFRSF18/AITR/GITR. It has been shown to modulate T lymphocyte survival in peripheral tissues. This cytokine is also found to be expressed in endothelial cells, and is thought to be important for interaction between T lymphocytes and endothelial cells.
1083	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a cytoplasmic ribosomal protein that is a component of the 60S subunit. The protein belongs to the L22E family of ribosomal proteins. Its initiating methionine residue is post-translationally removed. The protein can bind specifically to Epstein-Barr virus-encoded RNAs (EBERs) 1 and 2. The mouse protein has been shown to be capable of binding to heparin. Transcript variants utilizing alternative polyA signals exist. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome. It was previously thought that this gene mapped to 3q26 and that it was fused to the acute myeloid leukemia 1 (AML1) gene located at 21q22 in some therapy-related myelodysplastic syndrome patients with 3;21 translocations; however, these fusions actually involve a ribosomal protein L22 pseudogene located at 3q26, and this gene actually maps to 1p36.3-p36.2.
1084	The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21. This protein has a Ca2+ affinity 20- to 100-fold higher than the other S100 proteins studied under identical conditions. This protein also binds Zn2+ and Cu2+, and Cu2+ strongly which impairs the binding of Ca2+. This protein is expressed in very restricted regions of the adult brain.
1085	
1086	
1087	
1088	
1089	This gene encodes a member of the polycystin protein family. Members of this protein family may function in renal tubular development, morphology, and function, and may modulate intracellular calcium homoeostasis and other signal transduction pathways. Two transcript variants encoding distinct isoforms have been identified for this gene.
1090	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
1091	
1092	
1093	The protein encoded by this gene is predicted to be a five-pass transmembrane protein. This gene may be predominately expressed in brain.
1094	
1095	
1096	
1097	
1098	This intronless gene encodes a protein that may play a role in organelle biogenesis associated with melanosomes, platelet dense granules, and lysosomes. This protein interacts with Hermansky-Pudlak syndrome 5 protein. Mutations in this gene are associated with Hermansky-Pudlak syndrome type 6.
1099	
1100	The protein encoded by this gene contains a RING finger motif, which is most closely related to those of polycomb group (PcG) proteins RNF110/MEL-18 and BMI1. PcG proteins are known to form protein complexes and function as transcription repressors. This protein has been shown to interact with some PcG proteins and act as a transcription repressor. The activity of this protein is found to be regulated by cell cycle dependent phosphorylation. Alternatively spliced transcript variants encoding different isoforms have been identified.
1101	
1102	
1103	
1104	
1105	
1106	
1107	
1108	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It is localized in nuclear bodies of the nucleus. This protein is involved in the splicing process and it also participates in early heat shock-induced splicing arrest by transiently leaving the hnRNP complexes. Multiple alternative transcript variants seem to be present for this gene and some appear to have intronic regions in the mRNA. Presently, only two transcript variants are fully described.
1109	
1110	
1111	
1112	MINPP1 hydrolyzes the abundant metabolites inositol pentakisphosphate and inositol hexakisphosphate and, like PTEN (MIM 601728), has the ability to remove 3-phosphate from inositol phosphate substrates.[supplied by OMIM]
1113	
1114	The protein encoded by this gene is a peroxisomal enzyme. It catalyzes the initial alpha-oxidation step in the degradation of phytanic acid and converts phytanoyl-CoA to 2-hydroxyphytanoyl-CoA. It interacts specifically with the immunophilin FKBP52. Refsum disease, an autosomal recessive neurologic disorder, is caused by the deficiency of this encoded protein.
1115	
1116	
1117	
1118	The protein encoded by this gene is a regulator of the pancreatic beta-cell function. It is highly similar to JIP-1, a mouse protein known to be a regulator of c-Jun amino-terminal kinase (Mapk8). This protein has been shown to prevent MAPK8 mediated activation of transcription factors, and decrease IL-1 beta and MAP kinase kinase 1 (MEKK1) induced apoptosis in pancreatic beta cells. This protein also functions as a DNA-binding transactivator of the glucose transporter GLUT2. RE1-silencing transcription factor (REST) is reported to repress the expression of this gene in insulin-secreting beta cells. This gene is found to be mutated in a type 2 diabetes family, and thus is thought to be a susceptibility gene for type 2 diabetes.
1119	The SR (serine/arginine-rich) family contains a number of phosphoproteins that function as essential and alternative splicing factors. The SR family of proteins is characterized by the presence of a ribonucleoprotein (RNP)-type RNA binding motif and a carboxyl-terminal arginine-serine-rich (RS) domain. The protein encoded by this gene is a member of the SR family and functions as an essential splicing factor in vitro. This gene is thought to be an expressed PR264/SC35 retropseudogene.
1120	The activation of gene transcription is a multistep process that is triggered by factors that recognize transcriptional enhancer sites in DNA. These factors work with co-activators to direct transcriptional initiation by the RNA polymerase II apparatus. The protein encoded by this gene is a subunit of the CRSP (cofactor required for SP1 activation) complex, which, along with TFIID, is required for efficient activation by SP1. This protein is also a component of other multisubunit complexes e.g. thyroid hormone receptor-(TR-) associated proteins which interact with TR and facilitate TR function on DNA templates in conjunction with initiation factors and cofactors.
1121	
1122	This gene encodes a protein that may function as a ribonuclease. An alternative splice variant has been described but its biological nature has not been determined. A pseudogene for this gene has been identified on chromosome Y, near the sex determining region Y (SRY) gene.
1123	The protein encoded by this gene is highly homologous to rat dipeptidyl peptidase III, which has been shown to be a zinc metallo-exopeptidase. Both enzymes contain the HELLGH motif that is involved in zinc binding and catalytic activity. Two transcript variants encoding different isoforms have been found for this gene.
1124	
1125	
1126	
1127	
1128	
1129	
1130	
1131	Voltage-sensitive sodium channel beta subunits, such as SCN3B, are auxiliary components that regulate the ion-conducting alpha subunits.[supplied by OMIM]
1132	Neurogranin (NRGN) is the human homolog of the neuron-specific rat RC3/neurogranin gene. This gene encodes a postsynaptic protein kinase substrate that binds calmodulin in the absence of calcium. The NRGN gene contains four exons and three introns. The exons 1 and 2 encode the protein and exons 3 and 4 contain untranslated sequences. It is suggested that the NRGN is a direct target for thyroid hormone in human brain, and that control of expression of this gene could underlay many of the consequences of hypothyroidism on mental states during development as well as in adult subjects.
1133	
1134	
1135	
1136	This gene encodes a member of the membrane-spanning 4A gene family. Members of this nascent protein family are characterized by common structural features and similar intron/exon splice boundaries and display unique expression patterns among hematopoietic cells and nonlymphoid tissues. This gene encodes a B-lymphocyte surface molecule which plays a role in the development and differentiation of B-cells into plasma cells. This family member is localized to 11q12, among a cluster of family members. Alternative splicing of this gene results in several transcript variants which encode the same protein.
1137	In mammalian cells, 2 broad groups of centromere-interacting proteins have been described: constitutively binding centromere proteins and @#$%&passenger,@#$%& or transiently interacting, proteins (reviewed by Choo, 1997). The constitutive proteins include CENPA (centromere protein A; MIM 117139), CENPB (MIM 117140), CENPC1 (MIM 117141), and CENPD (MIM 117142). The term @#$%&passenger proteins@#$%& encompasses a broad collection of proteins that localize to the centromere during specific stages of the cell cycle (Earnshaw and Mackay, 1994). These include CENPE (MIM 117143); MCAK (MIM 604538); KID (MIM 603213); cytoplasmic dynein (e.g., MIM 600112); CliPs (e.g., MIM 179838); and CENPF/mitosin (MIM 600236). The inner centromere proteins (INCENPs) (Earnshaw and Cooke, 1991), the initial members of the passenger protein group, display a broad localization along chromosomes in the early stages of mitosis but gradually become concentrated at centromeres as the cell cycle progresses into mid-metaphase. During telophase, the proteins are located within the midbody in the intercellular bridge, where they are discarded after cytokinesis (Cutts et al., 1999).[supplied by OMIM]
1138	
1139	
1140	
1141	
1142	
1143	
1144	This gene encodes a member of the ubiquitin-specific processing (UBP) family of proteases that is a deubiquitinating enzyme (DUB) with His and Cys domains. This protein is located in the cytoplasm and cleaves the ubiquitin moiety from ubiquitin-fused precursors and ubiquitinylated proteins. The protein specifically deubiquitinates a protein in the Fanconi anemia (FA) DNA repair pathway. Alternate transcriptional splice variants have been characterized.
1145	
1146	
1147	The galectins are a family of beta-galactoside-binding proteins implicated in modulating cell-cell and cell-matrix interactions. The protein encoded by this gene is an S-type lectin. This galectin is strongly overexpressed in Hodgkin@#$%&s disease tissue and it might participate in the interaction between the H&RS cells with their surrounding cells and might thus play a role in the pathogenesis of this disease and/or its consistently associated immunodeficiency. The protein has N- and C- terminal carbohydrate-binding domains connected by a link peptide. Two isoforms (long and short) exist.
1148	Calpains are calcium-dependent cysteine proteases involved in signal transduction in a variety of cellular processes. A functional calpain protein consists of an invariant small subunit and 1 of a family of large subunits. CAPN5 is one of the large subunits. Unlike some of the calpains, CAPN5 and CAPN6 lack a calmodulin-like domain IV. Because of the significant similarity to Caenorhabditis elegans sex determination gene tra-3, CAPN5 is also called as HTRA3. Multiple alternatively spliced transcript variants have been found for this gene, but the full-length nature of some variants has not been defined.
1149	
1150	Malic enzyme catalyzes the oxidative decarboxylation of malate to pyruvate using either NAD+ or NADP+ as a cofactor. Mammalian tissues contain 3 distinct isoforms of malic enzyme: a cytosolic NADP(+)-dependent isoform, a mitochondrial NADP(+)-dependent isoform, and a mitochondrial NAD(+)-dependent isoform. This gene encodes a mitochondrial NADP(+)-dependent isoform. Multiple alternatively spliced transcript variants have been found for this gene, but the biological validity of some variants has not been determined.
1151	
1152	This gene encodes 54-kD nuclear protein that contains an arginine/serine-rich region similar to segments found in pre-mRNA splicing factors. Although the function of this protein is not yet known, structure and immunolocalization data suggest that it may play a role in pre-mRNA processing.
1153	
1154	
1155	The protein encoded by this gene is a member of the SWI/SNF family of proteins, whose members display helicase and ATPase activities and which are thought to regulate transcription of certain genes by altering the chromatin structure around those genes. The encoded protein is part of the large ATP-dependent chromatin remodeling complex SNF/SWI and contains a predicted leucine zipper motif typical of many transcription factors. Two transcript variants encoding different isoforms have been found for this gene.
1156	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S26E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
1157	
1158	The protein encoded by this gene is member of the G-protein coupled receptor family. This protein is a receptor for prostaglandin F2-alpha (PGF2-alpha), which is known to be a potent luteolytic agent, and may also be involved in modulating intraocular pressure and smooth muscle contraction in uterus. Knockout studies in mice suggest that the interaction of PGF2-alpha with this receptor may initiate parturition in ovarian luteal cells and thus induce luteolysis.
1159	This gene encodes a member of the PIWI subfamily of Argonaute proteins, evolutionarily conserved proteins containing both PAZ and Piwi motifs that play important roles in stem cell self-renewal, RNA silencing, and translational regulation in diverse organisms. The encoded protein may play a role as an intrinsic regulator of the self-renewal capacity of germline and hematopoietic stem cells.
1160	
1161	
1162	
1163	
1164	Cytosolic phospholipase A2 (PLA2; EC 3.1.1.4) catalyzes the release of arachidonic acid from membrane phospholipids. Arachidonic acid in turn serves as precursor for a wide spectrum of biologic effectors, collectively known as eicosanoids, that are involved in hemodynamic regulation, inflammatory responses, and other cellular processes. (Eicosanoids are lipid mediators of inflammation; they include a variety of compounds (prostaglandins, thromboxanes, leukotrienes, hydroxy- and epoxy-fatty acids, lipoxins, and isoprostanes) that are derived from the ubiquitous 20-carbon atom arachidonate (20 in Greek is @#$%&eicosa@#$%&) and a few similar polyunsaturated fatty acids.) Dennis (1994) reviewed various types of PLA2. The best characterized is the group II PLA2 originally isolated from human synovial fluid (i.e., PLA2G2A; MIM 172411). Groups I (human pancreatic, PLA2G1B (MIM 172410); also found in cobras and kraits), II (synovial; also found in rattlesnakes and vipers), and III (forms of which were described in bee and lizard) are secreted; group IV is cytosolic.[supplied by OMIM]
1165	The protein encoded by this gene is a member of the keratin gene family. The type II cytokeratins consist of basic or neutral proteins which are arranged in pairs of heterotypic keratin chains coexpressed during differentiation of simple and stratified epithelial tissues. This type II cytokeratin is expressed largely in the upper spinous layer of epidermal keratinocytes and mutations in this gene have been associated with bullous congenital ichthyosiform erythroderma. The type II cytokeratins are clustered in a region of chromosome 12q12-q13.
1166	
1167	
1168	
1169	Alpha-2-macroglobulin is a protease inhibitor and cytokine transporter. It inhibits many proteases, including trypsin, thrombin and collagenase. A2M is implicated in Alzheimer disease (AD) due to its ability to mediate the clearance and degradation of A-beta, the major component of beta-amyloid deposits.
1170	This gene encodes a protein that contains a mono-ADP-ribosylation (ART) motif. It is a member of the ADP-ribosyltransferase gene family but enzymatic activity has not been demonstrated experimentally. Antigens of the Dombrock blood group system are located on the gene product, which is glycosylphosphatidylinosotol-anchored to the erythrocyte membrane. Allelic variants, some of which lead to adverse transfusion reactions, are known.
1171	Lymphocyte-activation protein 3 belongs to Ig superfamily and contains 4 extracellular Ig-like domains. The LAG3 gene contains 8 exons.  The sequence data, exon/intron organization, and chromosomal localization all indicate a close relationship of LAG3 to CD4.
1172	
1173	
1174	
1175	
1176	
1177	
1178	
1179	The full-length protein encoded by this gene is an intracellular tetrapyrrole-binding protein. This protein includes a natural chemoattractant peptide of 21 amino acids at the N-terminus, which is a natural ligand for formyl peptide receptor-like receptor 2 (FPRL2) and promotes calcium mobilization and chemotaxis in monocytes and dendritic cells.
1180	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities and are involved in a variety of biological processes including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. The product of this gene inhibits renal tubular phosphate transport. This gene was identified by its mutations associated with autosomal dominant hypophosphatemic rickets (ADHR), an inherited phosphate wasting disorder. Abnormally high level expression of this gene was found in oncogenic hypophosphatemic osteomalacia (OHO), a phenotypically similar disease caused by abnormal phosphate metabolism. Mutations in this gene have also been shown to cause familial tumoral calcinosis with hyperphosphatemia.
1181	Aquaporin 5 (AQP5) is a water channel protein.  Aquaporins are a family of small integral membrane proteins related to the major intrinsic protein (MIP or AQP0). Aquaporin 5 plays a role in the generation of saliva, tears and pulmonary secretions. AQP0, AQP2, AQP5, and AQP6 are closely related and all map to 12q13.
1182	
1183	The protein encoded by this gene belongs to a small class of the protein tyrosine phosphatase (PTP) family. PTPs are cell signaling molecules that play regulatory roles in a variety of cellular processes. PTPs in this class contain a protein tyrosine phosphatase catalytic domain and a characteristic C-terminal prenylation motif. This PTP has been shown to primarily associate with plasmic and endosomal membrane through its C-terminal prenylation. This PTP was found to interact with the beta-subunit of Rab geranylgeranyltransferase II (beta GGT II), and thus may function as a regulator of GGT II activity. Overexpression of this gene in mammalian cells conferred a transformed phenotype, which suggested its role in tumorigenesis. Three alternatively spliced transcript variants, which encode two distinct isoforms, have been described.
1184	The protein encoded by this gene is a transcription factor of the PR-domain protein family. It contains a PR-domain and multiple zinc finger motifs. Transcription factors of the PR-domain family are known to be involved in cell differentiation and tumorigenesis. An elevated expression level of this gene has been observed in PC12 cells treated with nerve growth factor, beta polypeptide (NGF). This gene is located in a chromosomal region that is thought to contain tumor suppressor genes.
1185	
1186	This gene encodes a protein with an N-terminal RCC1 domain and a C-terminal BTB (broad complex, tramtrack and bric-a-brac) domain. In rat, over-expression of this gene in vascular smooth muscle cells induced cellular hypertrophy. In rat, the C-terminus of RCBTB1 interacts with the angiotensin II receptor-1A. In humans, this gene maps to a region of chromosome 13q that is frequently deleted in B-cell chronic lymphocytic leukemia and other lymphoid malignancies.
1187	This gene encodes poly(ADP-ribosyl)transferase-like 1 protein, which contains a catalytic domain and is capable of catalyzing a poly(ADP-ribosyl)ation reaction. This protein has a catalytic domain which is homologous to that of poly (ADP-ribosyl) transferase, but lacks an N-terminal DNA binding domain which activates the C-terminal catalytic domain of poly (ADP-ribosyl) transferase. Since this protein is not capable of binding DNA directly, its transferase activity may be activated by other factors such as protein-protein interaction mediated by the extensive carboxyl terminus.
1188	
1189	
1190	Excision repair cross-complementing rodent repair deficiency, complementation group 5 (xeroderma pigmentosum, complementation group G) is involved in excision repair of UV-induced DNA damage. Mutations cause Cockayne syndrome, which is characterized by severe growth defects, mental retardation, and cachexia. Multiple alternatively spliced transcript variants encoding distinct isoforms have been described, but the biological validity of all variants has not been determined.
1191	This gene encodes a tumor suppressor protein that can induce cell growth arrest and apoptosis. The encoded protein is a nuclear protein that physically interacts with the tumor suppressor protein TP53 and is a component of the p53 signaling pathway. Reduced expression and rearrangement of this gene have been detected in various cancers. Multiple alternatively spliced transcript variants encoding distinct isoforms have been reported.
1192	
1193	The receptors for glycoprotein hormones such as follicle-stimulating hormone (FSH; see MIM 136530) and thyroid-stimulating hormone (TSH; see MIM 188540) are G protein-coupled, 7-transmembrane receptors (GPCRs) with large N-terminal extracellular domains. Leucine-rich repeat (LRR)-containing GPCRs (LGRs) form a subgroup of the GPCR superfamily.[supplied by OMIM]
1194	Mitochondrial intermediate peptidase performs the final step in processing a specific class of nuclear-encoded proteins targeted to the mitochondrial matrix or inner membrane. The MIPEP open reading frame encodes a 713-amino acid protein that is 92% similar to rat MIP and 54% similar to yeast MIP. Human MIPEP is primarily involved in the maturation of oxidative phosphorylation (OXPHOS)-related proteins. Human MIPEP may contribute to the functional effects of frataxin deficiency and the clinical manifestations of Friedreich ataxia.
1195	The neuronal ceroid lipofuscinoses (CLN or NCL) are a group of autosomal recessive, progressive encephalopathies in children. They are characterized by psychomotor deterioration, visual failure, and the accumulation of autofluorescent lipopigment in neurons and other cell types. The main childhood forms are the infantile type (Santavuori-Haltia disease; MIM 256730), the late infantile type (Jansky-Bielschowsky disease; MIM 204500), and the juvenile type (Batten disease; MIM 204200) based on the age of onset, clinical course, neurologic and ophthalmologic findings, and ultrastructural analysis (Carpenter et al., 1977 [PubMed 193610]).[supplied by OMIM]
1196	ZFP67 is an early growth response gene that encodes a zinc finger-containing transcription factor that binds to the promoter regions of type I collagen genes (e.g., COL1A1; MIM 120150) and has a role in development.[supplied by OMIM]
1197	This gene encodes coagulation factor VII which is a vitamin K-dependent factor essential for hemostasis. This factor circulates in the blood in a zymogen form, and is converted to an active form by either factor IXa, factor Xa, factor XIIa, or thrombin by minor proteolysis. Upon activation of the factor VII, a heavy chain containing a catalytic domain and a light chain containing 2 EGF-like domains are generated, and two chains are held together by a disulfide bond. In the presence of factor III and calcium ions, the activated factor then further activates the coagulation cascade by converting factor IX to factor IXa and/or factor X to factor Xa. Alternative splicing of this gene results in 2 transcripts. Defects in this gene can cause coagulopathy.
1198	The protein encoded by this gene is an extracellular serine protease which binds hyaluronic acid. It is involved in cell adhesion. The protein is synthesized as a single chain, but then undergoes an autoproteolytic event to form the functional heterodimer. Further autoproteolysis leads to smaller, inactive peptides. Two transcript variants utilizing alternative polyA sites exist for this gene.
1199	
1200	
1201	Corticosteroid-binding globulin, otherwise known as transcortin, is an alpha-globulin.  CBG localizes to a region containing two closely related serine protease inhibitors which may have evolved by duplication events.
1202	
1203	
1204	
1205	This gene belongs to the forkhead family of transcription factors which is characterized by a  distinct forkhead domain.  The specific function of this gene has not yet been determined; however, it may play a role in the development of the brain and the telencephalon.
1206	This gene encodes one of the three subunits of the RNA polymerase III transcription factor complex. This complex plays a central role in transcription initiation by RNA polymerase III on genes encoding tRNA, 5S rRNA, and other small structural RNAs. The gene product belongs to the TF2B family. Three alternatively spliced variants encoding three different isoforms, that function at different promoters transcribed by RNA polymerase III, have been identified. A transcript encoding a fourth isoform has not yet been completely characterized.
1207	
1208	This gene encodes a WD repeat-containing protein. Multiple alternatively spliced transcript variants have been found for this gene, but the full-length nature of some variants has not been defined.
1209	
1210	
1211	
1212	This gene encodes a protein that possesses three distinct enzymatic activities, 5,10-methylenetetrahydrofolate dehydrogenase, 5,10-methenyltetrahydrofolate cyclohydrolase and 10-formyltetrahydrofolate synthetase. Each of these activities catalyzes one of three sequential reactions in the interconversion of 1-carbon derivatives of tetrahydrofolate, which are substrates for methionine, thymidylate, and de novo purine syntheses. The trifunctional enzymatic activities are conferred by two major domains, an aminoterminal portion containing the dehydrogenase and cyclohydrolase activities and a larger synthetase domain.
1213	
1214	The protein encoded by this gene is a ubiquitously expressed nuclear protein. It binds directly, with several other proteins, to retinoblastoma protein (pRB) which regulates cell proliferation. pRB represses transcription by recruiting the encoded protein. This protein, in turn, serves as a bridging molecule to recruit HDACs and, in addition, provides a second HDAC-independent repression function. The encoded protein possesses transcriptional repression activity. Multiple alternatively spliced transcripts have been observed for this gene, although not all transcript variants have been fully described.
1215	Members of the F-box protein family, such as FBXO34, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
1216	
1217	The RAS oncogene (MIM 190020) is mutated in nearly one-third of all human cancers. Members of the RAS superfamily are plasma membrane GTP-binding proteins that modulate intracellular signal transduction pathways. A subfamily of RAS effectors, including RASSF3, share a RAS association (RA) domain.[supplied by OMIM]
1218	The protein encoded by this gene is a member of the immunophilin protein family, which play a role in immunoregulation and basic cellular processes involving protein folding and trafficking. This encoded protein is a cis-trans prolyl isomerase that binds the immunosuppressants FK506 and rapamycin. It has a higher affinity for rapamycin than for FK506 and thus may be an important target molecule for immunosuppression by rapamycin.
1219	
1220	
1221	Cytolytic T lymphocytes (CTL) and natural killer (NK) cells share the remarkable ability to recognize, bind, and lyse specific target cells. They are thought to protect their host by lysing cells bearing on their surface @#$%&nonself@#$%& antigens, usually peptides or proteins resulting from infection by intracellular pathogens. The protein encoded by this gene is crucial for the rapid induction of target cell apoptosis by CTL in cell-mediated immune response.
1222	The protein encoded by this gene is involved in repairing stalled topoisomerase I-DNA complexes by catalyzing the hydrolysis of the phosphodiester bond between the tyrosine residue of topoisomerase I and the 3-prime phosphate of DNA. This protein may also remove glycolate from single-stranded DNA containing 3-prime phosphoglycolate, suggesting a role in repair of free-radical mediated DNA double-strand breaks. This gene is a member of the phospholipase D family and contains two PLD phosphodiesterase domains. Mutations in this gene are associated with the disease spinocerebellar ataxia with axonal neuropathy (SCAN1). While several transcript variants may exist for this gene, the full-length natures of only two have been described to date. These two represent the major variants of this gene and encode the same isoform.
1223	Machado-Joseph disease is an autosomal dominant neurologic disorder, and is now known to be the same as previously described spinocerebellar ataxia-3. The protein encoded by this gene contains (CAG)n repeats in the coding region, and the expansion of these repeats from the normal 13-36 to 68-79 is the cause of Machado-Joseph disease. There is a negative correlation between the age of onset and CAG repeat numbers. Transcript variants encoding different isoforms (not involving the (CAG)n repeats) and utilizing alternative polyA_sites have been described for this gene.
1224	This gene encodes a member of the Argonaute family of proteins which play a role in RNA interference. The encoded protein is highly basic, and contains a PAZ domain and a PIWI domain. It may interact with dicer1 and play a role in short-interfering-RNA-mediated gene silencing. This gene is located on chromosome 1 in a cluster of closely related family members including argonaute 3, and argonaute 4.
1225	
1226	The biogenic amine histamine is an important modulator of numerous physiologic processes, including neurotransmission, gastric acid secretion, and smooth muscle tone. The biosynthesis of histamine from histidine is catalyzed by the enzyme L-histidine decarboxylase (HDC; EC 4.1.1.22). This homodimeric enzyme is a pyridoxal phosphate (PLP)-dependent decarboxylase and is highly specific for its histidine substrate.[supplied by OMIM]
1227	
1228	
1229	
1230	
1231	
1232	
1233	
1234	
1235	
1236	
1237	The protein encoded by this gene is a type II membrane protein that is thought to catalyze the transfer of sialic acid from CMP-sialic acid to N-linked oligosaccharides and glycoproteins. The encoded protein may be found in the Golgi apparatus and may be involved in the production of polysialic acid, a modulator of the adhesive properties of neural cell adhesion molecule (NCAM1). This protein is a member of glycosyltransferase family 29.
1238	
1239	
1240	Casein kinase I is the most abundant serine/threonine kinase in eukaryotic cell extracts. Multiple isoforms of the enzyme exist. The gamma-1 isoform is involved in growth and morphogenesis of eukaryotic cells.[supplied by OMIM]
1241	This intronless gene encodes a member of the MAGE superfamily. The encoded protein is 76% identical to the mouse mage-g1 protein.
1242	The WNT gene family consists of structurally related genes which encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family, and is the first signaling molecule shown to influence the sex-determination cascade. It encodes a protein which shows 98% amino acid identity to the Wnt4 protein of mouse and rat. This gene and DAX1, a gene known to antagonize the testis-determining factor, play a concerted role in both the control of female development and the prevention of testes formation. This gene and another two family members, WNT2 and WNT7B, may be associated with abnormal proliferation in breast tissue.
1243	The muscarinic cholinergic receptors belong to a larger family of G protein-coupled receptors. The functional diversity of these receptors is defined by the binding of acetylcholine and includes cellular responses such as adenylate cyclase inhibition, phosphoinositide degeneration, and potassium channel mediation. Muscarinic receptors influence many effects of acetylcholine in the central and peripheral nervous system. The clinical implications of this receptor are unknown; however, stimulation of this receptor is known to increase cyclic AMP levels.
1244	A number of specific carrier proteins for members of the vitamin A family have been discovered. Cellular retinoic acid-binding protein is assumed to play an important role in retinoic acid-mediated differentiation and proliferation processes.  CRABP1 is structurally similar to the cellular retinol-binding proteins, but binds only retinoic acid. CRABP1 is constitutively expressed and is believed to have different functions in the cell than the related CRABP2.
1245	Transglutaminases (TGM; EC 2.3.2.13) are a family of structurally and functionally related enzymes that stabilize protein assemblies through the formation of gamma-glutamyl-epsilon lysine crosslinks. For additional background information on transglutaminases, see TGM1 (MIM 190195).[supplied by OMIM]
1246	
1247	
1248	The enzyme system for cleavage of glycine (glycine cleavage system; EC 2.1.2.10), which is confined to the mitochondria, is composed of 4 protein components: P protein (a pyridoxal phosphate-dependent glycine decarboxylase; MIM 238300), H protein (a lipoic acid-containing protein), T protein (a tetrahydrofolate-requiring enzyme; MIM 238310), and L protein (a lipoamide dehydrogenase; MIM 238331). Glycine encephalopathy (GCE; MIM 605899), also called nonketotic hyperglycinemia (NKH), may be due to a defect in any one of these enzymes.[supplied by OMIM]
1249	
1250	
1251	
1252	Microtubules, polymers of alpha and beta tubulin subunits, form the mitotic spindle of a dividing cell and help to organize membranous organelles during interphase. Katanin is a heterodimer that consists of a 60 kDa ATPase (p60 subunit A 1) and an 80 kDa accessory protein (p80 subunit B 1). The p60 subunit acts to sever and disassemble microtubules, while the p80 subunit targets the enzyme to the centrosome. Katanin is a member of the AAA family of ATPases.
1253	
1254	
1255	
1256	
1257	
1258	The protein encoded by this gene is a cytosolic factor that facilitates protein transport into the nucleus. It interacts with the nuclear pore complex glycoprotein p62. This encoded protein acts at a relative late stage of nuclear protein import, subsequent to the initial docking of nuclear import ligand at the nuclear envelope. It is thought to be part of a multicomponent system of cytosolic factors that assemble at the pore complex during nuclear import.
1259	
1260	This gene encodes one of the two alpha chains of type XI collagen, a minor fibrillar collagen. Type XI collagen is a heterotrimer but the third alpha chain is a post-translationally modified alpha 1 type II chain. Mutations in this gene are associated with type II Stickler syndrome and with Marshall syndrome. Three transcript variants encoding different isoforms have been identified for this gene.
1261	The product of this gene is a cytokine. Cytokines are small proteins that have an essential role in the immune and inflammatory responses. This gene is one of several chemokine-like factor genes located in a cluster on chromosome 16. The protein encoded by this gene is a potent chemoattractant for neutrophils, monocytes and lymphocytes. It also can stimulate the proliferation of skeletal muscle cells. This protein may play important roles in inflammation and in the regeneration of skeletal muscle. Alternatively spliced transcript variants encoding different isoforms have been identified.
1262	
1263	
1264	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S5P family of ribosomal proteins. It is located in the cytoplasm. This gene shares sequence similarity with mouse LLRep3. It is co-transcribed with the small nucleolar RNA gene U64, which is located in its third intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
1265	
1266	The protein encoded by this gene belongs to the RGS (regulator of G protein signaling) family. Members of the RGS family act as GTPase-activating proteins on the alpha subunits of heterotrimeric, signal-transducing G proteins. This protein inhibits signal transduction by increasing the GTPase activity of G protein alpha subunits, thereby driving them into their inactive GDP-bound form. Alternative splicing occurs at this locus and two transcript variants encoding distinct isoforms have been identified.
1267	
1268	Heparan sulfate biosynthetic enzymes are key components in generating a myriad of distinct heparan sulfate fine structures that carry out multiple biologic activities. The enzyme encoded by this gene is a member of the heparan sulfate biosynthetic enzyme family. It is a type II integral membrane protein and possesses heparan sulfate glucosaminyl 3-O-sulfotransferase activity. This gene is expressed predominantly in brain and may play a role in the nervous system.
1269	
1270	
1271	
1272	
1273	This gene belongs to the forkhead family of transcription factors which is characterized by a distinct forkhead domain.  The specific function of this gene has not yet been determined; however, it may play a role in the regulation of pulmonary genes as well as embryonic development.
1274	
1275	This gene is identified by the similarity of its product to the N-terminal region of BCL7A protein. The BCL7A protein is encoded by the gene known to be directly involved in a three-way gene translocation in a Burkitt lymphoma cell line. The function of this gene has not yet been determined.
1276	The enzyme encoded by this gene is classified as a thiolesterase and is responsible for the hydrolysis of S-lactoyl-glutathione to reduced glutathione and D-lactate.
1277	DNAJA3 belongs to the evolutionarily conserved DNAJ/HSP40 family of proteins. For background information on the DNAJ family, see MIM 608375.[supplied by OMIM]
1278	
1279	
1280	
1281	
1282	This gene encodes a Ras-related protein that is stimulated by dexamethasone. The exact function of this gene is unknown, but it may play a role in dexamethasone-induced alterations in cell morphology, growth and cell-extracellular matrix interactions.  In addition, studies of a similar rat protein suggest that it functions as as a novel physiologic nitric oxide (NO) effector. The gene product belongs to the Ras superfamily of small GTPases.
1283	
1284	The protein encoded by this gene belongs to the family of P-type primary ion transport ATPases characterized by the formation of an aspartyl phosphate intermediate during the reaction cycle. These enzymes remove bivalent calcium ions from eukaryotic cells against very large concentration gradients and play a critical role in intracellular calcium homeostasis. The mammalian plasma membrane calcium ATPase isoforms are encoded by at least four separate genes and the diversity of these enzymes is further increased by alternative splicing of transcripts. The expression of different isoforms and splice variants is regulated in a developmental, tissue- and cell type-specific manner, suggesting that these pumps are functionally adapted to the physiological needs of particular cells and tissues. This gene encodes the plasma membrane calcium ATPase isoform 4. Alternatively spliced transcript variants encoding different isoforms have been identified.
1285	
1286	The protein encoded by this gene is a dual specificity protein kinase that belongs to the MAP kinase kinase family. This kinase is activated by mitogenic and environmental stress, and participates in the MAP kinase-mediated signaling cascade. It phosphorylates and thus activates MAPK14/p38-MAPK. This kinase can be activated by insulin, and is necessary for the expression of glucose transporter. Expression of RAS oncogene is found to result in the accumulation of the active form of this kinase, which thus leads to the constitutive activation of MAPK14, and confers oncogenic transformation of primary cells. The inhibition of this kinase is involved in the pathogenesis of Yersina pseudotuberculosis. Three alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported.
1287	The protein encoded by this gene belongs to the leucine-rich repeat superfamily. This gene was found to be amplified and overexpressed in malignant gliomas. The encoded protein has homology with other proteins that function as cell-adhesion molecules or as signal transduction receptors and is a candidate for the target gene in the 1q32.1 amplicon in malignant gliomas. Two alternatively spliced transcript variants encoding the same protein have been described for this gene.
1288	This gene encodes a protein that is a member of the SUMO (small ubiquitin-like modifier) protein family. It functions in a manner similar to ubiquitin in that it is bound to target proteins as part of a post-translational modification system. However, unlike ubiquitin which targets proteins for degradation, this protein is involved in a variety of cellular processes, such as nuclear transport, transcriptional regulation, apoptosis, and protein stability. It is not active until the last two amino acids of the carboxy-terminus have been cleaved off. Numerous pseudogenes have been reported for this gene. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
1289	Cholesterol homeostasis is regulated, at least in part, by sterol regulatory element (SRE)-binding proteins (e.g., SREBP1; MIM 184756) and by liver X receptors (e.g., LXRA; MIM 602423). Upon sterol depletion, LXRs are inactive and SREBPs are cleaved, after which they bind promoter SREs and activate genes involved in cholesterol biosynthesis and uptake. Sterol transport is mediated by vesicles or by soluble protein carriers, such as steroidogenic acute regulatory protein (STAR; MIM 600617). STAR is homologous to a family of proteins containing a 200- to 210-amino acid STAR-related lipid transfer (START) domain, including STARD3 (Soccio et al., 2002).[supplied by OMIM]
1290	
1291	
1292	
1293	
1294	Clathrin is a major protein component of the cytoplasmic face of intracellular organelles, called coated vesicles and coated pits. These specialized organelles are involved in the intracellular trafficking of receptors and endocytosis of a variety of macromolecules.  The basic subunit of the clathrin coat is composed of three heavy chains and three light chains.
1295	Expression of this gene is induced by hexamethylene-bis-acetamide in vascular smooth muscle cells. The function of its protein product is not yet known.
1296	Sclerostin is a secreted glycoprotein with a C-terminal cysteine knot-like (CTCK) domain and sequence similarity to the DAN (differential screening-selected gene aberrative in neuroblastoma) family of bone morphogenetic protein (BMP) antagonists. Loss-of-function mutations in this gene are associated with an autosomal-recessive disorder, sclerosteosis, which causes progressive bone overgrowth. A deletion downstream of this gene, which causes reduced sclerostin expression, is associated with a milder form of the disorder called van Buchem disease.
1297	The protein encoded by this gene is a member of the ankyrin repeat and SOCS box-containing (ASB) family of proteins. They contain ankyrin repeat sequence and a SOCS box domain. The SOCS box serves to couple suppressor of cytokine signalling (SOCS) proteins and their binding partners with the elongin B and C complex, possibly targeting them for degradation.
1298	The 17-beta-hydroxysteroid dehydrogenase enzyme (EC 1.1.1.62) oxidizes or reduces estrogens and androgens in mammals and regulates the biologic potency of these steroids.[supplied by OMIM]
1299	
1300	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intracellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the ABC1 subfamily. Members of the ABC1 subfamily comprise the only major ABC subfamily found exclusively in multicellular eukaryotes. The function of this protein has not yet been determined.
1301	
1302	
1303	
1304	
1305	
1306	
1307	Mutations in the winged-helix transcription factor gene at the nude locus in mice and rats produce the pleiotropic phenotype of hairlessness and athymia, resulting in a severely compromised immune system. This gene is orthologous to the mouse and rat genes and encodes a similar DNA-binding transcription factor that is thought to regulate keratin gene expression. A mutation in this gene has been correlated with T-cell immunodeficiency, the skin disorder congenital alopecia, and nail dystrophy. Alternative splicing in the 5@#$%& UTR of this gene has been observed.
1308	The protein encoded by this gene is believed to be a secretory protein. It has regions of similarity to hydrophilic segments of yeast mannosyltransferases. Its expression is ubiquitous and the gene appears to be relatively conserved among mammals.
1309	The ADF (actin-depolymerizing factor)/cofilin family (see MIM 601442) is composed of stimulus-responsive mediators of actin dynamics. ADF/cofilin proteins are inactivated by kinases such as LIM domain kinase-1 (LIMK1; MIM 601329). The SSH family appears to play a role in actin dynamics by reactivating ADF/cofilin proteins in vivo (Niwa et al., 2002).[supplied by OMIM]
1310	This cell surface receptor binds to galactose-terminated glycoproteins. It transports these glycoproteins via a series of membrane vesicles and tubules to an acidic-sorting organelle where the receptor and ligand dissociates. Then the receptor is recycled back to the cell surface.
1311	
1312	
1313	Nucleocytoplasmic transport, a signal- and energy-dependent process, takes place through nuclear pore complexes embedded in the nuclear envelope. The import of proteins containing a nuclear localization signal (NLS) requires the NLS import receptor, a heterodimer of importin alpha and beta subunits also known as karyopherins. Importin alpha binds the NLS-containing cargo in the cytoplasm and importin beta docks the complex at the cytoplasmic side of the nuclear pore complex. In the presence of nucleoside triphosphates and the small GTP binding protein Ran, the complex moves into the nuclear pore complex and the importin subunits dissociate. Importin alpha enters the nucleoplasm with its passenger protein and importin beta remains at the pore. Interactions between importin beta and the FG repeats of nucleoporins are essential in translocation through the pore complex. The protein encoded by this gene is a member of the importin beta family.
1314	
1315	This gene is a member of the inhibitor of apoptosis (IAP) gene family, which encode negative regulatory proteins that prevent apoptotic cell death. IAP family members usually contain multiple baculovirus IAP repeat (BIR) domains, but this gene encodes proteins with only a single BIR domain. The encoded proteins also lack a C-terminus RING finger domain. Gene expression is high during fetal development and in most tumors yet low in adult tissues. Antisense transcripts are involved in the regulation of this gene@#$%&s expression. At least four transcript variants encoding distinct isoforms have been found for this gene, but the full-length natures of only three of them have been determined.
1316	This gene encodes a DNA topoisomerase, an enzyme that controls and alters the topologic states of DNA during transcription. This nuclear enzyme is involved in processes such as chromosome condensation, chromatid separation, and the relief of torsional stress that occurs during DNA transcription and replication. It catalyzes the transient breaking and rejoining of two strands of duplex DNA which allows the strands to pass through one another, thus altering the topology of DNA. Two forms of this enzyme exist as likely products of a gene duplication event. The gene encoding this form, alpha, is localized to chromsome 17 and the beta gene is localized to chromosome 3. The gene encoding this enzyme functions as the target for several anticancer agents and a variety of mutations in this gene have been associated with the development of drug resistance. Reduced activity of this enzyme may also play a role in ataxia-telangiectasia.
1317	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the high sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
1318	
1319	PAFAH1B1 was identified as encoding a gene that when mutated or lost caused the lissencephaly associated with Miller-Dieker lissencephaly syndrome. PAFAH1B1 encodes the non-catalytic alpha subunit of the intracellular Ib isoform of platelet-activating factor acteylhydrolase, a heterotrimeric enzyme that specifically catalyzes the removal of the acetyl group at the SN-2 position of platelet-activating factor (identified as 1-O-alkyl-2-acetyl-sn-glyceryl-3-phosphorylcholine). Two other isoforms of intracellular platelet-activating factor acetylhydrolase exist: one composed of multiple subunits, the other, a single subunit. In addition, a single-subunit isoform of this enzyme is found in serum.
1320	
1321	
1322	
1323	This gene is one of seven beta-1,4-galactosyltransferase (beta4GalT) genes. They encode type II membrane-bound glycoproteins that appear to have exclusive specificity for the donor substrate UDP-galactose; all transfer galactose in a beta1,4 linkage to similar acceptor sugars: GlcNAc, Glc, and Xyl. Each beta4GalT has a distinct function in the biosynthesis of different glycoconjugates and saccharide structures. As type II membrane proteins, they have an N-terminal hydrophobic signal sequence that directs the protein to the Golgi apparatus and which then remains uncleaved to function as a transmembrane anchor. By sequence similarity, the beta4GalTs form four groups: beta4GalT1 and beta4GalT2, beta4GalT3 and beta4GalT4, beta4GalT5 and beta4GalT6, and beta4GalT7. The enzyme encoded by this gene is a lactosylceramide synthase important for glycolipid biosynthesis.
1324	
1325	
1326	
1327	
1328	
1329	
1330	
1331	
1332	
1333	Vasodilator-stimulated phosphoprotein (VASP) is a member of the Ena-VASP protein family. Ena-VASP family members contain an EHV1 N-terminal domain that binds proteins containing E/DFPPPPXD/E motifs and targets Ena-VASP proteins to focal adhesions. In the mid-region of the protein, family members have a proline-rich domain that binds SH3 and WW domain-containing proteins. Their C-terminal EVH2 domain mediates tetramerization and binds both G and F actin. VASP is associated with filamentous actin formation and likely plays a widespread role in cell adhesion and motility. VASP may also be involved in the intracellular signaling pathways that regulate integrin-extracellular matrix interactions. VASP is regulated by the cyclic nucleotide-dependent kinases PKA and PKG. While several transcript variants may exist for this gene, the full-length natures of only two have been described to date. These two represent the major variants of this gene and encode distinct isoforms.
1334	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S19E family of ribosomal proteins. It is located in the cytoplasm. Mutations in this gene cause Diamond-Blackfan anemia (DBA), a constitutional erythroblastopenia characterized by absent or decreased erythroid precursors, in a subset of patients. This suggests a possible extra-ribosomal function for this gene in erythropoietic differentiation and proliferation, in addition to its ribosomal function. Higher expression levels of this gene in some primary colon carcinomas compared to matched normal colon tissues has been observed. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
1335	This gene encodes a protein containing KELCH-1 like domains, as well as a BTB/POZ domain. Kelch-like ECH-associated protein 1 interacts with NF-E2-related factor 2 in a redox-sensitive manner and the dissociation of the proteins in the cytoplasm is followed by transportation of NF-E2-related factor 2 to the nucleus. This interaction results in the expression of the catalytic subunit of gamma-glutamylcysteine synthetase. Two alternatively spliced transcript variants encoding the same isoform have been found for this gene.
1336	The protein encoded by this gene is a member of the intercellular adhesion molecule (ICAM) family. All ICAM proteins are type I transmembrane glycoproteins, contain 2-9 immunoglobulin-like C2-type domains, and bind to the leukocyte adhesion LFA-1 protein. This protein is expressed on the surface of telencephalic neurons and displays two types of adhesion activity, homophilic binding between neurons and heterophilic binding between neurons and leukocytes. It may be a critical component in neuron-microglial cell interactions in the course of normal development or as part of neurodegenerative diseases.
1337	
1338	
1339	The protein encoded by this gene is a cytokine that belongs to the tumor necrosis factor (TNF) ligand family. This cytokine is a ligand for TNFRSF27/CD27. It is a surface antigen on activated, but not on resting, T and B lymphocytes. It induces proliferation of costimulated T cells, enhances the generation of cytolytic T cells, and contributes to T cell activation. This cytokine is also reported to play a role in regulating B-cell activation, cytotoxic function of natural killer cells, and immunoglobulin sythesis.
1340	
1341	This gene belongs to the family defined by the mouse resistin-like genes. The characteristic feature of this family is the C-terminal stretch of 10 cys residues with identical spacing. The mouse homolog of this protein is secreted by adipocytes, and may be the hormone potentially linking obesity to type II diabetes.
1342	
1343	
1344	This gene encodes a member of the DNase family. The protein, located in the lysosome, hydrolyzes DNA under acidic conditions and mediates the breakdown of DNA during erythropoiesis and apoptosis. Two codominant alleles have been characterized, DNASE2*L (low activity) and DNASE2*H (high activity), that differ at one nucleotide in the promoter region. The DNASE2*H allele is represented in this record.
1345	
1346	
1347	The nuclear pore complex is a massive structure that extends across the nuclear envelope, forming a gateway that regulates the flow of macromolecules between the nucleus and the cytoplasm. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells. The protein encoded by this gene is a member of the FG-repeat containing nucleoporins and is localized to the nuclear pore central plug. This protein associates with the importin alpha/beta complex which is involved in the import of proteins containing nuclear localization signals. Multiple transcript variants of this gene encode a single protein isoform.
1348	
1349	
1350	
1351	
1352	
1353	Pleckstrin homology, Sec7 and coiled/coil domains 2 (PSCD2) is a member of the PSCD family. Members of this family have identical structural organization that consists of an N-terminal coiled-coil motif, a central Sec7 domain, and a C-terminal pleckstrin homology (PH) domain. The coiled-coil motif is involved in homodimerization, the Sec7 domain contains guanine-nucleotide exchange protein (GEP) activity, and the PH domain interacts with phospholipids and is responsible for association of PSCDs with membranes. Members of this family appear to mediate the regulation of protein sorting and membrane trafficking. PSCD2 exhibits GEP activity in vitro with ARF1, ARF3, and ARF6. PSCD2 protein is 83% homologous to PSCD1. Two transcript variants encoding different isoforms have been found for this gene.
1354	
1355	
1356	
1357	The protein encoded by this gene binds to RNA polymerase II subunit 5 (RPB5) and negatively modulates transcription through its binding to RPB5. The encoded protein seems to have inhibitory effects on various types of activated transcription, but it requires the RPB5-binding region. This protein acts as a corepressor. It is suggested that it may require signaling processes for its function or that it negatively modulates genes in the chromatin structure. Two alternatively spliced transcript variants encoding different isoforms have been described for this gene.
1358	
1359	
1360	
1361	This gene was identified by the induction of its expression in B lymphocytes by Epstein-Barr virus infection. The protein encoded by this gene is a secreted glycoprotein, which is a member of the hematopoietin receptor family related to the p40 subunit of interleukin 12 (IL-12). It might play a role in regulating cell-mediated immune responses.
1362	
1363	
1364	
1365	
1366	
1367	The protein encoded by this gene is a member of the trypsin family of peptidases. The encoded protein is a component of the alternative complement pathway best known for its role in humoral suppression of infectious agents. This protein is also a serine protease that is secreted by adipocytes into the bloodstream. Finally, the encoded protein has a high level of expression in fat, suggesting a role for adipose tissue in immune system biology.
1368	Action potentials in vertebrate neurons are followed by an afterhyperpolarization (AHP) that may persist for several seconds and may have profound consequences for the firing pattern of the neuron. Each component of the AHP is kinetically distinct and is mediated by different calcium-activated potassium channels. The protein encoded by this gene is activated before membrane hyperpolarization and is thought to regulate neuronal excitability by contributing to the slow component of synaptic AHP. The encoded protein is an integral membrane protein that forms a voltage-independent calcium-activated channel with three other calmodulin-binding subunits. This gene is a member of the KCNN family of potassium channel genes.
1369	
1370	
1371	This reference sequence was derived from multiple replicate ESTs and validated by similar human genomic sequence. This gene encodes a member of a family of small membrane proteins that share a 35-amino acid signature sequence domain, beginning with the sequence PFXYD and containing 7 invariant and 6 highly conserved amino acids. The approved human gene nomenclature for the family is FXYD-domain containing ion transport regulator. Transmembrane topology has been established for two family members (FXYD1 and FXYD2), with the N-terminus extracellular and the C-terminus on the cytoplasmic side of the membrane. FXYD2, also known as the gamma subunit of the Na,K-ATPase, regulates the properties of that enzyme. FXYD1 (phospholemman), FXYD2 (gamma), FXYD3 (MAT-8), FXYD4 (CHIF), and FXYD5 (RIC) have been shown to induce channel activity in experimental expression systems. This gene product, FXYD7, is novel and has not been characterized as a protein. [RefSeq curation by Kathleen J. Sweadner, Ph.D., sweadner@helix.mgh.harvard.edu.]
1372	SF4 is a member of the SURP family of splicing factors.[supplied by OMIM]
1373	
1374	Heterogeneous nuclear RNAs (hnRNAs) which include mRNA precursors and mature mRNAs are associated with specific proteins to form heterogenous ribonucleoprotein (hnRNP) complexes. Heterogeneous nuclear ribonucleoprotein L is among the proteins that are stably associated with hnRNP complexes and along with other hnRNP proteins is likely to play a major role in the formation, packaging, processing, and function of mRNA. Heterogeneous nuclear ribonucleoprotein L is present in the nucleoplasm as part of the HNRP complex. HNRP proteins have also been identified outside of the nucleoplasm. Exchange of hnRNP for mRNA-binding proteins accompanies transport of mRNA from the nucleus to the cytoplasm. Since HNRP proteins have been shown to shuttle between the nucleus and the cytoplasm, it is possible that they also have cytoplasmic functions. Two transcript variants encoding different isoforms have been found for this gene.
1375	
1376	
1377	
1378	
1379	
1380	
1381	In C. elegans, Ced4 binds and activates Ced3, an apoptotic initiator caspase, via caspase-associated recruitment domains (CARDs). Human Ced4 homologs include APAF1 (MIM 602233), NOD1 (CARD4; MIM 605980), and NOD2 (CARD15; MIM 605956). These proteins have at least 1 N-terminal CARD domain followed by a centrally located nucleotide-binding domain (NBD or NACHT) and a C-terminal regulatory domain, found only in mammals, that contains either WD40 repeats or leucine-rich repeats (LRRs). CARD12 is a member of the Ced4 family and can induce apoptosis.[supplied by OMIM]
1382	
1383	
1384	
1385	This gene encodes a protein belonging to the GCKR subfamily of the SIS (Sugar ISomerase) family of proteins. The gene product is a regulatory protein that inhibits glucokinase in liver and pancreatic islet cells by binding non-covalently to form an inactive complex with the enzyme. This gene is considered a susceptibility gene candidate for a form of maturity-onset diabetes of the young (MODY).
1386	This gene belongs to the ephrin receptor subfamily of the protein-tyrosine kinase family. EPH and EPH-related receptors have been implicated in mediating developmental events, particularly in the nervous system. Receptors in the EPH subfamily typically have a single kinase domain and an extracellular region containing a Cys-rich domain and 2 fibronectin type III repeats. The ephrin receptors are divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands.
1387	Nudix hydrolases, such as NUDT5, eliminate toxic nucleotide derivatives from the cell and regulate the levels of important signaling nucleotides and their metabolites (McLennan, 1999).[supplied by OMIM]
1388	
1389	The WNT gene family consists of structurally related genes which encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family. It is overexpressed in cervical cancer cell line and strongly coexpressed with another family member, WNT10A, in colorectal cancer cell line. The gene overexpression may play key roles in carcinogenesis. This gene and the WNT10A gene are clustered in the chromosome 2q35 region. The protein encoded by this gene is 97% identical to the mouse Wnt6 protein at the amino acid level.
1390	
1391	This gene encodes a pancreatic secretory protein that may be involved in cell proliferation or differentiation. It has similarity to the C-type lectin superfamily. The enhanced expression of this gene is observed during pancreatic inflammation and liver carcinogenesis. Multiple alternatively spliced transcript variants encoding the same protein have been described for this gene but the full length nature of some transcripts is not yet known.
1392	
1393	This gene encodes a member of the peroxiredoxin family of antioxidant enzymes, which reduce hydrogen peroxide and alkyl hydroperoxides. The encoded protein may play an antioxidant protective role in cells, and may contribute to the antiviral activity of CD8(+) T-cells. This protein may have a proliferative effect and play a role in cancer development or progression. Three transcript variants encoding the same protein have been identified for this gene.
1394	
1395	
1396	The nuclear body is a multiprotein complex that may have a role in the regulation of gene transcription. This gene is a member of the SP100/SP140 family of nuclear body proteins and encodes a leukocyte-specific nuclear body component. The protein can function as an activator of gene transcription and may serve as a nuclear hormone receptor coactivator. In addition, it has been suggested that the protein may play a role in ribosome biogenesis and in the induction of myeloid cell differentiation. Alternative splicing has been observed for this gene and three transcript variants, encoding distinct isoforms, have been identified.
1397	
1398	This gene encodes a scaffold protein that interacts with GTP-bound Rho proteins. Binding of this protein inhibits the GTPase activity of Rho proteins. This protein may interfere with the conversion of active, GTP-bound Rho to the inactive GDP-bound form by RhoGAP. Rho proteins regulate many important cellular processes, including cytokinesis, transcription, smooth muscle contraction, cell growth and transformation. Dysregulation of the Rho signal transduction pathway has been implicated in many forms of cancer. Alternative splicing results in multiple transcript variants encoding different isoforms.
1399	The first mRNA transcript isolated for this gene was part of an artificial chimera derived from two distinct gene transcripts and a primer used in the cloning process (see Genbank accession M29204). A positively charged amino terminus present only in the chimera was determined to bind GC-rich DNA, thus mistakenly thought to identify a transcription factor gene.
1400	Voltage-gated potassium channels form the largest and most diversified class of ion channels and are present in both excitable and nonexcitable cells. Their main functions are associated with the regulation of the resting membrane potential and the control of the shape and frequency of action potentials. The alpha subunits are of 2 types: those that are functional by themselves and those that are electrically silent but capable of modulating the activity of specific functional alpha subunits. The protein encoded by this gene is not functional by itself but can form heteromultimers with member 1 and with member 2 (and possibly other members) of the Shab-related subfamily of potassium voltage-gated channel proteins. This gene belongs to the S subfamily of the potassium channel family.
1401	
1402	
1403	The protein encoded by this gene belongs to the tigger subfamily of the pogo superfamily of DNA-mediated transposons in humans. These proteins are related to DNA transposons found in fungi and nematodes, and more distantly to the Tc1 and mariner transposases. They are also very similar to the major mammalian centromere protein B. The exact function of this gene is not known.
1404	
1405	The protein encoded by this gene belongs to the complex I 42kDA subunit family. Mammalian complex I is the first enzyme complex in the electron transport chain of mitochondria. It is composed of 45 different subunits. This protein is a component of the hydrophobic protein fraction and has NADH dehydrogenase activity and oxidoreductase activity. It transfers electrons from NADH to the respiratory chain.
1406	
1407	
1408	Although the function of this gene is not known, its existence is supported by mRNA and EST data. The predicted gene product contains a region similar to the STAR-related lipid transfer (START) domain, which is often present in proteins involved in the cell signaling mediated by lipid binding. Two alternatively spliced transcript variants that encode an identical protein have been reported.
1409	This gene belongs to the homeobox family of genes. The homeobox genes encode a highly conserved family of transcription factors that play an important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, located on different chromosomes, consisting of 9 to 11 genes arranged in tandem. This gene is one of several homeobox HOXD genes located at 2q31-2q37 chromosome regions. Deletions that removed the entire HOXD gene cluster or 5@#$%& end of this cluster have been associated with severe limb and genital abnormalities. The product of this gene may play a role in the establishment of mesodermal derivatives during embryogenesis.
1410	
1411	
1412	
1413	This gene encodes a nuclear-encoded mitochondrial bifunctional enzyme with methylenetetrahydrofolate dehydrogenase and methenyltetrahydrofolate cyclohydrolase activities. The enzyme functions as a homodimer and is unique in its absolute requirement for magnesium and inorganic phosphate. Formation of the enzyme-magnesium complex allows binding of NAD.
1414	
1415	The protein encoded by this gene is highly similar to the metaxin 2 protein from mouse, which has been shown to interact with the mitochondrial membrane protein metaxin 1. Because of this similarity, it is thought that the encoded protein is peripherally associated with the cytosolic face of the outer mitochondrial membrane and be involved in the import of proteins into the mitochondrion. Two transcript variants encoding different isoforms have been found for this gene.
1416	
1417	
1418	
1419	This gene encodes a protein belonging to the 14-3-3 family of proteins, members of which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals. The encoded protein has been shown to interact with RAF1 and CDC25 phosphatases, suggesting that it may play a role in linking mitogenic signaling and the cell cycle machinery. Two transcript variants, which encode the same protein, have been identified for this gene.
1420	
1421	
1422	
1423	The protein encoded by this gene is a sequence-specific DNA-binding transcription factor involved in the activation of several developmental genes. The encoded protein can act as either a homodimer or heterodimer with other family members and is induced during retinoic acid-mediated differentiation. It plays a role in the development of the eyes, face, body wall, limbs, and neural tube.
1424	
1425	
1426	
1427	
1428	Analysis of cDNA clones indicates that ribosome binding protein 1 may exist in different forms due to removal of tandem repeats, or partial intraexonic splicing of RRBP1. The form presented here is lacking the canine p180 ribosome-binding domain, NQGKKAEGAQ, which is tandemly repeated close to the N-terminus in other forms that haven@#$%&t been fully characterized. RRBP1 has been excluded as a candidate gene in the cause of Alagille syndrome.
1429	The RAB5 protein is a small GTPase involved in membrane trafficking in the early endocytic pathway. The protein encoded by this gene binds the GTP-bound form of the RAB5 protein preferentially over the GDP-bound form, and functions as a guanine nucleotide exchange factor for RAB5. The encoded protein is found primarily as a tetramer in the cytoplasm and does not bind other members of the RAB family.
1430	An acetyl-CoA synthetase found in the mitochondrial matrix has been characterized in both cow and mouse. It is primarily a cardiac enzyme which produces acetyl-CoA mainly for the oxidation of acetate. The protein encoded by this gene is structurally similar to the cow and mouse proteins and therefore may serve a similar function in humans.
1431	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S24E family of ribosomal proteins. It is located in the cytoplasm. Alternative splice variants that encode different protein isoforms exist. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
1432	
1433	
1434	
1435	
1436	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S21E family of ribosomal proteins. It is located in the cytoplasm. Alternative splice variants that encode different protein isoforms have been described, but their existence has not been verified. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
1437	Tight junctions represent one mode of cell-to-cell adhesion in epithelial or endothelial cell sheets, forming continuous seals around cells and serving as a physical barrier to prevent solutes and water from passing freely through the paracellular space. The protein encoded by this immunoglobulin superfamily gene member is localized in the tight junctions between high endothelial cells. It acts as an adhesive ligand for interacting with a variety of immune cell types and may play a role in lymphocyte homing to secondary lymphoid organs.
1438	
1439	The minichromosome maintenance protein 3 (MCM3) is one of the MCM proteins essential for the initiation of DNA replication. The protein encoded by this gene is a MCM3 binding protein. It was reported to have phosphorylation-dependent DNA-primase activity, which was up-regulated in antigen immunization induced germinal center. This protein was demonstrated to be an acetyltransferase that acetylates MCM3 and plays a role in DNA replication. The mutagenesis of a nuclear localization signal of MCM3 affects the binding of this protein with MCM3, suggesting that this protein may also facilitate MCM3 nuclear localization.
1440	
1441	The Shaker gene family of Drosophila encodes components of voltage-gated potassium channels and is comprised of four subfamilies. Based on sequence similarity, this gene is similar to one of these subfamilies, namely the Shaw subfamily. The protein encoded by this gene belongs to the delayed rectifier class of channel proteins and is an integral membrane protein that mediates the voltage-dependent potassium ion permeability of excitable membranes. The encoded protein generates atypical voltage-dependent transient current that may be important for neuronal excitability. Two transcript variants encoding different isoforms have been found for this gene.
1442	
1443	
1444	
1445	This gene encodes a protein that belongs to the CN hydrolase family. Beta-ureidopropionase catalyzes the last step in the pyrimidine degradation pathway. The pyrimidine bases uracil and thymine are degraded via the consecutive action of dihydropyrimidine dehydrogenase (DHPDH), dihydropyrimidinase (DHP) and beta-ureidopropionase (UP) to beta-alanine and beta-aminoisobutyric acid, respectively. UP deficiencies are associated with N-carbamyl-beta-amino aciduria and may lead to abnormalities in neurological activity.
1446	
1447	This gene encodes a transcription factor that regulates MHC class II genes by binding to a promoter element referred to as an X box. The protein contains two domains commonly found in transcription factors that confer DNA binding and dimerization capability. This gene product was also separately identified as a cellular transcription factor that binds to an enhancer in the promoter of the T cell leukemia virus type 1 promoter. It may increase expression of viral proteins by acting as the DNA binding partner of a viral transactivator. There is also a recent finding that the same agents that cause an ER stress response, the unfolded protein response, also increase transcription of this gene. This increase in transcription requires an ER stress response consensus binding element in the promoter. Adding to the list of functional possibilities is a recent report from studies of knockout mice showing that this gene is essential for hepatocyte growth. A pseudogene has been identified and localized to chromosome 5.
1448	NIPSNAP1 protein has a strong sequence similarity to the central portion of a hypothetical protein encoded by C. elegans chromosome III between a 4-nitrophenylphosphatase (NIP) domain and non-neuronal SNAP25-like protein.  The NIPSNAP1 gene is located between the NF2 and pK1.3 genes on 22q12. Its function is unknown.
1449	Sulfonation, an important step in the metabolism of many drugs, xenobiotics, hormones, and neurotransmitters, is catalyzed by sulfotransferases. The product of this gene is galactosylceramide sulfotransferase which catalyzes the conversion between 3@#$%&-phosphoadenylylsulfate + a galactosylceramide to adenosine 3@#$%&,5@#$%&-bisphosphate + galactosylceramide sulfate. Activity of this sulfotransferase is enhanced in renal cell carcinoma.
1450	Heme oxygenase, an essential enzyme in heme catabolism, cleaves heme to form biliverdin, which is subsequently converted to bilirubin by biliverdin reductase, and carbon monoxide, a putative neurotransmitter. Heme oxygenase activity is induced by its substrate heme and by various nonheme substances. Heme oxygenase occurs as 2 isozymes, an inducible heme oxygenase-1 and a constitutive heme oxygenase-2. HMOX1 and HMOX2 belong to the heme oxygenase family.
1451	CSF2RB is a common beta chain of the high affinity receptor for IL-3, IL-5 and CSF.  Defective CSF2RB has been reported to be associated with protein alveolar proteinosis
1452	The protein encoded by this gene is a soluble beta-galactoside binding lectin. The encoded protein is found as a homodimer and can bind to lymphotoxin-alpha. A single nucleotide polymorphism in an intron of this gene can alter the transcriptional level of the protein, with a resultant increased risk of myocardial infarction.
1453	
1454	
1455	This gene is a member of the cytidine deaminase gene family. It is one of seven related genes or pseudogenes found in a cluster, thought to result from gene duplication, on chromosome 22. Members of the cluster encode proteins that are structurally and functionally related to the C to U RNA-editing cytidine deaminase APOBEC1. It is thought that the proteins may be RNA editing enzymes and have roles in growth or cell cycle control. Alternatively spliced transcript variants encoding different isoforms have been identified.
1456	This gene encodes a transcription factor that was originally identified as a widely expressed mammalian DNA binding protein that could bind a tax-responsive enhancer element in the LTR of HTLV-1. The encoded protein was also isolated and characterized as the cAMP-response element binding protein 2 (CREB-2). The protein encoded by this gene belongs to a family of DNA-binding proteins that includes the AP-1 family of transcription factors, cAMP-response element binding proteins (CREBs) and CREB-like proteins. These transcription factors share a leucine zipper region that is involved in protein-protein interactions, located C-terminal to a stretch of basic amino acids that functions as a DNA binding domain. Two alternative transcripts encoding the same protein have been described. Two pseudogenes are located on the X chromsome at q28 in a region containing a large inverted duplication.
1457	This gene encodes a transcription factor that was originally identified as a widely expressed mammalian DNA binding protein that could bind a tax-responsive enhancer element in the LTR of HTLV-1. The encoded protein was also isolated and characterized as the cAMP-response element binding protein 2 (CREB-2). The protein encoded by this gene belongs to a family of DNA-binding proteins that includes the AP-1 family of transcription factors, cAMP-response element binding proteins (CREBs) and CREB-like proteins. These transcription factors share a leucine zipper region that is involved in protein-protein interactions, located C-terminal to a stretch of basic amino acids that functions as a DNA binding domain. Two alternative transcripts encoding the same protein have been described. Two pseudogenes are located on the X chromsome at q28 in a region containing a large inverted duplication.
1458	The p70/p80 autoantigen is a nuclear complex consisting of two subunits with molecular masses of approximately 70 and 80 kDa. The complex functions as a single-stranded DNA-dependent ATP-dependent helicase. The complex may be involved in the repair of nonhomologous DNA ends such as that required for double-strand break repair, transposition, and V(D)J recombination. High levels of autoantibodies to p70 and p80 have been found in some patients with systemic lupus erythematosus.
1459	The p70/p80 autoantigen is a nuclear complex consisting of two subunits with molecular masses of approximately 70 and 80 kDa. The complex functions as a single-stranded DNA-dependent ATP-dependent helicase. The complex may be involved in the repair of nonhomologous DNA ends such as that required for double-strand break repair, transposition, and V(D)J recombination. High levels of autoantibodies to p70 and p80 have been found in some patients with systemic lupus erythematosus.
1460	
1461	
1462	
1463	
1464	This gene encodes a protein that localizes to the nucleus and expression of this gene is induced in response to elevated levels of axin. The Wnt signalling pathway, which is negatively regulated by axin, is important in axis formation in early development and impaired regulation of this signalling pathway is often involved in tumors. A decreased level of expression of this gene in tumors compared to the level of expression in their corresponding normal tissues suggests that this gene product has a tumor suppressor function.
1465	Members of the F-box protein family, such as FBXO40, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
1466	
1467	
1468	
1469	This monomeric enzyme functions in glycogen symthesis by catalyzing the formation of alpha 1,6- glucosidic linkages. It is most highly expressed in liver and muscle.  Deficiency can result in glycogen storage disease IV (Andersen@#$%&s disease).
1470	This gene encodes a DNA topoisomerase, an enzyme that controls and alters the topologic states of DNA during transcription. This nuclear enzyme is involved in processes such as chromosome condensation, chromatid separation, and the relief of torsional stress that occurs during DNA transcription and replication. It catalyzes the transient breaking and rejoining of two strands of duplex DNA which allows the strands to pass through one another, thus altering the topology of DNA. Two forms of this enzyme exist as likely products of a gene duplication event. The gene encoding this form, beta, is localized to chromosome 3 and the alpha form is localized to chromosome 17. The gene encoding this enzyme functions as the target for several anticancer agents and a variety of mutations in this gene have been associated with the development of drug resistance. Reduced activity of this enzyme may also play a role in ataxia-telangiectasia. Alternative splicing of this gene results in two transcript variants; however, the second variant has not yet been fully described.
1471	
1472	G protein-coupled receptors play a role in cell communication. They are characterized by an extracellular N terminus, 7 transmembrane regions, and an intracellular C terminus.[supplied by OMIM]
1473	
1474	
1475	
1476	
1477	Calcium/calmodulin-dependent protein kinase I is expressed in many tissues and is a component of a calmodulin-dependent protein kinase cascade.  Calcium/calmodulin directly activates calcium/calmodulin-dependent protein kinase I by binding to the enzyme and indirectly promotes the phosphorylation and synergistic activation of the enzyme by calcium/calmodulin-dependent protein kinase I kinase.
1478	The protein encoded by this gene is highly similar to S. cerevisiae DNA damage repair protein Rad18. Yeast Rad18 functions through its interaction with Rad6, which is an ubiquitin-conjugating enzyme required for post-replication repair of damaged DNA. Similar to its yeast counterpart, this protein is able to interact with the human homolog of yeast Rad6 protein through a conserved ring-finger motif.  Mutation of this motif results in defective replication of UV-damaged DNA and hypersensitivity to multiple mutagens.
1479	
1480	
1481	The protein encoded by this gene is a cytokine that belongs to the tumor necrosis factor (TNF) ligand family. This protein preferentially induces apoptosis in transformed and tumor cells, but does not appear to kill normal cells although it is expressed at a significant level in most normal tissues. This protein binds to several members of TNF receptor superfamily including TNFRSF10A/TRAILR1, TNFRSF10B/TRAILR2, TNFRSF10C/TRAILR3, TNFRSF10D/TRAILR4, and possibly also to TNFRSF11B/OPG. The activity of this protein may be modulated by binding to the decoy receptors TNFRSF10C/TRAILR3, TNFRSF10D/TRAILR4, and TNFRSF11B/OPG that cannot induce apoptosis. The binding of this protein to its receptors has been shown to trigger the activation of MAPK8/JNK, caspase 8, and caspase 3.
1482	The protein encoded by this gene was identified by its interaction with the catalytic domain of protein kinase C-zeta. The encoded protein contains an actin-binding site and an ATP-binding site. It is most closely related to twinfilin (PTK9), a conserved actin monomer-binding protein.
1483	
1484	
1485	
1486	This gene encodes a member of the alpha-2/delta subunit family, a protein in the voltage-dependent calcium channel complex. Calcium channels mediate the influx of calcium ions into the cell upon membrane polarization and consist of a complex of alpha-1, alpha-2/delta, beta, and gamma subunits in a 1:1:1:1 ratio. Various versions of each of these subunits exist, either expressed from similar genes or the result of alternative splicing. Research on a highly similar protein in rabbit suggests the protein described in this record is cleaved into alpha-2 and delta subunits. Alternate transcriptional splice variants of this gene, encoding different isoforms, have been characterized.
1487	
1488	
1489	
1490	
1491	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L18P family of ribosomal proteins. It is located in the cytoplasm. The protein binds 5S rRNA to form a stable complex called the 5S ribonucleoprotein particle (RNP), which is necessary for the transport of nonribosome-associated cytoplasmic 5S rRNA to the nucleolus for assembly into ribosomes. The protein interacts specifically with the beta subunit of casein kinase II. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. This gene is co-transcribed with the small nucleolar RNA gene U21, which is located in its fifth intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
1492	
1493	CDC25A is a member of the CDC25 family of phosphatases. CDC25A is required for progression from G1 to the S phase of the cell cycle. It activates the cyclin-dependent kinase CDC2 by removing two phosphate groups. CDC25A is specifically degraded in response to DNA damage, which prevents cells with chromosomal abnormalities from progressing through cell division. CDC25A is an oncogene, although its exact role in oncogenesis has not been demonstrated. Two transcript variants encoding different isoforms have been found for this gene.
1494	Neuroligins, such as NLGN4Y, are cell adhesion molecules present at the postsynaptic side of the synapse and may be essential for the formation of functional synapses (Jamain et al., 2003).[supplied by OMIM]
1495	Megakaryocytopoiesis is the cellular development process that leads to platelet production. The protein encoded by this gene is a humoral growth factor that is necessary for megakaryocyte proliferation and maturation, as well as for thrombopoiesis. This protein is the ligand for MLP/C_MPL, the product of myeloproliferative leukemia virus oncogene. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
1496	
1497	The protein encoded by this gene is a type-1 membrane glycoprotein, which contains two immunoglobulin domains, and thus belongs to the immunoglobulin superfamily. Studies of the related genes in mouse and rat suggest that this gene may regulate myeloid cell activity and delivers an inhibitory signal for the macrophage lineage in diverse tissues. Multiple alternatively spliced transcript variants that encode different isoforms have been found for this gene.
1498	
1499	This gene encodes a transmembrane protein. Mutations in this gene are associated with an autosomal recessive syndrome characterized by insulin-dependent diabetes mellitus and bilateral progressive optic atrophy, usually presenting in childhood or early adult life. Diverse neurologic symptoms, including a predisposition to psychiatric illness, may also be associated with this disorder. A large number and variety of mutations in this gene, particularly in exon 8, can be associated with this syndrome. Mutations in this gene can also cause autosomal dominant deafness 6 (DFNA6), also known as DFNA14 or DFNA38.
1500	
1501	
1502	
1503	
1504	
1505	Soluble guanylate cyclase (sGC), a heterodimeric protein consisting of an alpha and a beta subunit, catalyzes the conversion of GTP to the second messenger cGMP and functions as the main receptor for nitric oxide and nitrovasodilator drugs (Zabel et al., 1998).[supplied by OMIM]
1506	
1507	HERC3 belongs to the HERC family of ubiquitin ligases, all of which contain a HECT domain and at least 1 RCC1 (MIM 179710)-like domain (RLD). The 350-amino acid HECT domain is predicted to catalyze the formation of a thioester with ubiquitin before transferring it to a substrate, and the RLD is predicted to act as a guanine nucleotide exchange factor for small G proteins (Hochrainer et al., 2005).[supplied by OMIM]
1508	The protein encoded by this gene is a platelet-derived growth factor that belongs to the CXC chemokine family. This growth factor is a potent chemoattractant and activator of neutrophils. It has been shown to stimulate various cellular processes including DNA synthesis, mitosis, glycolysis, intracellular cAMP accumulation, prostaglandin E2 secretion, and sythesis of hyaluronic acid and sulfated glycosaminoglycan. It also stimulates the formation and secretion of plasminogen activator by synovial cells.
1509	Histone acetylation plays a key role in the regulation of eukaryotic gene expression. Histone acetylation and deacetylation are catalyzed by multisubunit complexes. The protein encoded by this gene is a component of the histone deacetylase complex which includes SIN3, SAP18, HDAC1, HDAC2, RbAp46, RbAp48 and other polypeptides. This protein containing complex is active in deacetylating core histone octamers, but inactive in deacetylating nucleosomal histones.
1510	
1511	Sulfotransferase enzymes catalyze the sulfate conjugation of many hormones, neurotransmitters, drugs, and xenobiotic compounds. These cytosolic enzymes are different in their tissue distributions and substrate specificities. The gene structure (number and length of exons) is similar among family members. However, the total genomic length of this gene is greater than that of other SULT1 genes.
1512	This gene encodes a member of the trypsin family of serine proteases. Studies in mouse suggest that the encoded enzyme may be involved in structural reorganizations associated with learning and memory. The enzyme is also expressed in Leydig cells in the testis, but its function in this tissue is unknown.
1513	
1514	Heparan sulfate biosynthetic enzymes are key components in generating a myriad of distinct heparan sulfate fine structures that carry out multiple biologic activities. The enzyme encoded by this gene is a member of the heparan sulfate biosynthetic enzyme family. It possesses both heparan sulfate glucosaminyl 3-O-sulfotransferase activity, anticoagulant heparan sulfate conversion activity, and is a rate limiting enzyme for synthesis of anticoagulant heparan. This enzyme is an intraluminal Golgi resident protein.
1515	This gene encodes one family member of vacuolar protein sorting 10 (VPS10) domain-containing receptor proteins. The VPS10 domain name comes from the yeast carboxypeptidase Y sorting receptor Vps10 protein. Members of this gene family are large with many exons but the CDS lengths are usually less than 3700 nt. Very large introns typically separate the exons encoding the VPS10 domain; the remaining exons are separated by much smaller-sized introns. These genes are strongly expressed in the central nervous system. Two of the five family members (sortilin and sortilin-related receptor) are synthesized as preproproteins; it is not yet known if this encoded protein is also a preproprotein.
1516	
1517	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes one of the non-ATPase subunits of the 19S regulator lid. Two alternate transcripts encoding two different isoforms have been described. Pseudogenes have been identified on chromosomes 10 and 21.
1518	The protein encoded by this gene is a cytokine that is specifically induced by apoptosis. The release of this cytokine renders the tumor-associated vasculature sensitive to tumor necrosis factor. The precursor of SCYE1 (pro-SCYE1) is identical to the p43 subunit, which is associated with the multi-tRNA synthetase complex. Therefore, pro-SCYE1 may function in binding RNA as part of the tRNA synthetase complex in normal cells and in stimulating inflammatory responses after proteolytic cleavage in tumor cells.
1519	B lymphocyte chemoattractant, independently cloned and named Angie, is a CXC chemokine strongly expressed in the follicles of the spleen, lymph nodes, and Peyer@#$%&s patches. It preferentially promotes the migration of B lymphocytes (compared to T cells and macrophages), apparently by stimulating calcium influx into, and chemotaxis of, cells expressing Burkitt@#$%&s lymphoma receptor 1 (BLR-1). It may therefore function in the homing of B lymphocytes to follicles.
1520	
1521	This gene encodes a member of the annexin family.  Members of this calcium-dependent phospholipid-binding protein family play a role in the regulation of cellular growth and in signal transduction pathways.  This protein functions in the inhibition of phopholipase A2 and cleavage of inositol 1,2-cyclic phosphate to form inositol 1-phosphate. This protein may also play a role in anti-coagulation.
1522	
1523	
1524	FSHD region gene 1 maps 100 kb centromeric of the repeat units on chromosome 4q35 which are deleted in facioscapulohumeral muscular dystrophy. The FRG1 gene is evolutionarily conserved and has related sequences on multiple human chromosomes.  DNA sequence analysis did not reveal any homology to known genes.
1525	
1526	
1527	Arg and c-Abl represent the mammalian members of the Abelson family of non-receptor protein-tyrosine kinases.  They interact with the Arg/Abl binding proteins via the SH3 domains present in the carboxy end of the latter group of proteins.  The subcellular localization of Arg/Abl-interacting  protein 2 encoded by this gene in epithelial and cardiac muscle cells, suggests that it functions as an adapter protein to assemble signaling complexes in stress fibers, and that it is a potential link between Abl family kinases and the actin cytoskeleton.  Several transcript variants encoding different isoforms of Arg/Abl-interacting protein 2 have been described.
1528	
1529	
1530	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections.
1531	
1532	
1533	The protein encoded by this gene is a sperm protein, which interacts with A-kinase anchoring protein, AKAP3, through the amphipathic helix region of AKAP3. Type II regulatory subunit of cAMP-dependent protein kinase (PKARII) also binds to this helix domain of AKAP3, which allows PKARII to be targeted to specific subcellular compartments. It is suggested that sperm contains several proteins that bind to AKAPs in a manner similar to PKARII, and this encoded protein may be one of them.
1534	Apoptosis is a cell death process that removes toxic and/or useless cells during mammalian development. The apoptotic process is accompanied by shrinkage and fragmentation of the cells and nuclei and degradation of the chromosomal DNA into nucleosomal units. DNA fragmentation factor (DFF) is a heterodimeric protein of 40-kD (DFFB) and 45-kD (DFFA) subunits. DFFA is the substrate for caspase-3 and triggers DNA fragmentation during apoptosis. DFF becomes activated when DFFA is cleaved by caspase-3. The cleaved fragments of DFFA dissociate from DFFB, the active component of DFF. DFFB has been found to trigger both DNA fragmentation and chromatin condensation during apoptosis. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
1535	
1536	
1537	This gene encodes a selenoprotein that contains multiple selenocysteines. Selenocysteine is encoded by the usual stop codon UGA. Selenoprotein P is an extracellular glycoprotein and is the only selenoprotein known to contain multiple selenocysteine residues. Selenoprotein P binds to heparin and cell membranes and associates with endothelial cells. It has been implicated as an oxidant defense in the extracellular space and in the transport of selenium.
1538	The protein encoded by this gene is highly homologous to alpha-synuclein. These proteins are abundantly expressed in the brain and putatively inhibit phospholipase D2 selectively. The encoded protein, which may play a role in neuronal plasticity, is abundant in neurofibrillary lesions of patients with Alzheimer disease. This protein has been shown to be highly expressed in the substantia nigra of the brain, a region of neuronal degeneration in patients with Parkinson disease; however, no direct relation to Parkinson disease has been established. Two transcript variants encoding the same protein have been found for this gene.
1539	
1540	
1541	This gene encodes a hemopoietic cytokine proposed to signal through a heterodimeric receptor complex composed of the thymic stromal lymphopoietin receptor and the IL-7R alpha chain. It mainly impacts myeloid cells and induces the release of T cell-attracting chemokines from monocytes and enhances the maturation of CD11c(+) dendritic cells. Alternative splicing of this gene results in two transcript variants.
1542	
1543	
1544	
1545	
1546	This gene encodes the phosphatase 2A catalytic subunit. Protein phosphatase 2A is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. This gene encodes an alpha isoform of the catalytic subunit.
1547	
1548	
1549	
1550	
1551	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure, such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of the DEAD box protein family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a member of this family. The function of this member has not been determined. Based on studies in Drosophila, the abstrakt gene is widely required during post-transcriptional gene expression.
1552	
1553	
1554	
1555	
1556	The protein encoded by this gene functions together with DNA ligase IV and the DNA-dependent protein kinase in the repair of DNA double-strand break by non-homologous end joining and the completion of V(D)J recombination events. The non-homologous end-joining pathway is required both for normal development and for suppression of tumors. This gene functionally complements XR-1 Chinese hamster ovary cell mutant, which is impaired in DNA double-strand breaks produced by ionizing radiation and restriction enzymes. This gene contains 8 exons, and alternative transcription initiation and alternative splicing generates several transcript variants.
1557	The protein encoded by this gene is a regulatory protein involved in mitosis. The gene product complexes with p34(cdc2) to form the maturation-promoting factor (MPF). Two alternative transcripts have been found, a constitutively expressed transcript and a cell cycle-regulated transcript, that is expressed predominantly during G2/M phase. The different transcripts result from the use of alternate transcription initiation sites.
1558	
1559	The protein encoded by this gene is an extracellular copper enzyme that initiates the crosslinking of collagens and elastin. The enzyme catalyzes oxidative deamination of the epsilon-amino group in certain lysine and hydroxylysine residues of collagens and lysine residues of elastin. In addition to crosslinking extracellular matrix proteins, the encoded protein may have a role in tumor suppression.
1560	The multisubunit NADH:ubiquinone oxidoreductase (complex I) is the first enzyme complex in the electron transport chain of mitochondria. The iron-sulfur protein (IP) fraction is made up of 7 subunits, including NDUFS6. See NDUFS1 (MIM 157655).[supplied by OMIM]
1561	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein that belongs to the L22 ribosomal protein family. A pseudogene corresponding to this gene is found on chromosome 4q. Two transcript variants encoding different isoforms have been found for this gene.
1562	Nuclear pore complexes are extremely elaborate structures that mediate the regulated movement of macromolecules between the nucleus and cytoplasm. These complexes are composed of at least 100 different polypeptide subunits, many of which belong to the nucleoporin family. Nucleoporins are pore complex-specific glycoproteins characterized by cytoplasmically oriented O-linked N-acetylglucosamine residues and numerous repeats of the pentapeptide sequence XFXFG. The protein encoded by this gene has three distinct domains: a N-terminal region within which a pore targeting domain has been identified, a central region containing multiple zinc finger motifs, and a C-terminal region containing multiple XFXFG repeats.
1563	Histone deacetylase 2 (HDAC2), or transcriptional regulator homolog RPD3 L1, is highly homologous to the yeast transcription factor RPD3 (reduced potassium dependency 3) gene.  As in yeast, human HDA2 is likely to be involved in regulating chromatin structure during transcription.  It has been implicated to associate with YY1, a mammalian zinc-finger transcription factor, which negatively regulates transcription by tethering RPD3 to DNA as a cofactor.  This process is highly concerved from yeast to human.
1564	
1565	
1566	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that belongs to the ribosomal protein S18P family. The encoded protein is one of three that has significant sequence similarity to bacterial S18 proteins. The primary sequences of the three human mitochondrial S18 proteins are no more closely related to each other than they are to the prokaryotic S18 proteins. A pseudogene corresponding to this gene is found on chromosome 3p.
1567	
1568	Mucolipins constitute a family of cation channel proteins with homologs in mouse, Drosophila, and C. elegans. Mutations in the human MCOLN1 gene (MIM 605248) cause mucolipodosis IV (MIM 262650).[supplied by OMIM]
1569	
1570	Butyrophilin is the major protein associated with fat droplets in the milk. This gene is a member of the BTN2 subfamily of genes, which encode proteins belonging to the butyrophilin protein family. The gene is located in a cluster on chromosome 6, consisting of seven genes belonging to the expanding B7/butyrophilin-like group, a subset of the immunoglobulin gene superfamily. The encoded protein is a type 1 receptor glycoprotein involved in lipid, fatty-acid and sterol metabolism. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some variants has not been determined.
1571	
1572	
1573	
1574	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to both the cytoplasm and the nucleus. Its function has not been identified. Alternate splicing of this gene results in two transcript variants encoding different isoforms.
1575	
1576	
1577	The protein encoded by this gene is part of a complex termed negative elongation factor (NELF) which represses RNA polymerase II transcript elongation. This protein bears similarity to nuclear RNA-binding proteins; however, it has not been demonstrated that this protein binds RNA. The protein contains a tract of alternating basic and acidic residues, largely arginine (R) and aspartic acid (D). The gene localizes to the major histocompatibility complex (MHC) class III region on chromosome 6.
1578	
1579	
1580	
1581	
1582	
1583	
1584	
1585	
1586	
1587	
1588	
1589	
1590	This gene product is a member of the Vanin family of proteins which share extensive sequence similarity with each other, and also with biotinidase. The family includes secreted and membrane-associated proteins, a few of which have been reported to participate in hematopoietic cell trafficking. No biotinidase activity has been demonstrated for any of the vanin proteins, however, they possess pantetheinase activity, which may play a role in oxidative-stress response. The encoded protein is a GPI-anchored cell surface molecule that plays a role in transendothelial migration of neutrophils. This gene lies in close proximity to, and in same transcriptional orientation as two other vanin genes on chromosome 6q23-q24. Two transcript variants encoding different isoforms have been described for this gene.
1591	The product of this gene is a novel mitochondrial protein that interacts with reticulon 4, which is a potent inhibitor of regeneration following spinal cord injury. The interaction of reticulon 4 with mitochondrial proteins may provide insight into the mechanisms for reticulon-induced inhibition of neurite growth.
1592	This gene encodes a molecular chaperone that is member of the chaperonin containing TCP1 complex (CCT), also known as the TCP1 ring complex (TRiC). This complex consists of two identical stacked rings, each containing eight different proteins. Unfolded polypeptides enter the central cavity of the complex and are folded in an ATP-dependent manner. The complex folds various proteins, including actin and tubulin. Alternate transcriptional splice variants of this gene, encoding different isoforms, have been characterized.
1593	
1594	
1595	Calumenin (CALU) is a  calcium-binding protein localized in the endoplasmic reticulum (ER) and is involved in such ER functions as protein folding and sorting. Calumenin is a member of the EF-hand superfamily in the ER and Golgi apparatus named CERC. CERC is an acronym for its family members Cab-45, reticulocalbin, Erc-55, and calumenin. The CALU gene encodes a deduced 315-amino acid protein containing 6 EF-hand motifs, 1 potential N-glycosylation site, and a C-terminal ER retention signal. The human and mouse CALU proteins are 98% identical. CALU mRNA is ubiquitously expressed in human tissues and maps to 7q32.
1596	This gene product belongs to the DYRK family of dual-specificity protein kinases that catalyze autophosphorylation on serine/threonine and tyrosine residues. The members of this family share structural similarity, however, differ in their substrate specificity, suggesting their involvement in different cellular functions. The encoded protein has been shown to autophosphorylate on tyrosine residue and catalyze phosphorylation of histones H3 and H2B in vitro. Alternatively spliced transcript variants encoding different isoforms have been identified.
1597	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
1598	The product of this gene belongs to a small family of adapter proteins that are known to interact with a number of receptor tyrosine kinases and signaling molecules. This gene encodes a growth factor receptor-binding protein that interacts with insulin receptors and insulin-like growth-factor receptors. Overexpression of some isoforms of the encoded protein inhibits tyrosine kinase activity and results in growth suppression. This gene is imprinted in a highly isoform- and tissue-specific manner. Alternatively spliced transcript variants encoding different isoforms have been identified.
1599	
1600	
1601	
1602	
1603	The inhibin beta A subunit joins the alpha subunit to form a pituitary FSH secretion inhibitor. Inhibin has been shown to regulate gonadal stromal cell proliferation negatively and to have tumor-suppressor activity. In addition, serum levels of inhibin have been shown to reflect the size of granulosa-cell tumors and can therefore be used as a marker for primary as well as recurrent disease. Because expression in gonadal and various extragonadal tissues may vary severalfold in a tissue-specific fashion, it is proposed that inhibin may be both a growth/differentiation factor and a hormone. Furthermore, the beta A subunit forms a homodimer, activin A, and also joins with a beta B subunit to form a heterodimer, activin AB, both of which stimulate FSH secretion. Finally, it has been shown that the beta A subunit mRNA is identical to the erythroid differentiation factor subunit mRNA and that only one gene for this mRNA exists in the human genome.
1604	The protein encoded by this gene interacts with the dedicator of cyto-kinesis 1 protein to promote phagocytosis and effect cell shape changes. Similarity to a C. elegans protein suggests that this protein may function in apoptosis and in cell migration. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
1605	
1606	
1607	
1608	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
1609	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
1610	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
1611	This gene product belongs to the 14-3-3 family of proteins which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals, and this protein is 100% identical to the rat ortholog. It is induced by growth factors in human vascular smooth muscle cells, and is also highly expressed in skeletal and heart muscles, suggesting an important role for this protein in muscle tissue. It has been shown to interact with RAF1 and protein kinase C, proteins involved in various signal transduction pathways.
1612	
1613	
1614	
1615	The protein encoded by this gene is a major component of the inner surface of caveolae, small invaginations of the plasma membrane, and is involved in essential cellular functions, including signal transduction, lipid metabolism, cellular growth control and apoptosis. This protein may function as a tumor suppressor. CAV1 and CAV2 are located next to each other on chromosome 7 and express colocalizing proteins that form a stable hetero-oligomeric complex. Two transcript variants encoding distinct isoforms have been identified for this gene. By using alternative initiation codons in the same reading frame, two isoforms (alpha and beta) are encoded by one transcript.
1616	
1617	
1618	The A-kinase anchor proteins (AKAPs) are a group of structurally diverse proteins which have the common function of binding to the regulatory subunit of protein kinase A (PKA) and confining the holoenzyme to discrete locations within the cell. This gene encodes a member of the AKAP family. Alternate splicing of this gene results in many isoforms that localize to the centrosome and the Golgi apparatus, and interact with numerous signaling proteins from multiple signal transduction pathways. These signaling proteins include type II protein kinase A, serine/threonine kinase protein kinase N, protein phosphatase 1, protein phosphatase 2a, protein kinase C-epsilon and phosphodiesterase 4D3.
1619	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters.  ABC proteins transport various molecules across extra- and intra-cellular membranes.  ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White).  This protein is a member of the MDR/TAP subfamily.  Members of the MDR/TAP subfamily are involved in multidrug resistance as well as antigen presentation.  This gene encodes a full transporter and member of the p-glycoprotein family of membrane proteins with phosphatidylcholine as its substrate.  The function of this protein has not yet been determined; however, it may involve transport of phospholipids from liver hepatocytes into bile.  Alternative splicing of this gene results in several products of undetermined function.
1620	
1621	This gene encodes one subunit of the 2-oxoglutarate dehydrogenase complex. This complex catalyzes the overall conversion of 2-oxoglutarate (alpha-ketoglutarate) to succinyl-CoA and CO(2) during the Krebs cycle. The protein is located in the mitocondrial matrix and uses thiamine pyrophosphate as a cofactor. A congential deficiency in 2-oxoglutarate dehydrogenase activity is believed to lead to hypotonia, metabolic acidosis, and hyperlactatemia.
1622	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
1623	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
1624	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
1625	The protein encoded by this gene is a transmembrane glycoprotein that functions as a sulfate transporter. It is localized to the mucosa of the lower intestinal tract, particularly to the apical membrane of columnar epithelium and some goblet cells. Mutations in this gene have been associated with congenital chloride diarrhea.
1626	
1627	
1628	
1629	
1630	Members of the @#$%&frizzled@#$%& gene family encode 7-transmembrane domain proteins that are receptors for Wnt signaling proteins.  The FZD6 protein contains a signal peptide, a cysteine-rich domain in the N-terminal extracellular region, and 7 transmembrane domains. However, unlike many other Fz family members, FDZ6 does not contain a C-terminal PDZ domain-binding motif.
1631	APH1 is a multipass transmembrane protein that interacts with presenilin (see PSEN1; MIM 104311) and nicastrin (APH2; MIM 605254) as a functional component of the gamma-secretase complex. The gamma-secretase complex is required for the intramembrane proteolysis of a number of membrane proteins, including the amyloid-beta precursor protein (APP; MIM 104760) and Notch (MIM 190198).[supplied by OMIM]
1632	
1633	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L30E family of ribosomal proteins. It is located in the cytoplasm. This gene is co-transcribed with the U72 small nucleolar RNA gene, which is located in its fourth intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
1634	This gene encodes a protein similar to members of the protein phosphatase 2C (PP2C) family. Studies of the rat counterpart of this protein suggested that this protein may reside within the mitochondrial matrix space and be responsible for dephosphorylation and reactivation of the pyruvate dehydrogenase complex (PDC).
1635	
1636	Protein arginine N-methyltransferases, such as PRMT6, catalyze the sequential transfer of a methyl group from S-adenosyl-L-methionine to the side chain nitrogens of arginine residues within proteins to form methylated arginine derivatives and S-adenosyl-L-homocysteine.[supplied by OMIM]
1637	
1638	The M channel is a slowly activating and deactivating potassium channel that plays a critical role in the regulation of neuronal excitability. The M channel is formed by the association of the protein encoded by this gene and one of two related proteins encoded by the KCNQ2 and KCNQ5 genes, both integral membrane proteins. M channel currents are inhibited by M1 muscarinic acetylcholine receptors and activated by retigabine, a novel anti-convulsant drug. Defects in this gene are a cause of benign familial neonatal convulsions type 2 (BFNC2), also known as epilepsy, benign neonatal type 2 (EBN2).
1639	The protein encoded by this gene contains a RING zinc finger, a motif known to be involved in protein-protein interactions. This protein associates transiently with microtubules, myosin, and titin during muscle sarcomere assembly. It may act as a transient adaptor and plays a regulatory role in the assembly of sarcomeres. Four alternatively spliced transcript variants encoding distinct isoforms have been described.
1640	Some high-mobility group (HMG) box proteins (e.g., LEF1; MIM 153245) contain a single HMG box motif and bind DNA in a sequence-specific manner, while other members of this family (e.g., HMG1; MIM 163905) have multiple HMG boxes and bind DNA in a sequence-independent but structure-dependent manner. All HMG box proteins are able to induce a sharp bend in DNA. TOX contains a single HMG box motif.[supplied by OMIM]
1641	
1642	Cytosolic and membrane-bound forms of glutathione S-transferase are encoded by two distinct supergene families. At present, eight distinct classes of the soluble cytoplasmic mammalian glutathione S-transferases have been identified: alpha, kappa, mu, omega, pi, sigma, theta and zeta. This gene encodes a glutathione S-transferase that belongs to the mu class. The mu class of enzymes functions in the detoxification of electrophilic compounds, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress, by conjugation with glutathione. The genes encoding the mu class of enzymes are organized in a gene cluster on chromosome 1p13.3 and are known to be highly polymorphic. These genetic variations can change an individual@#$%&s susceptibility to carcinogens and toxins as well as affect the toxicity and efficacy of certain drugs. Mutations of this class mu gene have been linked with a slight increase in a number of cancers, likely due to exposure with environmental toxins.
1643	
1644	
1645	
1646	
1647	
1648	
1649	
1650	
1651	
1652	The product of this gene belongs to the phosphatase 2 regulatory subunit B family. Protein phosphatase 2 is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. The B regulatory subunit might modulate substrate selectivity and catalytic activity. This gene encodes an alpha isoform of the regulatory subunit B55 subfamily.
1653	CHRAC1 is a histone-fold protein that interacts with other histone-fold proteins to bind DNA in a sequence-independent manner. These histone-fold protein dimers combine within larger enzymatic complexes for DNA transcription, replication, and packaging.[supplied by OMIM]
1654	
1655	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a cytoplasmic ribosomal protein that is a component of the 40S subunit. The protein belongs to the S6E family of ribosomal proteins. It is the major substrate of protein kinases in the ribosome, with subsets of five C-terminal serine residues phosphorylated by different protein kinases. Phosphorylation is induced by a wide range of stimuli, including growth factors, tumor-promoting agents, and mitogens. Dephosphorylation occurs at growth arrest. The protein may contribute to the control of cell growth and proliferation through the selective translation of particular classes of mRNA. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
1656	This gene encodes a protein which is similar to the Drosophila crumbs protein and localizes to the inner segment of mammalian photoreceptors. In Drosophila crumbs localizes to the stalk of the fly photoreceptor and may be a component of the molecular scaffold that controls proper development of polarity in the eye. Mutations in this gene are associated with a severe form of retinitis pigmentosa, RP12, and with Leber congenital amaurosis. Two alternative transcripts encoding different isoforms have been described.
1657	
1658	
1659	The enzyme system for cleavage of glycine (glycine cleavage system; GCS; EC 2.1.2.10), which is confined to the mitochondria, is composed of 4 protein components: P protein (a pyridoxal phosphate-dependent glycine decarboxylase), H protein (a lipoic acid-containing protein), T protein (a tetrahydrofolate-requiring enzyme), and L protein (a lipoamide dehydrogenase). Glycine encephalopathy (GCE; MIM 605899) may be due to a defect in any one of these enzymes; see MIM 238310, MIM 238330, and MIM 238331.[supplied by OMIM]
1660	This gene product is a protein tyrosine kinase involved in a specific subset of cytokine receptor signaling pathways. It has been found to be constituitively associated with the prolactin receptor and is required for responses to gamma interferon. Mice that do not express an active protein for this gene exhibit embryonic lethality associated with the absence of definitive erythropoiesis.
1661	
1662	The protein encoded by this gene is a receptor protein tyrosine kinase whose cellular role has not been determined. It is a type I membrane protein and belongs to the ROR subfamily of cell surface receptors. Studies of a similar protein in mouse suggest that this protein may interact with another receptor protein tyrosine kinase and may be involved in skeletal and cardiac development.
1663	
1664	
1665	
1666	
1667	The protein encoded by this gene belongs to the lipocalin family. Lipocalins are a group of extracellular proteins that are able to bind lipophiles by enclosure within their structures to minimize solvent contact. This protein may bind hydrophobic ligands and inhibit cysteine proteinases. It may also play a role in taste reception.
1668	
1669	
1670	
1671	
1672	The protein encoded by this gene is a glycoprotein secreted from the pancreas into the digestive tract and from the lactating mammary gland into human milk. The physiological role of this protein is in cholesterol and lipid-soluble vitamin ester hydrolysis and absorption. This encoded protein promotes large chylomicron production in the intestine. Also its presence in plasma suggests its interactions with cholesterol and oxidized lipoproteins to modulate the progression of atherosclerosis. In pancreatic tumoral cells, this encoded protein is thought to be sequestrated within the Golgi compartment and is probably not secreted. This gene contains a variable number of tandem repeat (VNTR) polymorphism in the coding region that may influence the function of the encoded protein.
1673	
1674	
1675	DAB2IP is a Ras (MIM 190020) GTPase-activating protein (GAP) that acts as a tumor suppressor gene and is inactivated by methylation in prostate and breast cancers (Yano et al., 2005).[supplied by OMIM]
1676	This gene is one of several CC cytokine genes clustered on the p-arm of chromosome 9. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The CC cytokines are proteins characterized by two adjacent cysteines. The cytokine encoded by this gene may play a role in normal lymphocyte recirculation and homing. It also plays an important role in trafficking of T cells in thymus, and in T cell and B cell migration to secondary lymphoid organs. It specifically binds to chemokine receptor CCR7.
1677	
1678	
1679	
1680	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP contains an extracellular domain, a single transmembrane segment and two tandem intracytoplasmic catalytic domains, and thus belongs to receptor type PTP. This gene is specifically expressed in hematopoietic cells. This PTP has been shown to be an essential regulator of T- and B-cell antigen receptor signaling. It functions through either direct interaction with components of the antigen receptor complexes, or by activating various Src family kinases required for the antigen receptor signaling. This PTP also suppresses JAK kinases, and thus functions as a regulator of cytokine receptor signaling. Four alternatively spliced transcripts variants of this gene, which encode distinct isoforms, have been reported.
1681	
1682	
1683	
1684	
1685	
1686	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. A pseudogene corresponding to this gene is found at 17p13.3.
1687	
1688	
1689	
1690	
1691	This gene encodes a novel calcium binding protein expressed in the epidermis and related to the calmodulin family of calcium binding proteins. Functional studies with recombinant protein demonstrate it does bind calcium and undergoes a conformational change when it does so. Abundant expression is detected only in reconstructed epidermis and is restricted to differentiating keratinocytes. In addition, it can associate with transglutaminase 3, shown to be a key enzyme in the terminal differentiation of keratinocytes.
1692	ITGAM encodes the integrin alpha M chain. Note that this gene was cloned by different groups which used different naming conventions. Integrins are heterodimeric integral membrane proteins composed of an alpha chain and a beta chain. This I-domain containing alpha integrin combines with the beta 2 chain (ITGB2) to form a leukocyte-specific integrin referred to as macrophage receptor 1 (@#$%&Mac-1@#$%&), or inactivated-C3b (iC3b) receptor 3 (@#$%&CR3@#$%&). The alpha M beta 2 integrin is important in the adherence of neutrophils and monocytes to stimulated endothelium, and also in the phagocytosis of complement coated particles. Some integrin alpha M proteins contain an inserted amino acid (Q, between AA positions 499 and 500) resulting from a 3bp shift in a splice acceptor that inserts the codon CAG; receptors with or without the extra amino acid are functionally indistinguishable
1693	
1694	
1695	
1696	
1697	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. Expression of this gene is up-regulated by interferon. This gene is mappped to chromosome 11p15, where it resides within a TRIM gene cluster. Alternate splicing of this gene generates four transcript variants. Additionally, a read-through transcript transcribed from this gene and TRIM6 has been observed.
1698	The protein encoded by this gene is similar to ING1, a tumor suppressor protein that can interact with TP53, inhibit cell growth, and induce apoptosis. This protein contains a PHD-finger, which is a common motif in proteins involved in chromatin remodeling. This protein can bind TP53 and EP300/p300, a component of the histone acetyl transferase complex, suggesting its involvement in TP53-dependent regulatory pathway.
1699	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, which is highly homologous to yeast Prp22. This protein facilitates nuclear export of spliced mRNA by releasing the RNA from the spliceosome.
1700	
1701	
1702	This gene encodes a member of the Ca2+/calmodulin-dependent protein kinase 1 subfamily of serine/threonine kinases. The encoded protein may be involved in the regulation of granulocyte function through the chemokine signal transduction pathway. Alternatively spliced transcript variants encoding different isoforms of this gene have been described.
1703	Vitamin A metabolism is important for vital processes such as vision, embryonic development, cell differentiation, and membrane and skin protection. The protein encoded by this gene is a key enzyme in beta-carotene metabolism to vitamin A. It catalyzes the oxidative cleavage of beta,beta-carotene into two retinal molecules.
1704	
1705	
1706	
1707	
1708	
1709	SHOX2 is a member of the homeo box family of genes that encode proteins containing a 60-amino acid residue motif that represents a DNA binding domain. Homeo box genes have been characterized extensively as transcriptional regulators involved in pattern formation in both invertebrate and vertebrate species. Several human genetic disorders are caused by aberrations in human homeo box genes. SHOX is a pseudoautosomal homeo box gene that is thought to be responsible for idiopathic short stature and implicated to play a role in the short stature phenotype of Turner syndrome patients. SHOX2 (also called SHOT for SHOX homologous gene on chromosome 3) has two alternatively spliced transcript variants, SHOX2a and SHOX2b, that have identical homeodomains and share a C-terminal 14-amino acid residue motif characteristic for craniofacially expressed homeodomain proteins. The differences between SHOX2a and SHOX2b reside within the N-termini and an alternatively spliced exon in the C termini. SHOX2 maps to 3q25-q26.1 and is considered to be a candidate gene for Cornelia de Lange syndrome.
1710	The product of this gene belongs to the ligand-gated ion channel receptor superfamily. This gene encodes subunit C of the type 3 receptor for 5-hydroxytryptamine (serotonin), a biogenic hormone that functions as a neurotransmitter, a hormone, and a mitogen. This receptor causes fast, depolarizing responses in neurons after activation. Genes encoding subunits C, D and E form a cluster on chromosome 3.
1711	This gene encodes a kinase involved in spindle checkpoint function. The kinase functions in part by phosphorylating a member of the miotic checkpoint complex and activating the spindle checkpoint. Mutations in this gene have been associated with aneuploidy and several forms of cancer.
1712	
1713	
1714	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP contains an N-terminal domain that shares a significant similarity with yeast SEC14, which is a protein that has phosphatidylinositol transfer activity and is required for protein secretion through the Golgi complex in yeast. This PTP was found to be activated by polyphosphoinositide, and is thought to be involved in signaling events regulating phagocytosis.
1715	
1716	Complement component receptor-2 (CR2) is the membrane protein on B lymphocytes to which the Epstein-Barr virus (EBV) binds during infection of these cells. See also MIM 120620. Yefenof et al. (1976) found complete overlapping of EBV receptors and C3 (MIM 120700) receptors on human B lymphocytes.[supplied by OMIM]
1717	
1718	Fer protein is a member of the FPS/FES family of nontransmembrane receptor tyrosine kinases. It regulates cell-cell adhesion and mediates signaling from the cell surface to the cytoskeleton via growth factor receptors.
1719	
1720	S100A14 is a member of a subfamily of proteins related by Ca(2+)-binding motifs to the EF-hand Ca(2+)-binding protein superfamily.[supplied by OMIM]
1721	This gene encodes a protein that contains domains of thioredoxin and ERV1, members of two long-standing gene families. The gene expression is induced as fibroblasts begin to exit the proliferative cycle and enter quiescence, suggesting that this gene plays an important role in growth regulation. Two transcript variants encoding two different isoforms have been found for this gene.
1722	
1723	
1724	
1725	The major pathway for the biosynthesis of phosphatidylcholine occurs via the CDP-choline pathway. The protein encoded by this gene is the initial enzyme in the sequence and may play a regulatory role. The encoded protein also catalyzes the phosphorylation of ethanolamine. Two transcript variants encoding different isoforms have been found for this gene.
1726	Nuclear hormone receptors regulate transcription in part by recruiting distinct transcription coregulatory complexes to target gene promoters. The thyroid hormone receptor-associated proteins (TRAPs) form a complex with the thyroid hormone receptor (TR) (see THRA, MIM 190120) and markedly enhance TR-mediated transcription in vitro.[supplied by OMIM]
1727	
1728	
1729	
1730	
1731	
1732	This gene is a metastasis suppressor gene that suppresses metastases of melanomas and breast carcinomas without affecting tumorigenicity. The encoded protein may function to inhibit chemotaxis and invasion, attenuating metastasis in malignant melanomas. Studies suggest a putative role in the regulation of events downstream of cell-matrix adhesion, perhaps involving cytoskeletal reorganization. A polymorphism in the terminal exon of this mRNA results in two protein isoforms. An adenosine present at the polymorphic site represents the third position in a stop codon. When the adenosine is absent, a downstream stop codon is utilized and the encoded protein extends for an additional seven amino acid residues.
1733	NPTX1 is a member of the neuronal pentraxin gene family. Neuronal pentraxin 1 is similar to the rat NP1 gene which encodes a binding protein for the snake venom toxin taipoxin. Human NPTX1 mRNA is exclusively localized to the nervous system.
1734	This gene encodes the puromycin-sensitive aminopeptidase, a zinc metallopeptidase which hydrolyzes amino acids from the N-terminus of its substrate. The protein has been localized to both the cytoplasm and to cellular membranes. This enzyme degrades enkaphalins in the brain, and studies in mouse suggest that it is involved in proteolytic events regulating the cell cycle.
1735	This gene encodes a member of the regulator of G-protein signaling family. This protein contains one RGS domain, two Raf-like Ras-binding domains (RBDs), and one GoLoco domain. The protein attenuates the signaling activity of G-proteins by binding, through its GoLoco domain, to specific types of activated, GTP-bound G alpha subunits. Acting as a GTPase activating protein (GAP), the protein increases the rate of conversion of the GTP to GDP. This hydrolysis allows the G alpha subunits to bind G beta/gamma subunit heterodimers, forming inactive G-protein heterotrimers, thereby terminating the signal. Alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
1736	
1737	
1738	
1739	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. This gene encodes a member of the potassium channel, voltage-gated, subfamily H. This member is a pore-forming (alpha) subunit. The gene is brain-specific, and located in the neocortex and the striatum. It may be involved in cellular excitability of restricted neurons in the central nervous system.
1740	The protein encoded by this gene is a GTPase which belongs to the RAS superfamily of small GTP-binding proteins. Members of this superfamily appear to regulate a diverse array of cellular events, including the control of cell growth, cytoskeletal reorganization, and the activation of protein kinases.
1741	
1742	The nuclear envelope creates distinct nuclear and cytoplasmic compartments in eukaryotic cells. It consists of two concentric membranes perforated by nuclear pores, large protein complexes that form aqueous channels to regulate the flow of macromolecules between the nucleus and the cytoplasm. These complexes are composed of at least 100 different polypeptide subunits, many of which belong to the nucleoporin family. The nucleoporin protein encoded by this gene displays evolutionarily conserved interactions with other nucleoporins. This protein, which localizes to both sides of the nuclear pore complex at interphase, remains associated with the complex during mitosis and is targeted at early stages to the reforming nuclear envelope. This protein also localizes to kinetochores of mitotic cells.
1743	
1744	Angiogenesis is controlled by a local balance between stimulators and inhibitors of new vessel growth and is suppressed under normal physiologic conditions. Angiogenesis has been shown to be essential for growth and metastasis of solid tumors. In order to obtain blood supply for their growth, tumor cells are potently angiogenic and attract new vessels as results of increased secretion of inducers and decreased production of endogenous negative regulators.  BAI1 contains at least one @#$%&functional@#$%& p53-binding site within an intron, and its expression has been shown to be induced by wildtype p53. There are two other brain-specific angiogenesis inhibitor genes, designated BAI2 and BAI3 which along with BAI1 have similar tissue specificities and structures, however only BAI1 is transcriptionally regulated by p53. BAI1 is postulated to be a member of the secretin receptor family, an inhibitor of angiogenesis and a growth suppressor of glioblastomas
1745	GABA is the major inhibitory neurotransmitter in the mammalian brain where it acts at GABA-A receptors, which are ligand-gated chloride channels. Chloride conductance of these channels can be modulated by agents such as benzodiazepines that bind to the GABA-A receptor. At least 16 distinct subunits of GABA-A receptors have been identified.
1746	
1747	HLA-B belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon 1 encodes the leader peptide, exon 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Hundreds of HLA-B alleles have been described
1748	
1749	
1750	
1751	This gene, a member of the histidine triad gene family, encodes a diadenosine 5@#$%&,5@#$%&@#$%&@#$%&-P1,P3-triphosphate hydrolase involved in purine metabolism. The gene encompasses the common fragile site FRA3B on chromosome 3, where carcinogen-induced damage can lead to translocations and aberrant transcripts of this gene. In fact, aberrant transcripts from this gene have been found in about half of all esophageal, stomach, and colon carcinomas.
1752	This gene, a member of the histidine triad gene family, encodes a diadenosine 5@#$%&,5@#$%&@#$%&@#$%&-P1,P3-triphosphate hydrolase involved in purine metabolism. The gene encompasses the common fragile site FRA3B on chromosome 3, where carcinogen-induced damage can lead to translocations and aberrant transcripts of this gene. In fact, aberrant transcripts from this gene have been found in about half of all esophageal, stomach, and colon carcinomas.
1753	
1754	
1755	
1756	
1757	The import of proteins into the nucleus is a process that involves at least 2 steps. The first is an energy-independent docking of the protein to the nuclear envelope and the second is an energy-dependent translocation through the nuclear pore complex. Imported proteins require a nuclear localization sequence (NLS) which generally consists of a short region of basic amino acids or 2 such regions spaced about 10 amino acids apart. Proteins involved in the first step of nuclear import have been identified in different systems. These include the Xenopus protein importin and its yeast homolog, SRP1 (a suppressor of certain temperature-sensitive mutations of RNA polymerase I in Saccharomyces cerevisiae), which bind to the NLS. KPNA2 protein interacts with the NLSs of DNA helicase Q1 and SV40 T antigen and may be involved in the nuclear transport of proteins. KPNA2 also may play a role in V(D)J recombination
1758	Chemokines are a group of small (approximately 8 to 14 kD), mostly basic, structurally related molecules that regulate cell trafficking of various types of leukocytes through interactions with a subset of 7-transmembrane, G protein-coupled receptors. Chemokines also play fundamental roles in the development, homeostasis, and function of the immune system, and they have effects on cells of the central nervous system as well as on endothelial cells involved in angiogenesis or angiostasis. Chemokines are divided into 2 major subfamilies, CXC and CC. This gene is a CXC member of the chemokine superfamily. Its encoded protein induces a chemotactic response in activated T-cells and is the dominant ligand for CXC receptor-3. The gene encoding this protein contains 4 exons and at least three polyadenylation signals which might reflect cell-specific regulation of expression. IFN-gamma is a potent inducer of transcription of this gene.
1759	
1760	Members of the immunoglobulin (Ig) superfamily (see MIM 147100), which includes IGSF1, have a variety of functions, but all appear to play a role in cell recognition and the regulation of cell behavior.[supplied by OMIM]
1761	
1762	
1763	This gene encodes a member of the theta class glutathione S-transferase-like (GSTTL) protein family. In mouse, the encoded protein acts as a small stress response protein, likely involved in cellular redox homeostasis.
1764	
1765	Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. It is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, F0, which comprises the proton channel. The F1 complex consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled in a ratio of 3 alpha, 3 beta, and a single representative of the other 3. The F0 seems to have nine subunits (a, b, c, d, e, f, g, F6 and 8). This gene encodes the e subunit of the F0 complex.
1766	
1767	The gamma-aminobutyric acid (GABA) A receptor is a multisubunit chloride channel that mediates the fastest inhibitory synaptic transmission in the central nervous system. This gene encodes GABA A receptor, beta 1 subunit. It is mapped to chromosome 4p12 in a cluster comprised of genes encoding alpha 4, alpha 2 and gamma 1 subunits of the GABA A receptor. Alteration of this gene is implicated in the pathogenetics of schizophrenia.
1768	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP contains an extracellular domain, a single transmembrane segment and two tandem intracytoplasmic catalytic domains, and thus belongs to receptor type PTP. This gene is specifically expressed in hematopoietic cells. This PTP has been shown to be an essential regulator of T- and B-cell antigen receptor signaling. It functions through either direct interaction with components of the antigen receptor complexes, or by activating various Src family kinases required for the antigen receptor signaling. This PTP also suppresses JAK kinases, and thus functions as a regulator of cytokine receptor signaling. Four alternatively spliced transcripts variants of this gene, which encode distinct isoforms, have been reported.
1769	
1770	
1771	
1772	Cytoplasmic ribosomes, organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes ribosomal protein S4, a component of the 40S subunit. Ribosomal protein S4 is the only ribosomal protein known to be encoded by more than one gene, namely this gene and ribosomal protein S4, Y-linked (RPS4Y). The 2 isoforms encoded by these genes are not identical, but are functionally equivalent. Ribosomal protein S4 belongs to the S4E family of ribosomal proteins. This gene is not subject to X-inactivation. It has been suggested that haploinsufficiency of the ribosomal protein S4 genes plays a role in Turner syndrome; however, this hypothesis is controversial. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
1773	
1774	This gene encodes a member of the polycystin protein family. The encoded protein contains multiple transmembrane domains, and cytoplasmic N- and C-termini. The protein may be an integral membrane protein involved in cell-cell/matrix interactions. This protein functions as a calcium-regulated nonselective cation channel. Alternative splice variants have been described but their full length sequences have not been determined.
1775	
1776	
1777	
1778	
1779	GABA is the major inhibitory neurotransmitter in the mammalian brain where it acts at GABA-A receptors, which are ligand-gated chloride channels. Chloride conductance of these channels can be modulated by agents such as benzodiazepines that bind to the GABA-A receptor. At least 16 distinct subunits of GABA-A receptors have been identified
1780	
1781	
1782	
1783	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, which interacts specifically with hepatitis C virus core protein resulting a change in intracellular location. This gene has a homolog located in the nonrecombining region of the Y chromosome. The protein sequence is 91% identical between this gene and the Y-linked homolog. Alternative splicing of this gene generates 2 transcript variants, which are divergent in the 3@#$%& UTR.
1784	
1785	
1786	
1787	
1788	
1789	
1790	
1791	
1792	
1793	
1794	The protein encoded by this gene contains RNA recognition motif found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. In vitro studies showed that the rat homolog bound to RNA homopolymers, with a preference for G and U polyribonucleotides. This gene is part of a gene cluster on chromosome Xp11.23, and its 3@#$%& end lies within 20 kb upstream of UBE1. Two transcript variants encoding different isoforms have been identified for this gene.
1795	
1796	
1797	
1798	The reciprocal translocation t(1;3)(p36;q21) occurs in a subset of myelodysplastic syndrome (MDS) and acute myeloid leukemia (AML). This gene is located near the 1p36.3 breakpoint and has been shown to be specifically expressed in the t(1:3)(p36,q21)-positive MDS/AML. The protein encoded by this gene is a zinc finger transcription factor and contains an N-terminal PR domain. The translocation results in the overexpression of a truncated version of this protein that lacks the PR domain, which may play an important role in the pathogenesis of MDS and AML. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
1799	
1800	The protein encoded by this gene is a member of the E2F family of transcription factors. The E2F family plays a crucial role in the control of cell cycle and action of tumor suppressor proteins and is also a target of the transforming proteins of small DNA tumor viruses. The E2F proteins contain several evolutionally conserved domains found in most members of the family. These domains include a DNA binding domain, a dimerization domain which determines interaction with the differentiation regulated transcription factor proteins (DP), a transactivation domain enriched in acidic amino acids, and a tumor suppressor protein association domain which is embedded within the transactivation domain. This protein and another 2 members, E2F1 and E2F3, have an additional cyclin binding domain. This protein binds specifically to retinoblastoma protein pRB in a cell-cycle dependent manner, and it exhibits overall 46% amino acid identity to E2F1.
1801	
1802	
1803	
1804	
1805	This gene encodes two proteins: sterol carrier protein X (SCPx) and sterol carrier protein 2 (SCP2), as a result of transcription initiation from 2 independently regulated promoters. The transcript initiated from the proximal promoter encodes the longer SCPx protein, and the transcript initiated from the distal promoter encodes the shorter SCP2 protein, with the 2 proteins sharing a common C-terminus. Evidence suggests that the SCPx protein is a peroxisome-associated thiolase that is involved in the oxidation of branched chain fatty acids, while the SCP2 protein is thought to be an intracellular lipid transfer protein. This gene is highly expressed in organs involved in lipid metabolism, and may play a role in Zellweger syndrome, in which cells are deficient in peroxisomes and have impaired bile acid synthesis. Alternative splicing of this gene produces multiple transcript variants, some encoding different isoforms. The full-length nature of all transcript variants has not been determined.
1806	This gene encodes one of the members of the superfamily of potassium channel proteins containing two pore-forming P domains. The product of this gene has not been shown to be a functional channel, however, it may require other non-pore-forming proteins for activity.
1807	
1808	This gene encodes a cytoplasmic enzyme in the trans-sulfuration pathway that converts cystathione derived from methionine into cysteine. Glutathione synthesis in the liver is dependent upon the availability of cysteine. Mutations in this gene cause cystathioninuria. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
1809	
1810	This gene encodes a mitochondrial membrane protein that participates in mitochondrial fusion and contributes to the maintenance and operation of the mitochondrial network. This protein may play a role in the pathophysiology of obesity.
1811	The protein encoded by this gene is a member of neuron-specific calcium-binding proteins family found in the retina and brain. This protein is associated with the plasma membrane. It has similarities to proteins located in the photoreceptor cells that regulate photosignal transduction in a calcium-sensitive manner. This protein displays recoverin activity and a calcium-dependent inhibition of rhodopsin kinase. It is identical to the rat and mouse hippocalcin proteins and thought to play an important role in neurons of the central nervous system in a number of species.
1812	Action potentials in vertebrate neurons are followed by an afterhyperpolarization (AHP) that may persist for several seconds and may have profound consequences for the firing pattern of the neuron. Each component of the AHP is kinetically distinct and is mediated by different calcium-activated potassium channels. The protein encoded by this gene is activated before membrane hyperpolarization and is thought to regulate neuronal excitability by contributing to the slow component of synaptic AHP. The encoded protein is an integral membrane protein that forms a voltage-independent calcium-activated channel with three other calmodulin-binding subunits. This gene contains two CAG repeat regions in the coding sequence. It was thought that expansion of one or both of these repeats could lead to an increased susceptibility to schizophrenia or bipolar disorder, but studies indicate that this is probably not the case. This gene is a member of the KCNN family of potassium channel genes. Two transcript variants encoding two different isoforms have been found for this gene. One of the variants lacks the CAG repeat regions.
1813	This gene encodes a member of the membrane fatty acid desaturase family which is responsible for inserting double bonds into specific positions in fatty acids. This protein contains three His-containing consensus motifs that are characteristic of a group of membrane fatty acid desaturases. It is predicted to be a multiple membrane-spanning protein localized to the endoplasmic reticulum. Overexpression of this gene inhibited biosynthesis of the EGF receptor, suggesting a possible role of a fatty acid desaturase in regulating biosynthetic processing of the EGF receptor. Two splice variants have been identified.
1814	
1815	
1816	
1817	
1818	
1819	
1820	
1821	Chitotriosidase is secreted by activated human macrophages and is markedly elevated in plasma of Gaucher disease patients. The expression of chitotriosidase occurs only at a late stage of differentiation of monocytes to activated macrophages in culture. Human macrophages can synthesize a functional chitotriosidase, a highly conserved enzyme with a strongly regulated expression. This enzyme may play a role in the degradation of chitin-containing pathogens.
1822	
1823	
1824	
1825	The protein encoded by this gene belongs to a distinct class of mammalian adenylyl cyclase that is soluble and insensitive to G protein or forskolin regulation. It is localized in the cytoplasm and is thought to function as a general bicarbonate sensor throughout the body. It may also play an important role in the generation of cAMP in spermatozoa, implying possible roles in sperm maturation through the epididymis, capacitation, hypermotility, and/or the acrosome reaction.
1826	
1827	
1828	
1829	This gene encodes disintegrin and metalloprotease (ADAM) domain 30, which is a member of the ADAM protein family. Members of this family are membrane-anchored proteins structurally related to snake venom disintegrins, and have been implicated in a variety of biologic processes involving cell-cell and cell-matrix interactions, including fertilization, muscle development, and neurogenesis. This gene is testis-specific and contains a polymorphic region, resulting in isoforms with varying numbers of C-terminal repeats.
1830	
1831	The SET domain is a highly conserved, approximately 150-amino acid motif implicated in the modulation of chromatin structure. It was originally identified as part of a larger conserved region present in the Drosophila Trithorax protein and was subsequently identified in the Drosophila Su(var)3-9 and @#$%&Enhancer of zeste@#$%& proteins, from which the acronym SET is derived. Studies have suggested that the SET domain may be a signature of proteins that modulate transcriptionally active or repressed chromatin states through chromatin remodeling activities.[supplied by OMIM]
1832	The SET domain is a highly conserved, approximately 150-amino acid motif implicated in the modulation of chromatin structure. It was originally identified as part of a larger conserved region present in the Drosophila Trithorax protein and was subsequently identified in the Drosophila Su(var)3-9 and @#$%&Enhancer of zeste@#$%& proteins, from which the acronym SET is derived. Studies have suggested that the SET domain may be a signature of proteins that modulate transcriptionally active or repressed chromatin states through chromatin remodeling activities.[supplied by OMIM]
1833	
1834	DPH2L2 encodes diphthamide biosynthesis-like protein 2, one of two human genes similar to the yeast gene dph2. The yeast gene was identified by its ability to complement a diphthamide mutant strain, and thus probably functions in diphthamide biosynthesis. Diphthamide is a post-translationally modified histidine residue present in elongation factor 2 (EF2) that is the target of diphtheria toxin ADP-ribosylation. Two transcript variants encoding different isoforms have been found for this gene.
1835	
1836	
1837	
1838	The signal sequence receptor (SSR) is a glycosylated endoplasmic reticulum (ER) membrane receptor associated with protein translocation across the ER membrane. The SSR consists of 2 subunits, a 34-kD glycoprotein (alpha-SSR or SSR1) and a 22-kD glycoprotein (beta-SSR or SSR2). The human beta-signal sequence receptor gene (SSR2) maps to chromosome bands 1q21-q23.
1839	
1840	
1841	
1842	
1843	
1844	The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21. This protein may function in motility, invasion, and tubulin polymerization. Chromosomal rearrangements and altered expression of this gene have been implicated in tumor metastasis.  Alternative splicing of the 5@#$%& UTR results in two gene products.
1845	
1846	This gene encodes a member of the aquaglyceroporin family of integral membrane proteins. Members of this family function as water-permeable channels in the epithelia of organs that absorb and excrete water. This protein was shown to function as a water-selective channel, and could also permeate neutral solutes such as glycerol and urea.
1847	6-phosphogluconate dehydrogenase is the second dehydrogenase in the pentose phosphate shunt. Deficiency of this enzyme is generally asymptomatic, and the inheritance of this disorder is autosomal dominant. Hemolysis results from combined deficiency of 6-phosphogluconate dehydrogenase and 6-phosphogluconolactonase suggesting a synergism of the two enzymopathies.
1848	The piggyBac family of proteins, found in diverse animals, are transposases related to the transposase of the canonical piggyBac transposon from the moth, Trichoplusia ni. This family also includes genes in several genomes, including human, that appear to have been derived from the piggyBac transposons. This gene belongs to the subfamily of piggyBac transposable element derived (PGBD) genes. The PGBD proteins appear to be novel, with no obvious relationship to other transposases, or other known protein families. The exact function of this gene is not known.
1849	
1850	The protein encoded by this gene shares considerable sequence similarity to, and is structurally related to DNase II. The latter is a well characterized endonuclease that catalyzes DNA hydrolysis in the absence of divalent cations at acidic pH. Unlike DNase II which is ubiquitously expressed, expression of this gene product is restricted to the salivary gland and lungs. The gene has been localized to chromosome 1p22.3 adjacent (and in opposite orientation) to the uricase pseudogene. Two transcript variants encoding different isoforms have been described for this gene.
1851	This gene encodes a ubiquitin-specific protease, an enzyme that removes ubiquitin from ubiquitinated proteins. The encoded protein belongs to the C19 peptidase family, also known as family 2 of ubiquitin carboxyl-terminal hydrolases. This protein has been reported to be capable of removing NEDD8 from NEDD8 conjugates. Alternatively spliced transcript variants encoding different isoforms have been identified.
1852	Angiopoietins are members of the vascular endothelial growth factor family and the only known growth factors largely specific for vascular endothelium. Angiopoietin-1, angiopoietin-2, and angiopoietin-4 participate in the formation of blood vessels. The protein encoded by this gene is another member of the angiopoietin family that is widely expressed in adult tissues with mRNA levels highest in highly vascularized tissues. This protein was found to be a secretory protein that does not act as an endothelial cell mitogen in vitro.
1853	This gene encodes a mitochondrial ATPase inhibitor. Alternative splicing occurs at this locus and three transcript variants encoding distinct isoforms have been identified.
1854	
1855	The protein encoded by this gene is a cytokine produced primarily by monocytes and to a lesser extent by lymphocytes. This cytokine has pleiotropic effects in immunoregulation and inflammation. It down-regulates the expression of Th1 cytokines, MHC class II Ags, and costimulatory molecules on macrophages. It also enhances B cell survival, proliferation, and antibody production. This cytokine can block NF-kappa B activity, and is involved in the regulation of the JAK-STAT signaling pathway. Knockout studies in mice suggested the function of this cytokine as an essential immunoregulator in the intestinal tract.
1856	CAPZB is a member of the F-actin capping protein family. This gene encodes the beta subunit of the barbed-end actin binding protein.  The protein regulates growth of the actin filament by capping the barbed end of growing actin filaments.
1857	
1858	
1859	This gene encodes coagulation factor III which is a cell surface glycoprotein. This factor enables cells to initiate the blood coagulation cascades, and it functions as the high-affinity receptor for the coagulation factor VII. The resulting complex provides a catalytic event that is responsible for initiation of the coagulation protease cascades by specific limited proteolysis. Unlike the other cofactors of these protease cascades, which circulate as nonfunctional precursors, this factor is a potent initiator that is fully functional when expressed on cell surfaces. There are 3 distinct domains of this factor: extracellular, transmembrane, and cytoplasmic. This protein is the only one in the coagulation pathway for which a congenital deficiency has not been described.
1860	This gene belongs to the class II amino-acyl tRNA family. The encoded enzyme catalyzes the transfer of L-serine to tRNA (Ser) and is related to bacterial and yeast counterparts.
1861	Prosaposin is a 511-amino acid glycoprotein which is a precursor for 4 saposins that are designated A, B, C, and D (Morimoto et al., 1989). Each domain of the precursor protein is approximately 80 amino acid residues long; has nearly identical placement of cysteine residues, glycosylation sites, and helicoregions; and is flanked by proteolytic cleavage sites. Proteolytic cleavage of prosaposin at each domain boundary is predicted to give rise to 4 saposin proteins. The PSAP gene can be said to be polycistronic.[supplied by OMIM]
1862	This gene encodes one of seven subunits of the human Arp2/3 protein complex. The Arp2/3 protein complex has been implicated in the control of actin polymerization in cells and has been conserved through evolution. The exact role of the protein encoded by this gene, the p16 subunit, has yet to be determined.
1863	
1864	The protein encoded by this gene is a cytokine that belongs to the tumor necrosis factor (TNF) ligand family. This cytokine is a ligand for receptor TNFRSF4/OX4. It is found to be involved in T cell antigen-presenting cell (APC) interactions. In surface Ig- and CD40-stimulated B cells, this cytokine along with CD70 has been shown to provide CD28-independent costimulatory signals to T cells. This protein and its receptor are reported to directly mediate adhesion of activated T cells to vascular endothelial cells.
1865	
1866	Glial cell line-derived neurotrophic factor (GDNF) and neurturin (NTN) are two structurally related, potent neurotrophic factors that play key roles in the control of neuron survival and differentiation. The protein encoded by this gene is a member of the GDNF receptor family. It is a glycosylphosphatidylinositol(GPI)-linked cell surface receptor for both GDNF and NTN, and mediates activation of the RET tyrosine kinase receptor. This gene is a candidate gene for Hirschsprung disease. Two alternatively spliced transcipt variants encoding different isoforms have been described for this gene.
1867	KIF21B belongs to a family of plus end-directed kinesin (see MIM 600025) motor proteins. Neurons use kinesin and dynein (see MIM 600112) microtubule-dependent motor proteins to transport essential cellular components along axonal and dendritic microtubules.[supplied by OMIM]
1868	
1869	The protein encoded by this gene is a member of the TNF-receptor superfamily. This receptor was identified as a cellular mediator of herpes simplex virus (HSV) entry. Binding of HSV viral envelope glycoprotein D (gD) to this receptor protein has been shown to be part of the viral entry mechanism. The cytoplasmic region of this receptor was found to bind to several TRAF family members, which may mediate the signal transduction pathways that activate the immune response.
1870	
1871	
1872	
1873	The protein encoded by this gene has been found to form a complex with EPS8 and SOS1, and is thought to be involved in the transduction of signals from Ras to Rac. In addition, the encoded protein may play a role in the regulation of EGF-induced Erk pathway activation as well as cytoskeletal reorganization and EGFR signaling. Several transcript variants encoding multiple isoforms have been found for this gene.
1874	
1875	Cubilin (CUBN) acts as a receptor for intrinsic factor-vitamin B12 complexes. The role of receptor is supported by the presence of 27 CUB domains. Cubulin is located within the epithelium of intestine and kidney. Mutations in CUBN may play a role in autosomal recessive megaloblastic anemia.
1876	
1877	This gene encodes a protein belonging to the pantothenate kinase family. Pantothenate kinase is a key regulatory enzyme in the biosynthesis of coenzyme A (CoA) in bacteria and mammalian cells. It catalyzes the first committed step in the universal biosynthetic pathway leading to CoA and is itself subject to regulation through feedback inhibition by CoA. Alternative splicing has been observed at this locus and three variants, each encoding a distinct isoform, have been identified.
1878	Semaphorins are a large family of conserved secreted and membrane associated proteins which possess a semaphorin (Sema) domain and a PSI domain (found in plexins, semaphorins and integrins) in the N-terminal extracellular portion. Based on sequence and structural similarities, semaphorins are put into eight classes: invertebrates contain classes 1 and 2, viruses have class V, and vertebrates contain classes 3-7. Semaphorins serve as axon guidance ligands via multimeric receptor complexes, some (if not all) containing plexin proteins. This gene encodes a class 4 semaphorin. This gene and the gene for mitochondrial ribosomal protein L43 overlap at map location 10q24.31 and are transcribed in opposite directions.
1879	The protein encoded by this gene is similar to PPAR-gamma coactivator 1 (PPARGC1/PGC-1), a protein that can activate mitochondrial biogenesis in part through a direct interaction with nuclear respiratory factor 1 (NRF1). This protein has been shown to interact with NRF1. It is thought to be a functional relative of PPARGC1 that activates mitochondrial biogenesis through NRF1 in response to proliferative signals.
1880	
1881	
1882	
1883	
1884	
1885	
1886	
1887	
1888	
1889	
1890	Pre-mRNA splicing occurs in 2 sequential transesterification steps. The protein encoded by this gene is found to be essential for the catalytic step II in pre-mRNA splicing process. It is found in the spliceosome, and contains seven WD repeats, which function in protein-protein interactions. This protein has a sequence similarity to the yeast splicing factor Prp18.
1891	This gene encodes a member of a subclass of the HSP40/DnaJ protein family. Members of this family of proteins are associated with complex assembly, protein folding, and export. Two transcript variants encoding distinct isoforms have been identified for this gene.
1892	The protein encoded by this gene belongs to the GSK-3-binding protein family. It may function in tumor progression and in lymphomagenesis. Alternative splicing occurs at this locus and two transcript variants encoding the same isoform have been identified.
1893	
1894	
1895	The interleukin 2 (IL2) receptor alpha (IL2RA) and beta (IL2RB) chains, together with the common gamma chain (IL2RG), constitute the high-affinity IL2 receptor. Homodimeric alpha chains (IL2RA) result in low-affinity receptor, while homodimeric beta (IL2RB) chains produce a medium-affinity receptor. Normally an integral-membrane protein, soluble IL2RA has been isolated and determined to result from extracellular proteolyisis.  Alternately-spliced IL2RA mRNAs have been isolated, but the significance of each is presently unknown.
1896	
1897	The protein encoded by this gene is a member of the TGF-beta superfamily. Studies of the mouse counterpart suggested that this gene may be essential for mesoderm formation and subsequent organization of axial structures in early embryonic development.
1898	
1899	
1900	
1901	
1902	
1903	
1904	
1905	
1906	This gene encodes one of two high affinity forms of a receptor for melatonin, the primary hormone secreted by the pineal gland. This gene product is an integral membrane protein that is a G-protein coupled, 7-transmembrane receptor. It is found primarily in the retina and brain although this detection requires RT-PCR. It is thought to participate in light-dependent functions in the retina and may be involved in the neurobiological effects of melatonin.
1907	
1908	This gene is the cellular homolog of the fox sequence in the Finkel-Biskis-Reilly murine sarcoma virus (FBR-MuSV). It encodes a fusion protein consisting of the ubiquitin-like protein fubi at the N terminus and ribosomal protein S30 at the C terminus. It has been proposed that the fusion protein is post-translationally processed to generate free fubi and free ribosomal protein S30. Fubi is a member of the ubiquitin family, and ribosomal protein S30 belongs to the S30E family of ribosomal proteins. Whereas the function of fubi is currently unknown, ribosomal protein S30 is a component of the 40S subunit of the cytoplasmic ribosome. Pseudogenes derived from this gene are present in the genome. Similar to ribosomal protein S30, ribosomal proteins S27a and L40 are synthesized as fusion proteins with ubiquitin.
1909	The protein encoded by this gene is a member of the d4 domain family, characterized by a zinc finger-like structural motif. This protein functions as a transcription factor which is necessary for the apoptotic response following deprivation of survival factors. It likely serves a regulatory role in rapid hematopoietic cell growth and turnover. This gene is considered a candidate gene for multiple endocrine neoplasia type I, an inherited cancer syndrome involving multiple parathyroid, enteropancreatic, and pituitary tumors.
1910	The recognition of complex carbohydrate structures on glycoproteins is an important part of several biological processes, including cell-cell recognition, serum glycoprotein turnover, and neutralization of pathogens. The protein encoded by this gene is a type I membrane receptor that mediates the endocytosis of glycoproteins by macrophages. The protein has been shown to bind high-mannose structures on the surface of potentially pathogenic viruses, bacteria, and fungi so that they can be neutralized by phagocytic engulfment. This gene is in close proximity to MRC1L1. The gene loci including this gene, MRC1L1, as well as LOC340843 and LOC340893, consist of two nearly identical, tandemly linked genomic regions, which are thought to be a part of a duplicated region.
1911	The product of this gene is a mitogen induced GTPase activating protein (GAP). It exhibits a specific GAP activity for Ras-related regulatory proteins Rap1 and Rap2, but not for Ran or other small GTPases. This protein may also hamper mitogen-induced cell cycle progression when abnormally or prematurely expressed. It is localized to the perinuclear region. Two alternatively spliced variants encoding the same isoform have been characterized to date.
1912	
1913	Copper chaperone for superoxide dismutase specifically delivers Cu to copper/zinc superoxide dismutase and may activate copper/zinc superoxide dismutase through direct insertion of the Cu cofactor.
1914	
1915	This gene encodes gamma butyrobetaine hydroxylase which catalyzes the formation of L-carnitine from gamma-butyrobetaine, the last step in the L-carnitine biosynthetic pathway. Carnitine is essential for the transport of activated fatty acids across the mitochondrial membrane during mitochondrial beta-oxidation. This protein is 387 amino acids with a predicted molecular weight of 44.7 kDa. Its expression is high in kidney, moderate in liver, and very low in brain, while no expression was detected in other investigated tissues.
1916	
1917	
1918	
1919	
1920	LOXL4 is a member of the lysyl oxidase (LOX; MIM 153455) family of amine oxidases that catalyze the final enzymatic step in the lysine-derived cross-links of collagen fibrils and insoluble elastic fibers within the extracellular matrix. These enzymes may also contribute to developmental regulation, senescence, tumor suppression, cell growth control, and chemotaxis.[supplied by OMIM]
1921	
1922	
1923	
1924	N-methylation is one method by which drug and other xenobiotic compounds are metabolized by the liver. This gene encodes the protein responsible for this enzymatic activity which uses S-adenosyl methionine as the methyl donor.
1925	
1926	This gene encodes a member of the serine protease family. Serine proteases are known to be involved in a variety of biological processes, whose malfunction often leads to human diseases and disorders. This gene was identified as a gene overexpressed in pancreatic carcinoma. The encoded protein is membrane bound with a N-terminal anchor sequence and a glycosylated extracellular region containing the serine protease domain. Two alternative transcripts encoding different isoforms have been described.
1927	
1928	The protein encoded by this gene belongs to the eukaryotic diacylglycerol kinase family. It may attenuate protein kinase C activity by regulating diacylglycerol levels in intracellular signaling cascade and signal transduction. Alternative splicing occurs at this locus and three transcript variants encoding three distinct isoforms have been identified.
1929	Rho GTPases play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. The encoded protein may form a complex with G proteins and stimulate Rho-dependent signals. This protein is observed to form myeloid/lymphoid fusion partner in acute myeloid leukemia.
1930	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
1931	
1932	
1933	
1934	
1935	
1936	
1937	The protein encoded by this gene is a member of the lipophilin subfamily, part of the uteroglobin superfamily, and is an ortholog of prostatein, the major secretory glycoprotein of the rat ventral prostate gland. This gene product represents one component of a heterodimeric molecule present in human tears whose elution profile is consistent with prostatein, a tetrameric molecule composed of three peptide components in heterodimers. Assuming that human lipophilins are the functional counterparts of prostatein, they may be transcriptionally regulated by steroid hormones, with the ability to bind androgens, other steroids and possibly bind and concentrate estramustine, a chemotherapeutic agent widely used for prostate cancer. Although the gene has been reported to be on chromosome 15, this sequence appears to be from a cluster of genes on chromosome 11 that includes mammaglobin 2.
1938	
1939	
1940	
1941	
1942	
1943	The protein encoded by this intronless gene belongs to the GSK-3-binding protein family. Studies show that this protein plays a role as a positive regulator of the WNT signaling pathway. It may be upregulated in tumor progression.
1944	FXC1, or TIMM10B, belongs to a family of evolutionarily conserved proteins that are organized in heterooligomeric complexes in the mitochondrial intermembrane space. These proteins mediate the import and insertion of hydrophobic membrane proteins into the mitochondrial inner membrane.[supplied by OMIM]
1945	This gene is one of several tumor-suppressing subtransferable fragments located in the imprinted gene domain of 11p15.5, an important tumor-suppressor gene region. Alterations in this region have been associated with the Beckwith-Wiedemann syndrome, Wilms tumor, rhabdomyosarcoma, adrenocortical carcinoma, and lung, ovarian, and breast cancer. This gene may play a role in malignancies and disease that involve this region as well as the transport of chloroquine- and quinidine-related compounds in the kidney. Two alternative transcripts encoding the same isoform have been described.
1946	
1947	
1948	
1949	
1950	This gene encodes a nuclear protein, which is a tyrosine kinase belonging to the Ser/Thr family of protein kinases. This protein catalyzes the inhibitory tyrosine phosphorylation of CDC2/cyclin B kinase, and appears to coordinate the transition between DNA replication and mitosis by protecting the nucleus from cytoplasmically activated CDC2 kinase.
1951	Adrenomedullin, a hypotensive peptide found in human pheochromocytoma, consists of 52 amino acids, has 1 intramolecular disulfide bond, and shows a slight homology with the calcitonin gene-related peptide. It may function as a hormone in circulation control because it is found in blood in a considerable concentration. The precursor, called preproadrenomedullin, is 185 amino acids long. By RNA-blot analysis, human adrenomedullin mRNA was found to be highly expressed in several tissues. Genomic ADM DNA consists of 4 exons and 3 introns, with the 5-prime flanking region containing TATA, CAAT, and GC boxes. There are also multiple binding sites for activator protein-2 and a cAMP-regulated enhancer element.
1952	This gene encodes a component of prolyl 4-hydroxylase, a key enzyme in collagen synthesis composed of two identical alpha subunits and two beta subunits. The encoded protein is one of several different types of alpha subunits and provides the major part of the catalytic site of the active enzyme. In collagen and related proteins, prolyl 4-hydroxylase catalyzes the formation of 4-hydroxyproline that is essential to the proper three-dimensional folding of newly synthesized procollagen chains. Alternatively spliced transcript variants have been observed, but their full-length nature has not been determined.
1953	The protein encoded by this gene is a lysosomal prolylcarboxypeptidase, which cleaves C-terminal amino acids linked to proline in peptides such as angiotension II, III and des-Arg9-bradykinin. The cleavage occurs at acidic pH, but the enzyme activity is retained with some substrates at neutral pH. This enzyme has been shown to be an activator of the cell matrix-associated prekallikrein. The importance of angiotension II, one of the substrates of this enzyme, in regulating blood pressure and electrolyte balance suggests that this gene may be related to essential hypertension. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
1954	
1955	
1956	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
1957	The protein encoded by this gene belongs to the BCL2 protein family. BCL2 family members form hetero- or homodimers and act as anti- or pro-apoptotic regulators that are involved in a wide variety of cellular activities. Overexpression of this gene has been shown to induce apoptosis in cells. Four alternatively spliced transcript variants, which encode three distinct isoforms, have been reported for this gene.
1958	
1959	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein, which shares sequence similarity with the yeast ribosomal protein YL41, belongs to the L41E family of ribosomal proteins. It is located in the cytoplasm. The protein can interact with the beta subunit of protein kinase CKII and can stimulate the phosphorylation of DNA topoisomerase II-alpha by CKII. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
1960	The protein encoded by this gene is a member of the SWI/SNF family of proteins, whose members display helicase and ATPase activities and which are thought to regulate transcription of certain genes by altering the chromatin structure around those genes. The encoded protein is part of the large ATP-dependent chromatin remodeling complex SNF/SWI and contains a predicted leucine zipper motif typical of many transcription factors. Two transcript variants encoding different isoforms have been found for this gene.
1961	ITGA7 encodes integrin alpha chain 7. Integrins are heterodimeric integral membrane proteins composed of an alpha chain and a beta chain. Alpha chain 7 undergoes post-translational cleavage within the extracellular domain to yield disulfide-linked light and heavy chains that join with beta 1 to form an integrin that binds to the extracellular matrix protein laminin-1. Alpha 7 beta 1 is the major integrin complex expressed in differentiated muscle cells. Splice variants of alpha 7 that differ in both the extracellular and cytoplasmic domains exist in the mouse; however, to date only a single human transcript type has been isolated: it contains extracellular and cytoplasmic domains corresponding to the mouse X2 and B variants, respectively. A unique extracellular splice variant has been identified in human, although it clearly represents a minor species and its biological significance is unclear.
1962	
1963	RAN (ras-related nuclear protein) is a small GTP binding protein belonging to the RAS superfamily that is essential for the translocation of RNA and proteins through the nuclear pore complex. The RAN protein is also involved in control of DNA synthesis and cell cycle progression. Nuclear localization of RAN requires the presence of regulator of chromosome condensation 1 (RCC1). Mutations in RAN disrupt DNA synthesis. Because of its many functions, it is likely that RAN interacts with several other proteins. RAN regulates formation and organization of the microtubule network independently of its role in the nucleus-cytosol exchange of macromolecules. RAN could be a key signaling molecule regulating microtubule polymerization during mitosis. RCC1 generates a high local concentration of RAN-GTP around chromatin which, in turn, induces the local nucleation of microtubules.  RAN is an androgen receptor (AR) coactivator that binds differentially with different lengths of polyglutamine within the androgen receptor. Polyglutamine repeat expansion in the AR is linked to Kennedy@#$%&s disease (X-linked spinal and bulbar muscular atrophy). RAN coactivation of the AR diminishes with polyglutamine expansion within the AR, and this weak coactivation may lead to partial androgen insensitivity during the development of Kennedy@#$%&s disease.
1964	Microtubules of the eukaryotic cytoskeleton perform essential and diverse functions and are composed of a heterodimer of alpha and beta tubulins. The genes encoding these microtubule constituents belong to the tubulin superfamily, which is composed of six distinct families. Genes from the alpha, beta and gamma tubulin families are found in all eukaryotes. The alpha and beta tubulins represent the major components of microtubules, while gamma tubulin plays a critical role in the nucleation of microtubule assembly. There are multiple alpha and beta tubulin genes, which are highly conserved among species. This gene encodes alpha tubulin and is highly similar to mouse and rat Tuba1 gene. Northern blotting studies have shown that the gene expression is predominantly found in morphologically differentiated neurologic cells. This gene is one of three alpha-tubulin genes in a cluster on chromosome 12q.
1965	
1966	This gene encodes a lymphoid-specific helicase. Other helicases function in processes involving DNA strand separation, including replication, repair, recombination, and transcription. This protein is thought to be involved with cellular proliferation and may play a role in leukemogenesis. Alternatively spliced transcript variants have been described, but their biological validity has not been determined.
1967	
1968	
1969	
1970	Prefoldin is a heterohexameric chaperone protein which assists in the correct folding of other proteins. It binds specifically to cytosolic chaperonin and transfers target proteins. Prefoldin may function by selectively targeting nascent actin and tubulin chains pending their transfer to cytosolic chaperonin for final folding and/or assembly. It promotes folding in an environment in which there are many competing pathways for nonnative proteins. This gene encodes a subunit of prefoldin, prefoldin 5. This subunit is associated with c-myc and is thought to repress the transcriptional activity of c-myc.
1971	The protein encoded by this gene is a member of the HEM family of tissue-specific transmembrane proteins which are highly conserved from invertebrates through mammals. This gene is only expressed in hematopoietic cells, while hematopoietic protein 2 is preferentially expressed in brain, heart, liver and testis. The function of the HEM1 product has not been established but it is thought to play an essential role in oogenesis.
1972	This gene encodes a member of the STAT-induced STAT inhibitor (SSI), also known as suppressor of cytokine signaling (SOCS), family. SSI family members are cytokine-inducible negative regulators of cytokine signaling. The expression of this gene can be induced by a subset of cytokines, including erythropoietin, GM-CSF, IL10 and interferon (IFN)-gamma. The protein encoded by this gene is found to interact with the cytoplasmic domain of insulin-like growth factor-1 receptor (IGF1R), and thus is thought to be involved in the regulation of IGF1R mediated cell signaling. Knockout studies in mice also suggested a regulatory role of this gene in IGF-1 related growth control.
1973	This gene encodes a member of the steroid-thyroid hormone-retinoid receptor superfamily. Expression is induced by phytohemagglutinin in human lymphocytes and by serum stimulation of arrested fibroblasts. The encoded protein acts as a nuclear transcription factor. Translocation of the protein from the nucleus to mitochondria induces apoptosis. Three transcript variants encoding two distinct isoforms have been identified for this gene.
1974	The protein encoded by this gene is a member of the keratin gene family. The type II cytokeratins consist of basic or neutral proteins which are arranged in pairs of heterotypic keratin chains coexpressed during differentiation of simple and stratified epithelial tissues. This type II cytokeratin is specifically expressed in the spinous and granular layers of the epidermis with family member KRT10 and mutations in these genes have been associated with bullous congenital ichthyosiform erythroderma. The type II cytokeratins are clustered in a region of chromosome 12q12-q13.
1975	
1976	The protein encoded by this gene is a proline-rich salivary protein. This gene and five other genes that also encode salivary proline-rich proteins (PRPs), as well as a gene encoding a lacrimal gland PRP, form a PRP gene cluster in the chromosomal 12p13 region. Alternatively spliced transcript variants encoding distinct isoforms have been described.
1977	
1978	This gene is one of two human homologs of Drosophila bicaudal-D. It has been implicated in COPI-independent membrane transport from the Golgi apparatus to the endoplasmic reticulum. Two alternative splice variants have been described. Other alternative splice variants that encode different protein isoforms have been described but their full-length nature has not been determined.
1979	The protein encoded by this gene belongs to the family of P-type primary ion transport ATPases characterized by the formation of an aspartyl phosphate intermediate during the reaction cycle. These enzymes remove bivalent calcium ions from eukaryotic cells against very large concentration gradients and play a critical role in intracellular calcium homeostasis. The mammalian plasma membrane calcium ATPase isoforms are encoded by at least four separate genes and the diversity of these enzymes is further increased by alternative splicing of transcripts. The expression of different isoforms and splice variants is regulated in a developmental, tissue- and cell type-specific manner, suggesting that these pumps are functionally adapted to the physiological needs of particular cells and tissues. This gene encodes the plasma membrane calcium ATPase isoform 1. Alternatively spliced transcript variants encoding different isoforms have been identified.
1980	
1981	
1982	
1983	
1984	The protein encoded by this gene is a member of the immunophilin protein family, which play a role in immunoregulation and basic cellular processes involving protein folding and trafficking. This encoded protein is a cis-trans prolyl isomerase that binds to the immunosuppressants FK506 and rapamycin. It has high structural and functional similarity to FK506-binding protein 1A (FKBP1A), but unlike FKBP1A, this protein does not have immunosuppressant activity when complexed with FK506. It interacts with interferon regulatory factor-4 and plays an important role in immunoregulatory gene expression in B and T lymphocytes. This encoded protein is known to associate with phytanoyl-CoA alpha-hydroxylase. It can also associate with two heat shock proteins (hsp90 and hsp70) and thus may play a role in the intracellular trafficking of hetero-oligomeric forms of the steroid hormone receptors. This protein correlates strongly with adeno-associated virus type 2 vectors (AAV) resulting in a significant increase in AAV-mediated transgene expression in human cell lines. Thus this encoded protein is thought to have important implications for the optimal use of AAV vectors in human gene therapy. This gene has been found to have multiple polyadenylation sites.
1985	
1986	This gene belongs to a group of vacuolar protein sorting (VPS) genes that, when functionally impaired, disrupt the efficient delivery of vacuolar hydrolases. The protein encoded by this gene is a component of a large multimeric complex, termed the retromer complex, which is involved in retrograde transport of proteins from endosomes to the trans-Golgi network. This VPS protein may be involved in the formation of the inner shell of the retromer coat for retrograde vesicles leaving the prevacuolar compartment. Alternative splice variants encoding different isoforms, and usage of multiple polyadenylation sites have been found for this gene.
1987	The protein encoded by this gene contains a RINF finger, a motif known to be involved in protein-protein and protein-DNA interactions. This protein interacts with DNAJA3/hTid-1, which is a DnaJ protein reported to function as a modulator of apoptosis. Overexpression of this gene in Hela cells was shown to confer the resistance to TNF-alpha induced apoptosis, suggesting an anti-apoptotic function of this protein. This protein can be cleaved by caspase-3 during the induction of apoptosis. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
1988	
1989	
1990	
1991	
1992	This gene encodes a member of the F-box protein family, members of which are characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into three classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbx class. It contains an F-box motif and seven conserved helicase motifs, and has both DNA-dependent ATPase and DNA unwinding activities. Alternatively spliced transcript variants encoding distinct isoforms have been identified for this gene.
1993	This gene was identified by its overexpression specifically in herpesvirus samimiri-transformed T cells. The encoded protein is a member of the IL10 family of cytokines. It is a secreted protein and may function as a homodimer. This protein is thought to contribute to the transformed phenotype of T cells after infection by herpesvirus samimiri.
1994	
1995	
1996	
1997	
1998	
1999	
2000	Histone acetylation plays a key role in the regulation of eukaryotic gene expression. Histone acetylation and deacetylation are catalyzed by multisubunit complexes. The protein encoded by this gene is a component of the histone deacetylase complex which includes SIN3, SAP30, HDAC1, HDAC2, RbAp46, RbAp48 and other polypeptides. This protein directly interacts with SIN3 and enhances SIN3-mediated transcriptional repression when tethered to the promoter.
2001	Microtubules of the eukaryotic cytoskeleton perform essential and diverse functions and are composed of a heterodimer of alpha and beta tubulin. The genes encoding these microtubule constituents are part of the tubulin superfamily, which is composed of six distinct families. Genes from the alpha, beta and gamma tubulin families are found in all eukaryotes. The alpha and beta tubulins represent the major components of microtubules, while gamma tubulin plays a critical role in the nucleation of microtubule assembly. There are multiple alpha and beta tubulin genes and they are highly conserved among and between species. This gene is an alpha tubulin gene that encodes a protein identical to the mouse testis-specific Tuba3 and Tuba7 gene products. This gene is located in the 13q11 region, which is associated with the genetic diseases Clouston hidrotic ectodermal dysplasia and Kabuki syndrome. Alternative splicing has been observed for this gene and two variants have been identified.
2002	
2003	
2004	
2005	This gene encodes a tumor suppressor protein that can induce cell growth arrest and apoptosis. The encoded protein is a nuclear protein that physically interacts with the tumor suppressor protein TP53 and is a component of the p53 signaling pathway. Reduced expression and rearrangement of this gene have been detected in various cancers. Multiple alternatively spliced transcript variants encoding distinct isoforms have been reported.
2006	
2007	
2008	This gene encodes a protein belonging to the sprouty family. The encoded protein contains a carboxyl-terminal cysteine-rich domain essential for the inhibitory activity on receptor tyrosine kinase signaling proteins and is required for growth factor stimulated translocation of the protein to membrane ruffles. In primary dermal endothelial cells this gene is transiently upregulated in response to fibroblast growth factor two. This protein is indirectly involved in the non-cell autonomous inhibitory effect on fibroblast growth factor two signaling. The protein interacts with Cas-Br-M (murine) ectropic retroviral transforming sequence, and can function as a bimodal regulator of epidermal growth factor receptor/mitogen-activated protein kinase signaling. This protein may play a role in alveoli branching during lung development as shown by a similar mouse protein.
2009	
2010	Arginase catalyzes the hydrolysis of arginine to ornithine and urea.  At least two isoforms of mammalian arginase exists (types I and II) which differ in their tissue distribution, subcellular localization, immunologic crossreactivity and physiologic function.  The type II isoform encoded by this gene, is located in the mitochondria and expressed in extra-hepatic tissues, especially kidney.  The physiologic role of this isoform is poorly understood; it is thought to play a role in nitric oxide and polyamine metabolism.  Transcript variants of the type II gene resulting from the use of alternative polyadenylation sites have been described.
2011	
2012	GTP-binding proteins are GTPases and function as molecular switches that can flip between two states: active, when GTP is bound, and inactive, when GDP is bound. @#$%&Active@#$%& in this context usually means that the molecule acts as a signal to trigger other events in the cell. When an extracellular ligand binds to a G-protein-linked receptor, the receptor changes its conformation and switches on the trimeric G proteins that associate with it by causing them to eject their GDP and replace it with GTP. The switch is turned off when the G protein hydrolyzes its own bound GTP, converting it back to GDP. But before that occurs, the active protein has an opportunity to diffuse away from the receptor and deliver its message for a prolonged period to its downstream target.
2013	This gene encodes a receptor for bradykinin. The 9 aa bradykinin peptide elicits many responses including vasodilation, edema, smooth muscle spasm and pain fiber stimulation. This receptor associates with G proteins that stimulate a phosphatidylinositol-calcium second messenger system. Alternate start codons result in two isoforms of the protein.
2014	
2015	
2016	This gene encodes a subfamily member of the G-protein couple receptor (GPCR) superfamily. The encoded protein is a high-affinity receptor for lysophosphatidylcholine (LPC), a major phospholipid component of oxidized low density lipoprotein. This protein may react to LPC levels at sites of inflammation to limit the expansion of tissue-infiltrating cells. A similar protein in mouse is involved in cell cycle progression.
2017	This gene encodes a member of the phosphoenolpyruvate carboxykinase (GTP) family. The protein is a mitochondrial enzyme that catalyzes the conversion of oxaloacetate to phosphoenolpyruvate in the presence of GTP. A cytosolic form encoded by a different gene has also been characterized and is the key enzyme of gluconeogenesis in the liver. The encoded protein may serve a similar function, although it is constitutively expressed and not modulated by hormones such as glucagon and insulin that regulate the cytosolic form. Alternatively spliced transcript variants have been described.
2018	This gene encodes an abundant nuclear protein that binds with high affinity to nascent poly(A) tails. The protein is required for progressive and efficient polymerization of poly(A) tails on the 3@#$%& ends of eukaryotic genes and controls the size of the poly(A) tail to about 250 nt. At steady-state, this protein is localized in the nucleus whereas a different poly(A) binding protein is localized in the cytoplasm. An expansion of the  trinucleotide (GCG) repeat from normal 6 to 8-13 at the 5@#$%& end of the coding region of this gene leads to autosomal dominant oculopharyngeal muscular dystrophy (OPMD) disease. Multiple splice variants have been described but their full-length nature is not known. One splice variant includes introns 1 and 6 but no protein is formed.
2019	This gene encodes a member of the guanine nucleotide-binding protein (G protein)-coupled receptor kinase subfamily of the Ser/Thr protein kinase family. The protein phosphorylates the activated forms of G protein-coupled receptors thus initiating their deactivation. It has also been shown to play a role in regulating the motility of polymorphonuclear leukocytes (PMNs).
2020	
2021	
2022	
2023	
2024	
2025	
2026	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes one of the ATPase subunits, a member of the triple-A family of ATPases which have a chaperone-like activity. Pseudogenes have been identified on chromosomes 8 and 12.
2027	
2028	The FYVE domain mediates the recruitment of proteins involved in membrane trafficking and cell signaling to phosphatidylinositol 3-phosphate (PtdIns(3)P)-containing membranes. This gene encodes a protein which contains two zinc-binding FYVE domains in tandem. This protein displays a predominantly Golgi, endoplasmic reticulum and vesicular distribution. Alternatively spliced transcript variants have been found for this gene, and they encode two isoforms with different sizes.
2029	
2030	The protein encoded by this gene is a nuclear basic leucine zipper protein that belongs to the AP-1/ATF superfamily of transcription factors. The leucine zipper of this protein mediates dimerization with members of the Jun family of proteins. This protein is thought to be a negative regulator of AP-1/ATF transcriptional events.
2031	
2032	
2033	
2034	Machado-Joseph disease is an autosomal dominant neurologic disorder, and is now known to be the same as previously described spinocerebellar ataxia-3. The protein encoded by this gene contains (CAG)n repeats in the coding region, and the expansion of these repeats from the normal 13-36 to 68-79 is the cause of Machado-Joseph disease. There is a negative correlation between the age of onset and CAG repeat numbers. Transcript variants encoding different isoforms (not involving the (CAG)n repeats) and utilizing alternative polyA_sites have been described for this gene.
2035	
2036	
2037	
2038	NOX5 is a novel NADPH oxidase that generates superoxide and functions as a H+ channel in a Ca(2+)-dependent manner.[supplied by OMIM]
2039	Hexosaminidase A is the alpha subunit of the lysosomal enzyme beta-hexosaminidase that, together with the cofactor GM2 activator protein, catalyzes the degradation of the ganglioside GM2, and other molecules containing terminal N-acetyl hexosamines. Beta-hexosaminidase is composed of two subunits, alpha and beta, which are encoded by separate genes.  Both beta-hexosaminidase alpha and beta subunits are members of family 20 of glycosyl hydrolases.  Mutations in the alpha or beta subunit genes lead to an accumulation of GM2 ganglioside in neurons and neurodegenerative disorders termed the GM2 gangliosidoses.  Alpha subunit gene mutations lead to Tay-Sachs disease (GM2-gangliosidosis type I).
2040	
2041	
2042	This gene encodes a protein with sequence similarity to a mouse protein originally identified in embryonic stem cells. In mouse T-cell lines, this protein appears to be related to glucocorticoid- and staurine-induced apoptotic pathways, and to be linked to ceramide-mediated signalling. These observations suggest that this gene product is involved in specific apoptotic processes in T-cells.
2043	
2044	OCA2 encodes the human homologue of the mouse p (pink-eyed dilution) gene. The P protein is believed to be an integral membrane protein involved in small molecule transport, specifically tyrosine - a precursor of melanin. Mutations in OCA2 result in type 2 oculocutaneous albinism.
2045	
2046	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the mitochondrial inner membrane and catalyzes the conversion of cholesterol to pregnenolone, the first and rate-limiting step in the synthesis of the steroid hormones.
2047	
2048	
2049	This gene product is highly similar to Schizosaccharomyces pombe rad9, a cell cycle checkpoint protein required for cell cycle arrest and DNA damage repair in response to DNA damage. This protein is found to possess 3@#$%& to 5@#$%& exonuclease activity, which may contribute to its role in sensing and repairing DNA damage. It forms a checkpoint protein complex with RAD1 and HUS1. This complex is recruited by checkpoint protein RAD17 to the sites of DNA damage, which is thought to be important for triggering the checkpoint-signaling cascade. Use of alternative polyA sites has been noted for this gene.
2050	Isocitrate dehydrogenases catalyze the oxidative decarboxylation of isocitrate to 2-oxoglutarate. These enzymes belong to two distinct subclasses, one of which utilizes NAD(+) as the electron acceptor and the other NADP(+). Five isocitrate dehydrogenases have been reported: three NAD(+)-dependent isocitrate dehydrogenases, which localize to the mitochondrial matrix, and two NADP(+)-dependent isocitrate dehydrogenases, one of which is mitochondrial and the other predominantly cytosolic. Each NADP(+)-dependent isozyme is a homodimer. The protein encoded by this gene is the NADP(+)-dependent isocitrate dehydrogenase found in the mitochondria. It plays a role in intermediary metabolism and energy production. This protein may tightly associate or interact with the pyruvate dehydrogenase complex.
2051	This gene encodes the last enzyme in the tyrosine catabolism pathway. FAH deficiency is associated with Type 1 hereditary tyrosinemia (HT).
2052	
2053	
2054	
2055	
2056	
2057	
2058	
2059	Isovaleryl-CoA dehydrogenase (IVD) is a mitochondrial matrix enzyme that catalyzes the third step in leucine catabolism. The genetic deficiency of IVD results in an accumulation of isovaleric acid, which is toxic to the central nervous system and leads to isovaleric acidemia.
2060	Isovaleryl-CoA dehydrogenase (IVD) is a mitochondrial matrix enzyme that catalyzes the third step in leucine catabolism. The genetic deficiency of IVD results in an accumulation of isovaleric acid, which is toxic to the central nervous system and leads to isovaleric acidemia.
2061	
2062	
2063	
2064	
2065	
2066	
2067	
2068	
2069	
2070	
2071	
2072	
2073	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S5P family of ribosomal proteins. It is located in the cytoplasm. This gene shares sequence similarity with mouse LLRep3. It is co-transcribed with the small nucleolar RNA gene U64, which is located in its third intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
2074	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S5P family of ribosomal proteins. It is located in the cytoplasm. This gene shares sequence similarity with mouse LLRep3. It is co-transcribed with the small nucleolar RNA gene U64, which is located in its third intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
2075	This gene is a classical cadherin from the cadherin superfamily and is located in a six-cadherin cluster in a region on the long arm of chromosome 16 that is involved in loss of heterozygosity events in breast and prostate cancer. The encoded protein is a calcium-dependent cell-cell adhesion glycoprotein comprised of five extracellular cadherin repeats, a transmembrane region and a highly conserved cytoplasmic tail. Functioning as a classic cadherin by imparting to cells the ability to adhere in a homophilic manner, the protein may play an important role in endothelial cell biology through control of the cohesion and organization of the intercellular junctions.
2076	
2077	
2078	This gene encodes the alpha chain of the interleukin-4 receptor, a type I transmembrane protein that can bind interleukin 4 and interleukin 13 to regulate IgE production. The encoded protein also can bind interleukin 4 to promote differentiation of Th2 cells. A soluble form of the encoded protein can be produced by an alternate splice variant or by proteolysis of the membrane-bound protein, and this soluble form can inhibit IL4-mediated cell proliferation and IL5 upregulation by T-cells. Allelic variations in this gene have been associated with atopy, a condition that can manifest itself as allergic rhinitis, sinusitus, asthma, or eczema. Two transcript variants encoding different isoforms, a membrane-bound and a soluble form, have been found for this gene.
2079	This gene encodes a protein of unknown function.
2080	The protein encoded by this gene belongs to the tigger subfamily of the pogo superfamily of DNA-mediated transposons in humans. These proteins are related to DNA transposons found in fungi and nematodes, and more distantly to the Tc1 and mariner transposases. They are also very similar to the major mammalian centromere protein B. The exact function of this gene is not known.
2081	This gene encodes a cell-surface anchored serine protease, which is a member of the trypsin family of serine proteases. It is predicted to be active on peptide linkages involving the carboxyl group of lysine or arginine. The protein localizes to the cytoplasm and the plasma membrane of premeiotic testicular germ cells and it may be involved in progression of testicular tumors of germ cell origin. Alternative splicing of this gene results in three transcript variants encoding three different isoforms.
2082	
2083	The protein encoded by this gene is one subunit of a cleavage factor required for 3@#$%& RNA cleavage and polyadenylation processing. The interaction of the protein with the RNA is one of the earliest steps in the assembly of the 3@#$%& end processing complex and facilitates the recruitement of other processing factors. This gene encodes the 25kD subunit of the protein complex, which is composed of four polypeptides.
2084	
2085	FUT4 catalyzes the synthesis of the non-sialylated antigen, Lewis x (CD15). It has been detected in human embryos (5-10 weeks) suggesting a role in development.
2086	
2087	
2088	
2089	
2090	
2091	
2092	
2093	
2094	
2095	This gene encodes a transcription factor that binds to the sterol regulatory element-1 (SRE1), which is a decamer flanking the low density lipoprotein receptor gene and some genes involved in sterol biosynthesis. The protein is synthesized as a precursor that is attached to the nuclear membrane and endoplasmic reticulum. Following cleavage, the mature protein translocates to the nucleus and activates transcription by binding to the SRE1. Sterols inhibit the cleavage of the precursor, and the mature nuclear form is rapidly catabolized, thereby reducing transcription. The protein is a member of the basic helix-loop-helix-leucine zipper (bHLH-Zip) transcription factor family. This gene is located within the Smith-Magenis syndrome region on chromosome 17. Two transcript variants encoding different isoforms have been found for this gene.
2096	This gene encodes an inwardly rectifying K+ channel which may be blocked by divalent cations. This protein is thought to be one of multiple inwardly rectifying channels which contribute to the cardiac inward rectifier current (IK1). The gene is located within the Smith-Magenis syndrome region on chromosome 17.
2097	
2098	Midbrain dopaminergic neurons play a critical role in multiple brain functions, and abnormal signaling through dopaminergic pathways has been implicated in several major neurologic and psychiatric disorders. One well-studied target for the actions of dopamine is DARPP32. In the densely dopamine- and glutamate-innervated rat caudate-putamen, DARPP32 is expressed in medium-sized spiny neurons (Ouimet and Greengard, 1990) that also express dopamine D1 receptors (Walaas and Greengard, 1984). The function of DARPP32 seems to be regulated by receptor stimulation. Both dopaminergic and glutamatergic (NMDA) receptor stimulation regulate the extent of DARPP32 phosphorylation, but in opposite directions (Halpain et al., 1990). Dopamine D1 receptor stimulation enhances cAMP formation, resulting in the phosphorylation of DARPP32 (Walaas and Greengard, 1984); phosphorylated DARPP32 is a potent protein phosphatase-1 (see MIM 176875) inhibitor (Hemmings et al., 1984). NMDA receptor stimulation elevates intracellular calcium, which leads to activation of calcineurin and dephosphorylation of phospho-DARPP32, thereby reducing the phosphatase-1 inhibitory activity of DARPP32 (Halpain et al., 1990).[supplied by OMIM]
2099	
2100	This gene encodes a member of the peroxiredoxin family of antioxidant enzymes, which reduce hydrogen peroxide and alkyl hydroperoxides. The encoded protein may play an antioxidant protective role in different tissues under normal conditions and during inflammatory processes. This protein interacts with peroxisome receptor 1. The crystal structure of this protein in its reduced form has been resolved to 1.5 angstrom resolution. This gene uses alternate in-frame translation initiation sites to generate mitochondrial or peroxisomal/cytoplasmic forms. Three transcript variants encoding distinct isoforms have been identified for this gene.
2101	Aldehyde dehydrogenase isozymes are thought to play a major role in the detoxification of aldehydes generated by alcohol metabolism and lipid peroxidation. This gene product catalyzes the oxidation of long-chain aliphatic aldehydes to fatty acid. Mutations in the gene cause Sjogren-Larsson syndrome.
2102	
2103	
2104	
2105	
2106	The protein encoded by this gene is a member of the dual specificity protein phosphatase subfamily. These phosphatases inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues. They negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily (MAPK/ERK, SAPK/JNK, p38), which are associated with cellular proliferation and differentiation. Different members of the family of dual specificity phosphatases show distinct substrate specificities for various MAP kinases, different tissue distribution and subcellular localization, and different modes of inducibility of their expression by extracellular stimuli. This gene maps in a region that contains the BRCA1 locus which confers susceptibility to breast and ovarian cancer. Although DUSP3 is expressed in both breast and ovarian tissues, mutation screening in breast cancer pedigrees and in sporadic tumors was negative, leading to the conclusion that this gene is not BRCA1.
2107	
2108	This gene encodes a hypothalamic neuropeptide precusor protein that gives rise to two mature neuropeptides, orexin A and orexin B, by proteolytic processing. Orexin A and orexin B, which bind to orphan G-protein coupled receptors HCRTR1 and HCRTR2, function in the regulation of sleep and arousal. This neuropeptide arrangement may also play a role in feeding behavior, metabolism, and homeostasis.
2109	
2110	
2111	
2112	Prohibitin is an evolutionarily conserved gene that is ubiquitously expressed.  It is thought to be a negative regulator of cell proliferation and may be a tumor suppressor.  Mutations in PHB have been linked to sporadic breast cancer.  Prohibitin is expressed as two transcripts with varying lengths of 3@#$%& untranslated region.  The longer transcript is present at higher levels in proliferating tissues and cells, suggesting that this longer 3@#$%& untranslated region may function as a trans-acting regulatory RNA.
2113	
2114	
2115	
2116	
2117	
2118	
2119	
2120	This gene encodes one of the three enolase isoenzymes found in mammals. This isoenzyme, a homodimer, is found in skeletal muscle cells in the adult. A switch from alpha enolase to beta enolase occurs in muscle tissue during development in rodents. Mutations in this gene can be associated with metabolic myopathies that may result from decreased stability of the enzyme. Two transcripts have been identified for this gene that differ only in their 5@#$%& UTR.
2121	
2122	TBKBP1 is an adaptor protein that binds to TBK1 (MIM 604834) and is part of the interaction network in the TNF (MIM 191160)/NFKB (see MIM 164011) pathway (Bouwmeester et al., 2004 [PubMed 14743216]).[supplied by OMIM]
2123	Acidic fibroblast growth factor is mitogenic for a variety of different cell types and acts by stimulating mitogenesis or inducing morphological changes and differentiation. The FIBP protein is an intracellular protein that binds selectively to acidic fibroblast growth factor (aFGF). It is postulated that FIBP may be involved in the mitogenic action of aFGF. Two transcript variants encoding different isoforms have been found for this gene.
2124	Chromosomal segregation during mitosis as well as meiosis is regulated by kinases and phosphatases. The Aurora kinases associate with microtubules during chromosome movement and segregation. Aurora kinase B localizes to microtubules near kinetochores, specifically to the specialized microtubules called K-fibers, and Aurora kinase A (MIM 603072) localizes to centrosomes (Lampson et al., 2004).[supplied by OMIM]
2125	
2126	This gene encodes an enzyme that catalyzes the NAD(P)H-dependent conversion of pyrroline-5-carboxylate to proline. This enzyme may also play a physiologic role in the generation of NADP(+) in some cell types. The protein forms a homopolymer and localizes to the mitochondrion. Alternate splicing results in two transcript variants encoding different isoforms.
2127	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the high sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
2128	
2129	
2130	
2131	The protein encoded by this gene, a member of the peptidase C1 family, is a cysteine proteinase that may have a specific function in the mechanism or regulation of T-cell cytolytic activity. The encoded protein is found associated with the membrane inside the endoplasmic reticulum of natural killer and cytotoxic T-cells. Expression of this gene is up-regulated by interleukin-2.
2132	
2133	
2134	
2135	
2136	
2137	
2138	
2139	
2140	
2141	
2142	ROCK1 is a protein serine/threonine kinase that is activated when bound to the GTP-bound form of Rho. The small GTPase Rho regulates formation of focal adhesions and stress fibers of fibroblasts, as well as adhesion and aggregation of platelets and lymphocytes by shuttling between the inactive GDP-bound form and the active GTP-bound form. Rho is also essential in cytokinesis and plays a role in transcriptional activation by serum response factor. ROCK, a downstream effector of Rho, phosphorylates and activates LIM kinase, which in turn, phosphorylates cofilin, inhibiting its actin-depolymerizing activity.
2143	
2144	
2145	
2146	
2147	
2148	
2149	The protein encoded by this gene is highly similar to Cdc 37, a cell division cycle control protein of Sacchromyces cerevisiae. This protein is a molecular chaperone with specific function in cell signal transduction. It has been shown to form complex with Hsp90 and a variety of protein kinases including CDK4, CDK6, SRC, RAF-1, MOK, as well as eIF2 alpha kinases. It is thought to play a critical role in directing Hsp90 to its target kinases.
2150	The protein encoded by this gene is a member of the intercellular adhesion molecule (ICAM) family. All ICAM proteins are type I transmembrane glycoproteins, contain 2-9 immunoglobulin-like C2-type domains, and bind to the leukocyte adhesion LFA-1 protein. Glutamine->Arginine polymorphism in this protein is responsible for the Landsteiner-Wiener blood group system (GLN=WB(A); ARG=WB(B). This gene consists of 3 exons and alternative splicing generates 2 transcript variants.
2151	
2152	
2153	
2154	This gene product shares a high degree of sequence similarity with DNA-dependent protein kinase catalytic subunit-interacting protein 2 in human and mouse, and like them may bind the catalytic subunit of DNA-dependent protein kinases. The exact function of this gene is not known.
2155	
2156	
2157	tRNA splicing is a fundamental process required for cell growth and division. SEN34 is a subunit of the tRNA splicing endonuclease, which catalyzes the removal of introns, the first step in tRNA splicing (Paushkin et al., 2004).[supplied by OMIM]
2158	
2159	
2160	
2161	
2162	
2163	
2164	
2165	
2166	
2167	NTT5 shows structural characteristics of an Na(+)- and Cl(-)-dependent neurotransmitter transporter, including 12 transmembrane (TM) domains, intracellular N and C termini, and large extracellular loops containing multiple N-glycosylation sites.[supplied by OMIM]
2168	The dickkopf protein family interacts with the Wnt signaling pathway and its members are characterized by two conserved cysteine-rich domains. This gene encodes a secreted protein that has high similarity to the N-terminus of the dickkopf-3 protein and moderate similarity to that protein@#$%&s C-terminus though the C-terminal cysteine residues are not conserved.
2169	DBP is a member of the PAR bZIP (proline and acidic amino acid-rich basic leucine zipper) transcription factor family (Khatib et al., 1994).[supplied by OMIM]
2170	DBP is a member of the PAR bZIP (proline and acidic amino acid-rich basic leucine zipper) transcription factor family (Khatib et al., 1994).[supplied by OMIM]
2171	Potassium channels are present in most mammalian cells, where they participate in a wide range of physiologic responses. The protein encoded by this gene is an integral membrane protein and inward-rectifier type potassium channel, and probably has a role in controlling the excitability of motor neurons. Two transcript variants encoding the same protein have been found for this gene.
2172	
2173	
2174	
2175	BARX2 is a member of the homeobox gene family which are regulators of place-dependent morphogenesis and play important roles in controlling the expression patterns of cell adhesion molecules. The homeodomain encoded by Barx2 is 87% identical to that of Barx1, and both genes are related to genes at the Bar locus of Drosophila melanogaster. Barx1 and Barx2 also encode an identical stretch of 17 residues downstream of the homeobox. It is suggested that Barx2 may differentially control the expression of L1 and other target genes during embryonic development.
2176	Human natural killer (NK) cells and activated lymphocytes express and store a distinct subset of neutral serine proteases together with proteoglycans and other immune effector molecules in large cytoplasmic granules. These serine proteases are collectively termed granzymes and include 4 distinct gene products: granzyme A, granzyme B, granzyme H, and Met-ase, also known as granzyme M.
2177	
2178	
2179	
2180	
2181	
2182	
2183	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development, reproduction, and tissue remodeling, as well as in disease processes, such as arthritis and metastasis. Most MMP@#$%&s are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. The enzyme encoded by this gene degrades proteoglycans and fibronectin. The gene is part of a cluster of MMP genes which localize to chromosome 11q22.3.
2184	
2185	Elastases form a subfamily of serine proteases that hydrolyze many proteins in addition to elastin. Humans have six elastase genes which encode the structurally similar proteins elastase 1, 2, 2A, 2B, 3A, and 3B. Elastase 2 hydrolyzes proteins within specialized neutrophil lysosomes, called azurophil granules, as well as proteins of the extracellular matrix following the protein@#$%&s release from activated neutrophils. Elastase 2 may play a role in degenerative and inflammatory diseases by its proteolysis of collagen-IV and elastin of the extracellular matrix. This protein degrades the outer membrane protein A (OmpA) of E. coli as well as the virulence factors of such bacteria as Shigella, Salmonella and Yersinia. Mutations in this gene are associated with cyclic neutropenia and severe congenital neutropenia (SCN). This gene is clustered with other serine protease gene family members, azurocidin 1 and proteinase 3 genes, at chromosome 19pter. All 3 genes are expressed coordinately and their protein products are packaged together into azurophil granules during neutrophil differentiation.
2186	
2187	
2188	This reference sequence was derived from AF161462.1 and ESTs; validated by multiple replicate ESTs and human genomic sequence. This gene encodes a member of a family of small membrane proteins that share a 35-amino acid signature sequence domain, beginning with the sequence PFXYD and containing 7 invariant and 6 highly conserved amino acids. The approved human gene nomenclature for the family is FXYD-domain containing ion transport regulator. Mouse FXYD5 has been termed RIC (Related to Ion Channel). FXYD2, also known as the gamma subunit of the Na,K-ATPase, regulates the properties of that enzyme. FXYD1 (phospholemman), FXYD2 (gamma), FXYD3 (MAT-8), FXYD4 (CHIF), and FXYD5 (RIC) have been shown to induce channel activity in experimental expression systems. Transmembrane topology has been established for two family members (FXYD1 and FXYD2), with the N-terminus extracellular and the C-terminus on the cytoplasmic side of the membrane. This gene product, FXYD5, has not been characterized as a protein. Two transcript variants have been found for this gene, and they are both predicted to encode the same protein. [RefSeq curation by Kathleen J. Sweadner, Ph.D., sweadner@helix.mgh.harvard.edu.]
2189	Caspases are cysteine proteases that cleave C-terminal aspartic acid residues on their substrate molecules. CASP12 is most highly related to members of the ICE subfamily (see CASP1; MIM 147678) of caspases that process inflammatory cytokines. In rodents, Casp12 mediates apoptosis in response to endoplasmic reticulum stress. Human CASP12, however, appears to be nonfunctional (Fischer et al., 2002).[supplied by OMIM]
2190	
2191	
2192	
2193	This gene, CYP4F8, encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and functions as a 19-hydroxylase of prostaglandins in seminal vesicles. This gene is part of a cluster of cytochrome P450 genes on chromosome 19. Another member of this family, CYP4F3, is approximately 18 kb away.
2194	This gene is specifically expressed in the nucleus of haploid male germ cells. It encodes a protein that may be involved in the regulation of germ cell nuclear functions. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
2195	
2196	This gene encodes a mitochondrial DNA-directed RNA polymerase. The gene product is responsible for mitochondrial gene expression as well as for providing RNA primers for initiation of replication of the mitochondrial genome. Although this polypeptide has the same function as the three nuclear DNA-directed RNA polymerases, it is more closely related to RNA polymerases of phage and mitochondrial polymerases of lower eukaryotes.
2197	
2198	
2199	
2200	The origin recognition complex (ORC) is a highly conserved six subunit protein complex essential for the initiation of the DNA replication in eukaryotic cells. Studies in yeast demonstrated that ORC binds specifically to origins of replication and serves as a platform for the assembly of additional initiation factors such as Cdc6 and Mcm proteins. The protein encoded by this gene is a subunit of the ORC complex. It has been shown to form a core complex with ORC2L, -3L, and -5L. Three alternatively spliced transcript variants encoding the same protein have been reported.
2201	
2202	
2203	
2204	Crystallins are separated into two classes: taxon-specific, or enzyme, and ubiquitous. The latter class constitutes the major proteins of vertebrate eye lens and maintains the transparency and refractive index of the lens. Since lens central fiber cells lose their nuclei during development, these crystallins are made and then retained throughout life, making them extremely stable proteins. Mammalian lens crystallins are divided into alpha, beta, and gamma families; beta and gamma crystallins are also considered as a superfamily. Alpha and beta families are further divided into acidic and basic groups. Seven protein regions exist in crystallins: four homologous motifs, a connecting peptide, and N- and C-terminal extensions. Gamma-crystallins are a homogeneous group of highly symmetrical, monomeric proteins typically lacking connecting peptides and terminal extensions. They are differentially regulated after early development. Four gamma-crystallin genes (gamma-A through gamma-D) and three pseudogenes (gamma-E, gamma-F, gamma-G) are tandemly organized in a genomic segment as a gene cluster. Whether due to aging or mutations in specific genes, gamma-crystallins have been involved in cataract formation.
2205	
2206	
2207	
2208	
2209	
2210	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This mitochondrial protein oxidizes cholesterol intermediates as part of the bile synthesis pathway. Since the conversion of cholesterol to bile acids is the major route for removing cholesterol from the body, this protein is important for overall cholesterol homeostasis. Mutations in this gene cause cerebrotendinous xanthomatosis, a rare autosomal recessive lipid storage disease.
2211	
2212	The protein encoded by this gene is a member of the SWI/SNF family of proteins. Members of this family have helicase and ATPase activities and are thought to regulate transcription of certain genes by altering the chromatin structure around those genes. The encoded protein shows sequence similarity to the E. coli RNA polymerase-binding protein HepA. Mutations in this gene are a cause of Schimke immunoosseous dysplasia (SIOD), an autosomal recessive disorder with the diagnostic features of spondyloepiphyseal dysplasia, renal dysfunction, and T-cell immunodeficiency.
2213	
2214	The protein encoded by this gene belongs to the complex I 75 kDa subunit family. Mammalian complex I is composed of 45 different subunits. It locates at the mitochondrial inner membrane. This protein has NADH dehydrogenase activity and oxidoreductase activity. It transfers electrons from NADH to the respiratory chain. The immediate electron acceptor for the enzyme is believed to be ubiquinone. This protein is the largest subunit of complex I and it is a component of the iron-sulfur (IP) fragment of the enzyme. It may form part of the active site crevice where NADH is oxidized. Mutations in this gene are associated with complex I deficiency.
2215	The protein encoded by this gene belongs to the complex I 75 kDa subunit family. Mammalian complex I is composed of 45 different subunits. It locates at the mitochondrial inner membrane. This protein has NADH dehydrogenase activity and oxidoreductase activity. It transfers electrons from NADH to the respiratory chain. The immediate electron acceptor for the enzyme is believed to be ubiquinone. This protein is the largest subunit of complex I and it is a component of the iron-sulfur (IP) fragment of the enzyme. It may form part of the active site crevice where NADH is oxidized. Mutations in this gene are associated with complex I deficiency.
2216	
2217	
2218	
2219	
2220	High density lipoprotein-binding protein, also known as vigilin, is a 110-kD protein that specifically binds HDL molecules and may function in the removal of excess cellular cholesterol.[supplied by OMIM]
2221	VAX2 gene was named based on its high seqeunce similarity to VAX1 gene and is almost exclusively expressed in the ventral portion of the retina during development. It may play an important role in morphogenetic processes underlying the correct development of the eye, particularly in the establishment of a correct dorsoventral pattern.
2222	
2223	
2224	
2225	
2226	TTL is a cytosolic enzyme involved in the posttranslational modification of alpha-tubulin (see MIM 602529). Alpha-tubulin within assembled microtubules is detyrosinated over time at the C terminus. After microtubule disassembly, TTL restores the tyrosine residues and consequently participates in a cycle of tubulin detyrosination and tyrosination.[supplied by OMIM]
2227	
2228	
2229	This intronless gene encodes a member of the G protein-coupled receptor (GPCR) family. Members of this protein family contain seven putative transmembrane domains and may mediate signaling processes to the interior of the cell via activation of heterotrimeric G proteins. This protein may function in the central nervous system.
2230	The protein encoded by this protein contains a RING finger, a motif known to be involved in protein-DNA and protein-protein interactions. The mouse counterpart of this protein has been shown to interact with Ube2l3/UbcM4, which is an ubiquitin-conjugating enzyme involved in embryonic development.
2231	The protein encoded by this gene is a member of the TGF-beta family of growth factors. Data suggest that the similar protein in mouse plays an important role in trabeculation of the embryonic heart. In human, this protein may signal through receptor serine/threonine kinases.
2232	
2233	
2234	
2235	
2236	This gene encodes a protein that is involved in glycosylphosphatidylinositol (GPI)-anchor biosynthesis. The GPI-anchor is a glycolipid which contains three mannose molecules in its core backbone. The GPI-anchor is found on many blood cells and serves to anchor proteins to the cell surface. This protein and another GPI synthesis protein, PIGO, function in the transfer of ethanolaminephosphate (EtNP) to the third mannose in GPI. At least two alternatively spliced transcripts encoding distinct isoforms have been found for this gene.
2237	RAN is a small GTP-binding protein of the RAS superfamily that is associated with the nuclear membrane and is thought to control a variety of cellular functions through its interactions with other proteins. This gene encodes a very large RAN-binding protein that immunolocalizes to the nuclear pore complex. The protein is a giant scaffold and mosaic cyclophilin-related nucleoporin implicated in the Ran-GTPase cycle. The encoded protein directly interacts with the E2 enzyme UBC9 and strongly enhances SUMO1 transfer from UBC9 to the SUMO1 target SP100. These findings place sumoylation at the cytoplasmic filaments of the nuclear pore complex and suggest that, for some substrates, modification and nuclear import are linked events. This gene is partially duplicated in a gene cluster that lies in a hot spot for recombination on chromosome 2q.
2238	This gene encodes a member of the G-protein coupled receptor family. Although this protein was earlier thought to be a receptor for vasoactive intestinal peptide (VIP), it is now considered to be an orphan receptor, in that its endogenous ligand has not been identified. The protein is also a coreceptor for human immunodeficiency viruses (HIV). Translocations involving this gene and HMGA2 on chromosome 12 have been observed in lipomas.
2239	The protein encoded by this gene is a member of the RP/EB family of genes.  The protein localizes to the cytoplasmic microtubule network and binds APCL, a homolog of the adenomatous polyposis coli tumor suppressor gene.
2240	
2241	This gene product belongs to the 14-3-3 family of proteins which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals, and this protein is 99% identical to the mouse and rat orthologs. This gene is upregulated in patients with amyotrophic lateral sclerosis. It contains in its 5@#$%& UTR a 6 bp tandem repeat sequence which is polymorphic, however, there is no correlation between the repeat number and the disease.
2242	This gene encodes for a protein that in yeast forms part of a trimeric vacuolar-protein-sorting complex that is required for retrograde transport of proteins from prevacuoles to the late Golgi compartment. As in yeast, mammalian Vps54 proteins contain a coiled-coil region and dileucine motifs. Alternative splicing results in multiple transcript variants encoding different isoforms
2243	
2244	
2245	MIZF interacts with methyl-CpG-binding protein-2 (MBD2; MIM 603547), a component of the MeCP1 histone deacetylase (HDAC) complex, and plays a role in DNA methylation and transcription repression.[supplied by OMIM]
2246	
2247	
2248	The bactericidal permeability increasing protein is a lipopolysaccharide binding protein. It is associated with human neutrophil granules and has bactericidal activity on gram-negative organisms.
2249	The bactericidal permeability increasing protein is a lipopolysaccharide binding protein. It is associated with human neutrophil granules and has bactericidal activity on gram-negative organisms.
2250	This gene encodes a serine/threonine protein kinase. Although this gene product is similar to serum- and glucocorticoid-induced protein kinase (SGK), this gene is not  induced by serum or glucocorticoids. This gene is induced in response to signals that activate phosphatidylinositol 3-kinase, which is also true for SGK. Two alternate transcripts encoding two different isoforms have been described.
2251	
2252	Troponin (Tn), a key protein complex in the regulation of striated muscle contraction, is composed of 3 subunits. The Tn-I subunit inhibits actomyosin ATPase, the Tn-T subunit binds tropomyosin and Tn-C, while the Tn-C subunit binds calcium and overcomes the inhibitory action of the troponin complex on actin filaments. The protein encoded by this gene is the Tn-C subunit.
2253	This gene encodes one of two glycosyltransferases involved in the chain elongation step of heparan sulfate biosynthesis. Mutations in this gene cause the type II form of multiple exostoses.
2254	
2255	The Drosophila embryonic protein snail is a zinc finger transcriptional repressor which downregulates the expression of ectodermal genes within the mesoderm. The nuclear protein encoded by this gene is structurally similar to the Drosophila snail protein, and is also thought to be critical for mesoderm formation in the developing embryo. At least two variants of a similar processed pseudogene have been found on chromosome 2.
2256	
2257	
2258	
2259	The protein encoded by this gene is a member of the immunophilin protein family, which play a role in immunoregulation and basic cellular processes involving protein folding and trafficking. This encoded protein is a cis-trans prolyl isomerase that binds the immunosuppressants FK506 and rapamycin. It interacts with several intracellular signal transduction proteins including type I TGF-beta receptor. It also interacts with multiple intracellular calcium release channels including the tetrameric skeletal muscle ryanodine receptor. In mouse, deletion of this homologous gene causes congenital heart disorder known as noncompaction of left ventricular myocardium. There is evidence of multiple alternatively spliced transcript variants for this gene, but the full length nature of some variants has not been determined.
2260	
2261	
2262	
2263	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein binds to fanconi anemia complementation group A protein, but its function is unknown. This gene results in two transcript variants encoding the same protein.
2264	The crooked neck (crn) gene of Drosophila is essential for embryogenesis and is thought to be involved in cell cycle progression and pre-mRNA splicing. This gene is similar in sequence to crn and encodes a protein which can localize to pre-mRNA splicing complexes in the nucleus. The encoded protein, which contains many tetratricopeptide repeats, is required for pre-mRNA splicing.
2265	The protein encoded by this gene contains a paired-like homeodomain and binds to the core of the locus control region of the red/green visual pigment gene cluster. The encoded protein may regulate expression of the cone opsin genes early in development. Mutations in this gene can cause posterior polymorphous corneal dystrophy (PPCD) and keratoconus. Two transcript variants encoding different isoforms have been found for this gene.
2266	
2267	
2268	Atrophin-1 contains a polyglutamine repeat, expansion of which is responsible for dentatorubral and pallidoluysian atrophy. The protein encoded by this gene interacts with atrophin-1. This encoded protein is a closely related member of the NEDD4-like protein family. This family of proteins are E3 ubiquitin-ligase molecules and regulate key trafficking decisions, including targeting of proteins to proteosomes or lysosomes. This encoded protein contains four tandem WW domains and a HECT (homologous to the E6-associated protein carboxyl terminus) domain. It can act as a transcriptional corepressor of p45/NFE2 and may participate in the regulation of immune responses by modifying Notch-mediated signaling. It is highly similar to the mouse Itch protein, which has been implicated in the regulation and differentiation of erythroid and lymphoid cells.
2269	
2270	The mammalian sperm flagellum contains two cytoskeletal structures associated with the axoneme: the outer dense fibers surrounding the axoneme in the midpiece and principal piece and the fibrous sheath surrounding the outer dense fibers in the principal piece of the tail. Defects in these structures are associated with abnormal tail morphology, reduced sperm motility, and infertility. In the rat, the protein encoded by this gene associates with an outer dense fiber protein via a leucine zipper motif and localizes to the microtubules of the manchette and axoneme during sperm tail development.
2271	
2272	
2273	
2274	
2275	
2276	Crystallins are separated into two classes: taxon-specific, or enzyme, and ubiquitous. The latter class constitutes the major proteins of vertebrate eye lens and maintains the transparency and refractive index of the lens. Since lens central fiber cells lose their nuclei during development, these crystallins are made and then retained throughout life, making them extremely stable proteins. Mammalian lens crystallins are divided into alpha, beta, and gamma families; beta and gamma crystallins are also considered as a superfamily. Alpha and beta families are further divided into acidic and basic groups. Seven protein regions exist in crystallins: four homologous motifs, a connecting peptide, and N- and C-terminal extensions. Alpha crystallins are composed of two gene products: alpha-A and alpha-B, for acidic and basic, respectively. Alpha crystallins can be induced by heat shock and are members of the small heat shock protein (sHSP also known as the HSP20) family. They act as molecular chaperones although they do not renature proteins and release them in the fashion of a true chaperone; instead they hold them in large soluble aggregates. Post-translational modifications decrease the ability to chaperone. These heterogeneous aggregates consist of 30-40 subunits; the alpha-A and alpha-B subunits have a 3:1 ratio, respectively. Two additional functions of alpha crystallins are an autokinase activity and participation in the intracellular architecture. Alpha-A and alpha-B gene products are differentially expressed; alpha-A is preferentially restricted to the lens and alpha-B is expressed widely in many tissues and organs. Defects in this gene cause autosomal dominant congenital cataract (ADCC).
2277	USP18, a member of the deubiquitinating protease family of enzymes, removes ubiquitin adducts from a broad range of protein substrates.[supplied by OMIM]
2278	This gene encodes the receptor for reticulon 4, oligodendrocyte myelin glycoprotein and myelin-associated glycoprotein. This receptor mediates axonal growth inhibition and may play a role in regulating axonal regeneration and plasticity in the adult central nervous system.
2279	The protein encoded by this gene is similar to  SCARF1/SREC-I, a scavenger receptor protein that mediates the binding and degradation of acetylated low density lipoprotein (Ac-LDL). This protein has only little activity of internalizing modified low density lipoproteins (LDL), but it can interact with SCARF1 through its extracellular domain. The association of this protein with SCARF1 is suppressed by the presence of scavenger ligands. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
2280	
2281	Human gamma-glutamyltransferase catalyzes the transfer of the glutamyl moiety of glutathione to a variety of amino acids and dipeptide acceptors. This heteroduplex enzyme is composed of a heavy chain and a light chain, which are derived from a single precursor protein, and is present in tissues involved in absorption and secretion. This enzyme is a member of the gamma-glutamyltransferase protein family, of which many members have not yet been fully characterized. This gene encodes several transcript variants; studies suggest that many transcripts of this gene family may be non-functional or represent pseudogenes. The functional transcripts which have been fully characterized have been grouped and classified as type I gamma-glutamyltransferase. Complex splicing events may take place in a tissue-specific manner, resulting in marked dissimilarity in the 5@#$%&UTRs. Several 5@#$%&UTR transcript variants of the type I gene have been identified in different tissues and cancer cells.
2282	Reticulocalbin 1 is a calcium-binding protein located in the lumen of the ER. The protein contains six conserved regions with similarity to a high affinity Ca(+2)-binding motif, the EF-hand. High conservation of amino acid residues outside of these motifs, in comparison to mouse reticulocalbin, is consistent with a possible biochemical function besides that of calcium binding.
2283	The protein encoded by this gene catalyzes the O-sulfation of tyrosine residues within acidic regions of proteins. The encoded protein is a type II integral membrane protein found in the Golgi body. Two transcript variants encoding the same protein have been found for this gene.
2284	
2285	
2286	This gene encodes a protein that is abnormally elevated in malignant tumors of astrocytic origin. It is a strongly conserved gene containing a BRCT domain that is essential for the activity of this gene product. The gene plays a crucial role in cell proliferation and may be necessary for oncogenic transformation and tumor progression.
2287	The protein encoded by this gene is a nucleocytoplasmic shuttle protein for the translation initiation factor eIF4E. This shuttle protein interacts with the importin alpha-beta complex to mediate nuclear import of eIF4E. It is predominantly cytoplasmic;its own nuclear import is regulated by a nuclear localization signal and nuclear export signals.
2288	The protein encoded by this gene is structurally very similar to the CDC46 protein from S. cerevisiae, a protein involved in the initiation of DNA replication. The encoded protein is a member of the MCM family of chromatin-binding proteins and can interact with at least two other members of this family. The encoded protein is upregulated in the transition from the G0 to G1/S phase of the cell cycle and may actively participate in cell cycle regulation.
2289	This gene encodes a protein which can function as a monomer or as a disulfide-linked homodimer and which catalyzes the transfer of a sulfur ion from 3-mercaptopyruvate to cyanide or other thiol compounds. It may be involved in cyanide degradation and in thiosulfate biosynthesis. The encoded cytoplasmic protein is a member of the rhodanese family but is not rhodanese itself, which is a mitochondrial protein. Alternatively spliced transcript variants encoding the same protein have been identified.
2290	This gene encodes a member of the Golgi-localized, gamma adaptin ear-containing, ARF-binding (GGA) protein family. Members of this family are ubiquitous coat proteins that regulate the trafficking of proteins between the trans-Golgi network and the lysosome. These proteins share an amino-terminal VHS domain which mediates sorting of the mannose 6-phosphate receptors at the trans-Golgi network. They also contain a carboxy-terminal region with homology to the ear domain of gamma-adaptins. Multiple alternatively spliced transcript variants encoding different isoforms have been found for this gene.
2291	
2292	This gene is a member of the cytidine deaminase gene family. It is one of seven related genes or pseudogenes found in a cluster, thought to result from gene duplication, on chromosome 22. Members of the cluster encode proteins that are structurally and functionally related to the C to U RNA-editing cytidine deaminase APOBEC1. It is thought that the proteins may be RNA editing enzymes and have roles in growth or cell cycle control.
2293	
2294	
2295	The protein encoded by this gene is found exclusively in the mitochondrion, where it catalyzes the transfer of a malonyl group from malonyl-CoA to the mitochondrial acyl carrier protein. The encoded protein may be part of a fatty acid synthase complex that is more like the type II prokaryotic and plastid complexes rather than the type I human cytosolic complex. Two transcript variants encoding different isoforms have been found for this gene.
2296	
2297	The protein encoded by this gene is a member of the flamingo subfamily, part of the cadherin superfamily. The flamingo subfamily consists of nonclassic-type cadherins; a subpopulation that does not interact with catenins. The flamingo cadherins are located at the plasma membrane and have nine cadherin domains, seven epidermal growth factor-like repeats and two laminin A G-type repeats in their ectodomain. They also have seven transmembrane domains, a characteristic unique to this subfamily. It is postulated that these proteins are receptors involved in contact-mediated communication, with cadherin domains acting as homophilic binding regions and the EGF-like domains involved in cell adhesion and receptor-ligand interactions. This particular member is a developmentally regulated, neural-specific gene which plays an unspecified role in early embryogenesis.
2298	Mammalian cytochrome c oxidase (COX) catalyzes the transfer of reducing equivalents from cytochrome c to molecular oxygen and pumps protons across the inner mitochondrial membrane. In yeast, 2 related COX assembly genes, SCO1 and SCO2 (synthesis of cytochrome c oxidase), enable subunits 1 and 2 to be incorporated into the holoprotein. This gene is the human homolog of the yeast SCO2 gene.
2299	
2300	
2301	
2302	
2303	
2304	
2305	
2306	
2307	Fibroblast growth factors (FGFs; see MIM 603726) are secreted proteins involved in cellular proliferation, migration, differentiation, and survival. FGF activity is negatively regulated by members of the @#$%&sprouty@#$%& family (e.g., SPRY1, MIM 602465). The SEF protein is a modulator of FGF signaling.[supplied by OMIM]
2308	
2309	
2310	
2311	
2312	
2313	Von Hippel-Lindau syndrome (VHL) is a dominantly inherited familial cancer syndrome predisposing to a variety of malignant and benign tumors. A germline mutation of this gene is the basis of familial inheritance of VHL syndrome. The protein encoded by this gene is a component of the protein complex that includes elongin B, elongin C, and cullin-2, and possesses ubiquitin ligase E3 activity. This protein is involved in the ubiquitination and degradation of hypoxia-inducible-factor (HIF), which is a transcription factor that plays a central role in the regulation of gene expression by oxygen. RNA polymerase II subunit POLR2G/RPB7 is also reported to be a target of this protein. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
2314	The protein encoded by this gene belongs to the G-protein coupled receptor family and acts as a receptor for oxytocin. Its activity is mediated by G proteins which activate a phosphatidylinositol-calcium second messenger system. The oxytocin-oxytocin receptor system plays an important role in the uterus during parturition.
2315	
2316	The protein encoded by this gene is a member of the Toll-like receptor (TLR) family which plays a fundamental role in pathogen recognition and activation of innate immunity. TLRs are highly conserved from Drosophila to humans and share structural and functional similarities. They recognize pathogen-associated molecular patterns (PAMPs) that are expressed on infectious agents, and mediate the production of cytokines necessary for the development of effective immunity. The various TLRs exhibit different patterns of expression. This gene is preferentially expressed in immune cell rich tissues, such as spleen, lymph node, bone marrow and peripheral blood leukocytes. Studies in mice and human indicate that this receptor mediates cellular response to unmethylated CpG dinucleotides in bacterial DNA to mount an innate immune response.
2317	
2318	
2319	
2320	The semaphorin/collapsin family of molecules plays a critical role in the guidance of growth cones during neuronal development. The secreted protein encoded by this gene family member is important in axonal guidance and has been shown to act as a tumor suppressor by inducing apoptosis.
2321	This gene encodes a member of the organic-cation transporter family. It is located in a gene cluster with another member of the family, organic cation transporter like 3. The encoded protein is a transmembrane protein which is thought to transport small molecules and since this protein is conserved among several species, it is suggested to have a fundamental role in mammalian systems.
2322	
2323	
2324	The protein encoded by this gene is an RNA binding protein that interacts with the functionally-similar proteins FMR1 and FXR2. These proteins shuttle between the nucleus and cytoplasm and associate with polyribosomes, predominantly with the 60S ribosomal subunit. Three transcript variants encoding different isoforms have been found for this gene.
2325	This gene encodes a highly conserved protein whose function is not yet known. The protein was initially thought to be longer at the N-terminus and to contain an arginine-rich region but transcribed evidence indicates a smaller open reading frame that does not encode the arginine tract. The presence of a specific mutation changing the previously numbered codon 50 from ATG to AGG, or deletion of that codon, has been reported in a variety of solid tumors. With the protein size correction, this codon is now identified as the initiation codon.
2326	This gene encodes a type I integral membrane protein found only in the rough endoplasmic reticulum. The encoded protein is part of an N-oligosaccharyl transferase complex that links high mannose oligosaccharides to asparagine residues found in the Asn-X-Ser/Thr consensus motif of nascent polypeptide chains. This protein forms part of the regulatory subunit of the 26S proteasome and may mediate binding of ubiquitin-like domains to this proteasome.
2327	SEP1 localizes to cytoplasmic foci containing a complex of mRNA-degrading enzymes. In addition to mRNA metabolism, yeast Sep1 has been implicated in a variety of nuclear and cytoplasmic functions, including homologous recombination, meiosis, telomere maintenance, and microtubule assembly.[supplied by OMIM]
2328	The protein encoded by this gene is a member of the serine protease inhibitor (serpin) superfamily made up of proteins which play central roles in the regulation of a wide variety of physiological processes, including coagulation, fibrinolysis, development, malignancy and inflammation. The gene product may have a role in a growth-control, possibly growth-suppressing pathway and, when impaired, may be involved in pancreatic carcinogenesis. The protein is a member of the plasminogen activator inhibitor-1 family, a subset of the serpin superfamily whose members act as tissue-specific tPA inhibitors. Two alternatively spliced transcript variants encoding distinct protein isoforms have been found for this gene.
2329	
2330	
2331	Sm-like proteins were identified in a variety of organisms based on sequence homology with the Sm protein family (see SNRPD2; MIM 601061). Sm-like proteins contain the Sm sequence motif, which consists of 2 regions separated by a linker of variable length that folds as a loop. The Sm-like proteins are thought to form a stable heteromer present in tri-snRNP particles, which are important for pre-mRNA splicing.[supplied by OMIM]
2332	
2333	This gene encodes a member of a family of cytosolic phosphoproteins expressed exclusively in the nervous system. The encoded protein is thought to be a part of the semaphorin signal transduction pathway implicated in semaphorin-induced growth cone collapse during neural development. Alternative splicing results in multiple transcript variants.
2334	
2335	
2336	The protein encoded by this gene contains three cysteine-rich domains, a proline-rich region, and a pleckstrin homology domain with an overlapping Ras-associating domain. It is localized in the speckle domains of the nucleus, and mediates the regeneration of phosphatidylinositol (PI) from diacylglycerol in the PI-cycle during cell signal transduction.
2337	
2338	
2339	
2340	UCHL1 is a member of a gene family whose products hydrolyze small C-terminal adducts of ubiquitin to generate the ubiquitin monomer. Expression of UCHL1 is highly specific to neurons and to cells of the diffuse neuroendocrine system and their tumors. It is present in all neurons (Doran et al., 1983).[supplied by OMIM]
2341	
2342	
2343	
2344	Ankyrins are a family of proteins that are believed to link the integral membrane proteins to the underlying spectrin-actin cytoskeleton and play key roles in activities such as cell motility, activation, proliferation, contact and the maintenance of specialized membrane domains. Multiple isoforms of ankyrin with different affinities for various target proteins are expressed in a tissue-specific, developmentally regulated manner. Most ankyrins are typically composed of three structural domains: an amino-terminal domain containing multiple ankyrin repeats; a central region with a highly conserved spectrin binding domain; and a carboxy-terminal regulatory domain which is the least conserved and subject to variation. Ankyrin 2 is the major ankyrin protein in the brain. It is a distinct gene product from ankyrin 1, with which it shares similarity in the amino-terminal ankyrin repeat and the spectrin binding domains; however, the carboxy-terminus is unique. Alternatively spliced transcripts, encoding different isoforms of ankyrin 2, have been described.
2345	
2346	
2347	
2348	
2349	
2350	Unlike most eukaryotic kinases, alpha kinases, such as LAK, recognize phosphorylation sites in which the surrounding peptides have an alpha-helical conformation.[supplied by OMIM]
2351	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene encodes a replication-independent member of the histone H2A family that is distinct from other members of the family. Studies in mice have shown that this particular histone is required for embryonic development and indicate that lack of functional histone H2A leads to embryonic lethality.
2352	
2353	
2354	
2355	
2356	The protein encoded by this gene was identified by its RED repeat, a stretch of repeated arginine, glutamic acid and aspartic acid residues. The protein localizes to discrete dots within the nucleus, excluding the nucleolus. Its function is unknown. This gene maps to chromosome 5; however, a pseudogene may exist on chromosome 2.
2357	
2358	
2359	
2360	
2361	
2362	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that may be a functional partner of the death associated protein 3 (DAP3).
2363	This gene is a member of the protocadherin family, which represents a subset of the larger cadherin superfamily. The members of the protocadherin family encode non-classical cadherins that function as calcium-dependent cell-cell adhesion molecules. The kidney-expressed gene product has a signal peptide, nine cadherin repeat domains and a unique cytoplasmic region. This protocadherin represents a new candidate for tumor suppression.
2364	
2365	This gene encodes a protein that is involved in glycosylphosphatidylinositol (GPI)-anchor biosynthesis. The GPI-anchor is a glycolipid which contains three mannose molecules in its core backbone. The GPI-anchor is found on many blood cells and serves to anchor proteins to the cell surface. This protein and another GPI synthesis protein, PIGO, function in the transfer of ethanolaminephosphate (EtNP) to the third mannose in GPI. At least two alternatively spliced transcripts encoding distinct isoforms have been found for this gene.
2366	This gene encodes a protein required for transactivation by the HIV-1 transactivator Tat. Although the encoded protein is not a TBP-associated factor (TAF), it is associated with the RNA polymerase II holoenzyme and is involved in the regulation of transcription elongation. This gene product has motifs found in transcription factors that permit protein-protein interactions, including a glutamine-alanine repeat not observed in mammalian transcription factors.
2367	
2368	
2369	
2370	The protein encoded by this gene is a cytokine that controls the production, differentiation, and function of granulocytes and macrophages. The active form of the protein is found extracellularly as a homodimer. This gene has been localized to a cluster of related genes at chromosome region 5q31, which is known to be associated with interstitial deletions in the 5q- syndrome and acute myelogenous leukemia. Other genes in the cluster include those encoding interleukins 4, 5, and 13.
2371	This gene encodes an F-box protein which functions as a substrate recognition component of the SCF ubiquitin ligase complex. It binds to cyclin F, S-phase kinase-associated protein 2, and other regulatory proteins involved in ubiquitin proteolysis through an F-box motif. The encoded protein also collaborates with a network of proteins to control beta-catenin levels and affects the activity level of beta-catenin dependent TCF transcription factors. Studies have also characterized the protein as an RNA polymerase II elongation factor. Alternative splicing of this gene results in two transcript variants. A related pseudogene has been identified on chromosome 7.
2372	This gene encodes an F-box protein which functions as a substrate recognition component of the SCF ubiquitin ligase complex. It binds to cyclin F, S-phase kinase-associated protein 2, and other regulatory proteins involved in ubiquitin proteolysis through an F-box motif. The encoded protein also collaborates with a network of proteins to control beta-catenin levels and affects the activity level of beta-catenin dependent TCF transcription factors. Studies have also characterized the protein as an RNA polymerase II elongation factor. Alternative splicing of this gene results in two transcript variants. A related pseudogene has been identified on chromosome 7.
2373	
2374	This gene encodes a subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This subunit contains an N-terminal glutathione transferase domain, which may be involved in regulating the assembly of multisubunit complexes containing this elongation factor and aminoacyl-tRNA synthetases.
2375	Cytolytic T lymphocytes (CTL) and natural killer (NK) cells share the remarkable ability to recognize, bind, and lyse specific target cells. They are thought to protect their host by lysing cells bearing on their surface @#$%&nonself@#$%& antigens, usually peptides or proteins resulting from infection by intracellular pathogens. The protein described here is a T cell- and natural killer cell-specific serine protease that may function as a common component necessary for lysis of target cells by cytotoxic T lymphocytes and natural killer cells.
2376	IRX1 is a member of the Iroquois homeobox gene family. Members of this family appear to play multiple roles during pattern formation of vertebrate embryos.[supplied by OMIM]
2377	
2378	Glutathione peroxidase catalyzes the reduction of hydrogen peroxide, organic hydroperoxide, and lipid peroxides by reduced glutathione and functions in the protection of cells against oxidative damage. Human plasma glutathione peroxidase has been shown to be a selenium-containing enzyme. GPX3 expression appears to be tissue-specific.
2379	
2380	The leukemia inhibitory factor is a polyfunctional cytokine that affects the differentiation, survival, and proliferation of a wide variety of cells in the adult and the embryo. LIF action appears to be mediated through a high-affinity receptor complex composed of a low-affinity LIF binding chain (LIF receptor) and a high-affinity converter subunit, gp130. Both LIFR and gp130 are members of a family of cytokine receptors that includes components of the receptors for the majority of hematopoietic cytokines and for cytokines that affect other systems, including the ciliary neurotrophic factor, growth hormone and prolactin.
2381	
2382	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind RNAs, followed by a glycine-rich C-terminus.
2383	
2384	The protein encoded by this gene is a member of the cyclin-dependent protein kinase (CDK) family. CDK family members are highly similar to the gene products of Saccharomyces cerevisiae cdc28, and Schizosaccharomyces pombe cdc2, and are known to be important regulators of cell cycle progression. This protein forms a trimeric complex with cyclin H and MAT1, which functions as a Cdk-activating kinase (CAK). It is an essential component of the transcription factor TFIIH, that is involved in transcription initiation and DNA repair. This protein is thought to serve as a direct link between the regulation of transcription and the cell cycle.
2385	
2386	Members of the ribonuclease III superfamily of double-stranded (ds) RNA-specific endoribonucleases participate in diverse RNA maturation and decay pathways in eukaryotic and prokaryotic cells (Fortin et al., 2002). The RNase III Drosha is the core nuclease that executes the initiation step of microRNA (miRNA) processing in the nucleus (Lee et al., 2003).[supplied by OMIM]
2387	
2388	The protein encoded by this gene shares strong similarity with Saccharomyces cerevisiae Cdc23, a protein essential for cell cycle progression through the G2/M transition. This protein is a component of anaphase-promoting complex (APC), which is composed of eight protein subunits and highly conserved in eucaryotic cells. APC catalyzes the formation of cyclin B-ubiquitin conjugate that is responsible for the ubiquitin-mediated proteolysis of B-type cyclins. This protein and 3 other members of the APC complex contain the TPR (tetratricopeptide repeat), a protein domain important for protein-protein interaction.
2389	
2390	Nucleobindin-2 is a calcium-binding EF-hand protein.[supplied by OMIM]
2391	
2392	Laminins, a family of extracellular matrix glycoproteins, are the major noncollagenous constituent of basement membranes.  They have been implicated in a wide variety of biological processes including cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis.  Laminins are composed of 3 non identical chains: laminin alpha, beta and gamma (formerly A, B1, and B2, respectively) and they form a cruciform structure consisting of 3 short arms, each formed by a different chain, and a long arm composed of all 3 chains.  Each laminin chain is a multidomain protein encoded by a distinct gene.  Several isoforms of each chain have been described.  Different alpha, beta and gamma chain isomers combine to give rise to different heterotrimeric laminin isoforms which are designated by Arabic numerals in the order of their discovery, i.e. alpha1beta1gamma1 heterotrimer is laminin 1.  The biological functions of the different chains and trimer molecules are largely unknown, but some of the chains have been shown to differ with respect to their tissue distribution, presumably reflecting diverse functions in vivo.  This gene encodes the alpha chain isoform laminin, alpha 4.  The domain structure of alpha 4 is similar to that of alpha 3, both of which resemble truncated versions of alpha 1 and alpha 2, in that approximately 1,200 residues at the N-terminus (domains IV, V and VI) have been lost. Laminin, alpha 4 contains the C-terminal G domain which distinguishes all alpha chains from the beta and gamma chains.  The RNA analysis from adult and fetal tissues revealed developmental regulation of expression, however, the exact function of laminin, alpha 4 is not known.  Tissue-specific utilization of alternative polyA-signal has been described in literature.  Also, alternative splicing involving the first intron in the 5@#$%& UTR, and laminin alpha 4 like isoforms have been noted, however, the full-length nature of these products is not known.
2393	RRAGD is a monomeric guanine nucleotide-binding protein, or G protein. By binding GTP or GDP, small G proteins act as molecular switches in numerous cell processes and signaling pathways.[supplied by OMIM]
2394	
2395	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This endoplasmic reticulum protein is involved in the conversion of cholesterol to bile acids. Its substrates include the oxysterols 25-hydroxycholesterol, 27-hydroxycholesterol and 24-hydroxycholesterol.
2396	This gene encodes a DNA-binding protein with a gcm-motif (glial cell missing motif). The encoded protein is a homolog of the Drosophila glial cells missing gene (gcm). This protein binds to the GCM-motif (A/G)CCCGCAT, a novel sequence among known targets of DNA-binding proteins. The N-terminal DNA-binding domain confers the unique DNA-binding activity of this protein.
2397	The protein encoded by this gene is a cytosolic prolyl endopeptidase that cleaves peptide bonds on the C-terminal side of prolyl residues within peptides that are up to approximately 30 amino acids long. Prolyl endopeptidases have been reported to be involved in the maturation and degradation of peptide hormones and neuropeptides.
2398	
2399	
2400	
2401	
2402	This gene encodes an intracellular F-actin binding protein. The protein is expressed in lymphocytes, neutrophils, macrophages, and endothelium and may regulate neutrophil motility, adhesion to fibrinogen matrix proteins, and transendothelial migration. Alternative splicing results in multiple transcript variants encoding different isoforms.
2403	
2404	
2405	
2406	The product of this gene is a membrane protein expressed on the oligodendrocyte cell surface and the outermost surface of myelin sheaths. Due to this localization, it is a primary target antigen involved in immune-mediated demyelination. This protein may be involved in completion and maintenance of the myelin sheath and in cell-cell communication. Alternatively spliced transcript variants encoding different isoforms have been identified.
2407	
2408	
2409	Component C2 is part of the classical pathway of complement system.  Activated C1 cleaves C2 into C2a and C2b.  C2a leads to activation of C3.  Deficiency of C2 has been reported to associated with certain autoimmune diseases.
2410	A cluster of genes, BAT1-BAT5, has been localized in the vicinity of the genes for TNF alpha and TNF beta. These genes are all within the human major histocompatibility complex class III region. The protein encoded by this gene is thought to be involved in some aspects of immunity.
2411	
2412	
2413	This gene is a member of the cyclophilin family of peptidylprolyl isomerases (PPIases). The cyclophilins are a highly conserved, ubiquitous family, members of which play an important role in protein folding, immunosuppression by cyclosporin A, and infection of HIV-1 virions. Based on similarity to other PPIases, this protein could accelerate the folding of proteins and might catalyze the cis-trans isomerization of proline imidic peptide bonds in oligopeptides.
2414	
2415	
2416	
2417	
2418	
2419	Chromodomain Y is a primate-specific Y-chromosomal gene family expressed exclusively in the testis and implicated in infertility. Although the Y-linked genes are testis-specific, this autosomal gene is ubiquitously expressed. The Y-linked genes arose by retrotransposition of an mRNA from this gene, followed by amplification of the retroposed gene. Proteins encoded by this gene superfamily possess a chromodomain, a motif implicated in chromatin binding and gene suppression, and a catalytic domain believed to be involved in histone acetylation. Multiple proteins are encoded by transcript variants of this gene.
2420	
2421	
2422	This gene encodes a protein, which has high sequence similarity to rat, xenopus and zebrafish proteins. The protein function is unknown.
2423	Mitogen-activated protein kinase (MAPK) signaling cascades include MAPK or extracellular signal-regulated kinase (ERK), MAPK kinase (MKK or MEK), and MAPK kinase kinase (MAPKKK or MEKK). MAPKK kinase/MEKK phosphorylates and activates its downstream protein kinase, MAPK kinase/MEK, which in turn activates MAPK. The kinases of these signaling cascades are highly conserved, and homologs exist in yeast, Drosophila, and mammalian cells. MAPKKK5 contains 1,374 amino acids with all 11 kinase subdomains. Northern blot analysis shows that MAPKKK5 transcript is abundantly expressed in human heart and pancreas. The MAPKKK5 protein phosphorylates and activates MKK4 (aliases SERK1, MAPKK4) in vitro, and activates c-Jun N-terminal kinase (JNK)/stress-activated protein kinase (SAPK) during transient expression in COS and 293 cells; MAPKKK5 does not activate MAPK/ERK.
2424	The protein encoded by this gene is a brain fatty acid binding protein. Fatty acid binding proteins (FABPs) are a family of small, highly conserved, cytoplasmic proteins that bind long-chain fatty acids and other hydrophobic ligands. FABPs are thought to play roles in fatty acid uptake, transport, and metabolism.
2425	
2426	
2427	
2428	
2429	
2430	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
2431	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
2432	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
2433	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein that is known to complex with integrins. This protein appears to promote muscle cell fusion and support myotube maintenance. Also it may be involved in signal transduction. This gene is localized in the tumor-suppressor gene region and thus it is a candidate gene for malignancies.
2434	
2435	
2436	This gene was identified by the rapid tyrosine-phosphorylation of its product in response to cytokine stimulation. The encoded protein contains a SH3 domain and the immunoreceptor tyrosine-based activation motif (ITAM). This protein associates with JAK3 and JAK2 kinases via its ITAM region, and is phosphorylated by the JAK kinases upon cytokine stimulation, which suggests the function of this protein is as an adaptor molecule involved in the downstream signaling of cytokine receptors. HGS/HRS (hepatocyte grwoth factor-regulated tyrosine kinase substrate) has been found to bind and counteract the function of this protein.
2437	
2438	
2439	The protein encoded by this gene is a carboxypeptidase and bears strong sequence similarity to serine carboxypeptidases. Carboxypeptidases are a large class of proteases that act to cleave a single amino acid from the carboxy termini of proteins or peptides. The exact function of this protein, however, has not been determined. At least two alternatively spliced transcripts which encode the same protein have been observed.
2440	
2441	
2442	
2443	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
2444	
2445	
2446	The protein encoded by this gene is a type I transmembrane glycoprotein which shows homology to the pMEL17 precursor, a melanocyte-specific protein. GPNMB shows expression in the lowly metastatic human melanoma cell lines and xenografts but does not show expression in the highly metastatic cell lines. GPNMB may be involved in growth delay and reduction of metastatic potential. Two transcript variants encoding different isoforms have been found for this gene.
2447	
2448	
2449	
2450	
2451	
2452	This gene product is one of the SNARE recognition molecules implicated in vesicular transport between secretory compartments. It is a membrane associated, isoprenylated protein that functions at the endoplasmic reticulum-Golgi transport step. This protein is highly conserved from yeast to human and can functionally complement the loss of the yeast homolog in the yeast secretory pathway.
2453	
2454	
2455	Calcitonin gene-related peptide (CGRP; MIM 114130), a 37-amino acid neuropeptide, induces increased intracellular cAMP levels. This occurs when CGRP binds to a CGRP receptor (CRLR, or CGRPR; MIM 114190) after the latter is transported to the cell surface by RAMP1 (MIM 605153). CGRPR component protein, or CGRPRCP, modulates CGRP responsiveness in a variety of cell types.[supplied by OMIM]
2456	The WNT gene family consists of structurally related genes which encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family. It contains two transcript variants diverging at the 5@#$%& termini. These two variants are proposed to be the products of separate promoters and not to be splice variants from a single promoter. They are differentially expressed in normal tissues, one of which (variant 2) is expressed at significant levels only in the pancreas, whereas another one (variant 1) is expressed more ubiquitously with highest levels in adult kidney, placenta, brain, heart, and spleen.
2457	Multiprotein complexes are key determinants of Golgi apparatus structure and its capacity for intracellular transport and glycoprotein modification. Several complexes have been identified, including the Golgi transport complex (GTC), the LDLC complex, which is involved in glycosylation reactions, and the SEC34 complex, which is involved in vesicular transport. These 3 complexes are identical and have been termed the conserved oligomeric Golgi (COG) complex, which includes COG5 (Ungar et al., 2002).[supplied by OMIM]
2458	
2459	
2460	The protein encoded by this gene is a member of an epithelium-specific subclass of the Ets transcritpion factor family. In addition to its role in regulating the later stages of terminal differentiation of keratinocytes, it appears to regulate a number of epithelium-specific genes found in tissues containing glandular epithelium such as salivary gland and prostate. It has very low affinity to DNA due to its negative regulatory domain at the amino terminus. Two alternatively spliced transcript variants encoding different isoforms have been described for this gene.
2461	
2462	
2463	
2464	This gene encodes the phosphatase 2A catalytic subunit. Protein phosphatase 2A is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. This gene encodes a beta isoform of the catalytic subunit. Two transcript variants encoding the same protein have been identified for this gene.
2465	
2466	
2467	
2468	
2469	The protein encoded by this gene contains a RING finger, a motif present in a variety of functionally distinct proteins and known to be involved in protein-protein and protein-DNA interactions.
2470	
2471	
2472	
2473	
2474	Cellular responses to tumor necrosis factor (TNF; MIM 191160) are initiated by the interaction of TNF with 2 distinct cell surface receptors that transmit signals to the cytoplasm and nucleus, leading to profound alterations in transcriptional programs. The initiation of intracellular signaling events through 1 of these receptors, the 55-kD tumor necrosis factor receptor-1 (TNFR1; MIM 191190), appears to depend on protein intermediates that interact with specific cytoplasmic domains of TNFR1 (Adam-Klages et al., 1996).[supplied by OMIM]
2475	The protein encoded by this gene contains a THAP domain, a conserved DNA-binding domain. This protein colocalizes with the apoptosis response protein PAWR/PAR-4 in promyelocytic leukemia (PML) nuclear bodies, and functions as a proapoptotic factor that links PAWR to PML nuclear bodies. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
2476	
2477	
2478	
2479	The protein encoded by this gene is constitutively tyrosine phosphorylated in hematopoietic progenitors isolated from chronic myelogenous leukemia (CML) patients in the chronic phase. It may be a critical substrate for p210(bcr/abl), a chimeric protein whose presence is associated with CML. This encoded protein binds p120 (RasGAP) from CML cells. Two alternatively spliced transcript variants encoding different isoforms have been described for this gene.
2480	
2481	
2482	This gene catalyzes the hydrolysis of folylpoly-gamma-glutamates and antifolylpoly-gamma-glutamates by the removal of gamma-linked polyglutamates and glutamate.
2483	
2484	This gene is a member of the BCL2/adenovirus E1B 19 kd-interacting protein (BNIP) family.  It interacts with the E1B 19 kDa protein which is responsible for the protection of virally-induced cell death, as well as E1B 19 kDa-like sequences of BCL2, also an apoptotic protector.  The protein encoded by this gene is a functional homolog of BNIP3, a proapoptotic protein.  This protein may function simultaneously with BNIP3 and may play a role in tumor suppression.
2485	The protein encoded by this gene is a member of the dual specificity protein phosphatase subfamily. These phosphatases inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues. They negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily (MAPK/ERK, SAPK/JNK, p38), which are associated with cellular proliferation and differentiation. Different members of the family of dual specificity phosphatases show distinct substrate specificities for various MAP kinases, different tissue distribution and subcellular localization, and different modes of inducibility of their expression by extracellular stimuli. This gene product inactivates ERK1, ERK2 and JNK, is expressed in a variety of tissues, and is localized in the nucleus. Two alternatively spliced transcript variants, encoding distinct isoforms, have been observed for this gene. In addition, multiple polyadenylation sites have been reported.
2486	
2487	Adipocyte differentiation-related protein is associated with the globule surface membrane material. This protein is a major constituent of the globule surface. Increase in mRNA levels is one of the earliest indications of adipocyte differentiation
2488	
2489	This gene encodes a nuclear protein which is involved in cell-cycle regulation. The encoded protein is a ubiquitin-ligase capable of ubiquinating PCNP (PEST-containing nuclear protein), and together they may play a role in tumorigenesis. Alternative splicing of this gene results in several transcripts; however, not all variants have been fully described.
2490	
2491	
2492	
2493	
2494	Cytoplasmic ribosomes, organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L7AE family of ribosomal proteins. It can interact with a subclass of nuclear hormone receptors, including thyroid hormone receptor, and inhibit their ability to transactivate by preventing their binding to their DNA response elements. This gene is included in the surfeit gene cluster, a group of very tightly linked genes that do not share sequence similarity. It is co-transcribed with the U24, U36a, U36b, and U36c small nucleolar RNA genes, which are located in its second, fifth, fourth, and sixth introns, respectively. This gene rearranges with the trk proto-oncogene to form the chimeric oncogene trk-2h, which encodes an oncoprotein consisting of the N terminus of ribosomal protein L7a fused to the receptor tyrosine kinase domain of trk. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
2495	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that belongs to the ribosomal protein S2 family.
2496	
2497	
2498	
2499	
2500	
2501	
2502	
2503	
2504	This gene is one of several CC cytokine genes clustered on the p-arm of chromosome 9. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The CC cytokines are proteins characterized by two adjacent cysteines. The protein encoded by this gene is chemotactic for skin-associated memory T lymphocytes. This cytokine may also play a role in mediating homing of lymphocytes to cutaneous sites. It specifically binds to chemokine receptor 10 (CCR10). Studies of a similar murine protein indicate that these protein-receptor interactions have a pivotal role in T cell-mediated skin inflammation.
2505	This gene is one of seven beta-1,4-galactosyltransferase (beta4GalT) genes. They encode type II membrane-bound glycoproteins that appear to have exclusive specificity for the donor substrate UDP-galactose; all transfer galactose in a beta1,4 linkage to similar acceptor sugars: GlcNAc, Glc, and Xyl. Each beta4GalT has a distinct function in the biosynthesis of different glycoconjugates and saccharide structures. As type II membrane proteins, they have an N-terminal hydrophobic signal sequence that directs the protein to the Golgi apparatus and which then remains uncleaved to function as a transmembrane anchor. By sequence similarity, the beta4GalTs form four groups: beta4GalT1 and beta4GalT2, beta4GalT3 and beta4GalT4, beta4GalT5 and beta4GalT6, and beta4GalT7. This gene is unique among the beta4GalT genes because it encodes an enzyme that participates both in glycoconjugate and lactose biosynthesis. For the first activity, the enzyme adds galactose to N-acetylglucosamine residues that are either monosaccharides or the nonreducing ends of glycoprotein carbohydrate chains. The second activity is restricted to lactating mammary tissues where the enzyme forms a heterodimer with alpha-lactalbumin to catalyze UDP-galactose + D-glucose <=> UDP + lactose. The two enzymatic forms result from alternate transcription initiation sites and post-translational processing. Two transcripts, which differ only at the 5@#$%& end, with approximate lengths of 4.1 kb and 3.9 kb encode the same protein. The longer transcript encodes the type II membrane-bound, trans-Golgi resident protein involved in glycoconjugate biosynthesis. The shorter transcript encodes a protein which is cleaved to form the soluble lactose synthase.
2506	Dynamins represent one of the subfamilies of GTP-binding proteins. These proteins share considerable sequence similarity over the N-terminal portion of the molecule, which contains the GTPase domain. Dynamins are associated with microtubules. They have been implicated in cell processes such as endocytosis and cell motility, and in alterations of the membrane that accompany certain activities such as bone resorption by osteoclasts. Dynamins bind many proteins that bind actin and other cytoskeletal proteins. Dynamins can also self-assemble, a process that stimulates GTPase activity. This gene is located on chromosome 9. More than sixty highly conserved copies of the 3@#$%& region of this gene are found on other chromosomes, particularly chromosomes Y and 15. These copies are referred to as DNM1DN duplicons. Multiple splice variants exist for this gene but only two alternatively spliced transcripts encoding distinct isoforms have been described completely.
2507	This protein belongs to the aldehyde dehydrogenases family of proteins. Aldehyde dehydrogenase is the second enzyme of the major oxidative pathway of alcohol metabolism. This gene does not contain introns in the coding sequence. The variation of this locus may affect the development of alcohol-related problems.
2508	
2509	
2510	The Sec61 complex is the central component of the protein translocation apparatus of the endoplasmic reticulum (ER) membrane. Oligomers of the Sec61 complex form a transmembrane channel where proteins are translocated across and integrated into the ER membrane. This complex consists of three membrane proteins- alpha, beta, and gamma. This gene encodes the beta-subunit protein. The Sec61 subunits are also observed in the post-ER compartment, suggesting that these proteins can escape the ER and recycle back. There is evidence for multiple polyadenylated sites for this transcript.
2511	
2512	
2513	
2514	
2515	
2516	Cytosolic and membrane-bound forms of glutathione S-transferase are encoded by two distinct supergene families. These enzymes function in the detoxification of electrophilic compounds, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress, by conjugation with glutathione. The genes encoding these enzymes are known to be highly polymorphic. These genetic variations can change an individual@#$%&s susceptibility to carcinogens and toxins as well as affect the toxicity and efficacy of some drugs. At present, eight distinct classes of the soluble cytoplasmic mammalian glutathione S-transferases have been identified: alpha, kappa, mu, omega, pi, sigma, theta and zeta. This gene encodes a glutathione S-tranferase belonging to the alpha class. The alpha class genes, located in a cluster mapped to chromosome 6, are the most abundantly expressed glutathione S-transferases in liver. In addition to metabolizing bilirubin and certain anti-cancer drugs in the liver, the alpha class of these enzymes exhibit glutathione peroxidase activity thereby protecting the cells from reactive oxygen species and the products of peroxidation.
2517	The protein encoded by this gene is a component of the exocyst complex, a multiple protein complex essential for targeting exocytic vesicles to specific docking sites on the plasma membrane. Though best characterized in yeast, the component proteins and functions of exocyst complex have been demonstrated to be highly conserved in higher eukaryotes. At least eight components of the exocyst complex, including this protein, are found to interact with the v-ral simian leukemia viral oncogene homolog A (RALA), a Ras-related small GTPase involved in controlling actin cytoskeletal remodeling and vesicle transport. The exocyst complex containing this protein is required for targeting the glucose transporter GLUT4 to the plasma membrane by insulin. The protein complex containing the similar protein in rat was reported to be involved in synaptic vesicle docking and fusion at the presynaptic terminal.
2518	This gene encodes a protein localized in mitochondria and closely related to paraplegin. The paraplegin gene is responsible for an autosomal recessive form of hereditary spastic paraplegia. This gene is a candidate gene for other hereditary spastic paraplegias or neurodegenerative disorders.
2519	
2520	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein.
2521	
2522	
2523	Chloride channels are a diverse group of proteins that regulate fundamental cellular processes including stabilization of cell membrane potential, transepithelial transport, maintenance of intracellular pH, and regulation of cell volume. Chloride intracellular channel 4 (CLIC4) protein, encoded by the CLIC4 gene, is a member of the p64 family; the gene is expressed in many tissues and exhibits a intracellular vesicular pattern in Panc-1 cells (pancreatic cancer cells).
2524	Chloride channels are a diverse group of proteins that regulate fundamental cellular processes including stabilization of cell membrane potential, transepithelial transport, maintenance of intracellular pH, and regulation of cell volume. Chloride intracellular channel 4 (CLIC4) protein, encoded by the CLIC4 gene, is a member of the p64 family; the gene is expressed in many tissues and exhibits a intracellular vesicular pattern in Panc-1 cells (pancreatic cancer cells).
2525	The product of this gene belongs to the ALG6/ALG8 glucosyltransferase family. This enzyme adds the second glucose residue to the lipid-linked oligosaccharide precursor for N-linked glycosylation. Defects in this gene are the cause of congenital disorder of glycosylation type Ih (CDG-Ih). Alternatively spliced transcript variants encoding different isoforms have been identified.
2526	Dihydropyrimidine dehydrogenase  (DPYD) is a pyrimidine  catabolic enzyme and the initial and rate-limiting step in the pathway of  uracil and thymidine catabolism and also in the pathway leading to the  formation of beta-alanine. The DPYD gene is a large gene of approximately 150  kb consisting of 23 exons encoding a protein of approximately 111-kDa. Genetic  deficiency of DPYD enzyme results in an error in pyrimidine metabolism  associated with thymine-uraciluria and an increased risk of toxicity in  cancer patients receiving 5-flourouracil chemotherapy.
2527	
2528	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and two tandem intracellular catalytic domains, and thus represents a receptor-type PTP. The extracellular region contains a meprin-A5 antigen-PTP (MAM) domain, Ig-like and fibronectin type III-like repeats. The protein domain structure and the expression pattern of the mouse counterpart of this PTP suggest its roles in both signal transduction and cellular adhesion in the central nervous system. Two alternatively spliced transcript variants of this gene, which encode distinct proteins, have been reported.
2529	
2530	The protein encoded by this gene is similar to ING1, a tumor suppressor protein that can interact with TP53, inhibit cell growth, and induce apoptosis. This protein contains a PHD-finger, which is a common motif in proteins involved in chromatin remodeling. This protein can bind TP53 and EP300/p300, a component of the histone acetyl transferase complex, suggesting its involvement in TP53-dependent regulatory pathway.
2531	
2532	
2533	This gene is a member of the melanoma antigen gene (MAGE) family. Most of the genes of this family encode tumor specific antigens that are not expressed in normal adult tissues except testis. Although the protein encoded by this gene shares strong homology with members of the MAGE family, it is expressed in almost all normal adult tissues. This gene has been demonstrated to be involved in the p75 neurotrophin receptor mediated programmed cell death pathway. Three transcript variants encoding two different isoforms have been found for this gene.
2534	
2535	
2536	The protein encoded by this gene is a member of the mammalian members of the EGF family of growth factors. It is synthesized primarily as a transmembrane precursor, which is then processed to mature molecule by proteolytic events. This encoded protein is a ligand for the EGF receptor. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
2537	
2538	
2539	The protein encoded by this gene is a member of the G-protein-coupled receptor family. This protein is a receptor for interleukin 8 (IL8). It binds to IL8 with high affinity, and transduces the signal through a G-protein activated second messenger system. This receptor also binds to chemokine (C-X-C motif) ligand 1 (CXCL1/MGSA), a protein with melanoma growth stimulating activity, and has been shown to be a major component required for serum-dependent melanoma cell growth. This receptor mediates neutrophil migration to sites of inflammation. The angiogenic effects of IL8 in intestinal microvascular endothelial cells are found to be mediated by this receptor. Knockout studies in mice suggested that this receptor controls the positioning of oligodendrocyte precursors in developing spinal cord by arresting their migration. This gene, IL8RA, a gene encoding another high affinity IL8 receptor, as well as IL8RBP, a pseudogene of IL8RB, form a gene cluster in a region mapped to chromosome 2q33-q36.
2540	Beta actin is one of six different actin isoforms which have been identified.  ACTB is one of the two nonmuscle cytoskeletal actins.  Actins are highly conserved proteins that are involved in cell motility, structure and integrity.  Alpha actins are a major constituent of the contractile apparatus.
2541	
2542	
2543	PROP1 has both DNA-binding and transcriptional activation ability. Its expression leads to ontogenesis of pituitary gonadotropes, as well as somatotropes, lactotropes, and caudomedial thyrotropes. Inactivating mutations in PROP1 result in deficiencies of luteinizing hormone (LH; MIM 152780), follicle-stimulating hormone (FSH; MIM 136530), growth hormone (GH; MIM 139250), prolactin (PRL; MIM 176760), and thyroid-stimulating hormone (TSH; MIM 188540).[supplied by OMIM]
2544	
2545	Spire proteins, such as SPIRE1, are highly conserved between species. They belong to the family of Wiskott-Aldrich homology region-2 (WH2) proteins, which are involved in actin organization (Kerkhoff et al., 2001).[supplied by OMIM]
2546	This gene encodes subunit 3 of the splicing factor 3b protein complex. Splicing factor 3b, together with splicing factor 3a and a 12S RNA unit, forms the U2 small nuclear ribonucleoproteins complex (U2 snRNP). The splicing factor 3b/3a complex binds pre-mRNA upstream of the intron@#$%&s branch site in a sequence independent manner and may anchor the U2 snRNP to the pre-mRNA. Splicing factor 3b is also a component of the minor U12-type spliceosome. Subunit 3 has also been identified as a component of the STAGA (SPT3-TAF(II)31-GCN5L acetylase) transcription coactivator-HAT (histone acetyltransferase) complex, and the TFTC (TATA-binding-protein-free TAF(II)-containing complex). These complexes may function in chromatin modification, transcription, splicing, and DNA repair.
2547	
2548	Mitochondrial uncoupling proteins (UCP) are members of the larger family of mitochondrial anion carrier proteins (MACP). UCPs separate oxidative phosphorylation from ATP synthesis with energy dissipated as heat, also referred to as the mitochondrial proton leak. UCPs facilitate the transfer of anions from the inner to the outer mitochondrial membrane and the return transfer of protons from the outer to the inner mitochondrial membrane. They also reduce the mitochondrial membrane potential in mammalian cells. Tissue specificity occurs for the different UCPs and the exact methods of how UCPs transfer H+/OH- are not known. UCPs contain the three homologous protein domains of MACPs. This gene is widely expressed in many tissues with the greatest abundance in brain and testis. The gene product has an N-terminal hydrophobic domain that is not present in other UCPs. Two splice variants have been found for this gene.
2549	
2550	
2551	
2552	
2553	
2554	
2555	
2556	The protein encoded by this gene belongs the PI3/PI4-kinase family, and is most closely related to ATM, a protein kinase encoded by the gene mutated in ataxia telangiectasia. This protein and ATM share similarity with Schizosaccharomyces pombe rad3, a cell cycle checkpoint gene required for cell cycle arrest and DNA damage repair in response to DNA damage. This kinase has been shown to phosphorylate checkpoint kinase CHK1, checkpoint proteins RAD17, and RAD9, as well as tumor suppressor protein BRCA1. Mutations of this gene are associated with Seckel syndrome. An alternatively spliced transcript variant of this gene has been reported, however, its full length nature is not known. Transcript variants utilizing alternative polyA sites exist.
2557	
2558	
2559	
2560	The major pathway for the biosynthesis of phosphatidylcholine occurs via the CDP-choline pathway. The protein encoded by this gene is the initial enzyme in the sequence and may play a regulatory role. The encoded protein also catalyzes the phosphorylation of ethanolamine. Two transcript variants encoding different isoforms have been found for this gene.
2561	
2562	The protein encoded by this gene is the alpha subunit of the heterodimeric receptor for colony stimulating factor 2, a cytokine which controls the production, differentiation, and function of granulocytes and macrophages. The encoded protein is a member of the cytokine family of receptors. This gene is found in the pseudoautosomal region (PAR) of the X and Y chromosomes. Six transcript variants encoding five different isoforms have been found for this gene, with some of the isoforms being membrane-bound and others being soluble.
2563	This gene is a member of the sauvagine/corticotropin-releasing factor/urotensin I family. It is structurally related to the corticotropin-releasing factor (CRF) gene and the encoded product is an endogenous ligand for CRF type 2 receptors. In the brain it may be responsible for the effects of stress on appetite. In spite of the gene family name similarity, the product of this gene has no sequence similarity to urotensin II.
2564	
2565	
2566	
2567	
2568	The protein encoded by this gene is one of the four known components of the sarcoglycan complex, which is a subcomplex of the dystrophin-glycoprotein complex (DGC). DGC forms a link between the F-actin cytoskeleton and the extracellular matrix. This protein is expressed most abundantly in skeletal and cardiac muscle. The mutations in this gene have been associated with autosomal recessive limb-girdle muscular dystrophy and dilated cardiomyopathy. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
2569	
2570	
2571	
2572	This gene encodes a WASP interacting protein (WIP)-related protein. It has been shown that this protein has a role in the WASP-mediated organization of the actin cytoskeleton and that this protein is a potential link between the activated platelet-derived growth factor receptor and the actin polymerization machinery.
2573	This gene encodes a member of the EPS8 family. This protein contains one PH domain and one SH3 domain. It functions as part of the EGFR pathway, though its exact role has not been determined. Highly similar proteins in other organisms are involved in the transduction of signals from Ras to Rac and growth factor-mediated actin remodeling. Alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
2574	Skinny hedgehog@#$%& (SKI1) encodes an enzyme that acts within the secretory pathway to catalyze amino-terminal palmitoylation of @#$%&hedgehog@#$%& (see MIM 600725).[supplied by OMIM]
2575	
2576	
2577	This gene encodes a cell surface membrane protein of the immunoglobulin superfamily. This protein is expressed in pro-B-cells and is thought to play a role in their differentiation. In mice, expression of this gene is induced in the thymus when anti-CD3 antibodies are injected and large numbers of thymocytes undergo apoptosis. Mice deficient for this gene bred on a BALB/c  background developed dilated cardiomyopathy and died from congestive heart failure. These studies suggest that this gene product may also be important in T cell function and contribute to the prevention of autoimmune diseases.
2578	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development, reproduction, and tissue remodeling, as well as in disease processes, such as arthritis and metastasis. Most MMP@#$%&s are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. This protein encoded by this gene is considered a member of the membrane-type MMP (MT-MMP) subfamily. However, this protein is unique among the MT-MMP@#$%&s in that it is a GPI-anchored protein rather than a transmembrane protein. The protein activates MMP-2 by cleavage.
2579	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. This protein functions as a modifier of endothelial cell migration and proliferation, as well as an angiogenic factor. It acts as a mitogen for a variety of mesoderm- and neuroectoderm-derived cells in vitro, thus is thought to be involved in organogenesis. Three alternatively spliced variants encoding different isoforms have been described.
2580	
2581	IRF2 encodes interferon regulatory factor 2, a member of the interferon regulatory transcription factor (IRF) family. IRF2 competitively inhibits the IRF1-mediated transcriptional activation of interferons alpha and beta, and presumably other genes that employ IRF1 for transcription activation. However, IRF2 also functions as a transcriptional activator of histone H4.
2582	Neuregulin 1 (NRG1) was originally identified as a 44-kD glycoprotein that interacts with the NEU/ERBB2 receptor tyrosine kinase to increase its phosphorylation on tyrosine residues. It is known that an extraordinary variety of different isoforms are produced from the NRG1 gene by alternative splicing. These isoforms include heregulins (HRGs), glial growth factors (GGFs) and sensory and motor neuron-derived factor (SMDF). They are tissue-specifically expressed and differ significantly in their structure. The HRG isoforms all contain immunoglobulin (Ig) and epidermal growth factor-like (EGF-like) domains. GGF and GGF2 isoforms contain a kringle-like sequence plus Ig and EGF-like domains; and the SMDF isoform shares only the EGF-like domain with other isoforms. The receptors for all NRG1 isoforms are the ERBB family of tyrosine kinase transmembrane receptors. Through interaction with ERBB receptors, NRG1 isoforms induce the growth and differentialtion of epithelial, neuronal, glial, and other types of cells.
2583	SLC25A21 is a homolog of the S. cerevisiae ODC proteins, mitochondrial carriers that transport C5-C7 oxodicarboxylates across inner mitochondrial membranes. One of the species transported by ODC is 2-oxoadipate, a common intermediate in the catabolism of lysine, tryptophan, and hydroxylysine in mammals. Within mitochondria, 2-oxoadipate is converted into acetyl-CoA.[supplied by OMIM]
2584	
2585	
2586	
2587	Ganglioside GM3 is known to participate in the induction of cell differentiation, modulation of cell proliferation, maintenance of fibroblast morphology, signal transduction, and integrin-mediated cell adhesion. The protein encoded by this gene is a type II membrane protein which catalyzes the formation of GM3 using lactosylceramide as the substrate. The encoded protein is a member of glycosyltransferase family 29 and may be localized to the Golgi apparatus.
2588	BRDT is similar to the RING3 protein family. It possesses 2 bromodomain motifs and a PEST sequence (a cluster of proline, glutamic acid, serine, and threonine residues), characteristic of proteins that undergo rapid intracellular degradation. The bromodomain is found in proteins that regulate transcription. Two transcript variants encoding the same protein have been found for this gene.
2589	
2590	
2591	
2592	
2593	This gene encodes a protein that may play a role in organelle biogenesis associated with melanosomes, platelet dense granules, and lysosomes. The encoded protein is part of the heterotetrameric AP-3 protein complex which interacts with the scaffolding protein clathrin. Mutations in this gene are associated with Hermansky-Pudlak syndrome type 2.
2594	The UGTs (EC 2.4.1.17) serve a major role in the conjugation and subsequent elimination of potentially toxic xenobiotics and endogenous compounds. UGT2B7 has unique specificity for 3,4-catechol estrogens and estriol, suggesting that it may play an important role in regulating the level and activity of these potent estrogen metabolites. Its subcellular location is the microsome.[supplied by OMIM]
2595	HLA-DQB1 belongs to the HLA class II beta chain paralogues. This class II molecule is a heterodimer consisting of an alpha (DQA) and a beta chain (DQB), both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa and it contains 6 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail. Within the DQ molecule both the alpha chain and the beta chain contain the polymorphisms specifying the peptide binding specificities, resulting in up to 4 different molecules. Typing for these polymorphisms is routinely done for bone marrow transplantation.
2596	Guanine nucleotide-binding proteins are a family of heterotrimeric proteins that couple cell surface, 7-transmembrane domain receptors to intracellular signaling pathways. Receptor activation catalyzes the exchange of GTP for GDP bound to the inactive G protein alpha subunit resulting in a conformational change and dissociation of the complex. The G protein alpha and beta-gamma subunits are capable of regulating various cellular effectors. Activation is terminated by a GTPase intrinsic to the G-alpha subunit. G-alpha-q is the alpha subunit of one of the heterotrimeric GTP-binding proteins that mediates stimulation of phospholipase C-beta (MIM 600230).[supplied by OMIM]
2597	
2598	
2599	
2600	
2601	Apolipoprotein D (Apo-D) is a component of high density lipoprotein that has no marked similarity to other apolipoprotein sequences. It has a high degree of homology to plasma retinol-binding protein and other members of the alpha 2 microglobulin protein superfamily of carrier proteins, also known as lipocalins. It is a glycoprotein of estimated molecular weight 33 KDa. Apo-D is closely associated with the enzyme lecithin:cholesterol acyltransferase - an enzyme involved in lipoprotein metabolism.
2602	
2603	
2604	
2605	This gene encodes a protein that is one of many involved in mechanisms to ensure proper chromosome segregation during cell division. Experimental evidence indicated that the encoded protein functioned in a similar manner to that of the Drosophila rough deal protein.
2606	
2607	This gene encodes a disintegrin and metalloprotease (ADAM) domain 2, which is a member of the ADAM protein family. Members of this family are membrane-anchored proteins structurally related to snake venom disintegrins, and have been implicated in a variety of biologic processes involving cell-cell and cell-matrix interactions, including fertilization, muscle development, and neurogenesis. This member is a subunit of an integral sperm membrane glycoprotein (called fertilin), which plays an important role in sperm-egg interactions.
2608	This gene is a member of the MAGEA gene family. The members of this family encode proteins with 50 to 80% sequence identity to each other. The promoters and first exons of the MAGEA genes show considerable variability, suggesting that the existence of this gene family enables the same function to be expressed under different transcriptional controls. The MAGEA genes are clustered at chromosomal location Xq28. They have been implicated in some hereditary disorders, such as dyskeratosis congenita. Two transcript variants encoding the same protein have been identified for this gene.
2609	The protein encoded by this gene contains an ATPase/helicase domain, and thus it belongs to the SWI/SNF family of chromatin remodeling proteins. The mutations of this gene are associated with an X-linked mental retardation (XLMR) syndrome most often accompanied by alpha-thalassemia (ATRX) syndrome. Theses mutations have been shown to cause diverse changes in the pattern of DNA methylation, which may provide a link between chromatin remodeling, DNA methylation, and gene expression in developmental processes. This protein is found to undergo cell cycle-dependent phosphorylation, which regulates its nuclear matrix and chromatin association, and suggests its involvement in the gene regulation at interphase and chromosomal segregation in mitosis. Three alternatively spliced transcript variants encoding distinct isoforms have been reported.
2610	
2611	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This gene is widely expressed in various tissues, and located in chromosome X. The function of this gene has not been determined.
2612	Synaptophysin (p38) is an integral membrane protein of small synaptic vesicles in brain and endocrine cells.[supplied by OMIM]
2613	
2614	
2615	This gene encodes a protein with a series of six RCC1-like domains (RLDs), characteristic of the highly conserved guanine nucleotide exchange factors. Mutations in this gene have been associated with X-linked retinitis pigmentosa (XLRP). Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
2616	
2617	
2618	This gene is a member of the MAGEA gene family. The members of this family encode proteins with 50 to 80% sequence identity to each other. The promoters and first exons of the MAGEA genes show considerable variability, suggesting that the existence of this gene family enables the same function to be expressed under different transcriptional controls. The MAGEA genes are clustered at chromosomal location Xq28. They have been implicated in some hereditary disorders, such as dyskeratosis congenita. Two transcript variants encoding the same protein have been found for this gene.
2619	
2620	Spermidine/spermine N(1)-acetyltransferase (SPD/SPM acetyltransferase) is a rate-limiting enzyme in the catabolic pathway of polyamine metabolism. It catalyzes the N(1)-acetylation of spermidine and spermine and, by the successive activity of polyamine oxidase, spermine can be converted to spermidine and spermidine to putrescine.[supplied by OMIM]
2621	
2622	This gene encodes a member of the cupin superfamily. The encoded protein is an Fe(II)-containing nuclear protein expressed in all tissues of the body and concentrated within dot-like subnuclear structures. Interactions with nuclear factor I/CCAAT box transcription factor as well as B cell lymphoma 3-encoded oncoprotein suggest the encoded protein may act as a transcriptional cofactor and be involved in the regulation of DNA transcription and replication. Alternatively spliced transcript variants have been described.
2623	
2624	
2625	
2626	
2627	
2628	
2629	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a type III intermediate filament protein found in the peripheral nervous system neurons.
2630	Defects in the Bruton tyrosine kinase (BTK) gene cause Agammaglobulinemia.  Agammaglobulinemia is an X-linked immunodeficiency characterized by failure to produce mature B lymphocyte cells and associated with a failure of Ig heavy chain rearrangement.
2631	This gene is a member of the H/ACA snoRNPs (small nucleolar ribonucleoproteins) gene family. snoRNPs are involved in various aspects of rRNA processing and modification and have been classified into two families: C/D and H/ACA. The H/ACA snoRNPs also include the NOLA1, 2 and 3 proteins. The protein encoded by this gene and the three NOLA proteins localize to the dense fibrillar components of nucleoli and to coiled (Cajal) bodies in the nucleus. Both 18S rRNA production and rRNA pseudouridylation are impaired if any one of the four proteins is depleted. These four H/ACA snoRNP proteins are also components of the telomerase complex. The protein encoded by this gene is related to the Saccharomyces cerevisiae Cbf5p and Drosophila melanogaster Nop60B proteins. The gene lies in a tail-to-tail orientation with the palmitoylated erythrocyte membrane protein (MPP1) gene and is transcribed in a telomere to centromere direction. Both nucleotide substitutions and single trinucleotide repeat polymorphisms have been found in this gene. Mutations in this gene cause X-linked dyskeratosis congenita disease.
2632	The gene product inhibits renin activity by forming a dimer with renin, a complex known as high molecular weight renin. The encoded protein contains a leucine zipper domain, which is essential for its dimerization with renin. The gene product can catalyze the interconversion of N-acetylglucosamine to N-acetylmannosamine, indicating that it is a GlcNAc 2-epimerase. Transcript variants utilizing alternative promoters have been described in the literature.
2633	
2634	
2635	The protein encoded by this gene is a subunit of the interleukin 13 receptor. This subunit forms a receptor complex with IL4 receptor alpha, a subunit shared by IL13 and IL4 receptors. This subunit serves as a primary IL13-binding subunit of the IL13 receptor, and may also be a component of IL4 receptors. This protein has been shown to bind tyrosine kinase TYK2, and thus may mediate the signaling processes that lead to the activation of JAK1, STAT3 and STAT6 induced by IL13 and IL4.
2636	This gene encodes a protein containing zinc finger domains. A short peptide derived from this protein is a minor histocompatibility antigen which can lead to graft rejection of male donor cells in a female recipient.
2637	
2638	V7-3 shows structural characteristics of an Na(+) and Cl(-)-dependent neurotransmitter transporter, including 12 transmembrane (TM) domains, intracellular N and C termini, and large extracellular loops containing multiple N-glycosylation sites.[supplied by OMIM]
2639	The A-kinase anchor proteins (AKAPs) are a group of structurally diverse proteins, which have the common function of binding to the regulatory subunit of protein kinase A (PKA) and confining the holoenzyme to discrete locations within the cell. This gene encodes a member of the AKAP family. Alternative splicing of this gene results in at least 3 transcript variants encoding different isoforms containing a dbl oncogene homology (DH) domain and a pleckstrin homology (PH) domain. The DH domain is associated with guanine nucleotide exchange activation for the Rho/Rac family of small GTP binding proteins, resulting in the conversion of the inactive GTPase to the active form capable of transducing signals. The PH domain has multiple functions. Therefore, these isoforms function as scaffolding proteins to coordinate a Rho signaling pathway and, in addition, function as protein kinase A-anchoring proteins.
2640	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This family member is 89% identical to the mouse Wdr8 protein at the amino acid level. The function of this protein is not known, and the mouse studies suggest that the Wdr8 protein may play a role in the process of ossification.
2641	This gene encodes a major constituent of the human complement subcomponent C1q. C1q associates with C1r and C1s in order to yield the first component of the serum complement system. Deficiency of C1q has been associated with lupus erythematosus and glomerulonephritis. C1q is composed of 18 polypeptide chains: six A-chains, six B-chains, and six C-chains. Each chain contains a collagen-like region located near the N terminus and a C-terminal globular region. The A-, B-, and C-chains are arranged in the order A-C-B on chromosome 1. This gene encodes the B-chain polypeptide of human complement subcomponent C1q
2642	Members of the ID family of helix-loop-helix (HLH) proteins lack a basic DNA-binding domain and inhibit transcription through formation of nonfunctional dimers that are incapable of binding to DNA.[supplied by OMIM]
2643	This gene encodes one of three enolase isoenzymes found in mammals; it encodes alpha-enolase, a homodimeric soluble enzyme, and also encodes a shorter monomeric structural lens protein, tau-crystallin. The two proteins are made from the same message. The full length protein, the isoenzyme, is found in the cytoplasm. The shorter protein is produced from an alternative translation start, is localized to the nucleus, and has been found to bind to an element in the c-myc promoter. A pseudogene has been identified that is located on the other arm of the same chromosome.
2644	
2645	
2646	
2647	
2648	The protein encoded by this gene is a zinc metalloprotease similar to yeast Ste24p. It is an integral membrane protein belonging to peptidase family M48 and is found in the endoplasmic reticulum and possibly in the Golgi compartment. It is thought to be involved in the proteolytic processing of farnesylated proteins.
2649	The protein encoded by this gene contains a RING-H2 finger motif, which is known to be important for protein-protein interactions. The expression of this gene has been shown to be induced by mutant RET proteins (MEN2A/MEN2B). The germline mutations in RET gene are known to be responsible for the development of multiple endocrine neoplasia (MEN).
2650	
2651	
2652	
2653	
2654	
2655	Lysyl hydroxylase is a membrane-bound homodimeric protein localized to the cisternae of the endoplasmic reticulum. The enzyme (cofactors iron and ascorbate) catalyzes the hydroxylation of lysyl residues in collagen-like peptides. The resultant hydroxylysyl groups are attachment sites for carbohydrates in collagen and thus are critical for the stability of intermolecular crosslinks. Some patients with Ehlers-Danlos syndrome type VI have deficiencies in lysyl hydroxylase activity.
2656	This gene is a member of KDWK gene family. The product of this gene associates with GMEB2 protein, and the complex is essential for parvovirus DNA replication. Study of rat homolog implicates the role of this gene in modulation of transactivation by the glucocorticoid receptor bound to glucocorticoid response elements. Two alternative spliced transcript variants encoding different isoforms exist.
2657	
2658	
2659	
2660	
2661	SELP is a platelet alpha-granule membrane protein of molecular weight 140,000 that redistributes to the plasma membrane during platelet activation and degranulation. It is a member of a family of adhesion/homing receptors. Alternative splice variants may occur but are not well documented
2662	This gene is a member of a phylogenetically conserved family of genes that share a common DNA-binding domain, the T-box. T-box genes encode transcription factors involved in the regulation of developmental processes. This gene is the human ortholog of mouse Tbx19/Tpit gene. Studies in mouse show that Tpit protein is present only in the two pituitary pro-opiomelanocortin (POMC)-expressing lineages, the corticotrophs and melanotrophs. Mutations in the human ortholog were found in patients with isolated deficiency of pituitary POMC-derived ACTH, suggesting an essential role for this gene in differentiation of the pituitary POMC lineage.
2663	This gene is a member of a phylogenetically conserved family of genes that share a common DNA-binding domain, the T-box. T-box genes encode transcription factors involved in the regulation of developmental processes. This protein is a transcriptional repressor and is thought to play a role in the anterior/posterior axis of the tetrapod forelimb. Mutations in this gene cause ulnar-mammary syndrome, affecting limb, apocrine gland, tooth, hair, and genital development. Alternative splicing of this gene results in three transcript variants encoding different isoforms; however, the full length nature of one variant has not been determined.
2664	
2665	
2666	PAFAH2 encodes platelet-activating factor acetylhydrolase isoform 2, a single-subunit intracellular enzyme expressed in liver and kidney that catalyzes the removal of the acetyl group at the SN-2 position of platelet-activating factor (identified as 1-O-alkyl-2-acetyl-sn-glyceryl-3-phosphorylcholine). However, this lipase exhibits a broader substrate specificity than simply platelet activating factor. Two other isoforms of intracellular platelet-activating factor acetylhydrolase exist, and both are multi-subunit enzymes. Additionally, there is a single-subunit serum isoform of this enzyme.
2667	
2668	
2669	
2670	This gene encodes a component of vacuolar ATPase (V-ATPase), a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, receptor-mediated endocytosis, and synaptic vesicle proton gradient generation. V-ATPase is composed of a cytosolic V1 domain and a transmembrane V0 domain. The V1 domain consists of three A and three B subunits, two G subunits plus the C, D, E, F, and H subunits. The V1 domain contains the ATP catalytic site. The V0 domain consists of five different subunits: a, c, c@#$%&, c@#$%!, and d. Additional isoforms of many of the V1 and V0 subunit proteins are encoded by multiple genes or alternatively spliced transcript variants. This encoded protein is part of the transmembrane V0 domain and is the human counterpart of yeast VMA16.
2671	
2672	
2673	
2674	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein which is more than twice the size of its E.coli counterpart (EcoL24). Sequence analysis identified two transcript variants that encode the same protein.
2675	This gene is a member of the mucin family and encodes a membrane bound, glycosylated phosphoprotein. This protein is anchored to the apical surface of many epithelia by a transmembrane domain, with the degree of glycosylation varying with cell type. The protein serves a protective function, binding to pathogens, as well as functioning in a cell signaling capacity. Overexpression, aberrant intracellular localization, and changes in glycosylation of this protein have been associated with carcinomas. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
2676	
2677	This gene encodes an essential eukaryotic translation initiation factor. The protein is required for the binding of the 43S complex (a 40S subunit, eIF2/GTP/Met-tRNAi and eIF3) to the 5@#$%& end of capped RNA.
2678	
2679	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is intronless and encodes a member of the histone H3 family. Transcripts from this gene lack polyA tails; instead, they contain a palindromic termination element. This gene is located separately from the other H3 genes that are in the histone gene cluster on chromosome 6p22-p21.3.
2680	There are 3 types of adenosine receptors, each of which contains 7 transmembrane domains and interacts with G proteins. The A1 receptors inhibit adenylate cyclase while the type A2 receptors stimulate activity. Each adenosine receptor has a specific pattern of ligand binding and a unique tissue distribution. Adenosine released during cardiac ischemia exerts a potent, protective effect in the heart. The adenosine A3 receptor is expressed on cardiac ventricular cells, and its activation protects the ventricular heart cell against injury during a subsequent exposure to ischemia.
2681	The myeloid cell nuclear differentiation antigen (MNDA) is detected only in nuclei of cells of the granulocyte-monocyte lineage. A 200-amino acid region of human MNDA is strikingly similar to a region in the proteins encoded by a family of interferon-inducible mouse genes, designated Ifi-201, Ifi-202, and Ifi-203, that are not regulated in a cell- or tissue-specific fashion. The 1.8-kb MNDA mRNA, which contains an interferon-stimulated response element in the 5-prime untranslated region, was significantly upregulated in human monocytes exposed to interferon alpha. MNDA is located within 2,200 kb of FCER1A, APCS, CRP, and SPTA1. In its pattern of expression and/or regulation, MNDA resembles IFI16, suggesting that these genes participate in blood cell-specific responses to interferons.
2682	
2683	
2684	
2685	The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21. This protein may function in exocytosis and endocytosis.
2686	This gene product is a member of the semaphorin family of proteins. Semaphorins represent important molecular signals controlling multiple aspects of the cellular response that follows CNS injury, and thus may play an important role in neural regeneration.
2687	
2688	
2689	
2690	
2691	Myelin protein-zero is the major structural protein of peripheral myelin.[supplied by OMIM]
2692	ABL2 is a cytoplasmic tyrosine kinase which is closely related to but distinct from ABL1.  The similarity of the proteins includes the tyrosine kinase domains and extends amino-terminal to include the SH2 and SH3 domains.  ABL2 is expressed in both normal and tumor cells.  The ABL2 gene product is expressed as two variants bearing different amino termini, both approximately 12-kb in length.
2693	
2694	
2695	This gene encodes a member of the IL10 family of cytokines. It was identified as a gene induced during terminal differentiation in melanoma cells. The protein encoded by this gene can induce apoptosis selectively in various cancer cells. Overexpression of this gene leads to elevated expression of several GADD family genes, which correlates with the induction of apoptosis. The phosphorylation of mitogen-activated protein kinase 14 (MAPK7/P38), and heat shock 27kDa protein 1 (HSPB2/HSP27) are found to be induced by this gene in melanoma cells, but not in normal immortal melanocytes. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
2696	
2697	
2698	
2699	CNN3 contains a markedly acidic C terminus; the basic N terminus is highly homologous to the CNN1 N terminus.  CNN1, CNN2, and CNN3 all contain similar tandemly repeated motifs.  CNN3 is associated with the cytoskeleton but is not involved in contraction.
2700	This gene encodes neutrophil cytosolic factor 2, the 67-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in this gene, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
2701	
2702	
2703	
2704	Integrins are integral membrane proteins composed of an alpha chain and a beta chain, and are known to participate in cell adhesion as well as cell-surface mediated signalling. The I-domain containing alpha 10 combines with the integrin beta 1 chain (ITGB1) to form a novel collagen type II-binding integrin expressed in cartilage tissue.
2705	The protein encoded by this gene is a member of the neutral endopeptidase (NEP) or membrane metallo-endopeptidase (MME) family. Family members play important roles in pain perception, arterial pressure regulation, phosphate metabolism and homeostasis. This protein is a type II transmembrane protein and is thought to be expressed as a secreted protein. This gene is expressed mainly in testis with weak expression in the brain, kidney, and heart.
2706	
2707	
2708	
2709	
2710	
2711	
2712	
2713	
2714	Twinkle is a mitochondrial protein with structural similarity to the phage T7 primase/helicase (GP4) and other hexameric ring helicases. The twinkle protein colocalizes with mtDNA in mitochondrial nucleoids, and the protein@#$%&s name derives from the unusual localization pattern reminiscent of twinkling stars (Spelbrink et al., 2001).[supplied by OMIM]
2715	
2716	
2717	
2718	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. Two alternatively spliced transcript variants of this gene have been reported, one of which encodes a receptor-type PTP that possesses a short extracellular domain, a single transmembrane region, and two tandem intracytoplasmic catalytic domains; Another one encodes a PTP that contains a distinct hydrophilic N-terminus, and thus represents a nonreceptor-type isoform of this PTP. Studies of the similar gene in mice suggested the regulatory roles of this PTP in RAS related signal transduction pathways, cytokines induced SATA signaling, as well as the activation of voltage-gated K+ channels.
2719	The protein encoded by this gene is a member of the Ser/Thr protein kinase family. This protein is a catalytic subunit of the highly conserved protein kinase complex known as M-phase promoting factor (MPF), which is essential for G1/S and G2/M phase transitions of eukaryotic cell cycle. Mitotic cyclins stably associate with this protein and function as regulatory subunits. The kinase activity of this protein is controlled by cyclin accumulation and destruction through the cell cycle. The phosphorylation and dephosphorylation of this protein also play important regulatory roles in cell cycle control.
2720	
2721	
2722	
2723	This gene encodes a member of the aldo/keto reductase superfamily, which consists of more than 40 known enzymes and proteins. These enzymes catalyze the conversion of aldehydes and ketones to their corresponding alcohols by utilizing NADH and/or NADPH as cofactors. The enzymes display overlapping but distinct substrate specificity. This enzyme catalyzes the reduction of prostaglandin (PG) D2, PGH2 and phenanthrenequinone (PQ), and the oxidation of 9alpha,11beta-PGF2 to PGD2. It may play an important role in the pathogenesis of allergic diseases such as asthma, and may also have a role in controlling cell growth and/or differentiation. This gene shares high sequence identity with three other gene members and is clustered with those three genes at chromosome 10p15-p14.
2724	The protein encoded by this gene has significant homology to NADH oxidoreductases and the apoptosis-inducing factor PDCD8/AIF. Overexpression of this gene has been shown to induce apoptosis. The expression of this gene is found to be induced by tumor suppressor protein p53 in colon caner cells.
2725	This gene encodes an arginyltransferase, an enzyme that is involved in posttranslational conjugation of arginine to N-terminal aspartate or glutamate residues. Conjugation of arginine to the N-terminal aspartate or glutamate targets proteins for ubiquitin-dependent degradation. Alternative splicing results in two transcript variants encoding distinct isoforms.
2726	This gene belongs to the SMC3 subfamily of SMC proteins. The encoded protein occurs in certain cell types as either an intracellular, nuclear protein or a secreted protein. The nuclear form, known as structural maintenance of chromosomes 3, is a component of the multimeric cohesin complex that holds together sister chromatids during mitosis, enabling proper chromosome segregation. Post-translational modification of the encoded protein by the addition of chondroitin sulfate chains gives rise to the secreted proteoglycan bamacan, an abundant basement membrane protein.
2727	This gene is a member of the BCL2/adenovirus E1B 19 kd-interacting protein (BNIP) family.  It interacts with the E1B 19 kDa protein which is responsible for the protection of virally-induced cell death, as well as E1B 19 kDa-like sequences of BCL2, also an apoptotic protector.  This gene contains a BH3 domain and a transmembrane domain, which have been associated with pro-apoptotic function.  The dimeric mitochondrial protein encoded by this gene is known to induce apoptosis, even in the presence of BCL2.
2728	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This endoplasmic reticulum protein acts on retinoids, including all-trans-retinoic acid (RA), with both 4-hydroxylation and 18-hydroxylation activities. This enzyme regulates the cellular level of retinoic acid which is involved in regulation of gene expression in both embryonic and adult tissues. Two alternatively spliced transcript variants of this gene, which encode the distinct isoforms, have been reported.
2729	
2730	L-glutamate dehydrogenase (EC 1.4.1.3) has a central role in nitrogen metabolism in plants and animals. Glutamate dehydrogenase is found in all organisms and catalyzes the oxidative deamination of 1-glutamate to 2-oxoglutarate (Smith et al., 2001). Glutamate, the main substrate of GLUD, is present in brain in concentrations higher than in other organs. In nervous tissue, GLUD appears to function in both the synthesis and the catabolism of glutamate and perhaps in ammonia detoxification (Mavrothalassitis et al., 1988).[supplied by OMIM]
2731	The protein encoded by this gene is a cytokine receptor that specifically binds IL15 with high affinity. The receptors of IL15 and IL2 share two subunits, the IL2R beta and IL2R gamma chains. This forms the basis of many overlapping biological activities of IL15 and IL2. The IL2 receptor requires an additional IL2-specific alpha subunit for high affinity IL2 binding. This protein is structurally related to IL2R alpha, but is capable of binding IL15 with high affinity independent of other subunits, which suggests the distinct roles between IL15 and IL2. This receptor is reported to enhance cell proliferation and expression of apoptosis inhibitor BCL2L1/BCL2-XL and BCL2. Multiple alternatively spliced transcript variants of this gene have been reported. The full length sequences of only two variants encoding distinct isoforms are available.
2732	N-myristoyltransferase (NMT) catalyzes the reaction of N-terminal myristoylation of many signaling proteins. It transfers myristic acid from myristoyl coenzyme A to the amino group of a protein@#$%&s N-terminal glycine residue. Biochemical evidence indicates the presence of several distinct NMTs, varying in apparent molecular weight and /or subcellular distribution. The predicted 498-amino acid of human NMT2 protein shares 77% and 96%  sequence identity with human NMT1 and mouse Nmt2 comprise two  distinct families of N-myristoyltransferases.
2733	N-myristoyltransferase (NMT) catalyzes the reaction of N-terminal myristoylation of many signaling proteins. It transfers myristic acid from myristoyl coenzyme A to the amino group of a protein@#$%&s N-terminal glycine residue. Biochemical evidence indicates the presence of several distinct NMTs, varying in apparent molecular weight and /or subcellular distribution. The predicted 498-amino acid of human NMT2 protein shares 77% and 96%  sequence identity with human NMT1 and mouse Nmt2 comprise two  distinct families of N-myristoyltransferases.
2734	
2735	Alpha-2-adrenergic receptors are members of the G protein-coupled receptor superfamily. They include 3 highly homologous subtypes: alpha2A, alpha2B, and alpha2C. These receptors have a critical role in regulating neurotransmitter release from sympathetic nerves and from adrenergic neurons in the central nervous system. Studies in mouse revealed that both the alpha2A and alpha2C subtypes were required for normal presynaptic control of transmitter release from sympathetic nerves in the heart and from central noradrenergic neurons; the alpha2A subtype inhibited transmitter release at high stimulation frequencies, whereas the alpha2C subtype modulated neurotransmission at lower levels of nerve activity. This gene encodes alpha2A subtype and it contains no introns in either its coding or untranslated sequences.
2736	Regulator of G protein signaling (RGS) family members are regulatory molecules that act as GTPase activating proteins (GAPs) for G alpha subunits of heterotrimeric G proteins. RGS proteins are able to deactivate G protein subunits of the Gi alpha, Go alpha and Gq alpha subtypes. They drive G proteins into their inactive GDP-bound forms. Regulator of G protein signaling 10 belongs to this family. All RGS proteins share a conserved 120-amino acid sequence termed the RGS domain. This protein associates specifically with the activated forms of the two related G-protein subunits, G-alphai3 and G-alphaz but fails to interact with the structurally and functionally distinct G-alpha subunits. Regulator of G protein signaling 10 protein is localized in the nucleus. Two transcript variants encoding different isoforms have been found for this gene.
2737	
2738	CLC belongs to the interleukin-6 (IL6; MIM 147620) family of cytokines, which are involved in cell signaling through phosphorylation of gp130 (MIM 162820). IL6 family members share similarity in gene structure and have a 4-helix bundle in their protein structure.[supplied by OMIM]
2739	Lysosomal acid phosphatase is comprised of two subunits, alpha and beta, and is chemically and genetically distinct from red cell acid phosphatase. Lysosomal acid phosphatase 2 is a member of a family of distinct isoenzymes which hydrolyze orthophosphoric monoesters to alcohol and phosphate. Acid phosphatase deficiency is caused by mutations in the ACP2 (beta subunit) and ACP3 (alpha subunit) genes.
2740	The protein encoded by this gene is a peripheral membrane protein that is a component of tight junctions or TJs. TJs form an apical junctional structure and act to control paracellular permeability and maintain cell polarity. This protein is related to angiomotin, an angiostatin binding protein that regulates endothelial cell migration and capillary formation.
2741	
2742	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. This gene and the gene for the HRD1 protein use in their respective 3@#$%& UTRs some of the same genomic sequence. Pseudogenes corresponding to this gene are found on chromosomes 5q and 8p.
2743	
2744	The calpains, calcium-activated neutral proteases, are nonlysosomal, intracellular cysteine proteases. The mammalian calpains include ubiquitous, stomach-specific, and muscle-specific proteins. The ubiquitous enzymes consist of heterodimers with distinct large, catalytic subunits associated with a common small, regulatory subunit. This gene encodes the large subunit of the ubiquitous enzyme, calpain 1.
2745	The protein encoded by this gene belongs to the MYST family of histone acetyl transferases (HATs) and was originally isolated as an HIV-1 TAT-interactive protein. HATs play important roles in regulating chromatin remodeling, transcription and other nuclear processes by acetylating histone and nonhistone proteins. This protein is a histone acetylase that has a role in DNA repair and apoptosis and is thought to play an important role in signal transduction. Alternative splicing of this gene results in multiple transcript variants.
2746	Dentatorubral pallidoluysian atrophy is a rare neurodegenerative disorder characterized by cerebellar ataxia, myoclonic epilepsy, choreoathetosis, and dementia. The disorder is related to the expansion of a trinucleotide repeat within this gene. The encoded protein includes a serine repeat and a region of alternating acidic and basic amino acids, as well as the variable glutamine repeat. Alternative splicing results in two transcripts variants that encode the same protein.
2747	Transcriptional repression is a general mechanism for regulating transcriptional initiation in organisms ranging from yeast to humans.  Accurate initiation of transcription from eukaryotic protein-encoding genes requires the assembly of a large multiprotein complex consisting of RNA polymerase II and general transcription factors such as TFIIA, TFIIB, and TFIID. DR1 is a repressor that interacts with the TATA-binding protein (TBP) of TFIID and prevents the formation of an active transcription complex by precluding the entry of TFIIA and/or TFIIB into the preinitiation complex. The protein encoded by this gene is a corepressor of transcription that interacts with DR1 to enhance DR1-mediated repression. The interaction between this corepressor and DR1 is required for corepressor function and appears to stabilize the TBP-DR1-DNA complex.
2748	Alpha-actinin is an actin-binding protein with multiple roles in different cell types. ACTN3 expression is limited to skeletal muscle. It is localized to the Z-disc and analogous dense bodies, where it helps to anchor the myofibrillar actin filaments
2749	
2750	
2751	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure, such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of the DEAD box protein family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a member of this family. The encoded protein is a gonadotropin-regulated and developmentally expressed testicular RNA helicase. It may serve to maintain testicular functions related to steroidogenesis and spermatogenesis.
2752	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development,reproduction, and tissue remodeling, as well as in disease processes, such as arthritis and metastasis. Most MMP@#$%&s are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. The protein encoded by this gene cleaves type II collagen more efficiently than types I and III. It may be involved in articular cartilage turnover and cartilage pathophysiology associated with osteoarthritis. The gene is part of a cluster of MMP genes which localize to chromosome 11q22.3.
2753	Progesterone plays a central role in the reproductive events associated with the establishment and maintenance of pregnancy. Progesterone receptor, a member of the steroid receptor superfamily, mediates the physiologic effects of progesterone. The PGR gene uses separate promoters and translational start sites to produce 2 isoforms, PRA and PRB, which are identical except for an additional 165 amino acids present only in the N terminus of PRB. Although PRA and PRB share several structural domains, they are distinct transcription factors that mediate their own response genes and physiologic effects with little overlap.[supplied by OMIM]
2754	
2755	Complex II of the respiratory chain, which is specifically involved in the oxidation of succinate, carries electrons from FADH to CoQ.  The complex is composed of four nuclear-encoded subunits and is localized in the mitochondrial inner membrane.  The subunit D protein is one of two integral membrane proteins anchoring the complex to the matrix side of the membrane.  Mutations in SDHD have been linked to hereditary paraganglioma.
2756	
2757	
2758	Glyceraldehyde-3-phosphate dehydrogenase catalyzes an important energy-yielding step in carbohydrate metabolism, the reversible oxidative phosphorylation of glyceraldehyde-3-phosphate in the presence of inorganic phosphate and nicotinamide adenine dinucleotide (NAD). The enzyme exists as a tetramer of identical chains.  A GAPD pseudogene has been mapped to Xp21-p11 and 15 GAPD-like loci have been identified.
2759	
2760	
2761	The protein encoded by this gene contains a RING finger, a motif known to be involved in protein-DNA and protein-protein interactions. Abundant expression of this gene was found in the testicular tissue of fertile men, but was not detected in azoospermic patients. Studies of the mouse counterpart suggest that this gene may function as a testis specific transcription factor during spermatogenesis.
2762	The somatomedins, or insulin-like growth factors (IGFs), comprise a family of peptides that play important roles in mammalian growth and development. IGF1 mediates many of the growth-promoting effects of growth hormone (GH; MIM 139250). Early studies showed that growth hormone did not directly stimulate the incorporation of sulfate into cartilage, but rather acted through a serum factor, termed @#$%&sulfation factor,@#$%& which later became known as @#$%&somatomedin@#$%& (Daughaday et al., 1972). Three main somatomedins have been characterized: somatomedin C (IGF1), somatomedin A (IGF2; MIM 147470), and somatomedin B (MIM 193190) (Rotwein, 1986; Rosenfeld, 2003).[supplied by OMIM]
2763	
2764	
2765	The protein encoded by this gene inhibits activated C1r and C1s of the first complement component and thus regulates complement activation. Deficiency of this protein is associated with hereditary angioneurotic oedema (HANE).
2766	This gene encodes a DEAD box protein. DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division.
2767	
2768	
2769	
2770	G proteins are heterotrimers of alpha, beta, and gamma subunits. Gamma subunits, such as GNG3, contribute to the specificity of the hundreds of receptor signaling pathways involving G proteins (Schwindinger et al., 2004).[supplied by OMIM]
2771	
2772	
2773	
2774	This gene is one of several tumor-suppressing subtransferable fragments located in the imprinted gene domain of 11p15.5, an important tumor-suppressor gene region. Alterations in this region have been associated with the Beckwith-Wiedemann syndrome, Wilms tumor, rhabdomyosarcoma, adrenocortical carcinoma, and lung, ovarian, and breast cancer. This gene may play a role in malignancies and disease that involve this region as well as the transport of chloroquine- and quinidine-related compounds in the kidney. Two alternative transcripts encoding the same isoform have been described.
2775	
2776	
2777	This gene encodes a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, namely a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region.
2778	
2779	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
2780	
2781	
2782	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit, where it forms part of the domain where translation is initiated. The protein belongs to the S3P family of ribosomal proteins. Studies of the mouse and rat proteins have demonstrated that the protein has an extraribosomal role as an endonuclease involved in the repair of UV-induced DNA damage. The protein appears to be located in both the cytoplasm and nucleus but not in the nucleolus. Higher levels of expression of this gene in colon adenocarcinomas and adenomatous polyps compared to adjacent normal colonic mucosa have been observed. This gene is co-transcribed with the small nucleolar RNA genes U15A and U15B, which are located in its first and fifth introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
2783	HBXAP is involved in transcription repression, transcription coactivation when associated with hepatitis B virus X protein (HBX), and chromatin remodeling and spacing when associated with SNF2H (MIM 603375).[supplied by OMIM]
2784	
2785	
2786	The protein encoded by this gene is a proline-rich salivary protein. This gene and five other genes that also encode salivary proline-rich proteins (PRPs), as well as a gene encoding a lacrimal gland PRP, form a PRP gene cluster in the chromosomal 12p13 region. Alternatively spliced transcript variants encoding distinct isoforms have been described.
2787	T-cell activation requires stimulation of the T-cell receptor (TCR; see MIM 186880)-CD3 (see CD3Z; MIM 186780) complex, followed by recruitment of an array of intracellular signaling proteins (e.g., GRB2 (MIM 108355) and PLCG1 (MIM 172420)). Mediating the interaction between the extracellular receptors and intracellular signaling pathways are adaptor proteins such as LAT (MIM 602354), TRIM (MIM 604962), and LNK.[supplied by OMIM]
2788	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S26E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
2789	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein that is known to complex with integrins. It may function as a blood platelet activation marker. Deficiency of this protein is associated with Hermansky-Pudlak syndrome. Also this gene has been associated with tumor progression. The use of alternate polyadenylation sites has been found for this gene.
2790	The protein encoded by this gene is a specific CDK2-associated protein, which is thought to negatively regulate CDK2 activity by sequestering monomeric CDK2, and targeting CDK2 for proteolysis. This protein was found to also interact with DNA polymerase alpha/primase and mediate the phosphorylation of the large p180 subunit, which suggested the regulatory role in DNA replication during S phase of the cell cycle. A similar gene in hamster was isolated from, and functions as a growth suppressor of normal keratinocytes.
2791	
2792	
2793	
2794	Microtubules of the eukaryotic cytoskeleton perform essential and diverse functions and are composed of a heterodimer of alpha and beta tubulins. The genes encoding these microtubule constituents belong to the tubulin superfamily, which is composed of six distinct families. Genes from the alpha, beta and gamma tubulin families are found in all eukaryotes. The alpha and beta tubulins represent the major components of microtubules, while gamma tubulin plays a critical role in the nucleation of microtubule assembly. There are multiple alpha and beta tubulin genes, which are highly conserved among species. This gene encodes alpha tubulin and is highly similar to mouse and rat Tuba1 gene. Northern blotting studies have shown that the gene expression is predominantly found in morphologically differentiated neurologic cells. This gene is one of three alpha-tubulin genes in a cluster on chromosome 12q.
2795	
2796	
2797	
2798	The protein encoded by this gene is member of the GAP1 family of GTPase-activating proteins. These proteins stimulate the GTPase activity of normal RAS p21 but not its oncogenic counterpart. Acting as a suppressor of RAS function, the protein enhances the weak intrinsic GTPase activity of RAS proteins resulting in the inactive GDP-bound form of RAS, thereby allowing control of cellular proliferation and differentiation. This particular family member contains domains which are characteristic of the GAP1 subfamily of RasGAP proteins but, in contrast to the other GAP1 family members, this protein is strongly and selectively expressed in endocrine tissues.
2799	The product of this gene is a lysosomal enzyme found in all cells. It is involved in the catabolism of heparin, heparan sulphate, and keratan sulphate. Deficiency of this enzyme results in the accumulation of undegraded substrate and the lysosomal storage disorder mucopolysaccharidosis type IIID (Sanfilippo D syndrome). Mucopolysaccharidosis type IIID is the least common of the four subtypes of Sanfilippo syndrome.
2800	Islet, or insulinoma, amyloid polypeptide (IAPP, or amylin) is commonly found in pancreatic islets of patients suffering diabetes mellitus type II, or harboring an insulinoma.  While the assosciation of amylin with the development of type II diabetes has been known for some time, a direct causative role for amylin has been harder to establish.  Recent results suggest that amylin, like the related beta-amyloid (Abeta) assosciated with Alzheimer@#$%&s disease, can induce apoptotic cell-death in particular cultured cells, an effect that may be relevant to the development of type II diabetes.
2801	
2802	
2803	
2804	The protein encoded by this gene is a member of the dual specificity protein phosphatase subfamily. These phosphatases inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues. They negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily (MAPK/ERK, SAPK/JNK, p38), which are associated with cellular proliferation and differentiation. Different members of the family of dual specificity phosphatases show distinct substrate specificities for various MAP kinases, different tissue distribution and subcellular localization, and different modes of inducibility of their expression by extracellular stimuli. This gene product inactivates ERK2, is expressed in a variety of tissues with the highest levels in heart and pancreas, and unlike most other members of this family, is localized in the cytoplasm. Two transcript variants encoding different isoforms have been found for this gene.
2805	The protein encoded by this gene is a member of the TNF-receptor superfamily. This protein is one of the major receptors for the tumor necrosis factor-alpha. This receptor can activate NF-kappaB, mediate apoptosis, and function as a regulator of inflammation. Antiapoptotic protein BCL2-associated athanogene 4 (BAG4/SODD) and adaptor proteins TRADD and TRAF2 have been shown to interact with this receptor, and thus play regulatory roles in the signal transduction mediated by the receptor. Germline mutations of the extracellular domains of this receptor were found to be associated with the autosomal dominant periodic fever syndrome. The impaired receptor clearance is thought to be a mechanism of the disease.
2806	
2807	Interleukin-1 receptor (see IL1R; MIM 147810)-associated kinases (e.g., IRAK1; MIM 300283) are important mediators in the signal transduction of Toll-like receptor (TLR, e.g., TLR4; MIM 603030) and IL1R family members, collectively referred to as TIRs. IRAK4 functions in this signal transduction pathway.[supplied by OMIM]
2808	
2809	This gene encodes a protein containing two conserved tandem RNA recognition motifs. Similar proteins in other species function as RNA-binding proteins and play central roles in posttranscriptional gene regulation. Expression of this gene has been correlated with the grade of the malignancy and proliferative activity in gliomas and melanomas. A pseudogene for this gene is located on chromosome 11q13.
2810	
2811	The Shaker gene family of Drosophila encodes components of voltage-gated potassium channels and is comprised of four subfamilies. Based on sequence similarity, this gene is similar to one of these subfamilies, namely the Shaw subfamily. The protein encoded by this gene belongs to the delayed rectifier class of channel proteins and is an integral membrane protein that mediates the voltage-dependent potassium ion permeability of excitable membranes. Three transcript variants encoding three different isoforms have been found for this gene.
2812	The protein encoded by this gene belongs to the highly conserved cyclin family, whose members are characterized by a dramatic periodicity in protein abundance through the cell cycle. Cyclins function as regulators of CDK kinases. Different cyclins exhibit distinct expression and degradation patterns which contribute to the temporal coordination of each mitotic event. This cyclin forms a complex with and functions as a regulatory subunit of CDK4 or CDK6, whose activtiy is required for cell cycle G1/S transition. This protein has been shown to interact with and be involved in the phosphorylation of tumor suppressor protein Rb. Knockout studies of the homologous gene in mouse suggest the essential roles of this gene in ovarian granulosa and germ cell proliferation. High level expression of this gene was observed in ovarian and testicular tumors.
2813	
2814	The protein encoded by this gene is a membrane-bound arginine/lysine carboxypeptidase. Its expression is associated with monocyte to macrophage differentiation. This encoded protein contains hydrophobic regions at the amino and carboxy termini and has 6 potential asparagine-linked glycosylation sites. The active site residues of carboxypeptidases A and B are conserved in this protein. Three alternatively spliced transcript variants encoding the same protein have been described for this gene.
2815	
2816	
2817	This gene encodes a reticuloplasmin, a protein which resides in the lumen of the endoplasmic reticulum (ER). The protein shows sequence similarity to the protein disulfide isomerase family. However, it lacks the thioredoxin motif characteristic of this family, suggesting that this protein does not function as a disulfide isomerase. The protein dimerizes and is thought to play a role in the processing of secretory proteins within the ER.
2818	The transport of molecules between the nucleus and the cytoplasm in eukaryotic cells is mediated by the nuclear pore complex (NPC) which consists of 60-100 proteins and is probably 120 million daltons in molecular size. Small molecules (up to 70 kD) can pass through the nuclear pore by nonselective diffusion; larger molecules are transported by an active process. Most nuclear proteins contain short basic amino acid sequences known as nuclear localization signals (NLSs). KPNA3, encodes a protein similar to certain nuclear transport proteins of Xenopus and human. The predicted amino acid sequence shows similarity to Xenopus importin, yeast SRP1, and human RCH1 (KPNA2), respectively. The similarities among these proteins suggests that karyopherin alpha-3 may be involved in the nuclear transport system.
2819	The protein encoded by this gene is a member of the cyclin-dependent protein kinase (CDK) family. CDK family members are highly similar to the gene products of Saccharomyces cerevisiae cdc28, and Schizosaccharomyces pombe cdc2, and are known to be important regulators of cell cycle progression. This kinase and its regulatory subunit cyclin C are components of the RNA polymerase II holoenzyme complex, which phosphorylates the carboxy-terminal domain (CTD) of the largest subunit of RNA polymerase II. This kinase has also been shown to regulate transcription by targeting the CDK7/cyclin H subunits of the general transcription initiation factor IIH (TFIIH), thus providing a link between the @#$%&Mediator-like@#$%& protein complexes and the basal transcription machinery.
2820	
2821	
2822	
2823	G protein-coupled receptors (GPCRs, or GPRs) contain 7 transmembrane domains and transduce extracellular signals through heterotrimeric G proteins.[supplied by OMIM]
2824	
2825	
2826	
2827	This gene encodes WW domain-containing binding protein 4. The WW domain represents a small and compact globular structure that interacts with proline-rich ligands. This encoded protein is a general spliceosomal protein that may play a role in cross-intron bridging of U1 and U2 snRNPs in the spliceosomal complex A.
2828	
2829	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbls class and, in addition to an F-box, contains several tandem leucine-rich repeats and is localized in the nucleus.
2830	
2831	
2832	
2833	This gene encodes a component of vacuolar ATPase (V-ATPase), a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, receptor-mediated endocytosis, and synaptic vesicle proton gradient generation. V-ATPase is composed of a cytosolic V1 domain and a transmembrane V0 domain. The V1 domain consists of three A and three B subunits, two G subunits plus the C, D, E, F, and H subunits. The V1 domain contains the ATP catalytic site. The V0 domain consists of five different subunits: a, c, c@#$%&, c@#$%!, and d. Additional isoforms of many of the V1 and V0 subunit proteins are encoded by multiple genes or alternatively spliced transcript variants. This gene encodes the V1 domain D subunit protein.
2834	
2835	
2836	The protein encoded by this gene is a member of the SEC23 subfamily of the SEC23/SEC24 family. It is part of a protein complex and found in the ribosome-free transitional face of the endoplasmic reticulum (ER) and associated vesicles. This protein has similarity to yeast Sec23p component of COPII. COPII is the coat protein complex responsible for vesicle budding from the ER. The encoded protein is suggested to play a role in the ER-Golgi protein trafficking.
2837	
2838	This gene encodes a member of the phosphoenolpyruvate carboxykinase (GTP) family. The protein is a mitochondrial enzyme that catalyzes the conversion of oxaloacetate to phosphoenolpyruvate in the presence of GTP. A cytosolic form encoded by a different gene has also been characterized and is the key enzyme of gluconeogenesis in the liver. The encoded protein may serve a similar function, although it is constitutively expressed and not modulated by hormones such as glucagon and insulin that regulate the cytosolic form. Alternatively spliced transcript variants have been described.
2839	
2840	NP encodes the enzyme purine nucleoside phosphorylase that together with adenosine deaminase (ADA) serves a key role in purine catabolism, referred to as the salvage pathway. Mutations in either enzyme result in a severe combined immunodeficiency (SCID).
2841	cAMP response element (CRE)-binding protein-like-2 (CREBL2) was identified in a search to find genes in a commonly deleted region on chromosome 12p13 flanked by ETV6 and CDKN1B genes, frequently associated with hematopoietic malignancies, as well as breast, non-small-cell lung and ovarian cancers. CREBL2 shares a 41% identity with CRE-binding protein (CREB) over a 48-base long region which encodes the bZip domain of CREB. The bZip domain consists of about 30 amino acids rich in basic residues involved in DNA binding, followed by a leucine zipper motif involved in protein dimerization. This suggests that CREBL2 encodes a protein with DNA binding capabilities. The occurance of CREBL2 deletion in malignancy suggests that CREBL2 may act as a tumor suppressor gene.
2842	
2843	This gene belongs to the family of reticulon encoding genes. Reticulons are associated with the endoplasmic reticulum, and are involved in neuroendocrine secretion or in membrane trafficking in neuroendocrine cells. Alternatively spliced transcript variants encoding different isoforms have been identified. Multiple promoters rather than alternative splicing of internal exons seem to be involved in this diversity.
2844	
2845	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the peptidase T1A family, that is a 20S core alpha subunit. Two alternative transcripts encoding different isoforms have been identified.
2846	
2847	
2848	
2849	
2850	
2851	The A-kinase anchor proteins (AKAPs) are a group of structurally diverse proteins, which have the common function of binding to the regulatory subunit of protein kinase A (PKA) and confining the holoenzyme to discrete locations within the cell. This gene encodes a member of the AKAP family, and is expressed in testis only. The encoded protein contains an RII-binding domain, and is predicted to participate in protein-protein interactions with the R-subunit of the PKA. This protein is localized to the ribs of the fibrous sheath in the principal piece of the sperm tail. It may function as a regulator of both motility- and head-associated functions such as capacitation and the acrosome reaction.
2852	
2853	This gene encodes an oxygen-binding protein that is distantly related to members of the globin gene family. It is highly conserved among other vertebrates. It is expressed in the central and peripheral nervous system where it may be involved in increasing oxygen availability and providing protection under hypoxic/ischemic conditions.
2854	The protein encoded by this gene, a member of the peptidase S1 protein family, is found in azurophil granules of neutrophilic polymorphonuclear leukocytes. The encoded protease has a specificity similar to that of chymotrypsin C, and may participate in the killing and digestion of engulfed pathogens, and in connective tissue remodeling at sites of inflammation. Transcript variants utilizing alternative polyadenylation signals exist for this gene.
2855	The product of this gene is a member of the nuclear factors of activated T cells DNA-binding transcription complex. This complex consists of at least two components: a preexisting cytosolic component that translocates to the nucleus upon T cell receptor (TCR) stimulation and an inducible nuclear component. Other members of this family of nuclear factors of activated T cells also participate in the formation of this complex. The product of this gene plays a role in the inducible expression of cytokine genes in T cells, especially in the induction of the IL-2 and IL-4.
2856	Checkpoint suppressor 1 is a member of the forkhead/winged helix transcription factor family. Checkpoints are eukaryotic DNA damage-inducible cell cycle arrests at G1 and G2. Checkpoint suppressor 1 suppresses multiple yeast checkpoint mutations including mec1, rad9, rad53 and dun1 by activating a MEC1-independent checkpoint pathway.
2857	
2858	
2859	
2860	The protein encoded by this gene is an isozyme of long-chain fatty-acid-coenzyme A ligase family. Although differing in substrate specificity, subcellular localization, and tissue distribution, all isozymes of this family convert free long-chain fatty acids into fatty acyl-CoA esters, and thereby play a key role in lipid biosynthesis and fatty acid degradation. This isozyme activates long-chain, branched-chain and very-long-chain fatty acids containing 22 or more carbons to their CoA derivatives. It is expressed primarily in liver and kidney, and is present in both endoplasmic reticulum and peroxisomes but not in mitochondria. Its decreased peroxisomal enzyme activity is in part responsible for the biochemical pathology in X-linked adrenoleukodystrophy.
2861	
2862	The increased expression level of this gene is associated with HER-2/neu proto-oncogene overexpression. Amplification and resulting overexpression of this proto-oncogene are found in approximately 30% of human breast and 20% of human ovarian cancers. Further characterization of the product of this gene may yield insight into the fundamental biology and pathogenetic effects of HER-2/neu overexpression in human breast and ovarian cancer cells. Alternatively spliced variants encoding different isoforms have been identified for this gene.
2863	
2864	
2865	The protein encoded by this gene contains a structural module related to that of the seven transmembrane domain G protein-coupled receptor superfamily. This protein has sequence and structural similarities to the beta-amyloid binding protein (BBP), but, unlike BBP, it does not regulate a response to beta-amyloid peptide. This protein may have regulatory roles in cell death or proliferation signal cascades. Several alternatively spliced transcript variants of this gene are described but the full length nature of some variants has not been determined. Multiple polyadenylation sites have been found in this gene.
2866	
2867	
2868	This gene encodes a protein containing an alpha/beta hydrolase fold, which is a catalytic domain found in a very wide range of enzymes. The function of this protein has not been determined. Alternative splicing of this gene results in two transcript variants encoding the same protein.
2869	
2870	
2871	This gene encodes a member of the nucleoporin family. The protein is localized to the nuclear rim and is an essential component of the nuclear pore complex (NPC). All molecules entering or leaving the nucleus either diffuse through or are actively transported by the NPC. Alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
2872	
2873	
2874	
2875	Isocitrate dehydrogenases catalyze the oxidative decarboxylation of isocitrate to 2-oxoglutarate. These enzymes belong to two distinct subclasses, one of which utilizes NAD(+) as the electron acceptor and the other NADP(+). Five isocitrate dehydrogenases have been reported: three NAD(+)-dependent isocitrate dehydrogenases, which localize to the mitochondrial matrix, and two NADP(+)-dependent isocitrate dehydrogenases, one of which is mitochondrial and the other predominantly cytosolic. NAD(+)-dependent isocitrate dehydrogenases catalyze the allosterically regulated rate-limiting step of the tricarboxylic acid cycle. Each isozyme is a heterotetramer that is composed of two alpha subunits, one beta subunit, and one gamma subunit. The protein encoded by this gene is the alpha subunit of one isozyme of NAD(+)-dependent isocitrate dehydrogenase.
2876	Erythrocyte membrane protein band 4.2 is an ATP-binding protein which may regulate the association of protein 3 with ankyrin. It probably has a role in erythrocyte shape and mechanical property regulation. Mutations in the EPB42 gene are associated with recessive spherocytic elliptocytosis and recessively transmitted hereditary hemolytic anemia.
2877	
2878	
2879	Tryptases comprise a family of trypsin-like serine proteases, the peptidase family S1. Tryptases are enzymatically active only as heparin-stabilized tetramers, and they are resistant to all known endogenous proteinase inhibitors. Several tryptase genes are clustered on chromosome 16p13.3. These genes are characterized by several distinct features. They have a highly conserved 3@#$%& UTR and contain tandem repeat sequences at the 5@#$%& flank and 3@#$%& UTR which are thought to play a role in regulation of the mRNA stability. These genes have an intron immediately upstream of the initiator Met codon, which separates the site of transcription initiation from protein coding sequence. This feature is characteristic of tryptases but is unusual in other genes. The alleles of this gene exhibit an unusual amount of sequence variation, such that the alleles were once thought to represent two separate genes, alpha and beta 1. Beta tryptases appear to be the main isoenzymes expressed in mast cells; whereas in basophils, alpha tryptases predominate. Tryptases have been implicated as mediators in the pathogenesis of asthma and other allergic and inflammatory disorders.
2880	
2881	
2882	
2883	This gene is a member of the N-myc downregulated gene family which belongs to the alpha/beta hydrolase superfamily. The protein encoded by this gene is a cytoplasmic protein that may be involved in the regulation of mitogenic signalling in vascular smooth muscles cells. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
2884	
2885	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the proteasome B-type family, also known as the T1B family, that is a 20S core beta subunit. Proteolytic processing is required to generate a mature subunit. Expression of this gene is induced by gamma interferon, and this gene product replaces catalytic subunit 2 (proteasome beta 7 subunit) in the immunoproteasome.
2886	
2887	The protein encoded by this gene is similar to a C. elegans protein that functions in phagocytosis of apoptotic cells and in cell migration. A similar human protein interacts with the dedicator of cyto-kinesis 1 protein to promote phagocytosis and effect cell shape changes.
2888	
2889	Ataxin-2 binding protein 1 has an RNP motif that is highly conserved among RNA-binding proteins. This protein binds to the C-terminus of ataxin-2 and may contribute to the restricted pathology of spinocerebellar ataxia type 2 (SCA2). Ataxin-2 is the gene product of the SCA2 gene which causes familial neurodegenerative diseases. Ataxin-2 binding protein 1 and ataxin-2 are both localized in the trans-Golgi network. Four alternatively spliced transcript variants have been found for this gene. Additional transcript variants have been found but their full length nature has not been determined.
2890	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters.  ABC proteins transport various molecules across extra- and intracellular membranes.  ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White).  This protein is a member of the ABC1 subfamily.  Members of the ABC1 subfamily comprise the only major ABC subfamily found exclusively in multicellular eukaryotes.  The full transporter encoded by this gene may be involved in development of resistance to xenobiotics and engulfment during programmed cell death.
2891	
2892	
2893	
2894	
2895	
2896	MEFV was identified as the gene that when mutated causes Mediterranean fever, a hereditary periodic fever syndrome. MEFV is expressed in granulocytes and myeloid bone marrow precursors.
2897	
2898	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, which exhibits RNA-dependent ATPase and ATP-dependent RNA-unwinding activities. This protein is recruited to the cytoplasmic fibrils of the nuclear pore complex, where it participates in the export of mRNA from the nucleus. Multiple alternatively spliced transcript variants encoding different isoforms have been found for this gene.
2899	
2900	
2901	Cytochrome c oxidase (COX) is the terminal enzyme of the mitochondrial respiratory chain. It is a multi-subunit enzyme complex that couples the transfer of electrons from cytochrome c to molecular oxygen and contributes to a proton electrochemical gradient across the inner mitochondrial membrane. The complex consists of 13 mitochondrial- and nuclear-encoded subunits. The mitochondrially-encoded subunits perform the electron transfer and proton pumping activities. The functions of the nuclear-encoded subunits are unknown but they may play a role in the regulation and assembly of the complex. This gene encodes the nuclear-encoded subunit IV isoform 1 of the human mitochondrial respiratory chain enzyme. It is located at the 3@#$%& of the NOC4 (neighbor of COX4) gene in a head-to-head orientation, and shares a promoter with it.
2902	Cytochrome c oxidase (COX) is the terminal enzyme of the mitochondrial respiratory chain. It is a multi-subunit enzyme complex that couples the transfer of electrons from cytochrome c to molecular oxygen and contributes to a proton electrochemical gradient across the inner mitochondrial membrane. The complex consists of 13 mitochondrial- and nuclear-encoded subunits. The mitochondrially-encoded subunits perform the electron transfer and proton pumping activities. The functions of the nuclear-encoded subunits are unknown but they may play a role in the regulation and assembly of the complex. This gene encodes the nuclear-encoded subunit IV isoform 1 of the human mitochondrial respiratory chain enzyme. It is located at the 3@#$%& of the NOC4 (neighbor of COX4) gene in a head-to-head orientation, and shares a promoter with it.
2903	
2904	This gene encodes a member of the RCC1-related GEF family. The N-terminal half of the encoded amino acid sequence shows similarity to the regulator of chromosome condensation RCC1, which acts as a guanine nucleotide exchange factor (GEF) protein for the Ras-related GTPase Ran.
2905	
2906	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This ABC full transporter is a member of the MRP subfamily which is involved in multi-drug resistance. It is expressed at low levels in all tissues, except kidney, spleen, and colon. This gene and family member ABCC12 are determined to be derived by duplication and are both localized to chromosome 16q12.1. Their chromosomal localization, potential function, and expression patterns identify them as candidates for paroxysmal kinesigenic choreoathetosis, a disorder characterized by attacks of involuntary movements and postures, chorea, and dystonia. Multiple alternatively spliced transcript variants have been described for this gene.
2907	This gene is a member of the ADAR (adenosine deaminase acting on RNA) family.  Using site-specific adenosine modification, proteins encoded by these genes participate in the pre-mRNA editing of nuclear transcripts.  The protein encoded by this gene, tRNA-specific adenosine deaminase 1, is responsible for the deamination of adenosine 37 to inosine in eukaryotic tRNA.
2908	Galactokinase is a major enzyme for the metabolism of galactose and its deficiency causes congenital cataracts during infancy and presenile cataracts in the adult population.
2909	Diacylglycerol kinases are thought to be involved mainly in the regeneration of phosphatidylinositol (PI) from diacylglycerol in the PI-cycle during cell signal transduction.  When expressed in mammalian cells, DGK-epsilon shows specificity for arachidonyl-containing diacylglycerol. DGK-epsilon is expressed predominantly in testis.
2910	Myeloperoxidase (MPO) is a heme protein synthesized during myeloid differentiation that constitutes the major component of neutrophil azurophilic granules. Produced as a single chain precursor, myeloperoxidase is subsequently cleaved into a light and heavy chain.  The mature myeloperoxidase is a tetramer composed of 2 light chains and 2 heavy chains. This enzyme produces hypohalous acids central to the microbicidal activity of netrophils.
2911	
2912	
2913	The adult colon epithelium contains 3 differentiated cell types that arise from a multipotent stem cell. Deviation from the normal maturation pathway by neoplastic transformation is thought to initiate in stem cells or their early descendants. One potential marker is ICT1 whose mRNA and protein were more highly expressed in undifferentiated than in differentiated cells.
2914	This gene is located within the Smith-Magenis syndrome region on chromosome 17. It encodes a protein of unknown function.
2915	
2916	
2917	
2918	The GLP2 receptor (GLP2R) is a G protein-coupled receptor superfamily member closely related to the glucagon receptor ans GLP1 receptor. Glucagon-like peptide-2 (GLP2) is a 33-amino acid proglucagon-derived peptide produced by intestinal enteroendocrine cells. Like glucagon-like peptide-1 (GLP1) and glucagon itself, it is derived from the proglucagon peptide encoded by the GCG gene. GLP2 stimulates intestinal growth and upregulates villus height in the small intestine, concomitant with increased crypt cell proliferation and decreased enterocyte apoptosis. Moreover, GLP2 prevents intestinal hypoplasia resulting from total parenteral nutrition. GLP2R, a G protein-coupled receptor superfamily member is expressed in the gut and closely related to the glucagon receptor (GCGR) and the receptor for GLP1 (GLP1R).
2919	
2920	
2921	This gene encodes a member of the RSK (ribosomal S6 kinase) family of serine/threonine kinases. This kinase contains 2 non-identical kinase catalytic domains and phosphorylates several residues of the S6 ribosomal protein. The kinase activity of this protein leads to an increase in protein synthesis and cell proliferation. Amplification of the region of DNA encoding this gene and overexpression of this kinase are seen in some breast cancer cell lines. Alternate translational start sites have been described and alternate transcriptional splice variants have been observed but have not been thoroughly characterized.
2922	
2923	
2924	This gene product is a member of a family of membrane-associated proteins termed MAGUKs (membrane-associated guanylate kinase homologs). MAGUKs interact with the cytoskeleton and regulate cell proliferation, signaling pathways, and intracellular junctions. This protein contains a conserved sequence, called the SH3 (src homology 3) motif, found in several other proteins that associate with the cytoskeleton and are suspected to play important roles in signal transduction.
2925	Members of the @#$%&frizzled@#$%& gene family encode 7-transmembrane domain proteins that are receptors for Wnt signaling proteins.  The expression of the FZD2 gene appears to be developmentally regulated, with high levels of expression in fetal kidney and lung and in adult colon and ovary.
2926	Core binding factor (CBF) is a heterodimeric transcription factor that binds to the core element of many enhancers and promoters. The protein encoded by this gene represents the alpha subunit of CBF and is thought to be involved in the development of normal hematopoiesis. Chromosomal translocations involving this gene are well-documented and have been associated with several types of leukemia. Two transcript variants encoding different isoforms have been found for this gene.
2927	
2928	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L27E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
2929	WD40 repeat proteins are key components of many essential biologic functions. They regulate the assembly of multiprotein complexes by presenting a beta-propeller platform for simultaneous and reversible protein-protein interactions. Members of the WIPI subfamily of WD40 repeat proteins, such as WIPI1, have a 7-bladed propeller structure and contain a conserved motif for interaction with phospholipids (Proikas-Cezanne et al., 2004).[supplied by OMIM]
2930	NME1 was identified because of its reduced mRNA transcript levels in highly metastatic cells. NME1 encodes the @#$%&A@#$%& isoform of nucleoside diphosphate kinase (NDK). NDK exists as a hexamer composed of the @#$%&A@#$%& (NME1) and @#$%&B@#$%& (encoded by NME2) isoforms. Mutations in NME1 have been identified in aggressive neuroblastomas. Two transcript variants encoding different isoforms have been found for this gene.
2931	
2932	This gene encodes a member of an adaptor protein family. Members of this family encode proteins containing a homeobox homology domain, proline rich region and Src-homology 3 (SH3) domain. The encoded protein inhibits ectopic metastasis of tumor cells as well as cell migration. This may be accomplished through interaction with p21-activated kinase.
2933	
2934	The protein encoded by this gene contains a RNA recognition motif identified in the RAD52, a gene known to be important for DNA recombination and repair.
2935	This gene encodes an incretin hormone and belongs to the glucagon superfamily. The encoded protein is important in maintaining glucose homeostasis as it is a potent stimulator of insulin secretion from pancreatic beta-cells following food ingestion and nutrient absorption. This gene stimulates insulin secretion via its G protein-coupled receptor activation of adenylyl cyclase and other signal transduction pathways. It is a relatively poor inhibitor of gastric acid secretion.
2936	The protein encoded by this gene (p35) is a neuron-specific activator of cyclin-dependent kinase 5 (CDK5); the activation of CDK5 is required for proper development of the central nervous system. The p35 form of this protein is proteolytically cleaved by calpain, generating a p25 form. The cleavage of p35 into p25 results in relocalization of the protein from the cell periphery to nuclear and perinuclear regions. P25 deregulates CDK5 activity by prolonging its activation and changing its cellular location. The p25 form accumulates in the brain neurons of patients with Alzheimer@#$%&s disease. This accumulation correlates with an increase in CDK5 kinase activity, and may lead to aberrantly phosphorylated forms of the microtubule-associated protein tau, which contributes to Alzheimer@#$%&s disease.
2937	
2938	This gene encodes a member of the C-type lectin/C-type lectin-like domain (CTL/CTLD) superfamily. Members of this family share a common protein fold and have diverse functions, such as cell adhesion, cell-cell signalling, glycoprotein turnover, and roles in inflammation and immune response. The encoded type 2 transmembrane protein may function as a cell surface antigen. Two transcript variants encoding distinct isoforms have been identified for this gene.
2939	This gene encodes a member of the C-type lectin/C-type lectin-like domain (CTL/CTLD) superfamily. Members of this family share a common protein fold and have diverse functions, such as cell adhesion, cell-cell signalling, glycoprotein turnover, and roles in inflammation and immune response. The encoded type 2 transmembrane protein may function as a cell surface antigen. Two transcript variants encoding distinct isoforms have been identified for this gene.
2940	
2941	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S7E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
2942	This gene is a member of a phylogenetically conserved family of genes that share a common DNA-binding domain, the T-box. T-box genes encode transcription factors involved in the regulation of developmental processes. This gene is the human ortholog of mouse Tbx21/Tbet gene. Studies in mouse show that Tbx21 protein is a Th1 cell-specific transcription factor that controls the expression of the hallmark Th1 cytokine, interferon-gamma (IFNG). Expression of the human ortholog also correlates with IFNG expression in Th1 and natural killer cells, suggesting a role for this gene in initiating Th1 lineage development from naive Th precursor cells.
2943	
2944	
2945	Bleomycin hydrolase (BMH) is a cytoplasmic cysteine peptidase that is highly conserved through evolution; however, the only known activity of the enzyme is metabolic inactivation of the glycopeptide bleomycin (BLM), an essential component of combination chemotherapy regimens for cancer. The protein contains the signature active site residues of the cysteine protease papain superfamily.
2946	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the high sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
2947	This gene encodes a member of the DEAD box protein family. DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure, such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. The protein encoded by this gene is a nuclear matrix protein. Its amino acid sequence is highly similar to the amino acid sequences of the translation initiation factors eIF4AI and eIF4AII, two other members of the DEAD box protein family.
2948	
2949	The DRG2 gene encodes the developmentally regulated GTP-binding protein 2, a name derived from the fact that it shares significant similarity to known GTP-binding proteins. DRG2 was identified because it is expressed in normal fibroblasts but not in SV40-transformed fibroblasts. Read-through transcripts containing this gene and a downstream gene have been identified, but they are not thought to encode a fusion protein. This gene is located within the Smith-Magenis syndrome region on chromosome 17.
2950	
2951	
2952	
2953	
2954	
2955	
2956	The protein encoded by this gene is a cytokine that belongs to the tumor necrosis factor (TNF) ligand family. This cytokine is a ligand for receptors TNFRSF13B/TACI, TNFRSF17/BCMA, and TNFRSF13C/BAFFR. This cytokine is expressed in B cell lineage cells, and acts as a potent B cell activator. It has been also shown to play an important role in the proliferation and differentiation of B cells.
2957	NSF and SNAPs are general elements of the cellular membrane transport apparatus. The sequence of the predicted 312-amino acid human protein encoded by NAPG is 95% identical to that of bovine gamma-SNAP. NAPG mediates platelet exocytosis and controls the membrane fusion events of this process.
2958	NSF and SNAPs are general elements of the cellular membrane transport apparatus. The sequence of the predicted 312-amino acid human protein encoded by NAPG is 95% identical to that of bovine gamma-SNAP. NAPG mediates platelet exocytosis and controls the membrane fusion events of this process.
2959	Proteins that contain a CXXC motif within their DNA-binding domain, such as CXXC1, recognize CpG sequences and regulate gene expression (Carlone and Skalnik, 2001).[supplied by OMIM]
2960	
2961	
2962	The protein encoded by this gene contains a type 1 thrombospondin domain, which is found in thrombospondin, a number of proteins involved in the complement pathway, as well as extracellular matrix proteins. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
2963	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. The encoded protein is linked with a ubiquitin-like protein, NEDD8, which can be conjugated to cellular proteins, such as Cdc53/culin.
2964	
2965	
2966	
2967	
2968	
2969	The elongation of primed DNA templates by DNA polymerase delta and DNA polymerase epsilon requires the accessory proteins proliferating cell nuclear antigen (PCNA) and replication factor C (RFC). RFC, also named activator 1, is a protein complex consisting of five distinct subunits of 140, 40, 38, 37, and 36 kD. This gene encodes the 38 kD subunit. This subunit is essential for the interaction between the 140 kD subunit and the core complex that consists of the 36, 37, and 40 kD subunits. Alternatively spliced transcript variants encoding distinct isoforms have been described.
2970	Transforming growth factor-beta, or TGFB (see TGFB1; MIM 190180), remains biologically latent after secretion, blocked from activity by its N-terminal propeptide (the latency-associated peptide, or LAP). TGFB is activated by its dissociation from LAP. The latent form of TGFB is secreted in association with a latent TGFB-binding protein, or LTBP (see LTBP1; MIM 150390), and assembled into the extracellular matrix. LTBPs are mainly composed of epidermal growth factor (EGF; MIM 131530)-like repeats and protein domains with 8-cysteine repeats, both of which are also found in fibrillins (e.g., MIM 134797). See Oklu and Hesketh (2000) for a review of the LTBP gene family.[supplied by OMIM]
2971	The protein encoded by this gene was identified in a yeast two-hybrid assay employing the second subunit of human replication protein A as bait. It is localized to the nucleus and its expression is significantly higher in cancer cell lines compared to normal cell lines. This protein has also been shown to be a strong transcriptional co-activator. Alternative splicing has been observed at this locus and two variants, both encoding the same protein, have been identified.
2972	
2973	
2974	
2975	
2976	Retinitis pigmentosa (RP) is a clinically and genetically heterogeneous group of retinal dystrophies characterized by a progressive degeneration of photoreceptors, eventually resulting in severe visual impairment. For further background and phenotypic information on RP, see MIM 268000.[supplied by OMIM]
2977	Retinitis pigmentosa (RP) is a clinically and genetically heterogeneous group of retinal dystrophies characterized by a progressive degeneration of photoreceptors, eventually resulting in severe visual impairment. For further background and phenotypic information on RP, see MIM 268000.[supplied by OMIM]
2978	
2979	This gene is a member of the regulatory factor X gene family, which encodes transcription factors that contain a highly-conserved winged helix DNA binding domain. The protein encoded by this gene is structurally related to regulatory factors X2, X3, X4, and X5. It is a transcriptional activator that can bind DNA as a monomer or as a heterodimer with RFX family members X2, X3, and X5, but not with X4. This protein binds to the X-boxes of MHC class II genes and is essential for their expression.  Also, it can bind to an inverted repeat that is required for expression of hepatitis B virus genes.
2980	This gene encodes a member of the peroxiredoxin family of antioxidant enzymes, which reduce hydrogen peroxide and alkyl hydroperoxides. The encoded protein may play an antioxidant protective role in cells, and may contribute to the antiviral activity of CD8(+) T-cells. This protein may have a proliferative effect and play a role in cancer development or progression. The crystal structure of this protein has been resolved to 2.7 angstroms. Three transcript variants encoding distinct isoforms have been identified for this gene.
2981	
2982	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein might couple to signal transduction pathways and possibly modulate cellular activation and adhesion in haemopoietic and neural tissue.
2983	Acid phosphatase 5 is an iron containing glycoprotein which catalyzes the conversion of orthophosphoric monoester to alcohol and orthophosphate.  ACP5 is the most basic of the acid phosphatases and is the only form not inhibited by L(+)-tartrate.
2984	
2985	Histidine-rich calcium-binding protein is a luminal sarcoplasmic reticulum protein of 165 kD identified by its ability to bind low-density lipoprotein with high affinity
2986	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L18E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
2987	
2988	
2989	
2990	This gene encodes a member of the forkhead class of DNA-binding proteins. These hepatocyte nuclear factors are transcriptional activators for liver-specific transcripts such as albumin and transthyretin, and they also interact with chromatin. Similar family members in mice have roles in the regulation of metabolism and in the differentiation of the pancreas and liver. The crystal structure of a similar protein in rat has been resolved.
2991	
2992	
2993	Leukocyte Ig-like receptors (LIRs) are a family of immunoreceptors expressed predominantly on monocytes and B cells and at lower levels on dendritic cells and natural killer (NK) cells. All LIRs in subfamily B have an inhibitory function (see, e.g., LILRB1, MIM 604811). LIRs in subfamily A, with short cytoplasmic domains lacking an immunoreceptor tyrosine-based inhibitory motif (ITIM) and with transmembrane regions containing a charged arginine residue, may initiate stimulatory cascades. One member of subfamily A (LILRA3; MIM 604818) lacks a transmembrane region and is presumed to be a soluble receptor.[supplied by OMIM]
2994	
2995	
2996	
2997	
2998	
2999	This gene encodes a protein that is expressed in the thyroid.
3000	
3001	
3002	
3003	
3004	
3005	
3006	
3007	
3008	The A-kinase anchor proteins (AKAPs) are a group of structurally diverse proteins, which have the common function of binding to the regulatory subunit of protein kinase A (PKA) and confining the holoenzyme to discrete locations within the cell. This gene encodes a member of the AKAP family. The encoded protein is located in the nucleus during interphase and is distinctly redistributed during mitosis. This protein has a cell cycle-dependent interaction with the RII subunit of PKA.
3009	
3010	
3011	
3012	
3013	
3014	This gene belongs to the ADAMTS (A Disintegrin And Metalloproteinase domain, with ThromboSpondin type-1 modules) family of zinc-dependent proteases. ADAMTS proteases are complex secreted enzymes containing a prometalloprotease domain of the reprolysin type attached to an ancillary domain with a highly conserved structure that includes at least one thrombospondin type 1 repeat. They have been demonstrated to have important roles in connective tissue organization, coagulation, inflammation, arthritis, angiogenesis and cell migration. The product of this gene plays a major role in growth and in skin, lens, and heart development. It is also a candidate gene for autosomal recessive Weill-Marchesani syndrome.
3015	
3016	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S28E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
3017	
3018	This gene is a member of a group of genes whose transcript levels are increased following stressful growth arrest conditions and treatment with DNA-damaging agents. The genes in this group respond to environmental stresses by mediating activation of the p38/JNK pathway.  This activation is mediated via their proteins binding and activating MTK1/MEKK4 kinase, which is an upstream activator of both p38 and JNK MAPKs. The function of these genes or their protein products is involved in the regulation of growth and apoptosis. These genes are regulated by different mechanisms, but they are often coordinately expressed and can function cooperatively in inhibiting cell growth.
3019	
3020	
3021	
3022	The SMADIP1 gene (also known as SIP1) is a member of the delta-EF1 (TCF8; MIM 189909)/Zfh1 family of 2-handed zinc finger/homeodomain proteins. SMADIP1 interacts with receptor-mediated, activated full-length SMADs (see MIM 605568).[supplied by OMIM]
3023	
3024	The protein encoded by this gene is a protein serine/threonine phosphatase of the PP2C family. This protein can interact with integrin-linked kinase (ILK/ILK1), a regulator of integrin mediated signaling, and regulate the kinase activity of ILK. Through the interaction with ILK, this protein may selectively affect the signaling process of ILK-mediated glycogen synthase kinase 3 beta (GSK3beta), and thus participate in Wnt signaling pathway. Alternatively spliced transcript variants encoding distinct isoforms have been described.
3025	Crystallins are separated into two classes: taxon-specific, or enzyme, and ubiquitous. The latter class constitutes the major proteins of vertebrate eye lens and maintains the transparency and refractive index of the lens. Since lens central fiber cells lose their nuclei during development, these crystallins are made and then retained throughout life, making them extremely stable proteins. Mammalian lens crystallins are divided into alpha, beta, and gamma families; beta and gamma crystallins are also considered as a superfamily. Alpha and beta families are further divided into acidic and basic groups. Seven protein regions exist in crystallins: four homologous motifs, a connecting peptide, and N- and C-terminal extensions. Gamma-crystallins are a homogeneous group of highly symmetrical, monomeric proteins typically lacking connecting peptides and terminal extensions. They are differentially regulated after early development. Four gamma-crystallin genes (gamma-A through gamma-D) and three pseudogenes (gamma-E, gamma-F, gamma-G) are tandemly organized in a genomic segment as a gene cluster. Whether due to aging or mutations in specific genes, gamma-crystallins have been involved in cataract formation.
3026	This gene encodes a protein with a BIR (baculoviral inhibition of apoptosis protein repeat) domain and a UBCc (ubiquitin-conjugating enzyme E2, catalytic) domain. This protein inhibits apoptosis by facilitating the degradation of apoptotic proteins by ubiquitination.
3027	The protein encoded by this gene is a member of the immunophilin protein family, which play a role in immunoregulation and basic cellular processes involving protein folding and trafficking. This encoded protein is a cis-trans prolyl isomerase that binds the immunosuppressants FK506 and rapamycin. It is highly similar to the FK506-binding protein 1A. Its physiological role is thought to be in excitation-contraction coupling in cardiac muscle. There are two alternatively spliced transcript variants of this gene encoding different isoforms.
3028	
3029	
3030	The protein encoded by this gene is an aminopeptidase which prefers acidic amino acids, and specifically favors aspartic acid over glutamic acid. It is thought to be a cytosolic protein involved in general metabolism of intracellular proteins.
3031	
3032	
3033	
3034	
3035	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L37AE family of ribosomal proteins. It is located in the cytoplasm. The protein contains a C4-type zinc finger-like domain. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
3036	
3037	
3038	
3039	
3040	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. This gene encodes a member of the potassium channel, voltage-gated, subfamily F. This gene is intronless and expressed in all tissues tested, including the heart, skeletal muscle, brain, kidney, and pancreas.
3041	
3042	
3043	High density lipoprotein-binding protein, also known as vigilin, is a 110-kD protein that specifically binds HDL molecules and may function in the removal of excess cellular cholesterol.[supplied by OMIM]
3044	This gene encodes a component of vacuolar ATPase (V-ATPase), a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, receptor-mediated endocytosis, and synaptic vesicle proton gradient generation. V-ATPase is composed of a cytosolic V1 domain and a transmembrane V0 domain. The V1 domain consists of three A and three B subunits, two G subunits plus the C, D, E, F, and H subunits. The V1 domain contains the ATP catalytic site. The V0 domain consists of five different subunits: a, c, c@#$%&, c@#$%!, and d. Additional isoforms of many of the V1 and V0 subunit proteins are encoded by multiple genes or alternatively spliced transcript variants. This encoded protein is one of two V1 domain B subunit isoforms and is found in the kidney. Mutations in this gene cause distal renal tubular acidosis associated with sensorineural deafness.
3045	
3046	This gene encodes the largest subunit of dynactin, a macromolecular complex consisting of 10-11 subunits ranging in size from 22 to 150 kD. Dynactin binds to both microtubules and cytoplasmic dynein. It is involved in a diverse array of cellular functions, including ER-to-Golgi transport, the centripetal movement of lysosomes and endosomes, spindle formation, chromosome movement, nuclear positioning, and axonogenesis. This subunit interacts with dynein intermediate chain by its domains directly binding to dynein. Alternative splicing of this gene results in at least 2 functionally distinct isoforms: a ubiquitously expressed one and a brain-specific one. Based on its cytogenetic location, this gene is considered as a candidate gene for limb-girdle muscular dystrophy.
3047	RHD11, a member of the short-chain dehydrogenase/reductase (SDR) superfamily of oxidoreductases, is expressed at high levels in prostate epithelium, and its expression is regulated by androgens.[supplied by OMIM]
3048	Members of the Rho family of Ras-related GTPases, such as ARHE, regulate the organization of the actin cytoskeleton in response to extracellular growth factors. Like Ras (MIM 190020), Rho family members appear to cycle between an inactive GDP-bound form and an active GTP-bound form. Three major regulators of Rho activity have been identified: RhoGDIs, which interact with the GDP-bound Rho proteins to keep them in a resting complex (see MIM 601925); GEFs, which promote GDP/GTP exchange leading to activation of Rho proteins (see MIM 601855); and GAPs, which stimulate GTP hydrolysis and return the activated Rho protein to its inactive form (see MIM 602680) (Nobes et al., 1998 [PubMed 9531558]).[supplied by OMIM]
3049	
3050	
3051	
3052	
3053	
3054	
3055	
3056	
3057	
3058	
3059	
3060	
3061	
3062	
3063	
3064	
3065	
3066	
3067	
3068	This gene encodes a disintegrin and metalloprotease (ADAM) domain 17, which is a member of the ADAM protein family. Members of this family are membrane-anchored proteins structurally related to snake venom disintegrins, and have been implicated in a variety of biologic processes involving cell-cell and cell-matrix interactions, including fertilization, muscle development, and neurogenesis. The member encoded by this gene functions as a tumor necrosis factor-alpha converting enzyme; binds mitotic arrest deficient 2 protein; and also plays a prominent role in the activation of the Notch signaling pathway. This gene contains 19 exons and alternative splicing generates 2 transcripts.
3069	
3070	The protein encoded by this gene belongs to the inhibitor of DNA binding (ID) family, members of which are transcriptional regulators that contain a helix-loop-helix (HLH) domain but not a basic domain. Members of the ID family inhibit the functions of basic helix-loop-helix transcription factors in a dominant-negative manner by suppressing their heterodimerization partners through the HLH domains. This protein may play a role in negatively regulating cell differentiation. A pseudogene has been identified for this gene.
3071	
3072	The protein encoded by this gene is an integral membrane protein involved in the acute-phase immunologic response to gram-negative bacterial infections. Gram-negative bacteria contain a glycolipid, lipopolysaccharide (LPS), on their outer cell wall. Together with bactericidal permeability-increasing protein (BPI), the encoded protein binds LPS and interacts with the CD14 receptor, probably playing a role in regulating LPS-dependent monocyte responses. Studies in mice suggest that the encoded protein is necessary for the rapid acute-phase response to LPS but not for the clearance of LPS from circulation. This protein is part of a family of structurally and functionally related proteins, including BPI, plasma cholesteryl ester transfer protein (CETP), and phospholipid transfer protein (PLTP). Finally, this gene is found on chromosome 20, immediately downstream of the BPI gene.
3073	The protein encoded by this gene, a member of the MYB family of transcription factor genes, is a nuclear protein involved in cell cycle progression. The encoded protein is phosphorylated by cyclin A/cyclin-dependent kinase 2 during the S-phase of the cell cycle and possesses both activator and repressor activities. It has been shown to activate the cell division cycle 2, cyclin D1, and insulin-like growth factor-binding protein 5 genes. Transcript variants may exist for this gene, but their full-length natures have not been determined.
3074	The protein encoded by this gene is a peroxisomal thioesterase that appears to be involved more in the oxidation of fatty acids rather than in their formation. The encoded protein can bind to the human immunodeficiency virus-1 protein Nef, and mediate Nef-induced down-regulation of CD4 in T-cells. Multiple transcript variants encoding several different isoforms have been found for this gene.
3075	The protein encoded by this intronless gene is a bZIP transcription factor which can bind as a homodimer to certain DNA regulatory regions. It can also form heterodimers with the related proteins CEBP-alpha, CEBP-delta, and CEBP-gamma. The encoded protein is important in the regulation of genes involved in immune and inflammatory responses and has been shown to bind to the IL-1 response element in the IL-6 gene, as well as to regulatory regions of several acute-phase and cytokine genes. In addition, the encoded protein can bind the promoter and upstream element and stimulate the expression of the collagen type I gene.
3076	
3077	
3078	The protein encoded by this gene is a member of the immunophilin protein family, which play a role in immunoregulation and basic cellular processes involving protein folding and trafficking. This encoded protein is a cis-trans prolyl isomerase that binds the immunosuppressants FK506 and rapamycin. It interacts with several intracellular signal transduction proteins including type I TGF-beta receptor. It also interacts with multiple intracellular calcium release channels including the tetrameric skeletal muscle ryanodine receptor. In mouse, deletion of this homologous gene causes congenital heart disorder known as noncompaction of left ventricular myocardium. There is evidence of multiple alternatively spliced transcript variants for this gene, but the full length nature of some variants has not been determined.
3079	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP contains an extracellular domain, a single transmembrane segment and two tandem intracytoplasmic catalytic domains, and thus represents a receptor-type PTP. This PTP has been shown to dephosphorylate and activate Src family tyrosine kinases, and is implicated in the regulation of integrin signaling, cell adhesion and proliferation. Three alternatively spliced variants of this gene, which encode two distinct isoforms, have been reported.
3080	This gene is found on chromosome 20, approximately 20 kbp downstream of the gene encoding cellular prion protein, to which it is biochemically and structurally similar. The protein encoded by this gene is a membrane glycosylphosphatidylinositol-anchored glycoprotein that is found predominantly in testis. Mutations in this gene may lead to neurological disorders.
3081	BMP2 belongs to the transforming growth factor-beta (TGFB) superfamily.  Bone morphogenic protein induces bone formation. BMP2 is a candidate gene for the autosomal dominant disease of fibrodysplasia (myositis) ossificans progressiva.
3082	
3083	
3084	
3085	This gene is the human homolog of murine plunc, and like the mouse gene, is specifically expressed in the upper airways and nasopharyngeal regions. The exact biological function of this gene is not known, however, it has been suggested to be involved in inflammatory responses to irritants in the upper airways. It may also serve as a potential molecular marker for detection of micrometastasis in non-small-cell lung cancer. Multiple transcript variants resulting from alternative splicing in the 3@#$%& UTR have been detected, but the full-length nature of only two is known.
3086	
3087	
3088	The A-kinase anchor proteins (AKAPs) are a group of structurally diverse proteins, which have the common function of binding to the regulatory subunit of protein kinase A (PKA) and confining the holoenzyme to discrete locations within the cell. This gene encodes a member of the AKAP family. The encoded protein is highly expressed in various brain regions and cardiac and skeletal muscle. It is specifically localized to the sarcoplasmic reticulum and nuclear membrane, and is involved in anchoring PKA to the nuclear membrane or sarcoplasmic reticulum.
3089	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. Two transcript variants encoding distinct isoforms have been described. A pseudogene corresponding to this gene is found on chromosome 5q.
3090	
3091	The protein encoded by this gene is a component of the F-type ATPase found in the mitochondrial matrix. F-type ATPases are composed of a catalytic core and a membrane proton channel. The encoded protein appears to be part of the connector linking these two components and may be involved in transmission of conformational changes or proton conductance.
3092	This gene is involved in the first step in glycosylphosphatidylinositol (GPI)-anchor biosynthesis. The GPI-anchor is a glycolipid found on many blood cells and serves to anchor proteins to the cell surface. This gene encodes a N-acetylglucosaminyl transferase component that is part of the complex that catalyzes transfer of N-acetylglucosamine (GlcNAc) from UDP-GlcNAc to phosphatidylinositol (PI). This gene is located in the Down Syndrome critical region (DSCR) on chromosome 21 and is a candidate for the pathogenesis of Down syndrome. At least three transcript variants have been described for this gene.
3093	The A-kinase anchor proteins (AKAPs) are a group of structurally diverse proteins, which have the common function of binding to the regulatory subunit of protein kinase A (PKA) and confining the holoenzyme to discrete locations within the cell. This gene encodes a member of the AKAP family. The encoded protein is highly expressed in various brain regions and cardiac and skeletal muscle. It is specifically localized to the sarcoplasmic reticulum and nuclear membrane, and is involved in anchoring PKA to the nuclear membrane or sarcoplasmic reticulum.
3094	HSF2 binding protein (HSF2BP) associates with HSF2. The interaction occurs between the trimerization domain of HSF2 and the amino terminal hydrophilic region of HSF2BP that comprises two leucine zipper motifs. HSF2BP may therefore be involved in modulating HSF2 activation.
3095	
3096	The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21; however, this gene is located at 21q22.3. This protein may function in Neurite extension, proliferation of melanoma cells, stimulation of Ca2+ fluxes, inhibition of PKC-mediated phosphorylation, astrocytosis and axonal proliferation, and inhibition of microtubule assembly. Chromosomal rearrangements and altered expression of this gene have been implicated in several neurological, neoplastic, and other types of diseases, including Alzheimer@#$%&s disease, Down@#$%&s syndrome, epilepsy, amyotrophic lateral sclerosis, melanoma, and type I diabetes.
3097	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This gene is excluded as a candidate for a form of nonsyndromic deafness (DFNB10), but is still a candidate for other disorders mapped to 21q22.3 as well as for the development of Down syndrome phenotypes. Two transcript variants encoding the same protein have been found for this gene.
3098	This gene encodes a component of vacuolar ATPase (V-ATPase), a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, receptor-mediated endocytosis, and synaptic vesicle proton gradient generation. V-ATPase is composed of a cytosolic V1 domain and a transmembrane V0 domain. The V1 domain consists of three A and three B subunits, two G subunits plus the C, D, E, F, and H subunits. The V1 domain contains the ATP catalytic site. The V0 domain consists of five different subunits: a, c, c@#$%&, c@#$%!, and d. Additional isoforms of many of the V1 and V0 subunit proteins are encoded by multiple genes or alternatively spliced transcript variants. This encoded protein is one of two V1 domain E subunit proteins and is found ubiquitously. Pseudogenes for this gene have been found in the genome.
3099	Clathrin heavy chain is the main structural protein of the polyhedral lattice surrounding coated pits and coated vesicles, organelles that are involved in intracellular transport of membrane-associated components. CLTCL1 is ubiquitously expressed in fetal tissue, but expressed only in certain adult tissues, particularly skeletal muscle, indicating that certain specialized functions of clathrins may lie at the level of gene diversity. Two isoforms of CLTCL which differ in the presence of one alternatively spliced exon have been isolated. The additional 56 amino acids in trancript variant 2 overlap the alpha-helical coiled-coil domain involved in light chain interactions and are immediately adjacent to the trimerization domain.
3100	This gene, the result of a duplication at this locus, is one of two functional genes encoding nearly identical proteins that have similar expression patterns. The product of this gene is a protein that shares homology with the Drosophila gonadal protein, expressed in gonadal tissues and germ cells, and with the human laminin gamma-1 chain that functions in cell attachment and migration. This gene is located in a region of chromosome 22 implicated in the DiGeorge syndrome, one facet of a broader collection of anomalies referred to as the CATCH 22 syndrome.
3101	
3102	
3103	Crystallins are separated into two classes: taxon-specific, or enzyme, and ubiquitous. The latter class constitutes the major proteins of vertebrate eye lens and maintains the transparency and refractive index of the lens. Since lens central fiber cells lose their nuclei during development, these crystallins are made and then retained throughout life, making them extremely stable proteins. Mammalian lens crystallins are divided into alpha, beta, and gamma families; beta and gamma crystallins are also considered as a superfamily. Alpha and beta families are further divided into acidic and basic groups. Seven protein regions exist in crystallins: four homologous motifs, a connecting peptide, and N- and C-terminal extensions. Beta-crystallins, the most heterogeneous, differ by the presence of the C-terminal extension (present in the basic group, none in the acidic group). Beta-crystallins form aggregates of different sizes and are able to self-associate to form dimers or to form heterodimers with other beta-crystallins. This gene, a beta basic group member, undergoes extensive cleavage at its N-terminal extension during lens maturation. It is also a member of a gene cluster with beta-A4, beta-B2, and beta-B3.
3104	
3105	
3106	This gene encodes a Ras-related protein that enriched in striatum. The product of this gene binds to GTP and possesses intrinsic GTPase activity. The gene belongs to the Ras superfamily of small GTPases. The exact function of this gene is unknown, but most striatum-specific mRNAs characterized to date encode components of signal transduction cascades.
3107	
3108	
3109	
3110	This gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional activator after forming a protein complex with other proteins. This protein acts as a nucleocytoplasmic shuttle protein and is important for neural crest and peripheral nervous system development. Mutations in this gene are associated with Waardenburg-Shah and Waardenburg-Hirschsprung disease.
3111	This gene is a member of the cytidine deaminase gene family. It is one of seven related genes or pseudogenes found in a cluster, thought to result from gene duplication, on chromosome 22. Members of the cluster encode proteins that are structurally and functionally related to the C to U RNA-editing cytidine deaminase APOBEC1. It is thought that the proteins may be RNA editing enzymes and have roles in growth or cell cycle control. Alternatively spliced transcript variants encoding different isoforms have been identified.
3112	This gene encodes a member of the GRB2/Sem5/Drk family. This member is an adaptor-like protein involved in leukocyte-specific protein-tyrosine kinase signaling. Like its related family member, GRB2-related adaptor protein (GRAP), this protein contains an SH2 domain flanked by two SH3 domains. This protein interacts with other proteins, such as GRB2-associated binding protein 1 (GAB1) and the SLP-76 leukocyte protein (LCP2), through its SH3 domains. Transcript variants utilizing alternative polyA sites exist.
3113	
3114	
3115	
3116	
3117	
3118	
3119	Rho GTPases play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. The encoded protein may form complex with G proteins and stimulate Rho-dependent signals. This protein is similar to the NET1A protein.
3120	
3121	
3122	The protein encoded by this gene belongs to the family of P-type cation transport ATPases. This magnesium-dependent enzyme catalyzes the hydrolysis of ATP coupled with the transport of the calcium. Defects in this gene cause Hailey-Hailey disease, an autosomal dominant disorder. Alternatively spliced transcript variants encoding different isoforms have been identified.
3123	
3124	The glycosylphosphatidylinositol (GPI) anchor is a glycolipid found on many blood cells that serves to anchor proteins to the cell surface. This gene encodes a protein thought to be involved in GPI anchor biosynthesis. The yeast homolog is a member of a family of dolichol-phosphate-mannose (Dol-P-Man)-dependent mannosyltransferases that is required for the addition of the side-branching fourth mannose during the assembly of GPI anchors.
3125	
3126	
3127	This gene encodes an extracellular matrix protein, which belongs to the fibulin family. This protein binds various extracellular ligands and calcium. It may play a role during organ development, in particular, during the differentiation of heart, skeletal and neuronal structures. Alternatively spliced transcript variants encoding different isoforms have been identified.
3128	This gene was identified based on homology to Pichia pastoris GSA7 and Saccharomyces cerevisiae APG7. In the yeast, the protein appears to be required for fusion of peroxisomal and vaculuolar membranes. The protein shows homlogy to the ATP-binding and catalytic sites of the E1 ubiquitin activating enzymes.
3129	
3130	The protein encoded by this gene is a member of the immunoglobulin superfamily. It is a glycosylphosphatidylinositol (GPI)-anchored neuronal membrane protein that functions as a cell adhesion molecule. It may play a role in the formation of axon connections in the developing nervous system. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
3131	Protein kinase C (PKC) is a family of serine- and threonine-specific protein kinases that can be activated by calcium and the second messenger diacylglycerol. PKC family members phosphorylate a wide variety of protein targets and are known to be involved in diverse cellular signaling pathways. PKC family members also serve as major receptors for phorbol esters, a class of tumor promoters. Each member of the PKC family has a specific expression profile and is believed to play distinct roles in cells. The protein encoded by this gene is one of the PKC family members. Studies both in human and mice demonstrate that this kinase is involved in B cell signaling and in the regulation of growth, apoptosis, and differentiation of a variety of cell types. Alternatively spliced transcript variants encoding the same protein have been observed.
3132	This gene encodes the alpha chain of type VII collagen. The type VII collagen fibril, composed of three identical alpha collagen chains, is restricted to the basement zone beneath stratified squamous epithelia. It functions as an anchoring fibril between the external epithelia and the underlying stroma. Mutations in this gene are associated with all forms of dystrophic epidermolysis bullosa. In the absence of mutations, however, an acquired form of this disease can result from an autoimmune response made to type VII collagen.
3133	
3134	
3135	This gene encodes a member of the organic-cation transporter family. It is located in a gene cluster with another member of the family, organic cation transporter like 4. The encoded protein is a transmembrane protein which is thought to transport small molecules and since this protein is conserved among several species, it is suggested to have a fundamental role in mammalian systems.
3136	The protein encoded by this gene is a member of the fetuin family, part of the cystatin superfamily of cysteine protease inhibitors. Fetuins have been implicated in several diverse functions, including osteogenesis and bone resorption, regulation of the insulin and hepatocyte growth factor receptors, and response to systemic inflammation. This protein may be secreted by cells.
3137	
3138	
3139	
3140	
3141	The encoded protein is required for the formation of O-acetylated (Ac) gangliosides. It is predicted to contain 6 to 10 transmembrane domains, and a leucine zipper motif in transmembrane domain III. Studies indicate that the protein is localized to the cytoplasm.
3142	
3143	The Sec61 complex is the central component of the protein translocation apparatus of the endoplasmic reticulum (ER) membrane. The protein encoded by this gene and SEC63 protein are found to be associated with ribosome-free SEC61 complex. It is speculated that Sec61-Sec62-Sec63 may perform post-translational protein translocation into the ER. The Sec61-Sec62-Sec63 complex might also perform the backward transport of ER proteins that are subject to the ubiquitin-proteasome-dependent degradation pathway. The encoded protein is an integral membrane protein located in the rough ER.
3144	
3145	The protein encoded by this gene is one of the eight subunits of COP9 signalosome, a highly conserved protein complex that functions as an important regulator in multiple signaling pathways. The structure and function of COP9 signalosome is similar to that of the 19S regulatory particle of 26S proteasome. COP9 signalosome has been shown to interact with SCF-type E3 ubiquitin ligases and act as a positive regulator of E3 ubiquitin ligases. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
3146	The translocase of outer mitochondrial membrane (TOM) complex is a multisubunit complex involved in the recognition, unfolding, and translocation of preproteins into the mitochondria. See TIM17A (MIM 605057).[supplied by OMIM]
3147	
3148	
3149	This gene encodes the second largest subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. This subunit, in combination with at least two other polymerase subunits, forms a structure within the polymerase that maintains contact in the active site of the enzyme between the DNA template and the newly synthesized RNA.
3150	
3151	
3152	The protein encoded by this gene is a member of the ankyrin repeat and SOCS box-containing (ASB) family of proteins. They contain ankyrin repeat sequence and SOCS box domain. The SOCS box serves to couple suppressor of cytokine signalling (SOCS) proteins and their binding partners with the elongin B and C complex, possibly targeting them for degradation. Multiple alternatively spliced transcript variants have been described for this gene but their full length sequences are not known.
3153	The protein encoded by this gene is highly similar to Nocturnin, a gene identified as a circadian clock regulated gene in Xenopus laevis. This protein and Nocturnin protein share similarity with the C-terminal domain of a yeast transcription factor, carbon catabolite repression 4 (CCR4). The mRNA abundance of a similar gene in mouse has been shown to exhibit circadian rhythmicity, which suggests a role for this protein in clock function or as a circadian clock effector.
3154	
3155	
3156	
3157	This gene encodes a protein with ankyrin repeats, which are associated with protein-protein interactions. Studies in mice suggest that this protein is involved in liver development. Two transcript variants encoding different isoforms have been found for this gene.
3158	This gene encodes GalNAc transferase 7, a member of the GalNAc-transferase family. The enzyme encoded by this gene controls the initiation step of mucin-type O-linked protein glycosylation and transfer of N-acetylgalactosamine to serine and threonine amino acid residues. This enzyme is a type II transmembrane protein and shares common sequence motifs with other family members. Unlike other family members, this enzyme shows exclusive specificity for partially GalNAc-glycosylated acceptor substrates and shows no activity with non-glycosylated peptides. This protein may function as a follow-up enzyme in the initiation step of O-glycosylation.
3159	
3160	Human airway trypsin-like protease (HAT) is a trypsin-like serine protease released from the submucosal serous glands onto mucous membrane. It is a type II integral membrane protein and has 29-38% identity in the sequence of the catalytic region with human hepsin, enteropeptidase, acrosin, and mast cell tryptase. The noncatalytic region had little similarity to other known proteins. HAT may play some biological role in the host defense system on the mucous membrane independently of or in cooperation with other substances in airway mucous or bronchial secretions.
3161	
3162	
3163	Three-prime-phosphoadenosine 5-prime-phosphosulfate (PAPS) is the sulfate donor cosubstrate for all sulfotransferase (SULT) enzymes (Xu et al., 2000). SULTs catalyze the sulfate conjugation of many endogenous and exogenous compounds, including drugs and other xenobiotics. In humans, PAPS is synthesized from adenosine 5-prime triphosphate (ATP) and inorganic sulfate by 2 isoforms, PAPSS1 and PAPSS2 (MIM 603005).[supplied by OMIM]
3164	
3165	
3166	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S3AE family of ribosomal proteins. It is located in the cytoplasm. Disruption of the gene encoding rat ribosomal protein S3a, also named v-fos transformation effector protein, in v-fos-transformed rat cells results in reversion of the transformed phenotype. Transcript variants utilizing alternative transcription start sites have been described. This gene is co-transcribed with the U73A and U73B small nucleolar RNA genes, which are located in its fourth and third introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
3167	This gene encodes a disintegrin and metalloproteinase with thrombospondin (ADAMTS) motifs-3, which is a member of the ADAMTS protein family. Members of the family share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. The protein encoded by this gene is the major procollagen II N-propeptidase. A deficiency of this protein may be responsible for dermatosparaxis, a genetic defect of connective tissues.
3168	Unlike most eukaryotic kinases, alpha kinases, such as LAK, recognize phosphorylation sites in which the surrounding peptides have an alpha-helical conformation.[supplied by OMIM]
3169	The protein encoded by this gene is a major structural protein of the bone matrix. It constitutes approximately 12% of the noncollagenous proteins in human bone and is synthesized by skeletal-associated cell types, including hypertrophic chondrocytes, osteoblasts, osteocytes, and osteoclasts. The only extraskeletal site of its synthesis is the trophoblast. This protein binds to calcium and hydroxyapatite via its acidic amino acid clusters, and mediates cell attachment through an RGD sequence that recognizes the vitronectin receptor.
3170	
3171	
3172	
3173	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections.
3174	
3175	The protein encoded by this gene is a component of the mammalian counterpart of the yeast Exocyst complex, which is essential for post-Golgi traffic.
3176	
3177	
3178	
3179	The protein encoded by this gene is highly similar to the gene product of Schizosaccharomyces pombe rad1, a cell cycle checkpoint gene required for cell cycle arrest and DNA damage repair in response to DNA damage. This protein has been shown to associate with RAD9 and HUS1 proteins, and forms a cell cycle checkpoint complex. The complex can be recruited by checkpoint protein RAD17 to the sites of DNA damage, which is thought to be important for triggering the checkpoint-signaling cascade. Three alternatively spliced transcript variants, which encode two distinct proteins, have been reported.
3180	
3181	Coagulation factor II (thrombin) receptor-like 2 (F2RL2) is a member of the large family of 7-transmembrane-region receptors that couple to guanosine-nucleotide-binding proteins. F2RL2 is also a member of the protease-activated receptor family and activated by thrombin. F2RL2 is activated by proteolytic cleavage of its extracellular amino terminus. The new amino terminus functions as a tethered ligand and activates the receptor. F2RL2 is a cofactor for F2RL3 activation by thrombin. It mediates thrombin-triggered phosphoinositide hydrolysis and is expressed in a variety of tissues.
3182	
3183	
3184	Mitochondrial creatine kinase (MtCK) is responsible for the transfer of high energy phosphate from mitochondria to the cytosolic carrier, creatine. It belongs to the creatine kinase isoenzyme family. It exists as two isoenzymes, sarcomeric MtCK and ubiquitous MtCK, encoded by separate genes. Mitochondrial creatine kinase occurs in two different oligomeric forms: dimers and octamers, in contrast to the exclusively dimeric cytosolic creatine kinase isoenzymes. Sarcomeric mitochondrial creatine kinase has 80% homology with the coding exons of ubiquitous mitochondrial creatine kinase. This gene contains sequences homologous to several motifs that are shared among some nuclear genes encoding mitochondrial proteins and thus may be essential for the coordinated activation of these genes during mitochondrial biogenesis.
3185	
3186	SEPT8 is a member of the highly conserved septin family. Septins are 40- to 60-kD GTPases that assemble as filamentous scaffolds. They are involved in the organization of submembranous structures, in neuronal polarity, and in vesicle trafficking (Blaser et al., 2003).[supplied by OMIM]
3187	
3188	
3189	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbls class and, in addition to an F-box, contains several tandem leucine-rich repeats.
3190	This gene encodes a secreted protein which is mainly expressed in the endothelial cells in human lung and kidney tissues. The expression of this gene is regulated by cytokines, suggesting that it may play a role in endothelium-dependent pathological disorders. The transcript contains multiple polyadenylation and mRNA instability signals.
3191	This gene encodes a type II classical cadherin from the cadherin superfamily of integral membrane proteins that mediate calcium-dependent cell-cell adhesion. Mature cadherin proteins are composed of a large N-terminal extracellular domain, a single membrane-spanning domain, and a small, highly conserved C-terminal cytoplasmic domain. Type II (atypical) cadherins are defined based on their lack of a HAV cell adhesion recognition sequence specific to type I cadherins. This particular cadherin appears to be expressed specifically in the brain and its temporal pattern of expression would be consistent with a role during a critical period of neuronal development, perhaps specifically during synaptogenesis.
3192	The protein encoded by this gene is a small glycolipid transport protein which acts as a substrate specific co-factor for the lysosomal enzyme beta-hexosaminidase A. Beta-hexosaminidase A, together with GM2 ganglioside activator, catalyzes the degradation of the ganglioside GM2, and other molecules containing terminal N-acetyl hexosamines. Mutations in this gene result in GM2-gangliosidosis type AB or the AB variant of Tay-Sachs disease.
3193	The protein encoded by this gene is a small glycolipid transport protein which acts as a substrate specific co-factor for the lysosomal enzyme beta-hexosaminidase A. Beta-hexosaminidase A, together with GM2 ganglioside activator, catalyzes the degradation of the ganglioside GM2, and other molecules containing terminal N-acetyl hexosamines. Mutations in this gene result in GM2-gangliosidosis type AB or the AB variant of Tay-Sachs disease.
3194	
3195	
3196	
3197	
3198	
3199	
3200	
3201	
3202	
3203	In addition to regulating thyroid-specific expression of genes, thyroid transcription factor (TITF1; MIM 600635) controls the transcription of genes specifically expressed in lung, such as surfactant proteins (e.g., SFTPA1; MIM 178630) and uteroglobin (UGB; MIM 192020). Mice lacking Ttf1 die immediately after birth from respiratory failure caused by profoundly hypoplastic lungs (Kimura et al., 1996). The UGRP1 gene encodes a uteroglobin-related protein and is a downstream target of TITF1.[supplied by OMIM]
3204	The protein encoded by this gene interacts with thyroid hormone receptor in a ligand-dependent manner and enhances thyroid hormone-dependent activation from thyroid response elements. This protein contains a bromodomain and is thought to be a nuclear receptor coactivator. Three alternatively spliced transcript variants that encode distinct isoforms have been identified.
3205	This gene encodes beta-2-adrenergic receptor which is a member of the G protein-coupled receptor superfamily. This receptor is directly associated with one of its ultimate effectors, the class C L-type calcium channel Ca(V)1.2.  This receptor-channel complex also contains a G protein, an adenylyl cyclase, cAMP-dependent kinase, and the counterbalancing phosphatase, PP2A. The assembly of the signaling complex provides a mechanism that ensures specific and rapid signaling by this G protein-coupled receptor. This gene contains no introns in either its coding or untranslated sequences. Different polymorphic forms, point mutations, and/or downregulation of this gene are associated with nocturnal asthma, obesity and type 2 diabetes.
3206	
3207	
3208	
3209	
3210	
3211	
3212	The PLA2G7 gene encodes platelet-activating factor (PAF) acetylhydrolase (EC 3.1.1.47), a secreted enzyme that catalyzes the degradation of PAF to inactive products by hydrolysis of the acetyl group at the sn-2 position, producing the biologically inactive products LYSO-PAF and acetate.[supplied by OMIM]
3213	Glutamate-cysteine ligase, also known as gamma-glutamylcysteine synthetase is the first rate limiting enzyme of glutathione synthesis. The enzyme consists of two subunits, a heavy catalytic subunit and a light regulatory subunit. The gene encoding the catalytic subunit encodes a protein of 367 amino acids with a calculated molecular weight of 72.773 kDa and maps to chromosome 6. The regulatory subunit is derived from a different gene located on chromosome 1p22-p21. Deficiency of gamma-glutamylcysteine synthetase in human is associated with enzymopathic hemolytic anemia.
3214	This gene encodes a largely hydrophilic protein postulated to be a leucine-rich protein family member. A t(6;8)(q27;p11) chromosomal translocation, fusing this gene and the fibroblast growth factor receptor 1 (FGFR1) gene, has been found in cases of myeloproliferative disorder. The resulting chimeric protein contains the N-terminal leucine-rich region of this encoded protein fused to the catalytic domain of FGFR1. This gene is thought to play an important role in normal proliferation and differentiation of the erythroid lineage. Alternatively spliced transcript variants that encode different proteins have been identified.
3215	This gene encodes a nuclear protein expressed in a variety of tissues. The rat homolog, Rp8, is transiently expressed in immature thymocytes and is thought to be involved in programmed cell death. Expression of the human gene has been shown to be repressed by BCL6, a transcriptional repressor required for lymph node germinal center development, suggesting that BCL6 regulates apoptosis by its effects on PDCD2. This gene is closely linked on chromosome 6 to the gene for TBP, the TATA binding protein. Two transcripts encoding different proteins have been identified.
3216	
3217	This gene encodes a nuclear protein expressed in a variety of tissues. The rat homolog, Rp8, is transiently expressed in immature thymocytes and is thought to be involved in programmed cell death. Expression of the human gene has been shown to be repressed by BCL6, a transcriptional repressor required for lymph node germinal center development, suggesting that BCL6 regulates apoptosis by its effects on PDCD2. This gene is closely linked on chromosome 6 to the gene for TBP, the TATA binding protein. Two transcripts encoding different proteins have been identified.
3218	
3219	Bystin is expressed as a 2-kb major transcript and a 3.6-kb minor transcript in SNG-M cells and in human trophoblastic teratocarcinoma HT-H cells. Protein binding assays determined that bystin binds directly to trophinin and tastin, and that binding is enhanced when cytokeratins 8 and 18 are present. Immunocytochemistry of HT-H cells showed that bystin colocalizes with trophinin, tastin, and the cytokeratins, suggesting that these molecules form a complex in trophectoderm cells at the time of implantation. Using immunohistochemistry it was determined that trophinin and bystin are found in the placenta from the sixth week of pregnancy. Both proteins were localized in the cytoplasm of the syncytiotrophoblast in the chorionic villi and in endometrial decidual cells at the uteroplacental interface. After week 10, the levels of trophinin, tastin, and bystin decreased and then disappeared from placental villi.
3220	
3221	
3222	
3223	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene encodes a member of the histone H2B family. This gene is found in the histone microcluster on chromosome 6p21.33.
3224	
3225	
3226	HLA-F belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. HLA-F binds a restricted subset of peptides derived from the leader peptides of other class I molecules. The heavy chain is approximately 42 kDa and its gene contains 8 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region and exons 6 and 7 encode the cytoplasmic tail. The HLA-F gene exhibits few polymorphisms.
3227	The GNL1 gene, identified in the human major histocompatibility complex class I region, shows a high degree of similarity with its mouse counterpart. The GNL1 gene is located less than 2 kb centromeric to HLA-E, in the same transcriptional orientation. GNL1 is telomeric to HLA-B and HLA-C.
3228	This gene encodes a divergent member of the I-kappa-B family of proteins. Its function has not been determined. The gene lies within the major histocompatibility complex (MHC) class I region on chromosome 6.
3229	
3230	HLA-DRB1 belongs to the HLA class II beta chain paralogues. The class II molecule is a heterodimer consisting of an alpha (DRA) and a beta chain (DRB), both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa. It is encoded by 6 exons, exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail. Within the DR molecule the beta chain contains all the polymorphisms specifying the peptide binding specificities. Hundreds of DRB1 alleles have been described and typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. DRB1 is expressed at a level five times higher than its paralogues DRB3, DRB4 and DRB5. DRB1 is present in all individuals. Allelic variants of DRB1 are linked with either none or one of the genes DRB3, DRB4 and DRB5. There are 4 related pseudogenes: DRB2, DRB6, DRB7, DRB8 and DRB9.
3231	
3232	
3233	
3234	The protein encoded by this gene is a member of the MAP kinase family. MAP kinases act as an integration point for multiple biochemical signals, and are involved in a wide variety of cellular processes such as proliferation, differentiation, transcription regulation and development. This kinase is closely related to p38 MAP kinase, both of which can be activated by proinflammatory cytokines and cellular stress. MAP kinase kinases 3, and 6 can phosphorylate and activate this kinase. Transcription factor ATF2, and microtubule dynamics regulator stathmin have been shown to be the substrates of this kinase.
3235	
3236	The protein encoded by this gene is a member of the cyclin family of proteins. The encoded protein interacts with cyclin-dependent kinase 8 and induces the phophorylation of the carboxy-terminal domain of the large subunit of RNA polymerase II. The level of mRNAs for this gene peaks in the G1 phase of the cell cycle. Two transcript variants encoding different isoforms have been found for this gene.
3237	
3238	
3239	
3240	
3241	The protein encoded by this gene is a receptor for interleukin 20 (IL20), a cytokine that may be involved in epidermal function. The receptor of IL20 is a heterodimeric receptor complex consisting of this protein and interleukin 20 receptor beta (IL20B). This gene and IL20B are highly expressed in skin. The expression of both genes is found to be upregulated in Psoriasis.
3242	The protein encoded by this gene is a member of the cAMP-dependent protein kinase inhibitor family. Studies of a similar protein in rat suggest that this protein may interact with the catalytic subunit of cAMP-dependent protein kinase and act as a competitive inhibitor. At least three alternatively spliced transcript variants encoding the same protein have been reported.
3243	
3244	Polyspecific organic cation transporters in the liver, kidney, intestine, and other organs are critical for elimination of many endogenous small organic cations as well as a wide array of drugs and environmental toxins. This gene is one of three similar cation transporter genes located in a cluster on chromosome 6. The encoded protein contains twelve putative transmembrane domains and is a plasma integral membrane protein. Two transcript variants encoding two different isoforms have been found for this gene, but only the longer variant encodes a functional transporter.
3245	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. The encoded protein sequence is 100% identical to the mouse homolog and 98% identical to the frog and zebrafish homologs. Two alternatively spliced transcript variants have been found for this gene and they encode distinct isoforms.
3246	
3247	The muscarinic cholinergic receptors belong to a larger family of G protein-coupled receptors. The functional diversity of these receptors is defined by the binding of acetylcholine to these receptors and includes cellular responses such as adenylate cyclase inhibition, phosphoinositide degeneration, and potassium channel mediation. Muscarinic receptors influence many effects of acetylcholine in the central and peripheral nervous system. The muscarinic cholinergic receptor 2 is involved in mediation of bradycardia and a decrease in cardiac contractility. Multiple alternatively spliced transcript variants have been described for this gene.
3248	
3249	
3250	
3251	
3252	
3253	The protein encoded by this gene is a member of a family of actin-related proteins (ARPs) which share significant amino acid sequence identity to conventional actins. Both actins and ARPs have an actin fold, which is an ATP-binding cleft, as a common feature. The ARPs are involved in diverse cellular processes, including vesicular transport, spindle orientation, nuclear migration and chromatin remodeling. This gene encodes a subunit of the BAF (BRG1/brm-associated factor) complex in mammals, which is functionally related to SWI/SNF complex in S. cerevisiae and Drosophila; the latter is thought to facilitate transcriptional activation of specific genes by antagonizing chromatin-mediated transcriptional repression. This subunit may be involved in the regulation of genes by structural modulation of their chromatin, specifically in the brain.
3254	The protein encoded by this gene is involved in the synthesis of asparagine. This gene complements a mutation in the temperature-sensitive hamster mutant ts11, which blocks progression through the G1 phase of the cell cycle at nonpermissive temperature. There are three alternatively spliced transcript variants encoding the same protein described for this gene.
3255	
3256	Acyloxyacyl hydrolase (AOAH) is a 2-subunit lipase which selectively hydrolyzes the secondary (acyloxyacyl-linked) fatty acyl chains from the lipid A region of bacterial endotoxins.  AOAH may modulate host inflammatory responses to gram-negative bacterial invasion. The 2 subunits are encoded by a single mRNA.
3257	
3258	In vertebrates, the genes encoding the class of transcription factors called homeobox genes are found in clusters named A, B, C, and D on four separate chromosomes. Expression of these proteins is spatially and temporally regulated during embryonic development. This gene is part of the A cluster on chromosome 7 and encodes a DNA-binding transcription factor which may regulate gene expression, morphogenesis, and differentiation. The encoded protein may be involved in the placement of hindbrain segments in the proper location along the anterior-posterior axis during development. Two transcript variants encoding two different isoforms have been found for this gene, with only one of the isoforms containing the homeodomain region.
3259	ADP-ribosylation factor-like 4 is a member of the ADP-ribosylation factor family of GTP-binding proteins. ARL4 is similar to ARL7 and ARL6 and each has a nuclear localization signal and an unusually high guaninine nucleotide exchange rate. ARL4 is located in both the nuclear and extranuclear cell compartments. Two transcript variants encoding the same protein have been found for this gene.
3260	The protein encoded by this gene belongs to the complex I 9kDa subunit family. Mammalian complex I of mitochondrial respiratory chain is composed of 45 different subunits. This protein has NADH dehydrogenase activity and oxidoreductase activity. It transfers electrons from NADH to the respiratory chain. The immediate electron acceptor for the enzyme is believed to be ubiquinone.
3261	
3262	
3263	
3264	The proto-oncogene MET product is the hepatocyte growth factor receptor and encodes tyrosine-kinase activity. The primary single chain precursor protein is post-translationally cleaved to produce the alpha and beta subunits, which are disulfide linked to form the mature receptor. Various mutations in the MET gene are associated with papillary renal carcinoma.
3265	The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21. This protein may function in motility, invasion, and tubulin polymerization. Chromosomal rearrangements and altered expression of this gene have been implicated in tumor metastasis.  Alternative splicing of the 5@#$%& UTR results in two gene products.
3266	
3267	
3268	
3269	
3270	
3271	The adipocyte enhancer binding protein 1 is a transcriptional repressor with carboxypeptidase (CP) activity. This protein binds to a regulatory sequence, adipocyte enhancer 1 (AE-1), located in the proximal promoter region of the adipose P2 (aP2) gene, which encodes the adipocyte fatty-acid binding protein. It is characterized as a member of the regulatory B-like CP family. This protein seems to be activated by a novel mechanism, whereby the direct binding of DNA enhances its protease activity. Adipocyte-enhancer binding protein 1 may play a role in differentiated vascular smooth muscle cells.
3272	Ephrin receptors and their ligands, the ephrins, mediate numerous developmental processes, particularly in the nervous system. Based on their structures and sequence relationships, ephrins are divided into the ephrin-A (EFNA) class, which are anchored to the membrane by a glycosylphosphatidylinositol linkage, and the ephrin-B (EFNB) class, which are transmembrane proteins. The Eph family of receptors are divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands. Ephrin receptors make up the largest subgroup of the receptor tyrosine kinase (RTK) family. The ephrin receptor encoded by this gene lacks the kinase activity of most receptor tyrosine kinases and binds to ephrin-B ligands.
3273	Thiamine pyrophosphokinase (TPK; EC 2.7.6.2) is a cellular enzyme involved in the regulation of thiamine metabolism. TPK catalyzes the conversion of thiamine, a form of vitamin B1, to thiamine pyrophosphate (TDP, or TPP). TDP is an active cofactor for enzymes involved in glycolysis and energy production, including transketolase, pyruvate dehydrogenase, and alpha-ketoglutarate dehydrogenase.[supplied by OMIM]
3274	
3275	Vasoactive intestinal peptide (VIP; MIM 192320) and pituitary adenylate cyclase activating polypeptide (PACAP; MIM 102980) are homologous peptides that function as neurotransmitters and neuroendocrine hormones. While the receptors for VIP and PACAP share homology, they differ in their substrate specificities and expression patterns. See VIPR1 (MIM 192321) and ADCYAP1R1(MIM 102981).[supplied by OMIM]
3276	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility.
3277	
3278	
3279	This gene encodes a component of vacuolar ATPase (V-ATPase), a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, receptor-mediated endocytosis, and synaptic vesicle proton gradient generation. V-ATPase is composed of a cytosolic V1 domain and a transmembrane V0 domain. The V1 domain consists of three A and three B subunits, two G subunits plus the C, D, E, F, and H subunits. The V1 domain contains the ATP catalytic site. The V0 domain consists of five different subunits: a, c, c@#$%&, c@#$%!, and d. Additional isoforms of many of the V1 and V0 subunit proteins are encoded by multiple genes or alternatively spliced transcript variants. This encoded protein is one of two V1 domain C subunit proteins and is found ubiquitously. This C subunit is analogous but not homologous to gamma subunit of F-ATPases. Two alternatively spliced transcript variants encoding different isoforms have been found for this gene. Previously, this gene was designated ATP6D.
3280	The correlation of anti-sperm antibodies with cases of unexplained infertility implicates a role for these antibodies in blocking fertilization. Improved diagnosis and treatment of immunologic infertility, as well as identification of proteins for targeted contraception, are dependent on the identification and characterization of relevant sperm antigens. The protein expressed by this gene is recognized by anti-sperm agglutinating antibodies from an infertile woman. Furthermore, immunization of female rats with the recombinant human protein reduced fertility. This protein localizes to the plasma membrane of germ cells in the testis and to the post-acrosomal plasma membrane of mature spermatozoa. Recombinant polypeptide binds GTP and exhibits GTPase activity. Thus, this protein may regulate GTP signal transduction pathways involved in spermatogenesis and fertilization. Two transcript variants of this gene encode the same protein.
3281	
3282	
3283	This gene encodes the coiled-coil containing protein optineurin. Optineurin may play a role in normal-tension glaucoma and adult-onset primary open angle glaucoma. Optineurin interacts with adenovirus E3-14.7K protein and may utilize tumor necrosis factor-alpha or Fas-ligand pathways to mediate apoptosis, inflammation or vasoconstriction. Optineurin may also function in cellular morphogenesis and membrane trafficking, vesicle trafficking, and transcription activation through its interactions with the RAB8, huntingtin, and transcription factor IIIA proteins. Alternative splicing results in multiple transcript variants encoding the same protein.
3284	
3285	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. This protein is a potent epithelial cell-specific growth factor, whose mitogenic activity is predominantly exhibited in keratinocytes but not in fibroblasts and endothelial cells. Studies of mouse and rat homologs of this gene implicated roles in morphogenesis of epithelium, reepithelialization of wounds, hair development and early lung organogenesis.
3286	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. This gene encodes a member of the potassium voltage-gated channel subfamily V. This protein is essentially present in the brain, and its role might be to inhibit the function of a particular class of outward rectifier potassium channel types.
3287	
3288	
3289	
3290	
3291	LPL encodes lipoprotein lipase, which is expressed in heart, muscle, and adipose tissue. LPL functions as a homodimer, and has the dual functions of triglyceride hydrolase and ligand/bridging factor for receptor-mediated lipoprotein uptake. Severe mutations that cause LPL deficiency result in type I hyperlipoproteinemia, while less extreme mutations in LPL are linked to many disorders of lipoprotein metabolism.
3292	The transport of protein and large RNAs through the nuclear pore complexes (NPC) is an energy-dependent and regulated process. The import of proteins with a nuclear localization signal (NLS) is accomplished by recognition of one or more clusters of basic amino acids by the importin-alpha/beta complex; see MIM 600685 and MIM 602738. The small GTPase RAN (MIM 601179) plays a key role in NLS-dependent protein import. RAN-binding protein-16 is a member of the importin-beta superfamily of nuclear transport receptors.[supplied by OMIM]
3293	Superior cervical ganglion-10 is a neuronal growth-associated protein that shares significant amino acid sequence similarity with the phosphoprotein stathmin (MIM 151442).[supplied by OMIM]
3294	Squalene epoxidase catalyzes the first oxygenation step in sterol biosynthesis and is thought to be one of the rate-limiting enzymes in this pathway.
3295	
3296	
3297	
3298	
3299	O-mannosylation is an important protein modification in eukaryotes that is initiated by an evolutionarily conserved family of protein O-mannosyltransferases. POMT1 shares sequence similarity with protein O-mannosyltransferases of S. cerevisiae. In yeast, these enzymes are located in the endoplasmic reticulum (ER) and are required for cell integrity and cell wall rigidity. POMT1 also shows similarity to the Drosophila @#$%&rotated abdomen@#$%& (rt) gene, which when mutated causes defects in myogenesis and muscle structure.[supplied by OMIM]
3300	
3301	O-mannosylation is an important protein modification in eukaryotes that is initiated by an evolutionarily conserved family of protein O-mannosyltransferases. POMT1 shares sequence similarity with protein O-mannosyltransferases of S. cerevisiae. In yeast, these enzymes are located in the endoplasmic reticulum (ER) and are required for cell integrity and cell wall rigidity. POMT1 also shows similarity to the Drosophila @#$%&rotated abdomen@#$%& (rt) gene, which when mutated causes defects in myogenesis and muscle structure.[supplied by OMIM]
3302	
3303	
3304	
3305	
3306	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This protein contains 5 WD repeats, and has a structural similarity with actin-binding proteins: the D. discoideum coronin and the human p57 protein, suggesting that this protein may also be an actin-binding protein that regulates cell motility. Alternative splicing of this gene generates 2 transcript variants.
3307	The protein encoded by this gene, a member of the peptidase C1 family, is a lysosomal cysteine proteinase that may play an important role in corneal physiology. This gene is expressed in colorectal and breast carcinomas but not in normal colon, mammary gland, or peritumoral tissues, suggesting a possible role for this gene in tumor processes.
3308	VAV2 is the second member of the VAV oncogene family. Unlike VAV1, which is expressed exclusively in hematopoietic cells, VAV2 transcripts were found in most tissues.
3309	This gene is located in the surfeit gene cluster, a group of very tightly linked genes that do not share sequence similarity. The gene shares a bidirectional promoter with SURF1, which is located on the opposite strand. The function of the encoded protein from this housekeeping gene has not yet been determined.
3310	
3311	
3312	The Golgi complex plays a key role in the sorting and modification of proteins exported from the endoplasmic reticulum. The protein encoded by this gene is a type II Golgi transmembrane protein. It processes protein synthesized in the rough endoplasmic reticulum and assists in the transport of protein cargo through the Golgi apparatus. The expression of this encoded protein has been observed to be upregulated in response to viral infection. Two alternatively spliced transcript variants encoding the same protein have been described for this gene.
3313	
3314	The protein encoded by this gene shares similarity with the yeast Mob1 protein. Yeast Mob1 binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. This gene is located on the opposite strand as the interferon kappa precursor (IFNK) gene.
3315	The protein encoded by this gene is a glutathione-independent prostaglandin D synthase that catalyzes the conversion of prostaglandin H2 (PGH2) to postaglandin D2 (PGD2). PGD2 functions as a neuromodulator as well as a trophic factor in the central nervous system. PGD2 is also involved in smooth muscle contraction/relaxation and is a potent inhibitor of platelet aggregation. This gene is preferentially expressed in brain. Studies with transgenic mice overexpressing this gene suggest that this gene may be also involved in the regulation of non-rapid eye movement sleep.
3316	
3317	This gene encodes a member of the histo-blood group ABO gene family that encodes glycosyltransferases with related but distinct substrate specificity. This protein plays a role in synthesizing Forssman glycolipid (FG), a member of the globoseries glycolipid family. Human cells do not normally produce FG but produce the precursor glycolipids globotriaosylceramide and globoside. This protein may be involved in the tropism and binding of pathogenic organisms.
3318	
3319	This gene encodes a member of the annexin family. Members of this calcium-dependent phospholipid-binding protein family play a role in the regulation of cellular growth and in signal transduction pathways. This protein functions as an autocrine factor which heightens osteoclast formation and bone resorption. This gene has three pseudogenes located on chromosomes 4, 9 and 10, respectively. Multiple alternatively spliced transcript variants encoding different isoforms have been found for this gene.
3320	The oncogene BCL2 is a membrane protein that blocks a step in a pathway leading to apoptosis or programmed cell death. The protein encoded by this gene binds to BCL2 and is referred to as BCL2-associated athanogene. It enhances the anti-apoptotic effects of BCL2 and represents a link between growth factor receptors and anti-apoptotic mechanisms. At least three protein isoforms are encoded by this mRNA through the use of alternative translation initiation sites, including a non-AUG site.
3321	
3322	
3323	
3324	The protein encoded by this gene is a multi-membrane spanning protein containing a RING-H2 finger. This protein is located in the endoplasmic reticulum, and has been shown to possess ubiquitin ligase activity. This gene was found to be interrupted by a t(3:8) translocation in a family with hereditary renal and non-medulary thyroid cancer. Studies of the Drosophila counterpart suggested that this protein may interact with tumor suppressor protein VHL, as well as with  COPS5/JAB1, a protein responsible for the degradation of tumor suppressor CDKN1B/P27KIP.
3325	Nebulin is a giant protein component of the cytoskeletal matrix that coexists with the thick and thin filaments within the sarcomeres of skeletal muscle. In most vertebrates, nebulin accounts for 3 to 4% of the total myofibrillar protein and its size varies from 600 to 800 kD in a manner that is tissue-, species-,and developmental stage-specific. The 6,669-residue predicted protein, the short form, contains 185 copies of 35-amino acid modules that can be classified into 7 types.  It is suggested that alternative splicing may explain the developmental or tissue-specific size variants of nebulin. However, the nucleotide sequence information is not available for other spliced products.  The mutations in the NEB gene are associated with recessive nemaline myopathy.
3326	This protein encodes an enzyme that synthesizes selenophosphate from selenide and ATP. Selenophosphate is the selenium donor used to synthesize selenocysteine, which is co-translationally incorporated into selenoproteins at in-frame UGA codons. This protein itself contains a selenocysteine residue in its predicted active site. The 3@#$%& UTR of the gene has a stem-loop secondary structure called a selenocysteine insertion sequence (SECIS) element, which allows UGA to direct the incorporation of selenocysteine rather than signal a translational stop. Alternatively spliced transcripts have been identified, but their biological validity has not been determined.
3327	
3328	The dicarboxylate carrier catalyzes the transport of dicarboxylates such as malate and succinate across the mitochondrial membrane in exchange for phosphate, sulfate, and thiosulfate, thus supplying substrates for the Krebs cycle, gluconeogenesis, urea synthesis, and sulfur metabolism.[supplied by OMIM]
3329	
3330	
3331	
3332	
3333	
3334	
3335	The cystatin superfamily encompasses proteins that contain multiple cystatin-like sequences. Some of the members are active cysteine protease inhibitors, while others have lost or perhaps never acquired this inhibitory activity. There are three inhibitory families in the superfamily, including the type 1 cystatins (stefins), type 2 cystatins and the kininogens. The type 2 cystatin proteins are a class of cysteine proteinase inhibitors found in a variety of human fluids and secretions, where they appear to provide protective functions. This gene encodes a cystatin from the type 2 family, which is down-regulated in metastatic breast tumor cells as compared to primary tumor cells. Loss of expression is likely associated with the progression of a primary tumor to a metastatic phenotype.
3336	
3337	WW domain-containing proteins are found in all eukaryotes and play an important role in the regulation of a wide variety of cellular functions such as protein degradation, transcription, and RNA splicing. This gene encodes a protein which contains 4 tandem WW domains and a HECT (homologous to the E6-associated protein carboxyl terminus) domain. The encoded protein belongs to a family of NEDD4-like proteins, which are E3 ubiquitin-ligase molecules and regulate key trafficking decisions, including targeting of proteins to proteosomes or lysosomes. Alternative splicing of this gene generates at least 6 transcript variants; however, the full length nature of these transcripts has not been defined.
3338	
3339	The protein encoded by this gene belongs to the small GTPase superfamily, Rab family. It is GTP-binding protein and may be involved in vesicle transport.
3340	This gene belongs to the ephrin receptor subfamily of the protein-tyrosine kinase family. EPH and EPH-related receptors have been implicated in mediating developmental events, particularly in the nervous system. Receptors in the EPH subfamily typically have a single kinase domain and an extracellular region containing a Cys-rich domain and 2 fibronectin type III repeats. The ephrin receptors are divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands. This gene encodes a protein that binds ephrin-A ligands. Two alternatively spliced transcript variants have been described for this gene.
3341	
3342	
3343	
3344	
3345	
3346	
3347	This gene encodes a member of the alpha-2/delta subunit family, a protein in the voltage-dependent calcium channel complex. Calcium channels mediate the influx of calcium ions into the cell upon membrane polarization and consist of a complex of alpha-1, alpha-2/delta, beta, and gamma subunits in a 1:1:1:1 ratio. Various versions of each of these subunits exist, either expressed from similar genes or the result of alternative splicing. Research on a highly similar protein in rabbit suggests the protein described in this record is cleaved into alpha-2 and delta subunits. Alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
3348	
3349	Zinc finger proteins interact with nucleic acids and have diverse functions. The zinc finger domain is a conserved amino acid sequence motif containing 2 specifically positioned cysteines and 2 histidines that are involved in coordinating zinc. Kruppel-related proteins form 1 family of zinc finger proteins. See ZFP93 (MIM 604749) for additional information on zinc finger proteins.[supplied by OMIM]
3350	
3351	This gene encodes a member of the WD-protein subfamily. The encoded protein contains five WD-repeats spanning most of the protein and an SOCS box in the C-terminus. The SOCS box may act as a bridge between specific substrate-binding domains and E3 ubiquitin protein ligases.
3352	This gene encodes a member of the oxysterol-binding protein (OSBP) family, a group of intracellular lipid receptors. Like most members, the encoded protein contains an N-terminal pleckstrin homology domain and a highly conserved C-terminal OSBP-like sterol-binding domain.
3353	tRNA splicing is a fundamental process required for cell growth and division. SEN54 is a subunit of the tRNA splicing endonuclease, which catalyzes the removal of introns, the first step in tRNA splicing (Paushkin et al., 2004).[supplied by OMIM]
3354	
3355	The protein encoded by this gene has structural and functional similarities to complement component 9 (C9). Like C9, this protein creates transmembrane tubules and is capable of lysing non-specifically a variety of target cells. This protein is one of the main cytolytic proteins of cytolytic granules, and it is known to be a key effector molecule for T-cell- and natural killer-cell-mediated cytolysis. Defects in this gene cause familial hemophagocytic lymphohistiocytosis type 2 (HPLH2), a rare and lethal autosomal recessive disorder of early childhood.
3356	
3357	This gene belongs to the protocadherin gene family, a subfamily of the cadherin superfamily. The encoded protein consists of an extracellular domain containing 7 cadherin repeats, a transmembrane domain and a cytoplasmic tail that differs from those of the classical cadherins. The gene is located in a major X/Y block of homology and its most closely related cadherin superfamily member is located in this homologous region on the X chromosome. The protein is thought to play a fundamental role in cell-cell recognition essential for the segmental development and function of the central nervous system. Transcripts arising from alternative splicing encode isoforms with N- and C-terminal variation.
3358	
3359	This gene encodes a protein sharing sequence similarity with ribosomal protein S27. It is not currently known whether the encoded protein is a functional ribosomal protein or whether it has evolved a function that is independent of the ribosome.
3360	
3361	
3362	
3363	
3364	The protein encoded by this gene belongs to the ADAM-TS/metallospondin family of proteins, which are structurally homologous to the ADAM proteins, but they also contain at least 1 C-terminal thrombospondin type 1 (TSP1) repeat, and are secreted, instead of being membrane bound. Members of the ADAM-TS family have been implicated in the cleavage of proteoglycans, the control of organ shape during development, and the inhibition of angiogenesis. This gene is localized to chromosome 3p14.3-p14.2, an area known to be lost in hereditary renal tumors. Three transcript variants encoding different isoforms have been identified for this locus.
3365	
3366	This gene encodes an ataxin type 2 related protein of unknown function. This protein is a member of the spinocerebellar ataxia (SCAs) family, which is associated with a complex group of neurodegenerative disorders. At least 6 alternatively spliced transcripts are encoded by this gene.
3367	The protein encoded by this gene was identified by a two-hybrid screen using CD4 as the bait. It binds to the hydrophobic C-terminal amino acids of CD4 which are involved in repression of T cell activation. The interaction with CD4 is mediated by the noncatalytic alpha/beta hydrolase fold domain of this protein. It is thus proposed that this gene product modulates the stimulatory activity of CD4.
3368	The function of this gene is unknown; however, it may play a role in the ubiquitin-mediated proteolytic pathway.  Mutations in this gene are known to cause autosomal recessive juvenile parkinsonism.  Alternative splicing of this gene produces three known products of undetermined function.
3369	
3370	
3371	This gene belongs to the hyaluronan and proteoglycan binding link protein gene family. The protein encoded by this gene may function in hyaluronic acid binding and cell adhesion.
3372	
3373	
3374	
3375	The protein encoded by this gene is found in the nucleus, where it functions as an SII class transcription elongation factor. Elongation factors in this class are responsible for releasing RNA polymerase II ternary complexes from transcriptional arrest at template-encoded arresting sites. The encoded protein has been shown to interact with general transcription factor IIB, a basal transcription factor. Two transcript variants encoding different isoforms have been found for this gene.
3376	The protein encoded by this gene is one of the four known components of the sarcoglycan complex, which is a subcomplex of the dystrophin-glycoprotein complex (DGC). DGC forms a link between the F-actin cytoskeleton and the extracellular matrix. This protein is expressed most abundantly in skeletal and cardiac muscle. The mutations in this gene have been associated with autosomal recessive limb-girdle muscular dystrophy and dilated cardiomyopathy. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
3377	
3378	This gene encodes the human cellular counterpart of a feline sarcoma retrovirus protein with transforming capabilities. The gene product has tyrosine-specific protein kinase activity and that activity is required for maintenance of cellular transformation. Its chromosomal location has linked it to a specific translocation event identified in patients with acute promyelocytic leukemia but it is also involved in normal hematopoiesis. A truncated transcript has been identified that is generated utilizing a start site in one of the far downstream exons but a protein product associated with this transcript has not been identified.
3379	This gene encodes a protein with high similarity to the calcium-binding proteins of the calmodulin family. The encoded protein contains two EF-hand domains and potential calcium-binding sites. Alternative splicing results in multiple transcript variants.
3380	NPR1 is a membrane-bound guanylate cyclase that serves as the receptor for both atrial and brain natriuretic peptides (ANP (MIM 108780) and BNP (MIM 600295), respectively).[supplied by OMIM]
3381	
3382	The protein encoded by this gene is a lysosomal cysteine proteinase important in the overall degradation of lysosomal proteins. It is composed of a dimer of disulfide-linked heavy and light chains, both produced from a single protein precursor. The encoded protein, which belongs to the peptidase C1 protein family, can act both as an aminopeptidase and as an endopeptidase. Increased expression of this gene has been correlated with malignant progression of prostate tumors. Two transcript variants encoding different isoforms have been found for this gene.
3383	Apelin is a neuropeptide expressed in the supraoptic and paraventricular nuclei that acts on specific receptors located on vasopressinergic neurons (De Mota et al., 2004).[supplied by OMIM]
3384	This gene is a member of the Snail family of C2H2-type zinc finger transcription factors. It codes for a neural-specific transcriptional repressor that binds to E-box motifs. Scrt1 may promote neural differention and may be involved in cancers with neuroendocrine features.
3385	
3386	
3387	
3388	
3389	
3390	
3391	
3392	
3393	
3394	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein and is similar in sequence to its family member CD53 antigen. It is known to complex with integrins and other transmembrane 4 superfamily proteins.
3395	POU6F2 is a member of a gene family characterized by the presence of a bipartite DNA-binding domain, consisting of a POU-specific domain and a POU heterodomain, separated by a variable polylinker. POU domain family members are transcriptional regulators, many of which show highly restricted patterns of expression and are known to control cell type-specific differentiation pathways (see review by Phillips and Luisi, 2000).[supplied by OMIM]
3396	
3397	
3398	
3399	
3400	
3401	
3402	The Axin-related protein, Axin2, presumably plays an important role in the regulation of the stability of beta-catenin in the Wnt signaling pathway, like its rodent homologs, mouse conductin/rat axil. In mouse, conductin organizes a multiprotein complex of APC (adenomatous polyposis of the colon), beta-catenin, glycogen synthase kinase 3-beta, and conductin, which leads to the degradation of beta-catenin. Apparently, the deregulation of beta-catenin is an important event in the genesis of a number of malignancies. The AXIN2 gene has been mapped to 17q23-q24, a region that shows frequent loss of heterozygosity in breast cancer, neuroblastoma, and other tumors. Mutations in this gene have been associated with colorectal cancer with defective mismatch repair.
3403	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. The use of alternate polyadenylation sites has been found for this gene. Two alternative transcripts encoding different isoforms have been described.
3404	
3405	The protein encoded by this gene is the gamma component of fibrinogen, a blood-borne glycoprotein comprised of three pairs of nonidentical polypeptide chains. Following vascular injury, fibrinogen is cleaved by thrombin to form fibrin which is the most abundant component of blood clots. In addition, various cleavage products of fibrinogen and fibrin regulate cell adhesion and spreading, display vasoconstrictor and chemotactic activities, and are mitogens for several cell types. Mutations in this gene lead to several disorders, including dysfibrinogenemia, hypofibrinogenemia and thrombophilia. Alternative splicing results in two isoforms, varying in the carboxy-terminus.
3406	
3407	
3408	
3409	
3410	
3411	
3412	
3413	The gamma-aminobutyric acid (GABA) A receptor is a multisubunit chloride channel that mediates the fastest inhibitory synaptic transmission in the central nervous system. This gene is mapped to chromosome 15q11.2-q12 in a cluster that includes the alpha 5, and gamma 3 subunit genes of the GABA receptor. Alternative splicing generates 2 transcript variants. Deletion mutation of this gene may be involved in the pathogenesis of Angelman syndrome and Prader-Willi syndrome.
3414	
3415	
3416	
3417	
3418	
3419	The protein encoded by this gene is a type I transmembrane protein that localizes to the endocytic pathway. This protein contains a RING zinc-finger motif and has been shown to possess E3 ubiquitin ligase activity. Expression of this gene in retrovirally transduced T cell hybridoma significantly inhibits activation-induced IL2 and IL4 cytokine production. Induced expression of this gene was observed in anergic CD4(+) T cells, which suggested a role in the induction of anergic phenotype. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
3420	CSTF2 is one of three (including CSTF1 and CSTF3) cleavage stimulation factors which combine to form CSTF which is involved in the polyadenylation and 3@#$%&end cleavage of pre-mRNAs.  CSTF2 contains a ribonucleoprotein-type RNA binding domain.  CSTF2 is upregulated during activation of B cells which results in the switch of IgM heavy chain mRNA from membrane bound form to the secreted form.
3421	This gene is similar to a mouse gene that encodes a ferritin heavy polypeptide-like protein.
3422	
3423	
3424	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. This enzyme is required for post-replicative DNA damage repair. Multiple alternatively spliced transcript variants have been found for this gene and they encode distinct isoforms.
3425	
3426	Kinesins, such as KIF4A, are microtubule-based motor proteins that generate directional movement along microtubules. They are involved in many crucial cellular processes, including cell division (Zhu and Jiang, 2005).[supplied by OMIM]
3427	The protein encoded by this gene is a RING-H2 zinc finger protein. It has been shown to be a ubiquitin protein ligase that targets LIM domain binding 1 (LDB1/CLIM), and causes proteasome-dependent degradation of LDB1. This protein and LDB1 are co-repressors of LHX1/LIM-1, a homeodomain transcription factor. Alternatively spliced transcript variants encoding the same protein have been reported.
3428	
3429	
3430	The protein encoded by this gene is a member of the ros/insulin receptor family of tyrosine kinases. Tyrosine-specific phosphorylation of proteins is a key to the control of diverse pathways leading to cell growth and differentiation. Two alternatively spliced transcript variants encoding different isoforms have been described for this gene.
3431	
3432	
3433	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the ALD subfamily, which is involved in peroxisomal import of fatty acids and/or fatty acyl-CoAs in the organelle. All known peroxisomal ABC transporters are half transporters which require a partner half transporter molecule to form a functional homodimeric or heterodimeric transporter. This peroxisomal membrane protein is likely involved in the peroxisomal transport or catabolism of very long chain fatty acids. Defects in this gene have been identified as the underlying cause of adrenoleukodystrophy, an X-chromosome recessively inherited demyelinating disorder of the nervous system.
3434	This gene is one of a family of nuclear RNA export factor genes. It encodes a protein that is involved in mRNA export, is located in the nucleoplasm, and is associated with the nuclear envelope. Alternative splicing seems to be a common mechanism in this gene family. Two variants have been found for this gene.
3435	
3436	The protein encoded by this gene catalyzes the first step in ubiquitin conjugation to mark cellular proteins for degradation. This gene complements an X-linked mouse temperature-sensitive defect in DNA synthesis, and thus may function in DNA repair. It is part of a gene cluster on chromosome Xp11.23. Alternative splicing results in 2 transcript variants encoding the same protein, but with different 5@#$%& UTR.
3437	The protein encoded by this gene catalyzes the first step in ubiquitin conjugation to mark cellular proteins for degradation. This gene complements an X-linked mouse temperature-sensitive defect in DNA synthesis, and thus may function in DNA repair. It is part of a gene cluster on chromosome Xp11.23. Alternative splicing results in 2 transcript variants encoding the same protein, but with different 5@#$%& UTR.
3438	This gene encodes a protein similar to eukaryotic translation initiation factor 1A (EIF1A). EIF1A is required for the binding of the 43S complex (a 40S subunit, eIF2/GTP/Met-tRNAi and eIF3) to the 5@#$%& end of capped RNA.
3439	
3440	
3441	
3442	This gene encodes coagulation factor V which is an essential factor of the blood coagulation cascade. This factor circulates in plasma, and is converted to the active form by the release of the activation peptide by thrombin during coagulation. This generates a heavy chain and a light chain which are held together by calcium ions. The active factor V is a cofactor that participates with activated coagulation factor X to activate prothrombin to thrombin. Defects in this gene result in either an autosomal recessive hemorrhagic diathesis or an autosomal dominant form of thrombophilia, which is known as activated protein C resistance.
3443	This gene is a member of a phylogenetically conserved family of genes that share a common DNA-binding domain, the T-box. T-box genes encode transcription factors involved in the regulation of developmental processes. This gene is the human ortholog of mouse Tbx19/Tpit gene. Studies in mouse show that Tpit protein is present only in the two pituitary pro-opiomelanocortin (POMC)-expressing lineages, the corticotrophs and melanotrophs. Mutations in the human ortholog were found in patients with isolated deficiency of pituitary POMC-derived ACTH, suggesting an essential role for this gene in differentiation of the pituitary POMC lineage.
3444	
3445	The product encoded by this gene is predominantly localized in the nucleus and is a cell division cycle protein with kinase activity. Although expression levels of the protein appear to be constant throughout the cell cycle, the protein kinase activity appears to increase during S phase. It has been suggested that the protein is essential for initiation of DNA replication and that it plays a role in regulating cell cycle progression. Overexpression of this gene product may be associated with neoplastic transformation for some tumors. Additional transcript sizes have been detected, suggesting the presence of alternative splicing.
3446	
3447	
3448	
3449	
3450	UDP-N-acetylglucosamine:lysosomal-enzyme N-acetylglucosamine-1-phosphotransferase (GlcNAc-phosphotransferase) catalyzes the initial step in the synthesis of the mannose 6-phosphate determinant required for efficient intracellular targeting of newly synthesized lysosomal hydrolases to the lysosome. GlcNAc-phosphotransferase is an alpha-2/beta-2/gamma-2 hexameric complex. Also see GNPTA (MIM 607840).[supplied by OMIM]
3451	The protein encoded by this gene belongs to the family of P-type cation transport ATPases, and to the subfamily of Na+/K+ -ATPases. Na+/K+ -ATPase is an integral membrane protein responsible for establishing and maintaining the electrochemical gradients of Na and K ions across the plasma membrane. These gradients are essential for osmoregulation, for sodium-coupled transport of a variety of organic and inorganic molecules, and for electrical excitability of nerve and muscle. This enzyme is composed of two subunits, a large catalytic subunit (alpha) and a smaller glycoprotein subunit (beta). The catalytic subunit of Na+/K+ -ATPase is encoded by multiple genes. This gene encodes an alpha 2 subunit.
3452	
3453	Carnitine palmitoyltransferase II precursor (CPT2) is a nuclear protein which is transported to the mitochondrial inner membrane.  CPT2 together with carnitine palmitoyltransferase I oxidizes long-chain fatty acids in the mitochondria.   Defects in this gene are associated with mitochondrial long-chain fatty-acid (LCFA) oxidation disorders.
3454	
3455	
3456	The protein encoded by this gene is a beta-amyloid peptide-binding protein. It contains a structural module related to that of the seven transmembrane domain G protein-coupled receptor superfamily and known to be important in heterotrimeric G protein activation. Beta-amyloid peptide has been established to be a causative factor in neuron death and the consequent dimunition of cognitive abilities observed in Alzheimer@#$%&s disease. This protein may be a target of neurotoxic beta-amyloid peptide, and may mediate cellular vulnerability to beta-amyloid peptide toxicity through a G protein-regulated program of cell death.
3457	
3458	
3459	This gene encodes a large extracellular member of the immunoglobulin superfamily. A similar protein in C. elegans forms long, fine tracks at specific extracellular sites which are involved in many processes such as stabilization of the germline syncytium, anchorage of mechanosensory neurons to the epidermis, and organization of hemidesmosomes in the epidermis.
3460	
3461	BAI1, a p53-target gene, encodes brain-specific angiogenesis inhibitor, a seven-span transmembrane protein and is thought to be a member of the secretin receptor family. Brain-specific angiogenesis proteins BAI2 and BAI3 are similar to BAI1 in structure, have similar tissue specificities and may also play a role in angiogenesis.
3462	
3463	
3464	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and two tandem intracellular catalytic domains, and thus represents a receptor-type PTP. The extracellular region contains a meprin-A5 antigen-PTP (MAM) domain, Ig-like and fibronectin type III-like repeats. This PTP was thought to play roles in cell-cell recognition and adhesion. Studies of the similar gene in mice suggested the role of this PTP in early neural development. The expression of this gene was reported to be regulated by phorbol myristate acetate (PMA) or calcium ionophore in Jurkat T lymphoma cells. Three alternatively spliced transcript variants, which encode distinct proteins, have been reported.
3465	
3466	This gene encodes a protein that belongs to the ligand-gated ionic channel family. It can coassemble with either GRIK4 or GRIK5 to form heteromeric receptors and acts as an excitatory neurotransmitter at many synapses in the central nervous system. RNA editing in the mRNA has been reported.
3467	
3468	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP contains an N-terminal noncatalytic domain similar to that of band 4.1 superfamily cytoskeleton-associated proteins, which suggested the membrane or cytoskeleton localization of this protein. The specific function of this PTP has not yet been determined.
3469	This gene is a member of the ras superfamily, and is expressed in normal ovarian and breast epithelial cells but not in ovarian and breast cancers.  It is a maternally imprinted gene and its expression is associated with growth suppression.  Thus, this gene appears to be a putative tumor suppressor gene whose function is abrogated in ovarian and breast cancers.
3470	
3471	
3472	
3473	
3474	
3475	CDC20 appears to act as a regulatory protein interacting with several other proteins at multiple points in the cell cycle.  It is required for two microtubule-dependent processes, nuclear movement prior to anaphase and chromosome separation.
3476	This gene is one of seven beta-1,4-galactosyltransferase (beta4GalT) genes. They encode type II membrane-bound glycoproteins that appear to have exclusive specificity for the donor substrate UDP-galactose; all transfer galactose in a beta1,4 linkage to similar acceptor sugars: GlcNAc, Glc, and Xyl. Each beta4GalT has a distinct function in the biosynthesis of different glycoconjugates and saccharide structures. As type II membrane proteins, they have an N-terminal hydrophobic signal sequence that directs the protein to the Golgi apparatus and which then remains uncleaved to function as a transmembrane anchor. By sequence similarity, the beta4GalTs form four groups: beta4GalT1 and beta4GalT2, beta4GalT3 and beta4GalT4, beta4GalT5 and beta4GalT6, and beta4GalT7. The enzyme encoded by this gene synthesizes N-acetyllactosamine in glycolipids and glycoproteins. Its substrate specificity is affected by alpha-lactalbumin but it is not expressed in lactating mammary tissue. Two transcript variants encoding the same protein have been found for this gene.
3477	
3478	
3479	
3480	
3481	
3482	
3483	This gene belongs to the ephrin receptor subfamily of the protein-tyrosine kinase family. EPH and EPH-related receptors have been implicated in mediating developmental events, particularly in the nervous system. Receptors in the EPH subfamily typically have a single kinase domain and an extracellular region containing a Cys-rich domain and 2 fibronectin type III repeats. The ephrin receptors are divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands. This gene encodes a protein that binds ephrin-A ligands.
3484	
3485	This gene encodes a protein tyrosine phosphatase which is expressed primarily in lymphoid tissues. This enzyme associates with the molecular adapter protein CBL and may be involved in regulating CBL function in the T-cell receptor signaling pathway. Alternative splicing of this gene results in two transcript variants encoding distinct isoforms.
3486	
3487	
3488	
3489	
3490	Calpains are ubiquitous, well-conserved family of calcium-dependent, cysteine proteases. The calpain proteins are heterodimers consisting of an invariant small subunit and variable large subunits. The large subunit possesses a cysteine protease domain, and both subunits possess calcium-binding domains. Calpains have been implicated in neurodegenerative processes, as their activation can be triggered by calcium influx and oxidative stress. The protein encoded by this gene is expressed predominantly in stomach and small intestine and may have specialized functions in the digestive tract. This gene is thought to be associated with gastric cancer. Multiple alternatively spliced transcript variants encoding different isoforms have been found for this gene.
3491	
3492	
3493	The encoded protein has a central role in the regulation of lipid metabolism in cells. It is a sterol-regulated subtilisin-like serine protease that cleaves ER membrane-bound sterol regulatory element-binding proteins (SREBPs), a reaction that initiates the two-step proteolytic process by which transcriptionally active fragments of SREBPs are released from the membrane for translocation to the nucleus. The gene product is an integral membrane ER protein, with the bulk located in the ER lumen. It is synthesized as an inactive preproprotein that is self-activated by an intramolecular cleavage that generates the mature protein. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
3494	Mitochondrial complex I (NADH-ubiquinone reductase; EC 1.6.5.3) is the first multimeric complex of the respiratory chain that catalyzes the NADH oxidation with concomitant ubiquinone reduction and proton ejection out of the mitochondria. Mammalian mitochondrial complex I is an assembly of at least 43 different subunits. Seven of the subunits are encoded by the mitochondrial genome; the remainder are the products of nuclear genes. The iron-sulfur protein (IP) fraction of complex I is made up of 7 subunits, including NDUFS2. See NDUFS1 (MIM 157655).[supplied by OMIM]
3495	
3496	The encoded protein has a central role in the regulation of lipid metabolism in cells. It is a sterol-regulated subtilisin-like serine protease that cleaves ER membrane-bound sterol regulatory element-binding proteins (SREBPs), a reaction that initiates the two-step proteolytic process by which transcriptionally active fragments of SREBPs are released from the membrane for translocation to the nucleus. The gene product is an integral membrane ER protein, with the bulk located in the ER lumen. It is synthesized as an inactive preproprotein that is self-activated by an intramolecular cleavage that generates the mature protein. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
3497	
3498	
3499	
3500	The poly-Ig receptor is expressed on several glandular epithelia including those of liver and breast. It mediates transcellular transport of polymeric immunoglobulin molecules. It is a member of the immunoglobulin superfamily (Hood et al., 1985). The receptor has 5 units with homology to the variable (V) units of immunoglobulins and a transmembrane region, which also has some homology to certain immunoglobulin variable regions.[supplied by OMIM]
3501	This intronless gene encodes a carcinoma-associated antigen defined by the monoclonal antibody GA733. This antigen is a member of a family including at least two type I membrane proteins. It transduces an intracellular calcium signal and acts as a cell surface receptor. Mutations of this gene result in gelatinous drop-like corneal dystrophy, an autosomal recessive disorder characterized by severe corneal amyloidosis leading to blindness.
3502	
3503	
3504	Metabolic N-oxidation of the diet-derived amino-trimethylamine (TMA) is mediated by flavin-containing monooxygenase and is subject to an inherited FMO3 polymorphism in man resulting in a small subpopulation with reduced TMA N-oxidation capacity resulting in fish odor syndrome Trimethylaminuria. Three forms of the enzyme, FMO1 found in fetal liver, FMO2 found in adult liver, and FMO3 are encoded by genes clustered in the 1q23-q25 region.   Flavin-containing monooxygenases are NADPH-dependent flavoenzymes that catalyzes the oxidation of soft nucleophilic heteroatom centers in drugs, pesticides, and xenobiotics.
3505	
3506	Complex II of the respiratory chain, which is specifically involved in the oxidation of succinate, carries electrons from FADH to CoQ.  The complex is composed of four nuclear-encoded subunits and is localized in the mitochondrial inner membrane.  The iron-sulfur subunit is highly conserved and contains three cysteine-rich clusters which may comprise the iron-sulfur centers of the enzyme.
3507	
3508	
3509	
3510	This gene encodes a protein belonging to the pantothenate kinase family. Pantothenate kinase is a key regulatory enzyme in the biosynthesis of coenzyme A (CoA) in bacteria and mammalian cells. It catalyzes the first committed step in the universal biosynthetic pathway leading to CoA and is itself subject to regulation through feedback inhibition by CoA. This family member is most abundant in muscle but is expressed in all tissues.
3511	The protein encoded by this gene belongs to the family of P-type cation transport ATPases, and to the subfamily of Na+/K+ -ATPases. Na+/K+ -ATPase is an integral membrane protein responsible for establishing and maintaining the electrochemical gradients of Na and K ions across the plasma membrane. These gradients are essential for osmoregulation, for sodium-coupled transport of a variety of organic and inorganic molecules, and for electrical excitability of nerve and muscle. This enzyme is composed of two subunits, a large catalytic subunit (alpha) and a smaller glycoprotein subunit (beta). The catalytic subunit of Na+/K+ -ATPase is encoded by multiple genes. This gene encodes an alpha 1 subunit. Alternatively spliced transcript variants encoding different isoforms have been identified.
3512	
3513	
3514	
3515	
3516	
3517	This gene encodes a protein that is involved in the Ras signal transduction pathway, growth inhibition, and nerve-growth factor induced differentiation processes, as determined in mouse and human cell line studies. In mouse, the encoded protein was initially isolated based on its ability to inhibit v-Ras transformation. Multiple alternatively spliced transcript variants for this gene have been reported; one of these variants was found only in glioma tumors.
3518	This gene encodes a protein that is involved in the Ras signal transduction pathway, growth inhibition, and nerve-growth factor induced differentiation processes, as determined in mouse and human cell line studies. In mouse, the encoded protein was initially isolated based on its ability to inhibit v-Ras transformation. Multiple alternatively spliced transcript variants for this gene have been reported; one of these variants was found only in glioma tumors.
3519	This gene encodes a protein that is part of a post-splicing multiprotein complex involved in both mRNA nuclear export and mRNA surveillance. mRNA surveillance detects exported mRNAs with truncated open reading frames and initiates nonsense-mediated mRNA decay (NMD). When translation ends upstream from the last exon-exon junction, this triggers NMD to degrade mRNAs containing premature stop codons. This protein is located in the perinuclear area. It interacts with translation release factors and the proteins that are functional homologs of yeast Upf1p and Upf3p. Two splice variants have been found for this gene; both variants encode the same protein.
3520	
3521	
3522	
3523	
3524	
3525	
3526	
3527	This gene encodes a member of the aldo/keto reductase superfamily, which consists of more than 40 known enzymes and proteins. These enzymes catalyze the conversion of aldehydes and ketones to their corresponding alcohols by utilizing NADH and/or NADPH as cofactors. The enzymes display overlapping but distinct substrate specificity. This enzyme catalyzes the bioreduction of chlordecone, a toxic organochlorine pesticide, to chlordecone alcohol in liver. This gene shares high sequence identity with three other gene members and is clustered with those three genes at chromosome 10p15-p14.
3528	
3529	This gene is a member of the cadherin superfamily of calcium-dependent cell-cell adhesion molecules. The encoded protein has a signal peptide, six cadherin repeat domains and a unique cytoplasmic region. This non-classical cadherin appears to be exclusively expressed in the mitral and tufted cells in the main and accessory olfactory bulbs of the brain, suggesting a possible role in the formation and maintenance of neuronal networks.
3530	
3531	
3532	
3533	The protein encoded by this gene is a member of the dual specificity protein phosphatase subfamily.  These phosphatases inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues.  They negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily (MAPK/ERK, SAPK/JNK, p38), which are associated with cellular proliferation and differentiation.  Different members of the family of dual specificity phosphatases show distinct substrate specificities for various MAP kinases, different tissue distribution and subcellular localization, and different modes of inducibility of their expression by extracellular stimuli.  This gene product inactivates ERK1, is expressed in a variety of tissues with the highest levels in pancreas and brain, and is localized in the nucleus.
3534	This gene encodes a member of the ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) protein family. ADAMTS family members share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. The protein encoded by this gene has a high sequence similarity to the protein encoded by gene ADAMTS16, another family member. Alternative splicing results in two transcript variants encoding distinct isoforms.
3535	Actin alpha 2, the human aortic smooth muscle actin gene, is one of six different actin isoforms which have been identified.  Actins are highly conserved proteins that are involved in cell motility, structure and integrity.  Alpha actins are a major constituent of the contractile apparatus.
3536	
3537	
3538	
3539	
3540	
3541	This gene encodes the D4 subtype of the dopamine receptor. The D4 subtype is a G-protein coupled receptor which inhibits adenylyl cyclase. It is a target for drugs which treat schizophrenia and Parkinson disease. Mutations in this gene have been associated with various behavioral phenotypes, including autonomic nervous system dysfunction, attention deficit/hyperactivity disorder, and the personality trait of novelty seeking. This gene contains a polymorphic number (2-10 copies) of tandem 48 nt repeats; the sequence shown contains four repeats.
3542	
3543	This gene encodes a nuclear protein involved in homologous recombination, telomere length maintenance, and DNA double-strand break repair. By itself, the protein has 3@#$%& to 5@#$%& exonuclease activity and endonuclease activity. The protein forms a complex with the RAD50 homolog; this complex is required for nonhomologous joining of DNA ends and possesses increased single-stranded DNA endonuclease and 3@#$%& to 5@#$%& exonuclease activities. In conjunction with a DNA ligase, this protein promotes the joining of noncomplementary ends in vitro using short homologies near the ends of the DNA fragments. This gene has a pseudogene on chromosome 3. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
3544	
3545	The protein encoded by this gene is a member of the BCL-2 family. BCL-2 family members are known to be regulators of programmed cell death. This protein positively regulates cell apoptosis by forming heterodimers with BCL-xL and BCL-2, and reversing their death repressor activity. Proapoptotic activity of this protein is regulated through its phosphorylation. Protein kinases AKT and MAP kinase, as well as protein phosphatase calcineurin were found to be involved in the regulation of this protein. Alternative splicing of this gene results in two transcript variants which encode the same isoform.
3546	Hepatitis delta virus (HDV) is a pathogenic human virus whose RNA genome and replication cycle resemble those of plant viroids. Delta-interacting protein A (DIPA), a cellular gene product, has been found to have homology to hepatitis delta virus antigen (HDAg). DIPA interacts with the viral antigen, HDAg, and can affect HDV replication in vitro.
3547	
3548	
3549	
3550	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the peptidase T1A family, that is a 20S core alpha subunit. Two alternative transcripts encoding different isoforms have been identified.
3551	
3552	There is evidence that this intronless gene is transcribed but the protein is predicted. The gene function is unknown.
3553	
3554	
3555	Nucleases are components of DNA and RNA metabolism that carry out functions in DNA repair, replication, and recombination and in RNA processing and degradation. SFN is a homolog of Orn, a 3-prime-to-5-prime exoribonuclease of E. coli that attacks the free 3-prime hydroxyl group on single-stranded RNA, releasing 5-prime mononucleotides in a sequential manner.[supplied by OMIM]
3556	Thymus development depends on a complex series of interactions between thymocytes and the stromal component of the organ. Epithelial V-like antigen (EVA) is expressed in thymus epithelium and strongly downregulated by thymocyte developmental progression. This gene is expressed in the thymus and in several epithelial structures early in embryogenesis. It is highly homologous to the myelin protein zero and, in thymus-derived epithelial cell lines, is poorly soluble in nonionic detergents, strongly suggesting an association to the cytoskeleton. Its capacity to mediate cell adhesion through a homophilic interaction and its selective regulation by T cell maturation might imply the participation of EVA in the earliest phases of thymus organogenesis. The protein bears a characteristic V-type domain and two potential N-glycosylation sites in the extracellular domain; a putative serine phosphorylation site for casein kinase 2 is also present in the cytoplasmic tail. Two transcript variants encoding the same protein have been found for this gene.
3557	Vesicle mediated protein sorting plays an important role in segregation of intracellular molecules into distinct organelles. Genetic studies in yeast have identified more than 40 vacuolar protein sorting (VPS) genes involved in vesicle transport to vacuoles. This gene encodes the human homolog of yeast class C Vps11 protein. The mammalian class C Vps proteins are predominantly associated with late endosomes/lysosomes, and like their yeast counterparts, may mediate vesicle trafficking steps in the endosome/lysosome pathway.
3558	
3559	
3560	
3561	The zinc finger protein encoded by this gene is one of several cellular transcription factors whose DNA-binding activities are regulated through the action of adenovirus E1A. A 50-kDa amino-terminal product is generated from the full-length protein through proteolytic cleavage. The protein is differentially regulated by E1A-induced phosphorylation. The full-length gene product represses transcription from the E4 promoter in the absence of E1A, while the 50-kDa form acts as a transcriptional activator in its presence.
3562	This gene encodes a type I integral membrane glycoprotein. The encoded protein acts as a receptor and binds to both soluble and immobilized hyaluronan. This protein may function in lymphatic hyaluronan transport and have a role in tumor metastasis.
3563	
3564	
3565	
3566	
3567	Zeta-globin (HBZ ) is an alpha-like hemoglobin. The zeta-globin polypeptide is synthesized in the yolk sac of the early embryo, while alpha-globin is produced throughout fetal and adult like. The zeta-globin gene is a member of the human alpha-globin gene cluster that involves 4 functional genes and 3 nonfunctional pseudogenes. The order of genes is: 5@#$%&-zeta -- pseudozeta -- pseudoalpha2 -- pseudoalpha1 -- alpha2 -- alph1 -- theta1-3@#$%&.
3568	The protein encoded by this gene is a member of the lipophilin subfamily, part of the uteroglobin superfamily, and is an ortholog of prostatein, the major secretory glycoprotein of the rat ventral prostate gland. Lipophilin gene products are widely expressed in normal tissues, especially in endocrine-responsive organs. Assuming that human lipophilins are the functional counterparts of prostatein, they may be transcriptionally regulated by steroid hormones, with the ability to bind androgens, other steroids and possibly bind and concentrate estramustine, a chemotherapeutic agent widely used for prostate cancer. Although the gene has been reported to be on chromosome 10, this sequence appears to be from a cluster of genes on chromosome 11 that includes mammaglobin 2.
3569	
3570	
3571	Tryptophan hydroxylase (TPH; EC 1.14.16.4) catalyzes the biopterin-dependent monooxygenation of tryptophan to 5-hydroxytryptophan (5HT), which is subsequently decarboxylated to form the neurotransmitter serotonin. It is thus the rate-limiting enzyme in the biosynthesis of serotonin. TPH expression is limited to a few specialized tissues: raphe neurons, pinealocytes, mast cells, mononuclear leukocytes, beta-cells of the islets of Langerhans, and intestinal and pancreatic enterochromaffin cells.[supplied by OMIM]
3572	
3573	The protein encoded by this gene is a member of the tripartite motif (TRIM) family, also called the @#$%&RING-B-box-coiled-coil@#$%& (RBCC) subgroup of RING finger proteins. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This protein localizes to cytoplasmic filaments. It is similar to a rat protein which is a specific partner for the tail domain of myosin V, a class of myosins which are involved in the targeted transport of organelles. The rat protein can also interact with alpha-actinin-4. Thus it is suggested that this human protein may play a role in myosin V-mediated cargo transport. Alternatively spliced transcript variants encoding the same isoform have been identified.
3574	
3575	
3576	
3577	The protein encoded by this gene is the human ortholog of mouse and Drosophila four-jointed gene product. The Drosophila protein is important for growth and differentiation of legs and wings, and for proper development of the eyes. The exact function of this gene in humans is not known.
3578	ARHG is a member of the RAS superfamily of genes, which encode GTP-binding proteins that act in the pathway of signal transduction and play a key role in the regulation of cellular functions.[supplied by OMIM]
3579	
3580	
3581	
3582	This gene encodes a member of the trypsin family of serine proteases. Its specific function is unknown.
3583	
3584	
3585	
3586	
3587	
3588	The protein encoded by this gene shares sequence similarity with Hir1p and Hir2p, the two corepressors of histone gene transcription characterized in the yeast, Saccharomyces cerevisiae. The structural features of this protein suggest that it may function as part of a multiprotein complex. Several cDNAs encoding interacting proteins, HIRIPs, have been identified. HIRIP4 was isolated by virtue of its interaction with this protein; however, its exact function is not known. The sequence of HIRIP4 protein is highly homologous to the human DNJ3/CPR3, mouse Dj3 and rat Dj2 gene products.
3589	
3590	
3591	
3592	
3593	The VMD2 gene (MIM 607854) underlying vitelliform macular dystrophy and the 3 VMD2-like genes, VMD2L1 (MIM 607335), VMD2L2 (MIM 607336), and VMD2L3, encode transmembrane (TM) spanning proteins that share a homology region with a high content of aromatic residues including an invariant arginine (R), phenylalanine (F), and proline (P) motif. VMD2 and the 3 VMD2-related genes share a conserved gene structure with almost identical sizes of the 8 RFP-TM domain encoding exons and highly conserved positions of their corresponding exon-intron boundaries. Each of the 4 paralogous genes contains a unique 3-prime end of variable length without significant homology to known proteins or motifs (Stohr et al., 2002).[supplied by OMIM]
3594	
3595	
3596	The MAN antigens, such as MAN1, are nuclear membrane proteins that are recognized by autoantibodies from a patient with a collagen vascular disease. Because MAN1 contains an LEM domain, the gene is also called LEMD3 (for @#$%&LEM domain-containing-3@#$%&).[supplied by OMIM]
3597	Islet, or insulinoma, amyloid polypeptide (IAPP, or amylin) is commonly found in pancreatic islets of patients suffering diabetes mellitus type II, or harboring an insulinoma.  While the assosciation of amylin with the development of type II diabetes has been known for some time, a direct causative role for amylin has been harder to establish.  Recent results suggest that amylin, like the related beta-amyloid (Abeta) assosciated with Alzheimer@#$%&s disease, can induce apoptotic cell-death in particular cultured cells, an effect that may be relevant to the development of type II diabetes.
3598	Natural killer (NK) cells are lymphocytes that can mediate lysis of certain tumor cells and virus-infected cells without previous activation. They can also regulate specific humoral and cell-mediated immunity. The protein encoded by this gene belongs to the killer cell lectin-like receptor family, also called NKG2 family, which is a group of transmembrane proteins preferentially expressed in NK cells. This family of proteins is characterized by the type II membrane orientation and the presence of a C-type lectin domain. This protein forms a complex with another family member, KLRD1/CD94, and has been implicated in the recognition of the MHC class I HLA-E molecules in NK cells. The genes of NKG2 family members form a killer cell lectin-like receptor gene cluster on chromosome 12. Four alternatively spliced transcript variants encoding two distinct isoforms have been observed.
3599	
3600	
3601	
3602	
3603	
3604	
3605	
3606	A cluster of genes, BAT1-BAT5, has been localized in the vicinity of the genes for TNF alpha and TNF beta. These genes are all within the human major histocompatibility complex class III region. The protein encoded by this gene is a nuclear protein. It has been implicated in the control of apoptosis and regulating heat shock protein. There are three alternatively spliced transcript variants described for this gene.
3607	RICH1 is a GTPase-activating protein (GAP). GAPs stimulate the intrinsic GTP hydrolysis of small G proteins, such as RHOA (MIM 165390), RAC1 (MIM 602048), and CDC42 (MIM 116952).[supplied by OMIM]
3608	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility.
3609	
3610	
3611	
3612	
3613	
3614	
3615	
3616	
3617	EPLIN is a cytoskeleton-associated protein that inhibits actin filament depolymerization and cross-links filaments in bundles (Maul et al., 2003).[supplied by OMIM]
3618	
3619	
3620	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L6E family of ribosomal proteins. It is located in the cytoplasm. The protein can bind specifically to domain C of the tax-responsive enhancer element of human T-cell leukemia virus type 1, and it has been suggested that the protein may participate in tax-mediated transactivation of transcription. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
3621	
3622	
3623	
3624	
3625	
3626	
3627	
3628	This gene belongs to the forkhead family of transcription factors which are characterized by a distinct forkhead domain.  The specific function of this gene has not yet been determined; however, it may play a role in myogenic growth and differentiation. Translocation of this gene with PAX3 has been associated with alveolar rhabdomyosarcoma.
3629	
3630	
3631	This gene encodes a member of the cytokine family. The protein contains a tyrosine sulfation site, 3 potential N-myristoylation sites, multiple putative phosphorylation sites, and an RGD cell-attachment sequence. Expression of this protein is increased after the activation of T-cells by mitogens or the activation of NK cells by IL-2. This protein induces the production of TNFalpha from macrophage cells. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
3632	
3633	
3634	
3635	
3636	
3637	
3638	
3639	This gene encodes a brain-specific member of the copine family, which is composed of calcium-dependent membrane-binding proteins. The gene product contains two N-terminal C2 domains, and one von Willebrand factor A domain. It may have a role in synaptic plasticity.
3640	This gene encodes a brain-specific member of the copine family, which is composed of calcium-dependent membrane-binding proteins. The gene product contains two N-terminal C2 domains, and one von Willebrand factor A domain. It may have a role in synaptic plasticity.
3641	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. The immunoproteasome contains an alternate regulator, referred to as the 11S regulator or PA28, that replaces the 19S regulator. Three subunits (alpha, beta and gamma) of the 11S regulator have been identified. This gene encodes the alpha subunit of the 11S regulator, one of the two 11S subunits that is induced by gamma-interferon. Three alpha and three beta subunits combine to form a heterohexameric ring. Two transcripts encoding different isoforms have been identified.
3642	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. The immunoproteasome contains an alternate regulator, referred to as the 11S regulator or PA28, that replaces the 19S regulator. Three subunits (alpha, beta and gamma) of the 11S regulator have been identified. This gene encodes the alpha subunit of the 11S regulator, one of the two 11S subunits that is induced by gamma-interferon. Three alpha and three beta subunits combine to form a heterohexameric ring. Two transcripts encoding different isoforms have been identified.
3643	Transcription of protein-coding genes can be reconstituted on naked DNA with only the general transcription factors and RNA polymerase II. However, this minimal system cannot transcribe DNA packaged into chromatin, indicating that accessory factors may facilitate access to DNA. One such factor, FACT (facilitates chromatin transcription), interacts specifically with histones H2A/H2B to effect nucleosome disassembly and transcription elongation. FACT is composed of an 80 kDa subunit and a 140 kDa subunit, the latter of which is the protein encoded by this gene.
3644	
3645	Apurinic/apyrimidinic (AP) sites occur frequently in DNA molecules by spontaneous hydrolysis, by DNA damaging agents or by DNA glycosylases that remove specific abnormal bases. AP sites are pre-mutagenic lesions that can prevent normal DNA replication so the cell contains systems to identify and repair such sites. Class II AP endonucleases cleave the phosphodiester backbone 5@#$%& to the AP site. This gene encodes the major AP endonuclease in human cells. Splice variants have been found for this gene; all encode the same protein.
3646	Apurinic/apyrimidinic (AP) sites occur frequently in DNA molecules by spontaneous hydrolysis, by DNA damaging agents or by DNA glycosylases that remove specific abnormal bases. AP sites are pre-mutagenic lesions that can prevent normal DNA replication so the cell contains systems to identify and repair such sites. Class II AP endonucleases cleave the phosphodiester backbone 5@#$%& to the AP site. This gene encodes the major AP endonuclease in human cells. Splice variants have been found for this gene; all encode the same protein.
3647	The protein encoded by this gene is a member of the PP2C family of Ser/Thr protein phosphatases. PP2C family members are known to be negative regulators of cell stress response pathways. This phosphatase dephosphorylates, and negatively regulates the activities of, MAP kinases and MAP kinase kinases. It has been shown to inhibit the activation of p38 and JNK kinase cascades induced by environmental stresses. This phosphatase can also dephosphorylate cyclin-dependent kinases, and thus may be involved in cell cycle control. Overexpression of this phosphatase is reported to activate the expression of the tumor suppressor gene TP53/p53, which leads to G2/M cell cycle arrest and apoptosis. Three alternatively spliced transcript variants encoding two distinct isoforms have been described.
3648	
3649	
3650	
3651	
3652	This gene product is a chymotryptic serine proteinase that belongs to the peptidase family S1. It is expressed in mast cells and thought to function in the degradation of the extracellular matrix, the regulation of submucosal gland secretion, and the generation of vasoactive peptides. In the heart and blood vessels, this protein, rather than angiotensin converting enzyme, is largely responsible for converting angiotensin I to the vasoactive peptide angiotensin II. Angiotensin II has been implicated in blood pressure control and in the pathogenesis of hypertension, cardiac hypertrophy, and heart failure. Thus, this gene product is a target for cardiovascular disease therapies. This gene maps to 14q11.2 in a cluster of genes encoding other proteases.
3653	
3654	The protein encoded by this gene is a secreted, extracellular matrix protein containing an Arg-Gly-Asp (RGD) motif and calcium-binding EGF-like domains. It promotes adhesion of endothelial cells through interaction of integrins and the RGD motif. It is prominently expressed in developing arteries but less so in adult vessels. However, its expression is reinduced in balloon-injured vessels and atherosclerotic lesions, notably in intimal vascular smooth muscle cells and endothelial cells. Therefore, the protein encoded by this gene may play a role in vascular development and remodeling.
3655	
3656	
3657	The protein encoded by this gene is a glycoprotein and a member of the NADPH oxidase family. The synthesis of thyroid hormone is catalyzed by a protein complex located at the apical membrane of thyroid follicular cells. This complex contains an iodide transporter, thyroperoxidase, and a peroxide generating system that includes this encoded protein and DUOX1. This protein is known as dual oxidase because it has both a peroxidase homology domain and a gp91phox domain.
3658	
3659	
3660	Major alterations in the composition of the cartilage extracellular matrix occur in joint disease, such as osteoarthrosis. The synthesis of cartilage intermediate layer protein (CILP), which was identified and purified from human articular cartilage, increases in early osteoarthrosis cartilage.  The C-terminal 460 amino acids of the protein show 90% similarity to the pig ectonucleotide pyrophosphohydrolase NTPPHase; this region is preceded by a furin protease consensus cleavage site. Thus, the CILP gene is thought to encode a protein precursor for 2 different proteins, namely CILP and a homolog of NTPPHase.
3661	The protein encoded by this gene belongs to the subtilisin-like proprotein convertase family. The members of this family are proprotein convertases that process latent precursor proteins into their biologically active products. This encoded protein is a calcium-dependent serine endoprotease that can cleave precursor protein at their paired basic amino acid processing sites. Some of its substrates are - transforming growth factor beta related proteins, proalbumin, and von Willebrand factor. This gene is thought to play a role in tumor progression. Alternatively spliced transcript variants encoding different isoforms have been identified.
3662	This enzyme catalyzes O-glycan branch synthesis of the core 2 and core 4 type in mucins and controls expression of core 2 branched oligosaccharides and I antigens on the cell surface.
3663	
3664	
3665	
3666	This gene encodes a member of the BCL-2 protein family. The proteins of this family form hetero- or homodimers and act as anti- and pro-apoptotic regulators that are involved in a wide variety of cellular activities such as embryonic development, homeostasis and tumorigenesis. The protein encoded by this gene is able to reduce the release of pro-apoptotic cytochrome c from mitochondria and block caspase activation. This gene is a direct transcription target of NF-kappa B in response to inflammatory mediators, and has been shown to be up-regulated by different extracellular signals, such as granulocyte-macrophage colony-stimulating factor (GM-CSF), CD40, phorbol ester and inflammatory cytokine TNF and IL-1, which suggests a cytoprotective function that is essential for lymphocyte activation as well as cell survival.
3667	The protein encoded by this gene catalyzes the fourth enzymatic step, the ubiquinone-mediated oxidation of dihydroorotate to orotate, in de novo pyrimidine biosynthesis. This protein is a mitochondrial protein located on the outer surface of the inner mitochondrial membrane.
3668	
3669	
3670	
3671	
3672	
3673	
3674	
3675	
3676	
3677	
3678	The enzyme mevalonate pyrophosphate decarboxylase catalyzes the conversion of mevalonate pyrophosphate into isopentenyl pyrophosphate in one of the early steps in cholesterol biosynthesis. It decarboxylates and dehydrates its substrate while hydrolyzing ATP.
3679	
3680	
3681	
3682	
3683	
3684	
3685	The protein encoded by this gene is a member of the E2F family of transcription factors. The E2F family plays a crucial role in the control of cell cycle and action of tumor suppressor proteins and is also a target of the transforming proteins of small DNA tumor viruses. The E2F proteins contain several evolutionally conserved domains found in most members of the family. These domains include a DNA binding domain, a dimerization domain which determines interaction with the differentiation regulated transcription factor proteins (DP), a transactivation domain enriched in acidic amino acids, and a tumor suppressor protein association domain which is embedded within the transactivation domain. This protein binds to all three of the  tumor suppressor proteins pRB, p107 and p130, but with higher affinity to the last two. It plays an important role in the suppression of proliferation-associated genes, and its gene mutation and increased expression may be associated with human cancer.
3686	This gene belongs to the chemokine-like factor gene superfamily, a novel family that links the chemokine gene family and the transmembrane 4 superfamily. The protein encoded by this gene may play an important role in testicular development.
3687	
3688	The protein encoded by this gene belongs to the CDK subfamily of the Ser/Thr protein kinase family. The CDK subfamily members are highly similar to the gene products of S. cerevisiae cdc28, and S. pombe cdc2, and are known to be essential for cell cycle progression. This kinase has been shown to play a role in cellular proliferation. Its function is limited to cell cycle G2-M phase. At least three alternatively spliced transcript variants encoding different isoforms have been reported, two of which contain multiple non-AUG translation initiation sites.
3689	The protein encoded by this gene belongs to the CDK subfamily of the Ser/Thr protein kinase family. The CDK subfamily members are highly similar to the gene products of S. cerevisiae cdc28, and S. pombe cdc2, and are known to be essential for cell cycle progression. This kinase has been shown to play a role in cellular proliferation. Its function is limited to cell cycle G2-M phase. At least three alternatively spliced transcript variants encoding different isoforms have been reported, two of which contain multiple non-AUG translation initiation sites.
3690	This gene encodes a protein that is part of a post-splicing multiprotein complex involved in both mRNA nuclear export and mRNA surveillance. mRNA surveillance detects exported mRNAs with truncated open reading frames and initiates nonsense-mediated mRNA decay (NMD). When translation ends upstream from the last exon-exon junction, this triggers NMD to degrade mRNAs containing premature stop codons. This protein binds to the mRNA and remains bound after nuclear export, acting as a nucleocytoplasmic shuttling protein. This protein contains many serine residues. Two splice variants have been found for this gene; both variants encode the same protein.
3691	
3692	Ras homolog, or Rho, proteins interact with protein kinases and may serve as targets for activated GTPase. They play a critical role in muscle differentiation. The protein encoded by this gene binds GTP and is a member of the small GTPase superfamily. It is involved in endosome dynamics and reorganization of the actin cytoskeleton, and it may coordinate membrane transport with the function of the cytoskeleton.
3693	This gene encodes a type II classical cadherin from the cadherin superfamily, integral membrane proteins that mediate calcium-dependent cell-cell adhesion. Mature cadherin proteins are composed of a large N-terminal extracellular domain, a single membrane-spanning domain, and a small, highly conserved C-terminal cytoplasmic domain. Type II (atypical) cadherins are defined based on their lack of a HAV cell adhesion recognition sequence specific to type I cadherins. Expression of this particular cadherin in osteoblastic cell lines, and its upregulation during differentiation, suggests a specific function in bone development and maintenance. Two splice variants have been identified, one of which encodes an isoform with a truncated cytoplasmic domain.
3694	The protein encoded by this gene is similar to a mitochondrial di-iron containing hydroxylase in Saccharomyces cerevisiae that is involved with ubiquinone biosynthesis. Mutations in the yeast gene lead to slower development and longer life span. Transcript variants have been described for this gene, but their full-length sequences have not been determined.
3695	
3696	There are at least two protein isoforms of the Double C2 protein,  namely alpha (DOC2A) and beta (DOC2B), which contain two C2-like domains.  DOC2A and DOC2B are encoded by different genes; these genes are at times confused with the unrelated DAB2 gene which was initially named DOC-2. DOC2A is mainly expressed in brain and is suggested to be involved in Ca(2+)-dependent neurotransmitter release.
3697	
3698	
3699	
3700	
3701	
3702	
3703	
3704	
3705	Aminoacyl-tRNA synthetases are a class of enzymes that charge tRNAs with their cognate amino acids.  Lysyl-tRNA synthetase is a homodimer localized to the cytoplasm which belongs to the class II family of tRNA synthetases.  It has been shown to be a target of autoantibodies in the human autoimmune diseases, polymyositis or dermatomyositis
3706	
3707	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene contains introns and its mRNA is poyadenylated, unlike most histone genes. The protein encoded is a member of the histone H3 family.
3708	Arylalkylamine N-acetyltransferase belongs to the superfamily of acetyltransferases. It is the penultimate enzyme in melatonin synthesis and controls the night/day rhythm in melatonin production in the vertebrate pineal gland. Melatonin is essential for seasonal reproduction, modulates the function of the circadian clock in the suprachiasmatic nucleus, and influences activity and sleep. This enzyme is rapidly inactivated when animals are exposed to light at night. This protein is 80% identical to sheep and rat AA-NAT. Arylalkylamine N-acetyltransferase may contribute a multifactorial genetic diseases such as altered behavior in sleep/wake cycle.
3709	
3710	
3711	
3712	
3713	
3714	
3715	This gene encodes a member of the Asp-Glu-Ala-Asp (DEAD) box protein family. Members of this protein family are putative RNA helicases, and are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. Two transcript variants encoding the same protein have been identified for this gene.
3716	The protein encoded by this gene is a member of the intercellular adhesion molecule (ICAM) family. All ICAM proteins are type I transmembrane glycoproteins, contain 2-9 immunoglobulin-like C2-type domains, and bind to the leukocyte adhesion LFA-1 protein. This protein may play a role in lymphocyte recirculation by blocking LFA-1-dependent cell adhesion. It mediates adhesive interactions important for antigen-specific immune response, NK-cell mediated clearance, lymphocyte recirculation, and other cellular interactions important for immune response and surveillance.
3717	Cytochrome c oxidase (COX), the terminal component of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. This component is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in electron transfer, and the nuclear-encoded subunits may function in the regulation and assembly of the complex. This nuclear gene encodes a protein which is not a structural subunit, but may be a heme A biosynthetic enzyme involved in COX formation, according to the yeast mutant studies. However, the studies in Rhodobacter sphaeroides suggest that this gene is not required for heme A biosynthesis, but required for stable formation of the Cu(B) and magnesium centers of COX. This human protein is predicted to contain a transmembrane domain localized in the mitochondrial inner membrane. A pseudogene COX11P has been found on chromosome 6.
3718	The B9 protein function is not known. Identification of a conserved mouse homolog and representation in the EST database indicates that this transcript does represent a gene. This gene is located within the Smith-Magenis syndrome region on chromosome 17.
3719	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the MRP subfamily which is involved in multi-drug resistance. This protein functions as a modulator of ATP-sensitive potassium channels and insulin release. Mutations and deficiencies in this protein have been observed in patients with hyperinsulinemic hypoglycemia of infancy, an autosomal recessive disorder of unregulated and high insulin secretion. Mutations have also been associated with non-insulin-dependent diabetes mellitus type II, an autosomal dominant disease of defective insulin secretion. Alternative splicing of this gene has been observed; however, the transcript variants have not yet been fully described.
3720	
3721	
3722	
3723	NME1 was identified because of its reduced mRNA transcript levels in highly metastatic cells. NME1 encodes the @#$%&A@#$%& isoform of nucleoside diphosphate kinase (NDK). NDK exists as a hexamer composed of the @#$%&A@#$%& (NME1) and @#$%&B@#$%& (encoded by NME2) isoforms. Mutations in NME1 have been identified in aggressive neuroblastomas. Two transcript variants encoding different isoforms have been found for this gene.
3724	
3725	
3726	This gene is one of several CC cytokine genes clustered on the q-arm of chromosome 17. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The CC cytokines are proteins characterized by two adjacent cysteines. The cytokine encoded by this gene functions as a chemoattractant for blood monocytes, memory T helper cells and eosinophils. It causes the release of histamine from basophils and activates eosinophils. This cytokine is one of the major HIV-suppressive factors produced by CD8+ cells. It functions as one of the natural ligands for the chemokine receptor CCR5 and it suppresses in vitro replication of the R5 strains of HIV-1, which use CCR5 as a coreceptor.
3727	
3728	
3729	
3730	This gene encodes a NIMA-related kinase. Members of this serine/threonine protein kinase family are structurally-related to NIMA (never in mitosis, gene A) which controls mitotic signaling in Aspergillus nidulans.
3731	
3732	
3733	
3734	The nuclear pore complex is a massive structure that extends across the nuclear envelope, forming a gateway that regulates the flow of macromolecules between the nucleus and the cytoplasm. Nucleoporins, a family of 50 to 100 proteins, are the main components of the nuclear pore complex in eukaryotic cells. The protein encoded by this gene belongs to the nucleoporin family and is associated with the oncogenic nucleoporin CAN/Nup214 in a dynamic subcomplex. This protein is also overexpressed in a large number of malignant neoplasms and precancerous dysplasias.
3735	
3736	
3737	
3738	The protein encoded by this gene is a heat stable, nuclear protein and functions as a molecular chaperone. It is thought to regulate dimerization, DNA binding, and transcriptional activity of basic region-leucine zipper (bZIP) proteins.
3739	
3740	
3741	This gene encodes an integral membrane protein that transports the neurotransmitter serotonin from synaptic spaces into presynaptic neurons. The encoded protein terminates the action of serotonin and recycles it in a sodium-dependent manner. This protein is a target of psychomotor stimulants, such as amphetamines and cocaine, and is a member of the sodium:neurotransmitter symporter family. A repeat length polymorphism in the promoter of this gene has been shown to affect the rate of serotonin uptake and may play a role in sudden infant death syndrome, aggressive behavior in Alzheimer disease patients, and depression-susceptibility in people experiencing emotional trauma.
3742	
3743	This gene encodes acid alpha-glucosidase, which is essential for the degradation of glycogen to glucose in lysosomes. Different forms of acid alpha-glucosidase are obtained by proteolytic processing. Defects in this gene are the cause of glycogen storage disease II, also known as Pompe@#$%&s disease, which is an autosomal recessive disorder with a broad clinical spectrum.
3744	
3745	
3746	
3747	
3748	
3749	
3750	
3751	Desmosomes are cell-cell junctions between epithelial, myocardial, and certain other cell types. Desmoglein 3 is a calcium-binding transmembrane glycoprotein component of desmosomes in vertebrate epithelial cells. Currently, three desmoglein subfamily members have been identified and all are members of the cadherin cell adhesion molecule superfamily. These desmoglein gene family members are located in a cluster on chromosome 18. This protein has been identified as the autoantigen of the autoimmune skin blistering disease pemphigus vulgaris.
3752	
3753	
3754	This gene is a type II classical cadherin from the cadherin superfamily. The encoded membrane protein is a calcium dependent cell-cell adhesion glycoprotein comprised of five extracellular cadherin repeats, a transmembrane region and a highly conserved cytoplasmic tail. Type II (atypical) cadherins are defined based on their lack of a HAV cell adhesion recognition sequence specific to type I cadherins. Cadherins mediate cell-cell binding in a homophilic manner, contributing to the sorting of heterogeneous cell types and the maintenance of orderly structures. Alternative splicing in the 5@#$%& UTR of this gene yields variant transcripts encoding the same protein.
3755	
3756	
3757	
3758	
3759	
3760	This gene encodes a zinc finger protein containing five C2H2-type zinc finger domains, one or two Kruppel-associated box A (KRAB A) domains, and a leucine-rich domain. The encoded protein has been suggested to be a transcriptional repressor. It localizes predominantly to the nucleolus. Alternatively spliced transcript variants encoding different isoforms exist. These variants utilize alternative polyadenylation signals.
3761	
3762	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to the cytoplasm. The presence of potential DNA-binding and dimerization-transactivation domains suggests that this protein may act as a transcription factor, similar to several other members of the TRIM family. Expression of the gene is upregulated in response to estrogen, and it is thought to mediate estrogen actions in breast cancer as a primary response gene.
3763	
3764	
3765	
3766	The protein encoded by this gene is a member of the intercellular adhesion molecule (ICAM) family. All ICAM proteins are type I transmembrane glycoproteins, contain 2-9 immunoglobulin-like C2-type domains, and bind to the leukocyte adhesion LFA-1 protein. This protein is constitutively and abundantly expressed by all leucocytes and may be the most important ligand for LFA-1 in the initiation of the immune response. It functions not only as an adhesion molecule, but also as a potent signalling molecule.
3767	ICAM1 (CD54) is typically expressed on endothelial cells and cells of the immune system.  ICAM1 binds to integrins of type CD11a / CD18, or CD11b / CD18.  ICAM1 is also exploited by Rhinovirus as a receptor.
3768	
3769	
3770	T-cell activation requires 2 signals: recognition of antigen by the T-cell receptor (see TCR; MIM 186880) and a costimulatory signal provided primarily by CD28 (MIM 186760) in naive T cells. HSH2 is a target of both of these signaling pathways (Greene et al., 2003).[supplied by OMIM]
3771	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L36E family of ribosomal proteins. It is located in the cytoplasm. Transcript variants derived from alternative splicing exist; they encode the same protein. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
3772	
3773	
3774	Hook proteins are cytosolic coiled-coil proteins that contain conserved N-terminal domains, which attach to microtubules, and more divergent C-terminal domains, which mediate binding to organelles. The Drosophila Hook protein is a component of the endocytic compartment.[supplied by OMIM]
3775	
3776	
3777	Kallikreins are a subgroup of serine proteases having diverse physiological functions. Growing evidence suggests that many kallikreins are implicated in carcinogenesis and some have potential as novel cancer and other disease biomarkers. This gene is one of the fifteen kallikrein subfamily members located in a cluster on chromosome 19. Its expression is up-regulated by estrogens and progestins. The encoded protein is secreted and may be involved in desquamation in the epidermis.
3778	This gene encodes a member of the C-type lectin superfamily. The encoded protein is a secreted sulfated glycoprotein and functions as a growth factor for primitive hematopoietic progenitor cells. An alternative splice variant has been described but its biological nature has not been determined.
3779	
3780	
3781	
3782	
3783	
3784	N-methyl-D-aspartate (NMDA) receptors are a class of ionotropic glutamate receptors.  NMDA channel has been shown to be involved in long-term potentiation, an activity-dependent increase in the efficiency of synaptic transmission thought to underlie certain kinds of memory and learning.  NMDA receptor channels are heteromers composed of the key receptor subunit NMDAR1 (GRIN1) and 1 or more of the 4 NMDAR2 subunits: NMDAR2A (GRIN2A), NMDAR2B (GRIN2B), NMDAR2C (GRIN2C), and NMDAR2D (GRIN2D).
3785	
3786	The galectins are a family of beta-galactoside-binding proteins implicated in modulating cell-cell and cell-matrix interactions. LGALS3BP has been found elevated in the serum of patients with cancer and in those infected by the human immunodeficiency virus (HIV). It appears to be implicated in immune response associated with natural killer (NK) and lymphokine-activated killer (LAK) cell cytotoxicity. Using fluorescence in situ hybridization the full length 90K cDNA has been localized to chromosome 17q25. The native protein binds specifically to a human macrophage-associated lectin known as Mac-2 and also binds galectin 1.
3787	
3788	The protein encoded by this gene is the alpha-3 subunit of one of several alpha/beta-subunit heterodimeric transcription factors that regulate many adaptive responses to low oxygen tension (hypoxia). The alpha-3 subunit lacks the transactivation domain found in factors containing either the alpha-1 or alpha-2 subunits. It is thought that factors containing the alpha-3 subunit are negative regulators of hypoxia-inducible gene expression. At least four transcript variants encoding four different isoforms have been found for this gene.
3789	
3790	The protein encoded by this gene is an endothelial cell adhesion molecule that interacts preferentially with the leukocyte beta7 integrin LPAM-1 (alpha4beta7), L-selectin, and VLA-4 (alpha4beta1) on myeloid cells to direct leukocytes into mucosal and inflamed tissues. It is a member of the immunoglobulin family and is similar to ICAM1 and VCAM1. At least seven alternatively spliced transcripts encoding different protein isoforms have been found for this gene, but the full-length nature of some variants has not been determined.
3791	
3792	
3793	
3794	
3795	
3796	The C-terminus of the protein encoded by this gene binds topoisomerase I. The N-terminus contains a proline-rich region and a BTB/POZ domain (broad-complex, Tramtrack and bric a brac/Pox virus and Zinc finger), both of which are typically involved in protein-protein interactions. Subcellularly, the protein localizes to cytoplasmic bodies.
3797	Azurophil granules, specialized lysosomes of the neutrophil, contain at least 10 proteins implicated in the killing of microorganisms. The protein encoded by this gene is an azurophil granule antibiotic protein, with monocyte chemotactic and antibacterial activity. It is also an important multifunctional inflammatory mediator. This encoded protein is a member of the serine protease gene family but it is not a serine proteinase, because the active site serine and histidine residues are replaced. The genes encoding this protein, neutrophil elastase 2, and proteinase 3 are in a cluster located at chromosome 19pter. All 3 genes are expressed coordinately and their protein products are packaged together into azurophil granules during neutrophil differentiation.
3798	
3799	
3800	Sm-like proteins were identified in a variety of organisms based on sequence homology with the Sm protein family (see SNRPD2; MIM 601061). Sm-like proteins contain the Sm sequence motif, which consists of 2 regions separated by a linker of variable length that folds as a loop. The Sm-like proteins are thought to form a stable heteromer present in tri-snRNP particles, which are important for pre-mRNA splicing.[supplied by OMIM]
3801	
3802	This gene encodes a member of the basic helix-loop-helix leucine zipper family, and can function as a cellular transcription factor. The encoded protein can activate transcription through pyrimidine-rich initiator (Inr) elements and E-box motifs. Two transcript variants encoding distinct isoforms have been identified for this gene.
3803	Neurocan is a chondroitin sulfate proteoglycan thought to be involved in the modulation of cell adhesion and migration.[supplied by OMIM]
3804	
3805	This gene encodes a member of the hydratase/isomerase superfamily. The gene product shows high sequence similarity to enoyl-coenzyme A (CoA) hydratases of several species, particularly within a conserved domain characteristic of these proteins. The encoded protein, which contains a C-terminal peroxisomal targeting sequence, localizes to the peroxisome. The rat ortholog, which localizes to the matrix of both the peroxisome and mitochondria, can isomerize 3-trans,5-cis-dienoyl-CoA to 2-trans,4-trans-dienoyl-CoA, indicating that it is a delta3,5-delta2,4-dienoyl-CoA isomerase. This enzyme functions in the auxiliary step of the fatty acid beta-oxidation pathway. Expression of the rat gene is induced by peroxisome proliferators.
3806	
3807	
3808	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities and are involved in a variety of biological processes including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. The mouse homolog of this gene was found to be preferentially expressed in the inner root sheath of the hair follicle, which suggested a role in hair development.
3809	
3810	
3811	
3812	
3813	
3814	
3815	
3816	The inhibin beta B subunit joins the alpha subunit to form a pituitary FSH secretion inhibitor.  Inhibin has been shown to regulate gonadal stromal cell proliferation negatively and to have tumour-suppressor activity.  In addition, serum levels of inhibin have been shown to reflect the size of granulosa-cell tumors and can therefore be used as a marker for primary as well as recurrent disease.   Because expression in gonadal and various extragonadal tissues may vary severalfold in a tissue-specific fashion, it is proposed that inhibin may be both a growth/differentiation factor and a hormone.  Furthermore, the beta B subunit forms a homodimer, activin B, and also joins with the beta A subunit to form a heterodimer, activin AB, both of which stimulate FSH secretion.
3817	
3818	
3819	
3820	
3821	
3822	Aldehyde oxidase produces hydrogen peroxide and, under certain conditions, can catalyze the formation of superoxide. Aldehyde oxidase is a candidate gene for amyotrophic lateral sclerosis.
3823	
3824	
3825	Microtubules of the eukaryotic cytoskeleton perform essential and diverse functions and are composed of a heterodimer of alpha and beta tubulin. The genes encoding these microtubule constituents are part of the tubulin superfamily, which is composed of six distinct families. Genes from the alpha, beta and gamma tubulin families are found in all eukaryotes. The alpha and beta tubulins represent the major components of microtubules, while gamma tubulin plays a critical role in the nucleation of microtubule assembly. There are multiple alpha and beta tubulin genes and they are highly conserved among and between species. This gene encodes an alpha tubulin that is a highly conserved homolog of a rat testis-specific alpha tubulin.
3826	Microtubules of the eukaryotic cytoskeleton perform essential and diverse functions and are composed of a heterodimer of alpha and beta tubulin. The genes encoding these microtubule constituents are part of the tubulin superfamily, which is composed of six distinct families. Genes from the alpha, beta and gamma tubulin families are found in all eukaryotes. The alpha and beta tubulins represent the major components of microtubules, while gamma tubulin plays a critical role in the nucleation of microtubule assembly. There are multiple alpha and beta tubulin genes and they are highly conserved among and between species. This gene encodes an alpha tubulin that is a highly conserved homolog of a rat testis-specific alpha tubulin.
3827	The human BCS1L gene encodes a homolog of S. cerevisiae bcs1 protein involved in the assembly of complex III of the mitochondrial respiratory chain.[supplied by OMIM]
3828	
3829	
3830	
3831	
3832	
3833	
3834	
3835	N-acetylglucosamine kinase (NAGK; EC 2.7.1.59) converts endogenous N-acetylglucosamine (GlcNAc), a major component of complex carbohydrates, from lysosomal degradation or nutritional sources into GlcNAc 6-phosphate. NAGK belongs to the group of N-acetylhexosamine kinases and is a prominent salvage enzyme of amino sugar metabolism in mammals.[supplied by OMIM]
3836	POLE4 is a histone-fold protein that interacts with other histone-fold proteins to bind DNA in a sequence-independent manner. These histone-fold protein dimers combine within larger enzymatic complexes for DNA transcription, replication, and packaging.[supplied by OMIM]
3837	NMYC interactor (NMI) encodes a protein that interacts with NMYC and CMYC (two members of the oncogene Myc family), and other transcription factors containing a Zip, HLH, or HLH-Zip motif. The NMI protein also interacts with all STATs except STAT2 and augments STAT-mediated transcription in response to cytokines IL2 and IFN-gamma. The NMI mRNA has low expression levels in all human fetal and adult tissues tested except brain and has high expression in cancer cell line-myeloid leukemias.
3838	
3839	
3840	
3841	
3842	
3843	The TRAF (tumor necrosis factor receptor-associated factor) family of proteins associate with and transduce signals from members of the tumor necrosis factor receptor superfamily. The protein encoded by this gene is found in the cytoplasm and can bind to TRAF1, TRAF2, or TRAF3, thereby inhibiting TRAF function by sequestering the TRAFs in a latent state in the cytoplasm. For example, the protein encoded by this gene can block TRAF2 binding to LMP1, the Epstein-Barr virus transforming protein, and inhibit LMP1-mediated NF-kappa-B activation. Two transcript variants encoding different isoforms have been found for this gene.
3844	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein.
3845	
3846	
3847	
3848	
3849	
3850	
3851	This gene, isolated using the differential display method to detect tissue-specific genes, is specifically expressed in kidney and liver. The encoded protein shows amino acid sequence similarity to three different bacterial acetyltransferases involved in drug resistance, however, its exact function is not known.
3852	
3853	
3854	
3855	
3856	
3857	
3858	CpG methylation is an epigenetic modification that is important for embryonic development, imprinting, and X-chromosome inactivation. Studies in mice have demonstrated that DNA methylation is required for mammalian development. This gene encodes a DNA methyltransferase that is thought to function in de novo methylation, rather than maintenance methylation. The protein localizes to the cytoplasm and nucleus and its expression is developmentally regulated. Alternative splicing results in multiple transcript variants encoding different isoforms.
3859	The cytoplasmic domains of integrins are essential for cell adhesion. The protein encoded by this gene binds to the beta1 integrin cytoplasmic domain. The interaction between this protein and beta1 integrin is highly specific. Two isoforms of this protein are derived from alternatively spliced transcripts. The shorter form of this protein does not interact with the beta1 integrin cytoplasmic domain. The longer form is a phosphoprotein and the extent of its phosphorylation is regulated by the cell-matrix interaction, suggesting an important role of this protein during integrin-dependent cell adhesion.
3860	
3861	
3862	The protein encoded by this gene appears to be involved in pre-mRNA splicing, possibly acting as a bridging factor between U5 and U4/U6 snRNPs in formation of the spliceosome. The encoded protein also can bind androgen receptor, providing a link between transcriptional activation and splicing.
3863	
3864	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein does not contain a coiled coil region, like some family members. The specific function of this protein has not been determined. Two transcript variants encoding distinct isoforms have been identified for this gene.
3865	
3866	
3867	The bone morphogenetic proteins (BMPs) are a family of secreted signaling molecules that can induce ectopic bone growth. Many BMPs are part of the transforming growth factor-beta (TGFB) superfamily. BMPs were originally identified by an ability of demineralized bone extract to induce endochondral osteogenesis in vivo in an extraskeletal site. Based on its expression early in embryogenesis, the BMP encoded by this gene has a proposed role in early development. In addition, the fact that this BMP is closely related to BMP5 and BMP7 has lead to speculation of possible bone inductive activity.
3868	Mammalian cytochrome c oxidase (COX) catalyzes the transfer of reducing equivalents from cytochrome c to molecular oxygen and pumps protons across the inner mitochondrial membrane. In yeast, 2 related COX assembly genes, SCO1 and SCO2 (synthesis of cytochrome c oxidase), enable subunits 1 and 2 to be incorporated into the holoprotein. This gene is the human homolog to the yeast SCO1 gene.
3869	The protein encoded by this gene is a member of the immunophilin protein family, which play a role in immunoregulation and basic cellular processes involving protein folding and trafficking. This encoded protein is a cis-trans prolyl isomerase that binds the immunosuppressants FK506 and rapamycin. It interacts with several intracellular signal transduction proteins including type I TGF-beta receptor. It also interacts with multiple intracellular calcium release channels including the tetrameric skeletal muscle ryanodine receptor. In mouse, deletion of this homologous gene causes congenital heart disorder known as noncompaction of left ventricular myocardium. There is evidence of multiple alternatively spliced transcript variants for this gene, but the full length nature of some variants has not been determined.
3870	The protein encoded by this gene is one of several nuclear proteins that are found in common among U1, U2, U4/U6, and U5 small ribonucleoprotein particles (snRNPs). These snRNPs are involved in pre-mRNA splicing, and the encoded protein may also play a role in pre-mRNA splicing or snRNP structure. Autoantibodies from patients with systemic lupus erythematosus frequently recognize epitopes on the encoded protein. Two transcript variants encoding different isoforms (B and B@#$%&) have been found for this gene.
3871	
3872	Breakdown products of phosphoinositides are ubiquitous second messengers that function downstream of many G protein-coupled receptors and tyrosine kinases regulating cell growth, calcium metabolism, and protein kinase C activity. This gene encodes an enzyme which regulates the amount of phosphatidylinositol available for signaling by catalyzing the conversion of phosphatidic acid to CDP-diacylglycerol. This enzyme is an integral membrane protein localized to two subcellular domains, the matrix side of the inner mitochondrial membrane where it is thought to be involved in the synthesis of phosphatidylglycerol and cardiolipin and the cytoplasmic side of the endoplasmic reticulum where it functions in phosphatidylinositol biosynthesis. Two genes encoding this enzyme have been identified in humans, one mapping to human chromosome 4q21 and a second to 20p13.
3873	This gene is one of three related genes that have 2-hydroxyacid oxidase activity yet differ in encoded protein amino acid sequence, tissue expression and substrate preference. Subcellular location of the encoded protein is the peroxisome. Specifically, this gene is expressed primarily in liver and pancreas and the encoded protein is most active on glycolate, a two-carbon substrate. The protein is also active on 2-hydroxy fatty acids. The transcript detected at high levels in pancreas may represent an alternatively spliced form or the use of a multiple near-consensus upstream polyadenylation site.
3874	
3875	
3876	
3877	The protein encoded by this gene belongs to the non-receptor class of the protein-tyrosine phosphatase family. The encoded protein has both protein-tyrosine phophatase activity and serine/threonine-specific phosphatase activity, and therefore is known as a dual specificity phosphatase. Three transcript variants encoding two different isoforms have been found for this gene.
3878	Netrin is included in a family of laminin-related secreted proteins.  The function of this gene has not yet been defined; however, netrin is thought to be involved in axon guidance and cell migration during development.  Mutations and loss of expression of netrin suggest that variation in netrin may be involved in cancer development.
3879	The protein encoded by this gene may be involved in the innate immune response to bacterial exposure in the mouth, nasal cavities, and lungs. The encoded protein is secreted and is a member of the BPI/LBP/PLUNC protein superfamily. This gene is found with other members of the superfamily in a cluster on chromosome 20.
3880	MAP1A and MAP1B are microtubule-associated proteins which mediate the physical interactions between microtubules and components of the cytoskeleton. MAP1A and MAP1B each consist of a heavy chain subunit and multiple light chain subunits. The protein encoded by this gene is one of the light chain subunits and can associate with either MAP1A or MAP1B. Two transcript variants encoding different isoforms have been found for this gene.
3881	Erythrocyte membrane protein band 4.1 (EPB41) is a multifunctional protein that mediates interactions between the erythrocyte cytoskeleton and the overlying plasma membrane. The protein encoded by this gene is a neuronally-enriched protein that is structurally similar to EPB41. The encoded protein binds and stabilizes D2 and D3 dopamine receptors at the neuronal plasma membrane. Multiple transcript variants encoding different isoforms have been found for this gene, but the full-length nature of only two of them has been determined.
3882	
3883	Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. It is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, F0, which comprises the proton channel. The F1 complex consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled in a ratio of 3 alpha, 3 beta, and a single representative of the other 3. The F0 seems to have nine subunits (a, b, c, d, e, f, g, F6 and 8). This gene encodes the F6 subunit of the F0 complex, required for F1 and F0 interactions. Alternatively spliced transcript variants encoding different isoforms have been identified for this gene.
3884	The protein encoded by this gene binds copper and zinc ions and is one of two isozymes responsible for destroying free superoxide radicals in the body. The encoded isozyme is a soluble cytoplasmic protein, acting as a homodimer to convert naturally-occuring but harmful superoxide radicals to molecular oxygen and hydrogen peroxide. The other isozyme is a mitochondrial protein. Mutations in this gene have been implicated as causes of familial amyotrophic lateral sclerosis. Rare transcript variants have been reported for this gene.
3885	
3886	KCNE1 together with KCNQ1 encode the two subunits of the K+ channel responsible for the delayed-rectifier K+ current in cardiac myocytes (referred to as IsK).
3887	The region of chromosome 21 between genes CBR and ERG (CBR-ERG region), which spans 2.5 Mb on 21q22.2, has been defined by analysis of patients with partial trisomy 21. It contributes significantly to the pathogenesis of many characteristics of Down syndrome, including morphological features, hypotonia, and mental retardation. The DSCR3 (Down syndrome critical region gene 3) gene is found in this region and is predictated to contain eight exons. DSCR3 is expressed in most tissues examined.
3888	
3889	The cystatin superfamily encompasses proteins that contain multiple cystatin-like sequences. Some of the members are active cysteine protease inhibitors, while others have lost or perhaps never acquired this inhibitory activity. There are three inhibitory families in the superfamily, including the type 1 cystatins (stefins), type 2 cystatins and kininogens. This gene encodes a stefin that functions as an intracellular thiol protease inhibitor. The protein is able to form a dimer stabilized by noncovalent forces, inhibiting papain and cathepsins l, h and b. The protein is thought to play a role in protecting against the proteases leaking from lysosomes. Evidence indicates that mutations in this gene are responsible for the primary defects in patients with progressive myoclonic epilepsy (EPM1).
3890	
3891	This gene encodes a death agonist that heterodimerizes with either agonist BAX or antagonist BCL2. The encoded protein is a member of the BCL-2 family of cell death regulators. It is a mediator of mitochondrial damage induced by caspase-8 (CASP8); CASP8 cleaves this encoded protein, and the COOH-terminal part translocates to mitochondria where it triggers cytochrome c release. Multiple alternatively spliced transcript variants have been found, but the full-length nature of some variants has not been defined.
3892	
3893	The specific function of this gene has yet to be determined; however, it has been speculated to play a role in transcription regulation and/or chromatin and histone metabolism.  It is considered the primary candidate gene in some haploinsufficiency syndromes such as DiGeorge syndrome, and insufficient production of the gene may disrupt normal embryonic development.  This gene is mapped to 22q11.21, centromeric to COMT.
3894	
3895	This gene was predicted both by automated computational analysis and by similarity to a Drosophila gene and to predicted genes in other species (sheep, chimp, dog, cow). The predicted protein of this gene is similar to Drosophila suppressor of hairy wing protein, a leucine zipper protein which represses the function of transcriptional enhancers of the gypsy retrotransposon.
3896	
3897	
3898	
3899	This gene is a member of the apolipoprotein L gene family. The encoded protein is found in the cytoplasm, where it may affect the movement of lipids or allow the binding of lipids to organelles.
3900	This gene is a member of the apolipoprotein L gene family. The encoded protein is found in the cytoplasm, where it may affect the movement of lipids or allow the binding of lipids to organelles.
3901	Pleckstrin homology, Sec7 and coiled/coil domains 4 (PSCD4) is a member of the PSCD family. Members of this family have identical structural organization that consists of an N-terminal coiled-coil motif, a central Sec7 domain, and a C-terminal pleckstrin homology (PH) domain. The coiled-coil motif is involved in homodimerization, the Sec7 domain contains guanine-nucleotide exchange protein (GEP) activity, and the PH domain interacts with phospholipids and is responsible for association of PSCDs with membranes. Members of this family appear to mediate the regulation of protein sorting and membrane trafficking. The PSCD4 exhibits GEP activity in vitro with both ARF1 and ARF5 but is inactive with ARF6. The PSCD4 and PSCD1 gene structures are very similar.
3902	
3903	Adenylsuccinate lyase  is involved in both de novo synthesis of purines and formation of adenosine monophosphate from inosine monophosphate.  It catalyzes two reactions in AMP biosynthesis: the removal of a fumarate from succinylaminoimidazole carboxamide (SAICA) ribotide to give aminoimidazole carboxamide ribotide (AICA) and removal of fumarate from adenylosuccinate to give AMP. Adenylosuccinase deficiency results in succinylpurinemic autism, psychomotor retardation, and , in some cases, growth retardation associated with muscle wasting and epilepsy.
3904	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure, such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, which is a RNA-dependent ATPase, and also a proliferation-associated nuclear antigen, specifically reacting with the simian virus 40 tumor antigen. This gene consists of 13 exons, and alternatively spliced transcripts containing several intron sequences have been detected, but no isoforms encoded by these transcripts have been identified.
3905	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and is known to metabolize as many as 20% of commonly prescribed drugs. Its substrates include debrisoquine, an adrenergic-blocking drug; sparteine and propafenone, both anti-arrythmic drugs; and amitryptiline, an anti-depressant. The gene is highly polymorphic in the population; certain alleles result in the poor metabolizer phenotype, characterized by a decreased ability to metabolize the enzyme@#$%&s substrates. The gene is located near two cytochrome P450 pseudogenes on chromosome 22q13.1.
3906	
3907	
3908	The protein encoded by this gene belongs to the innexin family. Innexin family members are the structural components of gap junctions. This protein and pannexin 1 are abundantly expressed in central nerve system (CNS) and are coexpressed in various neuronal populations. Studies in Xenopus oocytes suggest that this protein alone and in combination with pannexin 1 may form cell type-specific gap junctions with distinct properties.
3909	
3910	
3911	The protein encoded by this gene, a member of the peptidase C1 family, is a lysosomal cysteine proteinase that appears to be a central coordinator for activation of many serine proteinases in immune/inflammatory cells. It is composed of a dimer of disulfide-linked heavy and light chains, both produced from a single protein precursor. It requires chloride ions for activity and can degrade glucagon. Defects in the encoded protein have been shown to be a cause of Papillon-Lefevre syndrome, an autosomal recessive disorder characterized by palmoplantar keratosis and periodontitis. Two transcript variants encoding different isoforms have been found for this gene.
3912	The signal sequence receptor (SSR) is a glycosylated endoplasmic reticulum (ER) membrane receptor associated with protein translocation across the ER membrane. The SSR is comprised of four membrane proteins/subunits: alpha, beta, gamma, and delta. The first two are glycosylated subunits and the latter two are non-glycosylated subunits. The protein encoded by this gene is the gamma subunit and is predicted to span the membrane four times.
3913	
3914	This gene belongs to subfamily P of the forkhead box (FOX) transcription factor family. Forkhead box transcription factors play important roles in the regulation of tissue- and cell type-specific gene transcription during both development and adulthood. Forkhead box P1 protein contains both DNA-binding- and protein-protein binding-domains. This gene may act as a tumor suppressor as it is lost in several tumor types and maps to a chromosomal region (3p14.1) reported to contain a tumor suppressor gene(s). Alternative splicing results in multiple transcript variants encoding different isoforms.
3915	The protein encoded by this gene is a member of the ankyrin repeat and SOCS box-containing (ASB) family of proteins. They contain ankyrin repeat sequence and a SOCS box domain. The SOCS box serves to couple suppressor of cytokine signalling (SOCS) proteins and their binding partners with the elongin B and C complex, possibly targeting them for degradation. Multiple alternatively spliced transcript variants have been described for this gene, but their full-length sequences are not known.
3916	CGGBP1 influences expression of the fragile X mental retardation gene, FMR1 (MIM 309550), by specifically interacting with the 5-prime (CGG)n-3-prime repeat in its 5-prime UTR.[supplied by OMIM]
3917	Eukaryotes contain two protein translational systems, one in the cytoplasm and one in the mitochondria. Mitochondrial translation is crucial for maintaining mitochondrial function and mutations in this system lead to a breakdown in the respiratory chain-oxidative phosphorylation system and to impaired maintenance of mitochondrial DNA. This gene encodes one of the mitochondrial translation elongation factors. Its role in the regulation of normal mitochondrial function and in different disease states attributed to mitochondrial dysfunction is not known.
3918	
3919	
3920	
3921	
3922	Biotinidase functions to recycle biotin in the body by cleaving biocytin (biotin-epsilon-lysine), a normal product of carboxylase degradation, resulting in regeneration of free biotin. Biotinidase has also been shown to have biotinyl-transferase activity. Defects in the biotinidase gene cause multiple carboxylase deficiency.
3923	The protein encoded by this gene is a member of the immunoglobulin superfamily. It is a glycosylphosphatidylinositol (GPI)-anchored neuronal membrane protein that functions as a cell adhesion molecule. It may play a role in the formation of axon connections in the developing nervous system.
3924	
3925	
3926	
3927	
3928	
3929	This gene is a member of the sauvagine/corticotropin-releasing factor/urotensin I family. It is structurally related to the corticotropin-releasing factor (CRF) gene and the encoded product is an endogenous ligand for CRF type 2 receptors. In the brain it may be responsible for the effects of stress on appetite. In spite of the gene family name similarity, the product of this gene has no sequence similarity to urotensin II.
3930	
3931	
3932	
3933	
3934	Alpha2-HS glycoprotein (AHSG), a glycoprotein present in the serum, is synthesized by hepatocytes. The AHSG molecule consists of two polypeptide chains, which are both cleaved from a proprotein encoded from a single mRNA. It is involved in several functions, such as endocytosis, brain development and the formation of bone tissue. The protein is commonly present in the cortical plate of the immature cerebral cortex and bone marrow hemopoietic matrix, and it has therefore been postulated that it participates in the development of the tissues. However, its exact significance is still obscure.
3935	
3936	
3937	The B-lymphocyte activation antigen B7-1 (formerly referred to as B7) provides regulatory signals for T lymphocytes as a consequence of binding to the CD28 (MIM 186760) and CTLA4 (MIM 123890) ligands of T cells.[supplied by OMIM]
3938	MYL3 encodes myosin light chain 3, an alkali light chain also referred to in the literature as both the ventricular isoform and the slow skeletal muscle isoform. Mutations in MYL3 have been identified as a cause of mid-left ventricular chamber type hypertrophic cardiomyopathy.
3939	
3940	The product of this gene belongs to the phosphatase 2 regulatory subunit B family. Protein phosphatase 2 is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. The B regulatory subunit might modulate substrate selectivity and catalytic activity. This gene encodes a gamma isoform of the regulatory subunit B55 subfamily. Alternatively spliced transcript variants encoding different isoforms have been identified.
3941	
3942	The protein encoded by this gene belongs to the ubiquitin-conjugating enzyme family. It binds selectively to a large region at the N terminus of huntingtin. This interaction is not influenced by the length of the huntingtin polyglutamine tract. This protein has been implicated in the degradation of huntingtin and suppression of apoptosis.
3943	
3944	In all eukaryotes, the cell cycle is governed by cyclin-dependent protein kinases (CDKs), whose activities are regulated by cyclins and CDK inhibitors in a diverse array of mechanisms that involve the control of phosphorylation and dephosphorylation of Ser, Thr or Tyr residues. Cyclins are molecules that possess a consensus domain called the @#$%&cyclin box.@#$%& In mammalian cells, 9 cyclin species have been identified, and they are referred to as cyclins A through I. Cyclin G is a direct transcriptional target of the p53 tumor suppressor gene product and thus functions downstream of p53. GAK is an association partner of cyclin G and CDK5.
3945	NEIL3 belongs to a class of DNA glycosylases homologous to the bacterial Fpg/Nei family. These glycosylases initiate the first step in base excision repair by cleaving bases damaged by reactive oxygen species and introducing a DNA strand break via the associated lyase reaction (Bandaru et al., 2002).[supplied by OMIM]
3946	
3947	
3948	This gene encodes a member of the non-histone chromosomal high mobility group protein family. The proteins of this family are chromatin-associated and ubiquitously distributed in the nucleus of higher eukaryotic cells. In vitro studies have demonstrated that this protein is able to efficiently bend DNA and form DNA circles.  These studies suggest a role in facilitating cooperative interactions between cis-acting proteins by promoting DNA flexibility. This protein was also reported to be involved in the final ligation step in DNA end-joining processes of DNA double-strand breaks repair and V(D)J recombination.
3949	Sulfotransferase enzymes catalyze the sulfate conjugation of many hormones, neurotransmitters, drugs, and xenobiotic compounds. These cytosolic enzymes are different in their tissue distributions and substrate specificities. The gene structure (number and length of exons) is similar among family members. This gene encodes a protein that transfers a sulfo moiety to and from estrone, which may control levels of estrogen receptors.
3950	This gene encodes the receptor for type 1 gonadotropin-releasing hormone. This receptor is a member of the seven-transmembrane, G-protein coupled receptor (GPCR) family. It is expressed on the surface of pituitary gonadotrope cells as well as lymphocytes, breast, ovary, and prostate. Following binding of gonadotropin-releasing hormone, the receptor associates with G-proteins that activate a phosphatidylinositol-calcium second messenger system. Activation of the receptor ultimately causes the release of gonadotropic luteinizing hormone (LH) and follicle stimulating hormone (FSH). Defects in this gene are a cause of hypogonadotropic hypogonadism (HH). Alternative splicing results in multiple transcript variants encoding different isoforms. More than 18 transcription initiation sites in the 5@#$%& region and multiple polyA signals in the 3@#$%& region have been identified for this gene.
3951	
3952	
3953	
3954	The protein encoded by this gene is an isoform of xylosyltransferase, which belongs to a family of glycosyltransferases. This enzyme transfers xylose from UDP-xylose to specific serine residues of the core protein and initiates the biosynthesis of glycosaminoglycan chains in proteoglycans including chondroitin sulfate, heparan sulfate, heparin and dermatan sulfate. The enzyme activity, which is increased in scleroderma patients, is a diagnostic marker for the determination of sclerotic activity in systemic sclerosis.
3955	
3956	
3957	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This enzyme is a hydroxylase that metabolizes arachidonic acid, docosahexaenoic acid, and other long chain fatty acids.
3958	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This enzyme is a hydroxylase that metabolizes arachidonic acid, docosahexaenoic acid, and other long chain fatty acids.
3959	This gene encodes a protein that is a member of the dickkopf family. The secreted protein contains two cysteine rich regions and is involved in embryonic development through its interactions with the Wnt signaling pathway. It can act as either an agonist or antagonist of Wnt/beta-catenin signaling, depending on the cellular context and the presence of the co-factor kremen 2. Activity of this protein is also modulated by binding to the Wnt co-receptor LDL-receptor related protein 6 (LRP6).
3960	FGF2 (MIM 134920) is a highly conserved, multifunctional heparin-binding growth factor involved in neuroectoderm development, angiogenesis, and wound healing. Elevated levels of FGF2 are associated with proliferation of smooth muscle in atherosclerosis and with proliferation of tumors. The FGF2 antisense gene, NUDT6, may regulate FGF2 expression.[supplied by OMIM]
3961	
3962	
3963	
3964	
3965	
3966	
3967	
3968	Epidermal growth factor has a profound effect on the differentiation of specific cells in vivo and is a potent mitogenic factor for a variety of cultured cells of both ectodermal and mesodermal origin.  The EGF precursor is believed to exist as a membrane-bound molecule which is proteolytically cleaved to generate the 53-amino acid peptide hormone that stimulates cells to divide.
3969	
3970	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections.
3971	The Golgi complex plays a key role in the sorting and modification of proteins exported from the endoplasmic reticulum. The protein encoded by this gene is a peripheral membrane protein of the Golgi stack and may have a regulatory role in Golgi trafficking. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of these variants has not been determined.
3972	IRX2 is a member of the Iroquois homeobox gene family. Members of this family appear to play multiple roles during pattern formation of vertebrate embryos.[supplied by OMIM]
3973	
3974	Hexosaminidase B is the beta subunit of the lysosomal enzyme beta-hexosaminidase that, together with the cofactor GM2 activator protein, catalyzes the degradation of the ganglioside GM2, and other molecules containing terminal N-acetyl hexosamines.  Beta-hexosaminidase is composed of two subunits, alpha and beta, which are encoded by separate genes.  Both beta-hexosaminidase alpha and beta subunits are members of family 20 of glycosyl hydrolases.  Mutations in the alpha or beta subunit genes lead to an accumulation of GM2 ganglioside in neurons and neurodegenerative disorders termed the GM2 gangliosidoses.  Beta subunit gene mutations lead to Sandhoff disease (GM2-gangliosidosis type II).
3975	This gene encodes a zinc-dependent aminopeptidase that cleaves vasopressin, oxytocin, lys-bradykinin, met-enkephalin, dynorphin A and other peptide hormones. The protein can be secreted in maternal serum, reside in intracellular vesicles with the insulin-responsive glucose transporter GLUT4, or form a type II integral membrane glycoprotein. The protein catalyzes the final step in the conversion of angiotensinogen to angiotensin IV (AT4) and is also a receptor for AT4. Alternative splicing results in multiple transcript variants encoding different isoforms.
3976	The protein encoded by this gene belongs to class III of the pyridoxal-phosphate-dependent mitochondrial aminotransferases.
3977	This gene encodes a protein that is a transmembrane receptor for growth hormone. Binding of growth hormone to the receptor leads to receptor dimerization and the activation of an intra- and intercellular signal transduction pathway leading to growth. A common alternate allele of this gene, called GHRd3, lacks exon three and has been well-characterized. Mutations in this gene have been associated with Laron syndrome, also known as the growth hormone insensitivity syndrome (GHIS), a disorder characterized by short stature. Other splice variants, including one encoding a soluble form of the protein (GHRtr), have been observed but have not been thoroughly characterized.
3978	This gene encodes the beta subunit of the karyopherin receptor complex which interacts with nuclear localization signals to target nuclear proteins to the nucleus. The karyopherin receptor complex is a heterodimer of an alpha subunit which recognizes the nuclear localization signal and a beta subunit which docks the complex at nucleoporins. Alternate splicing of this gene results in two transcript variants encoding the same protein.
3979	
3980	
3981	
3982	
3983	Polyspecific organic cation transporters in the liver, kidney, intestine, and other organs are critical for elimination of many endogenous small organic cations as well as a wide array of drugs and environmental toxins. The encoded protein is an organic cation transporter and plasma integral membrane protein containing eleven putative transmembrane domains as well as a nucleotide-binding site motif. Transport by this protein is at least partially ATP-dependent.
3984	
3985	This gene is reported to be specifically expressed in prostate, rectum and distal colon. Sequence analysis suggests that it may play a regulatory role in the nucleus.
3986	
3987	
3988	This gene is reported to be specifically expressed in prostate, rectum and distal colon. Sequence analysis suggests that it may play a regulatory role in the nucleus.
3989	This gene product is a member of a group of related serine proteases from the cytoplasmic granules of cytotoxic lymphocytes. Cytolytic T lymphocytes (CTL) and natural killer (NK) cells share the remarkable ability to recognize, bind, and lyse specific target cells. They are thought to protect their host by lysing cells bearing on their surface @#$%&nonself@#$%& antigens, usually peptides or proteins resulting from infection by intracellular pathogens. The protein described here lacks consensus sequences for N-glycosylation present in other granzymes.
3990	The inhibitory glycine receptor mediates postsynaptic inhibition in the spinal cord and other regions of the central nervous system. It is a pentameric receptor composed of alpha and beta subunits. The GLRB gene (MIM 138492) encodes the beta subunit of the receptor.[supplied by OMIM]
3991	The protein encoded by this gene is a cytoplasmic actin-binding protein thought to play a role in the process of neuronal growth. It is a member of the drebrin family of proteins that are developmentally regulated in the brain. A decrease in the amount of this protein in the brain has been implicated as a possible contributing factor in the pathogenesis of memory disturbance in Alzheimer@#$%&s disease. At least two alternative splice variants encoding different protein isoforms have been described for this gene.
3992	Oncostatin M is a member of the IL6 family of cytokines. Functional receptors for IL6 family cytokines are multisubunit complexes involving members of the hematopoietin receptor superfamily. Many IL6 cytokines utilize gp130 as a common receptor subunit. OSM binds to the gp130 receptor subunit and, in association with the leukemia inhibitory factor receptor, induces a proliferative response in permissive cells. OSMR is an alternative subunit (for an OSM receptor complex (a heterodimer of gp130 and OSMR) that is activated by OSM but not by LIF
3993	
3994	
3995	
3996	
3997	
3998	
3999	The transport of molecules between the nucleus and the cytoplasm in eukaryotic cells is mediated by the nuclear pore complex (NPC) which consists of 60-100 proteins and is probably 120 million daltons in molecular size. Small molecules (up to 70 kD) can pass through the nuclear pore by nonselective diffusion; larger molecules are transported by an active process. Most nuclear proteins contain short basic amino acid sequences known as nuclear localization signals (NLSs). KPNA5 protein belongs to the importin alpha protein family and is thought to be involved in NLS-dependent protein import into the nucleus
4000	The protein encoded by this gene is an ATP-dependent RNA helicase in the DEAD-box family and displays tumor-specific expression.
4001	
4002	The protein encoded by this gene shares a significant similarity with Schizosaccharomyces pombe cdc5 gene product, which is a cell cycle regulator important for G2/M transition. This protein has been demonstrated to act as a positive regulator of cell cycle G2/M progression. It was also found to be an essential component of a non-snRNA spliceosome, which contains at least five additional protein factors and is required for the second catalytic step of pre-mRNA splicing.
4003	
4004	
4005	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the proteasome B-type family, also known as the T1B family, that is a 20S core beta subunit. This gene is tightly linked to the TBP (TATA-binding protein) gene in human and in mouse, and is transcribed in the opposite orientation in both species.
4006	The protein encoded by this gene is a member of the TNF receptor associated factor (TRAF) protein family. TRAF proteins are associated with, and mediate signal transduction from members of the TNF receptor superfamily. This protein has been shown to interact with neurotrophin receptor, p75 (NTR/NTSR1), and negatively regulate NTR induced cell death and NF-kappa B activation. This protein has been found to bind to p47phox, a cytosolic regulatory factor included in a multi-protein complex known as NAD(P)H oxidase. This protein thus, is thought to be involved in the oxidative activation of MAPK8/JNK. Two alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported.
4007	
4008	The protein encoded by this gene contains six repeated kelch motifs that are structurally similar to recombination activating gene 2 (RAG2), a protein involved in the activation of the V(D)J recombination. This gene is found to express specifically in testis. Its expression in pachytene spermatocytes is localized to cytoplasma and meiotic chromatin, which suggests that this gene may be involved in meiotic recombination.
4009	
4010	
4011	The piggyBac family of proteins, found in diverse animals, are transposases related to the transposase of the canonical piggyBac transposon from the moth, Trichoplusia ni. This family also includes genes in several genomes, including human, that appear to have been derived from the piggyBac transposons. This gene belongs to the subfamily of piggyBac transposable element derived (PGBD) genes. The PGBD proteins appear to be novel, with no obvious relationship to other transposases, or other known protein families. This gene product is specifically expressed in the brain, however, its exact function is not known.
4012	This gene encodes a protein with similarity to a rat protein that has an inhibitory effect on protein phosphatase-1 (PP1). The rat protein localizes to the nucleus and colocalizes with chromatin at distinct phases during mitosis. This gene lies within the major histocompatibility complex class I region on chromosome 6.
4013	Lymphotoxin alpha, a member of the tumor necrosis factor family, is a cytokine produced by lymphocytes. LTA is highly inducible, secreted, and exists as homotrimeric molecule.  LTA forms heterotrimers with lymphotoxin-beta which anchors lymphotoxin-alpha to the cell surface.  LTA mediates a large variety of inflammatory, immunostimulatory, and antiviral responses.  LTA is also involved in the formation of secondary lymphoid organs during development and plays a role in apoptosis.
4014	This gene encodes complement factor B, a component of the alternative pathway of complement activation. Factor B circulates in the blood as a single chain polypeptide. Upon activation of the alternative pathway, it is cleaved by complement factor D yielding the noncatalytic chain Ba and the catalytic subunit Bb. The active subunit Bb is a serine protease which associates with C3b to form the alternative pathway C3 convertase. Bb is involved in the proliferation of preactivated B lymphocytes, while Ba inhibits their proliferation. This gene localizes to the major histocompatibility complex (MHC) class III region on chromosome 6. This cluster includes several genes involved in regulation of the immune reaction. The polyadenylation site of this gene is 421 bp from the 5@#$%& end of the gene for complement component 2.
4015	
4016	A cluster of genes, BAT1-BAT5, has been localized in the vicinity of the genes for TNF alpha and TNF beta. These genes are all within the human major histocompatibility complex class III region. The protein encoded by this gene is thought to be involved in some aspects of immunity.
4017	This gene encodes a member of the tenascin family of extracellular matrix proteins. The tenascins have anti-adhesive effects, as opposed to fibronectin which is adhesive. The specific function of this protein is unknown, but its deficiency has been associated with the connective tissue disorder Ehlers-Danlos syndrome. This gene localizes to the major histocompatibility complex (MHC) class III region on chromosome 6. It is one of four genes in this cluster which have been duplicated. The duplicated copy of this gene is incomplete and is a pseudogene which is transcribed but does not encode a protein. The structure of this gene is unusual in that it overlaps the CREBL1 and CYP21A2 genes at its 5@#$%& and 3@#$%& ends, respectively. The gene expresses two transcripts.
4018	This gene encodes a protein with significant sequence similarity to Drosophila Polycomblike. The encoded protein contains a zinc finger-like PHD (plant homeodomain) finger which is distinct from other classes of zinc finger motifs and which shows the typical Cys4-His-Cys3 arrangement. PHD finger genes are thought to belong to a diverse group of transcriptional regulators possibly affecting eukaryotic gene expression by influencing chromatin structure. Two transcript variants have been found for this gene.
4019	
4020	
4021	
4022	
4023	
4024	
4025	
4026	
4027	
4028	
4029	PECI is an auxiliary enzyme that catalyzes an isomerization step required for the beta-oxidation of unsaturated fatty acids.[supplied by OMIM]
4030	This gene encodes a member of the regulator of G-protein signaling family. This protein contains a conserved, 120 amino acid motif called the RGS domain and a cysteine-rich region. The protein attenuates the signaling activity of G-proteins by binding to activated, GTP-bound G alpha subunits and acting as a GTPase activating protein (GAP), increasing the rate of conversion of the GTP to GDP. This hydrolysis allows the G alpha subunits to bind G beta/gamma subunit heterodimers, forming inactive G-protein heterotrimers, thereby terminating the signal.
4031	
4032	This gene is a member of the ecto-nucleotide pyrophosphatase/phosphodiesterase (ENPP) family. The encoded protein is a type II transmembrane glycoprotein comprising two identical disulfide-bonded subunits. This protein has broad specificity and cleaves a variety of substrates, including phosphodiester bonds of nucleotides and nucleotide sugars and pyrophosphate bonds of nucleotides and nucleotide sugars. This protein may function to hydrolyze nucleoside 5@#$%& triphosphates to their corresponding monophosphates and may also hydrolyze diadenosine polyphosphates. Mutations in this gene have been associated with @#$%&idiopathic@#$%& infantile arterial calcification, ossification of the posterior longitudinal ligament of the spine (OPLL), and insulin resistance.
4033	TCF21 encodes a transcription factor of the basic helix-loop-helix family. The TCF21 product is mesoderm specific, and expressed in embryonic epicardium, mesenchyme-derived tissues of lung, gut, gonad, and both mesenchymal and glomerular epithelial cells in the kidney. Two transcript variants encoding the same protein have been found for this gene.
4034	
4035	
4036	GPCR41 (MIM 607882) and GPCR42 act as receptors for porcine endogenous retrovirus subgroup A (PERV-A).[supplied by OMIM]
4037	Many proteins are tethered to the extracellular face of eukaryotic plasma membranes by a glycosylphosphatidylinositol (GPI) anchor. The GPI-anchor is a glycolipid found on many blood cells. The protein encoded by this gene is a GPI degrading enzyme. Glycosylphosphatidylinositol specific phospholipase D1 hydrolyzes the inositol phosphate linkage in proteins anchored by phosphatidylinositol glycans, thereby releasing the attached protein from the plasma membrane.
4038	CD222 is a 250kDa transmembrane protein with a short cytoplasmic tail containing an internalization signal. CD222 was originally identified as a receptor for IGFII and M6P-containing proteins (e.g. lysosomal hydrolases). Lysosomal enzymes are sorted to lysosomes via CD222 either from the Golgi, where the enzymes acquire M6P, or from the extracellular space. The majority of CD222 molecules (approximately 90-95%) are located intracellularly, only 5-10% is present on the cell membrane. The internalization rate seems to be enhanced by ligand induced dimerization of CD222 as well as by phosphorylation of its cytoplasmic serine. CD222 is also a receptor for TGFbeta latency associated peptide (LAP), proliferin and may bind several molecules independently of M6P, including plasminogen, CD87 or retinoic acid. It is involved in activation of latent TGFbeta [PROW]
4039	
4040	This gene encodes a member of the [alpha]/[beta] hydrolase fold family and has isoform specific imprinting. The loss of imprinting of this gene has been linked to certain types of cancer and may be due to promotor switching. The encoded protein may play a role in development. Three transcript variants encoding two distinct isoforms have been identified for this gene. A pseudogene for this locus is located on chromosome 6.
4041	The protein encoded by this gene is a component of an evolutionarily conserved, genotoxin-activated checkpoint complex that is involved in the cell cycle arrest in response to DNA damage. This protein forms a heterotrimeric complex with checkpoint proteins RAD9 and RAD1. In response to DNA damage, the trimeric complex interacts with another protein complex consisting of checkpoint protein RAD17 and four small subunits of the replication factor C (RFC), which loads the combined complex onto the chromatin. The DNA damage induced chromatin binding has been shown to depend on the activation of the checkpoint kinase ATM, and is thought to be an early checkpoint signaling event.
4042	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
4043	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
4044	
4045	
4046	
4047	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the proteasome B-type family, also known as the T1B family, that is a 20S core beta subunit in the proteasome. This catalytic subunit is not present in the immunoproteasome and is replaced by catalytic subunit 1i (proteasome beta 9 subunit).
4048	
4049	This gene encodes type I adenylate cyclase activating polypeptide receptor, which is a membrane-associated protein and shares significant homology with members of the glucagon/secretin receptor family. This receptor mediates diverse biological actions of adenylate cyclase activating polypeptide 1 and is positively coupled to adenylate cyclase. Alternative splicing of two exons of this gene generates four major splice variants, but their full-length nature has not been determined.
4050	The product of this gene belongs to the CRE (cAMP response element)-binding protein family. Members of this family contain zinc-finger and bZIP DNA-binding domains. The encoded protein specifically binds to CRE as a homodimer or a heterodimer with c-Jun or CRE-BP1, and functions as a CRE-dependent trans-activator. Alternatively spliced transcript variants encoding different isoforms have been identified.
4051	The protein encoded by this gene belongs to the src family kinases. This protein is similar to the src family associated phosphoprotein 1. It is an adaptor protein and thought to play an essential role in the src signaling pathway in various cells. In a mouse study the SCAP2 gene is implicated in the processes of myeloid differentiation and growth arrest.
4052	
4053	
4054	
4055	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
4056	
4057	The protein encoded by this gene is a member of the ankyrin repeat and SOCS box-containing (ASB) family of proteins. They contain ankyrin repeat sequence and SOCS box domain. The SOCS box serves to couple suppressor of cytokine signalling (SOCS) proteins and their binding partners with the elongin B and C complex, possibly targeting them for degradation. Multiple alternatively spliced transcript variants have been described for this gene but some of the full length sequences are not known.
4058	
4059	
4060	This gene is a member of the cyclophilin family of peptidylprolyl isomerases. The cyclophilins are a highly conserved ubiquitous family, members of which play an important role in protein folding, immunosuppression by cyclosporin A, and infection of HIV-1 virions. This protein interacts with the proteinase inhibitor eglin c and is localized in the nucleus. This gene encodes three transcript variants encoding two distinct isoforms.
4061	
4062	The FK506-binding proteins (FKBPs) were first identified as receptors for the immunosuppressants FK506 and rapamycin. All FKBPs display peptidyl-prolyl cis/trans isomerase (PPIase) activity and act as molecular chaperones.[supplied by OMIM]
4063	The adipocyte enhancer binding protein 1 is a transcriptional repressor with carboxypeptidase (CP) activity. This protein binds to a regulatory sequence, adipocyte enhancer 1 (AE-1), located in the proximal promoter region of the adipose P2 (aP2) gene, which encodes the adipocyte fatty-acid binding protein. It is characterized as a member of the regulatory B-like CP family. This protein seems to be activated by a novel mechanism, whereby the direct binding of DNA enhances its protease activity. Adipocyte-enhancer binding protein 1 may play a role in differentiated vascular smooth muscle cells.
4064	This gene belongs to the ephrin receptor subfamily of the protein-tyrosine kinase family. EPH and EPH-related receptors have been implicated in mediating developmental events, particularly in the nervous system. Receptors in the EPH subfamily typically have a single kinase domain and an extracellular region containing a Cys-rich domain and 2 fibronectin type III repeats. The ephrin receptors are divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands. This gene is expressed in some human cancer cell lines and has been implicated in carcinogenesis.
4065	
4066	KRT17 encodes the type I intermediate filament chain keratin 17, expressed in nail bed, hair follicle, sebaceous glands, and other epidermal appendages. Mutations in this gene lead to Jackson-Lawler type pachyonychia congenita and steatocystoma multiplex.
4067	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. The 28S subunit of the mammalian mitoribosome may play a crucial and characteristic role in translation initiation. This gene encodes a 28S subunit protein that is one of the more highly conserved mitochondrial ribosomal proteins among mammals, Drosophila and C. elegans. Splice variants that differ in the 5@#$%& UTR have been found for this gene; all variants encode the same protein. Pseudogenes corresponding to this gene are found on chromosomes 1q, 4p, 4q, and 20q
4068	
4069	
4070	
4071	
4072	The protein encoded by this gene is part of the clathrin coat assembly complex which links clathrin to receptors in coated vesicles. These vesicles are involved in endocytosis and Golgi processing. This protein, as well as beta-prime-adaptin, gamma-adaptin, and the medium (mu) chain AP47, form the AP-1 assembly protein complex located at the Golgi vesicle. Two alternatively spliced transcript variants of this gene, which encode distinct isoforms, have been reported.
4073	
4074	
4075	Ornithine decarboxylase (ODC) catalyzes the conversion of ornithine to putrescine in the first and apparently rate-limiting step in polyamine biosynthesis. Ornithine decarboxylase antizymes play a role in the regulation of polyamine synthesis by binding to and inhibiting ornithine decarboxylase. The protein encoded by this gene is highly similar to ODC. It binds to ODC antizyme and stabilizes ODC, thus inhibiting antizyme-mediated ODC degradation. Two alternatively spliced transcript variants have been found for this gene.
4076	This gene encodes one of the smallest subunits of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. This subunit is shared by the other two DNA-directed RNA polymerases.
4077	
4078	
4079	This gene encodes a component of vacuolar ATPase (V-ATPase), a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, receptor-mediated endocytosis, and synaptic vesicle proton gradient generation. V-ATPase is composed of a cytosolic V1 domain and a transmembrane V0 domain. The V1 domain consists of three A and three B subunits, two G subunits plus the C, D, E, F, and H subunits. The V1 domain contains the ATP catalytic site. The V0 domain consists of five different subunits: a, c, c@#$%&, c@#$%!, and d. Additional isoforms of many of the V1 and V0 subunit proteins are encoded by multiple genes or alternatively spliced transcript variants. This gene encodes the regulatory H subunit of the V1 domain which is required for catalysis of ATP but not the assembly of V-ATPase. Three alternatively spliced transcript variants encode two isoforms of the H subunit.
4080	Calmodulin-dependent protein phosphatase, calcineurin, is involved in a wide range of biologic activities, acting as a Ca(2+)-dependent modifier of phosphorylation status. In testis, the motility of the sperm is thought to be controlled by cAMP-dependent phosphorylation and a unique form of calcineurin appears to be associated with the flagellum. The calcineurin holoenzyme is composed of catalytic and regulatory subunits of 60 and 18 kD, respectively. At least 3 genes, calcineurin A-alpha (CALNA1; MIM 114105), calcineurin A-beta (CALNA2; MIM 114106), and calcineurin A-gamma (CALNA3), have been cloned for the catalytic subunit. These genes have been identified in humans, mice, and rats, and are highly conserved between species (90 to 95% amino acid identity).[supplied by OMIM]
4081	The protein encoded by this gene contains a RING zinc finger, a motif known to be involved in protein-protein interactions. This gene has been found to be preferentially expressed in testis.
4082	The physiologic effects of glucagon (GCG; MIM 138030) are mediated through the glucagon receptor, a member of the superfamily of receptors characterized by a 7-transmembrane domain structure and by their coupling via GTP-binding proteins (G proteins) to adenyl cyclase.[supplied by OMIM]
4083	
4084	
4085	
4086	The homeodomain-containing transcription factor NKX3A is a putative prostate tumor suppressor that is expressed in a largely prostate-specific and androgen-regulated manner. Loss of NKX3A protein expression is a common finding in human prostate carcinomas and prostatic intraepithelial neoplasia.[supplied by OMIM]
4087	
4088	The protein encoded by this gene was initially identified as a molecule linking syndecan-mediated signaling to the cytoskeleton. The syntenin protein contains tandemly repeated PDZ domains that bind the cytoplasmic, C-terminal domains of a variety of transmembrane proteins. This protein may also affect cytoskeletal-membrane organization, cell adhesion, protein trafficking, and the activation of transcription factors. The protein is primarily localized to membrane-associated adherens junctions and focal adhesions but is also found at the endoplasmic reticulum and nucleus. Alternative splicing results in multiple transcript variants encoding different isoforms.
4089	
4090	
4091	
4092	Defensins form a family of microbicidal and cytotoxic peptides made by neutrophils. Members of the defensin family are highly similar in protein sequence. This gene encodes defensin, beta 4, an antibiotic peptide which is locally regulated by inflammation.
4093	
4094	
4095	
4096	
4097	
4098	
4099	
4100	
4101	
4102	
4103	
4104	Carbonic anhydrases (CAs) are a large family of zinc metalloenzymes that catalyze the reversible hydration of carbon dioxide.  They participate in a variety of biological processes, including respiration, calcification, acid-base balance, bone resorption, and the formation of aqueous humor, cerebrospinal fluid, saliva, and gastric acid.  They show extensive diversity in tissue distribution and in their subcellular localization.  CA IX is a transmembrane protein and the only tumor-associated carbonic anhydrase isoenzyme known.  It is expressed in all clear-cell renal cell carcinoma, but is not detected in normal kidney or most other normal tissues.  It may be involved in cell proliferation and transformation.  This gene was mapped to 17q21.2 by fluorescence in situ hybridization, however, radiation hybrid mapping localized it to 9p13-p12.
4105	
4106	
4107	
4108	
4109	
4110	
4111	The ficolin family of proteins are characterized by the presence of a leader peptide, a short N-terminal segment, followed by a collagen-like region, and a C-terminal fibrinogen-like domain. The collagen-like and the fibrinogen-like domains are also found separately in other proteins such as complement protein C1q, C-type lectins known as collectins, and tenascins.  However, all these proteins recognize different targets, and are functionally distinct.  Ficolin 1 encoded by FCN1 is predominantly expressed in the peripheral blood leukocytes, and has been postulated to function as a plasma protein with elastin-binding activity.
4112	
4113	
4114	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L24E family of ribosomal proteins. It is located in the cytoplasm. This gene has been referred to as ribosomal protein L30 because the encoded protein shares amino acid identity with the L30 ribosomal proteins from S. cerevisiae; however, its official name is ribosomal protein L24. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
4115	
4116	
4117	This gene encodes a protein containing multiple ankyrin domains and two IQ calmodulin-binding domains. The encoded protein may function in renal tubular development and function, and in left-right axis determination. This protein interacts with nephrocystin and infers a connection between primary cilia function and left-right axis determination. A similar protein in mice interacts with calmodulin. Mutations in this gene have been associated with nephronophthisis type 2. Two transcript variants encoding distinct isoforms have been identified for this gene.
4118	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the ultrahigh sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
4119	
4120	This gene encodes the smallest subunit of dynactin, a macromolecular complex consisting of 10 subunits ranging in size from 22 to 150 kD. Dynactin binds to both microtubules and cytoplasmic dynein. It is involved in a diverse array of cellular functions, including ER-to-Golgi transport, the centripetal movement of lysosomes and endosomes, spindle formation, cytokinesis, chromosome movement, nuclear positioning, and axonogenesis. This subunit, like most other dynactin subunits, exists only as a part of the dynactin complex. It is primarily an alpha-helical protein with very little coiled coil, and binds directly to the largest subunit (p150) of dynactin. Alternative splicing of this gene generates 2 transcript variants.
4121	Aquaporins/major intrinsic protein (MIP) are a family of water-selective membrane channels. Aquaporin 7 has greater sequence similarity with AQP3 and AQP9 and they may be a subfamily. Aquaporin 7 and AQP3 are at the same chromosomal location suggesting that 9p13 may be a site of an aquaporin cluster. Aquaporin 7 facilitates water, glycerol and urea transport. It may play an important role in sperm function.
4122	
4123	The protein encoded by this gene belongs to a family of ankyrin repeat proteins that, along with four other protein families, contain a C-terminal SOCS box motif. Growing evidence suggests that the SOCS box, similar to the F-box, acts as a bridge between specific substrate-binding domains and the more generic proteins that comprise a large family of E3 ubiquitin protein ligases.
4124	The protein encoded by this gene is a member of the cyclin-dependent protein kinase (CDK) family. CDK family members are highly similar to the gene products of S. cerevisiae cdc28, and S. pombe cdc2, and known as important cell cycle regulators. This kinase was found to be a component of the multiprotein complex TAK/P-TEFb, which is an elongation factor for RNA polymerase II-directed transcription and functions by phosphorylating the C-terminal domain of the largest subunit of RNA polymerase II. This protein forms a complex with and is regulated by its regulatory subunit cyclin T or cyclin K. HIV-1 Tat protein was found to interact with this protein and cyclin T, which suggested a possible involvement of this protein in AIDS.
4125	Fibrillar collagens, such as COL27A1, compose one of the most ancient families of extracellular matrix molecules. They form major structural elements in extracellular matrices of cartilage, skin, and tendon (Boot-Handford et al., 2003).[supplied by OMIM]
4126	This gene is similar to a mouse gene that encodes a ubiquitin-specific protease. This gene appears to be intronless.
4127	This gene encodes a molecular chaperone that is member of the chaperonin containing TCP1 complex (CCT), also known as the TCP1 ring complex (TRiC). This complex consists of two identical stacked rings, each containing eight different proteins. Unfolded polypeptides enter the central cavity of the complex and are folded in an ATP-dependent manner. The complex folds various proteins, including actin and tubulin. Alternate transcriptional splice variants of this gene, encoding different isoforms, have been characterized.
4128	The protein encoded by this gene is a cytosolic protein which contains a phosphotyrosine binding (PTD) domain. The PTD domain has been found to interact with the cytoplasmic tail of the LDL receptor. Mutations in this gene lead to LDL receptor malfunction and cause the disorder autosomal recessive hypercholesterolaemia.
4129	The protein encoded by this gene is a member of the X11 protein family. It is a neuronal adaptor protein that interacts with the Alzheimer@#$%&s disease amyloid precursor protein (APP). It stabilises APP and inhibits production of proteolytic APP fragments including the A beta peptide that is deposited in the brains of Alzheimer@#$%&s disease patients. This gene product is believed to be involved in signal transduction processes. It is also regarded as a putative vesicular trafficking protein in the brain that can form a complex with the potential to couple synaptic vesicle exocytosis to neuronal cell adhesion.
4130	
4131	
4132	
4133	
4134	The protein encoded by this gene is a receptor protein tyrosine kinase and type I transmembrane protein that belongs to the ROR subfamily of cell surface receptors.  The protein may be involved in the early formation of the chondrocytes and may be required for cartilage and growth plate development.  Mutations in this gene can cause brachydactyly type B, a skeletal disorder characterized by hypoplasia/aplasia of distal phalanges and nails. In addition, mutations in this gene can cause the autosomal recessive form of Robinow syndrome, which is characterized by skeletal dysplasia with generalized limb bone shortening, segmental defects of the spine, brachydactyly, and a dysmorphic facial appearance.
4135	
4136	
4137	
4138	The PDE4D gene is complex and has at least 9 different isoforms that encode functional proteins. These proteins degrade the second messenger cAMP, which is a key signal transduction molecule in multiple cell types, including vascular cells (Dominiczak and McBride, 2003).[supplied by OMIM]
4139	The protein encoded by this gene is a member of the RecQ DEAH helicase family and interacts with the BRCT repeats of breast cancer, type 1 (BRCA1). The bound complex is important in the normal double-strand break repair function of breast cancer, type 1 (BRCA1). This gene may be a target of germline cancer-inducing mutations.
4140	
4141	
4142	
4143	
4144	The protein encoded by this gene is a nuclear receptor that is closely related to the estrogen receptor. This protein acts as a site-specific transcription regulator and has been also shown to interact with estrogen and the transcripton factor TFIIB by direct protein-protein contact. The binding and regulatory activities of this protein have been demonstrated in the regulation of a variety of genes including lactoferrin, osteopontin, medium-chain acyl coenzyme A dehydrogenase (MCAD) and thyroid hormone receptor genes. A processed pseudogene of ESRRA is located on chromosome 13q12.1.
4145	
4146	
4147	Short/branched chain acyl-CoA dehydrogenase(ACADSB) is a member of the acyl-CoA dehydrogenase family of enzymes that catalyze the dehydrogenation of acyl-CoA derivatives in the metabolism of fatty acids or branch chained amino acids. Substrate specificity is the primary characteristic used to define members of this gene family. The ACADSB gene product has the greatest activity towards the short branched chain acyl-CoA derivative, (S)-2-methylbutyryl-CoA, but also reacts significantly with other 2-methyl branched chain substrates and with short straight chain acyl-CoAs. The cDNA encodes for a mitochondrial precursor protein which is cleaved upon mitochondrial import and predicted to yield a mature peptide of approximately 43.7-KDa.
4148	This gene encodes an alpha chain for one of the low abundance fibrillar collagens. Fibrillar collagen molecules are trimers that can be composed of one or more types of alpha chains. Type V collagen is found in tissues containing type I collagen and appears to regulate the assembly of heterotypic fibers composed of both type I and type V collagen. This gene product is closely related to type XI collagen and it is possible that the collagen chains of types V and XI constitute a single collagen type with tissue-specific chain combinations. Mutations in this gene are associated with Ehlers-Danlos syndrome, types I and II.
4149	
4150	DNA methylation is the major modification of eukaryotic genomes and plays an essential role in mammalian development. Human proteins MECP2, MBD1, MBD2, MBD3, and MBD4 comprise a family of nuclear proteins related by the presence in each of a methyl-CpG binding domain (MBD). Each of these proteins, with the exception of MBD3, is capable of binding specifically to methylated DNA. MECP2, MBD1 and MBD2 can also repress transcription from methylated gene promoters. The protein encoded by this gene may function as a mediator of the biological consequences of the methylation signal. It is also reported that the this protein functions as a demethylase to activate transcription, as DNA methylation causes gene silencing.
4151	This gene is known to suppress G-protein and mitogen-activated signal transduction in mammalian cells. The encoded protein shares significant similarity with Arabidopsis FUS6, which is a regulator of light-mediated signal transduction in plant cells. Two alternatively spliced transcript variants encoding different isoforms have been found for this gene.
4152	MaxiK channels are large conductance, voltage and calcium-sensitive potassium channels which are fundamental to the control of smooth muscle tone and neuronal excitability. MaxiK channels can be formed by 2 subunits: the pore-forming alpha subunit and the modulatory beta subunit. The protein encoded by this gene is an auxiliary beta subunit which decreases the activation time of MaxiK alpha subunit currents. Two variants encoding the same protein have been found for this gene.
4153	
4154	
4155	
4156	This gene encodes uromodulin, the most abundant protein in normal urine. Its excretion in urine follows proteolytic cleavage of the ectodomain of its glycosyl phosphatidylinosital-anchored counterpart that is situated on the luminal cell surface of the loop of Henle. Uromodulin may act as a constitutive inhibitor of calcium crystallization in renal fluids. Excretion of uromodulin in urine may provide defense against urinary tract infections caused by uropathogenic bacteria. Defects in this gene are associated with the autosomal dominant renal disorders medullary cystic kidney disease-2 (MCKD2) and familial juvenile hyperuricemic nephropathy (FJHN). These disorders are characterized by juvenile onset of hyperuricemia, gout, and progressive renal failure. While several transcript variants may exist for this gene, the full-length natures of only two have been described to date. These two represent the major variants of this gene and encode the same isoform.
4157	
4158	
4159	
4160	Members of arrestin/beta-arrestin protein family are thought to participate in agonist-mediated desensitization of G-protein-coupled receptors and cause specific dampening of cellular responses to stimuli such as hormones, neurotransmitters, or sensory signals. Arrestin beta 2, like arrestin beta 1, was shown to inhibit beta-adrenergic receptor function in vitro. It is expressed at high levels in the central nervous system and may play a role in the regulation of synaptic receptors. Besides the brain, a cDNA for arrestin beta 2 was isolated from thyroid gland, and thus it may also be involved in hormone-specific desensitization of TSH receptors. Multiple alternatively spliced transcript variants have been found for this gene, but the full-length nature of some variants has not been defined.
4161	This gene belongs to the protocadherin gene family, a subfamily of the cadherin superfamily. The mRNA encodes a cadherin-related neuronal receptor thought to play a role in the establishment and function of specific cell-cell connections in the brain. This family member contains 6 extracellular cadherin domains, a transmembrane domain and a cytoplasmic tail differing from those of the classical cadherins. Alternatively spliced transcripts encode isoforms with unique cytoplasmic domains.
4162	
4163	Cholesterol homeostasis is regulated, at least in part, by sterol regulatory element (SRE)-binding proteins (e.g., SREBP1; MIM 184756) and by liver X receptors (e.g., LXRA; MIM 602423). Upon sterol depletion, LXRs are inactive and SREBPs are cleaved, after which they bind promoter SREs and activate genes involved in cholesterol biosynthesis and uptake. Sterol transport is mediated by vesicles or by soluble protein carriers, such as steroidogenic acute regulatory protein (STAR; MIM 600617). STAR is homologous to a family of proteins containing a 200- to 210-amino acid STAR-related lipid transfer (START) domain, including STARD5 (Soccio et al., 2002).[supplied by OMIM]
4164	
4165	
4166	
4167	
4168	
4169	The protein encoded by this gene is the medium subunit of AP-3, which is an adaptor-related protein complex associated with the Golgi region as well as more peripheral intracellular structures. AP-3 facilitates the budding of vesicles from the Golgi membrane and may be directly involved in protein sorting to the endosomal/lysosomal system. AP-3 is a heterotetrameric protein complex composed of two large subunits (delta and beta3), a medium subunit (mu3), and a small subunit (sigma 3). Mutations in one of the large subunits of AP-3 have been associated with the Hermansky-Pudlak syndrome, a genetic disorder characterized by defective lysosome-related organelles. Alternatively spliced transcript variants encoding the same protein have been observed.
4170	
4171	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the MRP subfamily which is involved in multi-drug resistance. This protein is expressed in the canalicular (apical) part of the hepatocyte and functions in biliary transport. Substrates include anticancer drugs such as vinblastine; therefore, this protein appears to contribute to drug resistance in mammalian cells. Several different mutations in this gene have been observed in patients with Dubin-Johnson syndrome (DJS), an autosomal recessive disorder characterized by conjugated hyperbilirubinemia.
4172	
4173	
4174	
4175	
4176	
4177	
4178	
4179	
4180	
4181	
4182	
4183	AATK is a novel gene which is induced during apoptosis.  It encodes a protein with a tyrosine kinase domain at the N-terminal end and a proline-rich domain at the C-terminal end.  AATK expression may be a necessary pre-requisite for the induction of growth arrest and/or apoptosis of myeloid precursor cells
4184	AATK is a novel gene which is induced during apoptosis.  It encodes a protein with a tyrosine kinase domain at the N-terminal end and a proline-rich domain at the C-terminal end.  AATK expression may be a necessary pre-requisite for the induction of growth arrest and/or apoptosis of myeloid precursor cells
4185	Gastrin is hormone whose main function is to stimulate secretion of HCL by the gastric mucosa which results in gastrin formation inhibition.  GAS also acts as a mitogeneic factor for gastrointestinal epithelial cells.  Gastrin has two biologically active peptide forms, G34 and G17.
4186	
4187	
4188	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and two tandem catalytic domains, and thus represents a receptor-type PTP. The extracellular region contains a meprin-A5 antigen-PTP mu (MAM) domain, an Ig-like domain and four fibronectin type III-like repeats. This PTP has been shown to mediate cell-cell aggregation through the interaction with another molecule of this PTP on an adjacent cell. This PTP can interact with scaffolding protein RACK1/GNB2L1, which may be necessary for the downstream signaling in response to cell-cell adhesion.
4189	
4190	Initiation of transcription by RNA polymerase II requires the activities of more than 70 polypeptides. The protein that coordinates these activities is transcription factor IID (TFIID), which binds to the core promoter to position the polymerase properly, serves as the scaffold for assembly of the remainder of the transcription complex, and acts as a channel for regulatory signals. TFIID is composed of the TATA-binding protein (TBP) and a group of evolutionarily conserved proteins known as TBP-associated factors or TAFs. TAFs may participate in basal transcription, serve as coactivators, function in promoter recognition or modify general transcription factors (GTFs) to facilitate complex assembly and transcription initiation. This gene encodes a subunit of TFIID present in a subset of TFIID complexes. Translocations involving chromosome 17 and chromosome 9, where the gene for the nuclear receptor CSMF is located, result in a gene fusion product that is an RNA binding protein associated with a subset of extraskeletal myxoid chondrosarcomas. Two transcripts encoding different isoforms have been identified.
4191	More than 99% of the vertebrate ocular lens is comprised of terminally differentiated lens fiber cells.  Two lens-specific intermediate filament-like proteins, the protein product of this gene (CP49 or phakinin) and filensin (also known as CP115), are expressed only after fiber cell differentiation has begun. Both proteins are found in a structurally unique cytoskeletal element that is referred to as the beaded filament (BF). Mutations in this gene have been associated with juvenile-onset, progressive cataracts and Dowling-Meara epidermolysis bullosa simplex.
4192	Peroxisome proliferator-activated receptor gamma is a nuclear receptor which regulates adipose differentiation and glucose homeostasis. It acts through ligand-dependent transcriptional activation of target genes. This gene is one of the targets of peroxisome proliferator-activated receptor gamma and encodes an angiopoietin-like secreted glycoprotein. This gene also has been referred to as ANGPTL2, although a different gene exists with that same designation. Two transcript variants encoding the same protein have been found for this gene.
4193	The CD8 antigen is a cell surface glycoprotein found on most cytotoxic T lymphocytes that mediates efficient cell-cell interactions within the immune system. The CD8 antigen, acting as a coreceptor, and the T-cell receptor on the T lymphocyte recognize antigen displayed by an antigen presenting cell (APC) in the context of class I MHC molecules. The functional coreceptor is either a homodimer composed of two alpha chains, or a heterodimer composed of one alpha and one beta chain. Both alpha and beta chains share significant homology to immunoglobulin variable light chains. This gene encodes the CD8 beta chain isoforms. Six alternative transcripts encoding different isoforms, four membrane associated and two secreted, have been identified. A pseudogene, also located on chromosome 2, has been identified.
4194	
4195	The protein encoded by this gene belongs to the SMAD, a family of proteins similar to the gene products of the Drosophila gene @#$%&mothers against decapentaplegic@#$%& (Mad) and the C. elegans gene Sma. SMAD proteins are signal transducers and transcriptional modulators that mediate multiple signaling pathways. This protein mediates the signal of the transforming growth factor (TGF)-beta, and thus regulates multiple cellular processes, such as cell proliferation, apoptosis, and differentiation. This protein is recruited to the TGF-beta receptors through its interaction with the SMAD anchor for receptor activation (SARA) protein. In response to TGF-beta signal, this protein is phosphorylated by the TGF-beta receptors. The phosphorylation induces the dissociation of this protein with SARA and the association with the family member SMAD4. The association with SMAD4 is important for the translocation of this protein into the nucleus, where it binds to target promoters and forms a transcription repressor complex with other cofactors. This protein can also be phosphorylated by activin type 1 receptor kinase, and mediates the signal from the activin. Alternatively spliced transcript variants encoding the same protein have been observed.
4196	The protein encoded by this gene is a member of the SEC24 subfamily of the SEC23/SEC24 family, which is involved in vesicle trafficking. The encoded protein has similarity to yeast Sec24p component of COPII. COPII is the coat protein complex responsible for vesicle budding from the ER. The role of this gene product is implicated in the shaping of the vesicle, and also in cargo selection and concentration.
4197	This gene encodes a binding protein which interacts with the C-terminal region of topoisomerase II beta. This interaction suggests a supportive role for this protein in the catalytic reactions of topoisomerase II beta through transient breakages of DNA strands.
4198	The Axin-related protein, Axin2, presumably plays an important role in the regulation of the stability of beta-catenin in the Wnt signaling pathway, like its rodent homologs, mouse conductin/rat axil. In mouse, conductin organizes a multiprotein complex of APC (adenomatous polyposis of the colon), beta-catenin, glycogen synthase kinase 3-beta, and conductin, which leads to the degradation of beta-catenin. Apparently, the deregulation of beta-catenin is an important event in the genesis of a number of malignancies. The AXIN2 gene has been mapped to 17q23-q24, a region that shows frequent loss of heterozygosity in breast cancer, neuroblastoma, and other tumors. Mutations in this gene have been associated with colorectal cancer with defective mismatch repair.
4199	
4200	
4201	
4202	
4203	
4204	
4205	
4206	
4207	
4208	Voltage-dependent Ca(2+) channels are multisubunit complexes found in the membrane of many excitable cells that regulate calcium entry (see MIM 601011). N-type calcium channels, which control neurotransmitter release from neurons, are dihydropyridine-insensitive and omega-conotoxin-sensitive. The alpha-1 subunit forms the pore through which calcium enters the cell, and is encoded by a family of at least 5 genes.[supplied by OMIM]
4209	Cytochrome c oxidase (COX), the terminal component of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. This component is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in electron transfer, and the nuclear-encoded subunits may function in the regulation and assembly of the complex. This nuclear gene encodes subunit VIIb, which is highly similar to bovine COX VIIb protein and is found in all tissues. This gene may have several pseudogenes on chromosomes 1, 2, 20 and 22, respectively.
4210	This gene encodes a novel protein which is highly conserved in mouse. It interacts with the N-terminus of the androgen receptor and plays a role in facilitating receptor-induced transcriptional activation. It is also likely to be involved in tumorigenesis as it is abundantly expressed in tumor tissues. This gene is part of a gene cluster on chromosome Xp11.23. Alternative splicing results in 2 transcript variants encoding different isoforms.
4211	
4212	This gene is a member of a phylogenetically conserved family of genes that share a common DNA-binding domain, the T-box. T-box genes encode transcription factors involved in the regulation of developmental processes. Mutations in this gene have been associated with the inherited X-linked disorder, Cleft palate with ankyloglossia, and it is believed to play a major role in human palatogenesis.
4213	
4214	
4215	
4216	
4217	The protein encoded by this gene was identified as an interacting corepressor of BCL6, a POZ/zinc finger transcription repressor that is required for germinal center formation and may influence apoptosis. This protein selectively interacts with the POZ domain of BCL6, but not with eight other POZ proteins. Specific class I and II histone deacetylases (HDACs) have been shown to interact with this protein, which suggests a possible link between the two classes of HDACs. At least two alternatively spliced transcript variants, which encode different isoforms, have been reported for this gene.
4218	
4219	
4220	
4221	The human NDUFA1 gene codes for an essential component of complex I of the respiratory chain, which transfers electrons from NADH to ubiquinone. It has been noted that the N-terminal hydrophobic domain has the potential to be folded into an alpha-helix spanning the inner mitochondrial membrane with a C-terminal hydrophilic domain interacting with globular subunits of complex I. The highly conserved two-domain structure suggests that this feature is critical for the protein function and might act as an anchor for the NADH:ubiquinone oxidoreductase complex at the inner mitochondrial membrane. However, the NDUFA1 peptide is one of about 31 components of the @#$%!hydrophobic protein@#$%! (HP) fraction of complex I which is involved in proton translocation. Thus the NDUFA1 peptide may also participate in that function.
4222	
4223	SLC7A3 is a member of the system y+ family of transporters characterized by sodium-independent transport of cationic amino acids.[supplied by OMIM]
4224	This gene encodes a member of the Rab family of proteins. Rab proteins are small GTPases that are involved in vesicular trafficking.
4225	
4226	This gene is related to members of the MAGED gene family, in terms of the sequence similarity and the chromosome location. This gene is expressed only in brain and ovary among normal tissues, and two transcript variants of this gene are specifically expressed in glioma cells among cancer cells. This gene and the other MAGED genes are clustered on chromosome Xp11. Multiple alternatively spliced transcript variants have been found for this gene, however, the full length nature of some variants has not been defined.
4227	Familial incontinentia pigmenti (IP) is a genodermatosis that segregates as an X-linked dominant disorder and is usually lethal prenatally in males (The International Incontinentia Pigmenti Consortium, 2000 [PubMed 10839543]). In affected females it causes highly variable abnormalities of the skin, hair, nails, teeth, eyes, and central nervous system. The prominent skin signs occur in 4 classic cutaneous stages: perinatal inflammatory vesicles, verrucous patches, a distinctive pattern of hyperpigmentation, and dermal scarring. Cells expressing the mutated X chromosome are eliminated selectively around the time of birth, so females with IP exhibit extremely skewed X-inactivation. Familial incontinentia pigmenti is caused by mutations in the NEMO gene and is here referred to as IP2, or @#$%&classical@#$%& incontinentia pigmenti. Sporadic incontinentia pigmenti, the so-called IP1, which maps to Xp11, is categorized as hypomelanosis of Ito (MIM 300337).[supplied by OMIM]
4228	
4229	
4230	
4231	
4232	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to microtubular structures in the cytoplasm. Its function has not been identified. Alternate splicing of this gene results in two transcript variants encoding different isoforms.
4233	
4234	
4235	This gene encodes a protein containing tetratricopeptide repeats which are thought to be involved in protein-protein interactions. This protein is a minor histocompatibility antigen which may induce graft rejection of male stem cell grafts. Alternative splicing results in multiple transcript variants encoding different isoforms.
4236	
4237	
4238	
4239	
4240	
4241	This gene encodes UDP-galactose-4-epimerase which catalyzes 2 distinct but analogous reactions: the epimerization of UDP-glucose to UDP-galactose, and the epimerization of UDP-N-acetylglucosamine to UDP-N-acetylgalactosamine. The bifunctional nature of the enzyme has the important metabolic consequence that mutant cells (or individuals) are dependent not only on exogenous galactose, but also on exogenous N-acetylgalactosamine for necessary precursor for the synthesis of glycoproteins and glycolipids. The missense mutations in the GALE gene result in the epimerase-deficiency galactosemia. Alternative splicing has been observed at this locus and two variants, both encoding the same protein, have been identified.
4242	
4243	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility.
4244	The protein encoded by this gene belongs to the family of P-type cation transport ATPases, and to the subfamily of Na+/K+ -ATPases. Na+/K+ -ATPase is an integral membrane protein responsible for establishing and maintaining the electrochemical gradients of Na and K ions across the plasma membrane. These gradients are essential for osmoregulation, for sodium-coupled transport of a variety of organic and inorganic molecules, and for electrical excitability of nerve and muscle. This enzyme is composed of two subunits, a large catalytic subunit (alpha) and a smaller glycoprotein subunit (beta). The catalytic subunit of Na+/K+ -ATPase is encoded by multiple genes. This gene encodes an alpha 4 subunit. Alternatively spliced transcript variants encoding different isoforms have been identified.
4245	CDC42, a small Rho GTPase, regulates the formation of F-actin-containing structures through its interaction with the downstream effector proteins. The protein encoded by this gene is a member of the Borg family of CDC42 effector proteins. Borg family proteins contain a CRIB (Cdc42/Rac interactive-binding) domain. They bind to, and negatively regulate the function of, CDC42. Coexpression of this protein with dominant negative mutant CDC42 protein in fibroblast was found to induce pseudopodia formation, which suggested a role of this protein in actin filament assembly and cell shape control.
4246	
4247	
4248	
4249	
4250	
4251	Angiopoietins are members of the vascular endothelial growth factor family and the only known growth factors largely specific for vascular endothelium. Angiopoietin-1, angiopoietin-2, and angiopoietin-4 participate in the formation of blood vessels. Angiopoietin-like 3 is predominantly expressed in the liver, however, its exact function is not known. The ANGPTL3 protein has the characteristic structure of angiopoietins: a signal peptide, an extended helical domain predicted to form dimeric or trimeric coiled-coils, a short linker peptide, and a globular fibrinogen homology domain (FHD). ANGPTL3 also does not contain the characteristic calcium-binding motif found in other angiopoietins.
4252	The protein encoded by this gene is an approximately 345-kD proteoglycan specifically synthesized by chondrocytes located at the surface of articular cartilage, and also by some synovial lining cells. This protein contains both chondroitin sulfate and keratan sulfate glycosaminoglycans. The cDNA encodes a protein of 1,404 amino acids with a somatomedin B homology domain, heparin-binding domains, multiple mucin-like repeats, a hemopexin domain, and an aggregation domain. There are 3 consensus sequences for N-glycosylation and 1 chondroitin sulfate substitution site. The locus for autosomal recessive camptodactyly-arthropathy-coxa vara-pericarditis syndrome maps to chromosome 1q25-q31. This protein may act as a joint/intimal cell lubricant, and cell overgrowth may be primary to the pathogenesis of this protein.
4253	
4254	This gene encodes the alpha chain of type XVI collagen, a member of the FACIT collagen family (fibril-associated collagens with interrupted helices). Members of this collagen family are found in association with fibril-forming collagens such as type I and II, and serve to maintain the integrity of the extracellular matrix. High levels of type XVI collagen have been found in fibroblasts and keratinocytes, and in smooth muscle and amnion.
4255	
4256	
4257	The protein encoded by this gene is a member of the AKT, also called PKB,  serine/threonine protein kinase family. AKT kinases are known to be regulators of cell signaling in response to insulin and growth factors. They are involved in a wide variety of biological processes including cell proliferation, differentiation, apoptosis, tumorigenesis, as well as glycogen synthesis and glucose uptake. This kinase has been shown to be stimulated by platelet-derived growth factor (PDGF), insulin, and insulin-like growth factor 1 (IGF1). Alternatively splice transcript variants encoding distinct isoforms have been described.
4258	Elliptocytosis is a hematologic disorder characterized by elliptically shaped erythrocytes and a variable degree of hemolytic anemia. Inherited as an autosomal dominant, elliptocytosis results from mutation in any one of several genes encoding proteins of the red cell membrane skeleton. The form discussed here is the one found in the 1950s to be linked to Rh blood group and more recently shown to be caused by a defect in protein 4.1. @#$%&Rh-unlinked@#$%& forms of elliptocytosis are caused by mutation in the alpha-spectrin gene (MIM 182860), the beta-spectrin gene (MIM 182870), or the band 3 gene (MIM 109270).[supplied by OMIM]
4259	
4260	
4261	
4262	
4263	Myogenin is a muscle-specific transcription factor that can induce myogenesis in a variety of cell types in tissue culture. It is a member of a large family of proteins related by sequence homology, the helix-loop-helix (HLH) proteins. It is essential for the development of functional skeletal muscle.
4264	
4265	
4266	
4267	
4268	Glutaredoxins (e.g., GLRX; MIM 600443) are a family of glutathione-dependent hydrogen donors that participate in a variety of cellular redox reactions.[supplied by OMIM]
4269	
4270	
4271	
4272	
4273	This gene encodes an extracellular protein containing motifs with a cysteine pattern characteristic of the cysteine pattern of the ligand-binding @#$%!double-loop@#$%! domains of the albumin protein family. This gene maps outside of the epidermal differentiation complex (EDC), a cluster of three gene families involved in epidermal differentiation. Alternatively spliced transcript variants encoding distinct isoforms have been described.
4274	
4275	
4276	
4277	
4278	This gene encodes a protein that belongs to the glutamate-gated ionic channel family. Glutamate functions as the major excitatory neurotransmitter in the central nervous system through activation of ligand-gated ion channels and G protein-coupled membrane receptors. The protein encoded by this gene forms functional heteromeric kainate-preferring ionic channels with the subunits encoded by related gene family members.
4279	This gene encodes a molecular chaperone that is member of the chaperonin containing TCP1 complex (CCT), also known as the TCP1 ring complex (TRiC). This complex consists of two identical stacked rings, each containing eight different proteins. Unfolded polypeptides enter the central cavity of the complex and are folded in an ATP-dependent manner. The complex folds various proteins, including actin and tubulin. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
4280	
4281	
4282	
4283	
4284	The heterotetrameric adaptor protein (AP) complexes sort integral membrane proteins at various stages of the endocytic and secretory pathways. AP4 is composed of 2 large chains, beta-4 (AP4B1) and epsilon-4 (AP4E1; MIM 607244), a medium chain, mu-4 (AP4M1; MIM 602296), and a small chain, sigma-4 (AP4S1; MIM 607243).[supplied by OMIM]
4285	The protein encoded by this gene is thought to play a role in coupling hepatocyte growth factor receptor signaling to the p70 S6 protein kinase pathway. It also interacts with the peptidyl prolyl isomerase FK506-binding protein 59 (FKBP59), FKBPP52 and FKBP12. FKBPs belong to the large family of immunophilins that bind the macrolide immunosuppressant drugs FK506 and rapamycin. This gene has two alternatively spliced transcript variants encoding different isoforms. There is evidence for another polyadenylation site further downstream for this gene.
4286	
4287	
4288	
4289	The protein encoded by this gene is an unusual orphan receptor that contains a putative ligand-binding domain but lacks a conventional DNA-binding domain. The gene product is a member of the nuclear hormone receptor family, a group of transcription factors regulated by small hydrophobic hormones, a subset of which do not have known ligands and are referred to as orphan nuclear receptors. The protein has been shown to interact with retinoid and thyroid hormone receptors, inhibiting their ligand-dependent transcriptional activation. In addition, interaction with estrogen receptors has been demonstrated, leading to inhibition of function. Studies suggest that the protein represses nuclear hormone receptor-mediated transactivation via two separate steps: competition with coactivators and the direct effects of its transcriptional repressor function.
4290	Multiprotein complexes are key determinants of Golgi apparatus structure and its capacity for intracellular transport and glycoprotein modification. Several complexes have been identified, including the Golgi transport complex (GTC), the LDLC complex, which is involved in glycosylation reactions, and the SEC34 complex, which is involved in vesicular transport. These 3 complexes are identical and have been termed the conserved oligomeric Golgi (COG) complex, which includes COG2 (Ungar et al., 2002).[supplied by OMIM]
4291	
4292	The high affinity IgE receptor is a key molecule involved in allergic reactions. It is a tetramer composed of 1 alpha, 1 beta, and 2 gamma chains. The gamma chains are also subunits of other Fc receptors.
4293	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene contains a RNA binding domain and scaffold-associated region (SAR)-specific bipartite DNA-binding domain. This protein is also thought to be involved in the packaging of hnRNA into large ribonucleoprotein complexes. During apoptosis, this protein is cleaved in a caspase-dependent way. Cleavage occurs at the SALD site, resulting in a loss of DNA-binding activity and a concomitant detachment of this protein from nuclear structural sites. But this cleavage does not affect the function of the encoded protein in RNA metabolism. Two alternatively spliced transcript variants have been described for this gene.
4294	
4295	
4296	
4297	DNA (cytosine-5-)-methyltransferase 1 has a role in the establishment and regulation of tissue-specific patterns of methylated cytosine residues. Aberrant methylation patterns are associated with certain human tumors and developmental abnormalities.
4298	The protein encoded by this gene is a G protein-coupled receptor and is a component of the heterodimeric amino acid taste receptor T1R1+3. The T1R1+3 receptor responds to L-amino acids but not to D-enantiomers or other compounds. Most amino acids that are perceived as sweet activate T1R1+3, and this activation is strictly dependent on an intact T1R1+3 heterodimer. Multiple transcript variants encoding several different isoforms have been found for this gene.
4299	
4300	
4301	
4302	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intracellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the ABC1 subfamily. Members of the ABC1 subfamily comprise the only major ABC subfamily found exclusively in multicellular eukaryotes. This protein is a retina-specific ABC transporter with N-retinylidene-PE as a substrate. It is expressed exclusively in retina photoreceptor cells, indicating the gene product mediates transport of an essental molecule across the photoreceptor cell membrane. Mutations in this gene are found in patients diagnosed with Stargardt disease and are associated with retinitis pigmentosa-19 and macular degeneration age-related 2.
4303	The protein encoded by this gene is an evolutionarily conserved protein similar to yeast SSF1 as well as to the gene product of the Drosophila gene peter pan (ppan). SSF1 is known to be involved in the second step of mRNA splicing. Both SSF1 and ppan are essential for cell growth and proliferation. This gene was found to cotranscript with P2RY11/P2Y(11), an immediate downstream gene on the chromosome that encodes a ATP receptor. The chimeric transcripts of this gene and P2RY11 were found to be ubiquitously present and regulated during granulocytic differentiation. Exogenous expression of this gene was reported to reduce the anchorage-independent growth of some tumor cells.
4304	Laminins, a family of extracellular matrix glycoproteins, are the major noncollagenous constituent of basement membranes.  They have been implicated in a wide variety of biological processes including cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis.  Laminins are composed of 3 non identical chains: laminin alpha, beta and gamma (formerly A, B1, and B2, respectively) and they form a cruciform structure consisting of 3 short arms, each formed by a different chain, and a long arm composed of all 3 chains.  Each laminin chain is a multidomain protein encoded by a distinct gene.  Several isoforms of each chain have been described.  Different alpha, beta and gamma chain isomers combine to give rise to different heterotrimeric laminin isoforms which are designated by Arabic numerals in the order of their discovery, i.e. alpha1beta1gamma1 heterotrimer is laminin 1.  The biological functions of the different chains and trimer molecules are largely unknown, but some of the chains have been shown to differ with respect to their tissue distribution, presumably reflecting diverse functions in vivo. This gene encodes the gamma chain isoform laminin, gamma 2.  The gamma 2 chain, formerly thought to be a truncated version of beta chain (B2t), is highly homologous to the gamma 1 chain; however, it lacks domain VI, and domains V, IV and III are shorter.  It is expressed in several fetal tissues but differently from gamma 1, and is specifically localized to epithelial cells in skin, lung and kidney.  The gamma 2 chain together with alpha 3 and beta 3 chains constitute laminin 5 (earlier known as kalinin), which is an integral part of the anchoring filaments that connect epithelial cells to the underlying basement membrane.  The epithelium-specific expression of the gamma 2 chain implied its role as an epithelium attachment molecule, and mutations in this gene have been associated with junctional epidermolysis bullosa, a skin disease characterized by blisters due to disruption of the epidermal-dermal junction.  Two transcript variants resulting from alternative splicing of the 3@#$%& terminal exon, and encoding different isoforms of gamma 2 chain, have been described.  The two variants are differentially expressed in embryonic tissues, however, the biological significance of the two forms is not known.  Transcript variants utilizing alternative polyA_signal have also been noted in literature.
4305	
4306	SNAPAP is a component of the SNARE complex of proteins that is required for synaptic vesicle docking and fusion.[supplied by OMIM]
4307	
4308	
4309	
4310	
4311	
4312	
4313	This gene is a member of the Ets family of transcription factors and of the ternary complex factor (TCF) subfamily. Proteins of the TCF subfamily form a ternary complex by binding to the the serum response factor and the serum reponse element in the promoter of the c-fos proto-oncogene. The protein encoded by this gene is phosphorylated by the kinases, MAPK1 and MAPK8. Several transcript variants have been described for this gene.
4314	
4315	Leptin (LEP; MIM 164160), an adipocyte-specific hormone that regulates adipose-tissue mass through hypothalamic effects on satiety and energy expenditure, acts through the leptin receptor (LEPR), a single-transmembrane-domain receptor of the cytokine receptor family.[supplied by OMIM]
4316	The protein encoded by this gene is an enzyme that elongates the prenyl side-chain of coenzyme Q, or ubiquinone, one of the key elements in the respiratory chain. The gene product catalyzes the formation of all trans-polyprenyl pyrophosphates from isopentyl diphosphate in the assembly of polyisoprenoid side chains, the first step in coenzyme Q biosynthesis. The protein may be peripherally associated with the inner mitochondrial membrane, though no transit peptide has been definitively identified to date.
4317	
4318	
4319	
4320	
4321	Along with the microfilaments (actins) and microtubules (tubulins), the intermediate filaments represent a third class of well-characterized cytoskeletal elements. The subunits display a tissue-specific pattern of expression. Desmin (MIM 125660) is the subunit specific for muscle and vimentin the subunit specific for mesenchymal tissue.[supplied by OMIM]
4322	Along with the microfilaments (actins) and microtubules (tubulins), the intermediate filaments represent a third class of well-characterized cytoskeletal elements. The subunits display a tissue-specific pattern of expression. Desmin (MIM 125660) is the subunit specific for muscle and vimentin the subunit specific for mesenchymal tissue.[supplied by OMIM]
4323	The protein encoded by this gene is localized to the nucleus of endothelial cells and is induced by IL-1 and TNF-alpha stimulation.  Studies in rat cardiomyocytes suggest that this gene functions as a transcription factor.
4324	This gene, CYP2A6, encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and its expression is induced by phenobarbital. The enzyme is known to hydroxylate coumarin, and also metabolizes nicotine, aflatoxin B1, nitrosamines, and some pharmaceuticals. Individuals with certain allelic variants are said to have a poor metabolizer phenotype, meaning they do not efficiently metabolize coumarin or nicotine. This gene is part of a large cluster of cytochrome P450 genes from the CYP2A, CYP2B and CYP2F subfamilies on chromosome 19q. The gene was formerly referred to as CYP2A3; however, it has been renamed CYP2A6.
4325	
4326	
4327	
4328	
4329	Proteoglycans, which consist of a core protein and covalently linked glycosaminoglycans, are components of the extracellular matrix. SPOCK2 encodes a member of a novel Ca(2+)-binding proteoglycan family.[supplied by OMIM]
4330	
4331	
4332	
4333	The protein encoded by this gene is a member of the inorganic pyrophosphatase (PPase) family. PPases catalyze the hydrolysis of pyrophosphate to inorganic phosphate, which is important for the phosphate metabolism of cells. Studies of a similar protein in bovine suggested a cytoplasmic localization of this enzyme.
4334	The protein encoded by this gene contains a characteristic catalytic motif of the protein tyrosine phosphatases (PTPs) family. The PTP motif of this protein has the highly conserved arginine residue replaced by a proline residue; thus it may represent a distinct class of PTPs. Members of the PTP family are known to be signaling molecules that regulate a variety of cellular processes. This gene was found to preferentially expressed in both adult and fetal heart. A much lower expression level was detected in skeletal and smooth muscle tissues, and no expression was observed in other tissues. The tissue specific expression in the developing and adult heart suggests a role in regulating cardiac development and differentiation.
4335	
4336	
4337	Expression of the c-myc gene, which produces an oncogenic transcription factor, is tightly regulated in normal cells but is frequently deregulated in human cancers. The protein encoded by this gene is a transcriptional repressor thought to negatively regulate MYC function, and is therefore a potential tumor suppressor. This protein inhibits the transcriptional activity of MYC by competing for MAX, another basic helix-loop-helix protein that binds to MYC and is required for its function. Defects in this gene are frequently found in patients with prostate tumors. Three alternatively spliced transcripts encoding different isoforms have been described. Additional alternatively spliced transcripts may exist but the products of these transcripts have not been verified experimentally.
4338	
4339	
4340	
4341	APM2 gene is exclusively expressed in adipose tissue. Its function is currently unknown.
4342	GDP dissociation inhibitors are proteins that regulate the GDP-GTP exchange reaction of members of the rab family, small GTP-binding proteins of the ras superfamily, that are involved in vesicular trafficking of molecules between cellular organelles.  GDIs slow the rate of dissociation of GDP from rab proteins and release GDP from membrane-bound rabs.  GDI2 is ubiquitously expressed.  The GDI2 gene contains many repetitive elements indicating that it may be prone to inversion/deletion rearrangements.
4343	
4344	
4345	The protein encoded by this gene is a member of a family of RNA-binding proteins and possesses nucleolytic activity against cytotoxic lymphocyte target cells. The gene product is a cytotoxic granule-associated protein and has been shown to bind specifically to poly(A) homopolymers and to fragment DNA in permeabilized target cells. It has been suggested that members of this protein family may be involved in the induction of apoptosis. One isoform contains a lysosome-targeting motif in its C-terminal auxiliary domain; however, alternative splicing results in a T-cluster DNA-binding isoform, differing at the C-terminus where a hydrophobic sequence replaces the lysosome-targeting motif.
4346	Acidic fibroblast growth factor is mitogenic for a variety of different cell types and acts by stimulating mitogenesis or inducing morphological changes and differentiation. The FIBP protein is an intracellular protein that binds selectively to acidic fibroblast growth factor (aFGF). It is postulated that FIBP may be involved in the mitogenic action of aFGF. Two transcript variants encoding different isoforms have been found for this gene.
4347	
4348	
4349	
4350	
4351	
4352	
4353	This gene is a member of the N-acetylated alpha-linked acidic dipeptidase (NAALADase) gene family. The representative member of this family is the gene encoding human prostate-specific membrane antigen (PSM), which is a marker of prostatic carcinomas and is the first to be shown to possess NAALADase activity. NAALADase cleaves N-acetyl-L-aspartate-L-glutamate (NAAG), which is a neuropeptide expressed both in the central nervous systems and in the periphery and is thought to function as a neurotransmitter. The product of this gene is a type II integral membrane protein. Transient transfection of this gene confers both NAALADase and dipetidyl peptidase IV activities to mammalian cells. This gene is highly expressed in ovary and testis as well as within discrete brain areas.
4354	EMK (ELKL Motif Kinase) is a small family of ser/thr protein kinases involved in the control of cell polarity, microtubule stability and cancer. Several cDNA clones have been isolated that encoded two isoforms of the human ser/thr protein kinase EMK1. These isoforms were characterized by the presence of a 162-bp alternative exon that gave rise to two forms, one containing the exon and the other one lacking it. Both forms were found to be coexpressed in a number of selected cell lines and tissue samples. The human EMK1 was shown to be encoded by a single mRNA ubiquitously expressed.
4355	The protein encoded by this gene is a member of the immunophilin protein family, which play a role in immunoregulation and basic cellular processes involving protein folding and trafficking. This encoded protein is a cis-trans prolyl isomerase that binds the immunosuppressants FK506 and rapamycin. It is thought to function as an ER chaperone and may also act as a component of membrane cytoskeletal scaffolds. This gene has two alternatively spliced transcript variants that encode the same isoform. Multiple polyadenylation sites have been described for this gene, but the full length nature of this gene has not been determined.
4356	
4357	The Fos gene family consists of 4 members: FOS, FOSB, FOSL1, and FOSL2. These genes encode leucine zipper proteins that can dimerize with proteins of the JUN family, thereby forming the transcription factor complex AP-1. As such, the FOS proteins have been implicated as regulators of cell proliferation, differentiation, and transformation.
4358	
4359	
4360	This gene encodes pyruvate carboxylase, which requires biotin and ATP to catalyse the carboxylation of pyruvate to oxaloacetate. The active enzyme is a homotetramer arranged in a tetrahedron which is located exclusively in the mitochondrial matrix. Pyruvate carboxylase is involved in gluconeogenesis, lipogenesis, insulin secretion and synthesis of the neurotransmitter glutamate. Mutations in this gene have been associated with pyruvate carboxylase deficiency. Two transcript variants are encoded by this gene.
4361	SYT13 belongs to the large synaptotagmin protein family. All synaptotagmins show type I membrane topology, with an extracellular N terminus, a single transmembrane region, and a cytoplasmic C terminus containing tandem C2 domains. Major functions of synaptotagmins include vesicular traffic, exocytosis, and secretion.[supplied by OMIM]
4362	
4363	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development, reproduction, and tissue remodeling, as well as in disease processes, such as arthritis and metastasis. Most MMP@#$%&s are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. This gene encodes an enzyme which degrades fibronectin, laminin, collagens III, IV, IX, and X, and cartilage proteoglycans. The enzyme is thought to be involved in wound repair, progression of atherosclerosis, and tumor initiation. The gene is part of a cluster of MMP genes which localize to chromosome 11q22.3.
4364	
4365	
4366	
4367	This gene encodes a member of a family of small membrane proteins that share a 35-amino acid signature sequence domain, beginning with the sequence PFXYD and containing 7 invariant and 6 highly conserved amino acids. The approved human gene nomenclature for the family is FXYD-domain containing ion transport regulator. Mouse FXYD5 has been termed RIC (Related to Ion Channel). FXYD2, also known as the gamma subunit of the Na,K-ATPase, regulates the properties of that enzyme. FXYD1 (phospholemman), FXYD2 (gamma), FXYD3 (MAT-8), FXYD4 (CHIF), and FXYD5 (RIC) have been shown to induce channel activity in experimental expression systems. Transmembrane topology has been established for two family members (FXYD1 and FXYD2), with the N-terminus extracellular and the C-terminus on the cytoplasmic side of the membrane. The Type III integral membrane protein encoded by this gene is the gamma subunit of the Na,K-ATPase present on the plasma membrane. Although the Na,K-ATPase does not depend on the gamma subunit to be functional, it is thought that the gamma subunit modulates the enzyme@#$%&s activity by inducing ion channel activity. Mutations in this gene have been associated with renal hypomagnesaemia.
4368	
4369	
4370	Protein kinase C (PKC) is a family of serine- and threonine-specific protein kinases that can be activated by calcium and second messenger diacylglycerol. PKC family members phosphorylate a wide variety of protein targets and are known to be involved in diverse cellular signaling pathways. PKC also serve as major receptors for phorbol esters, a class of tumor promoters. Each member of the PKC family has a specific expression profile and is believed to play distinct roles in cells. The protein encoded by this gene is one of the PKC family members. This protein kinase is expressed solely in the brain and spinal cord and its localization is restricted to neurons. It has been demonstrated that several neuronal functions, including long term potentiation (LTP) and long term depression (LTD), specifically require this kinase. Knockout studies in mice also suggest that this kinase may be involved in neuropathic pain development. Defects in this protein have been associated with neurodegenerative disorder spinocerebellar ataxia-14 (SCA14).
4371	This gene encodes a member of the membrane-spanning 4A gene family. Members of this protein family are characterized by common structural features and similar intron/exon splice boundaries and display unique expression patterns among hematopoietic cells and nonlymphoid tissues. This family member likely plays a role in signal transduction and may function as a subunit associated with receptor complexes. The gene encoding this protein is localized to 11q12, among a cluster of related family members. Alternative splicing may result in multiple transcript variants; however, not all variants have been fully described.
4372	
4373	Synaptotagmins, such as SYT7, are brain-specific calcium-dependent phospholipid-binding proteins that play a role in synaptic exocytosis and neurotransmitter release. See MIM 600782.[supplied by OMIM]
4374	
4375	
4376	
4377	
4378	
4379	
4380	This gene encodes a G-protein coupled receptor for gastrin and cholecystokinin (CCK), regulatory peptides of the brain and gastrointestinal tract. This protein is a type B gastrin receptor, which has a high affinity for both sulfated and nonsulfated CCK analogs and is found principally in the central nervous system and the gastrointestinal tract. A misspliced transcript variant including an intron has been observed in cells from colorectal and pancreatic tumors.
4381	
4382	
4383	
4384	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. A pseudogene corresponding to this gene is found on chromosome 6p.
4385	
4386	
4387	
4388	
4389	
4390	
4391	The VMD2 gene (MIM 607854) underlying vitelliform macular dystrophy and the 3 VMD2-like genes, VMD2L1, VMD2L2 (MIM 607336), and VMD2L3 (MIM 607337), encode transmembrane (TM) spanning proteins that share a homology region with a high content of aromatic residues including an invariant arginine (R), phenylalanine (F), and proline (P) motif. VMD2 and the 3 VMD2-related genes share a conserved gene structure with almost identical sizes of the 8 RFP-TM domain encoding exons and highly conserved positions of their corresponding exon-intron boundaries. Each of the 4 paralogous genes contains a unique 3-prime end of variable length without significant homology to known proteins or motifs (Stohr et al., 2002).[supplied by OMIM]
4392	This gene encodes a protein whose expression was found to increase in cultured cells at high density but not during growth arrest. This gene was also shown to have increased expression in cells overexpressing HER-2/neu proto-oncogene. The protein contains an SUI1 domain. In budding yeast, SUI1 is a translation initiation factor that along with eIF-2 and the initiator tRNA-Met, directs the ribosome to the proper translation start site. Proteins similar to SUI have been found in mammals, insects, and plants.
4393	
4394	The protein encoded by this gene belongs to the myosin superfamily. Myosins are molecular motors that, upon interaction with actin filaments, utilize energy from ATP hydrolysis to generate mechanical force. Each myosin has a conserved N-terminal motor domain that contains both ATP-binding and actin-binding sequences. Following the motor domain is a light-chain-binding @#$%&neck@#$%& region containing 1-6 copies of a repeat element, the IQ motif, that serves as a binding site for calmodulin or other members of the EF-hand superfamily of calcium-binding proteins. At the C-terminus, each myosin class has a distinct tail domain that serves in dimerization, membrane binding, protein binding, and/or enzymatic activities and targets each myosin to its particular subcellular location. The kidney epithelial cell line, LLC-PK1-CL4 (CL4), forms a well ordered brush border (BB) on its apical surface. Experiments indicate that the brush border population of the encoded protein turns over rapidly, while its head and tail domains interact transiently with the core actin and plasma membrane, respectively. A rapidly exchanging pool of the protein encoded by this gene envelops an actin core bundle that, by comparison, is static in structure.
4395	This gene encodes a member of a small family of proteins which bind single stranded DNA/RNA. These proteins are characterized by the presence of two sets of ribonucleoprotein consensus sequence (RNP-CS) that contain conserved motifs, RNP1 and RNP2, originally described in RNA binding proteins, and required for DNA binding. These proteins have been implicated in such diverse functions as DNA replication, gene transcription, cell cycle progression and apoptosis. Multiple transcript variants, resulting from alternative splicing and encoding different isoforms, have been described. Several of these were isolated by virtue of their binding to either strand of an upstream element of c-myc (MSSPs), or by phenotypic complementation of cdc2 and cdc13 mutants of yeast (scr2), or as a potential human repressor of HIV-1 and ILR-2 alpha promoter transcription (YC1). A pseudogene for this locus is found on chromosome 12.
4396	This gene encodes a member of the Hedgehog family. The hedgehog gene family encodes signaling molecules that play an important role in regulating morphogenesis. This protein is predicted to be made as a precursor that is autocatalytically cleaved; the N-terminal portion is soluble and contains the signalling activity while the C-terminal portion is involved in precursor processing. More importantly, the C-terminal product covalently attaches a cholesterol moiety to the N-terminal product, restricting the N-terminal product to the cell surface and preventing it from freely diffusing throughout the organism. Defects in this protein have been associated with partial gonadal dysgenesis (PGD) accompanied by minifascicular polyneuropathy. This protein may be involved in both male gonadal differentiation and perineurial development.
4397	The WNT gene family consists of structurally related genes which encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family. It may be involved in breast cancer, and its protein signaling is likely a molecular switch that governs adipogenesis. This protein is 96% identical to the mouse Wnt10b protein at the amino acid level. This gene is clustered with another family member, WNT1, in the chromosome 12q13 region.
4398	
4399	This gene belongs to the homeobox family, members of which encode a highly conserved family of transcription factors that play important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, consisting of 9 to 11 genes arranged in tandem, and located on different chromosomes. This gene is one of several HOXC genes located in a cluster on chromosome 12. Three genes, HOXC5, HOXC4 and HOXC6, are co-transcribed from a primary transcript. Subsequent processing results in gene-specific transcripts, which sometimes share the 5@#$%& non-coding exon. The product of this gene may contribute to the breast cell phenotype through co-operative interactions with other HOX-derived proteins and repression of their target genes. Two alternatively spliced transcripts encoding different isoforms have been characterized to date.
4400	This gene encodes a member of the DEAD box protein family. DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. The protein encoded by this gene interacts in a hormone-dependent manner with nuclear receptors and represses their transcriptional activity.
4401	
4402	
4403	
4404	
4405	This gene encodes a member of the arm-repeat (armadillo) and plakophilin gene families. Plakophilin proteins contain numerous armadillo repeats, localize to cell desmosomes and nuclei, and participate in linking cadherins to intermediate filaments in the cytoskeleton. This gene product may regulate the signaling activity of beta-catenin. Two alternately spliced transcripts encoding two protein isoforms have been identified. A processed pseudogene with high similarity to this locus has been mapped to chromosome 12p13.
4406	
4407	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. N-terminal part of this PTP contains two tandem Src homolog (SH2) domains, which act as protein phospho-tyrosine binding domains, and mediate the interaction of this PTP with its substrates. This PTP is expressed primarily in hematopoietic cells, and functions as an important regulator of multiple signaling pathways in hematopoietic cells. This PTP has been shown to interact with, and dephosphorylate a wide spectrum of phospho-proteins involved in hematopoietic cell signaling. Three alternatively spliced variants of this gene, which encode distinct isoforms, have been reported.
4408	Kallikreins are a subgroup of serine proteases having diverse physiological functions. Growing evidence suggests that many kallikreins are implicated in carcinogenesis and some have potential as novel cancer and other disease biomarkers. This gene is one of the fifteen kallikrein subfamily members located in a cluster on chromosome 19. This protein is functionally conserved in its capacity to release the vasoactive peptide, Lys-bradykinin, from low molecular weight kininogen.
4409	
4410	
4411	MYL2 encodes the regulatory light chain associated with cardiac myosin beta (or slow) heavy chain. Ca+ triggers the phosphorylation of regulatory light chain that in turn triggers contraction.  Mutations in MYL2 are associated with mid-left ventricular chamber type hypertrophic cardiomyopathy.
4412	
4413	
4414	This gene encodes the nuclear hormone receptor for vitamin D3. This receptor also functions as a receptor for the secondary bile acid lithocholic acid. The receptor belongs to the family of trans-acting transcriptional regulatory factors and shows sequence similarity to the steroid and thyroid hormone receptors. Downstream targets of this nuclear hormone receptor are principally involved in mineral metabolism though the receptor regulates a variety of other metabolic pathways, such as those involved in the immune response and cancer. Mutations in this gene are associated with type II vitamin D-resistant rickets. A single nucleotide polymorphism in the initiation codon results in an alternate translation start site three codons downstream. Alternative splicing results in multiple transcript variants encoding the same protein.
4415	
4416	
4417	
4418	
4419	The protein encoded by this gene is a membrane-bound arginine/lysine carboxypeptidase. Its expression is associated with monocyte to macrophage differentiation. This encoded protein contains hydrophobic regions at the amino and carboxy termini and has 6 potential asparagine-linked glycosylation sites. The active site residues of carboxypeptidases A and B are conserved in this protein. Three alternatively spliced transcript variants encoding the same protein have been described for this gene.
4420	
4421	
4422	
4423	The protein encoded by this gene contains a RING-H2 finger motif. Deletions and mutations in this gene were detected in esophageal squamous cell carcinoma (ESCC), suggesting that this protein may be a potential tumor suppressor. Studies of the mouse counterpart suggested a role of this protein in the transcription regulation that controls germinal differentiation. Four alternatively spliced transcript variants encoding two distinct isoforms have been reported.
4424	TSC22 encodes a transcription factor and belongs to the large family of early response genes.[supplied by OMIM]
4425	
4426	This gene may play a role in testicular physiology, spermatogenesis or spermiogenesis. Expression of the encoded protein is highest in the testis and down-regulated in testicular cancer. The gene is localized to the region 19q13.3 among the related RAS viral oncogene homolog (RRAS) and interferon regulatory factor 3 (IRF3) genes, which are both involved in tumorigenesis pathways and progression.
4427	
4428	
4429	
4430	This gene is homologous to the yeast gene SGT1, which encodes a protein involved in kinetochore function and required for the G1/S and G2/M transitions. Complementation studies suggest that the human protein has similar functions.
4431	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. The 28S subunit of the mammalian mitoribosome may play a crucial and characteristic role in translation initiation. This gene encodes a 28S subunit protein that has also been associated with type 1 diabetes; however, its relationship to the etiology of this disease remains to be clarified. Pseudogenes corresponding to this gene have been found on chromosomes 3 and 13.
4432	
4433	
4434	The protein encoded by this gene is a member of a family of membrane glycoproteins.  This glycoprotein provides selectins with carbohydrate ligands.  It may also play a role in tumor cell metastasis.
4435	
4436	
4437	
4438	The protein encoded by this gene is a plasma protease inhibitor and member of the serine protease inhibitor class.  Polymorphisms in this protein appear to be tissue specific and influence protease targeting.  Variation in this protein@#$%&s sequence have been implicated in Alzheimer@#$%&s disease, and deficiency of this protein has been associated with liver disease.  Mutations have been identified in patients with Parkinson disease and chronic obstructive pulmonary disease.
4439	
4440	
4441	
4442	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has one RRM domain that binds to RNAs. Two alternatively spliced transcript variants have been described for this gene.
4443	
4444	This gene encodes  poly(ADP-ribosyl)transferase-like 2 protein, which contains a catalytic domain and is capable of catalyzing a poly(ADP-ribosyl)ation reaction. This protein has a catalytic domain which is homologous to that of poly (ADP-ribosyl) transferase, but lacks an N-terminal DNA binding domain which activates the C-terminal catalytic domain of poly (ADP-ribosyl) transferase. The basic residues within the N-terminal region of this protein may bear potential DNA-binding properties, and may be involved in the nuclear and/or nucleolar targeting of the protein. Transcript variants of this gene have been detected, but have not been fully described.
4445	
4446	
4447	Basement membranes, which are composed of type IV collagens (see MIM 120130), laminins (see LAMC1; MIM 150290), perlecan (HSPG2; MIM 142461), and nidogen (see NID1; MIM 131390), are thin pericellular protein matrices that control a large number of cellular activities, including adhesion, migration, differentiation, gene expression, and apoptosis.[supplied by OMIM]
4448	
4449	
4450	
4451	Acylphosphatase is a small cytosolic enzyme that catalyzes the hydrolysis of the carboxyl-phosphate bond of acylphosphates. Two isoenzymes have been isolated, called muscle acylphosphatase and erythrocyte acylphosphatase, on the basis of their tissue localization. This gene encodes the erythrocyte acylphosphatase isoenzyme. Alternatively spliced transcript variants that encode different proteins were identified through data analysis.
4452	
4453	
4454	
4455	
4456	
4457	
4458	
4459	The protein encoded by this gene belongs to the BCL-2 protein family. BCL-2 family members form hetero- or homodimers and act as anti- or pro-apoptotic regulators that are involved in a wide variety of cellular activities. The protein encoded by this gene contains conserved BH4, BH1 and BH2 domains. This protein can interact with other members of BCL-2 protein family including BCL2, BCL2L1/BCL-X(L), and BAX. Overexpression of this gene has been shown to suppress cell apoptosis possibly through the prevention of cytochrome C release from the mitochondria, and thus activating caspase-3 activation. The mouse counterpart of this protein is found to interact with Apaf1 and forms a protein complex with Caspase 9, which suggests the involvement of this protein in APAF1 and CASPASE 9 related apoptotic pathway.
4460	
4461	Mitochondrial creatine (MtCK) kinase is responsible for the transfer of high energy phosphate from mitochondria to the cytosolic carrier, creatine. It belongs to the creatine kinase isoenzyme family. It exists as two isoenzymes, sarcomeric MtCK and ubiquitous MtCK, encoded by separate genes. Mitochondrial creatine kinase occurs in two different oligomeric forms: dimers and octamers, in contrast to the exclusively dimeric cytosolic creatine kinase isoenzymes. Many malignant cancers with poor prognosis have shown overexpression of ubiquitous mitochondrial creatine kinase; this may be related to high energy turnover and failure to eliminate cancer cells via apoptosis. Ubiquitous mitochondrial creatine kinase has 80% homology with the coding exons of sarcomeric mitochondrial creatine kinase. Two genes located near each other on chromosome 15 have been identified which encode identical mitochondrial creatine kinase proteins.
4462	Mitochondrial creatine (MtCK) kinase is responsible for the transfer of high energy phosphate from mitochondria to the cytosolic carrier, creatine. It belongs to the creatine kinase isoenzyme family. It exists as two isoenzymes, sarcomeric MtCK and ubiquitous MtCK, encoded by separate genes. Mitochondrial creatine kinase occurs in two different oligomeric forms: dimers and octamers, in contrast to the exclusively dimeric cytosolic creatine kinase isoenzymes. Many malignant cancers with poor prognosis have shown overexpression of ubiquitous mitochondrial creatine kinase; this may be related to high energy turnover and failure to eliminate cancer cells via apoptosis. Ubiquitous mitochondrial creatine kinase has 80% homology with the coding exons of sarcomeric mitochondrial creatine kinase. Two genes located near each other on chromosome 15 have been identified which encode identical mitochondrial creatine kinase proteins.
4463	
4464	
4465	
4466	
4467	Carbonic anhydrases (CAs) are a large family of zinc metalloenzymes that catalyze the reversible hydration of carbon dioxide. They participate in a variety of biological processes, including respiration, calcification, acid-base balance, bone resorption, and the formation of aqueous humor, cerebrospinal fluid, saliva, and gastric acid. This gene product is a type I membrane protein that is highly expressed in normal tissues, such as kidney, colon and pancreas, and has been found to be overexpressed in 10% of clear cell renal carcinomas. Two transcript variants encoding different isoforms have been identified for this gene.
4468	This gene encodes a member of the mammalian PIAS [protein inhibitor of activated STAT-1 (signal transducer and activator of transcription-1)] family. This member contains a putative zinc-binding motif and a highly acidic region. It inhibits STAT1-mediated gene activation and the DNA binding activity, binds to Gu protein/RNA helicase II/DEAD box polypeptide 21, and interacts with androgen receptor (AR). It functions in testis as a nuclear receptor transcriptional coregulator and may have a role in AR initiation and maintenance of spermatogenesis.
4469	This gene encodes a member of the mammalian PIAS [protein inhibitor of activated STAT-1 (signal transducer and activator of transcription-1)] family. This member contains a putative zinc-binding motif and a highly acidic region. It inhibits STAT1-mediated gene activation and the DNA binding activity, binds to Gu protein/RNA helicase II/DEAD box polypeptide 21, and interacts with androgen receptor (AR). It functions in testis as a nuclear receptor transcriptional coregulator and may have a role in AR initiation and maintenance of spermatogenesis.
4470	The protein encoded by this gene is a 36-kD water-soluble protein which carries 11-cis-retinaldehyde or 11-cis-retinal as physiologic ligands. It may be a functional component of the visual cycle. Mutations of this gene have been associated with severe rod-cone dystrophy, Bothnia dystrophy (nonsyndromic autosomal recessive retinitis pigmentosa) and retinitis punctata albescens.
4471	
4472	The nicotinic acetylcholine receptors (nAChRs) are members of a superfamily of ligand-gated ion channels that mediate fast signal transmission at synapses. The nAChRs are thought to be hetero-pentamers composed of homologous subunits. The proposed structure for each subunit is a conserved N-terminal extracellular domain followed by three conserved transmembrane domains, a variable cytoplasmic loop, a fourth conserved transmembrane domain, and a short C-terminal extracellular region. The protein encoded by this gene forms a homo-oligomeric channel, displays marked permeability to calcium ions and is a major component of brain nicotinic receptors that are blocked by, and highly sensitive to, alpha-bungarotoxin. Once this receptor binds acetylcholine, it undergoes an extensive change in conformation that affects all subunits and leads to opening of an ion-conducting channel across the plasma membrane. This gene is located in a region identified as a major susceptibility locus for juvenile myoclonic epilepsy and a chromosomal location involved in the genetic transmission of schizophrenia. An evolutionarily recent partial duplication event in this region results in a hybrid containing sequence from this gene and a novel FAM7A gene.
4473	This gene, CYP1A1, encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and its expression is induced by some polycyclic aromatic hydrocarbons (PAHs), some of which are found in cigarette smoke. The enzyme@#$%&s endogenous substrate is unknown; however, it is able to metabolize some PAHs to carcinogenic intermediates. The gene has been associated with lung cancer risk. A related family member, CYP1A2, is located approximately 25 kb away from CYP1A1 on chromosome 15.
4474	
4475	
4476	
4477	
4478	
4479	
4480	
4481	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein. Alternate splice variants of this gene have been described but their full-length nature has not been determined.
4482	Cytochrome b is comprised of a light chain (alpha) and a heavy chain (beta). This gene encodes the light, alpha subunit which has been proposed as a primary component of the microbicidal oxidase system of phagocytes. Mutations in this gene are associated with autosomal recessive chronic granulomatous disease (CGD), that is characterized by the failure of activated phagocytes to generate superoxide, which is important for the microbicidal activity of these cells.
4483	
4484	
4485	Initiation of transcription by RNA polymerase I requires the formation of a complex composed of the TATA-binding protein (TBP) and three TBP-associated factors (TAFs) specific for RNA polymerase I. This complex, known as SL1, binds to the core promoter of ribosomal RNA genes to position the polymerase properly and acts as a channel for regulatory signals. This gene encodes the largest SL1-specific TAF. Two transcripts encoding different isoforms have been identified.
4486	
4487	
4488	
4489	
4490	This gene encodes a member of the hydratase/isomerase superfamily. The protein encoded is a key mitochondrial enzyme involved in beta-oxidation of unsaturated fatty acids. It catalyzes the transformation of 3-cis and 3-trans-enoyl-CoA esters arising during the stepwise degradation of cis-, mono-, and polyunsaturated fatty acids to the 2-trans-enoyl-CoA intermediates. Alternatively spliced transcript variants have been described, but their full-length nature has not been determined.
4491	This gene encodes a secreted multidomain protein consisting of a signal peptide, a WAP domain, a follistatin domain, an immunoglobulin domain, two tandem Kunitz domains, and an NTR domain. These domains have been implicated frequently in inhibition of various types of proteases, suggesting that the encoded protein may be a multivalent protease inhibitor and may control the action of multiple types of serine proteases as well as metalloproteinases.
4492	
4493	
4494	
4495	This gene encodes a member of the Golgi-localized, gamma adaptin ear-containing, ARF-binding (GGA) family. This family includes ubiquitous coat proteins that regulate the trafficking of proteins between the trans-Golgi network and the lysosome. These proteins share an amino-terminal VHS domain which mediates sorting of the mannose 6-phosphate receptors at the trans-Golgi network. They also contain a carboxy-terminal region with homology to the ear domain of gamma-adaptins. This family member may play a significant role in cargo molecules regulation and clathrin-coated vesicle assembly. Alternative splicing of this gene results in two transcript variants.
4496	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
4497	
4498	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development, reproduction, and tissue remodeling,as well as in disease processes, such as arthritis and metastasis. Most MMP@#$%&s are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. This gene encodes an enzyme which degrades type IV collagen, the major structural component of basement membranes. The enzyme plays a role in endometrial menstrual breakdown, regulation of vascularization and the inflammatory response.
4499	
4500	
4501	
4502	
4503	
4504	
4505	In a study identifying genes in rat that are upregulated in response to nerve damage, a gene which is highly expressed in ganglia and in the central nervous system was found. The protein encoded by the rat gene contains both a zinc finger and a RING finger motif and is localized in the endosome/lysosome compartment, indicating that it may be involved in ubiquitin-mediated protein modification. The protein encoded by this human gene is highly similar in sequence to that encoded by the rat gene.
4506	
4507	
4508	This gene encodes an enzyme which converts phosphatidylethanolamine to phosphatidylcholine by sequential methylation in the liver. The protein localizes to the endoplasmic reticulum and mitochondria-associated membranes. The gene is within the Smith-Magenis syndrome region on chromosome 17. Alternate splicing of this gene results in three transcript variants encoding two different isoforms.
4509	
4510	
4511	This gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins.
4512	
4513	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes one of the ATPase subunits, a member of the triple-A family of ATPases which have a chaperone-like activity. In addition to participation in proteasome functions, this subunit may participate in transcriptional regulation since it has been shown to interact with the thyroid hormone receptor and retinoid X receptor-alpha.
4514	The protein encoded by this gene is the ER to nucleus signalling 1 protein, a human homologue of the yeast Ire1 gene product. This protein possesses intrinsic kinase activity and an endoribonuclease activity and it is important in altering gene expression as a response to endoplasmic reticulum-based stress signals. Two alternatively spliced transcript variants encoding different isoforms have been found for this gene.
4515	This protein is expressed by in vitro differentiated macrophages but not freshly isolated monocytes. Although sequence analysis identifies seven potential transmembrane domains, this protein has little homology to G-protein receptors and it has not been positively identified as a receptor. A suggested alternative function is that of an ion channel protein in maturing macrophages.
4516	This gene encodes a protein with similarity to a bovine microfibril-associated protein. The protein has binding specificities for both collagen and carbohydrate. It is thought to be an extracellular matrix protein which is involved in cell adhesion or intercellular interactions. The gene is located within the Smith-Magenis syndrome region.
4517	This gene encodes the heavy chain subunit of axonemal dynein, a large multi-subunit molecular motor. Axonemal dynein attaches to microtubules and hydrolyzes ATP to mediate the movement of cilia and flagella. The gene expresses at least two transcript variants; additional variants have been described, but their full length nature has not been determined.
4518	
4519	
4520	Calpains are a family of cytosolic calcium-activated cysteine proteases involved in a variety of cellular processes including apoptosis, cell division, modulation of integrin-cytoskeletal interactions, and synaptic plasticity (Dear et al., 2000). CAPN12 belongs to the calpain large subunit family.[supplied by OMIM]
4521	This gene encodes an integral membrane protein which is classified as a member of the metalloproteinase family.  This enzyme is thought to function in the maintenance and processing of CAAX-type prenylated proteins.
4522	
4523	
4524	The product of this gene belongs to the family of basic helix-loop-helix leucine zipper (bHLH-Zip) transcription factors. These factors form heterodimers with Mad proteins and play a role in proliferation, determination and differentiation. This gene product may act to diversify Mad family function by its restricted association with a subset of the Mad family of transcriptional repressors, namely, Mad1 and Mad4. Alternatively spliced transcript variants encoding different isoforms have been identified for this gene.
4525	
4526	This gene encodes a member of the tob/btg1 family of anti-proliferative proteins that have the potential to regulate cell growth. When exogenously expressed, this protein supresses cell growth in tissue culture. The protein undergoes phophorylation by a serine/threonine kinase, 90 kDa ribosomal S6 kinase. Interactions of this protein with the v-erb-b2 erythroblastic leukemia viral oncogene homolog 2 gene product p185 interferes with growth suppression. This protein inhibits T cell proliferation and transcription of cytokines and cyclins. The protein interacts with both mothers against decapentaplegic Drosophila homolog 2 and 4 to enhance their DNA binding activity. This interaction inhibits interleukin 2 transcription in T cells.
4527	Many vertebrate homeo box-containing genes have been identified on the basis of their sequence similarity with Drosophila developmental genes. Members of the Dlx gene family contain a homeobox that is related to that of Distal-less (Dll), a gene expressed in the head and limbs of the developing fruit fly. The Distal-less (Dlx) family of genes comprises at least 6 different members, DLX1-DLX6. The DLX proteins are postulated to play a role in forebrain and craniofacial development. Three transcript variants have been described for this gene, however, the full length nature of one variant has not been described. Studies of the two splice variants revealed that one encoded isoform functions as a repressor of the beta-globin gene while the other isoform lacks that function.
4528	
4529	This gene encodes a subunit of mitochondrial ATP synthase. Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. ATP synthase is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, Fo, comprising the proton channel. The catalytic portion of mitochondrial ATP synthase consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled with a stoichiometry of 3 alpha, 3 beta, and a single representative of the other 3. The proton channel seems to have nine subunits (a, b, c, d, e, f, g, F6 and 8). There are three separate genes which encode the subunit c of the proton channel and they specify precursors with different import sequences but identical mature proteins. The protein encoded by this gene is one of three precursors of subunit c. Alternatively spliced transcript variants encoding the same isoform have been identified.
4530	
4531	
4532	
4533	
4534	Claudins, such as CLDN7, are involved in the formation of tight junctions between epithelial cells. Tight junctions restrict lateral diffusion of lipids and membrane proteins, and thereby physically define the border between the apical and basolateral compartments of epithelial cells (Zheng et al., 2003).[supplied by OMIM]
4535	
4536	This gene encodes a serine/threonine kinase belonging to the germinal center kinase (GCK) family. The protein is structurally similar to the kinases that are related to NIK and may belong to a distinct subfamily of NIK-related kinases within the GCK family. Studies of the mouse homolog indicate an up-regulation of expression in the course of postnatal mouse cerebral development and activation of the cJun N-terminal kinase (JNK) and the p38 pathways. Alternative splicing occurs at this locus and three transcript variants encoding distinct isoforms have been identified.
4537	This gene encodes a serine/threonine kinase belonging to the germinal center kinase (GCK) family. The protein is structurally similar to the kinases that are related to NIK and may belong to a distinct subfamily of NIK-related kinases within the GCK family. Studies of the mouse homolog indicate an up-regulation of expression in the course of postnatal mouse cerebral development and activation of the cJun N-terminal kinase (JNK) and the p38 pathways. Alternative splicing occurs at this locus and three transcript variants encoding distinct isoforms have been identified.
4538	
4539	This gene encodes the puromycin-sensitive aminopeptidase, a zinc metallopeptidase which hydrolyzes amino acids from the N-terminus of its substrate. The protein has been localized to both the cytoplasm and to cellular membranes. This enzyme degrades enkaphalins in the brain, and studies in mouse suggest that it is involved in proteolytic events regulating the cell cycle.
4540	Synaptobrevins/VAMPs, syntaxins, and the 25-kD synaptosomal-associated protein SNAP25 are the main components of a protein complex involved in the docking and/or fusion of synaptic vesicles with the presynaptic membrane. VAMP2 is a member of the vesicle-associated membrane protein (VAMP)/synaptobrevin family. VAMP2 is a likely candidate gene for familial infantile myasthenia (FIMG) because of its map location and because it encodes a synaptic vesicle protein of the type that has been implicated in the pathogenesis of FIMG. VAMP2 is thought to participate in neurotransmitter release at a step between docking and fusion. The protein forms a stable complex with syntaxin, synaptosomal-associated protein, 25 kD, and synaptotagmin. It also forms a distinct complex with synaptophysin.
4541	This gene encodes a member of the delta protein ligand family. This family functions as Notch ligands that are characterized by a DSL domain, EGF repeats, and a transmembrane domain. Mutations in this gene cause autosomal recessive spondylocostal dysostosis 1. Two transcript variants encoding distinct isoforms have been identified for this gene.
4542	The protein encoded by this gene is part of the large ATP-dependent chromatin remodeling complex SWI/SNF, which is required for transcriptional activation of genes normally repressed by chromatin. The encoded protein, either alone or when in the SWI/SNF complex, can bind to 4-way junction DNA, which is thought to mimic the topology of DNA as it enters or exits the nucleosome. The protein contains a DNA-binding HMG domain, but disruption of this domain does not abolish the DNA-binding or nucleosome-displacement activities of the SWI/SNF complex. Unlike most of the SWI/SNF complex proteins, this protein has no yeast counterpart.
4543	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the high sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
4544	
4545	This gene encodes a protein of unknown function. The gene is located within the Smith-Magenis syndrome region on chromosome 17.
4546	The ASPA gene product, aspartoacylase, catalyzes the conversion of N-acetyl_L-aspartic acid (NAA) to aspartate and acetate.  NAA is abundant in the brain where hydrolysis by aspartoacylase is thought to help maintain white matter.  The ASPA gene product is an NAA scavenger in other tissues.  Defects in ASPA cause Canavan disease.
4547	This gene is a member of a phylogenetically conserved family of genes that share a common DNA-binding domain, the T-box. T-box genes encode transcription factors involved in the regulation of developmental processes. This gene product is the human homolog of mouse Tbx2, and shares strong sequence similarity with Drosophila omb protein. Expression studies indicate that this gene may have a potential role in tumorigenesis as an immortalizing agent. Transcript heterogeneity due to alternative polyadenylation has been noted for this gene.
4548	
4549	Desmosomes are cell-cell junctions between epithelial, myocardial and certain other cell types. Desmoglein 1 is a calcium-binding transmembrane glycoprotein component of desmosomes in vertebrate epithelial cells. Currently, three desmoglein subfamily members have been identified and all are members of the cadherin cell adhesion molecule superfamily. These desmoglein gene family members are located in a cluster on chromosome 18. The protein encoded by this gene has been identified as the autoantigen of the autoimmune skin blistering disease pemphigus foliaceus.
4550	This gene is the cellular homolog of the Yamaguchi sarcoma virus oncogene. The encoded protein has tyrosine kinase activity and belongs to the src family of proteins. This gene lies in close proximity to thymidylate synthase gene on chromosome 18, and a corresponding pseudogene has been found on chromosome 22.
4551	
4552	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This protein contains multiple WD repeats. It has been demonstrated to be greatly elevated in numerous malignant, transformed human cell lines, and shows a striking correlation with both TGF-beta resistance and metastatic potential. Alternative splicing of this gene generates 2 transcript variants.
4553	
4554	Sialic acid-binding immunoglobulin-like lectins (SIGLECs) are a family of cell surface proteins belonging to the immunoglobulin superfamily. They mediate protein-carbohydrate interactions by selectively binding to different sialic acid moieties present on glycolipids and glycoproteins. This gene encodes a member of the SIGLEC3-like subfamily of SIGLECs. Members of this subfamily are characterized by an extracellular V-set immunoglobulin-like domain followed by two C2-set immunoglobulin-like domains, and the cytoplasmic tyrosine-based motifs ITIM and SLAM-like. The encoded protein, upon tyrosine phosphorylation, has been shown to recruit the Src homology 2 domain-containing protein-tyrosine phosphatases SHP1 and SHP2. It has been suggested that the protein is involved in the negative regulation of macrophage signaling by functioning as an inhibitory receptor. This gene is located in a cluster with other SIGLEC3-like genes on 19q13.4. Alternatively spliced transcript variants encoding distinct isoforms have been described for this gene.
4555	
4556	
4557	
4558	
4559	This gene encodes the receptor for urokinase plasminogen activator and, given its role in localizing and promoting plasmin formation, likely influences many normal and pathological processes related to cell-surface plasminogen activation and localized degradation of the extracellular matrix. It binds both the proprotein and mature forms of urokinase plasminogen activator and permits the activation of the receptor-bound pro-enzyme by plasmin. The protein lacks transmembrane or cytoplasmic domains and may be anchored to the plasma membrane by a glycosyl-phosphatidylinositol (GPI) moiety following cleavage of the nascent polypeptide near its carboxy-terminus. However, a soluble protein is also produced in some cell types. Alternative splicing results in multiple transcript variants encoding different isoforms. The proprotein experiences several post-translational cleavage reactions that have not yet been fully defined.
4560	The protein encoded by this gene is a member of the intercellular adhesion molecule (ICAM) family. All ICAM proteins are type I transmembrane glycoproteins, contain 2-9 immunoglobulin-like C2-type domains, and bind to the leukocyte adhesion LFA-1 protein. This protein is constitutively and abundantly expressed by all leucocytes and may be the most important ligand for LFA-1 in the initiation of the immune response. It functions not only as an adhesion molecule, but also as a potent signalling molecule.
4561	
4562	
4563	
4564	The activation of gene transcription is a multistep process that is triggered by factors that recognize transcriptional enhancer sites in DNA. These factors work with co-activators to direct transcriptional initiation by the RNA polymerase II apparatus. The protein encoded by this gene is a subunit of the CRSP (cofactor required for SP1 activation) complex, which, along with TFIID, is required for efficient activation by SP1. This protein is also a component of other multisubunit complexes e.g. thyroid hormone receptor-(TR-) associated proteins which interact with TR and facilitate TR function on DNA templates in conjunction with initiation factors and cofactors.
4565	The protein encoded by this gene is a member of the RAS superfamily which are small GTP/GDP-binding proteins with an average size of 200 amino acids. The RAS-related proteins of the RAB/YPT family may play a role in the transport of proteins from the endoplasmic reticulum to the Golgi and the plasma membrane. This protein shares 97%, 96%, and 51% similarity with the dog RAB8, mouse MEL, and mouse YPT1 proteins, respectively and contains the 4 GTP/GDP-binding sites that are present in all the RAS proteins. The putative effector-binding site of this protein is similar to that of the RAB/YPT proteins. However, this protein contains a C-terminal CAAX motif that is characteristic of many RAS superfamily members but which is not found in YPT1 and the majority of RAB proteins. Although this gene was isolated as a transforming gene from a melanoma cell line, no linkage between MEL and malignant melanoma has been demonstrable. This oncogene is located 800 kb distal to MY09B on chromosome 19p13.1.
4566	
4567	The human leukocyte differentiation antigen CD23 (FCE2) is a key molecule for B-cell activation and growth. It is the low-affinity receptor for IgE. The truncated molecule can be secreted, then functioning as a potent mitogenic growth factor.[supplied by OMIM]
4568	
4569	
4570	
4571	Of the multiple RNases H in mammals, RNase HI is the major enzyme and shows increased activity during DNA replication. It shows more homology to the RNase HII of Escherichia coli.
4572	This gene, which encodes a member of the serine/threonine kinase family, regulates cell polarity and functions as a tumor suppressor. Mutations in this gene have been associated with Peutz-Jeghers syndrome, an autosomal dominant disorder characterized by the growth of polyps in the gastrointestinal tract, pigmented macules on the skin and mouth, and other neoplasms. Alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
4573	
4574	Potassium channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. Four sequence-related potassium channel genes - shaker, shaw, shab, and shal - have been identified in Drosophila, and each has been shown to have human homolog(s). This gene encodes a member of the potassium channel, voltage-gated, shaker-related subfamily. This member contains six membrane-spanning domains with a shaker-type repeat in the fourth segment. The gene is expressed preferentially in skeletal muscle, heart and kidney. It is a candidate gene for inherited cardiac disorders.
4575	
4576	
4577	This gene encodes an integral membrane protein. The gene belongs to the synaptogyrin gene family. Like other members of the family the protein contains four transmembrane regions. The exact function of this protein is unclear.
4578	
4579	
4580	
4581	
4582	The protein encoded by this gene is a member of the Ras guanyl nucleotide-releasing protein (RasGRP) family of Ras guanine nucleotide exchange factors. It contains a Ras exchange motif, a diacylglycerol-binding domain, and two calcium-binding EF hands. This protein was shown to activate H-Ras in a cation-dependent manner in vitro. Expression of this protein in myeloid cell lines was found to be correlated with elevated level of activated RAS protein, and the RAS activation can be greatly enhanced by phorbol ester treatment, which suggested a role of this protein in diacylglycerol regulated cell signaling pathways. Studies of a mast cell leukemia cell line expressing substantial amounts of abnormal transcripts of this gene indicated that this gene may play an important role in the final stages of mast cell development. Four alternatively spliced transcript variants of this gene encoding three distinct isoforms have been reported.
4583	Death-associated protein kinase 3 (DAPK3) induces morphological changes in apoptosis when overexpressed in mammalian cells.  These results suggest that DAPK3 may play a role in the induction of apoptosis.
4584	Lysophospholipases are enzymes that act on biological membranes to regulate the multifunctional lysophospholipids. The protein encoded by this gene has lysophospholipase activity. It is composed of two identical subunits which are held together by disulfide bonds. This protein has structural similarity to several members of the beta-galactoside-binding S-type lectin family.
4585	
4586	
4587	
4588	
4589	
4590	The protein encoded by this gene shares similarity with several thiamine pyrophosphate-binding proteins identified in bacteria, yeast, and plants. The highest degree of similarity is found with bacterial acetolactate synthases (AHAS), which are enzymes that catalyze the first step in branched-chain amino acid biosynthesis. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
4591	This gene is a member of the ARID (AT-rich interaction domain) family of proteins which bind DNA. It was found by its homology to the Drosophila dead ringer gene, which is important for normal embryogenesis. Other ARID family members have roles in embryonic patterning, cell lineage gene regulation, cell cycle control, transcriptional regulation and possibly in chromatin structure modification.
4592	
4593	
4594	Major histocompatibility (MHC) class II molecules are transmembrane proteins that have a central role in development and control of the immune system. The protein encoded by this gene, along with regulatory factor X-associated protein and regulatory factor-5, forms a complex that binds to the X box motif of certain MHC class II gene promoters and activates their transcription. Once bound to the promoter, this complex associates with the non-DNA-binding factor MHC class II transactivator, which controls the cell type specificity and inducibility of MHC class II gene expression. This protein contains ankyrin repeats involved in protein-protein interactions. Mutations in this gene have been linked to bare lymphocyte syndrome type II, complementation group B. Two transcript variants encoding different isoforms have been described for this gene, with only one isoform showing activation activity.
4595	This gene encodes a protein expressed in the eye.
4596	The protein encoded by this gene is similar to proline dehydrogenase (oxidase) 1, a mitochondrial enzyme which catalyzes the first step in proline catabolism. The function of this protein has not been determined.
4597	
4598	
4599	Bone morphogenetic proteins (e.g., BMP5; MIM 112265) are members of the transforming growth factor-beta (see TGFB1; MIM 190180) superfamily and regulate tissue differentiation and maintenance. They are synthesized as precursor molecules that are processed at a dibasic cleavage site to release C-terminal domains containing a characteristic motif of 7 conserved cysteines in the mature protein.[supplied by OMIM]
4600	
4601	In mammals, the Y chromosome directs the development of the testes and plays an important role in spermatogenesis. A high percentage of infertile men have deletions that map to regions of the Y chromosome. The DAZ (deleted in azoospermia) gene cluster maps to the AZFc region of the Y chromosome and is deleted in many azoospermic and severely oligospermic men. It is thought that the DAZ gene cluster arose from the transposition, amplification, and pruning of the ancestral autosomal gene DAZL also involved in germ cell development and gametogenesis. This gene encodes a RNA-binding protein with two RNP motifs that was originally identified by its interaction with the infertility factors DAZ and DAZL. Two isoforms are encoded by transcript variants of this gene.
4602	This gene encodes the fifth largest subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. This subunit is shared by the other two DNA-directed RNA polymerases and is present in two-fold molar excess over the other polymerase subunits. An interaction between this subunit and a hepatitis virus transactivating protein has been demonstrated, suggesting that interaction between transcriptional activators and the polymerase can occur through this subunit. A pseudogene is located on chromosome 11.
4603	
4604	
4605	
4606	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This protein contains a C-terminal PTP domain and an N-terminal domain homologous to the band 4.1 superfamily of cytoskeletal-associated proteins. This PTP has been shown to interact with glutamate receptor delta 2 and epsilon subunits, and is thought to play a role in signalling downstream of the glutamate receptors through tyrosine dephosphorylation.
4607	
4608	
4609	The protein encoded by this gene is a member of the ankyrin repeat and SOCS box-containing (ASB) family of proteins. They contain ankyrin repeat sequence and SOCS box domain. The SOCS box serves to couple suppressor of cytokine signalling (SOCS) proteins and their binding partners with the elongin B and C complex, possibly targeting them for degradation.
4610	
4611	
4612	Crystallins are separated into two classes: taxon-specific, or enzyme, and ubiquitous. The latter class constitutes the major proteins of vertebrate eye lens and maintains the transparency and refractive index of the lens. Since lens central fiber cells lose their nuclei during development, these crystallins are made and then retained throughout life, making them extremely stable proteins. Mammalian lens crystallins are divided into alpha, beta, and gamma families; beta and gamma crystallins are also considered as a superfamily. Alpha and beta families are further divided into acidic and basic groups. Seven protein regions exist in crystallins: four homologous motifs, a connecting peptide, and N- and C-terminal extensions. Gamma-crystallins are a homogeneous group of highly symmetrical, monomeric proteins typically lacking connecting peptides and terminal extensions. They are differentially regulated after early development. Four gamma-crystallin genes (gamma-A through gamma-D) and three pseudogenes (gamma-E, gamma-F, gamma-G) are tandemly organized in a genomic segment as a gene cluster. Whether due to aging or mutations in specific genes, gamma-crystallins have been involved in cataract formation.
4613	
4614	This gene encodes a 14 kDa protein subunit of the splicing factor 3b complex. Splicing factor 3b associates with both the U2 and U11/U12 small nuclear ribonucleoprotein complexes (U2 snRNP) of spliceosomes. This 14 kDa protein interacts directly with subunit 1 of the splicing factor 3b complex. This 14 kDa protein also interacts directly with the adenosine that carries out the first transesterification step of splicing at the pre-mRNA branch site.
4615	
4616	
4617	
4618	
4619	Crystallins are separated into two classes: taxon-specific, or enzyme, and ubiquitous. The latter class constitutes the major proteins of the vertebrate eye, which function to maintain the transparency and refractive index of the lens. Since lens central fiber cells lose their nuclei during development, these crystallins are made and then retained throughout life, making them extremely stable proteins. Mammalian lens crystallins are divided into alpha, beta, and gamma families; beta and gamma crystallins are also defined as a superfamily. Alpha and beta families are further divided into acidic and basic groups. Seven protein regions exist in crystallins: four homologous motifs, a connecting peptide, and N- and C-terminal extensions. Beta-crystallins, the most heterogeneous, differ by the presence of the C-terminal extension (present in the basic group but absent in the acidic group). Beta-crystallins form aggregates of different sizes and are able to form homodimers through self-association or heterodimers with other beta-crystallins. This gene is a beta acidic group member. Three alternatively spliced transcript variants encoding identical proteins have been reported.
4620	The Golgi complex plays a key role in the sorting and modification of proteins exported from the endoplasmic reticulum. The protein encoded by this gene is involved in establishing the stacked structure of the Golgi apparatus.
4621	
4622	
4623	This gene encodes a translation elongation factor. The protein is a guanine nucleotide exchange factor involved in the transfer of aminoacylated tRNAs to the ribosome. Alternative splicing results in two transcript variants which differ only in the 5@#$%& UTR.
4624	The protein encoded by this gene shares similarity with mouse Hs1bp3, an Hcls1/Hs1-interacting protein that may be involved in lymphocyte activation. Two alternatively spliced transcript variants of this gene, which encode distinct isoforms, have been reported.
4625	
4626	
4627	
4628	
4629	
4630	The protein encoded by this gene is a secretory protein that contains a hyaluronan-binding domain, and thus is a member of the hyaluronan-binding protein family. The hyaluronan-binding domain is known to be involved in extracellular matrix stability and cell migration. This protein has been shown to form a stable complex with inter-alpha-inhibitor (I alpha I), and thus enhance the serine protease inhibitory activity of I alpha I, which is important in the protease network associated with inflammation. The expression of this gene can be induced by tumor necrosis factor alpha and interleukin-1. The expression can also be induced by mechanical stimuli in vascular smooth muscle cells, and is found to be correlated with proteoglycan synthesis and aggregation.
4631	This gene encodes a member of the gelsolin/villin family of actin-regulatory proteins. The encoded protein reversibly blocks the barbed ends of F-actin filaments in a Ca2+ and phosphoinositide-regulated manner, but does not sever preformed actin filaments. By capping the barbed ends of actin filaments, the encoded protein contributes to the control of actin-based motility in non-muscle cells. Alternatively spliced transcript variants have been observed, but have not been fully described.
4632	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. Sequence analysis identified three transcript variants. Pseudogenes corresponding to this gene are found on chromosomes 6p, 10q, and Xp.
4633	Retrovirus receptors allow infection of human and murine cells by various retroviruses. The receptors that have been identified at the molecular level include CD4 (MIM 186940) for human immunodeficiency virus, Rec1 for murine ecotropic virus, and GLVR1 for gibbon ape leukemia virus (see MIM 182090). These 3 proteins show no homology to one another at the DNA or protein level. GLVR1 is a sodium-dependent phosphate symporter.[supplied by OMIM]
4634	
4635	
4636	POH1 (pad one homolog-1) is a component of the 26S proteasome, a multiprotein complex that degrades proteins targeted for destruction by the ubiquitin pathway (Spataro et al., 1997).[supplied by OMIM]
4637	
4638	
4639	The protein encoded by this gene is a member of the bone morphogenetic protein (BMP) family and the TGF-beta superfamily. This group of proteins is characterized by a polybasic proteolytic processing site which is cleaved to produce a mature protein containing seven conserved cysteine residues. The members of this family are regulators of cell growth and differentiation in both embryonic and adult tissues. This gene is thought to encode a secreted protein which negatively regulates skeletal muscle growth.
4640	
4641	
4642	
4643	
4644	
4645	
4646	
4647	This gene encodes the rate-limiting enzyme of the polyamine biosynthesis pathway which catalyzes ornithine to putrescine. The activity level for the enzyme varies in response to growth-promoting stimuli and exhibits a high turnover rate in comparison to other mammalian proteins. Originally localized to both chromosomes 2 and 7, the gene encoding this enzyme has been determined to be located on 2p25, with a pseudogene located on 7q31-qter.
4648	
4649	
4650	Initiation of transcription by RNA polymerase I requires the formation of a complex composed of the TATA-binding protein (TBP) and three TBP-associated factors (TAFs) specific for RNA polymerase I. This complex, known as SL1, binds to the core promoter of ribosomal RNA genes to position the polymerase properly and acts as a channel for regulatory signals. This gene encodes one of the SL1-specific TAFs.
4651	
4652	The protein encoded by this gene belongs to the stathmin/oncoprotein 18 family of microtubule-destabilizing phosphoproteins. It is similar to the SCG10 protein and is involved in signal transduction and regulation of microtubule dynamics.
4653	
4654	
4655	
4656	
4657	
4658	The product of this gene belongs to the prefoldin beta subunit family. Prefoldin is a heterohexameric chaperone protein which assists in the correct folding of other proteins. It consists of two PFD-alpha type and four PFD-beta type subunits. It binds specifically to cytosolic chaperonin and transfers target proteins. Prefoldin may function by selectively targeting nascent actin and tubulin chains pending their transfer to cytosolic chaperonin for final folding and/or assembly. It promotes folding in an environment in which there are many competing pathways for nonnative proteins. This gene encodes the prefoldin beta subunit 4.
4659	Isocitrate dehydrogenases catalyze the oxidative decarboxylation of isocitrate to 2-oxoglutarate. These enzymes belong to two distinct subclasses, one of which utilizes NAD(+) as the electron acceptor and the other NADP(+). Five isocitrate dehydrogenases have been reported: three NAD(+)-dependent isocitrate dehydrogenases, which localize to the mitochondrial matrix, and two NADP(+)-dependent isocitrate dehydrogenases, one of which is mitochondrial and the other predominantly cytosolic. NAD(+)-dependent isocitrate dehydrogenases catalyze the allosterically regulated rate-limiting step of the tricarboxylic acid cycle. Each isozyme is a heterotetramer that is composed of two alpha subunits, one beta subunit, and one gamma subunit. The protein encoded by this gene is the beta subunit of one isozyme of NAD(+)-dependent isocitrate dehydrogenase. Three alternatively spliced transcript variants encoding different isoforms have been described for this gene.
4660	
4661	This gene encodes a protein that contains a Ras association domain. Similar to its cattle and sheep counterparts, this gene is located near the prion gene. The specific function of this gene has not yet been determined. Three alternatively spliced transcript variants of this gene encoding two distinct isoforms have been reported.
4662	
4663	FLRT1, FLRT2 and FLRT3 are members of the fibronectin leucine rich transmembrane protein (FLRT) family. They may function in cell adhesion and/or receptor signalling. Their protein structures resemble small leucine-rich proteoglycans found in the extracellular matrix. FLRT3 shares 55% amino acid sequence identity with FLRT1 and 44% identity with FLRT2. FLRT3 is expressed in kidney, brain, pancreas, skeletal muscle, lung, liver, placenta, and heart. Two alternatively spliced transcript variants encoding the same protein have been described for this gene.
4664	
4665	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has three repeats of quasi-RRM domains that bind to RNAs. This protein also constitutes a monomer of the N-acetylglucosamine-specific receptor which is postulated to trigger selective recycling of immature GlcNAc-bearing thyroglobulin molecules. Multiple alternatively spliced transcript variants are known for this gene but only two transcripts has been isolated.
4666	
4667	
4668	This gene encodes one of three GA-binding protein transcription factor subunits which functions as a DNA-binding subunit.  Since this subunit shares identity with a subunit encoding the nuclear respiratory factor 2 gene, it is likely involved in activation of cytochrome oxidase expression and nuclear control of mitochondrial function.  This subunit also shares identity with a subunit constituting the transcription factor E4TF1, responsible for expression of the adenovirus E4 gene.  Because of its chromosomal localization and ability to form heterodimers with other polypeptides, this gene may play a role in the Down Syndrome phenotype.
4669	
4670	The protein encoded by this gene interacts with calcineurin A and inhibits calcineurin-dependent signaling pathways, possibly affecting central nervous system development. This gene is located in the minimal candidate region for the Down syndrome phenotype, and is overexpressed in the brain of Down syndrome fetuses. Chronic overexpression of this gene may lead to neurofibrillary tangles such as those associated with Alzheimer disease. Three transcript variants encoding three different isoforms have been found for this gene.
4671	
4672	
4673	
4674	
4675	The protein encoded by this gene is involved in the transsulfuration pathway. The first step of this pathway, from homocysteine to cystathionine, is catalyzed by this protein. CBS deficiency can cause homocystinuria which affects many organs and tissues, including the eyes and the skeletal, vascular and central nervous systems.
4676	
4677	The protein encoded by this gene forms a complex with two other proteins, NPL4 and VCP, that is necessary for the degradation of ubiquitinated proteins. In addition, this complex controls the disassembly of the mitotic spindle and the formation of a closed nuclear envelope after mitosis. Mutations in this gene have been associated with Catch 22 syndrome as well as cardiac and craniofacial defects.
4678	
4679	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes (E1s), ubiquitin-conjugating enzymes (E2s) and ubiquitin-protein ligases (E3s). This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. This enzyme is demonstrated to participate in the ubiquitination of p53, c-Fos, and the NF-kB precursor p105 in vitro. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
4680	
4681	The VPREB3 gene product is the human homologue of the mouse VpreB3 (8HS20) protein, and is specifically expressed in cell lines representative of all stages of B-cell differentiation.  It is also related to VPREB1 and other members of the immunoglobulin supergene family.  The VPREB3 protein associates with membrane mu heavy chains early in the course of pre-B cell receptor biosynthesis.  The precise function of VPREB3 is not known, but it may contribute to mu chain transport in pre-B cells.
4682	Gamma-glutamyltransferase-like activity 1 (GGTLA1) is a member of a gene family with at least 4 members (GGTLA1, GGTLA2, GGTLA3 and GGTLA4). The enzyme encoded by GGTLA1 is related to, but distinct from, gamma-glutamyl transpeptidase (GGT). The GGTLA1 enzyme consists of a heavy and a light chain and is able to hydrolyze the gamma-glutamyl moiety of glutathione. It converts leukotriene C4 to leukotriene D4, however, it doesn@#$%&t convert synthetic substrates that are commonly used to assay GGT. Its amino acid sequence shows an overall similarity of 39.5% with human GGT.
4683	
4684	Oncostatin M is a member of a cytokine family that includes leukemia-inhibitory factor, granulocyte colony-stimulating factor, and interleukin 6. This gene encodes a growth regulator which inhibits the proliferation of a number of tumor cell lines. It regulates cytokine production, including IL-6, G-CSF and GM-CSF from endothelial cells.
4685	
4686	This gene product belongs to the 14-3-3 family of proteins which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals, and this protein is 99% identical to the mouse, rat and bovine orthologs. This gene contains a 7 bp repeat sequence in its 5@#$%& UTR, and changes in the number of this repeat has been associated with early-onset schizophrenia.
4687	This gene is a member of the apolipoprotein L gene family. The encoded protein is found in the cytoplasm, where it may affect the movement of lipids or allow the binding of lipids to organelles.
4688	The galectins are a family of beta-galactoside-binding proteins implicated in modulating cell-cell and cell-matrix interactions. LGALS1 may act as an autocrine negative growth factor that regulates cell proliferation.
4689	
4690	
4691	This gene encodes a ubiquitously expressed transcription factor that controls cholesterol homeostasis by stimulating transcription of sterol-regulated genes. The encoded protein contains a basic helix-loop-helix-leucine zipper (bHLH-Zip) domain.
4692	
4693	
4694	
4695	This gene encodes a protein of unknown function. The protein contains a bromodomain, a sequence motif often found in transcriptional coactivators, and localizes to the nucleus in testis and several other cell types.
4696	
4697	
4698	
4699	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this DEAD box protein family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a member of this family. The encoded protein has 97% sequence identity with the mouse HELG protein. Alternative splicing of this gene generates 3 transcript variants.
4700	
4701	
4702	Expression of this gene is affected by vitamin A. The encoded protein of this gene may be associated with the cytoskeleton. A similar protein in rats may play a role in the regulation of cell differentiation. The rat protein binds and inhibits the cell membrane glutamate transporter EAAC1. The expression of the rat gene is upregulated by retinoic acid, which results in a specific reduction in EAAC1-mediated glutamate transport.
4703	The specific function of the adaptor protein encoded by this gene is not known; however, it is speculated that it may recruit kinases AKT2 and PIK3 to the cell membrane.
4704	
4705	
4706	Angiomotin is a protein that binds angiostatin, a circulating inhibitor of the formation of new blood vessels (angiogenesis). Angiomotin mediates angiostatin inhibition of endothelial cell migration and tube formation in vitro. The protein encoded by this gene is related to angiomotin and is a member of the motins protein family.
4707	Synovial sarcomas occur most frequently in the extremities around large joints. More than 90% of cases have a recurrent and specific chromosomal translocation, t(X;18)(p11.2;q11.2), in which the 5-prime end of the SS18 gene (MIM 600192) is fused in-frame to the 3-prime end of the SSX1 (MIM 312820), SSX2 (MIM 300192), or SSX4 (MIM 300326) gene. The SS18L2 gene is homologous to SS18.[supplied by OMIM]
4708	
4709	
4710	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation.
4711	
4712	
4713	
4714	
4715	
4716	
4717	Decapping is a key step in general and regulated mRNA decay. The protein encoded by this gene is a decapping enzyme. This protein and another decapping enzyme form a decapping complex, which interacts with the nonsense-mediated decay factor hUpf1 and may be recruited to mRNAs containing premature termination codons. This protein also participates in the TGF-beta signaling pathway.
4718	
4719	With transferrin (MIM 190000) and melanoma tumor antigen p97 (MIM 155750), lactotransferrin belongs to a family of iron-binding proteins that modulate iron metabolism, hemopoiesis, and immunologic reactions. They are evolutionary products of gene duplication, and all 3 are encoded by genes on 3q.[supplied by OMIM]
4720	
4721	
4722	
4723	Carbamoyl phosphate synthetase I (EC 6.3.4.16) is the rate-limiting enzyme that catalyzes the first committed step of the hepatic urea cycle. The mitochondrial isozyme is designated CPS I and the cytoplasmic enzyme CPS II. CPS II is part of a multifunctional enzyme, called the CAD trifunctional protein of pyrimidine biosynthesis (CAD; MIM 114010), which has been mapped to 2p21.[supplied by OMIM]
4724	
4725	Ephrin receptors and their ligands, the ephrins, mediate numerous developmental processes, particularly in the nervous system. Based on their structures and sequence relationships, ephrins are divided into the ephrin-A (EFNA) class, which are anchored to the membrane by a glycosylphosphatidylinositol linkage, and the ephrin-B (EFNB) class, which are transmembrane proteins. The Eph family of receptors are divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands. Ephrin receptors make up the largest subgroup of the receptor tyrosine kinase (RTK) family. The protein encoded by this gene is a receptor for ephrin-B family members.
4726	
4727	Cytochrome c oxidase (COX), the terminal component of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. This component is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in electron transfer, and the nuclear-encoded subunits may function in the regulation and assembly of the complex. This nuclear gene encodes a protein which is not a structural subunit, but may be involved in the recruitment of copper to mitochondria for incorporation into the COX apoenzyme. This protein shares 92% amino acid sequence identity with mouse and rat Cox17 proteins. This gene is no longer considered to be a candidate gene for COX deficiency. A pseudogene COX17P has been found on chromosome 13.
4728	
4729	
4730	
4731	The protein encoded by this gene converts UDP-glucose to UDP-glucuronate and thereby participates in the biosynthesis of glycosaminoglycans such as hyaluronan, chondroitin sulfate, and heparan sulfate. These glycosylated compounds are common components of the extracellular matrix and likely play roles in signal transduction, cell migration, and cancer growth and metastasis. The expression of this gene is up-regulated by transforming growth factor beta and down-regulated by hypoxia.
4732	The protein encoded by this gene converts UDP-glucose to UDP-glucuronate and thereby participates in the biosynthesis of glycosaminoglycans such as hyaluronan, chondroitin sulfate, and heparan sulfate. These glycosylated compounds are common components of the extracellular matrix and likely play roles in signal transduction, cell migration, and cancer growth and metastasis. The expression of this gene is up-regulated by transforming growth factor beta and down-regulated by hypoxia.
4733	
4734	This gene encodes a transcriptional repressor which represses neuronal genes in non-neuronal tissues. It is a member of the Kruppel-type zinc finger transcription factor family. It represses transcription by binding a DNA sequence element called the neuron-restrictive silencer element. The protein is also found in undifferentiated neuronal progenitor cells, and it is thought that this repressor may act as a master negative regular of neurogenesis. Alternatively spliced transcript variants have been described; however, their full length nature has not been determined.
4735	This gene encodes an enzyme that hydrolyzes the teminal alpha-L-iduronic acid residues of two glycosaminoglycans, dermatan sulfate and heparan sulfate. This hydrolysis is required for the lysosomal degradation of these glycosaminoglycans. Mutations in this gene that result in enzymatic deficiency lead to the autosomal recessive disease Mucopolysaccharidosis Type I (MPS I).
4736	The protein encoded by this gene is a member of the platelet-derived growth factor/vascular endothelial growth factor (PDGF/VEGF) family, is active in angiogenesis and endothelial cell growth, and can also affect the permeability of blood vessels.  This secreted protein undergoes a complex proteolytic maturation, generating multiple processed forms which bind and activate VEGFR-3 receptors. Only the fully processed form can bind and activate VEGFR-2 receptors. This protein is structurally and functionally similar to vascular endothelial growth factor D.
4737	GABA is the major inhibitory neurotransmitter in the mammalian brain where it acts at GABA-A receptors, which are ligand-gated chloride channels. Chloride conductance of these channels can be modulated by agents such as benzodiazepines that bind to the GABA-A receptor. At least 16 distinct subunits of GABA-A receptors have been identified.
4738	This gene encodes a protein similar to the mouse hedgehog-interacting protein, a regulatory component of the hedgehog signalling pathway. Members of the hedgehog family are evolutionarily conserved proteins which are involved in many fundamental processes in embryonic development, including anteroposterior patterns of limbs and regulation of left-right asymmetry.
4739	Scrapie-responsive gene 1 is associated with neurodegenerative changes observed in transmissible spongiform encephalopathies. It may play a role in host response to prion-associated infections. The scrapie responsive protein 1 may be partly included in the membrane or secreted by the cells due to its hydrophobic N-terminus.
4740	
4741	The protein encoded by this gene is a microsomal enzyme that catalyzes the esterification of all-trans-retinol into all-trans-retinyl ester, an essential reaction for the retinoid cycle in visual system and vitamin A status in liver. Mutations in this gene have been associated with early-onset severe retinal dystrophy.
4742	Centromere protein C 1 is a centromere autoantigen and a component of the inner kinetochore plate. The protein is required for maintaining proper kinetochore size and a timely transition to anaphase. A putative pseudogene exists on chromosome 12.
4743	
4744	This gene encodes the enzyme dihydropteridine reductase, which catalyzes the NADH-mediated reduction of quinonoid dihydrobiopterin.  This enzyme is an essential component of the pterin-dependent aromatic amino acid hydroxylating systems. Mutations in this gene resulting in QDPR deficiency include aberrant splicing, amino acid substitutions, insertions, or premature terminations.  Dihydropteridine reductase deficiency presents as atypical phenylketonuria due to insufficient production of biopterin, a cofactor for phenylalanine hydroxylase.
4745	
4746	Short-chain L-3-hydroxyacyl-CoA dehydrogenase (SCHAD; EC 1.1.1.35) plays an essential role in the mitochondrial beta-oxidation of short chain fatty acids. It catalyzes the reversible dehydrogenation of 3-hydroxyacyl-CoAs to their corresponding 3-ketoacyl-CoAs with concomitant reduction of NAD to NADH and exerts it highest activity toward 3-hydroxybutyryl-CoA.[supplied by OMIM]
4747	
4748	
4749	
4750	
4751	
4752	
4753	
4754	
4755	The protein encoded by this gene contains a RING finger motif and acts as a transcription regulator. This protein has been shown to interact with, and inhibit the activity of, TRPS1, a transcription suppressor of GATA-mediated transcription. Transcription repressor ZNF278/PATZ is found to interact with this protein, and thus reduce the enhancement of androgen receptor-dependent transcription mediated by this protein. Studies of the mouse and rat counterparts suggested a role of this protein in spermatogenesis.
4756	The protein encoded by this gene contains a RING finger motif and acts as a transcription regulator. This protein has been shown to interact with, and inhibit the activity of, TRPS1, a transcription suppressor of GATA-mediated transcription. Transcription repressor ZNF278/PATZ is found to interact with this protein, and thus reduce the enhancement of androgen receptor-dependent transcription mediated by this protein. Studies of the mouse and rat counterparts suggested a role of this protein in spermatogenesis.
4757	This gene encodes a member of the polycystin protein family. The encoded protein contains multiple transmembrane domains, and cytoplasmic N- and C-termini. The protein may be an integral membrane protein involved in cell-cell/matrix interactions. The encoded protein may function in renal tubular development, morphology, and function, and may modulate intracellular calcium homoeostasis and other signal transduction pathways. This protein interacts with polycystin 1 to produce cation-permeable currents. Mutations in this gene have been associated with autosomal dominant polycystic kidney disease.
4758	CD38 is a novel multifunctional ectoenzyme widely expressed in cells and tissues especially in leukocytes. CD38 also functions in cell adhesion,signal transduction and calcium signaling.
4759	This gene encodes a GTP-binding protein that belongs to the small GTPase superfamily and Ras family of proteins. GTP-binding proteins mediate the transmembrane signaling initiated by the occupancy of certain cell surface receptors.
4760	
4761	This gene encodes a member of the RIEG/PITX homeobox family, which is in the bicoid class of homeodomain proteins. This protein acts as a transcription factor and regulates procollagen lysyl hydroxylase gene expression. Mutations in this gene are associated with Axenfeld-Rieger syndrome (ARS), iridogoniodysgenesis syndrome (IGDS), and sporadic cases of Peters anomaly. This protein plays a role in the terminal differentiation of somatotroph and lactotroph cell phenotypes. This protein is involved in the development of the eye, tooth and abdominal organs. This protein acts as a transcriptional regulator involved in basal and hormone-regulated activity of prolactin. A similar protein in other vertebrates is involved in the determination of left-right asymmetry during development. Three transcript variants encoding distinct isoforms have been identified for this gene.
4762	This gene encodes a phosphoinositide binding protein containing ARF-GAP, RHO-GAP, RAS-associating, and pleckstrin homology domains. The ARF-GAP and RHO-GAP domains cooperate in mediating rearrangements in the cell cytoskeleton and cell shape. It is a specific PtdIns(3,4,5)P3/PtdIns(3,4)P2-stimulated Arf6-GAP protein. An alternatively spliced transcript has been found for this gene, but its full-length nature has not been determined.
4763	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections.
4764	The protein encoded by this gene is a subunit of the CCR4-NOT core transcriptional regulation complex. The encoded protein has a 3@#$%&-5@#$%& RNase activity and prefers polyadenylated substates.
4765	Mannosyl (alpha-1,6-)-glycoprotein beta-1,6-N-acetyl-glucosaminyltransferase protein is a glycosyltransferase involved in the synthesis of protein-bound and lipid-bound oligosaccharides.  Alterations of the oligosaccharides on cell surface glycoproteins cause significant changes in the adhesive or migratory behavior of a cell.  Increase in GMAT5 enzyme activity may correlate with the progression of invasive malignancies.
4766	
4767	Aminopeptidases play a role in the metabolism of several peptides that may be involved in blood pressure and the pathogenesis of essential hypertension (MIM 145500). Adipocyte-derived leucine aminopeptidase (ALAP) is a member of the M1 family of zinc metallopeptidases.[supplied by OMIM]
4768	
4769	CART appears to have roles in reward, feeding, and stress, and it has the functional properties of an endogenous psychostimulant (Kuhar et al., 2002).[supplied by OMIM]
4770	Ephrin-A5, a member of the ephrin gene family, prevents axon bundling in cocultures of cortical neurons with astrocytes, a model of late stage nervous system development and differentiation. The EPH and EPH-related receptors comprise the largest subfamily of receptor protein-tyrosine kinases and have been implicated in mediating developmental events, particularly in the nervous system. EPH receptors typically have a single kinase domain and an extracellular region containing a Cys-rich domain and 2 fibronectin type III repeats. The ephrin ligands and receptors have been named by the Eph Nomenclature Committee (1997). Based on their structures and sequence relationships, ephrins are divided into the ephrin-A (EFNA) class, which are anchored to the membrane by a glycosylphosphatidylinositol linkage, and the ephrin-B (EFNB) class, which are transmembrane proteins. The Eph family of receptors are similarly divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands.
4771	
4772	
4773	Proteins encoded by the complexin/synaphin gene family are cytosolic proteins that function in synaptic vesicle exocytosis. These proteins bind syntaxin, part of the SNAP receptor. The protein product of this gene binds to the SNAP receptor complex and disrupts it, allowing transmitter release. Two transcript variants encoding the same protein have been found for this gene.
4774	
4775	Dihydrofolate reductase converts dihydrofolate into tetrahydrofolate, a methyl group shuttle required for the de novo synthesis of purines, thymidylic acid, and certain amino acids.  While the functional dihydrofolate reductase gene has been mapped to chromosome 5, four intronless processed pseudogenes have been identified on separate chromosomes.  Dihydrofolate reductase deficiency has been linked to megaloblastic anemia.
4776	
4777	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. This enzyme is required for post-replicative DNA damage repair. Its protein sequence is 100% identical to the mouse, rat, and rabbit homologs, which indicates that this enzyme is highly conserved in eukaryotic evolution.
4778	
4779	
4780	
4781	This gene encodes a member of the eukaryote-specific amino acid/auxin permease (AAAP) 1 transporter family. The encoded protein functions as a proton-dependent, small amino acid transporter. This gene is clustered with related family members on chromosome 5q33.1.
4782	This gene encodes coagulation factor XII which circulates in blood as a zymogen. This single chain zymogen is converted to a two-chain serine protease with an heavy chain (alpha-factor XIIa) and a light chain. The heavy chain contains two fibronectin-type domains, two epidermal growth factor (EGF)-like domains, a kringle domain and a proline-rich domain, whereas the light chain contains only a catalytic domain. On activation, further cleavages takes place in the heavy chain, resulting in the production of beta-factor XIIa light chain and the alpha-factor XIIa light chain becomes beta-factor XIIa heavy chain. Prekallikrein is cleaved by factor XII to form kallikrein, which then cleaves factor XII first to alpha-factor XIIa and then to beta-factor XIIa. The active factor XIIa participates in the initiation of blood coagulation, fibrinolysis, and the generation of bradykinin and angiotensin. It activates coagulation factors VII and XI. Defects in this gene do not cause any clinical symptoms and the sole effect is that whole-blood clotting time is prolonged.
4783	This gene encodes a highly conserved neurotrophic factor. The recombinant form of this protein was shown to promote the survival and differentiation of dopaminergic neurons in culture, and was able to prevent apoptosis of motor neurons induced by axotomy. The encoded protein is processed to a mature secreted form that exists as a homodimer. The mature form of the protein is a ligand for the product of the RET (rearranged during transfection) protooncogene. In addition to the transcript encoding GDNF, two additional alternative transcripts encoding distinct proteins, referred to as astrocyte-derived trophic factors, have also been described. Mutations in this gene may be associated with Hirschsprung disease.
4784	Cofactor A is one of four proteins (cofactors A, D, E, and C) involved in the pathway leading to correctly folded beta-tubulin from folding intermediates. Cofactors A and D are believed to play a role in capturing and stabilizing beta-tubulin intermediates in a quasi-native confirmation. Cofactor E binds to the cofactor D/beta-tubulin complex; interaction with cofactor C then causes the release of beta-tubulin polypeptides that are committed to the native state.
4785	This gene belongs to the polypeptide N-acetylgalactosaminyltransferase (pp-GalNAc-T) gene family. Polypeptide GalNAc transferases initiate the synthesis of mucin-type oligosaccharides by transferring GalNAc from UDP-GalNAc to the hydroxyl group of either a serine or threonine residue on the polypeptide acceptor. Following expression in insect cells, recombinant GalNAc transferase 10 showed significant GalNAcT activity toward mucin-derived peptides, and it utilized both nonglycosylated and glycosylated peptide substrates. Two transcript variants encoding distinct isoforms have been identified for this gene.
4786	
4787	
4788	
4789	
4790	This gene is a member of the KCNQ potassium channel gene family that is differentially expressed in subregions of the brain and in skeletal muscle. The protein encoded by this gene yields currents that activate slowly with depolarization and can form heteromeric channels with the protein encoded by the KCNQ3 gene. Currents expressed from this protein have voltage dependences and inhibitor sensitivities in common with M-currents. They are also inhibited by M1 muscarinic receptor activation. Three alternatively spliced transcript variants encoding distinct isoforms have been found for this gene, but the full-length nature of only one has been determined.
4791	
4792	
4793	The protein encoded by this gene is a cytokine that shares sequence similarity with IL17. This cytokine is expressed by activated T cells, and has been shown to stimulate the production of several other cytokines, including IL6, IL8, and CSF2/GM_CSF. This cytokine is also found to inhibit the angiogenesis of endothelial cells and induce endothelial cells to produce IL2, TGFB1/TGFB, and monocyte chemoattractant protein-1.
4794	
4795	DLL1 is a human homolog of the Notch Delta ligand and is a member of the delta/serrate/jagged family.  It plays a role in mediating cell fate decisions during hematopoiesis. It may play a role in cell-to-cell communication.
4796	
4797	The protein encoded by this gene is a substrate for the fibroblast growth factor receptor. It is found in peripheral plasma membrane and functions in linking FGF receptor stimulation to activators of Ras.
4798	
4799	
4800	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3,  and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is intronless and encodes a member of the histone H2B family. Transcripts from this gene lack polyA tails; instead, they contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6p22-p21.3.
4801	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a member of the histone H2A family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the small histone gene cluster on chromosome 6p22-p21.3.
4802	
4803	
4804	
4805	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, which is a human homologue of yeast SKI2 and may be involved in antiviral activity by blocking translation of poly(A) deficient mRNAs. This gene consists of 28 exons and the gene is located in the class III region of the major histocompatibility complex.
4806	
4807	This gene encodes a member of the palmitoyl-protein thioesterase family. The encoded glycosylated lysosomal protein has palmitoyl-CoA hydrolase activity in vitro, but does not hydrolyze palmitate from cysteine residues in proteins. Three transcript variants encoding different isoforms have been described for this gene, with one of the isoforms being inactive.
4808	This gene encodes a protein that belongs to the inositol phosphokinase (IPK) family. This protein is likely responsible for the conversion of inositol hexakisphosphate (InsP6) to diphosphoinositol pentakisphosphate (InsP7/PP-InsP5). It may also convert 1,3,4,5,6-pentakisphosphate (InsP5) to PP-InsP4.
4809	
4810	
4811	
4812	
4813	
4814	
4815	This gene encodes a protein-disulfide isomerase. Its expression is induced by hypoxia and its role may be to protect hypoxic cells from apoptosis. This gene can be co-transcribed with the upstream gene MU.
4816	Pre-mRNA splicing occurs in two sequential transesterification steps, and the protein encoded by this gene is thought to be involved in pre-mRNA splicing and in signal transduction. This protein belongs to a kinase family that includes serine/arginine-rich protein-specific kinases and cyclin-dependent kinases (CDKs). This protein is regarded as a CDK-like kinase (Clk) with homology to mitogen-activated protein kinases (MAPKs). Sequence analysis identified two transcript variants that encode the same protein.
4817	The protein encoded by this gene, a member of the Wiskott-Aldrich syndrome protein (WASP)-family, plays a critical role downstream of Rac, a Rho-family small GTPase, in regulating the actin cytoskeleton required for membrane ruffling. It is shown to associate with an actin nucleation core Arp2/3 complex while enhancing actin polymerization in vitro. Wiskott-Aldrich syndrome is a disease of the immune system, likely due to defects in regulation of actin cytoskeleton.
4818	TINAG is a basement membrane glycoprotein initially identified as a target of antibodies in some forms of immunologically mediated tubulointerstitial nephritis.[supplied by OMIM]
4819	
4820	The HEY2 gene encodes the homolog of the zebrafish @#$%&gridlock@#$%& gene, which is expressed in lateral plate mesoderm before vessel formation and is involved in the arterial-venous cell fate decision. For background information on hairy/enhancer of split-related proteins, see HEY1 (MIM 602953).[supplied by OMIM]
4821	
4822	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
4823	
4824	Ephrin receptors and their ligands, the ephrins, mediate numerous developmental processes, particularly in the nervous system. Based on their structures and sequence relationships, ephrins are divided into the ephrin-A (EFNA) class, which are anchored to the membrane by a glycosylphosphatidylinositol linkage, and the ephrin-B (EFNB) class, which are transmembrane proteins. The Eph family of receptors are divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands. Ephrin receptors make up the largest subgroup of the receptor tyrosine kinase (RTK) family. The protein encoded by this gene binds to ephrin-B2 and plays an essential role in vascular development.
4825	Similarity to a related locus suggests that this gene encodes a subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. This subunit exists as a heterodimer with another polymerase subunit; together they form a core subassembly unit of the polymerase. Two similar genes are located downstream of this locus. Alternative splicing of this gene results in three transcript variants encoding different isoforms.
4826	
4827	The product of this gene belongs to the small GTPase superfamily, Ras family of proteins. GTP-binding proteins mediate the transmembrane signaling initiated by the occupancy of certain cell surface receptors. This gene encodes a low molecular mass ras-like GTP-binding protein that shares about 50% similarity with other ras proteins.
4828	
4829	
4830	
4831	
4832	A key step that determines the intensity and duration of monoamine neural signaling is the rate of reuptake of released monoamines. Plasma membrane transporters, such as SLC29A4, are responsible for the reuptake of monoamines into presynaptic neurons (Engel et al., 2004).[supplied by OMIM]
4833	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
4834	
4835	The zona pellucida is an extracellular matrix that surrounds the oocyte and early embryo. It is composed primarily of three or four glycoproteins with various functions during fertilization and preimplantation development. The protein encoded by this gene is a structural component of the zona pellucida and functions in primary binding and induction of the sperm acrosome reaction. The nascent protein contains a N-terminal signal peptide sequence, a conserved ZP domain, a C-terminal consensus furin cleavage site, and a transmembrane domain. It is hypothesized that furin cleavage results in release of the mature protein from the plasma membrane for subsequent incorporation into the zona pellucida matrix. However, the requirement for furin cleavage in this process remains controversial based on mouse studies. A variation in the last exon of this gene has previously served as the basis for an additional ZP3 locus; however, sequence and literature review reveals that there is only one full-length ZP3 locus in the human genome. Another locus encoding a bipartite transcript designated POMZP3 contains a duplication of the last four exons of ZP3, including the above described variation, and maps closely to this gene.
4836	The zona pellucida is an extracellular matrix that surrounds the oocyte and early embryo. It is composed primarily of three or four glycoproteins with various functions during fertilization and preimplantation development. The protein encoded by this gene is a structural component of the zona pellucida and functions in primary binding and induction of the sperm acrosome reaction. The nascent protein contains a N-terminal signal peptide sequence, a conserved ZP domain, a C-terminal consensus furin cleavage site, and a transmembrane domain. It is hypothesized that furin cleavage results in release of the mature protein from the plasma membrane for subsequent incorporation into the zona pellucida matrix. However, the requirement for furin cleavage in this process remains controversial based on mouse studies. A variation in the last exon of this gene has previously served as the basis for an additional ZP3 locus; however, sequence and literature review reveals that there is only one full-length ZP3 locus in the human genome. Another locus encoding a bipartite transcript designated POMZP3 contains a duplication of the last four exons of ZP3, including the above described variation, and maps closely to this gene.
4837	
4838	
4839	
4840	
4841	This gene encodes a member of a family of calcium-regulated actin-binding proteins.  This protein represents a dominant part of the brush border cytoskeleton which functions in the capping, severing, and bundling of actin filaments.  Two mRNAs of 2.7 kb and 3.5 kb have been observed; they result from utilization of alternate poly-adenylation signals present in the terminal exon.
4842	Carnitine octanoyltransferase (EC 2.3.1.137) is a carnitine acyltransferase that catalyzes the reversible transfer of fatty acyl groups between CoA and carnitine. This provides a crucial step in the transport of medium- and long-chain acyl-CoA out of the mammalian peroxisome to the cytosol and mitochondria. See also CRAT (MIM 600184). Van der Leij et al. (2000) reviewed the function, structural features, and phylogenetics of human carnitine acyltransferase genes, including CROT.[supplied by OMIM]
4843	This gene encodes a member of the DEAD box protein family. DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. The protein encoded by this gene shows ATPase activity in the presence of polynucleotides and associates with nucleoplasmic 65S preribosomal particles. This gene may be involved in ribosome synthesis, most likely during assembly of the large 60S ribosomal subunit.
4844	
4845	
4846	
4847	
4848	
4849	The protein encoded by this gene is a membrane-bound homodimeric enzyme that is localized to the cisternae of the rough endoplasmic reticulum. The enzyme (cofactors iron and ascorbate) catalyzes the hydroxylation of lysyl residues in collagen-like peptides. The resultant hydroxylysyl groups are attachment sites for carbohydrates in collagen and thus are critical for the stability of intermolecular crosslinks. Some patients with Ehlers-Danlos syndrome type VIB have deficiencies in lysyl hydroxylase activity.
4850	
4851	
4852	
4853	
4854	
4855	The protein encoded by this gene contains a C2H2 type of zinc finger, and thus may function as a transcription factor. This gene is located in a region close to ZNF7/KOX4, a gene also encoding a zinc finger protein, on chromosome 8.
4856	
4857	
4858	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S10P family of ribosomal proteins. It is located in the cytoplasm. This gene is co-transcribed with the small nucleolar RNA gene U54, which is located in its second intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
4859	The zinc finger protein encoded by this gene is a widely expressed member of the FOG family of transcription factors. The family members modulate the activity of GATA family proteins, which are important regulators of hematopoiesis and cardiogenesis in mammals. It has been demonstrated that the protein can both activate and down-regulate expression of GATA-target genes, suggesting different modulation in different promoter contexts. A related mRNA suggests an alternatively spliced product but this information is not yet fully supported by the sequence.
4860	Carbonic anhydrases (CAs) are a large family of zinc metalloenzymes that catalyze the reversible hydration of carbon dioxide.  They participate in a variety of biological processes, including respiration, calcification, acid-base balance, bone resorption, and the formation of aqueous humor, cerebrospinal fluid, saliva, and gastric acid.  They show extensive diversity in tissue distribution and in their subcellular localization.  CA1 is closely linked to CA2 and CA3 genes on chromosome 8, and it encodes a cytosolic protein which is found at the highest level in erythrocytes.  Transcript variants of CA1 utilizing alternative polyA_sites have been described in literature.
4861	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the mitochondrial inner membrane and is involved in the conversion of progesterone to cortisol in the adrenal cortex. Mutations in this gene cause congenital adrenal hyperplasia due to 11-beta-hydroxylase deficiency.
4862	The members of the zinc fingers and homeoboxes gene family are nuclear homodimeric transcriptional repressors that interact with the A subunit of nuclear factor-Y (NF-YA) and contain two C2H2-type zinc fingers and five homeobox DNA-binding domains. This gene encodes member 2 of this gene family. In addition to forming homodimers, this protein heterodimerizes with member 1 of the zinc fingers and homeoboxes family.
4863	This gene encodes a serine/threonine protein kinase. Although this gene product is related to serum- and glucocorticoid-induced protein kinase (SGK), this gene is not induced by serum or glucocorticoids. This gene is induced in response to signals that activate phosphatidylinositol 3-kinase, which is also true for SGK. Two alternate transcripts encoding two different isoforms have been described.
4864	The protein encoded by this gene was initially identified as a molecule linking syndecan-mediated signaling to the cytoskeleton. The syntenin protein contains tandemly repeated PDZ domains that bind the cytoplasmic, C-terminal domains of a variety of transmembrane proteins. This protein may also affect cytoskeletal-membrane organization, cell adhesion, protein trafficking, and the activation of transcription factors. The protein is primarily localized to membrane-associated adherens junctions and focal adhesions but is also found at the endoplasmic reticulum and nucleus. Alternative splicing results in multiple transcript variants encoding different isoforms.
4865	The ADRB3 gene product, beta-3-adrenergic receptor, is located mainly in adipose tissue and is involved in the regulation of lipolysis and thermogenesis.  Beta adrenergic receptors are involved in the epenephrine and norepinephrine-induced activation of adenylate cyclase through the action of G proteins.
4866	
4867	
4868	Defensins form a family of microbicidal and cytotoxic peptides made by neutrophils. Members of the defensin family are highly similar in protein sequence. This gene encodes defensin, beta 103A, which has broad spectrum antimicrobial activity and may play an important role in innate epithelial defense.
4869	The protein encoded by this gene is a member of the TNF-receptor superfamily, and contains an intracelluar death domain. This receptor can be activated by tumor necrosis factor-related apoptosis inducing ligand (TNFSF10/TRAIL/APO-2L), and transduces apoptosis signal. Studies with FADD-deficient mice suggested that FADD, a death domain containing adaptor protein, is required for the apoptosis mediated by this protein.
4870	This gene encodes a member of the Snail family of C2H2-type zinc finger transcription factors. The encoded protein acts as a transcriptional repressor that binds to E-box motifs and is also likely to repress E-cadherin transcription in breast carcinoma. This protein is involved in epithelial-mesenchymal transitions and has antiapoptotic activity. Mutations in this gene may be associated with sporatic cases of neural tube defects.
4871	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that has been called mitochondrial ribosomal protein S35 in the literature.
4872	
4873	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This endoplasmic reticulum membrane protein catalyzes the first reaction in the cholesterol catabolic pathway of extrahepatic tissues, which converts cholesterol to bile acids. This enzyme likely plays a minor role in total bile acid synthesis, but may also be involved in the development of atherosclerosis, neurosteroid metabolism and sex hormone synthesis.
4874	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development, reproduction, and tissue remodeling, as well as in disease processes, such as arthritis and metastasis. Most MMP@#$%&s are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. This gene encodes a secreted enzyme which breaks down the interstitial collagens, types I, II, and III. The gene is part of a cluster of MMP genes which localize to chromosome 11q22.3.
4875	
4876	
4877	
4878	This gene encodes human pituitary glutaminyl cyclase, which is responsible for the presence of pyroglutamyl residues in many neuroendocrine peptides. The amino acid sequence of this enzyme is 86% identical to that of bovine glutaminyl cyclase.
4879	This gene product is a serine/threonine protein kinase that contains an N-terminal protein kinase domain and a C-terminal proline-rich domain. Its protein kinase domain is most closely related to those of the LIM motif-containing protein kinases (LIMKs). The encoded protein can phosphorylate myelin basic protein and histone in vitro. The testicular germ cell-specific expression and developmental pattern of expression of the mouse gene suggests that this gene plays an important role at and after the meiotic phase of spermatogenesis.
4880	The protein encoded by this gene is a scaffold protein and a regulator for 3 different kinases involved in proinflammatory signaling. This encoded protein can bind NF-kappa-B-inducing kinase (NIK) and IKKs through separate domains and assemble them into an active kinase complex. Mutations in this gene have been associated with familial dysautonomia.
4881	
4882	
4883	Hexose transport into mammalian cells is catalyzed by a family of membrane proteins, including SLC2A6, that contain 12 transmembrane domains and a number of critical conserved residues.[supplied by OMIM]
4884	The protein encoded by this gene is a cytokine that belongs to the tumor necrosis factor (TNF) ligand family. This cytokine is a ligand for TNFRSF8/CD30, which is a cell surface antigen and a marker for Hodgkin lymphoma and related hematologic malignancies. The engagement of this cytokine expressed on B cell surface plays an inhibitory role in modulating Ig class switch. This cytokine was shown to enhance cell proliferation of some lymphoma cell lines, while to induce cell death and reduce cell proliferation of other lymphoma cell lines. The pleiotropic biologic activities of this cytokine on different CD30+ lymphoma cell lines may play a pathophysiologic role in Hodgkin@#$%&s and some non-Hodgkin@#$%&s lymphomas.
4885	NNA1 is a zinc carboxypeptidase that contains nuclear localization signals and an ATP/GTP-binding motif that was initially cloned from regenerating spinal cord neurons of the mouse.[supplied by OMIM]
4886	This gene is found in a cluster with two other members of the gene family, having in common a zinc finger-like DNA-binding motif (DM domain). The DM domain is an ancient, conserved component of the vertebrate sex-determining pathway that is also a key regulator of male development in flies and nematodes. This gene exhibits a gonad-specific and sexually dimorphic expression pattern. Defective testicular development and XY feminization occur when this gene is hemizygous.
4887	The protein encoded by this gene is a member of the TNF receptor associated factor (TRAF) protein family. TRAF proteins associate with, and mediate the signal transduction from members of the TNF receptor superfamily. This protein directly interacts with TNF receptors, and forms a heterodimeric complex with TRAF1. This protein is required for TNF-alpha-mediated activation of MAPK8/JNK and NF-kappaB. The protein complex formed by this protein and TRAF1 interacts with the inhibitor-of-apoptosis proteins (IAPs), and functions as a mediator of the anti-apoptotic signals from TNF receptors. The interaction of this protein with TRADD, a TNF receptor associated apoptotic signal transducer, ensures the recruitment of IAPs for the direct inhibition of caspase activation. BIRC2/c-IAP1, an apoptosis inhibitor possessing ubiquitin ligase activity, can unbiquitinate and induce the degradation of this protein, and thus potentiate TNF-induced apoptosis. Multiple alternatively spliced transcript variants have been found for this gene, but the biological validity of only one transcript has been determined.
4888	
4889	
4890	This gene encodes a protein that appears to belong to a family of evolutionarily related proteins (DUF78), that may share one or more domains in common. Members of this family are small archaebacterial proteins with no known function. Alternative splicing has been observed at this locus and two variants, both encoding the same protein, have been identified.
4891	Protein kinase CK2 is a ubiquitous and pleiotropic Ser/Thr protein kinase involved in cell growth and transformation. This gene encodes a protein similar to the E2 ubiquitin conjugating enzyme UBC3/CDC34. Studies suggest that CK2-dependent phosphorylation of this ubiquitin-conjugating enzyme functions by regulating beta-TrCP substrate recognition and induces its interaction with beta-TrCP, enhancing beta-catenin degradation.
4892	
4893	
4894	The protein encoded by this gene is a transmembrane (type I) heparan sulfate proteoglycan and is a member of the syndecan proteoglycan family. The syndecans mediate cell binding, cell signaling, and cytoskeletal organization and syndecan receptors are required for internalization of the HIV-1 tat protein. The syndecan-1 protein functions as an integral membrane protein and participates in cell proliferation, cell migration and cell-matrix interactions via its receptor for extracellular matrix proteins. Altered syndecan-1 expression has been detected in several different tumor types. While several transcript variants may exist for this gene, the full-length natures of only two have been described to date. These two represent the major variants of this gene and encode the same protein.
4895	The protein encoded by this gene is a member of the cyclin-dependent protein kinase (CDK) family. CDK family members are highly similar to the gene products of S. cerevisiae cdc28, and S. pombe cdc2, and known as important cell cycle regulators. This kinase was found to be a component of the multiprotein complex TAK/P-TEFb, which is an elongation factor for RNA polymerase II-directed transcription and functions by phosphorylating the C-terminal domain of the largest subunit of RNA polymerase II. This protein forms a complex with and is regulated by its regulatory subunit cyclin T or cyclin K. HIV-1 Tat protein was found to interact with this protein and cyclin T, which suggested a possible involvement of this protein in AIDS.
4896	
4897	This gene encodes a key acute phase plasma protein.  Because of its increase due to acute inflammation, this protein is classified as an acute-phase reactant.  The specific function of this protein has not yet been determined; however, it may be involved in aspects of immunosuppression.
4898	The protein encoded by this gene is a glycoprotein secreted from the pancreas into the digestive tract and from the lactating mammary gland into human milk. The physiological role of this protein is in cholesterol and lipid-soluble vitamin ester hydrolysis and absorption. This encoded protein promotes large chylomicron production in the intestine. Also its presence in plasma suggests its interactions with cholesterol and oxidized lipoproteins to modulate the progression of atherosclerosis. In pancreatic tumoral cells, this encoded protein is thought to be sequestrated within the Golgi compartment and is probably not secreted. This gene contains a variable number of tandem repeat (VNTR) polymorphism in the coding region that may influence the function of the encoded protein.
4899	The protein encoded by this gene is related to nucleoporins, a class of proteins that mediate nucleocytoplasmic transport. This encoded protein binds the Rev activation domain when Rev is assembled onto its RNA target and can significantly enhance Rev activity when overexpressed. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
4900	Multiple receptor subtypes of serotonin neurotransmitters with multiple physiologic functions have been recognized. The 5-HT-2 receptors mediate many of the central and peripheral physiologic functions of serotonin. Cardiovascular effects include contraction of blood vessels and shape changes in platelets; central nervous system effects include neuronal sensitization to tactile stimuli and mediation of hallucinogenic effects of phenylisopropylamine hallucinogens.[supplied by OMIM]
4901	
4902	
4903	
4904	This gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins. The protein may play a role in tumorigenesis. A similar protein in mice is involved in the regulation of the wingless-type MMTV integration site family (Wnt) pathway.
4905	HIPK2 is a conserved serine/threonine nuclear kinase that interacts with homeodomain transcription factors.[supplied by OMIM]
4906	Annexin IV (ANX4) belongs to the annexin family of calcium-dependent phospholipid binding proteins. Although their functions are still not clearly defined, several members of the annexin family have been implicated in membrane-related events along exocytotic and endocytotic pathways.  ANX4 has 45 to 59% identity with other members of its family and shares a similar size and exon-intron organization.  Isolated from human placenta, ANX4 encodes a protein that has possible interactions with ATP, and has in vitro anticoagulant activity and also inhibits phospholipase A2 activity.  ANX4 is almost exclusively expressed in epithelial cells.
4907	
4908	
4909	
4910	This gene belongs to the ephrin receptor subfamily of the protein-tyrosine kinase family. EPH and EPH-related receptors have been implicated in mediating developmental events, particularly in the nervous system. Receptors in the EPH subfamily typically have a single kinase domain and an extracellular region containing a Cys-rich domain and 2 fibronectin type III repeats. The ephrin receptors are divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands. This gene encodes a protein that binds ephrin-A ligands. Two alternatively spliced transcript variants have been described for this gene.
4911	
4912	
4913	
4914	
4915	Armadillo-like proteins are characterized by a series of armadillo repeats, first defined in the Drosophila @#$%&armadillo@#$%& gene product, that are typically 42 to 45 amino acids in length. These proteins can be divided into subfamilies based on their number of repeats, their overall sequence similarity, and the dispersion of the repeats throughout their sequences. Members of the p120(ctn)/plakophilin subfamily of Armadillo-like proteins, including CTNND1, CTNND2, PKP1, PKP2, PKP4, and ARVCF. PKP4 may be a component of desmosomal plaque and other adhesion plaques and is thought to be involved in regulating junctional plaque organization and cadherin function. Multiple transcript variants have been found for this gene, but the full-length nature of only two of them have been described so far. These two variants encode distinct isoforms.
4916	
4917	
4918	
4919	
4920	The protein encoded by this gene is a neuron-specific protein. The similar protein in pig enhances neuroblastoma cell differentiation in vitro and may be involved in neuronal differentiation in vivo.
4921	
4922	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the proteasome B-type family, also known as the T1B family, that is a 20S core beta subunit.
4923	
4924	
4925	
4926	
4927	
4928	
4929	The cystatin superfamily encompasses proteins that contain multiple cystatin-like sequences. Some of the members are active cysteine protease inhibitors, while others have lost or perhaps never acquired this inhibitory activity. There are three inhibitory families in the superfamily, including the type 1 cystatins (stefins), type 2 cystatins, and kininogens. This gene encodes a stefin that functions as a cysteine protease inhibitor, forming tight complexes with papain and the cathepsins B, H, and L. The protein is one of the precursor proteins of cornified cell envelope in keratinocytes and plays a role in epidermal development and maintenance. Stefins have been proposed as prognostic and diagnostic tools for cancer.
4930	This gene belongs to the protocadherin gene family, a subfamily of the cadherin superfamily. The mRNA encodes a cadherin-related neuronal receptor thought to play a role in the establishment and function of specific cell-cell connections in the brain. This family member contains 6 extracellular cadherin domains, a transmembrane domain and a cytoplasmic tail differing from those of the classical cadherins. Alternatively spliced transcripts encode isoforms with unique cytoplasmic domains.
4931	This gene encodes a phosphatase enzyme that is involved in actin polymerization and is found in the trans-Golgi network. Mutations in this gene cause oculocerebrorenal syndrome of Lowe and also Dent disease.
4932	The protein encoded by this gene is similar to yeast Sec31 protein. Yeast Sec31 protein is known to be a component of the COPII protein complex which is responsible for vesicle budding from endoplasmic reticulum (ER). This protein was found to colocalize with SEC13, one of the other components of COPII , in the subcellular structures corresponding to the vesicle transport function. An immunodepletion experiment confirmed that this protein is required for ER-Golgi transport. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
4933	
4934	
4935	
4936	The protein encoded by this gene is a gastric aspartyl protease that functions as a disulfide-linked homodimer. This protease, which is a member of the peptidase C1 family, has a specificity similar to that of pepsin A and cathepsin D. It is an intracellular proteinase that does not appear to be involved in the digestion of dietary protein and is found in highest concentration in the surface of epithelial mucus-producing cells of the stomach. It is the first aspartic proteinase expressed in the fetal stomach and is found in more than half of gastric cancers. It appears, therefore, to be an oncofetal antigen. Transcript variants utilizing alternative polyadenylation signals and two transcript variants encoding different isoforms exist for this gene.
4937	This gene encodes a subunit of the heterotetrameric adaptor-related protein comlex 3 (AP-3), which belongs to the adaptor complexes medium subunits family. The AP-3 complex plays a role in protein trafficking to lysosomes and specialized organelles.
4938	
4939	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein which forms homodimers. In prokaryotic ribosomes, two L7/L12 dimers and one L10 protein form the L8 protein complex.
4940	
4941	
4942	
4943	
4944	The cystatin superfamily encompasses proteins that contain multiple cystatin-like sequences. Some of the members are active cysteine protease inhibitors, while others have lost or perhaps never acquired this inhibitory activity. There are three inhibitory families in the superfamily, including the type 1 cystatins (stefins), type 2 cystatins and the kininogens. The type 2 cystatin proteins are a class of cysteine proteinase inhibitors found in a variety of human fluids and secretions. The cystatin locus on chromosome 20 contains the majority of the type 2 cystatin genes and pseudogenes. This gene is located in the cystatin locus and encodes a protein found in saliva and tears. The encoded protein may play a protective role against proteinases present in the oral cavity.
4945	
4946	
4947	This gene encodes a protein that contains an N-terminal RING finger motif and a putative coiled-coil domain. A similar murine protein interacts with TNFR-associated factor 1 (TRAF1), TNFR-associated factor 2 (TRAF2), and cylindromatosis. The interaction with TRAF2 inhibits TRAF2-mediated nuclear factor kappa-B, subunit 1 activation that is required for cell activation and protection against apoptosis.
4948	
4949	This gene encodes a protein which is necessary for normal skeletal development and brain function. Mutations in this gene are associated with two types of recessive osteochondrodysplasia, Dyggve-Melchior-Clausen (DMC) dysplasia and Smith-McCort (SMC) dysplasia, which involve both skeletal defects and mental retardation.
4950	The protein encoded by this gene is a member of the interleukin 1 cytokine family. The activity of this cytokine is mediated by interleukin 1 receptor-like 2 (IL1RL2/IL1R-rp2), and is specifically inhibited by interleukin 1 family, member 5 (IL1F5/IL-1 delta). Interferon-gamma, tumor necrosis factor-alpha and interleukin 1, beta (IL1B) are reported to stimulate the expression of this cytokine in keratinocytes. The expression of this cytokine in keratinocytes can also be induced by a contact hypersensitivity reaction or herpes simplex virus infection. This gene and eight other interleukin 1 family genes form a cytokine gene cluster on chromosome 2.
4951	The sialyl Lewis x oligosaccharide determinant is an essential component of leukocyte counterreceptors for E-selectin- (MIM 131210) and P-selectin- (MIM 173610) mediated adhesions of leukocytes. This oligosaccharide molecule is displayed on the surfaces of granulocytes, monocytes, and natural killer cells. Formation of leukocyte adhesions to these selectins is an early and important step in the process that ultimately allows leukocytes to leave the vascular tree and become recruited into lymphoid tissues and sites of inflammation.[supplied by OMIM]
4952	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. This gene encodes a member of the potassium channel, voltage-gated, subfamily H. This member is a pore-forming (alpha) subunit. Several alternatively spliced transcript variants have been identified from this gene, but the full-length nature of only two transcript variants has been determined.
4953	
4954	
4955	
4956	
4957	
4958	This gene encodes a dual-specificity phosphatase that associates with polyribosomes. The encoded protein may be involved in the regulation of glycogen metabolism. Mutations in this gene have been associated with myoclonic epilepsy of Lafora. Alternative splicing results in multiple transcript variants.
4959	
4960	
4961	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. This gene encodes a member of the potassium channel, voltage-gated, subfamily H. This member is a pore-forming (alpha) subunit.
4962	
4963	
4964	
4965	
4966	
4967	
4968	
4969	
4970	
4971	The protein encoded by this gene is a LIM domain protein. LIM domains are cysteine-rich double zinc fingers composed of 50 to 60 amino acids that are involved in protein-protein interactions. LIM domain-containing proteins are scaffolds for the formation of multiprotein complexes. The proteins are involved in cytoskeleton organization, cell lineage specification, organ development, and oncogenesis. The encoded protein is also a member of the Enigma class of proteins, a family of proteins that possess a 100-amino acid PDZ domain in the N terminus and 1 to 3 LIM domains in the C terminus. Multiple transcript variants encoding different isoforms have been found for this gene, although not all of them have been fully characterized.
4972	
4973	This gene encodes a member of the PUF family. The encoded protein contains a sequence-specific RNA binding domain comprised of eight repeats and N- and C-terminal flanking regions, and serves as a translational regulator of specific mRNAs by binding to their 3@#$%& untranslated regions. The evolutionarily conserved function of the encoded protein in invertebrates and lower vertebrates suggests that the human protein may be involved in translational regulation of embryogenesis, and cell development and differentiation. Alternatively spliced transcript variants encoding different isoforms have been described.
4974	
4975	
4976	Interleukin 1 induces synthesis of acute phase and proinflammatory proteins during infection, tissue damage, or stress, by forming a complex at the cell membrane with an interleukin 1 receptor and an accessory protein. This gene encodes an interleukin 1 receptor accessory protein. Alternative splicing of this gene results in two transcript variants encoding two different isoforms, one membrane-bound and one soluble.
4977	The protein encoded by this gene is the human ortholog of bovine and rat dipeptidylpeptidase VI. They are transmembrane proteins and likely members of the dipeptidyl aminopeptidase family. Two transcript variants encoding different isoforms have been identified in all 3 species. In bovine and rat, the longer form has been shown to be expressed almost exclusively in brain, whereas the shorter form has a more widespread expression.
4978	
4979	
4980	
4981	
4982	
4983	
4984	
4985	
4986	Emopamil-binding protein (EBP) is an integral membrane protein of the endoplasmic reticulum. It is a high affinity binding protein for the antiischemic phenylalkylamine Ca2+ antagonist [3H]emopamil and the photoaffinity label [3H]azidopamil. It is similar to sigma receptors and may be a member of a superfamily of high affinity drug-binding proteins in the endoplasmic reticulum of different tissues. EBP shares structural features with bacterial and eukaryontic drug transporting proteins. It has four putative transmembrane segments and contains two conserved glutamate residues which may be involved in the transport of cationic amphiphilics. Another prominent feature of EBP is its high content of aromatic amino acid residues (>23%) in its transmembrane segments. These aromatic amino acid residues have been suggested to be involved in the drug transport by the P-glycoprotein. Mutations in this gene cause Chondrodysplasia punctata 2 (CDPX2; also known as Conradi-Hunermann syndrome).
4987	
4988	EDA-A1 and EDA-A2 are two isoforms of ectodysplasin that are encoded by the anhidrotic ectodermal dysplasia (EDA) gene. Mutations in EDA give rise to a clinical syndrome characterized by loss of hair, sweat glands, and teeth. The protein encoded by this gene specifically binds to EDA-A2 isoform. This protein is a type III transmembrane protein of the TNFR (tumor necrosis factor receptor) superfamily, and contains 3 cysteine-rich repeats and a single transmembrane domain but lacks an N-terminal signal peptide. Multiple alternatively spliced transcript variants have been found for this gene, but some variants lack sufficient support.
4989	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein and is highly similar in sequence to the transmembrane 4 superfamily member 2. The use of alternate polyadenylation sites has been found for this gene.
4990	This gene encodes a member of the kelch family of proteins, which are characterized by kelch repeat motifs and a POZ/BTB protein-binding domain. It is thought that kelch repeats are actin binding domains. However, the specific function of this protein has not been determined. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
4991	
4992	
4993	The proliferation and differentiation of B cells is dependent upon a B-cell antigen receptor (BCR) complex. Binding of antigens to specific B-cell receptors results in a tyrosine phosphorylation reaction through the BCR complex and leads to multiple signal transduction pathways.
4994	The proliferation and differentiation of B cells is dependent upon a B-cell antigen receptor (BCR) complex. Binding of antigens to specific B-cell receptors results in a tyrosine phosphorylation reaction through the BCR complex and leads to multiple signal transduction pathways.
4995	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters.  ABC proteins transport various molecules across extra- and intra-cellular membranes.  ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White).  This protein is a member of the MDR/TAP subfamily.  Members of the MDR/TAP subfamily are involved in multidrug resistance as well as antigen presentation.  This gene encodes a half-transporter involved in the transport of heme from the mitochondria to the cytosol.  With iron/sulfur cluster precursors as its substrates, this protein may play a role in metal homeostasis.  Mutations in this gene have been implicated in X-linked sideroblastic anemia with ataxia.
4996	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene encodes a member of the histone H2A family. This gene is part of a region that is repeated three times on chromosome X, once in intron 22 of the F8 gene and twice closer to the Xq telomere. This record represents the most telomeric copy.
4997	Palmitoylated membrane protein 1 is the prototype of a family of membrane-associated proteins termed MAGUKs (membrane-associated guanylate kinase homologs).  MAGUKs interact with the cytoskeleton and regulate cell proliferation, signaling pathways, and intracellular junctions.  Palmitoylated membrane protein 1 contains a conserved sequence, called the SH3 (src homology 3) motif, found in several other proteins that associate with the cytoskeleton and are suspected to play important roles in signal transduction.
4998	
4999	
5000	The protein encoded by this gene is an axonal glycoprotein belonging to the immunoglobulin supergene family. The ectodomain, consisting of several immunoglobulin-like domains and fibronectin-like repeats (type III), is linked via a single transmembrane sequence to a conserved cytoplasmic domain. This cell adhesion molecule plays an important role in nervous system development, including neuronal migration and differentiation. Mutations in the gene cause three X-linked neurological syndromes known by the acronym CRASH (corpus callosum hypoplasia, retardation, aphasia, spastic paraplegia and hydrocephalus). Alternative splicing of a neuron-specific exon is thought to be functionally relevant.
5001	This gene is one member of a family of nuclear RNA export factor genes. Common domain features of this family are a noncanonical RNP-type RNA-binding domain (RBD), 4 leucine-rich repeats (LRRs), a nuclear transport factor 2 (NTF2)-like domain that allows heterodimerization with NTF2-related export protein-1 (NXT1), and a ubiquitin-associated domain that mediates interactions with nucleoporins. The LRRs and NTF2-like domains are required for export activity. Alternative splicing seems to be a common mechanism in this gene family. The encoded protein of this gene has shortened LRR and ubiquitin-associated domains and its RDB is unable to bind RNA. It is located in the nucleoplasm but is not associated with either the nuclear envelope or the nucleolus.
5002	
5003	Protein ubiquitination controls many intracellular processes, including cell cycle progression, transcriptional activation, and signal transduction. It is a dynamic process involving enzymes that add (ubiquitin conjugating enzymes) and those that remove (deubiquitinating enzymes) ubiquitin. The latter are cysteine proteases that specifically cleave ubiquitin from ubiquitin-conjugated protein substrates. This gene encodes a member of the family of deubiquitinating enzymes, and lies in a gene cluster on chromosome Xp11.23. It was originally thought to be a candidate gene for X-linked retinal disorders, however, this has not proven to be true.
5004	This gene is a member of the plant homeodomain (PHD)-like finger (PHF) family. It encodes a protein with two PHD-type zinc finger domains, indicating a potential role in transcriptional regulation, that localizes to the nucleolus. Mutations affecting the coding region of this gene or the splicing of the transcript have been associated with Borjeson-Forssman-Lehmann syndrome (BFLS), a disorder characterized by mental retardation, epilepsy, hypogonadism, hypometabolism, obesity, swelling of subcutaneous tissue of the face, narrow palpebral fissures, and large ears. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
5005	This gene lies within the male specific region of chromosome Y. Its homolog on chromosome X escapes X inactivation and encodes an actin sequestering protein.
5006	There are two copies of this locus in the P3 palindrome on chromosome Y. This record represents the more telomeric copy.
5007	
5008	
5009	This locus shares sequence similarity with heterogeneous nuclear ribonuclear proteins and represents an apparent transcribed pseudogene of the HNRPA3 gene, which maps to chromosome 2.
5010	
5011	
5012	
5013	ANAPC1 is 1 of at least 10 subunits of the anaphase-promoting complex (APC), which functions at the metaphase-to-anaphase transition of the cell cycle and is regulated by spindle checkpoint proteins. The APC is an E3 ubiquitin ligase that targets cell cycle regulatory proteins for degradation by the proteasome, thereby allowing progression through the cell cycle.[supplied by OMIM]
5014	
5015	
5016	
5017	
5018	
5019	The protein encoded by this gene is a mitochondrial calcium-binding protein located in the luminal space of the terminal cisternae of the sarcoplasmic reticulum. The protein binds and putatively stores calcium ions. The protein is absent in patients with Duchenne and Becker types of muscular dystrophy.
5020	
5021	
5022	The protein encoded by this gene is a basic leucine zipper (bZIP) transcription factor that plays an important role in the regulation of lineage-specific hematopoiesis. The encoded nuclear protein represses ETS1-mediated transcription of erythroid-specific genes in myeloid cells. This gene contains no introns.
5023	The ganglioside GD3 synthase causes cell differentiation with neurite sprouting when transfected into the mouse neuroblastoma cell line Neuro2a. After differentiation, the expression of several genes is upregulated, including one that encodes a protein termed ganglioside-induced differentiation-associated protein 1 (Gdap1). A similar gene was found in humans, and mutations in the human gene are associated with Charcot-Marie-Tooth type 4A disease. The protein encoded by this gene is similar in sequence to the human GDAP1 protein.
5024	Proteins that carry a nuclear localization signal (NLS) are transported into the nucleus by the importin-alpha/beta heterodimer. Importin-alpha binds the NLS, while importin-beta mediates translocation through the nuclear pore complex. After translocation, RanGTP binds importin-beta and displaces importin-alpha. Importin-alpha must then be returned to the cytoplasm, leaving the NLS protein behind. The protein encoded by this gene binds strongly to NLS-free importin-alpha, and this binding is released in the cytoplasm by the combined action of RANBP1 and RANGAP1. In addition, the encoded protein may play a role both in apoptosis and in cell proliferation. Multiple transcript variants encoding several different isoforms have been described for this gene.
5025	
5026	Breast carcinoma amplified sequence 1 (BCAS1) was isolated from a region at 20q13 which is amplified in a variety of tumor types and associated with more aggressive tumor phenotypes. Among the genes identified from this region, BCAS1 was found to be highly expressed in three amplified breast cancer cell lines and in one breast tumor without amplification at 20q13.2. However, the BCAS1 gene is not in the common region of maximal amplification and its expression was not detected in the breast cancer cell line MCF7, in which this region is highly amplified. Although not consistently expressed, BCAS1 is a candidate oncogene. It is predicted to encode a protein of 584 amino acids with no significant homology to other proteins.
5027	
5028	
5029	
5030	The protein encoded by this gene is a member of the signal-regulatory-protein (SIRP) family, and also belongs to the immunoglobulin superfamily. SIRP family members are receptor-type transmembrane glycoproteins known to be involved in the negative regulation of receptor tyrosine kinase-coupled signaling processes. This protein was found to interact with TYROBP/DAP12, a protein bearing immunoreceptor tyrosine-based activation motifs. This protein was also reported to participate in the recruitment of tyrosine kinase SYK.
5031	The protein encoded by this gene is a member of the M14 family of zinc carboxypeptidases; however, the protein has no detectable carboxypeptidase activity. The encoded protein is thought to be an extracellular and/or membrane protein, and may be involved in cell-cell interactions.
5032	
5033	
5034	
5035	The protein encoded by this gene is similar in sequence to histidyl-tRNA synthetase, which hydrolyzes D-tyrosyl-tRNA(Tyr) into D-tyrosine and free tRNA(Tyr). The encoded protein is found in the cytoplasm.
5036	
5037	
5038	The protein encoded by this gene acts as a heterodimer with kinesin family member 3A to aid in chromosome movement during mitosis and meiosis. The encoded protein is a plus end-directed microtubule motor and can interact with the SMC3 subunit of the cohesin complex. In addition, the encoded protein may be involved in the intracellular movement of membranous organelles. This protein and kinesin family member 3A form the kinesin II subfamily of the kinesin superfamily.
5039	
5040	
5041	S-adenosylhomocysteine hydrolase catalyzes the reversible hydrolysis of S-adenosylhomocysteine (AdoHcy) to adenosine (Ado) and L-homocysteine (Hcy). Thus, it regulates the intracellular S-adenosylhomocysteine (SAH) concentration thought to be important for transmethylation reactions. Deficiency in this protein is one of the different causes of hypermethioninemia. S-adenosylhomocysteine hydrolase belongs to the adenosylhomocysteinase family.
5042	
5043	This gene is a classical cadherin from the cadherin superfamily. The encoded protein is a calcium-dependent cell-cell adhesion glycoprotein comprised of five extracellular cadherin repeats, a transmembrane region and a highly conserved cytoplasmic tail. Based on studies in chicken and mouse, this cadherin is thought to play an important role during brain segmentation and neuronal outgrowth. In addition, a role in kidney and muscle development is indicated. Of particular interest are studies showing stable cis-heterodimers of cadherins 2 and 4 in cotransfected cell lines. Previously thought to interact in an exclusively homophilic manner, this is the first evidence of cadherin heterodimerization.
5044	The protein encoded by this gene is a member of the heat shock protein 70 family and is found associated with microsomes. Members of this protein family play a role in the processing of cytosolic and secretory proteins, as well as in the removal of denatured or incorrectly-folded proteins. The encoded protein contains an ATPase domain and has been shown to associate with a ubiquitin-like protein.
5045	
5046	
5047	
5048	The protein encoded by this gene is a type I membrane protein that forms one of the two chains of a receptor for interferons alpha and beta. Binding and activation of the receptor stimulates Janus protein kinases, which in turn phosphorylate several proteins, including STAT1 and STAT2. The encoded protein also functions as an antiviral factor.
5049	Potassium channels are present in most mammalian cells, where they participate in a wide range of physiologic responses. The protein encoded by this gene is an integral membrane protein and inward-rectifier type potassium channel. The encoded protein has a greater tendency to allow potassium to flow into a cell rather than out of a cell. Three transcript variants encoding the same protein have been found for this gene.
5050	In mouse, the interferon-inducible Mx protein is responsible for a specific antiviral state against influenza virus infection. The protein encoded by this gene is similar to the mouse protein as determined by its antigenic relatedness, induction conditions, physicochemical properties, and amino acid analysis. This cytoplasmic protein is a member of both the dynamin family and the family of large GTPases.
5051	CpG methylation is an epigenetic modification that is important for embryonic development, imprinting, and X-chromosome inactivation. Studies in mice have demonstrated that DNA methylation is required for mammalian development. This gene encodes a nuclear protein with similarity to DNA methyltransferases. This protein is not thought to function as a DNA methyltransferase as it does not contain the amino acid residues necessary for methyltransferase activity. However, this protein does stimulate de novo methylation by DNA cytosine methyltransferase 3 alpha and it is thought to be required for the establishment of maternal genomic imprints. This protein also mediates transcriptional repression through interaction with histone deacetylase 1. Alternative splicing results in two transcript variants. An additional splice variant has been described but its biological validity has not been determined.
5052	SLC37A1, a member of the sugar-phosphate transport family, transports glycerol-3-phosphate (G3P) between cellular compartments for its utilization in several compartment-specific biochemical pathways.[supplied by OMIM]
5053	Interleukin 17 is known to induce nuclear factor kappa-B and enhance expression of interleukin 6, intercellular adhesion molecule-1, granulocyte macrophage colony-stimulating factor, and prostaglandin E2. In addition, interleukin 17 induces the maturation of CD34-positive hematopoietic precursors into neutrophils. The protein encoded by this gene is a type I membrane glycoprotein which binds with low affinity to interleukin 17, suggesting that additional components are involved in interleukin 17-induced signaling.
5054	This gene is a member of the septin gene family which encodes nucleotide binding proteins, originally described in yeast as cell division cycle regulatory proteins. Septins are highly conserved in yeast, Drosophila and mouse, and appear to regulate cytoskeletal organization. Disruption of septin function disturbs cytokinesis and results in large multinucleate or polyploid cells. This gene is mapped to 22q11, the region frequently deleted in DiGeorge and velocardiofacial syndromes, and a translocation involving MLL gene and this gene has also been reported in patients with acute myeloid leukemia. Two transcripts, a major one of 2.2 kb and a minor one of 3.5 kb, have been observed in the gene region. The 2.2 kb form which encodes this septin (369 amino acids) protein, results from the utilization of a non-consensus polyA_signal (AACAAT). In the absence of polyadenylation from this imperfect site, this gene uses the consensus polyA_signal within its 3@#$%& neighbour gene (GP1BB; platelet glycoprotein Ib), resulting in the 3.5 kb read-through transcript. A transcript variant with a different 5@#$%& end arising by alternative splicing, and possibly accompanied by alternative promoter usage, has also been identified.
5055	
5056	The protein encoded by this gene is an enzyme that catalyzes the first step in the dolichol-linked oligosaccharide pathway for glycoprotein biosynthesis. This enzyme belongs to the glycosyltransferase family 4. This protein is an integral membrane protein of the endoplasmic reticulum. The congenital disorder of glycosylation type Ij is caused by mutation in the gene encoding this enzyme. Alternatively spliced transcript variants encoding different isoforms have been identified.
5057	D-dopachrome tautomerase converts D-dopachrome into 5,6-dihydroxyindole. The DDT gene is related to the migration inhibitory factor (MIF) in terms of sequence, enzyme activity, and gene structure. DDT and MIF are closely linked on chromosome 22.
5058	
5059	The protein encoded by this gene is found in the cytoplasm, where it catalyzes the transfer of phosphatidylinositol and phosphatidylcholine between membranes.
5060	This gene encodes a cytosolic protein which belongs to a family of lipid-binding proteins including Sec14p, alpha-tocopherol transfer protein, and cellular retinol-binding protein. The encoded protein stimulates squalene monooxygenase which is a downstream enzyme in the cholesterol biosynthetic pathway.
5061	
5062	
5063	
5064	Glucose transporters are integral membrane proteins that mediate the transport of glucose and structurally-related substances across cellular membranes. Two families of glucose transporter have been identified: the facilitated-diffusion glucose transporter family (GLUT family), also known as @#$%&uniporters,@#$%& and the sodium-dependent glucose transporter family (SGLT family), also known as @#$%&cotransporters@#$%& or @#$%&symporters@#$%& (Wright et al., 1994). The SLC5A1 gene encodes a protein that is involved in the active transport of glucose and galactose into eukaryotic and some prokaryotic cells.[supplied by OMIM]
5065	
5066	The protein encoded by this gene is a high affinity calcium ion-binding protein that is structurally and functionally similar to calmodulin and troponin C. The encoded protein is thought to be involved in muscle relaxation.
5067	
5068	
5069	
5070	The protein encoded by this gene is a type I membrane receptor that activates cytokine gene promoters such as the IL-13 and TNF-alpha promoters. The encoded protein contains an immunoreceptor tyrosine-based activation motif (ITAM) and is thought to regulate the signaling and development of B-cells.
5071	The protein encoded by this gene is a triacylglycerol lipase that mediates triacylglycerol hydrolysis in adipocytes. The encoded protein, which appears to be membrane bound, may be involved in the balance of energy usage/storage in adipocytes.
5072	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbls class and, in addition to an F-box, contains 12 tandem leucine-rich repeats.
5073	This protein belongs to the basic helix-loop-helix family of transcription factors. It is a transcriptional repressor of genes that require a bHLH protein for their transcription. The protein has a particular type of basic domain that contains a helix interrupting protein that binds to the N-box rather than the canonical E-box.
5074	Three different forms of human pancreatic procarboxypeptidase A have been isolated. The A3 form, is obtained as a binary complex of a procarboxypeptidase A with a proproteinase E. Mast cell carboxypeptidase A is a secretory granule metalloexopeptidase that has a pH optimum in the neutral to basic range. It resembles pancreatic carboxypeptidase A in cleaving COOH-terminal aromatic and aliphatic amino acid residues.
5075	
5076	
5077	This gene was identified as a retinoid acid (RA) receptor-responsive gene. It encodes a type 1 membrane protein. The expression of this gene is upregulated by tazarotene as well as by retinoic acid receptors. The expression of this gene is found to be downregulated in prostate cancer, which is caused by the methylation of its promoter and CpG island. Alternatively spliced transcript variant encoding distinct isoforms have been observed.
5078	
5079	
5080	This gene encodes a member of a family of neuronal cell surface proteins. Members of this family may act as splice site-specific ligands for beta-neurexins and may be involved in the formation and remodeling of central nervous system synapses.
5081	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein. A pseudogene corresponding to this gene is found on chromosome 4.
5082	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L32E family of ribosomal proteins. It is located in the cytoplasm. Although some studies have mapped this gene to 3q13.3-q21, it is believed to map to 3p25-p24. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome. Alternatively spliced transcript variants encoding the same protein isoform have been found for this gene.
5083	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
5084	The elongation of primed DNA templates by DNA polymerase delta and DNA polymerase epsilon requires the accessory proteins proliferating cell nuclear antigen (PCNA) and replication factor C (RFC). RFC, also named activator 1, is a protein complex consisting of five distinct subunits of 140, 40, 38, 37, and 36 kD. This gene encodes the 37 kD subunit. This subunit forms a core complex with the 36 and 40 kDa subunits. The core complex possesses DNA-dependent ATPase activity, which was found to be stimulated by PCNA in an in vitro system. Alternatively spliced transcript variants encoding the same protein have been reported.
5085	
5086	PAX6 encodes paired box gene 6, one of many human homologues of the Drosophila melanogaster gene prd. In addition to the hallmark feature of this gene family, a conserved paired box domain, PAX6 also contains a homeo box domain. Both domains are known to bind DNA, and function as regulators of gene transcription. PAX6 is expressed in the developing nervous system, and in developing eyes. Mutations in PAX6 are known to cause aniridia as well as Peter@#$%&s anomoly, both ocular diseases.
5087	G protein-coupled receptor 5 is a chemokine receptor belonging to the G protein-coupled receptor superfamily. They are characterized by the presence of 7 transmembrane domains and numerous conserved amino acids. This receptor is most closely related to RBS11 and the MIP1-alpha/RANTES receptor. The viral macrophage inflammatory protein -II is an antagonist of this receptor and blocks signaling.
5088	
5089	
5090	
5091	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L14E family of ribosomal proteins. It contains a basic region-leucine zipper (bZIP)-like domain. The protein is located in the cytoplasm. This gene contains a trinucleotide (GCT) repeat tract whose length is highly polymorphic; these triplet repeats result in a stretch of alanine residues in the encoded protein. Transcript variants utilizing alternative polyA signals exist. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
5092	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L14E family of ribosomal proteins. It contains a basic region-leucine zipper (bZIP)-like domain. The protein is located in the cytoplasm. This gene contains a trinucleotide (GCT) repeat tract whose length is highly polymorphic; these triplet repeats result in a stretch of alanine residues in the encoded protein. Transcript variants utilizing alternative polyA signals exist. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
5093	The protein encoded by this gene has high sequence similarity to counterparts in mouse, rat and Drosophila. The function of this protein has not been identified. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
5094	
5095	This gene encodes a protein, originally identified in a premyeloid cell line, with similarity to proteins that participate in apoptosis. Three alternative transcripts encoding the same protein, differing only in their 5@#$%& UTRs, have been identified for this gene.
5096	The protein encoded by this gene is a member of the glucuronyltransferase gene family, enzymes that exhibit strict acceptor specificity, recognizing nonreducing terminal sugars and their anomeric linkages. This gene product functions as the key enzyme in a glucuronyl transfer reaction during the biosynthesis of the carbohydrate epitope HNK-1 (CD57). Alternative splicing of this gene results in two transcript variants which differ in the 5@#$%& and 3@#$%& UTRs.
5097	This gene encodes an enzyme that is a member of the beta-1,3-N-acetylglucosaminyltransferase family. This enzyme is a type II membrane protein. It exhibits strong activity to transfer GlcNAc to glycolipid substrates and is identified as the most likely candidate for lactotriaosylceramide synthase. This enzyme is essential for the expression of Lewis X epitopes on glycolipids.
5098	
5099	The protein encoded by this gene belongs to the protein kinase C (PKC) family of serine/threonine protein kinases. The PKC family comprises at least eight members, which are differentially expressed and are involved in a wide variety of cellular processes. This protein kinase is calcium-independent and phospholipid-dependent. It is not activated by phorbolesters or diacylglycerol. This kinase can be recruited to vesicle tubular clusters (VTCs) by direct interaction with the small GTPase RAB2, where this kinase phosphorylates glyceraldehydes-3-phosphate dehydrogenase (GAPD/GAPDH) and plays a role in microtubule dynamics in the early secretory pathway. This kinase is found to be necessary for BCL-ABL-mediated resistance to drug-induced apoptosis and therefore protects leukemia cells against drug-induced apoptosis. There is a single exon pseudogene mapped on chromosome X.
5100	This gene encodes a protein that is highly similar to the rat CaM-KII inhibitory protein, an inhibitor of calcium/calmodulin-dependent protein kinase II (CAMKII). CAMKII regulates numerous physiological functions, including neuronal synaptic plasticity through the phosphorylation of alpha-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid-type glutamate (AMPA) receptors. Studies of the similar protein in rat suggest that this protein may function as a negative regulator of CaM-KII and may act to inhibit the phosphorylation of AMPA receptors.
5101	
5102	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein that belongs to the L3P ribosomal protein family. A pseudogene corresponding to this gene is found on chromosome 13q.
5103	The protein encoded by this gene is a transcriptional coactivator that regulates the genes involved in energy metabolism. This protein interacts with PPARgamma, which permits the interaction of this protein with multiple transcription factors. This protein can interact with, and regulate the activities of, cAMP response element binding protein (CREB) and nuclear respiratory factors (NRFs). It provides a direct link between external physiological stimuli and the regulation of mitochondrial biogenesis, and is a major factor that regulates muscle fiber type determination. This protein may be also involved in controlling blood pressure, regulating cellular cholesterol homoeostasis, and the development of obesity.
5104	The function of this gene has not been specifically defined; however, it is thought to be involved in T cell trafficking. This gene has been localized to 4q21 with INP10, which is also a member of the chemokine family of cytokines.
5105	The protein encoded by this gene is the large subunit of replication factor C, which is a five subunit DNA polymerase accessory protein. Replication factor C is a DNA-dependent ATPase that is required for eukaryotic DNA replication and repair. The protein acts as an activator of DNA polymerases, binds to the 3@#$%& end of primers, and promotes coordinated synthesis of both strands. It also may have a role in telomere stability.
5106	
5107	This gene product mediates the attachment of erythroblasts to macrophages. This attachment promotes terminal maturation and enucleation of erythroblasts, presumably by suppressing apoptosis. This protein is an integral membrane protein with the N-terminus on the extracellular side and the C-terminus on the cytoplasmic side of the cell. Two immunologically related isoforms of erythroblast macrophage protein with apparent molecular weights of 33 kD and 36 kD were detected in macrophage membranes; this gene encodes the larger isoform. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene, but the biological validity of some variants has not been determined.
5108	
5109	The protein encoded by this gene is a member of the IRS1-like multisubstrate docking protein family. The encoded protein is an important mediator of branching tubulogenesis and plays a central role in cellular growth response, transformation and apoptosis. Two transcript variants encoding different isoforms have been found for this gene.
5110	INPP4B encodes the inositol polyphosphate 4-phosphatase type II, one of the enzymes involved in phosphatidylinositol signaling pathways. This enzyme removes the phosphate group at position 4 of the inositol ring from inositol 3,4-bisphosphate. There is limited data to suggest that the human type II enzyme is subject to alternative splicing, as has been established for the type I enzyme.
5111	
5112	
5113	
5114	
5115	
5116	
5117	
5118	
5119	
5120	Cytosolic beta-glucosidase (EC 3.2.1.21) is a predominantly liver enzyme that efficiently hydrolyzes beta-D-glucoside and beta-D-galactoside but not any known physiologic beta-glycoside, suggesting that it may be involved in the detoxification of plant glycosides.[supplied by OMIM]
5121	Calmegin is a testis-specific endoplasmic reticulum chaperone protein.  CLGN may play a role in spermatogeneisis and infertility.
5122	
5123	
5124	The membrane-associated protein encoded by this gene is included in the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the White subfamily. Alternatively referred to as a breast cancer resistance protein, this protein functions as a xenobiotic transporter which may play a major role in multi-drug resistance. It likely serves as a cellular defense mechanism in response to mitoxantrone and anthracycline exposure. Significant expression of this protein has been observed in the placenta, which may suggest a potential role for this molecule in placenta tissue.
5125	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections.
5126	
5127	The protein encoded by this gene is a DNA polymerase that is likely involved in DNA repair. In addition, the encoded protein may be required for sister chromatid adhesion.
5128	This gene encodes the small subunit of  3-methylcrotonyl-CoA carboxylase. This enzyme functions as a heterodimer and catalyzes the carboxylation of 3-methylcrotonyl-CoA to form 3-methylglutaconyl-CoA. Mutations in this gene are associated with 3-Methylcrotonylglycinuria, an  autosomal recessive disorder of leucine catabolism.
5129	
5130	CD4 (MIM 186940)-positive T helper lymphocytes can be divided into types 1 (Th1) and 2 (Th2) on the basis of their cytokine secretion patterns. Th1 cells and their associated cytokines are involved in cell-mediated immunity to intracellular pathogens and delayed-type hypersensitivity reactions, whereas Th2 cells are involved in the control of extracellular helminthic infections and the promotion of atopic and allergic diseases. The 2 types of cells also cross-regulate the functions of the other. TIM3 is a Th1-specific cell surface protein that regulates macrophage activation and enhances the severity of experimental autoimmune encephalomyelitis in mice.[supplied by OMIM]
5131	This gene encodes a protein which is a member of family 38 of the glycosyl hydrolases. The protein is located in the Golgi and catalyzes the final hydrolytic step in the asparagine-linked oligosaccharide (N-glycan) maturation pathway. Mutations in the mouse homolog of this gene have been shown to cause a systemic autoimmune disease similar to human systemic lupus erythematosus.
5132	This gene encodes a component of prolyl 4-hydroxylase, a key enzyme in collagen synthesis composed of two identical alpha subunits and two beta subunits. The encoded protein is one of several different types of alpha subunits and provides the major part of the catalytic site of the active enzyme. In collagen and related proteins, prolyl 4-hydroxylase catalyzes the formation of 4-hydroxyproline that is essential to the proper three-dimensional folding of newly synthesized procollagen chains. Alternatively spliced transcript variants encoding different isoforms have been described.
5133	
5134	
5135	Synuclein alpha interacting protein contains several protein-protein interaction domains and interacts with alpha synuclein in neurons. Mutations of SNCAIP have been linked to Parkinson disease.
5136	
5137	
5138	The gamma-aminobutyric acid (GABA) A receptor is a multisubunit chloride channel that mediates the fastest inhibitory synaptic transmission in the central nervous system. This gene encodes GABA A receptor, beta 2 subunit. It is mapped to chromosome 5q34 in a cluster comprised of genes encoding alpha 1 and gamma 2 subunits of the GABA A receptor. Alternative splicing of this gene generates 2 transcript variants, differing by a 114 bp insertion.
5139	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are produced by RNA polymerase II and they form a component of the heterogeneous nuclear RNA (hnRNA) complexes. They are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It acts as the catalytic component of the multiprotein editosome complex which mediates apolipoprotein B mRNA editing. Two alternatively spliced transcript variants has been described for this gene which encode different isoforms.
5140	
5141	This gene belongs to the polypeptide N-acetylgalactosaminyltransferase (pp-GalNAc-T) gene family. Polypeptide GalNAc transferases initiate the synthesis of mucin-type oligosaccharides by transferring GalNAc from UDP-GalNAc to the hydroxyl group of either a serine or threonine residue on the polypeptide acceptor. Following expression in insect cells, recombinant GalNAc transferase 10 showed significant GalNAcT activity toward mucin-derived peptides, and it utilized both nonglycosylated and glycosylated peptide substrates. Two transcript variants encoding distinct isoforms have been identified for this gene.
5142	GSPT2 is closely related to GSPT1 (MIM 139259), a GTP-binding protein that plays an essential role at the G1- to S-phase transition of the cell cycle in yeast and human cells. GSPT1 is a positive regulator of translational accuracy and, in a binary complex with eRF1 (MIM 600285), functions as a polypeptide chain release factor.[supplied by OMIM]
5143	
5144	G protein-coupled receptors (GPCRs) are the largest superfamily of related proteins. All have 7-transmembrane (TM) segments and allow cells to sample and respond to their environment. VLGR1 is a large, calcium-binding GPCR expressed in the central nervous system (CNS).[supplied by OMIM]
5145	MaxiK channels are large conductance, voltage and calcium-sensitive potassium channels which are fundamental to the control of smooth muscle tone and neuronal excitability. MaxiK channels can be formed by 2 subunits: the pore-forming alpha subunit and the product of this gene, the modulatory beta subunit. Intracellular calcium regulates the physical association between the alpha and beta subunits.
5146	
5147	
5148	
5149	Lysophospholipases are enzymes that act on biological membranes to regulate the multifunctional lysophospholipids. The protein encoded by this gene hydrolyzes lysophosphatidylcholine in both monomeric and micellar forms. The use of alternate polyadenylation sites has been found for this gene. There are alternatively spliced transcript variants described for this gene but the full length nature is not known yet.
5150	
5151	
5152	The protein encoded by this gene is a tumor-rejection antigen. This antigen possesses tumor epitopes capable of inducing HLA-A24-restricted and tumor-specific cytotoxic T lymphocytes in cancer patients and may be useful for specific immunotherapy. This gene product is localized to the endoplasmic reticulum.
5153	
5154	
5155	
5156	
5157	
5158	
5159	
5160	
5161	
5162	
5163	
5164	This gene encodes a specific inhibitor of protein phosphatase-1 (PP1) with a differential sensitivity toward the metal-independent and metal-dependent forms of PP1. The gene is located within the major histocompatibility complex class I region on chromosome 6. Alternative splicing results in two transcript variants encoding different isoforms. Other alternatively spliced transcripts have been described, but their full length sequences have not been determined.
5165	The MDC1 protein contains a forkhead homology-associated (FHA) domain and 2 BRCA1 (MIM 113705) C-terminal motifs (BRCTs) and is required for the intra-S-phase DNA damage checkpoint.[supplied by OMIM]
5166	This gene encodes the acidic form of complement factor 4, part of the classical activation pathway. The protein is expressed as a single chain precursor which is proteolytically cleaved into a trimer of alpha, beta, and gamma chains prior to secretion. The trimer provides a surface for interaction between the antigen-antibody complex and other complement components. The alpha chain may be cleaved to release C4 anaphylatoxin, a mediator of local inflammation. Deficiency of this protein is associated with systemic lupus erythematosus and type I diabetes mellitus. This gene localizes to the major histocompatibility complex (MHC) class III region on chromosome 6. Varying haplotypes of this gene cluster exist, such that individuals may have 1, 2, or 3 copies of this gene.
5167	
5168	This gene belongs to the dimethylarginine dimethylaminohydrolase (DDAH) gene family. The encoded enzyme plays a role in nitric oxide generation by regulating cellular concentrations of methylarginines, which in turn inhibit nitric oxide synthase activity.
5169	
5170	
5171	
5172	This gene encodes a serine/arginine protein kinase specific for the SR (serine/arginine-rich domain) family of splicing factors. The protein localizes to the nucleus and the cytoplasm. It is thought to play a role in regulation of both constitutive and alternative splicing by regulating intracellular localization of splicing factors. A second alternatively spliced transcript variant for this gene has been described, but its full length nature has not been determined.
5173	
5174	
5175	
5176	The cytoplasmic peripheral membrane protein encoded by this gene functions as a protein-tyrosine kinase substrate in microvilli.  As a member of the ERM protein family, this protein serves as an intermediate between the plasma membrane and the actin cytoskeleton.  It plays a key role in cell surface structure adhesion, migration, and organization.
5177	
5178	
5179	Retinoids exert biologic effects such as potent growth inhibitory and cell differentiation activities and are used in the treatment of hyperproliferative dermatological diseases. These effects are mediated by specific nuclear receptor proteins that are members of the steroid and thyroid hormone receptor superfamily of transcriptional regulators. RARRES1, RARRES2, and RARRES3 are genes whose expression is upregulated by the synthetic retinoid tazarotene. RARRES3 is thought act as a tumor suppressor or growth regulator.
5180	The bone morphogenetic proteins (BMPs) are a family of secreted signaling molecules that can induce ectopic bone growth. Many BMPs are part of the transforming growth factor-beta (TGFB) superfamily. BMPs were originally identified by an ability of demineralized bone extract to induce endochondral osteogenesis in vivo in an extraskeletal site. Based on its expression early in embryogenesis, the BMP encoded by this gene has a proposed role in early development. In addition, the fact that this BMP is closely related to BMP5 and BMP7 has lead to speculation of possible bone inductive activity.
5181	The protein encoded by this gene belongs to the glucagon family. It stimulates myocardial contractility, causes vasodilation, increases glycogenolysis, lowers arterial blood pressure and relaxes the smooth muscle of trachea, stomach and gall bladder. Alternative splicing occurs at this locus and two transcript variants encoding distinct isoforms have been identified.
5182	
5183	
5184	This gene encodes a serine/threonine protein kinase that is highly similar to the rat serum-and glucocorticoid-induced protein kinase (SGK). This gene was identified in a screen of hepatocellular genes regulated in response to cellular hydration or swelling. Cellular hydration is a catabolic signal, stimulating glycogenolysis and proteolysis, and inhibiting protein and glycogen synthesis. This kinase has been shown to be important in activating certain potassium, sodium, and chloride channels. Expression of this gene in hepatocytes is stimulated by transforming growth factor-beta (TGF-beta) which participates in the pathophysiology of diabetic complications. Since both TGF-beta expression and SGK expression are elevated in diabetic nephropathy, this suggests an involvement of SGK in the development of this condition.
5185	
5186	
5187	
5188	This gene encodes a member of the bone morphogenetic protein family which is part of the transforming growth factor-beta superfamily. The superfamily includes large families of growth and differentiation factors. Bone morphogenetic proteins were originally identified by an ability of demineralized bone extract to induce endochondral osteogenesis in vivo in an extraskeletal site. These proteins are synthesized as prepropeptides, cleaved, and then processed into dimeric proteins. This protein may act as an important signaling molecule within the trabecular meshwork and optic nerve head, and may play a potential role in glaucoma pathogenesis. This gene is differentially regulated during the formation of various tumors.
5189	
5190	Polyspecific organic cation transporters in the liver, kidney, intestine, and other organs are critical for elimination of many endogenous small organic cations as well as a wide array of drugs and environmental toxins. This gene is one of three similar cation transporter genes located in a cluster on chromosome 6. The encoded protein contains twelve putative transmembrane domains and is a plasma integral membrane protein.
5191	This gene is a member of the carboxypeptidase A/B subfamily, and it is located in a cluster with three other family members on chromosome 7. Carboxypeptidases are zinc-containing exopeptidases that catalyze the release of carboxy-terminal amino acids, and are synthesized as zymogens that are activated by proteolytic cleavage. This gene could be involved in the histone hyperacetylation pathway. It is imprinted and may be a strong candidate gene for prostate cancer aggressiveness.
5192	
5193	After removal of the precursor signal peptide, proinsulin is post-translationally cleaved into two chains (peptide A and peptide B) that are covalently linked via two disulfide bonds.  Binding of this mature form of insulin to the insulin receptor (INSR) stimulates glucose uptake.  A variety of mutant alleles with changes in the coding region have been identified.
5194	Cell signaling pathways rely on a dynamic interaction between activating and inhibiting processes. SHP-1-mediated dephosphorylation of protein tyrosine residues is central to the regulation of several cell signaling pathways. Two types of inhibitory receptor superfamily members are immunoreceptor tyrosine-based inhibitory motif (ITIM)-bearing receptors and their non-ITIM-bearing, activating counterparts. Control of cell signaling via SHP-1 is thought to occur through a balance between PILRalpha-mediated inhibition and PILRbeta-mediated activation. These paired immunoglobulin-like receptor genes are located in a tandem head-to-tail orientation on chromosome 7. This particular gene encodes the non-ITIM-bearing member of the receptor pair, which has a truncated cytoplasmic tail relative to its ITIM-bearing partner and functions in the activating role. Alternative splicing has been observed at this locus and three variants, encoding two distinct isoforms, are described. Additional transcript variants have been identified but their full-length nature has not been determined.
5195	
5196	Aquaporins are a family of small integral membrane proteins related to the major intrinsic protein (MIP or AQP0). This gene encodes an aquaporin which functions as a molecular water channel protein. It is a homotetramer with 6 bilayer spanning domains and N-glycosylation sites. The protein physically resembles channel proteins and is abundant in erythrocytes and renal tubes. The gene encoding this aquaporin is a possible candidate for disorders involving imbalance in ocular fluid movement. Alternative splicing results in two transcript variants which encode the same protein.
5197	Retention of resident soluble proteins in the lumen of the endoplasmic reticulum (ER) is achieved in both yeast and animal cells by their continual retrieval from the cis-Golgi, or a pre-Golgi compartment.  Sorting of these proteins is dependent on a C-terminal tetrapeptide signal, usually lys-asp-glu-leu (KDEL) in animal cells, and his-asp-glu-leu (HDEL) in S. cerevisiae.  This process is mediated by a receptor that recognizes, and binds the tetrapeptide-containing protein, and returns it to the ER.  In yeast, the sorting receptor encoded by a single gene, ERD2, is a seven-transmembrane protein.  Unlike yeast, several human homologs of the ERD2 gene, constituting the KDEL receptor gene family, have been described. KDELR2 was the second member of the family to be identified, and it encodes a protein which is 83% identical to the KDELR1 gene product.
5198	Retention of resident soluble proteins in the lumen of the endoplasmic reticulum (ER) is achieved in both yeast and animal cells by their continual retrieval from the cis-Golgi, or a pre-Golgi compartment.  Sorting of these proteins is dependent on a C-terminal tetrapeptide signal, usually lys-asp-glu-leu (KDEL) in animal cells, and his-asp-glu-leu (HDEL) in S. cerevisiae.  This process is mediated by a receptor that recognizes, and binds the tetrapeptide-containing protein, and returns it to the ER.  In yeast, the sorting receptor encoded by a single gene, ERD2, is a seven-transmembrane protein.  Unlike yeast, several human homologs of the ERD2 gene, constituting the KDEL receptor gene family, have been described. KDELR2 was the second member of the family to be identified, and it encodes a protein which is 83% identical to the KDELR1 gene product.
5199	The protein encoded by this gene is a member of the Fe65 protein family. It is an adaptor protein localized in the nucleus. It interacts with the Alzheimer@#$%&s disease amyloid precursor protein (APP), transcription factor CP2/LSF/LBP1 and the low-density lipoprotein receptor-related protein. APP functions as a cytosolic anchoring site that can prevent the gene product@#$%&s nuclear translocation. This encoded protein could play an important role in the pathogenesis of Alzheimer@#$%&s disease. It is thought to regulate transcription. Also it is observed to block cell cycle progression by downregulating thymidylate synthase expression. Multiple alternatively spliced transcript variants have been described for this gene but some of their full length sequence is not known.
5200	The protein encoded by this gene is primarily found in the nucleolus, where it can bind to the 5@#$%& UTR of the insulin-like growth factor II leader 3 mRNA and may repress translation of insulin-like growth factor II during late development. The encoded protein contains several KH domains, which are important in RNA binding and are known to be involved in RNA synthesis and metabolism. A pseudogene exists on chromosome 7, and there are putative pseudogenes on other chromosomes.
5201	
5202	This gene encodes a member of the adenylate kinase family, which is involved in regulating the adenine nucleotide composition within a cell by catalyzing the reversible transfer of phosphate groups among adenine nucleotides. This member is related to the UMP/CMP kinase of several species. It is located in the cytosol and expressed exclusively in brain. Alternatively spliced transcript variants encoding distinct isoforms have been identified for this gene.
5203	
5204	L-fucose is a key sugar in glycoproteins and other complex carbohydrates since it may be involved in many of the functional roles of these macromolecules, such as in cell-cell recognition. The fucosyl donor for these fucosylated oligosaccharides is GDP-beta-L-fucose. There are two alternate pathways for the biosynthesis of GDP-fucose; the major pathway converts GDP-alpha-D-mannose to GDP-beta-L-fucose. The protein encoded by this gene participates in an alternate pathway that is present in certain mammalian tissues, such as liver and kidney, and appears to function as a salvage pathway to reutilize L-fucose arising from the turnover of glycoproteins and glycolipids. This pathway involves the phosphorylation of L-fucose to form beta-L-fucose-1-phosphate, and then condensation of the beta-L-fucose-1-phosphate with GTP by fucose-1-phosphate guanylyltransferase to form GDP-beta-L-fucose.
5205	
5206	
5207	This gene encodes a member of the p34Cdc2 protein kinase family. p34Cdc2 kinase family members are known to be essential for eukaryotic cell cycle control. This gene is in close proximity to CDC2L1, a nearly identical gene in the same chromosomal region. The gene loci including this gene, CDC2L1, as well as metalloprotease MMP21/22, consist of two identical, tandemly linked genomic regions, which are thought to be a part of the larger region that has been duplicated. This gene and CDC2L1 were shown to be deleted or altered frequently in neuroblastoma with amplified MYCN genes. The protein kinase encoded by this gene could be cleaved by caspases and was demonstrated to play roles in cell apoptosis. A large number of the alternatively spliced variants of this gene have been reported.
5208	Ephrin receptors and their ligands, the ephrins, mediate numerous developmental processes, particularly in the nervous system. Based on their structures and sequence relationships, ephrins are divided into the ephrin-A (EFNA) class, which are anchored to the membrane by a glycosylphosphatidylinositol linkage, and the ephrin-B (EFNB) class, which are transmembrane proteins. The Eph family of receptors are divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands. Ephrin receptors make up the largest subgroup of the receptor tyrosine kinase (RTK) family. The protein encoded by this gene is a receptor for ephrin-B family members.
5209	
5210	
5211	Heparan sulfate biosynthetic enzymes are key components in generating a myriad of distinct heparan sulfate fine structures that carry out multiple biologic activities. This gene encodes heparan sulfate 2-O-sulfotransferase, a member of the heparan sulfate biosynthetic enzyme family. This family member transfers sulfate to the 2 position of the iduronic acid residue of heparan sulfate. The disruption of this gene resulted in no kidney formation in knockout embryonic mice, indicating that the absence of this enzyme may interfere with the signaling required for kidney formation.
5212	This gene encodes a member of the CD2 family of cell surface proteins involved in lymphocyte activation. These proteins are characterized by Ig domains. This protein is expressed in lymphoid tissues, and studies of a similar protein in mouse suggest that it may function during B cell lineage commitment. The gene is found in a region of chromosome 1 containing many CD2 genes.
5213	The function of this gene has not yet been determined. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of these variants has not been determined.
5214	This gene is a member of the Regulator of Complement Activation (RCA) gene cluster and encodes a protein with twenty short concensus repeat (SCR) domains. This protein is secreted into the bloodstream and has an essential role in the regulation of complement activation, restricting this innate defense mechanism to microbial infections. Mutations in this gene have been associated with hemolytic-uremic syndrome (HUS) and chronic hypocomplementemic nephropathy. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
5215	
5216	
5217	
5218	Drosophila that have mutations in their mago nashi (grandchildless) gene produce progeny with defects in germplasm assembly and germline development. This gene encodes the mammalian mago nashi homolog. In mammals, mRNA expression is not limited to the germ plasm, but is expressed ubiquitously in adult tissues and can be induced by serum stimulation of quiescent fibroblasts.
5219	
5220	This gene encodes an apolipoprotein E receptor, a member of the low density lipoprotein receptor (LDLR) family. Apolipoprotein E is a small lipophilic plasma protein and a component of lipoproteins such as chylomicron remnants, very low density lipoprotein (VLDL), and high density lipoprotein (HDL). The apolipoprotein E receptor is involved in cellular recognition and internalization of these lipoproteins. Alternative splicing generates multiple transcript variants encoding distinct isoforms for this gene.
5221	
5222	
5223	
5224	
5225	
5226	
5227	
5228	
5229	
5230	
5231	
5232	Adenylate kinases are involved in regulating the adenine nucleotide composition within a cell by catalyzing the reversible transfer of phosphate groups among adenine nucleotides. Three isozymes of adenylate kinase, namely 1, 2, and 3, have been identified in vertebrates; this gene encodes isozyme 2. Expression of these isozymes is tissue-specific and developmentally regulated. Isozyme 2 is localized in the mitochondrial intermembrane space and may play a role in apoptosis. Transcript variants encoding distinct isoforms have been identified for this gene.
5233	
5234	This gene encodes the enzyme responsible for RNA editing by site-specific deamination of adenosines.  This enzyme destabilizes double stranded RNA through conversion of adenosine to inosine.  Alternative splicing of this gene results in several transcript variants, three which are characterized by deletion of exons 6 and/or 7.
5235	
5236	
5237	The Golgi complex plays a key role in the sorting and modification of proteins exported from the endoplasmic reticulum. The protein encoded by this gene is involved in the maintenance of Golgi structure and function through its interaction with the integral membrane protein giantin. It may also be involved in the hormonal regulation of steroid formation.
5238	DISCONTINUED: LocusID 29056 was defined by AF090941.1 and NM_014137.1 which do not appear to represent a protein coding gene.
5239	
5240	This gene encodes a member of the adenylate kinase family of enzymes. The encoded protein is localized to the mitochondrial matrix. Adenylate kinases regulate the adenine and guanine nucleotide compositions within a cell by catalyzing the reversible transfer of phosphate group among these nucleotides. Five isozymes of adenylate kinase have been identified in vertebrates. Expression of these isozymes is tissue-specific and developmentally regulated. A pseudogene for this gene has been located on chromosome 17. Three transcript variants encoding the same protein have been identified for this gene. Sequence alignment suggests that the gene defined by NM_013410, NM_203464, and NM_001005353 is located on chromosome 1.
5241	
5242	This gene encodes a protein containing domains that associate it with the peptidase family C19, also known as family 2 of ubiquitin carboxyl-terminal hydrolases. Family members function as deubiquitinating enzymes, recognizing and hydrolyzing the peptide bond at the C-terminal glycine of ubiquitin. Enzymes in peptidase family C19 are involved in the processing of poly-ubiquitin precursors as well as that of ubiquitinated proteins.
5243	The protein encoded by this gene is a small GTPase of the Rho-subfamily, which regulates signaling pathways that control diverse cellular functions including cell morphology, migration, endocytosis and cell cycle progression. This protein is highly similar to Saccharomyces cerevisiae Cdc 42, and is able to complement the yeast cdc42-1 mutant. The product of oncogene Dbl was reported to specifically catalyze the dissociation of GDP from this protein. This protein could regulate actin polymerization through its direct binding to Neural Wiskott-Aldrich syndrome protein (N-WASP), which subsequently activates Arp2/3 complex. Alternative splicing of this gene results in at least two transcript variants.
5244	Elastases form a subfamily of serine proteases that hydrolyze many proteins in addition to elastin. Humans have six elastase genes which encode the structurally similar proteins elastase 1, 2, 2A, 2B, 3A, and 3B. Unlike other elastases, elastase 3B has little elastolytic activity. Like most of the human elastases, elastase 3B is secreted from the pancreas as a zymogen and, like other serine proteases such as trypsin, chymotrypsin and kallikrein, it has a digestive function in the intestine. Elastase 3B preferentially cleaves proteins after alanine residues. Elastase 3B may also function in the intestinal transport and metabolism of cholesterol. Both elastase 3A and elastase 3B have been referred to as protease E and as elastase 1.
5245	
5246	
5247	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of the ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class. It is also a member of the NFB42 (neural F Box 42 kDa) family, similar to F-box only protein 2 and F-box only protein 6. Four alternatively spliced transcript variants encoding two distinct isoforms have been found for this gene.
5248	
5249	L-glutamate is the major excitatory neurotransmitter in the central nervous system and activates both ionotropic and metabotropic glutamate receptors. Glutamatergic neurotransmission is involved in most aspects of normal brain function and can be perturbed in many neuropathologic conditions. The metabotropic glutamate receptors are a family of G protein-coupled receptors, that have been divided into 3 groups on the basis of sequence homology, putative signal transduction mechanisms, and pharmacologic properties. Group I includes GRM1 and GRM5 and these receptors have been shown to activate phospholipase C. Group II includes GRM2 and GRM3 while Group III includes GRM4, GRM6, GRM7 and GRM8. Group II and III receptors are linked to the inhibition of the cyclic AMP cascade but differ in their agonist selectivities.
5250	
5251	The protein encoded by this gene belongs to the phosphoinositide 3-kinase (PI3K) family. PI3-kinases play roles in signaling pathways involved in cell proliferation, oncogenic transformation, cell survival, cell migration, and intracellular protein trafficking. This protein contains a lipid kinase catalytic domain as well as a C-terminal C2 domain, a characteristic of class II PI3-kinases. C2 domains act as calcium-dependent phospholipid binding motifs that mediate translocation of proteins to membranes, and may also mediate protein-protein interactions. The PI3-kinase activity of this protein is sensitive to low nanomolar levels of the inhibitor wortmanin. The C2 domain of this protein was shown to bind phospholipids but not Ca2+, which suggests that this enzyme may function in a calcium-independent manner.
5252	
5253	Regulator of G protein signaling (RGS) family members are regulatory molecules that act as GTPase activating proteins (GAPs) for G alpha subunits of heterotrimeric G proteins. RGS proteins are able to deactivate G protein subunits of the Gi alpha, Go alpha and Gq alpha subtypes. They drive G proteins into their inactive GDP-bound forms. Regulator of G protein signaling 4 belongs to this family. All RGS proteins share a conserved 120-amino acid sequence termed the RGS domain. Regulator of G protein signaling 4 protein is 37% identical to RGS1 and 97% identical to rat Rgs4. This protein negatively regulate signaling upstream or at the level of the heterotrimeric G protein and is localized in the cytoplasm.
5254	Dermatopontin is an extracellular matrix protein with possible functions in cell-matrix interactions and matrix assembly. The protein is found in various tissues and many of its tyrosine residues are sulphated. Dermatopontin is postulated to modify the behavior of TGF-beta through interaction with decorin.
5255	
5256	
5257	
5258	
5259	This gene encodes a member of the Notch family. Members of this Type 1 transmembrane protein family share structural characteristics including an extracellular domain consisting of multiple epidermal growth factor-like (EGF) repeats, and an intracellular domain consisting of multiple, different domain types. Notch family members play a role in a variety of developmental processes by controlling cell fate decisions. The Notch signaling network is an evolutionarily conserved intercellular signaling pathway which regulates interactions between physically adjacent cells. In Drosophilia, notch interaction with its cell-bound ligands (delta, serrate) establishes an intercellular signaling pathway that plays a key role in development. Homologues of the notch-ligands have also been identified in human, but precise interactions between these ligands and the human notch homologues remain to be determined. This protein is cleaved in the trans-Golgi network, and presented on the cell surface as a heterodimer. This protein functions as a receptor for membrane bound ligands, and may play a role in vascular, renal and hepatic development.
5260	
5261	
5262	
5263	Opsins, including OPN3, are members of the superfamily of guanine nucleotide-binding protein (G protein)-coupled receptors, which function through the activation of a G protein and an effector enzyme. Opsin proteins consist of a single polypeptide chain of 340 to 500 amino acids that form 7 alpha-helical transmembrane regions connected by cytoplasmic and extracellular loops.[supplied by OMIM]
5264	
5265	This gene belongs to the stathmin family of genes. It encodes a ubiquitous cytosolic phosphoprotein proposed to function as an intracellular relay integrating regulatory signals of the cellular environment. The encoded protein is involved in the regulation of the microtubule filament system by destabilizing microtubules. It prevents assembly and promotes disassembly of microtubules. Alternatively spliced transcript variants encoding the same protein have been identified.
5266	
5267	The protein encoded by this gene belongs to a highly conserved cAMP-regulated phosphoprotein (ARPP) family. This protein was identified as an endogenous ligand for the sulfonylurea receptor, ABCC8/SUR1. ABCC8 is the regulatory subunit of the ATP-sensitive potassium (KATP) channel, which is located on the plasma membrane of pancreatic beta cells and plays a key role in the control of insulin release from pancreatic beta cells. This protein is thought to be an endogenous regulator of KATP channels. In vitro studies have demonstrated that this protein modulates insulin secretion through the interaction with KATP channel, and this gene has been proposed as a candidate gene for type 2 diabetes. At least eight alternatively spliced transcript variants encoding distinct isoforms have been observed.
5268	
5269	MTR encodes the enzyme 5-methyltetrahydrofolate-homocysteine methyltransferase. This enzyme, also known as cobalamin-dependent methionine synthase, catalyzes the final step in methionine biosynthesis. Mutations in MTR have been identified as the underlying cause of methylcobalamin deficiency complementation group G.
5270	MTR encodes the enzyme 5-methyltetrahydrofolate-homocysteine methyltransferase. This enzyme, also known as cobalamin-dependent methionine synthase, catalyzes the final step in methionine biosynthesis. Mutations in MTR have been identified as the underlying cause of methylcobalamin deficiency complementation group G.
5271	The protein encoded by this gene is a member of the peptidyl-prolyl cis-trans isomerase (PPIase) family. PPIases catalyze the cis-trans isomerization of proline imidic peptide bonds in oligopeptides and accelerate the folding of proteins. This protein is a specific component of the complex that includes pre-mRNA processing factors PRPF3, PRPF4, and PRPF18, as well as U4/U5/U6 tri-snRNP. This protein has been shown to possess PPIase activity and may act as a protein chaperone that mediates the interactions between different proteins inside the spliceosome.
5272	
5273	
5274	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and two tandem intracytoplasmic catalytic domains, and thus represents a receptor-type PTP. The extracellular region contains three Ig-like domains, and nine non-Ig like domains similar to that of neural-cell adhesion molecule. This PTP was shown to function in the regulation of epithelial cell-cell contacts at adherents junctions, as well as in the control of beta-catenin signaling. An increased expression level of this protein was found in the insulin-responsive tissue of obese, insulin-resistant individuals, and may contribute to the pathogenesis of insulin resistance. Two alternatively spliced transcript variants of this gene, which encode distinct proteins, have been reported.
5275	
5276	
5277	The protein encoded by this gene is a pyruvate kinase that catalyzes the production of phohsphoenolpyruvate from pyruvate and ATP. Defects in this enzyme, due to gene mutations or genetic variations, are the common cause of chronic hereditary nonspherocytic hemolytic anemia (CNSHA or HNSHA). Alternatively spliced transcript variants encoding distinct isoforms have been described.
5278	This gene encodes a lysosomal membrane protein that cleaves the beta-glucosidic linkage of glycosylceramide, an intermediate in glycolipid metabolism. Mutations in this gene cause Gaucher disease, a lysosomal storage disease characterized by an accumulation of glucocerebrosides. A related pseudogene is approximately 12 kb downstream of this gene on chromosome 1. Alternative splicing results in multiple transcript variants encoding the same protein.
5279	The protein encoded by this gene belongs to the thrombospondin family. Thrombospondin family members are adhesive glycoproteins that mediate cell-to-cell and cell-to-matrix interactions. This protein forms a pentameric molecule linked by a single disulfide bond. This gene shares a common promoter with metaxin 1.
5280	
5281	
5282	
5283	This tumor suppressor gene is a member of a nuclear histone/protein methyltransferase superfamily. It encodes a zinc finger protein that can bind to retinoblastoma protein, estrogen receptor, and the TPA-responsive element (MTE) of the heme-oxygenase-1 gene. Although the functions of this protein have not been fully characterized, it may (1) play a role in transcriptional regulation during neuronal differentiation and pathogenesis of retinoblastoma, (2) act as a transcriptional activator of the heme-oxygenase-1 gene, and (3) be a specific effector of estrogen action. Three transcript variants encoding different isoforms have been found for this gene.
5284	ADP-ribosylation factor 1 (ARF1) is a member of the human ARF gene family. The family members encode small guanine nucleotide-binding proteins that stimulate the ADP-ribosyltransferase activity of cholera toxin and play a role in vesicular trafficking as activators of phospholipase D. The gene products, including 6 ARF proteins and 11 ARF-like proteins, constitute a family of the RAS superfamily. The ARF proteins are categorized as class I (ARF1, ARF2 and ARF3), class II (ARF4 and ARF5) and class III (ARF6), and members of each class share a common gene organization. The ARF1 protein is localized to the Golgi apparatus and has a central role in intra-Golgi transport. Multiple alternatively spliced transcript variants encoding the same protein have been found for this gene.
5285	
5286	
5287	
5288	
5289	
5290	
5291	This gene encodes a protein tyrosine phosphatase which is expressed primarily in lymphoid tissues. This enzyme associates with the molecular adapter protein CBL and may be involved in regulating CBL function in the T-cell receptor signaling pathway. Alternative splicing of this gene results in two transcript variants encoding distinct isoforms.
5292	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility.
5293	
5294	This gene encodes a CCCH-type zinc finger protein that is thought to prevent infection by retroviruses. Studies of the rat homolog indicate that the protein may primarily function to inhibit viral gene expression and induce an innate immunity to viral infection. Alternative splicing occurs at this locus and two variants, each encoding distinct isoforms, are described.
5295	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. The encoded protein also contains a Ras association (RalGDS/AF-6) domain.
5296	This gene is one member of a family of sulfate/anion transporter genes. Family members are well conserved in their genomic (number and size of exons) and protein (aa length among species) structures yet have markedly different tissue expression patterns. Two splice variants that encode different isoforms have been identified.
5297	The coenzyme NAD and its derivatives are involved in hundreds of metabolic redox reactions and are utilized in protein ADP-ribosylation, histone deacetylation, and in some Ca(2+) signaling pathways. NMNAT (EC 2.7.7.1) is a central enzyme in NAD biosynthesis, catalyzing the condensation of nicotinamide mononucleotide (NMN) or nicotinic acid mononucleotide (NaMN) with the AMP moiety of ATP to form NAD or NaAD (Zhang et al., 2003).[supplied by OMIM]
5298	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes an additional conjugation factor, E4, which is involved in multiubiquitin chain assembly. This gene is also the strongest candidate in the neuroblastoma tumor suppressor genes.
5299	
5300	
5301	The pattern of cellular proliferation and differentiation that leads to normal development of embryonic structures often depends upon the localized production of secreted protein signals. Cells surrounding the source of a particular signal respond in a graded manner according to the effective concentration of the signal, and this response produces the pattern of cell types constituting the mature structure. A novel segment-polarity gene known as dispatched has been identified in Drosophila and its protein product is required for normal Hedgehog (Hh) signaling. This gene is one of two human homologs of Drosophila dispatched and, based on sequence identity to its mouse counterpart, the encoded protein may play an essential role in Hh patterning activities in the early embryo.
5302	
5303	DVL1, the human homolog of the Drosophila dishevelled gene (dsh) encodes a cytoplasmic phosphoprotein that regulates cell proliferation, acting as a transducer molecule for developmental processes, including segmentation and neuroblast specification. DVL1 is a candidate gene for neuroblastomatous transformation. The Schwartz-Jampel syndrome and Charcot-Marie-Tooth disease type 2A have been mapped to the same region as DVL1. The phenotypes of these diseases may be consistent with defects which might be expected from aberrant expression of a DVL gene during development. Three transcript variants encoding three different isoforms have been found for this gene.
5304	
5305	
5306	
5307	This gene belongs to the dimethylarginine dimethylaminohydrolase (DDAH) gene family. The encoded enzyme plays a role in nitric oxide generation by regulating cellular concentrations of methylarginines, which in turn inhibit nitric oxide synthase activity.
5308	
5309	This gene encodes the penultimate enzyme of heme biosynthesis, which catalyzes the 6-electron oxidation of protoporphyrinogen IX to form protoporphyrin IX. This protein is a flavoprotein associated with the outer surface of the inner mitochondrial membrane. Mutations in this gene cause variegate porphyria.
5310	Adenylosuccinate synthetase catalyzes the first committed step in the conversion of IMP to AMP
5311	
5312	This gene is an interferon-stimulated gene (ISG). The type I- (alpha and beta) interferons induce transcription while type II- (gamma) interferon is a poor transcription inducer for this gene. A minisatellite repeat region, of varying lengths, has been found at the end of exon 2 and the beginning of intron 2. Alternative splicing occurs due to the use of alternative splice donor sites provided by the minisatellite sequence. Three alternatively spliced transcript variants have been described. The encoded product is a hydrophobic protein and thought to be present in the membrane. It is postulated to be a precursor protein but its signal peptide region has not been confirmed. The function of the encoded protein has not been determined.
5313	This gene is a member of the Period family of genes and is expressed in a circadian pattern in the suprachiasmatic nucleus, the primary circadian pacemaker in the mammalian brain. Genes in this family encode components of the circadian rhythms of locomotor activity, metabolism, and behavior. Circadian expression in the suprachiasmatic nucleus continues in constant darkness, and a shift in the light/dark cycle evokes a proportional shift of gene expression in the suprachiasmatic nucleus. The specific function of this gene is not yet known.
5314	
5315	This gene encodes a member of the Ser/Thr protein kinase family. This kinase is regulated through direct phosphorylation by p38 MAP kinase. In conjunction with p38 MAP kinase, this kinase is known to be involved in many cellular processes including stress and inflammatory responses, nuclear export, gene expression regulation and cell proliferation. Heat shock protein HSP27 was shown to be one of the substrates of this kinase in vivo. Two transcript variants encoding two different isoforms have been found for this gene.
5316	The protein encoded by this gene is involved in both the synthesis and degradation of fructose-2,6-bisphosphate, a regulatory molecule that controls glycolysis in eukaryotes. The encoded protein has a 6-phosphofructo-2-kinase activity that catalyzes the synthesis of fructose-2,6-bisphosphate, and a fructose-2,6-biphosphatase activity that catalyzes the degradation of fructose-2,6-bisphosphate. This protein regulates fructose-2,6-bisphosphate levels in the heart, while a related enzyme encoded by a different gene regulates fructose-2,6-bisphosphate levels in the liver and muscle. This enzyme functions as a homodimer. Two transcript variants encoding two different isoforms have been found for this gene.
5317	
5318	
5319	This gene is the putative transforming gene of avian sarcoma virus 17. It encodes a protein which is highly similar to the viral protein, and which interacts directly with specific target DNA sequences to regulate gene expression. This gene is intronless and is mapped to 1p32-p31, a chromosomal region involved in both translocations and deletions in human malignancies.
5320	Netrin G1 (NTNG1) belongs to a conserved family of proteins that act as axon guidance cues during vertebrate nervous system development (Nakashiba et al., 2000).[supplied by OMIM]
5321	VAV3, a third representative of the Vav family, participates in signal transduction processes. It is also involved in the activation of several members of the Rho family. This activity, when deregulated, leads to marked cytoskeletal changes and to alterations in the process of cell division.
5322	
5323	The protein encoded by this gene is a member of the flamingo subfamily, part of the cadherin superfamily. The flamingo subfamily consists of nonclassic-type cadherins; a subpopulation that does not interact with catenins. The flamingo cadherins are located at the plasma membrane and have nine cadherin domains, seven epidermal growth factor-like repeats and two laminin A G-type repeats in their ectodomain. They also have seven transmembrane domains, a characteristic unique to this subfamily. It is postulated that these proteins are receptors involved in contact-mediated communication, with cadherin domains acting as homophilic binding regions and the EGF-like domains involved in cell adhesion and receptor-ligand interactions. The specific function of this particular member has not been determined.
5324	This gene encodes a protein that is a multi-ligand type-1 receptor with similarity to the yeast carboxypeptidase Y sorting receptor Vps10 protein. The encoded protein, a trans-Golgi network (TGN) transmembrane protein, binds a number of unrelated ligands that participate in a wide range of cellular processes; however, it lacks the typical features of a signalling receptor. In the TGN, furin mediates the activation of the mature binding form. The encoded protein consists of a large luminal domain, a single transmembrane segment and short C-terminal cytoplasmic tail. The luminal domain contains a cysteine-rich region similar to two corresponding segments in the yeast Vps10p; the cytoplasmic tail is similar to the corresponding segment of the cation-independent mannose 6-phosphate receptor and the tail also interacts with the VHS domains of GGA (Golgi-associated, gamma-adaptin homologous, ARF-interacting) proteins.
5325	
5326	NIMA-related kinases share high amino acid sequence identity with the gene product of the Aspergillus nidulans @#$%&never in mitosis A@#$%& gene, which controls initiation of mitosis.[supplied by OMIM]
5327	Laminins, a family of extracellular matrix glycoproteins, are the major noncollagenous constituent of basement membranes. They have been implicated in a wide variety of biological processes including cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis. Laminins are composed of 3 non identical chains: laminin alpha, beta and gamma (formerly A, B1, and B2, respectively) and they form a cruciform structure consisting of 3 short arms, each formed by a different chain, and a long arm composed of all 3 chains. Each laminin chain is a multidomain protein encoded by a distinct gene. Several isoforms of each chain have been described. Different alpha, beta and gamma chain isomers combine to give rise to different heterotrimeric laminin isoforms which are designated by Arabic numerals in the order of their discovery, i.e. alpha1beta1gamma1 heterotrimer is laminin 1. The biological functions of the different chains and trimer molecules are largely unknown, but some of the chains have been shown to differ with respect to their tissue distribution, presumably reflecting diverse functions in vivo. This gene encodes the gamma chain isoform laminin, gamma 1. The gamma 1 chain, formerly thought to be a beta chain, contains structural domains similar to beta chains, however, lacks the short alpha region separating domains I and II. The structural organization of this gene also suggested that it had diverged considerably from the beta chain genes. Embryos of transgenic mice in which both alleles of the gamma 1 chain gene were inactivated by homologous recombination, lacked basement membranes, indicating that laminin, gamma 1 chain is necessary for laminin heterotrimer assembly. It has been inferred by analogy with the strikingly similar 3@#$%& UTR sequence in mouse laminin gamma 1 cDNA, that multiple polyadenylation sites are utilized in human to generate the 2 different sized mRNAs (5.5 and 7.5 kb) seen on Northern analysis.
5328	Laminins, a family of extracellular matrix glycoproteins, are the major noncollagenous constituent of basement membranes.  They have been implicated in a wide variety of biological processes including cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis.  Laminins are composed of 3 non identical chains: laminin alpha, beta and gamma (formerly A, B1, and B2, respectively) and they form a cruciform structure consisting of 3 short arms, each formed by a different chain, and a long arm composed of all 3 chains.  Each laminin chain is a multidomain protein encoded by a distinct gene.  Several isoforms of each chain have been described.  Different alpha, beta and gamma chain isomers combine to give rise to different heterotrimeric laminin isoforms which are designated by Arabic numerals in the order of their discovery, i.e. alpha1beta1gamma1 heterotrimer is laminin 1.  The biological functions of the different chains and trimer molecules are largely unknown, but some of the chains have been shown to differ with respect to their tissue distribution, presumably reflecting diverse functions in vivo. This gene encodes the gamma chain isoform laminin, gamma 2.  The gamma 2 chain, formerly thought to be a truncated version of beta chain (B2t), is highly homologous to the gamma 1 chain; however, it lacks domain VI, and domains V, IV and III are shorter.  It is expressed in several fetal tissues but differently from gamma 1, and is specifically localized to epithelial cells in skin, lung and kidney.  The gamma 2 chain together with alpha 3 and beta 3 chains constitute laminin 5 (earlier known as kalinin), which is an integral part of the anchoring filaments that connect epithelial cells to the underlying basement membrane.  The epithelium-specific expression of the gamma 2 chain implied its role as an epithelium attachment molecule, and mutations in this gene have been associated with junctional epidermolysis bullosa, a skin disease characterized by blisters due to disruption of the epidermal-dermal junction.  Two transcript variants resulting from alternative splicing of the 3@#$%& terminal exon, and encoding different isoforms of gamma 2 chain, have been described.  The two variants are differentially expressed in embryonic tissues, however, the biological significance of the two forms is not known.  Transcript variants utilizing alternative polyA_signal have also been noted in literature.
5329	
5330	
5331	
5332	
5333	The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21. This protein may have a tumor suppressor function. Chromosomal rearrangements and altered expression of this gene have been implicated in breast cancer.
5334	The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21. This protein may function in stimulation of Ca2+-dependent insulin release, stimulation of prolactin secretion, and exocytosis. Chromosomal rearrangements and altered expression of this gene have been implicated in melanoma.
5335	The protein encoded by this gene is a member of the TNF receptor associated factor (TRAF) protein family. TRAF proteins are associated with, and mediate signal transduction from members of the TNF receptor superfamily. This protein has been found to be one of the components of a multiple protein complex associated with the CD40 cytoplasmic domain, which mediates TNF induced NF-kappa B activation and protection from cell death. This protein has also been implicated in the signaling events mediated by various other receptors including CD27, CD30, and lymphotoxin-beta receptor. Two alternatively spliced transcript variants encoding the same protein have been reported.
5336	Peptidylarginine deiminases (PADs; EC 3.5.3.15), such as PADI2, are a family of posttranslational modification enzymes that convert arginine residues in peptides to citrulline residues in the presence of calcium ion.[supplied by OMIM]
5337	
5338	Dehydrodolichyl diphosphate (dedol-PP) synthase catalyzes cis-prenyl chain elongation to produce the polyprenyl backbone of dolichol, a glycosyl carrier lipid required for the biosynthesis of several classes of glycoproteins.[supplied by OMIM]
5339	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This gene is preferentially expressed in a variety of hematopoietic cells, and is an early response gene in lymphokine stimulated cells. The noncatalytic N-terminus of this PTP can interact with MAP kinases and suppress the MAP kinase activities. This PTP was shown to be involved in the regulation of T cell antigen receptor (TCR) signaling, which was thought to function through dephosphorylating the molecules related to MAP kinase pathway. Three alternatively spliced transcript variants of this gene, which encode two distinct isoforms, are reported.
5340	Myosin light chain phosphatase (MLCP) consists of three subunits- catalytic subunit, large subunit/myosin binding subunit (MBS) and small subunit (sm-M20). This gene is a multi-functional gene which encodes both MBS and sm-M20. MLCP regulates myosins and the dephosphorylation is enhanced by the presence of MBS. The sm-M20 is suggested to play a regulatory role in muscle contraction by binding to MBS. MBS is also encoded by another gene, myosin light chain phosphatase target subunit 1. sm-M20 shows higher binding affinity to this gene product than to myosin light chain phosphatase target subunit 2-MBS even though the two MBS proteins are highly similar. Although both MBSs increase the activity of MLCP, myosin light chain phosphatase target subunit 1-MBS is a more efficient activator. There are four alternatively spliced transcript variants described; two alter the MBS coding region and two alter the sm-M20 coding region of this gene.
5341	The protein encoded by this gene is a member of the phosphatidic acid phosphatase (PAP) family. PAPs convert phosphatidic acid to diacylglycerol, and function in de novo synthesis of glycerolipids as well as in receptor-activated signal transduction mediated by phospholipase D. This protein is a membrane glycoprotein localized at the cell plasma membrane. It has been shown to actively hydrolyze extracellular lysophosphatidic acid and short-chain phosphatidic acid. The expression of this gene is found to be enhanced by epidermal growth factor in Hela cells. Alternatively spliced transcript variants encoding the same protein have been described.
5342	
5343	
5344	The balance between fission and fusion regulates the morphology of mitochondria. TTC11 is a component of a mitochondrial complex that promotes mitochondrial fission (James et al., 2003).[supplied by OMIM]
5345	
5346	
5347	
5348	The protein encoded by this gene is a ubiquitously expressed nuclear protein and belongs to a highly conserved subfamily of WD-repeat proteins. It is found among several proteins that bind directly to retinoblastoma protein, which regulates cell proliferation. The encoded protein interacts preferentially with the underphosphorylated retinoblastoma protein via the E1A-binding pocket B.
5349	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters.  ABC proteins transport various molecules across extra- and intra-cellular membranes.  ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White).  This protein is a member of the MDR/TAP subfamily.  Members of the MDR/TAP subfamily are involved in multidrug resistance.  The function of this mitochondrial protein is unknown.
5350	
5351	The leiomodin 1 protein has a putative membrane-spanning region and 2 types of tandemly repeated blocks.  The transcript is expressed in all tissues tested, with the highest levels in thyroid, eye muscle, skeletal muscle, and ovary. Increased expression of leiomodin 1 may be linked to Graves@#$%& disease and thyroid-associated ophthalmopathy.
5352	This gene encodes a protein with sequence similarity to retinoblastoma-binding protein-1. The gene product is recognized by IgG antibody isolated from a breast cancer patient. Transcript variants encoding different isoforms and utilizing alternative polyA sites have been observed.
5353	This gene encodes one of seven subunits of the human Arp2/3 protein complex. This subunit is a member of the SOP2 family of proteins and is most similar to the protein encoded by gene ARPC1A. The similarity between these two proteins suggests that they both may function as p41 subunit of the human Arp2/3 complex that has been implicated in the control of actin polymerization in cells. It is possible that the p41 subunit is involved in assembling and maintaining the structure of the Arp2/3 complex.  Multiple versions of the p41 subunit may adapt the functions of the complex to different cell types or developmental stages.
5354	PARD proteins, which were first identified in C. elegans, are essential for asymmetric cell division and polarized growth, whereas CDC42 (MIM 116952) mediates the establishment of cell polarity. The CDC42 GTPase, which is controlled by nucleotide exchange factors (GEFs; see MIM 606057) and GTPase-activating proteins (GAPs; see MIM 604980), interacts with a large set of effector proteins that typically contain a CDC42/RAC (MIM 602048)-interactive binding (CRIB) domain.[supplied by OMIM]
5355	
5356	
5357	The protein encoded by this gene contains a WW domain, which is a protein module found in a wide range of signaling proteins. This domain mediates protein-protein interactions and binds proteins containing short linear peptide motifs that are proline-rich or contain at least one proline. This gene product shares 94% sequence identity with the WAC protein in mouse, however, its exact function is not known.
5358	
5359	
5360	TCF8 encodes a human zinc finger transcription factor that represses T-lymphocyte-specific IL2 gene (MIM 147680) expression by binding to a negative regulatory domain 100 nucleotides 5-prime of the IL2 transcription start site (Williams et al., 1991).[supplied by OMIM]
5361	
5362	
5363	
5364	The protein encoded by this gene is a nuclear protein that forms intranuclear foci during proliferation and is redistributed in the nucleoplasm during the cell cycle. Short-wave ultraviolet light provokes the relocalization of the protein, suggesting its participation in the cellular response to DNA damage. Originally selected based on protein-binding with RecA antibodies, the mouse protein presents a limited similarity with a functional domain of the bacterial RecA protein, a characteristic shared by this human ortholog.
5365	
5366	
5367	The protein encoded by this gene is a member of the peptidyl-prolyl cis-trans isomerase (PPIase) family. PPIases catalyze the cis-trans isomerization of proline imidic peptide bonds in oligopeptides and accelerate the folding of proteins. This protein is part of the mitochondrial permeability transition pore in the inner mitochondrial membrane. Activation of this pore is thought to be involved in the induction of apoptotic and necrotic cell death.
5368	
5369	
5370	
5371	NFKB has been detected in numerous cell types that express cytokines, chemokines, growth factors, cell adhesion molecules, and some acute phase proteins in health and in various disease states. NFKB is activated by a wide variety of stimuli such as cytokines, oxidant-free radicals, inhaled particles, ultraviolet irradiation, and bacterial or viral products. Inappropriate activation of NF-kappa-B has been linked to inflammatory events associated with autoimmune arthritis, asthma, septic shock, lung fibrosis, glomerulonephritis, atherosclerosis, and AIDS. In contrast, complete and persistent inhibition of NF-kappa-B has been linked directly to apoptosis, inappropriate immune cell development, and delayed cell growth. For reviews, see Chen et al. (1999) and Baldwin (1996).[supplied by OMIM]
5372	
5373	
5374	
5375	
5376	
5377	This gene encodes a mitochondrial transcription factor that is a key activator of mitochondrial transcription as well as a participant in mitochondrial genome replication. Studies in mice have demonstrated that this gene product is required to regulate the mitochondrial genome copy number and is essential for embryonic development. A mouse model for Kearns-Sayre syndrome was produced when expression of this gene was eliminated by targeted disruption in heart and muscle cells.
5378	Ankyrins are a family of proteins that are believed to link the integral membrane proteins to the underlying spectrin-actin cytoskeleton and play key roles in activities such as cell motility, activation, proliferation, contact, and the maintenance of specialized membrane domains. Multiple isoforms of ankyrin with different affinities for various target proteins are expressed in a tissue-specific, developmentally regulated manner. Most ankyrins are typically composed of three structural domains: an amino-terminal domain containing multiple ankyrin repeats; a central region with a highly conserved spectrin binding domain; and a carboxy-terminal regulatory domain which is the least conserved and subject to variation. Ankyrin 3 is an immunologically distinct gene product from ankyrins 1 and 2, and was originally found at the axonal initial segment and nodes of Ranvier of neurons in the central and peripheral nervous systems. Alternatively spliced variants may be expressed in other tissues. Although multiple transcript variants encoding several different isoforms have been found for this gene, the full-length nature of only two have been characterized.
5379	
5380	Vinculin is a cytoskeletal protein associated with cell-cell and cell-matrix junctions, where it is thought to function as one of several interacting proteins involved in anchoring F-actin to the membrane. Multiple alternatively spliced transcript variants have been found for this gene, but the biological validity of some variants has not been determined. Human vinculin protein exhibits a greater than 95% sequence identity to the chicken vinculin protein.
5381	The protein encoded by this gene is a membrane-associated type I inositol 1,4,5-trisphosphate (InsP3) 5-phosphatase. InsP3 5-phosphatases hydrolyze Ins(1,4,5)P3, which mobilizes intracellular calcium and acts as a second messenger mediating cell responses to various stimulation.
5382	This gene encodes a protein involved in spindle checkpoint function. The encoded protein contains four WD repeat domains and has sequence similarity with the yeast BUB3 protein. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
5383	This gene encodes a disintegrin and metalloprotease (ADAM) domain 12, which is a member of the ADAM protein family. Members of this family are membrane-anchored proteins structurally related to snake venom disintegrins, and have been implicated in a variety of biologic processes involving  cell-cell and cell-matrix interactions, including fertilization, muscle development, and neurogenesis. This gene has 2 alternative splicing transcript forms: a shorter secreted form and a longer membrane-bound form, that diverge at their 3@#$%& ends. The shorter form is found to stimulate myogenesis.
5384	
5385	
5386	This gene encodes a protein that may play a role in organelle biogenesis associated with melanosomes, platelet dense granules, and lysosomes. The encoded protein is a component of three different protein complexes termed biogenesis of lysosome-related organelles complex (BLOC)-3, BLOC4, and BLOC5. Mutations in this gene are associated with Hermansky-Pudlak syndrome type 1. Multiple transcript variants encoding distinct isoforms have been identified for this gene; the full-length sequences of some of these have not been determined yet.
5387	Cytochrome c oxidase (COX), the terminal component of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. This component is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in electron transfer, and the nuclear-encoded subunits may function in the regulation and assembly of the complex. This nuclear gene encodes a protein which is not a structural subunit, but may be essential for the biogenesis of COX formation and may function in the hydroxylation of heme O, according to the yeast mutant studies. This protein is predicted to contain 5 transmembrane domains localized in the mitochondrial inner membrane. Alternative splicing of this gene generates several transcript variants diverging in the 3@#$%& region including alternate poly A sites. In total, 2 different isoforms are encoded by these variants.
5388	
5389	
5390	
5391	
5392	Adducins are heteromeric proteins composed of different subunits referred to as adducin alpha, beta and gamma. The three subunits are encoded by distinct genes and belong to a family of membrane skeletal proteins involved in the assembly of spectrin-actin network in erythrocytes and at sites of cell-cell contact in epithelial tissues. While adducins alpha and gamma are ubiquitously expressed, the expression of adducin beta is restricted to brain and hematopoietic tissues. Adducin, originally purified from human erythrocytes, was found to be a heterodimer of adducins alpha and beta. Polymorphisms resulting in amino acid substitutions in these two subunits have been associated with the regulation of blood pressure in an animal model of hypertension. Heterodimers consisting of alpha and gamma subunits have also been described. Structurally, each subunit is comprised of two distinct domains. The amino-terminal region is protease resistant and globular in shape, while the carboxy-terminal region is protease sensitive. The latter contains multiple phosphorylation sites for protein kinase C, the binding site for calmodulin, and is required for association with spectrin and actin. Alternatively spliced adducin gamma transcripts encoding different isoforms have been described. The functions of the different isoforms are not known.
5393	
5394	
5395	The protein encoded by this gene is a member of the dual specificity protein kinase family, which acts as a mitogen-activated protein (MAP) kinase kinase. MAP kinases, also known as extracellular signal-regulated kinases (ERKs), act as an integration point for multiple biochemical signals. This protein kinase lies upstream of MAP kinases and stimulates the enzymatic activity of MAP kinases upon wide variety of extra- and intracellular signals. As an essential component of MAP kinase signal transduction pathway, this kinase is involved in many cellular processes such as proliferation, differentiation, transcription regulation and development.
5396	
5397	
5398	This gene encodes a cytoskeletal LIM protein that binds to actin filaments via a domain that is homologous to erythrocyte dematin. LIM domains, found in over 60 proteins, play key roles in the regulation of developmental pathways. LIM domains also function as protein-binding interfaces, mediating specific protein-protein interactions. The protein encoded by this gene could mediate such interactions between actin filaments and cytoplasmic targets. Alternatively spliced transcript variants encoding different isoforms have been identified.
5399	
5400	
5401	
5402	This gene encodes a disintegrin and metalloproteinase with thrombospondin (ADAMTS) motifs-14, which is a member of the ADAMTS protein family. Members of the family share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. This gene is highly similar to another 2 family members, ADAMTS2 and ADAMTS3, in the sequence and the gene structure, and the protein encoded by this gene shares the aminoprocollagen peptidase activity with the protein products encoded by ADAMTS2 and ADAMTS3. Various transcript variants of this gene have been identified. They result from the use of 2 different promoters and transcription initiation sites as well as alternative splicing sites. The full length nature of these transcripts has not been defined, but these transcripts are known to encode several isoforms differring in the N-terminus and/or an internal 3 aa peptide.
5403	
5404	
5405	
5406	This gene is one of several tumor-suppressing subtransferable fragments located in the imprinted gene domain of 11p15.5, an important tumor-suppressor gene region. Alterations in this region have been associated with the Beckwith-Wiedemann syndrome, Wilms tumor, rhabdomyosarcoma, adrenocortical carcinoma, and lung, ovarian, and breast cancer. This gene is located among several imprinted genes; however, this gene, as well as the pan-hematopoietic expression gene (PHEMX), escapes imprinting. This gene may play a role in malignancies and disease that involve this region.
5407	COPII-coated vesicles are involved in protein transport from the endoplasmic reticulum to the Golgi apparatus. The protein encoded by this gene was identified by its interaction with a mouse protein similar to yeast Sec23p, an essential component of the COPII. This protein shares significant similarity with phospholipid-modifying proteins, especially phosphatidic acid preferring-phospholipase A1. Overexpression of this protein has been shown to cause disorganization of the endoplasmic reticulum-Golgi intermediate compartment and Golgi apparatus, which suggests its role in the early secretory pathway.
5408	
5409	Glutathione S-transferases (GSTs) are a family of enzymes that play an important role in detoxification by catalyzing the conjugation of many hydrophobic and electrophilic compounds with reduced glutathione.  Based on their biochemical, immunologic, and structural properties, the soluble GSTs are categorized into 4 main classes: alpha, mu, pi, and theta.  The glutathione S-transferase pi gene (GSTP1) is a polymorphic gene encoding active, functionally different GSTP1 variant proteins that are thought to function in xenobiotic metabolism and play a role in susceptibility to cancer, and other diseases.
5410	
5411	
5412	
5413	This gene is a novel mucin-like gene that is a member of the cadherin superfamily. While encoding nonpolymorphic tandem repeats rich in proline, serine and threonine similar to mucin proteins, the gene also contains sequence encoding calcium-binding motifs found in all cadherins. The role of the hybrid extracellular region and the specific function of this protein have not yet been determined. Alternative splicing has been identified, with observed variation resulting in the presence or absence of domains. In addition, splicing occurs in the 5@#$%& UTR but transcripts including these variations have not been described completely.
5414	BHC80 is a component of a BRAF35 (MIM 605535)/histone deacetylase (HDAC; see MIM 601241) complex (BHC) that mediates repression of neuron-specific genes through the cis-regulatory element known as repressor element-1 (RE1) or neural restrictive silencer (NRS) (Hakimi et al., 2002).[supplied by OMIM]
5415	The function of this gene has not yet been defined, however, the encoded protein shares similarity with the human (41% identical) and mouse (34% identical) jerky gene products. This protein may act as a nuclear regulatory protein.
5416	
5417	
5418	
5419	This gene encodes one of the members of the superfamily of potassium channel proteins containing two pore-forming P domains. The gene product is an outwardly rectifying channel that is expressed primarily in neural tissues and is stimulated by polyunsaturated fatty acids. Three splice variants have been identified for this gene.
5420	This gene encodes a protein highly similar to Drosophila and mouse proteins. In Drosophila the ovo protein plays a critical role in Drosophila oogenesis and cuticle formation. In mice the ovo like protein is involved in hair formation and spermatogenesis. The function of the human gene product has not been determined.
5421	This gene is overexpressed in breast cancer and squamous cell carcinomas of the head and neck. The encoded protein is localized in the cytoplasm and in areas of the cell-substratum contacts. This gene has two roles: (1) regulating the interactions between components of adherens-type junctions and (2) organizing the cytoskeleton and cell adhesion structures of epithelia and carcinoma cells. During apoptosis, the encoded protein is degraded in a caspase-dependent manner. The aberrant regulation of this gene contributes to tumor cell invasion and metastasis. Two splice variants that encode different isoforms have been identified for this gene.
5422	The protein encoded by this gene belongs to the highly conserved cyclin family, whose members are characterized by a dramatic periodicity in protein abundance throughout the cell cycle. Cyclins function as regulators of CDK kinases. Different cyclins exhibit distinct expression and degradation patterns which contribute to the temporal coordination of each mitotic event. This cyclin forms a complex with and functions as a regulatory subunit of CDK4 or CDK6, whose activity is required for cell cycle G1/S transition. This protein has been shown to interact with tumor suppressor protein Rb and the expression of this gene is regulated positively by Rb. Mutations, amplification and overexpression of this gene, which alters cell cycle progression, are observed frequently in a variety of tumors and may contribute to tumorigenesis.
5423	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of the ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class. Alternative splicing of this gene generates 2 transcript variants diverging at the 3@#$%& end.
5424	
5425	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This enzyme is a microsomal vitamin D hydroxylase that converts vitamin D into the active ligand for the vitamin D receptor. A mutation in this gene has been associated with selective 25-hydroxyvitamin D deficiency.
5426	This gene is an ortholog of the C. elegans unc-76 gene, which is necessary for normal axonal bundling and elongation within axon bundles. Expression of this gene in C. elegans unc-76 mutants can restore to the mutants partial locomotion and axonal fasciculation, suggesting that it also functions in axonal outgrowth. The N-terminal half of the gene product is highly acidic. Alternatively spliced transcript variants encoding different isoforms of this protein have been described.
5427	
5428	The innate immune system recognizes microbial pathogens through Toll-like receptors (TLRs), which identify pathogen-associated molecular patterns. Different TLRs recognize different pathogen-associated molecular patterns and all TLRs have a Toll-interleukin 1 receptor (TIR) domain, which is responsible for signal transduction. The protein encoded by this gene is a TIR adaptor protein involved in the TLR4 signaling pathway of the immune system. It activates NF-kappa-B, MAPK1, MAPK3 and JNK, which then results in cytokine secretion and the inflammatory response. Alternative splicing of this gene results in several transcript variants; however, not all variants have been fully described.
5429	
5430	
5431	
5432	
5433	PORIMIN (pro-oncosis receptor inducing membrane injury), a member of the cell membrane-associated mucin family, mediates oncosis-like cell death.[supplied by OMIM]
5434	
5435	The protein encoded by this gene is similar to Saccharomyces cerevisiae LYS5, which is required for the activation of the alpha-aminoadipate dehydrogenase in the biosynthetic pathway of lysine. Yeast alpha-aminoadipate dehydrogenase converts alpha-biosynthetic-aminoadipate semialdehyde to alpha-aminoadipate. It has been suggested that defects in the human gene result in pipecolic acidemia.
5436	
5437	
5438	
5439	Dihydrolipoamide acetyltransferase (DLAT; EC 2.3.1.12) is 1 of 6 components of the mammalian pyruvate dehydrogenase complex (PDC) of the inner mitochondrial membrane. Patients with primary biliary cirrhosis (PBC; MIM 109720) show autoantibodies to DLAT.[supplied by OMIM]
5440	
5441	
5442	The protein encoded by this gene belongs to the subtilisin-like proprotein convertase family. The members of this family are proprotein convertases that process latent precursor proteins into their biologically active products. This encoded protein is a calcium-dependent serine endoprotease. It is structurally related to its family members, PACE and PACE4. This protein is concentrated in the trans-Golgi network, associated with the membranes, and is not secreted. It can process proalbumin and is thought to be responsible for the activation of HIV envelope glycoproteins gp160 and gp140. This gene has been implicated in the transcriptional regulation of housekeeping genes. Multiple alternatively spliced transcripts are described for this gene but their full length nature is not yet known. Downstream of this gene@#$%&s map location at 11q23-q24, nucleotides that match part of this gene@#$%&s 3@#$%& end are duplicated and inverted. A translocation breakpoint associated with lymphoma occurs between this gene and its inverted counterpart.
5443	This gene maps in a region, which include the mixed lineage leukemia and Friend leukemia virus integration 1 genes, where multiple disease-associated chromosome translocations occur. It is an intracellular protein. Archain sequences are well conserved among eukaryotes and this protein may play a fundamental role in eukaryotic cell biology. It has similarities to heat shock proteins and clathrin-associated proteins, and may be involved in vesicle structure or trafficking.
5444	Dihydropyrimidinase catalyzes the conversion of 5,6-dihydrouracil to 3-ureidopropionate in pyrimidine metabolism.  Dihydropyrimidinase is expressed at a high level in liver and kidney as a major 2.5-kb transcript and a minor 3.8-kb transcript.  Defects in the DPYS gene are linked to dihydropyrimidinuria.
5445	
5446	
5447	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes an additional conjugation factor, E4, which is involved in multiubiquitin chain assembly.
5448	
5449	Carbonic anhydrase III (CAIII) is a member of a multigene family (at least six separate genes are known) that encode carbonic anhydrase isozymes. These carbonic anhydrases are a class of metalloenzymes that catalyze the reversible hydration of carbon dioxide and are differentially expressed in a number of cell types. The expression of the CA3 gene is strictly tissue specific and present at high levels in skeletal muscle and much lower levels in cardiac and smooth muscle. A proportion of carriers of Duchenne muscle dystrophy have a higher CA3 level than normal. The gene spans 10.3 kb and contains seven exons and six introns.
5450	The protein encoded by this gene is a receptor for interleukin 10. This protein is structurally related to interferon receptors. It has been shown to mediate the immunosuppressive signal of interleukin 10, and thus inhibits the synthesis of proinflammatory cytokines. This receptor is reported to promote survival of progenitor myeloid cells through the insulin receptor substrate-2/PI 3-kinase/AKT pathway. Activation of this receptor leads to tyrosine phosphorylation of JAK1 and TYK2 kinases.
5451	Apolipoprotein C-III is a very low density lipoprotein (VLDL) protein.  APOC3 inhibits lipoprotein lipase and hepatic lipase; it is thought to delay catabolism of triglyceride-rich particles.  The APOA1, APOC3 and APOA4 genes are closely linked in both rat and human genomes.  The A-I and A-IV genes are transcribed from the same strand, while the A-1 and C-III genes are convergently transcribed.  An increase in apoC-III levels induces the development of hypertriglyceridemia.
5452	APOA1 promotes cholesterol efflux from tissues to the liver for excretion. Apolipoprotein A-I is the major protein component of  high density lipoprotein (HDL) in the plasma. Synthesized in the liver and small intestine, it consists of two identical chains of 77 amino acids; an 18-amino acid signal peptide is removed   co-translationally and a 6-amino acid propeptide is cleaved  post-translationally.  Variation in the latter step, in addition to modifications leading to so-called isoforms, is responsible for some of the polymorphism observed.  APOA1 is a cofactor for lecithin cholesterolacyltransferase (LCAT) which is responsible for the formation of most plasma cholesteryl esters.  The APOA1, APOC3 and APOA4 genes are closely linked in both rat and human genomes.   The A-I and A-IV genes are transcribed from the same strand, while the C-III gene is transcribed convergently in relation to A-I.   Defects in the apolipoprotein A-1 gene are associated with HDL deficiency and Tangier disease.
5453	This gene was identified as a gene specifically expressed in Burkitt@#$%&s lymphoma and lymphatic tissues. The protein encoded by this gene is predicted to be a seven transmembrane G protein- coupled receptor and belongs to the CXC chemokine receptor family. BLC, a B-lymphocyte chemoattractant, was identified to be a specific ligand for this receptor. Studies of this gene and its mouse conterpart strongly suggest the essential function of this gene in B cell migration and localization within specific anatomic compartments, such as follicles in lymph nodes as well as in spleen. Two alternatively spliced variants of this gene exist.
5454	
5455	The protein encoded by this gene belongs to the TRIM protein family. It has multiple zinc finger motifs and a leucine zipper motif. It has been proposed to form homo- or heterodimers which are involved in nucleic acid binding. Thus, it may act as a transcriptional regulatory factor involved in carcinogenesis and/or differentiation. It may also function in the suppression of radiosensitivity since it is associated with ataxia telangiectasia phenotype. Multiple alternatively spliced transcript variants have been detected for this gene but the full-length nature of all of them has not been described yet. One of the transcripts of 2.4 kb, appears to be untranslated and probably nonfunctional.
5456	
5457	
5458	Reticulocalbin 1 is a calcium-binding protein located in the lumen of the ER. The protein contains six conserved regions with similarity to a high affinity Ca(+2)-binding motif, the EF-hand. High conservation of amino acid residues outside of these motifs, in comparison to mouse reticulocalbin, is consistent with a possible biochemical function besides that of calcium binding.
5459	
5460	
5461	
5462	
5463	
5464	
5465	
5466	This gene encodes a member of the membrane-spanning 4A gene family. Members of this nascent protein family are characterized by common structural features and similar intron/exon splice boundaries and display unique expression patterns among hematopoietic cells and nonlymphoid tissues. The gene encoding this protein is localized to 11q12.1, among a cluster of family members. Alternative splicing of this gene results in several transcript variants.
5467	
5468	
5469	
5470	
5471	The protein encoded by this gene is a member of the fatty acid desaturase (FADS) gene family. Desaturase enzymes regulate unsaturation of fatty acids through the introduction of double bonds between defined carbons of the fatty acyl chain. FADS family members are considered fusion products composed of an N-terminal cytochrome b5-like domain and a C-terminal multiple membrane-spanning desaturase portion, both of which are characterized by conserved histidine motifs. This gene is clustered with family members FADS1 and FADS2 at 11q12-q13.1; this cluster is thought to have arisen evolutionarily from gene duplication based on its similar exon/intron organization.
5472	This gene encodes a protein that has been identified as a component of NuRD, a nucleosome remodeling deacetylase complex identified in the nucleus of human cells. It shows a very broad expression pattern and is strongly expressed in many tissues. It may represent one member of a small gene family that encode different but related proteins involved either directly or indirectly in transcriptional regulation. Their indirect effects on transcriptional regulation may include chromatin remodeling. It is closely related to another member of this family, a protein that has been correlated with the metastatic potential of certain carcinomas. These two proteins are so closely related that they share the same types of domains. These domains include two DNA binding domains, a dimerization domain, and a domain commonly found in proteins that methylate DNA. One of the proteins known to be a target protein for this gene product is p53. Deacteylation of p53 is correlated with a loss of growth inhibition in transformed cells supporting a connection between these gene family members and metastasis.
5473	The Shaker gene family of Drosophila encodes components of voltage-gated potassium channels and is comprised of four subfamilies. Based on sequence similarity, this gene is similar to one of these subfamilies, namely the Shaw subfamily. The protein encoded by this gene belongs to the delayed rectifier class of channel proteins and is an integral membrane protein that mediates the voltage-dependent potassium ion permeability of excitable membranes.
5474	Transduction of extracellular matrix signals through integrins influences intracellular and extracellular functions, and appears to require interaction of integrin cytoplasmic domains with cellular proteins. Integrin-linked kinase (ILK), interacts with the cytoplasmic domain of beta-1 integrin. This gene encodes a serine/threonine protein kinase with 4 ankyrin-like repeats, which associates with the cytoplasmic domain of beta integrins and acts as a proximal receptor kinase regulating integrin-mediated signal transduction. Multiple alternatively spliced transcript variants encoding the same protein have been found for this gene.
5475	The protein encoded by this gene was identified as a binding protein of the protein kinase C, delta (PRKCD). The expression of this gene in cultured cell lines is strongly induced by serum starvation. The expression of this protein was found to be down-regulated in various cancer cell lines, suggesting the possible tumor suppressor function of this protein.
5476	
5477	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This protein localizes to the cytoplasm and its expression is induced by interferon. The protein down-regulates transcription from the HIV-1 LTR promoter region, suggesting that function of this protein may be to mediate interferon@#$%&s antiviral effects.
5478	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to cytoplasmic bodies. Its function has not been identified. Five alternatively spliced transcript variants for this gene have been described. However, the full length nature of two of the variants has not been determined.
5479	The alpha (HBA) and beta (HBB) loci determine the structure of the 2 types of polypeptide chains in adult hemoglobin, Hb A. The normal adult hemoglobin tetramer consists of two alpha chains and two beta chains. Mutant beta globin causes sickle cell anemia. Absence of beta chain causes beta-zero-thalassemia. Reduced amounts of detectable beta globin causes beta-plus-thalassemia. The order of the genes in the beta-globin cluster is 5@#$%&-epsilon -- gamma-G -- gamma-A -- delta -- beta--3@#$%&.
5480	
5481	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L15P family of ribosomal proteins. It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, multiple processed pseudogenes derived from this gene are dispersed through the genome.
5482	Signal-mediated nuclear import and export proceed through the nuclear pore complex (NPC), which is comprised of approximately 50 unique proteins collectively known as nucleoporins. The 98 kD nucleoporin is generated through a biogenesis pathway that involves synthesis and proteolytic cleavage of a 186 kD precursor protein. This cleavage results in the 98 kD nucleoporin as well as a 96 kD nucleoporin, both of which are localized to the nucleoplasmic side of the NPC. Rat studies show that the 98 kD nucleoporin functions as one of several docking site nucleoporins of transport substrates. The human gene has been shown to fuse to several genes following chromsome translocatons in acute myelogenous leukemia (AML) and T-cell acute lymphocytic leukemia (T-ALL). This gene is one of several genes located in the imprinted gene domain of 11p15.5, an important tumor-suppressor gene region. Alterations in this region have been associated with the Beckwith-Wiedemann syndrome, Wilms tumor, rhabdomyosarcoma, adrenocortical carcinoma, and lung, ovarian, and breast cancer. Alternative splicing of this gene results in several transcript variants; however, not all variants have been fully described.
5483	Mitochondrial uncoupling proteins (UCP) are members of the larger family of mitochondrial anion carrier proteins (MACP). UCPs separate oxidative phosphorylation from ATP synthesis with energy dissipated as heat, also referred to as the mitochondrial proton leak. UCPs facilitate the transfer of anions from the inner to the outer mitochondrial membrane and the return transfer of protons from the outer to the inner mitochondrial membrane. They also reduce the mitochondrial membrane potential in mammalian cells. Tissue specificity occurs for the different UCPs and the exact methods of how UCPs transfer H+/OH- are not known. UCPs contain the three homologous protein domains of MACPs. This gene has tissue-specific transcription initiation with other transcription initiation sites upstream of SM-1 (major skeletal muscle site). Chromosomal order is 5@#$%&-UCP3-UCP2-3@#$%&. Two splice variants have been found for this gene.
5484	
5485	
5486	
5487	
5488	
5489	
5490	
5491	This gene product belongs to a family of glycohydrolytic enzymes which remove sialic acid residues from glycoproteins and glycolipids. It is localized in the plasma membrane, and its activity is specific for gangliosides. It may play a role in modulating the ganglioside content of the lipid bilayer.
5492	
5493	
5494	This gene is a member of the GRB2-associated binding protein (GAB) gene family and is similar to the GAB1 gene. These proteins contain pleckstrin homology (PH) domain, and bind SHP2 tyrosine phosphatase and GRB2 adapter protein. They act as adapters for transmitting various signals in response to stimuli through cytokine and growth factor receptors, and T- and B-cell antigen receptors. The protein encoded by this gene is the principal activator of phosphatidylinositol-3 kinase in response to activation of the high affinity IgE receptor. Two alternatively spliced transcripts encoding different isoforms have been described for this gene.
5495	
5496	
5497	This gene encodes a protein that is highly expressed in osteosarcomas. This protein binds to the hypoxia-inducible factor 1 (HIF-1), a key regulator of the hypoxic response and angiogenesis, and promotes the degradation of one of its subunits. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
5498	
5499	The Golgi apparatus, which participates in glycosylation and transport of proteins and lipids in the secretory pathway, consists of a series of stacked cisternae (flattened membrane sacs). Interactions between the Golgi and microtubules are thought to be important for the reorganization of the Golgi after it fragments during mitosis. The golgins are a family of proteins, of which the protein encoded by this gene is a member, that are localized to the Golgi. This encoded protein has been postulated to play a role in nuclear transport and Golgi apparatus localization. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of these variants has not been determined.
5500	Sm-like proteins were identified in a variety of organisms based on sequence homology with the Sm protein family (see SNRPD2; MIM 601061). Sm-like proteins contain the Sm sequence motif, which consists of 2 regions separated by a linker of variable length that folds as a loop. The Sm-like proteins are thought to form a stable heteromer present in tri-snRNP particles, which are important for pre-mRNA splicing.[supplied by OMIM]
5501	
5502	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development,reproduction, and tissue remodeling, as well as in disease processes, such as arthritis and metastasis. Most MMP@#$%&s are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. The function of the protein encoded by this gene has not been determined. This gene was previously referred to as MMP18 but has been renamed matrix metalloproteinase 19 (MMP19). Multiple transcript variants encoding distict isoforms have been identified for this gene.
5503	
5504	
5505	
5506	SSPN gene is expressed in a variety of tissues with highest levels in muscle, where alternative splice variants have been observed. In certain tumors KRAS2, SSPN, and ITPR2 are coamplified. The function of this gene is unknown.
5507	
5508	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, which is an enzyme that possesses both ATPase and DNA helicase activities. This gene is a homolog of the yeast CHL1 gene, and may function to maintain chromosome transmission fidelity and genome stability. This gene consists of 28 exons and its alternative splicing generates at least 3 transcript variants.
5509	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, which is an enzyme that possesses both ATPase and DNA helicase activities. This gene is a homolog of the yeast CHL1 gene, and may function to maintain chromosome transmission fidelity and genome stability. This gene consists of 28 exons and its alternative splicing generates at least 3 transcript variants.
5510	
5511	This gene is a member of the frizzled gene family. Members of this family encode 7-transmembrane domain proteins that are receptors for the Wingless type MMTV integration site family of signaling proteins. Most frizzled receptors are coupled to the beta-catenin canonical signaling pathway. Using array analysis, expression of this intronless gene is significantly up-regulated in two cases of primary colon cancer.
5512	
5513	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a non-ATPase subunit of the 19S regulator.
5514	
5515	DYRK2 belongs to a family of protein kinases whose members are presumed to be involved in cellular growth and/or development.  The family is defined by structural similarity of their kinase domains and their capability to autophosphorylate on tyrosine residues.  DYRK2 has demonstrated tyrosine autophosphorylation and catalyzed phosphorylation of histones H3 and H2B in vitro.  Two isoforms of DYRK2 have been isolated.  The predominant isoform, isoform 1, lacks a 5@#$%& terminal insert.
5516	The protein encoded by this gene appears to be multifunctional. It along with PCBP-1 and hnRNPK corresponds to the major cellular poly(rC)-binding proteins. It contains three K-homologous (KH) domains which may be involved in RNA binding. This encoded protein together with PCBP-1 also functions as translational coactivators of poliovirus RNA via a sequence-specific interaction with stem-loop IV of the IRES and promote poliovirus RNA replication by binding to its 5@#$%&-terminal cloverleaf structure. It has also been implicated in translational control of the 15-lipoxygenase mRNA, human Papillomavirus type 16 L2 mRNA, and hepatitis A virus RNA. The encoded protein is also suggested to play a part in formation of a sequence-specific alpha-globin mRNP complex which is associated with alpha-globin mRNA stability. This multiexon structural mRNA is thought to be retrotransposed to generate PCBP-1 intronless gene which has similar functions. This gene and PCBP-1 has paralogues PCBP3 and PCBP4 which is thought to arose as a result of duplication events of entire genes. It also has two processed pseudogenes PCBP2P1 and PCBP2P2. There are presently two alternatively spliced transcript variants described for this gene.
5517	FMFRamide-related protein precursor plays a role in the regulation of heart rate and blood pressure and the modulation of morphine-induced antinociception.  FMRFAL encodes a preproprotein which is cleaved to form two active peptides with similar function.
5518	
5519	
5520	This gene encodes a member of the UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase (GalNAc-T) family of enzymes. GalNAc-Ts initiate mucin-type O-linked glycosylation in the Golgi apparatus by catalyzing the transfer of GalNAc to serine and threonine residues on target proteins. They are characterized by an N-terminal transmembrane domain, a stem region, a lumenal catalytic domain containing a GT1 motif and Gal/GalNAc transferase motif, and a C-terminal ricin/lectin-like domain. GalNAc-Ts have different, but overlapping, substrate specificities and patterns of expression. The encoded protein is capable of glycosylating fibronectin peptide in vitro and is expressed in a fibroblast cell line, indicating that it may be involved in the synthesis of oncofetal fibronectin.
5521	The protein encoded by this gene is a member of the keratin gene family. As a type II hair keratin, it is a basic protein which heterodimerizes with type I keratins to form hair and nails. The type II hair keratins are clustered in a region of chromosome 12q13 and are grouped into two distinct subfamilies based on structure similarity. One subfamily, consisting of KRTHB1, KRTHB3, and KRTHB6, is highly related. The other less-related subfamily includes KRTHB2, KRTHB4, and KRTHB5.
5522	
5523	
5524	Islet, or insulinoma, amyloid polypeptide (IAPP, or amylin) is commonly found in pancreatic islets of patients suffering diabetes mellitus type II, or harboring an insulinoma.  While the assosciation of amylin with the development of type II diabetes has been known for some time, a direct causative role for amylin has been harder to establish.  Recent results suggest that amylin, like the related beta-amyloid (Abeta) assosciated with Alzheimer@#$%&s disease, can induce apoptotic cell-death in particular cultured cells, an effect that may be relevant to the development of type II diabetes.
5525	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the MRP subfamily which is involved in multi-drug resistance. This protein is thought to form ATP-sensitive potassium channels in cardiac, skeletal, and vascular and non-vascular smooth muscle. Protein structure suggests a role as the drug-binding channel-modulating subunit of the extrapancreatic ATP-sensitive potassium channels. No disease has been associated with this gene thus far. Alternative splicing of this gene results in several products, two of which result from differential usage of two terminal exons and one which results from exon deletion.
5526	Natural killer (NK) cells are a distinct lineage of lymphocytes that mediate cytotoxic activity and secrete cytokines upon immune stimulation. Several genes of the C-type lectin superfamily, including members of the NKG2 family, are expressed by NK cells and may be involved in the regulation of NK cell function. KLRD1 (CD94) is an antigen preferentially expressed on NK cells and is classified as a type II membrane protein because it has an external C terminus. KLRD1 has two alternatively spliced variants that differ in the presence or absence of exon 2 sequence.
5527	
5528	This gene is a homolog of the Drosophila polyhomeotic gene, which is a member of the Polycomb group of genes. The gene product is a component of a multimeric protein complex that contains EDR2 and the vertebrate Polycomb protein BMH1. The gene product, the EDR2 protein, and the Drosophila polyhomeotic protein share 2 highly conserved domains, named homology domains I and II. These domains are involved in protein-protein interactions and may mediate heterodimerization of the protein encoded by this gene and the EDR2 protein.
5529	The protein encoded by this gene is a receptor for mannose-6-phosphate groups on lysosomal enzymes. The receptor forms a homodimer or homotetramer for intracellular targeting of lysosomal enzymes and export of newly synthesized lysosomal enzymes into the cell secretions. The receptor is an integral membrane protein which localizes to the trans-Golgi reticulum, endosomes, and the plasma membrane.
5530	Ubiquitin (MIM 191339), a highly conserved protein involved in the regulation of intracellular protein breakdown, cell cycle regulation, and stress response, is released from degraded proteins by disassembly of the polyubiquitin chains. The disassembly process is mediated by ubiquitin-specific proteases (USPs). Also see USP1 (MIM 603478).[supplied by OMIM]
5531	RHOBTB2 is a member of the evolutionarily conserved RHOBTB subfamily of Rho GTPases. For background information on RHOBTBs, see RHOBTB1 (MIM 607351).[supplied by OMIM]
5532	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is located on chromosome 12 and encodes a variant H2A histone. The protein is divergent at the C-terminus compared to the consensus H2A histone family member. Two transcript variants encoding different isoforms have been found for this gene.
5533	
5534	
5535	
5536	
5537	
5538	This gene is a member of the regulatory factor X gene family, which encodes transcription factors that contain a highly-conserved winged helix DNA binding domain. The protein encoded by this gene is structurally related to regulatory factors X1, X2, X3, and X5. It has been shown to interact with itself as well as with regulatory factors X2 and X3, but it does not interact with regulatory factor X1. This protein may be a transcriptional repressor rather than a transcriptional activator. Three transcript variants encoding different isoforms have been described for this gene.
5539	This gene contains long CAG trinucleotide repeats coding consecutive glutamine residues. The gene product may function as a transcription factor, with a potential role in the regulation of neurodevelopment or neuroplasticity. Studies in mouse suggest that nuclear matrix transcription factors (NP/NMP4) may be part of a general mechanical pathway that couples cell construction and function during extracellular matrix remodeling. Multiple transcript variants have been detected for this gene, but their full-length natures have not been determined.
5540	
5541	The protein encoded by this gene belongs to the tigger subfamily of the pogo superfamily of DNA-mediated transposons in humans. These proteins are related to DNA transposons found in fungi and nematodes, and more distantly to the Tc1 and mariner transposases. They are also very similar to the major mammalian centromere protein B. The exact function of this gene is not known.
5542	
5543	
5544	The protein encoded by this gene is a ubiquitously expressed nuclear protein. It binds directly, with several other proteins, to retinoblastoma protein which regulates cell proliferation. This protein also interacts with rhombotin-2 which functions distinctly in erythropoiesis and in T-cell leukemogenesis. Rhombotin-2 is thought to either directly affect the activity of the encoded protein or may indirectly modulate the functions of the retinoblastoma protein by binding to this protein.
5545	The protein encoded by this gene is a ubiquitously expressed nuclear protein. It binds directly, with several other proteins, to retinoblastoma protein which regulates cell proliferation. This protein also interacts with rhombotin-2 which functions distinctly in erythropoiesis and in T-cell leukemogenesis. Rhombotin-2 is thought to either directly affect the activity of the encoded protein or may indirectly modulate the functions of the retinoblastoma protein by binding to this protein.
5546	
5547	
5548	
5549	
5550	This gene encodes a member of the GIT protein family. GIT proteins interact with G protein-coupled receptor kinases and possess ADP-ribosylation factor (ARF) GTPase-activating protein (GAP) activity. This gene undergoes extensive alternative splicing; although ten transcript variants have been described, the full length sequence has been determined for only four variants. The various isoforms have functional differences, with respect to ARF GAP activity and to G protein-coupled receptor kinase 2 binding.
5551	
5552	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein, which is the functional equivalent of the E. coli L10 ribosomal protein, belongs to the L10P family of ribosomal proteins. It is a neutral phosphoprotein with a C-terminal end that is nearly identical to the C-terminal ends of the acidic ribosomal phosphoproteins P1 and P2. The P0 protein can interact with P1 and P2 to form a pentameric complex consisting of P1 and P2 dimers, and a P0 monomer. The protein is located in the cytoplasm. Transcript variants derived from alternative splicing exist; they encode the same protein. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
5553	
5554	
5555	The protein encoded by this gene is a regulatory subunit of the AMP-activated protein kinase (AMPK). AMPK is a heterotrimer consisting of an alpha catalytic subunit, and non-catalytic beta and gamma subunits. AMPK is an important energy-sensing enzyme that monitors cellular energy status. In response to cellular metabolic stresses, AMPK is activated, and thus phosphorylates and inactivates acetyl-CoA carboxylase (ACC) and beta-hydroxy beta-methylglutaryl-CoA reductase (HMGCR), key enzymes involved in regulating de novo biosynthesis of fatty acid and cholesterol. This subunit may be a positive regulator of AMPK activity. The myristoylation and phosphorylation of this subunit have been shown to affect the enzyme activity and cellular localization of AMPK. This subunit may also serve as an adaptor molecule mediating the association of the AMPK complex.
5556	
5557	Myosin phosphatase target subunit 1, which is also called the myosin-binding subunit of myosin phosphatase, is one of the subunits of myosin phosphatase. Myosin phosphatase regulates the interaction of actin and myosin downstream of the guanosine triphosphatase Rho. The small guanosine triphosphatase Rho is implicated in myosin light chain (MLC) phosphorylation, which results in contraction of smooth muscle and interaction of actin and myosin in nonmuscle cells. The guanosine triphosphate (GTP)-bound, active form of RhoA (GTP.RhoA) specifically interacted with the myosin-binding subunit (MBS) of myosin phosphatase, which regulates the extent of phosphorylation of MLC. Rho-associated kinase (Rho-kinase), which is activated by GTP. RhoA, phosphorylated MBS and consequently inactivated myosin phosphatase. Overexpression of RhoA or activated RhoA in NIH 3T3 cells increased phosphorylation of MBS and MLC. Thus, Rho appears to inhibit myosin phosphatase through the action of Rho-kinase.
5558	
5559	
5560	
5561	
5562	The protein encoded by this gene is highly similar to the gene product of Schizosaccharomyces pombe rad21, a gene involved in the repair of DNA double-strand breaks, as well as in chromatid cohesion during mitosis. This protein is a nuclear phospho-protein, which becomes hyperphosphorylated in cell cycle M phase. The highly regulated association of this protein with mitotic chromatin specifically at the centromere region suggests its role in sister chromatid cohesion in mitotic cells.
5563	
5564	
5565	
5566	This gene encodes a member of the natural killer cell receptor C-type lectin family. The encoded protein inhibits osteoclast formation and contains a transmembrane domain near the N-terminus as well as the C-type lectin-like extracellular domain. Several alternatively spliced transcript variants have been identified, but the full-length nature of every transcript has not been defined.
5567	This gene encodes a cyclin-dependent kinase inhibitor, which shares a limited similarity with CDK inhibitor CDKN1A/p21. The encoded protein binds to and prevents the activation of cyclin E-CDK2 or cyclin D-CDK4 complexes, and thus controls the cell cycle progression at G1. The degradation of this protein, which is triggered by its CDK dependent phosphorylation and subsequent ubiquitination by SCF complexes, is required for the cellular transition from quiescence to the proliferative state.
5568	
5569	This gene encodes a water channel protein located in the kidney collecting tubule. It belongs to the MIP/aquaporin family, some members of which are clustered together on chromosome 12q13. Mutations in this gene have been linked to autosomal dominant, and recessive forms of nephrogenic diabetes insipidus.
5570	
5571	The protein encoded by this gene is a member of the SWI/SNF family of proteins, whose members display helicase and ATPase activities and which are thought to regulate transcription of certain genes by altering the chromatin structure around those genes. The encoded protein is part of the large ATP-dependent chromatin remodeling complex SNF/SWI and has sequence similarity to the yeast Swp73 protein. Two transcript variants encoding different isoforms have been found for this gene.
5572	This gene encodes a member of the degenerin/epithelial sodium channel (DEG/ENaC) superfamily. The members of this family are amiloride-sensitive sodium channels that contain intracellular N and C termini, 2 hydrophobic transmembrane regions, and a large extracellular loop, which has many cysteine residues with conserved spacing. The member encoded by this gene is expressed in most if not all brain neurons, and it may be an ion channel subunit; however, its function as an ion channel remains unknown. Alternative splicing of this gene generates 2 transcript products.
5573	
5574	
5575	This gene encodes one of the multiple subunits of the RNA polymerase III transcription factor complex required for transcription of genes with promoter elements upstream of the initiation site. The product of this gene, a TFIIB-like factor, is directly recruited to the TATA-box of polymerase III small nuclear RNA gene promoters through its interaction with the TATA-binding protein.
5576	This gene encodes a member of the oxysterol-binding protein (OSBP) family, a group of intracellular lipid receptors. Like most members, the encoded protein contains an N-terminal pleckstrin homology domain and a highly conserved C-terminal OSBP-like sterol-binding domain. Two transcript variants encoding different isoforms have been found for this gene.
5577	
5578	
5579	
5580	Plastins are a family of actin-binding proteins that are conserved throughout eukaryote evolution and expressed in most tissues of higher eukaryotes. In humans, two ubiquitous plastin isoforms (L and T) have been identified. Plastin 1 (otherwise known as Fimbrin) is a third distinct plastin isoform which is specifically expressed at high levels in the small intestine. The L isoform is expressed only in hemopoietic cell lineages, while the T isoform has been found in all other normal cells of solid tissues that have replicative potential (fibroblasts, endothelial cells, epithelial cells, melanocytes, etc.). However, L-plastin has been found in many types of malignant human cells of non-hemopoietic origin suggesting that its expression is induced accompanying tumorigenesis in solid tissues.
5581	Retinoblastoma (RB) is an embryonic malignant neoplasm of retinal origin. It almost always presents in early childhood and is often bilateral. Spontaneous regression (@#$%&cure@#$%&) occurs in some cases.[supplied by OMIM]
5582	
5583	This gene belongs to the protocadherin gene family, a subfamily of the cadherin superfamily. This gene encodes a protein which contains 6 extracellular cadherin domains, a transmembrane domain and a cytoplasmic tail differing from those of the classical cadherins. Although its specific function is undetermined, the cadherin-related neuronal receptor is thought to play a role in the establishment and function of specific cell-cell connections in the brain.
5584	This gene encodes a member of the nucleoporin family that shares 87% sequence identity with rat nucleoporin p58. The protein is localized to the nuclear rim and is a component of the nuclear pore complex (NPC). All molecules entering or leaving the nucleus either diffuse through or are actively transported by the NPC. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
5585	The A-kinase anchor proteins (AKAPs) are a group of structurally diverse proteins, which have the common function of binding to the regulatory subunit of protein kinase A (PKA) and confining the holoenzyme to discrete locations within the cell. This gene encodes a member of the AKAP family. The encoded protein is expressed at high levels throughout spermatogenesis and in mature sperm. It binds the RI and RII subunits of PKA in testis. It may serve a function in cell cycle control of both somatic cells and germ cells in addition to its putative role in spermatogenesis and sperm function. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
5586	
5587	This gene is a member of the N-myc downregulated gene family which belongs to the alpha/beta hydrolase superfamily. The protein encoded by this gene is a cytoplasmic protein involved in stress responses, hormone responses, cell growth, and differentiation. Mutation in this gene has been reported to be causative for hereditary motor and sensory neuropathy-Lom.
5588	Gap junctions were first characterized by electron microscopy as regionally specialized structures on plasma membranes of contacting adherent cells. These structures were shown to consist of cell-to-cell channels. Proteins, called connexins, purified from fractions of enriched gap junctions from different tissues differ. The connexins are designated by their molecular mass. Another system of nomenclature divides gap junction proteins into 2 categories, alpha and beta, according to sequence similarities at the nucleotide and amino acid levels. For example, CX43 (MIM 121014) is designated alpha-1 gap junction protein, whereas CX32 (GJB1; MIM 304040) and CX26 are called beta-1 and beta-2 gap junction proteins, respectively. This nomenclature emphasizes that CX32 and CX26 are more homologous to each other than either of them is to CX43.[supplied by OMIM]
5589	Gap junctions were first characterized by electron microscopy as regionally specialized structures on plasma membranes of contacting adherent cells. These structures were shown to consist of cell-to-cell channels. Proteins, called connexins, purified from fractions of enriched gap junctions from different tissues differ. The connexins are designated by their molecular mass. Another system of nomenclature divides gap junction proteins into 2 categories, alpha and beta, according to sequence similarities at the nucleotide and amino acid levels. For example, CX43 (MIM 121014) is designated alpha-1 gap junction protein, whereas CX32 (GJB1; MIM 304040) and CX26 are called beta-1 and beta-2 gap junction proteins, respectively. This nomenclature emphasizes that CX32 and CX26 are more homologous to each other than either of them is to CX43.[supplied by OMIM]
5590	
5591	
5592	Nucleocytoplasmic transport, a signal- and energy-dependent process, takes place through nuclear pore complexes embedded in the nuclear envelope. The import of proteins containing a nuclear localization signal (NLS) requires the NLS import receptor, a heterodimer of importin alpha and beta subunits also known as karyopherins. Importin alpha binds the NLS-containing cargo in the cytoplasm and importin beta docks the complex at the cytoplasmic side of the nuclear pore complex. In the presence of nucleoside triphosphates and the small GTP binding protein Ran, the complex moves into the nuclear pore complex and the importin subunits dissociate. Importin alpha enters the nucleoplasm with its passenger protein and importin beta remains at the pore. Interactions between importin beta and the FG repeats of nucleoporins are essential in translocation through the pore complex. The protein encoded by this gene is a member of the importin beta family.
5593	
5594	
5595	This gene encodes one of the six subunits of type IV collagen, the major structural component of basement membranes. The C-terminal portion of the protein, known as canstatin, is an inhibitor of angiogenesis and tumor growth. Like the other members of the type IV collagen gene family, this gene is organized in a head-to-head conformation with another type IV collagen gene so that each gene pair shares a common promoter.
5596	
5597	This gene belongs to the protocadherin gene family, a subfamily of the cadherin superfamily. This gene encodes a protein which contains 6 extracellular cadherin domains, a transmembrane domain and a cytoplasmic tail differing from those of the classical cadherins. Although its specific function is undetermined, the cadherin-related neuronal receptor is thought to play a role in the establishment and function of specific cell-cell connections in the brain.
5598	
5599	
5600	Alternative splicing has been observed at this gene locus and two transcript variants have been identified. Both variants encode the same protein, which has an undetermined function.
5601	This gene encodes a protein containing a calponin homology (CH) domain, a PDZ domain, and a LIM domain. An F-box (FBX) domain is present in alternative splice variants. Members of the LIM protein family carry the LIM domain, a unique cysteine-rich zinc-binding domain. Members of the FBX protein family are involved in protein-protein interactions. The encoded protein may be involved in protein-protein interactions. Multiple alternative splice variants have been described but their full-length sequences have not been determined.
5602	
5603	This gene encodes the vitamin K-dependent coagulation factor X precursor of the blood coagulation cascade. This factor precursor is converted to a mature two-chain form by the excision of the tripeptide RKR. Two chains of the factor are held together by 1 or more disulfide bonds; the light chain contains 2 EGF-like domains, while the heavy chain contains the catalytic domain which is structurally homologous to those of the other hemostatic serine proteases. The mature factor is activated by the cleavage of the activation peptide by factor IXa (in the intrisic pathway), or by factor VIIa (in the extrinsic pathway). The activated factor then converts prothrombin to thrombin in the presence of factor Va, Ca+2, and phospholipid during blood clotting. Mutations of this gene result in factor X deficiency, a hemorrhagic condition of variable severity.
5604	
5605	
5606	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development, reproduction, and tissue remodeling, as well as in disease processes, such as arthritis and metastasis. Most MMP@#$%&s are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. The enzyme encoded by this gene degrades type IV collagen, fibronectin, fibrinogen and casein, and activates MMP9 by cleavage. The protein differs from most MMP family members in that it lacks a conserved C-terminal protein domain.
5607	
5608	
5609	The protein encoded by this gene belongs to the family of P-type cation transport ATPases. This gene encodes a polypeptide that acts as a membrane copper-transport protein. Defects in this gene are the cause of Wilson disease (WD). WD is an autosomal recessive disorder of copper metabolism in which copper cannot be incorporated into ceruloplasmin in liver, and cannot be excreted from the liver into the bile. Alternatively spliced transcript variants encoding different isoforms have been identified.
5610	The product of this gene belongs to the phosphatase 2A regulatory subunit B family. Protein phosphatase 2A is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. The B regulatory subunit might modulate substrate selectivity and catalytic activity. This gene encodes a gamma isoform of the regulatory subunit B56 subfamily. Alternatively spliced transcript variants encoding different isoforms have been identified.
5611	The RCOR gene encodes a functional corepressor required for regulation of neural-specific gene expression.[supplied by OMIM]
5612	
5613	
5614	
5615	The protein encoded by this gene belongs to the NCP family of cell-recognition molecules. This family represents a distinct subgroup of the neurexins. NCP proteins mediate neuron-glial interactions in vertebrates and glial-glial contact in invertebrates. The protein encoded by this gene may play a role in cell recognition within the nervous system. Alternatively spliced transcript variants encoding different isoforms have been described but their biological nature has not been determined.
5616	
5617	
5618	The protein encoded by this gene is a HEAT-like repeat-containing protein. The HEAT repeat is a tandemly repeated, 37-47 amino acid long module occurring in a number of cytoplasmic proteins. Arrays of HEAT repeats form a rod-like helical structure and appear to function as protein-protein interaction surfaces. The repeat-containing region of this protein has some similarity to the constant regulatory domain of the protein phosphatase 2A PR65/A subunit. The function of this particular gene product has not been determined. Alternative splicing has been observed for this gene and two transcript variants encoding distinct isoforms have been identified.
5619	The protein encoded by this gene is a member of the ankyrin repeat and SOCS box-containing (ASB) family of proteins. They contain ankyrin repeat sequence and SOCS box domain. The SOCS box serves to couple suppressor of cytokine signalling (SOCS) proteins and their binding partners with the elongin B and C complex, possibly targeting them for degradation. This gene is induced by all-trans retinoic acid. In myeloid leukemia cells, the expression of this encoded protein has been shown to induce growth inhibition and chromatin condensation. Multiple alternatively spliced transcript variants have been described for this gene but their full length sequences are not known.
5620	The protein encoded by this gene was identified by its interaction with apoptosis regulator BAX protein. This protein contains a Bcl-2 homology 3 (BH3)-like motif, which is required for the association with BAX. When overexpressed, this gene has been shown to mediate caspase-dependent apoptosis.
5621	
5622	
5623	
5624	This gene belongs to the forkhead family of transcription factors which is characterized by a distinct forkhead domain.  The specific function of this gene has not yet been determined; however, it may play a role in the development of the brain and telencephalon.
5625	
5626	
5627	The function of this gene is not known, but its existence is supported by mRNAs and EST data. A similar gene in mice was found to be expressed preferentially in hematopoietic progenitors and mature blood cells. Two alternatively spliced transcript variants encoding the same protein have been noted for this gene.
5628	
5629	
5630	
5631	
5632	
5633	The protein encoded by this gene belongs to the pancreatic ribonuclease family. It plays an important role in mRNA cleavage and has marked specificity towards the 3@#$%& side of uridine nucleotides. Alternative splicing occurs at this locus and three transcript variants encoding the same protein have been identified. This gene and ANG share two exons in their 5@#$%& UTRs. The downstream exons are unique and contain the coding regions.
5634	The protein encoded by this gene is a member of the basic helix-loop-helix leucine zipper (bHLHZ) family of transcription factors. It is able to form homodimers and heterodimers with other family members, which include Mad, Mxi1 and Myc. Myc is an oncoprotein implicated in cell proliferation, differentiation and apoptosis. The homodimers and heterodimers compete for a common DNA target site (the E box) and rearrangement among these dimer forms provides a complex system of transcriptional regulation. Multiple alternatively spliced transcript variants have been described for this gene but the full length nature for some of them is unknown.
5635	Protein kinase C (PKC) is a family of serine- and threonine-specific protein kinases that can be activated by calcium and the second messenger diacylglycerol. PKC family members phosphorylate a wide variety of protein targets and are known to be involved in diverse cellular signaling pathways. PKC family members also serve as major receptors for phorbol esters, a class of tumor promoters. Each member of the PKC family has a specific expression profile and is believed to play a distinct role in cells. The protein encoded by this gene is one of the PKC family members. It is a calcium-independent and phospholipids-dependent protein kinase. It is predominantly expressed in epithelial tissues and has been shown to reside specifically in the cell nucleus. This protein kinase can regulate keratinocyte differentiation by activating the MAP kinase MAPK13 (p38delta)-activated protein kinase cascade that targets CCAAT/enhancer-binding protein alpha (CEBPA). It is also found to mediate the transcription activation of the transglutaminase 1 (TGM1) gene.
5636	
5637	
5638	
5639	
5640	
5641	
5642	
5643	This gene encodes a member of the serine/threonine protein kinase family, that is highly similar to yeast SPS1/STE20 kinase. Yeast SPS1/STE20 functions near the beginning of the MAP kinase signal cascades that is essential for yeast pheromone response. This kinase was shown to activate Jun kinase in mammalian cells, which suggested a role in stress response. Two alternatively spliced transcript variants encoding the same protein have been described for this gene.
5644	
5645	
5646	Members of the RGS (regulator of G protein signaling) family have been shown to modulate the functioning of G proteins by activating the intrinsic GTPase activity of the alpha (guanine nucleotide-binding) subunits.[supplied by OMIM]
5647	Alzheimer@#$%&s disease (AD) patients with an inherited form of the disease carry mutations in the presenilin proteins (PSEN1; PSEN2) or the amyloid precursor protein (APP).  These disease-linked mutations result in increased production of the longer form of amyloid-beta (main component of amyloid deposits found in AD brains).  Presenilins are postulated to regulate APP processing through their effects on gamma-secretase, an enzyme that cleaves APP.  Also, it is thought that the presenilins are involved in the cleavage of the Notch receptor, such that they either directly regulate gamma-secretase activity or themselves are protease enzymes.  Three alternative splice variants of PSEN1 have been identified.
5648	
5649	STXBP6 binds components of the SNARE complex (see MIM 603215) and may be involved in regulating SNARE complex formation (Scales et al., 2002).[supplied by OMIM]
5650	This gene encodes a C2H2-type zinc finger protein and is closely related to BCL11A, a gene whose translocation may be associated with B-cell malignancies. The specific function of this gene has not yet been determined. Two alternatively spliced transcript variants, which encode distinct isoforms, have been reported.
5651	Human echinoderm microtubule-associated protein-like is a strong candidate for the Usher syndrome type 1A gene. Usher syndromes (USHs) are a group of genetic disorders consisting of congenital deafness, retinitis pigmentosa, and vestibular dysfunction of variable onset and severity depending on the genetic type. The disease process in USHs involves the entire brain and is not limited to the posterior fossa or auditory and visual systems. The USHs are catagorized as type I (USH1A, USH1B, USH1C, USH1D, USH1E and USH1F), type II (USH2A and USH2B) and type III (USH3). The type I is the most severe form. Gene loci responsible for these three types are all mapped. Two transcript variants encoding different isoforms have been found for this gene.
5652	
5653	Heterotrimeric guanine nucleotide-binding proteins (G proteins), which integrate signals between receptors and effector proteins, are composed of an alpha, a beta, and a gamma subunit. These subunits are encoded by families of related genes. This gene encodes a beta subunit. Beta subunits are important regulators of alpha subunits, as well as of certain signal transduction receptors and effectors. Alternatively spliced transcript variants encoding different isoforms exist.
5654	
5655	
5656	The protein encoded by this gene may play a role in intracellular vesicle trafficking. It interacts with Syntaxin 13 which mediates intracellular membrane fusion. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
5657	The protein encoded by this gene may play a role in intracellular vesicle trafficking. It interacts with Syntaxin 13 which mediates intracellular membrane fusion. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
5658	
5659	The C-terminus of the protein encoded by this gene binds topoisomerase I. The N-terminus contains a proline-rich region and a BTB/POZ domain (broad-complex, Tramtrack and bric a brac/Pox virus and Zinc finger), both of which are typically involved in protein-protein interactions. Subcellularly, the protein localizes to cytoplasmic bodies. Alternative splicing results in multiple transcript variants encoding different isoforms.
5660	
5661	
5662	This protein is part of a large family of nuclear receptor transcription factors involved in signaling pathways. Nuclear receptors have been shown to regulate pathways involved in embryonic development, as well as in maintenance of proper cell function in adults. Members of this family are characterized by discrete domains that function in DNA and ligand binding. This gene encodes a retinal nuclear receptor that is a ligand-dependent transcription factor. Defects in this gene are a cause of enhanced S cone syndrome. Alternatively spliced transcript variants encoding different isoforms have been identified.
5663	The SCAN domain is a highly conserved, leucine-rich motif of approximately 60 aa originally found within a subfamily of zinc finger proteins. This gene belongs to a family of genes that encode an isolated SCAN domain, but no zinc finger motif. Functional studies have established that the SCAN box is a protein interaction domain that mediates both hetero- and homoprotein associations, and maybe involved in regulation of transcriptional activity. Multiple transcript variants which encode the same isoform but differ only in their 3@#$%& UTRs, and another variant which encodes a truncated isoform have been described for this gene.
5664	Actins are highly conserved proteins that are involved in various types of cell motility. Polymerization of globular actin (G-actin) leads to a structural filament (F-actin) in the form of a two-stranded helix. Each actin can bind to four others. The protein encoded by this gene belongs to the actin family which is comprised of three main groups of actin isoforms, alpha, beta, and gamma. The alpha actins are found in muscle tissues and are a major constituent of the contractile apparatus. Defects in this gene have been associated with idiopathic dilated cardiomyopathy (IDC) and familial hypertrophic cardiomyopathy (FHC).
5665	This gene encodes a member of the neurotrophic tyrosine receptor kinase (NTRK) family. This kinase is a membrane-bound receptor that, upon neurotrophin binding, phosphorylates itself and members of the MAPK pathway. Signalling through this kinase leads to cell differentiation and may play a role in the development of proprioceptive neurons that sense body position. Mutations in this gene have been associated with medulloblastomas, secretory breast carcinomas and other cancers.
5666	
5667	
5668	
5669	
5670	Aldehyde dehydrogenase isozymes are thought to play a major role in the detoxification of aldehydes generated by alcohol metabolism and lipid peroxidation. The gene for this human isozyme spans about 37 kb and consists of 13 exons. It is expressed at low levels in many tissues and at higher levels in salivary gland, stomach and kidney. The major transcript is approximately 3.5 kb; however, there is a less abundant transcript that is smaller by 111 bp in the 3@#$%& non-coding region and uses an alternative polyadenylation signal. The significance of alternative splicing of this gene is currently unclear.
5671	APH1 is a multipass transmembrane protein that interacts with presenilin (see PSEN1; MIM 104311) and nicastrin (APH2; MIM 605254) as a functional component of the gamma-secretase complex. The gamma-secretase complex is required for the intramembrane proteolysis of a number of membrane proteins, including the amyloid-beta precursor protein (APP; MIM 104760) and Notch (MIM 190198).[supplied by OMIM]
5672	This gene is a member of the tropomyosin family of highly conserved, widely distributed actin-binding proteins involved in the contractile system of striated and smooth muscles and the cytoskeleton of non-muscle cells. Tropomyosin is composed of two alpha-helical chains arranged as a coiled-coil. It is polymerized end to end along the two grooves of actin filaments and provides stability to the filaments. The encoded protein is one type of alpha helical chain that forms the predominant tropomyosin of striated muscle, where it also functions in association with the troponin complex to regulate the calcium-dependent interaction of actin and myosin during muscle contraction. In smooth muscle and non-muscle cells, alternatively spliced transcript variants encoding a range of isoforms have been described. Mutations in this gene are associated with type 3 familial hypertrophic cardiomyopathy.
5673	This gene encodes an alpha integrin. Integrins are heterodimeric integral membrane proteins composed of an alpha chain and a beta chain. This protein contains an I domain, is expressed in muscle tissue, dimerizes with beta 1 integrin in vitro, and appears to bind collagen in this form. Therefore, the protein may be involved in attaching muscle tissue to the extracellular matrix. Alternative transcriptional splice variants, encoding the same protein, have been characterized.
5674	
5675	This gene encodes a protein that is involved in cytokinesis. The encoded protein is at high level during S and G2/M and drop dramatically after cell exit mitosis and enter G1. It is located in the nucleus during interphase, and becomes associated with mitotic spindles in a highly dynamic manner during mitosis, and localizes to the cell mid-body during cytokinesis. This protein has been shown to be a substrate of several cyclin-dependent kinases (CDKs). At least three alternatively spliced transcript variants encoding distinct isoforms have been observed.
5676	NIPSNAP3A belongs to a family of proteins with putative roles in vesicular transport (Buechler et al., 2004).[supplied by OMIM]
5677	
5678	
5679	
5680	
5681	
5682	This gene, CYP1A2, encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. The protein encoded by this gene localizes to the endoplasmic reticulum and its expression is induced by some polycyclic aromatic hydrocarbons (PAHs), some of which are found in cigarette smoke. The enzyme@#$%&s endogenous substrate is unknown; however, it is able to metabolize some PAHs to carcinogenic intermediates. Other xenobiotic substrates for this enzyme include caffeine, aflatoxin B1, and acetaminophen. The transcript from this gene contains four Alu sequences flanked by direct repeats in the 3@#$%& untranslated region. A related family member, CYP1A1, is located approximately 25 kb away from CYP1A2 on chromosome 15.
5683	This gene, CYP1A2, encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. The protein encoded by this gene localizes to the endoplasmic reticulum and its expression is induced by some polycyclic aromatic hydrocarbons (PAHs), some of which are found in cigarette smoke. The enzyme@#$%&s endogenous substrate is unknown; however, it is able to metabolize some PAHs to carcinogenic intermediates. Other xenobiotic substrates for this enzyme include caffeine, aflatoxin B1, and acetaminophen. The transcript from this gene contains four Alu sequences flanked by direct repeats in the 3@#$%& untranslated region. A related family member, CYP1A1, is located approximately 25 kb away from CYP1A2 on chromosome 15.
5684	
5685	
5686	
5687	
5688	The protein encoded by this gene is a subunit of a disulfide-linked homotrimeric protein. This protein is an adhesive glycoprotein that mediates cell-to-cell and cell-to-matrix interactions. This protein can bind to fibrinogen, fibronectin, laminin, type V collagen and integrins alpha-V/beta-1. This protein has been shown to play roles in platelet aggregation, angiogenesis, and tumorigenesis.
5689	
5690	
5691	
5692	
5693	
5694	
5695	
5696	
5697	
5698	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. This gene encodes a member of the potassium channel, voltage-gated, subfamily G. This member functions as a modulatory subunit. The gene has strong expression in brain. Alternative splicing results in two transcript variants encoding distinct isoforms.
5699	
5700	This gene is a classical cadherin from the cadherin superfamily. The encoded protein is a calcium dependent cell-cell adhesion glycoprotein comprised of five extracellular cadherin repeats, a transmembrane region and a highly conserved cytoplasmic tail. Mutations in this gene are correlated with gastric, breast, colorectal, thyroid and ovarian cancer. Loss of function is thought to contribute to progression in cancer by increasing proliferation, invasion, and/or metastasis. The ectodomain of this protein mediates bacterial adhesion to mammalian cells and the cytoplasmic domain is required for internalization. Identified transcript variants arise from mutation at consensus splice sites.
5701	
5702	
5703	The product of this gene is a cytokine. Cytokines are small proteins that have an essential role in the immune and inflammatory responses. This gene is one of several chemokine-like factor genes located in a cluster on chromosome 16. The protein encoded by this gene is a potent chemoattractant for neutrophils, monocytes and lymphocytes. It also can stimulate the proliferation of skeletal muscle cells. This protein may play important roles in inflammation and in the regeneration of skeletal muscle. Alternatively spliced transcript variants encoding different isoforms have been identified.
5704	
5705	WW domain-containing proteins are found in all eukaryotes and play an important role in the regulation of a wide variety of cellular functions such as protein degradation, transcription, and RNA splicing. This gene encodes a protein which contains 2 WW domains and a short-chain dehydrogenase/reductase domain (SRD). The highest normal expression of this gene is detected in hormonally regulated tissues such as testis, ovary, and prostate. This expression pattern and the presence of an SRD domain suggest a role for this gene in steroid metabolism. The encoded protein is more than 90% identical to the mouse protein, which is an essential mediator of tumor necrosis factor-alpha-induced apoptosis, suggesting a similar, important role in apoptosis for the human protein. In addition, there is evidence that this gene behaves as a suppressor of tumor growth. Alternative splicing of this gene generates 7 transcript variants, which encode different isoforms.
5706	
5707	The Fanconi anemia complementation group (FANC) currently includes FANCA, FANCB, FANCC, FANCD1 (also called BRCA2), FANCD2, FANCE, FANCF, FANCG, and FANCL. The previously defined group FANCH is the same as FANCA. Fanconi anemia is a genetically heterogeneous recessive disorder characterized by cytogenetic instability, hypersensitivity to DNA crosslinking agents, increased chromosomal breakage, and defective DNA repair. The members of the Fanconi anemia complementation group do not share sequence similarity; they are related by their assembly into a common nuclear protein complex. This gene encodes the protein for complementation group A. Alternative splicing results in multiple transcript variants encoding different isoforms. Mutations in this gene are the most common cause of Fanconi anemia.
5708	
5709	
5710	This gene encodes a protein localized to the inner mitochondrial membrane and thought to be involved in the biogenesis of the cytochrome c oxidase complex. The protein is a member of the SURF1 family, which includes the related yeast protein SHY1 and rickettsial protein RP733. The gene is located in the surfeit gene cluster, a group of very tightly linked genes that do not share sequence similarity, where it shares a bidirectional promoter with SURF2 on the opposite strand. Defects in this gene are a cause of Leigh syndrome, a severe neurological disorder that is commonly associated with systemic cytochrome c oxidase deficiency.
5711	The hepatotrophic factor designated augmenter of liver regeneration (ALR) is thought to be one of the factors responsible for the extraordinary regenerative capacity of mammalian liver. It has also been called hepatic regenerative stimulation substance (HSS). The gene resides on chromosome 16 in the interval containing the locus for polycystic kidney disease (PKD1). The putative gene product is 42% similar to the scERV1 protein of yeast. The yeast scERV1 gene had been found to be essential for oxidative phosphorylation, the maintenance of mitochondrial genomes, and the cell division cycle. The human gene is both the structural and functional homolog of the yeast scERV1 gene.
5712	
5713	
5714	
5715	
5716	
5717	
5718	Protein kinase C (PKC) is a family of serine- and threonine-specific protein kinases that can be activated by calcium and second messenger diacylglycerol. PKC family members phosphorylate a wide variety of protein targets and are known to be involved in diverse cellular signaling pathways. PKC family members also serve as major receptors for phorbol esters, a class of tumor promoters. Each member of the PKC family has a specific expression profile and is believed to play a distinct role in cells. The protein encoded by this gene is one of the PKC family members. This protein kinase has been reported to be involved in many different cellular functions, such as B cell activation, apoptosis induction, endothelial cell proliferation, and intestinal sugar absorption. Studies in mice also suggest that this kinase may also regulate neuronal functions and correlate fear-induced conflict behavior after stress. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
5719	
5720	
5721	
5722	
5723	
5724	
5725	This gene encodes a member of the NEDD4-like protein family. The family of proteins is known to possess ubiquitin-protein ligase activity. The encoded protein contains 4 tandem WW domains. The WW domain is a protein motif consisting of 35 to 40 amino acids and is characterized by 4 conserved aromatic residues. The WW domain may mediate specific protein-protein interactions. Three alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
5726	
5727	
5728	
5729	
5730	Carboxylesterase 1 is a member of a large multigene family.  The enzymes encoded by these genes are responsible for the hydrolysis of ester- and amide-bond-containing drugs such as cocaine and heroin.  They also hydrolize long-chain fatty acid esters and thioesters.  This enzyme is known to hydrolyze aromatic and aliphatic esters and is necessary for cellular cholesterol esterification.  It may also play a role in detoxification in the lung and/or protection of the central nervous system from ester or amide compounds.  Carboxylesterase deficiency may be associated with non-Hodgkin lymphoma or B-cell lymphocytic leukemia.
5731	
5732	
5733	
5734	This nuclear gene encodes a mitochondrial protein tyrosine aminotransferase which is present in the liver and catalyzes the conversion of L-tyrosine into p-hydroxyphenylpyruvate. Mutations in this gene cause tyrosinemia (type II, Richner-Hanhart syndrome), a disorder accompanied by major skin and corneal lesions, with possible mental retardation. A regulator gene for tyrosine aminotransferase is X-linked.
5735	Adaptins are important components of clathrin-coated vesicles transporting ligand-receptor complexes from the plasma membrane or from the trans-Golgi network to lysosomes. The adaptin family of proteins is composed of four classes of molecules named alpha, beta-, beta prime- and gamma- adaptins. Adaptins, together with medium and small subunits, form a heterotetrameric complex called an adaptor, whose role is to promote the formation of clathrin-coated pits and vesicles. The protein encoded by this gene is a gamma-adaptin protein and it belongs to the adaptor complexes large subunits family.
5736	
5737	ITGAL encodes the integrin alpha L chain. Integrins are heterodimeric integral membrane proteins composed of an alpha chain and a beta chain. This I-domain containing alpha integrin combines with the beta 2 chain (ITGB2) to form the integrin lymphocyte function-associated antigen-1 (LFA-1), which is expressed on all leukocytes. LFA-1 plays a central role in leukocyte intercellular adhesion through interactions with its ligands, ICAMs 1-3 (intercellular adhesion molecules 1 through 3), and also functions in lymphocyte costimulatory signaling.
5738	
5739	Exportins, such as XPO6, recruit cargo in the nucleoplasm in the presence of RAN (MIM 601179)-GTP and form ternary export complexes. These complexes are transported through nuclear pore complexes to the cytoplasm, where GTP is hydrolyzed and the export complex is disassembled.[supplied by OMIM]
5740	Exportins, such as XPO6, recruit cargo in the nucleoplasm in the presence of RAN (MIM 601179)-GTP and form ternary export complexes. These complexes are transported through nuclear pore complexes to the cytoplasm, where GTP is hydrolyzed and the export complex is disassembled.[supplied by OMIM]
5741	
5742	
5743	
5744	
5745	
5746	The protein encoded by this gene is a smooth muscle myosin belonging to the myosin heavy chain family. The gene product is a subunit of a hexameric protein that consists of two heavy chain subunits and two pairs of non-identical light chain subunits. It functions as a major contractile protein, converting chemical energy into mechanical energy through the hydrolysis of ATP. The gene encoding a human ortholog of rat NUDE1 is transcribed from the reverse strand of this gene, and its 3@#$%& end overlaps with that of the latter. The pericentric inversion of chromosome 16 [inv(16)(p13q22)] produces a chimeric transcript that encodes a protein consisting of the first 165 residues from the N terminus of core-binding factor beta in a fusion with the C-terminal portion of the smooth muscle myosin heavy chain. This chromosomal rearrangement is associated with acute myeloid leukemia of the M4Eo subtype. Alternative splicing generates isoforms that are differentially expressed, with ratios changing during muscle cell maturation. Additional splice variants have been described, but their full-length natures have not been determined.
5747	The protein encoded by this gene is the first enzyme of the fatty acid beta-oxidation pathway, which catalyzes the desaturation of acyl-CoAs to 2-trans-enoyl-CoAs. ACOX1 donates electrons directly to molecular oxygen, thereby producing hydrogen peroxide. Defects in the ACOX1 gene result in pseudoneonatal adrenoleukodystrophy, a disease that is characterized by accumulation of very long chain fatty acids.
5748	The protein encoded by this gene belongs to the SEC14 cytosolic factor family. It has similarity to yeast SEC14 and to Japanese flying squid RALBP which suggests a possible role of the gene product in an intracellular transport system.
5749	
5750	Transcriptional regulatory proteins containing tandemly repeated zinc finger domains are thought to be involved in both normal and abnormal cellular proliferation and differentiation. ZNF161 is a C2H2-type zinc finger protein. See MIM 603971 for general information on zinc finger proteins.[supplied by OMIM]
5751	
5752	This gene is a member of the RAD51 family of related genes, which encode strand-transfer proteins thought to be involved in recombinational repair of damaged DNA and in meiotic recombination. This gene product interacts with two other DNA repair proteins, encoded by RAD51B and XRCC3, but not with itself. The protein copurifies with XRCC3 protein in a complex, reflecting their endogenous association and suggesting a cooperative role during recombinational repair. This gene is one of four localized to a region of chromosome 17q23 where amplification occurs frequently in breast tumors. Overexpression of the four genes during amplification has been observed and suggests a possible role in tumor progression. Alternative splicing has been observed for this gene and three variants encoding different isoforms have been identified.
5753	WD40 repeat proteins are key components of many essential biologic functions. They regulate the assembly of multiprotein complexes by presenting a beta-propeller platform for simultaneous and reversible protein-protein interactions. Members of the WIPI subfamily of WD40 repeat proteins, such as WIPI3, have a 7-bladed propeller structure and contain a conserved motif for interaction with phospholipids (Proikas-Cezanne et al., 2004).[supplied by OMIM]
5754	The CMRF35 antigen (CMRF35A; MIM 606786), which was identified by reactivity with a monoclonal antibody, is present on monocytes, neutrophils, and some T and B lymphocytes. CMRF35H is recognized by the same antibody and is distinct from CMRF35 (Green et al., 1998).[supplied by OMIM]
5755	
5756	
5757	This gene encodes a member of the STAT-induced STAT inhibitor (SSI), also known as suppressor of cytokine signaling (SOCS), family. SSI family members are cytokine-inducible negative regulators of cytokine signaling. The expression of this gene is induced by various cytokines, including IL6, IL10, and interferon (IFN)-gamma. The protein encoded by this gene can bind to JAK2 kinase, and inhibit the activity of JAK2 kinase. Studies of the mouse counterpart of this gene suggested the roles of this gene in the negative regulation of fetal liver hematopoiesis, and placental development.
5758	
5759	
5760	The muscle acetylcholine receptor is composed of five subunits: two alpha subunits and one beta, one gamma, and one delta subunit. This gene encodes the beta subunit of the acetylcholine receptor. The acetylcholine receptor changes conformation upon acetylcholine binding leading to the opening of an ion-conducting channel across the plasma membrane. Mutations in this gene are associated with slow-channel congenital myasthenic syndrome.
5761	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. The function of this gene has not yet been determined. The expression pattern of the mouse homolog implies a role in nervous system development.
5762	C8G is one of the three polypeptides that constitute C8, a component of the complement system.  C8 participates in the formation of Membrane Attack Complex (MAC).  Patients with deficiency in C8 are vulnerable to certain bacteria infection.
5763	This gene product is a 626-amino acid polypeptide that is 96.5% identical to mouse Mekk3. Its catalytic domain is closely related to those of several other kinases, including mouse Mekk2, tobacco NPK, and yeast Ste11. Northern blot analysis revealed a 4.6-kb transcript that appears to be ubiquitously expressed. This protein directly regulates the stress-activated protein kinase (SAPK) and extracellular signal-regulated protein kinase (ERK) pathways by activating SEK and MEK1/2 respectively; it does not regulate the p38 pathway. In cotransfection assays, it enhanced transcription from a nuclear factor kappa-B (NFKB)-dependent reporter gene, consistent with a role in the SAPK pathway. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
5764	This gene product is a 626-amino acid polypeptide that is 96.5% identical to mouse Mekk3. Its catalytic domain is closely related to those of several other kinases, including mouse Mekk2, tobacco NPK, and yeast Ste11. Northern blot analysis revealed a 4.6-kb transcript that appears to be ubiquitously expressed. This protein directly regulates the stress-activated protein kinase (SAPK) and extracellular signal-regulated protein kinase (ERK) pathways by activating SEK and MEK1/2 respectively; it does not regulate the p38 pathway. In cotransfection assays, it enhanced transcription from a nuclear factor kappa-B (NFKB)-dependent reporter gene, consistent with a role in the SAPK pathway. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
5765	
5766	The protein encoded by this gene is a member of the somatotropin/prolactin family of hormones and plays an important role in growth control. The gene is located at the growth hormone locus on chromosome 17 along with four other related genes in the same transcriptional orientation; an arrangement which is thought to have evolved by a series of gene duplications. Although the five genes share a remarkably high degree of sequence identity, they are expressed selectively in different tissues. Alternative splicing generates additional isoforms of each of the five growth hormones, leading to further diversity and potential for specialization. This particular family member is expressed mainly in the placenta and utilizes multiple transcription initiation sites. Expression of the identical mature proteins for chorionic somatomammotropin hormones 1 and 2 is upregulated during development, although the ratio of 1 to 2 increases by term. Mutations in this gene result in placental lactogen deficiency and Silver-Russell syndrome.
5767	
5768	
5769	The protein encoded by this gene is a member of the somatotropin/prolactin family of hormones and plays an important role in growth control. The gene is located at the growth hormone locus on chromosome 17 along with four other related genes in the same transcriptional orientation; an arrangement which is thought to have evolved by a series of gene duplications. Although the five genes share a remarkably high degree of sequence identity, they are expressed selectively in different tissues. Alternative splicing generates additional isoforms of each of the five growth hormones, leading to further diversity and potential for specialization. This particular family member is expressed mainly in the placenta and utilizes multiple transcription initiation sites. Expression of the identical mature proteins for chorionic somatomammotropin hormones 1 and 2 is upregulated during development, although the ratio of 1 to 2 increases by term. Mutations in this gene result in placental lactogen deficiency and Silver-Russell syndrome.
5770	
5771	The B lymphocyte antigen receptor is a multimeric complex which includes the antigen-specific component, surface immunoglobulin (Ig). Surface Ig non-covalently associates with 2 other proteins, Ig-alpha and Ig-beta, which are necessary for expression and function of the B-cell antigen receptor. This gene encodes the Ig-beta protein of the B-cell antigen component. Alternative splicing results in two known transcript variants.
5772	
5773	
5774	The protein encoded by this gene interacts with microtubules and is functionally associated with beta-amyloid precursor protein transport and/or processing. The beta-amyloid precursor protein is a cell surface protein with signal-transducing properties, and it is thought to play a role in the pathogenesis of Alzheimer@#$%&s disease. This gene has been found to be highly expressed in breast cancer. Multiple polyadenylation sites have been found for this gene.
5775	
5776	
5777	
5778	This gene encodes mitogen-activated protein kinase kinase kinase 14, which is a serine/threonine protein-kinase. This kinase binds to TRAF2 and stimulates NF-kappaB activity. It shares sequence similarity with several other MAPKK kinases. It participates in an NF-kappaB-inducing signalling cascade common to receptors of the tumour-necrosis/nerve-growth factor (TNF/NGF) family and to the interleukin-1 type-I receptor.
5779	
5780	Pancreatic polypeptide (PPY) was the first identified product of the dominating endocrine cell type of the duodenal pancreas (Kimmel et al., 1975; Larsson et al., 1976). This hormone of 36 amino acids seems to be involved in the regulation of exocrine pancreatic secretion and biliary tract motility (Schwartz, 1983). PPY may be important in regulation of food intake; genetically obese laboratory animals have altered PPY release and in New Zealand obese mice weight gain can be cured by infusion of PPY. Children with Prader-Willi syndrome (PWS; MIM 176270) have blunted secretion of PPY (Zipf et al., 1981).[supplied by OMIM]
5781	
5782	
5783	The protein encoded by this gene is a member of the STAT family of transcription factors. In response to cytokines and growth factors, STAT family members are phosphorylated by the receptor associated kinases, and then form homo- or heterodimers that translocate to the cell nucleus where they act as transcription activators. This protein is activated by, and mediates the responses of many cell ligands, such as IL2, IL3, IL7 GM-CSF, erythropoietin, thrombopoietin, and different growth hormones. Activation of this protein in myeloma and lymphoma associated with a TEL/JAK2 gene fusion is independent of cell stimulus and has been shown to be essential for the tumorigenesis. The mouse counterpart of this gene is found to induce the expression of BCL2L1/BCL-X(L), which suggests the antiapoptotic function of this gene in cells.
5784	
5785	The protein encoded by this gene was originally identified as an ovarian tumor antigen monitored in ovarian cancer. The encoded protein contains a B-box/coiled coil motif, which is present in many genes with transformation potential, but the function of this protein is unknown. This gene is located on a region of chromosome 17q21.1 that is in close proximity to tumor suppressor gene BRCA1. Three alternatively spliced variants encoding the same protein have been identified for this gene.
5786	
5787	
5788	This gene is one of several cytokine genes clustered on the q-arm of chromosome 17. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The protein encoded by this gene is structurally related to the CXC subfamily of cytokines. Members of this subfamily are characterized by two cysteines separated by a single amino acid. This cytokine displays chemotactic activity for monocytes, lymphocytes, basophils and eosinophils. By recruiting leukocytes to sites of inflammation this cytokine may contribute to tumor-associated leukocyte infiltration and to the antiviral state against HIV infection.
5789	Extracellular signals are transduced into cells through mitogen-activated protein kinases. The structural organization of these kinases into specific signaling domains is facilitated by scaffolding proteins involved in closely tethering different kinases so that successive phosphorylation events can occur. The protein encoded by this gene is a scaffolding protein that brings together mitogen-activated protein kinases and their transcription factor targets for the activation of specific signaling pathways. This gene which is abundantly expressed in testicular haploid germ cells encodes a protein that is recognized by sperm-agglutinating antibodies and implicated in infertility. Alternatively spliced variants that encode different protein isoforms have been described but the full-length sequences of only two have been determined.
5790	This gene encodes a cisplatin resistance-associated overexpressed protein (CROP). The N-terminal half of the CROP contains cysteine/histidine motifs and leucine zipper-like repeats, and the C-terminal half is rich in arginine and glutamate residues (RE domain) and arginine and serine residues (RS domain). This protein localizes with a speckled pattern in the nucleus, and could be involved in the formation of splicesome via the RE and RS domains. Two alternatively spliced transcript variants encoding the same protein have been found for this gene.
5791	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the MRP subfamily which is involved in multi-drug resistance. The specific function of this protein has not yet been determined; however, this protein may play a role in the transport of biliary and intestinal excretion of organic anions. Alternative splicing of this gene results in three known transcript variants.
5792	The protein encoded by this gene contains a kinase domain most closely related to the cyclin-dependent protein kinases. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
5793	This gene encodes the major component of type I collagen, the fibrillar collagen found in most connective tissues, and the only component of the collagen found in cartilage. Mutations in this gene are associated with osteogenesis imperfecta, Ehlers-Danlos syndrome, and idiopathic osteoporosis. Reciprocal translocations between chromosomes 17 and 22, where this gene and the gene for platelet-derived growth factor beta are located, are associated with a particular type of skin tumor called dermatofibrosarcoma protuberans, resulting from unregulated expression of the growth factor. Two transcripts, resulting from the use of alternate polyadenylation signals, have been identified for this gene.
5794	In addition to antigen recognition by the T-cell receptor, T-cell activation requires a second signal from a costimulatory receptor, such as CD28 (MIM 186760), which interacts with B7-1 (CD80; MIM 112203) and B7-2 (CD86; MIM 601020) ligands on antigen-presenting cells. CD28 costimulation induces transcription of interleukin-2 (IL2; MIM 147680) and stabilizes newly synthesized IL2 through the activation of mitogen-activated protein kinases (MAPKs), such as ERK (e.g., MAP2K4; MIM 601335) and JNK (see MIM 601158), and the subsequent creation of AP1 transcription factor (see MIM 165160). DUSP14 is a negative regulator of CD28 signaling.[supplied by OMIM]
5795	
5796	TCF2 encodes transcription factor 2, a liver-specific factor of the homeobox-containing basic helix-turn-helix family. The TCF2 protein is believed to form heterodimers with another liver-specific member of this transcription factor family, TCF1; depending on the TCF2 isoform, the result may be to activate or inhibit transcription of target genes. Mutation of TCF2 that disrupts normal function has been identified as the cause of MODY5 (Maturity-Onset of Diabetes, Type 5). A third human transcript variant is believed to exist based on such a variant in the rat: however, to date such an mRNA species has not been isolated.
5797	
5798	This gene is one of several cytokine genes clustered on the q-arm of chromosome 17. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The CC cytokines are proteins characterized by two adjacent cysteines. The cytokine encoded by this gene displays chemotactic activity for lymphocytes and monocytes but not for neutrophils. This cytokine also shows a potent myelosuppressive activity and suppresses proliferation of myeloid progenitor cells. The expression of this gene is upregulated by IL-10.
5799	Mutations linked to neurofibromatosis type 1 led to the identification of NF1. NF1 encodes the protein neurofibromin, which appears to be a negative regulator of the ras signal transduction pathway. In addition to type 1 neurofibromatosis, mutations in NF1 can also lead to juvenile myelomonocytic leukemia. Alternatively spliced NF1 mRNA transcripts have been isolated, although their functions, if any, remain unclear.
5800	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a non-ATPase subunit of the 19S regulator.
5801	
5802	This gene encodes a cytoplasmic protein that contains a coiled-coil structure and a BTB/POZ domain at its N-terminus, ankyrin repeats in the middle portion, and a FYVE-finger motif at its C-terminus. This protein belongs to a subgroup of double zinc finger proteins which may be involved in vesicle or protein transport. Alternative splicing has been observed at this locus and two variants, each encoding a distinct isoform, have been identified.
5803	
5804	The product of this gene belongs to the Serine/Threonine protein kinase family, and to the Ca(2+)/calmodulin-dependent protein kinase subfamily. This protein plays a role in the calcium/calmodulin-dependent (CaM) kinase cascade. Three transcript variants encoding two distinct isoforms have been identified for this gene.
5805	The ACADVL gene product is targetted to the inner mitochondrial membrane where it catalyzes the first step of the mitochondrial fatty acid beta-oxidation pathway.  This acyl-Coenzyme A dehydrogenase is specific to long-chain and very long-chain fatty acids. ACADVL deficiency reduces myocardial fatty acid beta-oxidation and is associated with cardiomyopathy.
5806	
5807	
5808	
5809	
5810	The protein encoded by this gene is a nuclear hormone receptor for triiodothyronine. It is one of the several receptors for thyroid hormone, and has been shown to mediate the biological activities of thyroid hormone. Knockout studies in mice suggest that the different receptors, while having certain extent of redundancy, may mediate different functions of thyroid hormone. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
5811	
5812	
5813	
5814	
5815	The protein encoded by this gene interacts with poly(A)-binding protein and with the cap-binding complex eIF4A. It is involved in translational initiation and protein biosynthesis. Overexpression of this gene in COS7 cells stimulates translation. Alternative splicing occurs at this locus and three transcript variants encoding three distinct isoforms have been identified.
5816	The metallocarboxypeptidase family of enzymes is divided into 2 subfamilies based on sequence similarities. The pancreatic carboxypeptidase-like and the regulatory B-type carboxypeptidase subfamilies. Carboxypeptidase D has been identified as a regulatory B-type carboxypeptidase. CPD is a homolog of duck gp180, a hepatitis B virus-binding protein. Transcript variants utilizing alternative polyadenylation signals exist for this gene.
5817	
5818	
5819	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the ultrahigh sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
5820	The protein encoded by this gene belongs to the membrane-associated guanylate kinase (MAGUK) family, a class of proteins that functions as molecular scaffolds for the assembly of multiprotein complexes at specialized regions of the plasma membrane. This protein is also a member of the CARD protein family, which is defined by carrying a characteristic caspase-associated recruitment domain (CARD). This protein shares a similar domain structure with CARD11 protein. The CARD domains of both proteins have been shown to specifically interact with BCL10, a protein known to function as a positive regulator of cell apoptosis and NF-kappaB activation. When expressed in cells, this protein activated NF-kappaB and induced the phosphorylation of BCL10. Two alternatively spliced variants of this gene encoding distinct isoforms have been reported.
5821	
5822	This gene is a member of the casein kinase I (CKI) gene family whose members have been implicated in the control of cytoplasmic and nuclear processes, including DNA replication and repair. The encoded protein is highly similar to the mouse and rat CK1 delta homologs. Two transcript variants encoding different isoforms have been found for this gene.
5823	This gene encodes a transmembrane and secreted protein with characteristics of a type 1a transmembrane protein. It is found in a perinuclear Golgi-like pattern and thought to be involved in hematopoietic and/or immune system processes.
5824	This gene encodes an ion channel that is activated by high temperatures above 52 degrees Celsius. The protein may be involved in transduction of high-temperature heat responses in sensory ganglia. It is thought that in other tissues the channel may be activated by stimuli other than heat.
5825	
5826	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. Members of the PTP family share a highly conserved catalytic motif, which is essential for the catalytic activity. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. Epidermal growth factor receptor and the adaptor protein Shc were reported to be substrates of this PTP, which suggested the roles in growth factor mediated cell signaling. Three alternatively spliced variants of this gene, which encode isoforms differing at their extreme C-termini, have been described. The different C-termini are thought to determine the substrate specificity, as well as the cellular localization of the isoforms. Two highly related but distinctly processed pseudogenes that localize to distinct chromosomes have been reported.
5827	The protein encoded by this gene is a type IV membrane protein. It is present in the plasma membrane and intracellular vesicles. It may also be associated with the cytoskeleton. This protein may function in vesicle trafficking, membrane fusion, protein complex assembly and cell motility. Alternative splicing occurs at this locus and two transcript variants encoding distinct isoforms have been identified.
5828	
5829	The protein encoded by this gene contains a SH2 domain and a CIS homolog domain. The protein thus belongs to the cytokine-induced STAT inhibitor (CIS), also known as suppressor of cytokine signaling (SOCS) or STAT-induced STAT inhibitor (SSI), protein family. CIS family members are known to be cytokine-inducible negative regulators of cytokine signaling. The expression of this gene can be induced by GM-CSF and EPO in hematopoietic cells. A high expression level of this gene was found in factor-independent chronic myelogenous leukemia (CML) and erythroleukemia (HEL) cell lines.
5830	
5831	
5832	
5833	The protein encoded by this gene is a calcium-dependent glycoprotein that is a member of the desmocollin subfamily of the cadherin superfamily. These desmosomal family members, along with the desmogleins, are found primarily in epithelial cells where they constitute the adhesive proteins of the desmosome cell-cell junction and are required for cell adhesion and desmosome formation. The desmosomal family members are arranged in two clusters on chromosome 18, occupying less than 650 kb combined. Alternative splicing results in two transcript variants encoding distinct isoforms.
5834	The protein encoded by this gene has homologs that have been identified in mouse and macaque. The mouse and human proteins have a putative prenyl group binding site (CAAX box) at their C-terminus. A diverse list of proteins are known or strongly presumed to be the target of post-translational modification by the attachment of either a farnesyl or a geranyl-geranyl group to a cysteine residue at the C-terminus. The function of this protein has not been determined.
5835	
5836	Mouse studies suggest that this gene functions during normal adipose tissue development and may play a role in human triglyceride metabolism. This gene represents a candidate gene for human lipodystrophy, characterized by loss of body fat, fatty liver, hypertriglyceridemia, and insulin resistance.
5837	
5838	
5839	
5840	
5841	This gene encodes an integral outer mitochondrial membrane protein that blocks the apoptotic death of some cells such as lymphocytes. Constitutive expression of BCL2, such as in the case of translocation of BCL2 to Ig heavy chain locus, is thought to be the cause of follicular lymphoma. Two transcript variants, produced by alternate splicing, differ in their C-terminal ends.
5842	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a protein containing RNA helicase-DEAD box protein motifs.
5843	
5844	The protein encoded by this gene belongs to the dystrobrevin subfamily and the dystrophin family. This protein is a component of the dystrophin-associated protein complex (DPC). The DPC consists of dystrophin and several integral and peripheral membrane proteins, including dystroglycans, sarcoglycans, syntrophins and alpha- and beta-dystrobrevin. The DPC localizes to the sarcolemma and its disruption is associated with various forms of muscular dystrophy. This protein may be involved in the formation and stability of synapses as well as the clustering of nicotinic acetylcholine receptors. Multiple alternatively spliced transcript variants encoding different isoforms have been identified.
5845	
5846	SIGLECs are members of the immunoglobulin superfamily that are expressed on the cell surface. Most SIGLECs have 1 or more cytoplasmic immune receptor tyrosine-based inhibitory motifs, or ITIMs. SIGLECs are typically expressed on cells of the innate immune system, with the exception of the B-cell expressed SIGLEC6 (MIM 604405).[supplied by OMIM]
5847	
5848	
5849	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation.
5850	
5851	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S7P family of ribosomal proteins. It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
5852	
5853	
5854	The protein encoded by this gene is secreted in plasma where it is a component of very low density lipoprotein. This protein activates the enzyme lipoprotein lipase, which hydrolyzes triglycerides and thus provides free fatty acids for cells. Mutations in this gene cause hyperlipoproteinemia type IB, characterized by hypertriglyceridemia, xanthomas, and increased risk of pancreatitis and early atherosclerosis.
5855	
5856	
5857	
5858	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L28E family of ribosomal proteins. It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
5859	Inositol 1,4,5-trisphosphate 3-kinase (ITPK) catalyzes the phosphorylation of Ins(1,4,5)P3 to Ins(1,3,4,5)P4, both of which are modulators of calcium homeostasis. ITPK isoforms include ITPKA (MIM 147521), ITPKB (MIM 147522), and ITPKC, all of which contain a conserved catalytic unit in their C termini, but have unique N-terminal sequences and tissue distributions.[supplied by OMIM]
5860	This gene, CYP2B6, encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and its expression is induced by phenobarbital. The enzyme is known to metabolize some xenobiotics, such as the anti-cancer drugs cyclophosphamide and ifosphamide. Transcript variants for this gene have been described; however, it has not been resolved whether these transcripts are in fact produced by this gene or by a closely related pseudogene, CYP2B7. Both the gene and the pseudogene are located in the middle of a CYP2A pseudogene found in a large cluster of cytochrome P450 genes from the CYP2A, CYP2B and CYP2F subfamilies on chromosome 19q.
5861	The protein encoded by this gene is a member of the receptor tyrosine kinase subfamily. Although it is similar to other receptor tyrosine kinases, the Axl protein represents a unique structure of the extracellular region that juxtaposes IgL and FNIII repeats. It transduce signals from the extracellular matrix into the cytoplasm by binding growth factors like vitamin K-dependent protein growth-arrest-specific gene 6. It is involved in the stimulation of cell proliferation. This receptor can also mediate cell aggregation by homophilic binding. Axl is a chronic myelogenous leukemia-associated oncogene and also associated with colon cancer and melanoma. It is in close vicinity to the bcl3 oncogene which is at 19q13.1-q13.2. The Axl gene is evolutionarily conserved between vertebrate species. This gene has two different alternatively spliced transcript variants.
5862	
5863	Coagulation factor II (thrombin) receptor-like 3 (F2RL3) is a member of the large family of 7-transmembrane-region receptors that couple to guanosine-nucleotide-binding proteins. F2RL3 is also a member of the protease-activated receptor family. F2RL3 is activated by proteolytic cleavage of its extracellular amino terminus. The new amino terminus functions as a tethered ligand and activates the receptor. F2RL3 is activated by thrombin and trypsin.
5864	This gene encodes a DNA-binding protein that has high specificity for scaffold or matrix attachment region DNA elements (S/MAR DNA). This protein is thought to be involved in attaching the base of chromatin loops to the nuclear matrix but there is conflicting evidence as to whether this protein is a component of chromatin or a nuclear matrix protein. Scaffold attachment factors are a specific subset of nuclear matrix proteins (NMP) that specifically bind to S/MAR. This encoded protein is thought to serve as a molecular base to assemble a @#$%&transcriptosome complex@#$%& in the vicinity of actively transcribed genes. It is involved in the regulation of the heat shock protein 27 transcription and also can act as an estrogen receptor corepressor. This gene is a candidate gene for breast tumorigenesis.
5865	
5866	This gene encodes a mitochondrial matrix protein in the Lon family of ATP-dependent proteases. A similar E. coli protein regulates gene expression by targeting specific regulatory proteins for degradation. This protein binds a specific sequence in the light and heavy chain promoters of the mitochondrial genome which are involved in regulation of DNA replication and transcription.
5867	The protein encoded by this gene is a member of the ELAVL protein family. This encoded protein contains 3 RNA-binding domains and binds cis-acting AU-rich elements. It destabilizes mRNAs and thereby regulates gene expression.
5868	This gene encodes a type II integral membrane protein that is 77% identical to CD209 antigen, a HIV gp120-binding protein. This protein, like CD209, efficiently binds both intercellular adhesion molecule 3 (ICAM3) and HIV-1 gp120, and enhances HIV-1 infection of T cells. This gene is mapped to 19p13.3, in a cluster with the CD209 and CD23/FCER2 genes. Multiple alternatively spliced transcript variants have been found for this gene, but the biological validity of some variants has not been determined.
5869	Leukocyte Ig-like receptors (LIRs) are a family of immunoreceptors expressed predominantly on monocytes and B cells and at lower levels on dendritic cells and natural killer (NK) cells. Activation of various immune cell types can be prevented by negative signaling receptors through interactions with specific ligands, such as MHC class I molecules by NK cells. All of the LIR inhibitory receptors, members of subfamily B, contain a cytoplasmic immunoreceptor tyrosine-based inhibitory motif (ITIM). Upon MHC class I (or other ligand) engagement and tyrosine phosphorylation of the ITIM, intracellular protein-tyrosine phosphatases such as SHP1 (PTPN6; MIM 176883) are recruited, and an inhibitory signal cascade ensues. Other LIR receptors, members of subfamily A, with short cytoplasmic regions containing no ITIMs and with transmembrane regions containing a charged arginine residue, may initiate stimulatory cascades (see, e.g., MIM 604810). One member of subfamily A (LILRA3; MIM 604818) lacks a transmembrane region and is presumed to be a soluble receptor.[supplied by OMIM]
5870	
5871	AU-specific RNA-binding enoyl-CoA hydratase (AUH) protein binds to the AU-rich element (ARE), a common element found in the 3@#$%& UTR of rapidly decaying mRNA such as c-fos, c-myc and granulocyte/ macrophage colony stimulating factor. ARE elements are involved in directing RNA to rapid degradation and deadenylation.  AUH is also homologous to enol-CoA hydratase, an enzyme involved in fatty acid degradation, and has been shown to have intrinsic hydratase enzymatic activity.  AUH is thus a bifunctional chimera between RNA binding and metabolic enzyme activity.  A possible subcellular localization in the mitochondria has been demonstrated for the mouse homolog of this protein which shares 92% identity with the human protein.  It has been suggested that AUH may have a novel role as a mitochondrial located AU-binding protein.  Human AUH is expressed as a single mRNA species of 1.8 kb, and translated as a 40-kDa precursor protein which is subsequently processed to a 32-kDa mature form.
5872	
5873	
5874	This gene encodes a member of the latrophilin subfamily of G-protein coupled receptors (GPCR). Latrophilins may function in both cell adhesion and signal transduction. In experiments with non-human species, endogenous proteolytic cleavage within a cysteine-rich GPS (G-protein-coupled-receptor proteolysis site) domain resulted in two subunits (a large extracellular N-terminal cell adhesion subunit and a subunit with substantial similarity to the secretin/calcitonin family of GPCRs) being non-covalently bound at the cell membrane. Latrophilin-1 has been shown to recruit the neurotoxin from black widow spider venom, alpha-latrotoxin, to the synapse plasma membrane.
5875	
5876	
5877	
5878	
5879	
5880	Angiopoietins are members of the vascular endothelial growth factor family and the only known growth factors largely specific for vascular endothelium. Angiopoietin-1, angiopoietin-2, and angiopoietin-4 participate in the formation of blood vessels. ANGPTL2 protein is a secreted glycoprotein with homology to the angiopoietins and may exert a function on endothelial cells through autocrine or paracrine action.
5881	
5882	
5883	
5884	This gene encodes a member of the forkhead class of DNA-binding proteins. These hepatocyte nuclear factors are transcriptional activators for liver-specific transcripts such as albumin and transthyretin, and they also interact with chromatin. Similar family members in mice have roles in the regulation of metabolism and in the differentiation of the pancreas and liver. The crystal structure of a similar protein in rat has been resolved.
5885	
5886	The protein encoded by this gene contains a characteristic motif of protein tyrosine phosphatases (PTPs). PTPs regulate activities of phosphoproteins through dephosphorylation. They are signaling molecules involved in the regulation of a wide variety of biological processes. The specific function of this protein has not yet been determined. Alternatively spliced transcript variants encoding distinct isoforms have been identified.
5887	
5888	This gene encodes a protein that is related to epidermal growth factor receptor pathway substrate 8 (EPS8), a substrate for the epidermal growth factor receptor. The function of this protein is unknown. Several alternatively spliced transcript variants encoding different isoforms exist.
5889	
5890	This gene encodes a component of vacuolar ATPase (V-ATPase), a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, receptor-mediated endocytosis, and synaptic vesicle proton gradient generation. V-ATPase is composed of a cytosolic V1 domain and a transmembrane V0 domain. The V1 domain consists of three A and three B subunits, two G subunits plus the C, D, E, F, and H subunits. The V1 domain contains the ATP catalytic site. The V0 domain consists of five different subunits: a, c, c@#$%&, c@#$%!, and d. Additional isoforms of many of the V1 and V0 subunit proteins are encoded by multiple genes or alternatively spliced transcript variants. This encoded protein is one of three V1 domain G subunit proteins. This gene had previous gene symbols of ATP6J and ATP6G1.
5891	The protein encoded by this gene is a dual specificity protein kinase that belongs to the MAP kinase kinase family. This kinase is known to play a critical role in mitogen growth factor signal transduction. It phosphorylates and thus activates MAPK1/ERK2 and MAPK2/ERK3. The activation of this kinase itself is dependent on the Ser/Thr phosphorylation by MAP kinase kinase kinases. The inhibition or degradation of this kinase is found to be involved in the pathogenesis of Yersinia and anthrax.
5892	This gene encodes a member of the sirtuin family of proteins, homologs to the yeast Sir2 protein. Members of the sirtuin family are characterized by a sirtuin core domain and grouped into four classes. The functions of human sirtuins have not yet been determined; however, yeast sirtuin proteins are known to regulate epigenetic gene silencing and suppress recombination of rDNA. Studies suggest that the human sirtuins may function as intracellular regulatory proteins with mono-ADP-ribosyltransferase activity. The protein encoded by this gene is included in class IV of the sirtuin family.
5893	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S9P family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
5894	
5895	AKT2 is a putative oncogene encoding a protein belonging to a subfamily of serine/threonine kinases containing SH2-like (Src homology 2-like) domains. Furthermore, AKT2 was shown to be amplified and overexpressed in 2 of 8 ovarian carcinoma cell lines and 2 of 15 primary ovarian tumors. Overexpression of AKT2 contributes to the malignant phenotype of a subset of human ductal pancreatic cancers. AKT2 is a general protein kinase capable of phophorylating several known proteins.
5896	UHRF1 binds to an inverted CCAAT box in the promoter of topoisomerase II-alpha (TOP2A; MIM 126430). TOP2A introduces transient double-stranded breaks in DNA, which are required during the cell cycle for DNA replication, chromosome condensation, and segregation. UHRF1 is likely involved in TOPO2A expression (Hopfner et al., 2000).[supplied by OMIM]
5897	This gene encodes a member of the semaphorin family, a group of proteins characterized by the presence of a conserved semaphorin (sema) domain. Whereas some semaphorins are transmembrane proteins, others are secreted. Semaphorins play a major role in axon guidance. The protein encoded by this gene may be involved in both peripheral and central nervous system development. Alternatively spliced transcript variants encoding different isoforms exist.
5898	This gene encodes a protein that is a member of the S9B family in clan SC of the serine proteases. The protein has been shown to have post-proline dipeptidyl aminopeptidase activity, cleaving Xaa-Pro dipeptides from the N-termini of proteins. Although the activity of this protein is similar to that of dipeptidyl peptidase 4 (DPP4), it does not appear to be membrane bound. In general, dipeptidyl peptidases appear to be involved in the regulation of the activity of their substrates and have been linked to a variety of diseases including type 2 diabetes, obesity and cancer. Several transcript variants of this gene have been described but not fully characterized.
5899	
5900	
5901	
5902	
5903	PALM is a phosphoprotein associated with brain synaptic plasma membranes. It is also abundant in several endocrine tissues and appears to affect cell shape through a C-terminal lipid anchor (Burwinkel et al., 1998; Kutzleb et al., 1998).[supplied by OMIM]
5904	The protein encoded by this gene is an insulin like hormone produced mainly in gonadal tissues in males and females. Studies of the mouse counterpart suggest that this gene may be involved in the development of urogenital tract and female fertility. It may also act as a hormone to regulate growth and differentiation of gubernaculum, and thus mediating intra-abdominal testicular descent. The mutations in this gene may lead to, but not a frequent cause of, cryptorchidism.
5905	The protein encoded by this gene is homologous to the murine protein MCAP, which associates with chromosomes during mitosis, and to the human RING3 protein, a serine/threonine kinase. Each of these proteins contains two bromodomains, a conserved sequence motif which may be involved in chromatin targeting. This gene has been implicated as the chromosome 19 target of translocation t(15;19)(q13;p13.1), which defines an upper respiratory tract carcinoma in young people. Two alternatively spliced transcript variants have been described.
5906	
5907	
5908	
5909	The product of this gene is an epsilon subunit of coatomer protein complex. Coatomer is a cytosolic protein complex that binds to dilysine motifs and reversibly associates with Golgi non-clathrin-coated vesicles. It is required for budding from Golgi membranes, and is essential for the retrograde Golgi-to-ER transport of dilysine-tagged proteins. Coatomer complex consists of at least the alpha, beta, beta@#$%&, gamma, delta, epsilon and zeta subunits. Alternatively spliced transcript variants encoding different isoforms have been identified.
5910	The protein encoded by this gene is a member of the bone morphogenetic protein (BMP) family and the TGF-beta superfamily. This group of proteins is characterized by a polybasic proteolytic processing site which is cleaved to produce a mature protein containing seven conserved cysteine residues. The members of this family are regulators of cell growth and differentiation in both embryonic and adult tissues. Studies in yeast suggest that the protein is involved in aging. This protein is transcribed from both a monocistronic mRNA and a bicistronic mRNA that also encodes growth differentiation factor 1.
5911	This gene, CYP4F3, encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum. The enzyme starts the process of inactivating and degrading leukotriene B4, a potent mediator of inflammation. This gene is part of a cluster of cytochrome P450 genes on chromosome 19. Another member of this family, CYP4F8, is approximately 18 kb away.
5912	The protein encoded by this gene is a member of the immunoglobulin superfamily. It was identified by its similarity to LAIR1, an inhibitory receptor present on mononuclear leukocytes. This gene maps to a region of 19q13.4, termed the leukocyte receptor cluster, which contains 29 genes in the immunoglobulin superfamily, including LAIR1. The function of this protein is unknown, although it is thought to be secreted and may help modulate mucosal tolerance. Two transcript variants encoding different isoforms have been found for this gene.
5913	
5914	
5915	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S19P family of ribosomal proteins. It is located in the cytoplasm. This gene has been found to be activated in various tumors, such as insulinomas, esophageal cancers, and colon cancers. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
5916	Follistatin-like 3 is a secreted glycoprotein of the follistatin-module-protein family. It may have a role in leukemogenesis.
5917	
5918	
5919	The protein encoded by this gene belongs to the highly conserved cyclin family, whose members are characterized by a dramatic periodicity in protein abundance through the cell cycle. Cyclins function as regulators of CDK kinases. Different cyclins exhibit distinct expression and degradation patterns which contribute to the temporal coordination of each mitotic event. This cyclin forms a complex with and functions as a regulatory subunit of CDK2, whose activity is required for cell cycle G1/S transition. This protein accumulates at the G1-S phase boundary and is degraded as cells progress through S phase. Overexpression of this gene has been observed in many tumors, which results in chromosome instability, and thus may contribute to tumorigenesis. This protein was found to associate with, and be involved in, the phosphorylation of NPAT protein (nuclear protein mapped to the ATM locus), which participates in cell-cycle regulated histone gene expression and plays a critical role in promoting cell-cycle progression in the absence of pRB. Two alternatively spliced transcript variants of this gene, which encode distinct isoforms, have been described. Two additional splice variants were reported but detailed nucleotide sequence information is not yet available.
5920	
5921	
5922	The protein encoded by this gene is a member of the PP2C family of Ser/Thr protein phosphatases. PP2C family members are known to be negative regulators of cell stress response pathways. This phosphatase has been shown to dephosphorylate cyclin-dependent kinases (CDKs), and thus may be involved in cell cycle control. Overexpression of this phosphatase is reported to cause cell-growth arrest or cell death. Three alternatively spliced transcript variants encoding distinct isoforms have been described.
5923	
5924	
5925	The protein encoded by this gene belongs to the highly conserved cyclin family, whose members are characterized by a dramatic periodicity in protein abundance through the cell cycle. Cyclins function as regulators of CDK kinases. Different cyclins exhibit distinct expression and degradation patterns which contribute to the temporal coordination of each mitotic event. This cyclin and its kinase partner CDK9 were found to be subunits of the transcription elongation factor p-TEFb. The p-TEFb complex containing this cyclin was reported to interact with, and act as a negative regulator of human immunodeficiency virus type 1 (HIV-1) Tat protein. Two alternatively spliced transcript variants, which encode distinct isoforms, have been described.
5926	
5927	This gene is a member of the Period family of genes and is expressed in a circadian pattern in the suprachiasmatic nucleus, the primary circadian pacemaker in the mammalian brain. Genes in this family encode components of the circadian rhythms of locomotor activity, metabolism, and behavior. Circadian expression in the suprachiasmatic nucleus continues in constant darkness, and a shift in the light/dark cycle evokes a proportional shift of gene expression in the suprachiasmatic nucleus. The specific function of this gene is not yet known.
5928	
5929	
5930	
5931	
5932	
5933	This gene encodes a protein which is a member of the cysteine-aspartic acid protease (caspase) family. Sequential activation of caspases plays a central role in the execution-phase of cell apoptosis. Caspases exist as inactive proenzymes which undergo proteolytic processing at conserved aspartic residues to produce 2 subunits, large and small, that dimerize to form the active enzyme. This protein was shown to cleave and activate caspases 3 and 7, and itself could be processed by caspase 8.  Mutations in this gene are associated with apoptosis defects seen in type II autoimmune lymphoproliferative syndrome. Four alternatively spliced transcript variants encoding different isoforms have been described for this gene.
5934	
5935	
5936	Members of the @#$%&frizzled@#$%& gene family encode 7-transmembrane domain proteins that are receptors for Wnt signaling proteins.  The FZD7 protein contains an N-terminal signal sequence, 10 cysteine residues typical of the cysteine-rich extracellular domain of Fz family members, 7 putative transmembrane domains, and an intracellular C-terminal tail with a PDZ domain-binding motif.  FZD7 gene expression may downregulate APC function and enhance beta-catenin-mediated signals in poorly differentiated human esophageal carcinomas.
5937	
5938	The protein encoded by this gene is a member of the PP2C family of Ser/Thr protein phosphatases. PP2C family members are known to be negative regulators of cell stress response pathways. This phosphatase is found to be responsible for the dephosphorylation of Pre-mRNA splicing factors, which is important for the formation of functional spliceosome. Studies of a similar gene in mice suggested a role of this phosphatase in regulating cell cycle progression. Alternatively spliced transcript variants encoding the same protein have been described.
5939	
5940	
5941	
5942	
5943	This gene encodes one of several forms of glutamic acid decarboxylase, identified as a major autoantigen in insulin-dependent diabetes. The enzyme encoded is responsible for catalyzing the production of gamma-aminobutyric acid from L-glutamic acid. A pathogenic role for this enzyme has been identified in the human pancreas since it has been identified as an autoantigen and an autoreactive T cell target in insulin-dependent diabetes. This gene may also play a role in the stiff man syndrome. Deficiency in this enzyme has been shown to lead to pyridoxine dependency with seizures. Alternative splicing of this gene results in two products, the predominant 67-kD form and a less-frequent 25-kD form.
5944	
5945	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. The enzyme encoded by this gene localizes to the endoplasmic reticulum and metabolizes procarcinogens such as polycyclic aromatic hydrocarbons and 17beta-estradiol. Mutations in this gene have been associated with primary congenital glaucoma; therefore it is thought that the enzyme also metabolizes a signaling molecule involved in eye development, possibly a steroid.
5946	CDC42, a small Rho GTPase, regulates the formation of F-actin-containing structures through its interaction with the downstream effector proteins. The protein encoded by this gene is a member of the Borg family of CDC42 effector proteins. Borg family proteins contain a CRIB (Cdc42/Rac interactive-binding) domain. They bind to, and negatively regulate the function of, CDC42. This protein can interact with CDC42, as well as with the ras homolog gene family, member Q (ARHQ/TC10). Expression of this protein in fibroblasts has been shown to induce pseudopodia formation.
5947	
5948	
5949	
5950	
5951	The protein encoded by this gene is a serine/threonine kinase that regulates cytokinesis, smooth muscle contraction, the formation of actin stress fibers and focal adhesions, and the activation of the c-fos serum response element. This protein, which is an isozyme of ROCK1 is a target for the small GTPase Rho.
5952	This gene encodes a member of the poly(A) polymerase family which catalyzes template-independent extension of the 3@#$%& end of a DNA/RNA strand. This enzyme shares 60% identity to the well characterized poly(A) polymerase II (PAPII) at the amino acid level. These two enzymes have similar organization of structural and functional domains. This enzyme is exclusively localized in the nucleus and exhibits both nonspecific and CPSF (cleavage and polyadenylation specificity factor)/AAUAAA-dependent polyadenylation activity. This gene is located on chromosome 2 in contrast to the PAPII gene, which is located on chromosome 14.
5953	The protein encoded by this gene contains a RING finger domain, a motif present in a variety of functionally distinct proteins and known to be involved in protein-protein and protein-DNA interactions. This gene is expressed in many cancer cell lines. Its expression in normal tissues, however, was found to be restricted to prostate. This gene was also found to be differentially expressed in androgen-dependent versus androgen-independent prostate cancer cells.
5954	
5955	
5956	
5957	
5958	
5959	
5960	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes the largest non-ATPase subunit of the 19S regulator lid, which is responsible for substrate recognition and binding.
5961	
5962	This gene encodes the fourth largest subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. In yeast, this polymerase subunit is associated with the polymerase under suboptimal growth conditions and may have a stress protective role. A sequence for a ribosomal pseudogene is contained within the 3@#$%& untranslated region of the transcript from this gene.
5963	
5964	
5965	
5966	
5967	This gene encodes a transcription factor that is a member of the leucine zipper family of DNA binding proteins. This protein binds to the cAMP-responsive element (CRE), an octameric palindrome. The protein forms a homodimer or heterodimer with c-Jun and stimulates CRE-dependent transcription. The protein is also a histone acetyltransferase (HAT) that specifically acetylates histones H2B and H4 in vitro; thus it may represent a class of sequence-specific factors that activate transcription by direct effects on chromatin components. Additional transcript variants have been identified but their biological validity has not been determined.
5968	
5969	Cell surface heparan sulfate proteoglycans are composed of a membrane-associated protein core substituted with a variable number of heparan sulfate chains. Members of the glypican-related integral membrane proteoglycan family (GRIPS) contain a core protein anchored to the cytoplasmic membrane via a glycosyl phosphatidylinositol linkage.  These proteins may play a role in the control of cell division and growth regulation.
5970	
5971	
5972	Granulysin is a protein present in cytotoxic granules of cytotoxic T lymphocytes and natural killer cells. Granulysin is a member of the saposin-like protein (SAPLIP) family and is located in the cytotoxic granules of T cells, which are released upon antigen stimulation. Granulysin has antimicrobial activity against M. tuberculosis and other organisms. Granulysin is alternatively spliced, resulting in the NKG5 and 519 transcripts.
5973	
5974	This gene is a member of the visinin/recoverin subfamily of neuronal calcium sensor proteins. The encoded protein is strongly expressed in granule cells of the cerebellum where it associates with membranes in a calcium-dependent manner and modulates intracellular signaling pathways of the central nervous system by directly or indirectly regulating the activity of adenylyl cyclase. Alternatively spliced transcript variants have been observed, but their full-length nature has not been determined.
5975	
5976	
5977	
5978	
5979	
5980	Histones play a critical role in transcriptional regulation, cell cycle progression, and developmental events. Histone acetylation/deacetylation alters chromosome structure and affects transcription factor access to DNA. The protein encoded by this gene belongs to class II of the histone deacetylase/acuc/apha family. It possesses histone deacetylase activity and represses transcription when tethered to a promoter. This protein does not bind DNA directly, but through transcription factors MEF2C and MEF2D. It seems to interact in a multiprotein complex with RbAp48 and HDAC3.
5981	
5982	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that belongs to the ribosomal protein S5P family. Pseudogenes corresponding to this gene are found on chromosomes 4q, 5q, and 18q.
5983	
5984	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases that catalyze many reactions involved in drug metabolism and the synthesis of cholesterol, steroids and other lipids. The enzyme encoded by this gene is involved in the specific inactivation of all-trans-retinoic acid to hydroxylated forms, such as 4-oxo-, 4-OH-, and 18-OH-all-trans-retinoic acid.
5985	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases that catalyze many reactions involved in drug metabolism and the synthesis of cholesterol, steroids and other lipids. The enzyme encoded by this gene is involved in the specific inactivation of all-trans-retinoic acid to hydroxylated forms, such as 4-oxo-, 4-OH-, and 18-OH-all-trans-retinoic acid.
5986	
5987	
5988	The ITGA6 protein product is the integrin alpha chain alpha 6. Integrins are integral cell-surface proteins composed of an alpha chain and a beta chain.  A given chain may combine with multiple partners resulting in different integrins.  For example, alpha 6 may combine with beta 4 in the integrin referred to as TSP180, or with beta 1 in the integrin VLA-6. Integrins are known to participate in cell adhesion as well as cell-surface mediated signalling.
5989	
5990	Members of the CRMP family, such as DPYSL5, are believed to play a role in growth cone guidance during neural development.[supplied by OMIM]
5991	This gene encodes a cytoplasmic enzyme which phosphorylates diacylglycerol to produce phosphatidic acid. Diacylglycerol and phosphatidic acid are two lipids which act as second messengers in signaling cascades. Their cellular concentrations are regulated by this enzyme, and so it is thought to play an important role in cellular signal transduction. Alternative splicing results in two transcript variants encoding different isoforms.
5992	This gene was identified based on its similarity with the mouse counterpart. Studies of the mouse counterpart suggest that the expression of this gene may be regulated during oocyte maturation and preimplantation following zygotic gene activation.  Alternatively spliced transcript variants encoding distinct isoforms have been observed.
5993	
5994	
5995	
5996	
5997	
5998	In the de novo synthesis of purine nucleotides, IMP is the branch point metabolite at which point the pathway diverges to the synthesis of either guanine or adenine nucleotides. In the guanine nucleotide pathway, there are 2 enzymes involved in converting IMP to GMP, namely IMP dehydrogenase (IMPD1), which catalyzes the oxidation of IMP to XMP, and GMP synthetase, which catalyzes the amination of XMP to GMP.
5999	
6000	Cytochrome c oxidase (COX), the terminal component of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. This component is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in electron transfer, and the nuclear-encoded subunits may function in the regulation and assembly of the complex. This nuclear gene encodes a protein similar to polypeptides 1 and 2 of subunit VIIa in the C-terminal region, and also highly similar to the mouse Sig81 protein sequence. This gene is expressed in all tissues, and upregulated in a breast cancer cell line after estrogen treatment. It is possible that this gene represents a regulatory subunit of COX and mediates the higher level of energy production in target cells by estrogen.
6001	The protein encoded by this gene is an adaptor protein which contains five LIM domains, or double zinc fingers. The protein is likely involved in integrin signaling through its LIM domain-mediated interaction with integrin-linked kinase, found in focal adhesion plaques. It is also thought to act as a bridge linking integrin-linked kinase to NCK adaptor protein 2, which is involved in growth factor receptor kinase signaling pathways. Its localization to the periphery of spreading cells also suggests that this protein may play a role in integrin-mediated cell adhesion or spreading.
6002	
6003	The protein encoded by this gene is a GTPase-activating protein (GAP) which associates with the Golgi apparatus and which interacts with ADP-ribosylation factor 1 (ARF1). The encoded protein promotes hydrolysis of ARF1-bound GTP and is required for the dissociation of coat proteins from Golgi-derived membranes and vesicles. Dissociation of the coat proteins is required for the fusion of these vesicles with target compartments. The activity of this protein is stimulated by phosphoinosides and inhibited by phosphatidylcholine. Two transcript variants encoding different isoforms have been found for this gene.
6004	
6005	The M channel is a slowly activating and deactivating potassium channel that plays a critical role in the regulation of neuronal excitability.  The M channel is formed by the association of the protein encoded by this gene and a related protein encoded by the KCNQ3 gene, both integral membrane proteins. M channel currents are inhibited by M1 muscarinic acetylcholine receptors and activated by retigabine, a novel anti-convulsant drug. Defects in this gene are a cause of benign familial neonatal convulsions type 1 (BFNC), also known as epilepsy, benign neonatal type 1 (EBN1). At least five transcript variants encoding five different isoforms have been found for this gene.
6006	The product of this gene is a membrane-associated guanylate kinase found at the postsynaptic density in neuronal cells. It is a signaling molecule that can interact with potassium channels and receptors, as well as other signaling molecules. The protein encoded by this gene can interact with PSD-95 through its guanylate kinase domain and may be involved in clustering PSD-95 in the postsynaptic density region. The encoded protein is one of at least four similar proteins that have been found. Alternatively spliced transcript variants encoding different isoforms have been found for this gene.
6007	Myosin, a structural component of muscle, consists of two heavy chains and four light chains. The protein encoded by this gene is a myosin light chain that may regulate muscle contraction by modulating the ATPase activity of myosin heads. The encoded protein binds calcium and is activated by myosin light chain kinase. Two transcript variants encoding different isoforms have been found for this gene.
6008	This gene encodes the homolog of a protein identified in Drosophila as a suppressor of malignant transformation of neuroblasts and ganglion-mother cells in the optic centers of the brain. This gene product is localized to condensed chromosomes in mitotic cells. Overexpression of this gene in a glioma cell line results in improper nuclear segregation and cytokinesis producing multinucleated cells. Two transcripts have been identified for this gene.
6009	
6010	
6011	This gene encodes an elastase-specific inhibitor, which contains a WAP-type four-disulfide core (WFDC) domain, and is thus a member of the WFDC domain family. Most WFDC gene members are localized to chromosome 20q12-q13 in two clusters: centromeric and telomeric. This gene belongs to the centromeric cluster.
6012	Adenosine deaminase catalyzes the hydrolysis of adenosine to inosine. ADA deficiency causes one form of severe combined immunodeficiency disease (SCID), in which there is dysfunction of both B and T lymphocytes with impaired cellular immunity and decreased production of immunoglobulins.
6013	
6014	
6015	
6016	This gene is a member of the septin family of GTPases. Members of this family are required for cytokinesis. This gene encodes a protein associated with the tau-based paired helical filament core, and may contribute to the formation of neurofibrillary tangles in Alzheimer@#$%&s disease.
6017	
6018	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. This gene encodes a member of the potassium channel, voltage-gated, subfamily G. This gene is abundantly expressed in skeletal muscle. Alternative splicing results in at least two transcript variants encoding distinct isoforms.
6019	
6020	
6021	This gene is a main control point for the regulation of gluconeogenesis. The cytosolic enzyme encoded by this gene, along with GTP, catalyzes the formation of phosphoenolpyruvate from oxaloacetate, with the release of carbon dioxide and GDP. The expression of this gene can be regulated by insulin, glucocorticoids, glucagon, cAMP, and diet. A mitochondrial isozyme of the encoded protein also has been characterized.
6022	The protein encoded by this gene is an integral membrane protein and tetrodotoxin-resistant voltage-gated sodium channel subunit. The encoded protein is found primarily in cardiac muscle and is responsible for the initial upstroke of the action potential in an electrocardiogram. Defects in this gene are a cause of long QT syndrome type 3 (LQT3), an autosomal dominant cardiac disease. Alternative splicing results in two transcript variants encoding separate isoforms which differ by a single amino acid. Mutation nomenclature has been assigned with respect to the longer isoform.
6023	
6024	This gene encodes a subunit of mitochondrial ATP synthase. Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. ATP synthase is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, Fo, comprising the proton channel. The catalytic portion of mitochondrial ATP synthase consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled with a stoichiometry of 3 alpha, 3 beta, and a single representative of the other 3. The proton channel consists of three main subunits (a, b, c). This gene encodes the epsilon subunit of the catalytic core. Two pseudogenes of this gene are located on chromosomes 4 and 13.
6025	The synaptonemal complex is a proteinaceous structure that links homologous chromosomes during the prophase of meiosis. The protein encoded by this gene is a major component of the synaptonemal complex and may bind DNA at scaffold attachment regions. The encoded protein requires synaptonemal complex protein 3, but not 1, for inclusion in the synaptonemal complex.
6026	
6027	The protein encoded by this gene is a putative protein kinase that is induced by the transcription factor NF-kappaB. The encoded protein is a negative regulator of NF-kappaB and can also sensitize cells to TNF- and TRAIL-induced apoptosis. In addition, this protein can negatively regulate the cell survival serine-threonine kinase AKT1.
6028	The protein encoded by this gene is a member of the signal-regulatory-protein (SIRP) family, and also belongs to the immunoglobulin superfamily. SIRP family members are receptor-type transmembrane glycoproteins known to be involved in the negative regulation of receptor tyrosine kinase-coupled signaling processes. This protein can be phosphorylated by tyrosine kinases. The phospho-tyrosine residues of this PTP have been shown to recruit SH2 domain containing tyrosine phosphatases (PTP), and serve as substrates of PTPs. This protein was found to participate in signal transduction mediated by various growth factor receptors. CD47 has been demonstrated to be a ligand for this receptor protein. This gene and its product share very high similarity with several other members of the SIRP family. These related genes are located in close proximity to each other on chromosome 20p13.
6029	
6030	This gene encodes a disintegrin and metalloprotease (ADAM) domain 33, which is a member of the ADAM protein family. Members of this family are membrane-anchored proteins structurally related to snake venom disintegrins, and have been implicated in a variety of biologic processes involving cell-cell and cell-matrix interactions, including fertilization, muscle development, and neurogenesis. This protein is a type I transmembrane protein implicated in asthma and bronchial hyperresponsiveness. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
6031	The protein encoded by this gene is similar to an uncharacterized protein from C. elegans. Although the function of the encoded protein is unknown, it shares weak sequence similarity to a region of S. cerevisiae translation initiation factor subunit Gcd10p.
6032	Synaptic vesicle membrane docking and fusion is mediated by SNAREs (soluble N-ethylmaleimide-sensitive factor attachment protein receptors) located on the vesicle membrane (v-SNAREs) and the target membrane (t-SNAREs). The assembled v-SNARE/t-SNARE complex consists of a bundle of four helices, one of which is supplied by v-SNARE and the other three by t-SNARE. For t-SNAREs on the plasma membrane, the protein syntaxin supplies one helix and the protein encoded by this gene contributes the other two. Therefore, this gene product is a presynaptic plasma membrane protein involved in the regulation of neurotransmitter release. Two alternative transcript variants encoding different protein isoforms have been described for this gene.
6033	The jagged 1 protein encoded by JAG1 is the human homolog of the Drosophilia jagged protein.  Human jagged 1 is the ligand for the receptor notch 1, the latter a human homolog of the Drosophilia jagged receptor notch.  Mutations that alter the jagged 1 protein cause Alagille syndrome.  Jagged 1 signalling through notch 1 has also been shown to play a role in hematopoiesis.
6034	
6035	This gene encodes a type I membrane protein. This protein acts as a receptor for complement protein C1q, mannose-binding lectin, and pulmonary surfactant protein A. This protein is a functional receptor involved in ligand-mediated enhancement of phagocytosis.
6036	
6037	The protein encoded by this gene is a glycogen phosphorylase found predominantly in the brain. The encoded protein forms homodimers which can associate into homotetramers, the enzymatically active form of glycogen phosphorylase. The activity of this enzyme is positively regulated by AMP and negatively regulated by ATP, ADP, and glucose-6-phosphate. This enzyme catalyzes the rate-determining step in glycogen degradation.
6038	
6039	This gene encodes a cytosolic enzyme that catalyzes the activation of acetate for use in lipid synthesis and energy generation. The protein acts as a monomer and produces acetyl-CoA from acetate in a reaction that requires ATP. Expression of this gene is regulated by sterol regulatory element-binding proteins, transcription factors that activate genes required for the synthesis of cholesterol and unsaturated fatty acids. Two transcript variants encoding different isoforms have been found for this gene.
6040	
6041	Calbindin 2 (calretinin), closely related to calbindin 1, is an intracellular calcium-binding protein belonging to the troponin C superfamily. Calbindin 1 is known to be involved in the vitamin-D-dependent calcium absorption through intestinal and renal epithelia, while the function of neuronal calbindin 1 and calbindin 2 is poorly understood. The sequence of the calbindin 2 cDNA reveals an open reading frame of 271 codons coding for a protein of 31,520 Da, and shares 58% identical residues with human calbindin 1. Calbindin 2 contains five presumably active and one presumably inactive calcium-binding domains. Comparison with the partial sequences available for chick and guinea pig calbindin 2 reveals that the protein is highly conserved in evolution. The calbindin 2 message was detected in the brain, while absent from heart muscle, kidney, liver, lung, spleen, stomach and thyroid gland. There are two additional forms of alternatively spliced calbindin 2 mRNAs encoding C-terminally truncated proteins. Exon 7 can splice to exon 9, resulting in a frame shift and a translational stop at the second codon of exon 9, and encoding calretinin-20k. Exon 7 can also splice to exon 10, resulting in a frame shift and a translational stop at codon 15 of exon 10, and encoding calretinin-22k. The truncated proteins are able to bind calcium.
6042	Glutathione is important for a variety of biological functions, including protection of cells from oxidative damage by free radicals, detoxification of xenobiotics, and membrane transport. The protein encoded by this gene functions as a homodimer to catalyze the second step of glutathione biosynthesis, which is the ATP-dependent conversion of gamma-L-glutamyl-L-cysteine to glutathione. Defects in this gene are a cause of glutathione synthetase deficiency.
6043	
6044	The SCAN domain is a highly conserved, leucine-rich motif of approximately 60 aa originally found within a subfamily of zinc finger proteins. This gene belongs to a family of genes that encode an isolated SCAN domain, but no zinc finger motif. Functional studies have established that the SCAN box is a protein interaction domain that mediates both hetero- and homoprotein associations, and maybe involved in regulation of transcriptional activity. Two transcript variants with different 5@#$%& UTRs, but encoding the same protein, have been described for this gene.
6045	This gene encodes a member of the oxysterol-binding protein (OSBP) family, a group of intracellular lipid receptors. Most members contain an N-terminal pleckstrin homology domain and a highly conserved C-terminal OSBP-like sterol-binding domain, although some members contain only the sterol-binding domain. This encoded protein contains only the sterol-binding domain. In vitro studies have shown that the encoded protein can bind strongly to phosphatic acid and weakly to phosphatidylinositol 3-phosphate, but cannot bind to 25-hydroxycholesterol. The protein associates with the Golgi apparatus. Transcript variants encoding different isoforms have been described.
6046	Nuclear receptor interacting protein 1 (NRIP1) is a nuclear protein that specifically interacts with the hormone-dependent activation domain AF2 of nuclear receptors. Also known as RIP140, this protein modulates transcriptional activity of the estrogen receptor.
6047	
6048	
6049	This gene encodes a disintegrin and metalloproteinase with thrombospondin motifs-1 (ADAMTS1), which is a member of the ADAMTS protein family. Members of the family share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. The protein encoded by this gene contains 2 disintegrin loops and 3 C-terminal TS motifs and has anti-angiogenic activity. The expression of this gene may be associated with various inflammatory processes as well as development of cancer cachexia. This gene is likely to be necessary for normal growth, fertility, and organ morphology and function.
6050	This gene encodes a transcription factor that belongs to the cap@#$%&n@#$%&collar type of basic region leucine zipper factor family (CNC-bZip). The encoded protein contains broad complex, tramtrack, bric-a-brac/poxvirus and zinc finger (BTB/POZ) domains, which is atypical of CNC-bZip family members. These BTB/POZ domains facilitate protein-protein interactions and formation of homo- and/or hetero-oligomers. When this encoded protein forms a heterodimer with MafK, it functions as a repressor of Maf recognition element (MARE) and transcription is repressed. Multiple alternatively spliced transcript variants have been identified for this gene. Some exons of this gene overlap with some exons from the C21orf41 gene, which is transcribed in an opposite orientation to this gene but does not seem to encode a protein.
6051	Similarity to a transcriptional repressor suggests that this gene@#$%&s protein product is involved in the regulation of transcription. Alternative splicing of this gene results in four transcript variants encoding different isoforms. Additional transcript variants have been described, but their full length sequences have not been determined.
6052	This gene encodes a basic helix-loop-helix transcription factor which is expressed in oligodendroglial tumors of the brain. The gene is involved in a chromosomal translocation t(14;21)(q11.2;q22) associated with T-cell acute lymphoblastic leukemia. Its chromosomal location is within a region of chromosome 21 which has been suggested to play a role in learning deficits associated with Down syndrome.
6053	The protein encoded by this gene is a type I membrane protein that forms one of the two chains of a receptor for interferons alpha and beta. Binding and activation of the receptor stimulates Janus protein kinases, which in turn phosphorylate several proteins, including STAT1 and STAT2. Multiple transcript variants encoding at least two different isoforms have been found for this gene.
6054	
6055	SIM1 and SIM2 genes are Drosophila single-minded (sim) gene homologs. The Drosophila sim gene encodes a transcription factor that is a master regulator of fruit fly neurogenesis. SIM2 maps within the so-called Down syndrome chromosomal region.  Based on the mapping position, its potential function as transcriptional repressor and similarity to Drosophila sim, it is proposed that SIM2 may contribute to some specific Down syndrome phenotypes
6056	
6057	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This protein contains 2 bromodomains and multiple WD repeats, and the function of this protein is not known. This gene is located within the Down syndrome region-2 on chromosome 21. Alternative splicing of this gene generates 3 transcript variants diverging at the 3@#$%& ends.
6058	
6059	
6060	The activation of gene transcription is a multistep process that is triggered by factors that recognize transcriptional enhancer sites in DNA. These factors work with co-activators to direct transcriptional initiation by the RNA polymerase II apparatus. The protein encoded by this gene is a subunit of the CRSP (cofactor required for SP1 activation) complex, which, along with TFIID, is required for efficient activation by SP1. This protein is also a component of other multisubunit complexes e.g. thyroid hormone receptor-(TR-) associated proteins which interact with TR and facilitate TR function on DNA templates in conjunction with initiation factors and cofactors. This protein contains a bipartite nuclear localization signal. This gene is known to escape chromosome X-inactivation.
6061	This gene encodes the enzyme responsible for pre-mRNA editing of the glutamate receptor subunit B by site-specific deamination of adenosines.  Alternative splicing of this gene results in several transcript variants, three which have been characterized by the presence or absence of an ALU cassette insert and a short or long C-terminal region.
6062	This gene encodes the alpha chain of type XVIII collagen. This collagen is one of the multiplexins, extracellular matrix proteins that contain multiple triple-helix domains (collagenous domains) interrupted by non-collagenous domains. The proteolytically produced C-terminal fragment of type XVIII collagen is endostatin, a potent antiangiogenic protein. Mutations in this gene are associated with Knobloch syndrome. The main features of this syndrome involve retinal abnormalities so type XVIII collagen may play an important role in retinal structure and in neural tube closure. Three transcripts have been identified for this gene.
6063	
6064	This gene encodes one of the three alpha chains of type VI collagen, a beaded filament collagen found in most connective tissues. The product of this gene contains several domains similar to von Willebrand Factor type A domains. These domains have been shown to bind extracellular matrix proteins, an interaction that explains the importance of this collagen in organizing matrix components. Mutations in this gene are associated with Bethlem myopathy and Ullrich scleroatonic muscular dystrophy. Three transcript variants have been identified for this gene.
6065	
6066	
6067	The protein encoded by this gene catalyzes the conversion of (S)-2,3 oxidosqualene to lanosterol. The encoded protein is a member of the terpene cyclase/mutase family and catalyzes the first step in the biosynthesis of cholesterol, steroid hormones, and vitamin D. Two transcript variants encoding the same protein have been found for this gene.
6068	The protein encoded by this gene is a serine/threonine protein kinase that interacts with protein kinase C-delta. The encoded protein can also activate NFkappaB and is required for keratinocyte differentiation. This kinase undergoes autophosphorylation.
6069	This gene encodes a protein that belongs to the serine protease family. The encoded protein contains a serine protease domain, a transmembrane domain, a LDL receptor-like domain, and a scavenger receptor cysteine-rich domain. Serine proteases are known to be involved in a variety of biological processes, whose malfunction often leads to human diseases and disorders. This gene was identified by its association with both congenital and childhood onset autosomal recessive deafness. This gene is expressed in fetal cochlea and many other tissues, and is thought to be involved in the development and maintenance of the inner ear or the contents of the perilymph and endolymph. This gene was also identified as a tumor associated gene that is overexpressed in ovarian tumors. Four alternatively spliced variants have been described, two of which encode identical products.
6070	The protein encoded by this gene is one of at least forty-one subunits that make up the NADH-ubiquinone oxidoreductase complex. This complex is part of the mitochondrial respiratory chain and serves to catalyze the rotenone-sensitive oxidation of NADH and the reduction of ubiquinone. The encoded protein is one of three proteins found in the flavoprotein fraction of the complex. The specific function of the encoded protein is unknown. Two transcript variants encoding different isoforms have been found for this gene.
6071	
6072	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This gene is excluded as a candidate for a form of nonsyndromic deafness (DFNB10), but is still a candidate for other disorders mapped to 21q22.3 as well as for the development of Down syndrome phenotypes. Two transcript variants encoding the same protein have been found for this gene.
6073	
6074	
6075	This gene is a member of a phylogenetically conserved family of genes that share a common DNA-binding domain, the T-box. T-box genes encode transcription factors involved in the regulation of developmental processes. This gene product shares 98% amino acid sequence identity with the mouse ortholog. DiGeorge syndrome (DGS)/velocardiofacial syndrome (VCFS), a common congenital disorder characterized by neural-crest-related developmental defects, has been associated with deletions of chromosome 22q11.2, where this gene has been mapped. Studies using mouse models of DiGeorge syndrome suggest a major role for this gene in the molecular etiology of DGS/VCFS. Several alternatively spliced transcript variants encoding different isoforms have been described for this gene.
6076	
6077	The protein encoded by this gene is a member of a G protein subfamily that mediates signal transduction in pertussis toxin-insensitive systms. This encoded protein may play a role in maintaining the ionic balance of perilymphatic and endolymphatic cochlear fluids.
6078	
6079	This gene encodes a structural protein that is found exclusively in contractile smooth muscle cells. It associates with stress fibers and constitutes part of the cytoskeleton. This gene is localized to chromosome 22q12.3, distal to the TUPLE1 locus and outside the DiGeorge syndrome deletion. Alternative splicing of this gene results in three transcript variants.
6080	
6081	
6082	There are approximately 40 known eukaryotic LIM proteins, so named for the LIM domains they contain. LIM domains are highly  conserved cysteine-rich structures containing 2 zinc fingers.   Although zinc fingers usually function by binding to DNA or RNA, the LIM motif probably mediates protein-protein interactions. LIM kinase-1 and LIM kinase-2 belong to a small subfamily with a unique combination of 2 N-terminal LIM motifs and a C-terminal protein kinase domain. The LIMK2 protein is phosphorylated and activated by ROCK, a downstream effector of Rho, and LIM kinase 2, in turn, phosphorylates cofilin, inhibiting its actin-depolymerizing activity. It is thought that this pathway contributes to Rho-induced reorganization of the actin cytoskeleton. Two alternative splice variants of LIMK2 that utilize alternative promoters have been identified.
6083	The protein encoded by this gene contains an A-T hook DNA binding motif which usually binds to other DNA binding structures to play an important role in chromatin modeling and transcription regulation. Its Poz domain is thought to function as a site for protein-protein interaction and is required for transcriptional repression, and the zinc-fingers comprise the DNA binding domain. Since the encoded protein has typical features of a transcription factor, it is postulated to be a repressor of gene expression. In small round cell sarcoma, this gene is fused to EWS by a small inversion of 22q, then the hybrid is thought to be translocated (t(1;22)(p36.1;q12). The rearrangement of chromosome 22 involves intron 8 of EWS and exon 1 of this gene creating a chimeric sequence containing the transactivation domain of EWS fused to zinc finger domain of this protein. This is a distinct example of an intra-chromosomal rearrangement of chromosome 22. There are four alternatively spliced transcript variants described for this gene.
6084	
6085	The protein encoded by this gene is a type II transmembrane serine proteinase that is found attached to the cell surface. The encoded protein may be involved in matrix remodeling processes in the liver.
6086	Retention of resident soluble proteins in the lumen of the endoplasmic reticulum (ER) is achieved in both yeast and animal cells by their continual retrieval from the cis-Golgi, or a pre-Golgi compartment.  Sorting of these proteins is dependent on a C-terminal tetrapeptide signal, usually lys-asp-glu-leu (KDEL) in animal cells, and his-asp-glu-leu (HDEL) in S. cerevisiae.  This process is mediated by a receptor that recognizes, and binds the tetrapeptide-containing protein, and returns it to the ER.  In yeast, the sorting receptor encoded by a single gene, ERD2, is a seven-transmembrane protein.  Unlike yeast, several human homologs of the ERD2 gene, constituting the KDEL receptor gene family, have been described.  KDELR3 was the third member of the family to be identified, and it encodes a protein highly homologous to KDELR1 and KDELR2 proteins.  Two transcript variants of KDELR3 that arise by alternative splicing, and encode different isoforms of KDELR3 receptor, have been described.
6087	This gene encodes an integral membrane protein associated with presynaptic vesicles in neuronal cells. The exact function of this protein is unclear, but studies of a similar murine protein suggest that it functions in synaptic plasticity without being required for synaptic transmission. The gene product belongs to the synaptogyrin gene family. Three alternatively spliced variants encoding three different isoforms have been identified.
6088	The protein encoded by this gene, a member of the G protein-coupled receptor family 1, is an integral plasma membrane protein which binds melanin-concentrating hormone. The encoded protein can inhibit cAMP accumulation and stimulate intracellular calcium flux, and is probably involved in the neuronal regulation of food consumption. Although structurally similar to somatostatin receptors, this protein does not seem to bind somatostatin.
6089	
6090	This gene encodes an evolutionarily conserved protein that interacts with cullins. The protein plays a unique role in the ubiquitination reaction by heterodimerizing with cullin-1 to catalyze ubiquitin polymerization. It also may be involved in the regulation of protein turn-over.
6091	
6092	This gene belongs to the septin family of GTPases. Members of this family are required for cytokinesis. Expression is upregulated by retinoic acid in a human teratocarcinoma cell line. The specific function of this gene has not been determined. This gene encodes three transcript variants resulting in three distinct isoforms.
6093	Phosphomannomutase catalyzes the conversion between D-mannose 6-phosphate and D-mannose 1-phosphate which is a substrate for GDP-mannose synthesis. GDP-mannose is used for synthesis of dolichol-phosphate-mannose, which is essential for N-linked glycosylation and thus the secretion of several glycoproteins as well as for the synthesis of glycosyl-phosphatidyl-inositol (GPI) anchored proteins.
6094	
6095	
6096	
6097	The nuclear pore complex is a massive structure that extends across the nuclear envelope, forming a gateway that regulates the flow of macromolecules between the nucleus and the cytoplasm. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells. The protein encoded by this gene is a member of the FG-repeat containing nucleoporins that functions as a soluble cofactor in importin-alpha:beta-mediated nuclear protein import. Pseudogenes of this gene are found on chromosomes 5, 6, and 14. Three transcript variants encoding two different isoforms have been found for this gene.
6098	Fibulin 1 is a secreted glycoprotein that becomes incorporated into a fibrillar extracellular matrix. Calcium-binding is apparently required to mediate its binding to laminin and nidogen. It mediates platelet adhesion via binding fibrinogen. Four splice variants which differ in the 3@#$%& end have been identified. Each variant encodes a different isoform, but no functional distinctions have been identified among the four variants.
6099	
6100	
6101	The function of this gene product is unknown; however, homology to other proteins suggests that it may be an integral membrane transporter. Mutations in this gene have been associated with megalencephalic leukoencephalopathy with subcortical cysts, an autosomal recessive neurological disorder. Two transcript variants encoding the same protein but differing in the 5@#$%& UTR have been noted for this gene.
6102	
6103	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth, and invasion. This growth factor lacks the N-terminal signal sequence present in most of the FGF family members, but it contains clusters of basic residues that have been demonstrated to act as a nuclear localization signal. When transfected into mammalian cells, this protein accumulated in the nucleus, but was not secreted. The specific function of this gene has not yet been determined. Two alternatively spliced transcript variants encoding distinct isoforms have been reported.
6104	
6105	
6106	
6107	
6108	
6109	
6110	
6111	
6112	
6113	Angiotensin II is a potent vasopressor hormone and a primary regulator of aldosterone secretion. It is an important effector controlling blood pressure and volume in the cardiovascular system. It acts through at least two types of receptors. This gene encodes the type 1 receptor which is thought to mediate the major cardiovascular effects of angiotensin II. AGTR1 may play role in the generation of reperfusion arrhythmias following restoration of blood flow to ischemic or infarcted myocardium. It was previously thought that a related gene AGTR1B exists; however, it is now believed that there is only one AGTR1 gene. The gene expresses at least four transcript variants; additional variants have been described but their full length nature has not been determined. Exon 5 contains the entire coding sequence and is present in all transcript variants.
6114	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E1 ubiquitin-activating enzyme family. The encoded enzyme associates with AppBp1, an amyloid beta precursor protein binding protein, to form a heterodimer, and then the enzyme complex activates NEDD8, a ubiquitin-like protein, which regulates cell division, signaling and embryogenesis. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
6115	
6116	
6117	
6118	The protein encoded by this gene is a membrane-bound homodimeric enzyme that is localized to the cisternae of the rough endoplasmic reticulum. The enzyme (cofactors iron and ascorbate) catalyzes the hydroxylation of lysyl residues in collagen-like peptides. The resultant hydroxylysyl groups are attachment sites for carbohydrates in collagen and thus are critical for the stability of intermolecular crosslinks. Some patients with Ehlers-Danlos syndrome type VIB have deficiencies in lysyl hydroxylase activity. Mutations in the coding region of this gene are associated with Bruck syndrome. Alternative splicing results in multiple transcript variants encoding different isoforms.
6119	The WNT gene family consists of structurally related genes which encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family. It encodes a protein which shows 98%, 98% and 87% amino acid identity to the mouse, rat and the xenopus Wnt5A protein, respectively. The experiments performed in Xenopus laevis embryos identified that human frizzled-5 (hFz5) is the receptor for the Wnt5A ligand and the Wnt5A/hFz5 signaling mediates axis induction.
6120	The product of this gene belongs to the family of G-protein coupled receptors. This family has several receptor subtypes with different pharmacological selectivity, which overlaps in some cases, for various adenosine and uridine nucleotides. This receptor is activated by ADP. Two transcript variants encoding the same protein have been identified for this gene.
6121	
6122	
6123	
6124	The protein encoded by this gene is a member of the SEC22 family of vesicle trafficking proteins. It is localized at the endoplasmic reticulum and it is thought to play a role in the early stages of the ER-Golgi protein trafficking. There are two alternatively spliced transcript variants encoding different isoforms described for this gene.
6125	This gene encodes a membrane-anchored protein with a hydrophobic amino terminal domain and a cyclophilin-like PPIase domain. It is present on the surface of natural killer cells and facilitates their binding to targets. Alternate transcriptional splice variants, encoding different isoforms, have been characterized and expression of the splice variants is regulated by IL2 activation of the cells.
6126	
6127	
6128	The protein encoded by this gene is a cell-surface glycoprotein found on melanoma cells. The protein shares sequence similarity and iron-binding properties with members of the transferrin superfamily. The importance of the iron binding function has not yet been identified. This gene resides in the same region of chromosome 3 as members of the transferrin superfamily. Alternative splicing results in two transcript variants.
6129	Mutant alleles at the BCHE locus are responsible for suxamethonium sensitivity. Homozygous persons sustain prolonged apnea after administration of the muscle relaxant suxamethonium in connection with surgical anesthesia. The activity of pseudocholinesterase in the serum is low and its substrate behavior is atypical. In the absence of the relaxant, the homozygote is at no known disadvantage.
6130	
6131	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E1 ubiquitin-activating enzyme family. The encoded enzyme is a retinoid target that triggers promyelocytic leukemia (PML)/retinoic acid receptor alpha (RARalpha) degradation and apoptosis in acute promyelocytic leukemia.
6132	
6133	This gene is a member of the ankyrin-repeat family and is induced by lipopolysaccharide (LPS). The C-terminal portion of the encoded product which contains the ankyrin repeats, shares high sequence similarity with the I kappa B family of proteins. The latter are known to play a role in inflammatory responses to LPS by their interaction with NF-B proteins through ankyrin-repeat domains. Studies in mouse indicate that this gene product is one of the nuclear I kappa B proteins and an activator of IL-6 production. Two transcript variants encoding different isoforms have been found for this gene.
6134	The DAZ (Deleted in AZoospermia) gene family encodes potential RNA binding proteins that are expressed in prenatal and postnatal germ cells of males and females. The protein encoded by this gene is localized to the nucleus and cytoplasm of fetal germ cells and to the cytoplasm of developing oocytes. In the testis, this protein is localized to the nucleus of spermatogonia but relocates to the cytoplasm during meiosis where it persists in spermatids and spermatozoa. Transposition and amplification of this autosomal gene during primate evolution gave rise to the DAZ gene cluster on the Y chromosome. Mutations in this gene have been linked to severe spermatogenic failure and infertility in males.
6135	
6136	
6137	The nuclear pore complex is a massive structure that extends across the nuclear envelope, forming a gateway that regulates the flow of macromolecules between the nucleus and the cytoplasm. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells. The protein encoded by this gene is a membrane-spanning glycoprotein that is a major component of the nuclear pore complex.
6138	
6139	Histamine is a ubiquitous messenger molecule released from mast cells, enterochromaffin-like cells, and neurons. Its various actions are mediated by histamine receptors H1, H2, H3 and H4. This gene was thought to be intronless until recently. The protein encoded by this gene is an integral membrane protein and belongs to the family 1 of G protein-coupled receptors. It mediates the contraction of smooth muscles, increase in capillary permeability due to contraction of terminal venules, catecholamine release from adrenal medulla, and mediates neurotransmission in the central nervous system.
6140	Huntington@#$%&s disease (HD), a neurodegenerative disorder characterized by loss of striatal neurons, is caused by an expansion of a polyglutamine tract in the HD protein huntingtin. This gene encodes a protein belonging to a class of huntingtin interacting proteins characterized by WW motifs. Two transcripts encoding different isoforms have been completely described.
6141	
6142	The protein encoded by this gene is a member of the L1 gene family of neural cell adhesion molecules. It is a neural recognition molecule that may be involved in signal transduction pathways. The deletion of one copy of this gene may be responsible for mental defects in patients with 3p- syndrome. Several alternatively spliced transcript variants of this gene have been described, but their full length nature is not known.
6143	
6144	The Ra-reactive factor (RARF) is a complement-dependent bactericidal factor that binds to the Ra and R2 polysaccharides expressed by certain enterobacteria. Alternate splicing of this gene results in two transcript variants encoding two RARF components that are involved in the mannan-binding lectin (MBL) pathway of complement activation. Each isoform is cleaved into two chains which form a heterodimer linked by a disulfide bond. The encoded proteins are members of the trypsin family of peptidases.
6145	
6146	This gene encodes a member of the tripartite motif (TRIM) family, whose members are involved in diverse cellular functions such as developmental patterning and oncogenesis. The TRIM motif includes zinc-binding domains, a RING finger region, a B-box motif and a coiled-coil domain. The RING finger and B-box domains chelate zinc and might be involved in protein-protein and/or protein-nucleic acid interactions. The gene mutations are associated with mulibrey (muscle-liver-brain-eye) nanism, an autosomal recessive disorder that involves several tissues of mesodermal origin. Alternatively spliced transcript variants encoding the same protein have been identified.
6147	
6148	
6149	
6150	
6151	This gene encodes two isoforms of a receptor for monocyte chemoattractant protein-1, a chemokine which specifically mediates monocyte chemotaxis. Monocyte chemoattractant protein-1 is involved in monocyte infiltration in inflammatory diseases such as rheumatoid arthritis as well as in the inflammatory response against tumors. The receptors encoded by this gene mediate agonist-dependent calcium mobilization and inhibition of adenylyl cyclase. This gene is located in the chemokine receptor gene cluster region. Two alternatively spliced transcript variants are expressed by the gene.
6152	The protein encoded by this gene is a member of the beta chemokine receptor family. It is predicted to be a seven transmembrane protein similar to G protein-coupled receptors. Chemokines and their receptors are key regulators of the thymocytes migration and maturation in normal and inflammation conditions. The specific ligand of this receptor is CCL25. It has been found that this gene is differentially expressed by T lymphocytes of small intestine and colon, suggested a role in the thymocytes recruitment and development that may permit functional specialization of immune responses in different segment of the gastrointestinal tract. This gene is mapped to the chemokine receptor gene cluster region. Two alternatively spliced transcript variants have been described.
6153	Transport of small hydrophilic substances across cell membranes is mediated by substrate-specific transporter proteins which have been classified into several families of related genes. The protein encoded by this gene is a member of the subgroup of transporter with unidentified substrates within the Na+ and Cl- coupled transporter family. This gene is expressed in kidney, and its alternative splicing generates 2 transcript variants.
6154	
6155	This gene encodes a class 1 aminoacyl-tRNA synthetase, mitochondrial leucyl-tRNA synthetase. Each of the twenty aminoacyl-tRNA synthetases catalyzes the aminoacylation of a specific tRNA or tRNA isoaccepting family with the cognate amino acid.
6156	This gene encodes a lysosomal hyaluronidase. Hyaluronidases intracellularly degrade hyaluronan, one of the major glycosaminoglycans of the extracellular matrix. Hyaluronan is thought to be involved in cell proliferation, migration and differentiation. This enzyme is active at an acidic pH and is the major hyaluronidase in plasma. Mutations in this gene are associated with mucopolysaccharidosis type IX, or hyaluronidase deficiency. The gene is one of several related genes in a region of chromosome 3p21.3 associated with tumor suppression. Eight alternatively spliced transcript variants of this gene encoding six distinct isoforms have been described.
6157	
6158	
6159	The protein encoded by this gene belongs to the SECY/SEC61- alpha family. It appears to play a crucial role in the insertion of secretory and membrane polypeptides into the endoplasmic reticulum. This protein found to be tightly associated with membrane-bound ribosomes, either directly or through adaptor proteins. This gene encodes an alpha subunit of the heteromeric SEC61 complex, which also contains beta and gamma subunits.
6160	ENTPD3 is similar to E-type nucleotidases (NTPases). NTPases, such as CD39, mediate catabolism of extracellular nucleotides. ENTPD3 contains 4 apyrase-conserved regions which is characteristic of NTPases.
6161	
6162	
6163	
6164	
6165	
6166	The protein encoded by this gene is a glycoprotein with an approximate molecular weight of 76.5 kDa. It is thought to have been created as a result of an ancient gene duplication event that led to generation of homologous C and N-terminal domains each of which binds 1 ion of ferric iron. The function of this encoded protein is to transport iron from the intestine, reticuloendothelial system, and liver parenchymal cells to all proliferating cells in the body. In addition to its function in iron transport, this protein may also have a physiologic role as granulocyte/pollen-binding protein (GPBP) involved in the removal of certain organic matter/allergins from serum.
6167	P-type ATPases, such as ATP11B, are phosphorylated in their intermediate state and drive uphill transport of ions across membranes. Several subfamilies of P-type ATPases have been identified. One subfamily transports heavy metal ions, such as Cu(2+) or Cd(2+). Another subfamily transports non-heavy metal ions, such as H(+), Na(+), K(+), or Ca(+). A third subfamily transports amphipaths, such as phosphatidylserine.[supplied by OMIM]
6168	
6169	
6170	The protein encoded by this gene has sequence similarity with members of the WD40 repeat-containing protein family. The WD40 group is a large family of proteins, which appear to have a regulatory function. It is believed that the WD40 repeats mediate protein-protein interactions and members of the family are involved in signal transduction, RNA processing, gene regulation, vesicular trafficking, cytoskeletal assembly and may play a role in the control of cytotypic differentiation.
6171	The protein encoded by this gene has sequence similarity with members of the WD40 repeat-containing protein family. The WD40 group is a large family of proteins, which appear to have a regulatory function. It is believed that the WD40 repeats mediate protein-protein interactions and members of the family are involved in signal transduction, RNA processing, gene regulation, vesicular trafficking, cytoskeletal assembly and may play a role in the control of cytotypic differentiation.
6172	
6173	Dystroglycan is a laminin binding component of the dystrophin-glycoprotein complex which provides a linkage between the subsarcolemmal cytoskeleton and the extracellular matrix. Dystroglycan 1 is a candidate gene for the site of the mutation in autosomal recessive muscular dystrophies. The dramatic reduction of dystroglycan 1 in Duchenne muscular dystrophy leads to a loss of linkage between the sarcolemma and extracellular matrix, rendering muscle fibers more susceptible to necrosis. Dystroglycan also functions as dual receptor for agrin and laminin-2 in the Schwann cell membrane. The muscle and nonmuscle isoforms of dystroglycan differ by carbohydrate moieties but not protein sequence.
6174	
6175	
6176	
6177	
6178	
6179	
6180	
6181	This gene encodes a protein with similarity to follistatin, an activin-binding protein. It contains an FS module, a follistatin-like sequence containing 10 conserved cysteine residues. This gene product is thought to be an autoantigen associated with rheumatoid arthritis.
6182	
6183	
6184	
6185	Huntingtin is a disease gene linked to Huntington@#$%&s disease, a neurodegenerative disorder characterized by loss of striatal neurons.  This is thought to be caused by an expanded,  unstable trinucleotide repeat in the huntingtin gene, which translates as a polyglutamine repeat in the protein product.  The huntingtin locus is large, spanning 180 kb and consisting of 67 exons. The huntingtin gene is widely expressed and is required for normal development.  It is expressed as 2 alternatively polyadenylated forms displaying different relative abundance in various fetal and adult tissues.  The larger transcript is approximately 13.7 kb and is expressed predominantly in adult and fetal brain whereas the smaller transcript of approximately 10.3 kb is more widely expressed.  The genetic defect leading to Huntington@#$%&s disease may not necessarily eliminate transcription, but may confer a new property on the mRNA or alter the function of the protein. One candidate is the huntingtin-associated protein-1, highly expressed in brain, which has increased affinity for huntingtin protein with expanded polyglutamine repeats.
6186	This gene encodes a member of the guanine nucleotide-binding protein (G protein)-coupled receptor kinase subfamily of the Ser/Thr protein kinase family. The protein phosphorylates the activated forms of G protein-coupled receptors thus initiating its deactivation. This gene has been linked to both genetic and acquired hypertension.
6187	
6188	
6189	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. This gene was identified as an oncogene, which confers transforming potential when transfected into mammalian cells. Targeted disruption of the homolog of this gene in mouse resulted in the phenotype of abnormally long hair, which suggested a function as an inhibitor of hair elongation. Two alternatively spliced variants encoding different isoforms have been described.
6190	Proteins encoded by the complexin/synaphin gene family are cytosolic proteins that function in synaptic vesicle exocytosis.  These proteins bind syntaxin, part of the SNAP receptor.  The protein product of this gene binds to the SNAP receptor complex and disrupts it, allowing transmitter release.
6191	The protein encoded by this gene is a member of the fibroblast growth factor receptor family, where amino acid sequence is highly conserved between members and throughout evolution. FGFR family members differ from one another in their ligand affinities and tissue distribution. A full-length representative protein would consist of an extracellular region, composed of three immunoglobulin-like domains, a single hydrophobic membrane-spanning segment and a cytoplasmic tyrosine kinase domain. The extracellular portion of the protein interacts with fibroblast growth factors, setting in motion a cascade of downstream signals, ultimately influencing mitogenesis and differentiation. This particular family member binds acidic and basic fibroblast growth hormone and plays a role in bone development and maintenance. Mutations in this gene lead to craniosynostosis and multiple types of skeletal dysplasia. Alternative splicing occurs and additional variants have been described, including those utilizing alternate exon 8 rather than 9, but their full-length nature has not been determined.
6192	The function of this gene has not yet been determined; however, it is speculated that it may be involved in cell growth and differentiation.  Expression of this gene is up-regulated in some cancer cell lines, and in embryonic day 15 in mice.
6193	Glycophorins A (GYPA) and B (GYPB) are major sialoglycoproteins of the human erythrocyte membrane which bear the antigenic determinants for the MN and Ss blood groups. GYPB gene consists of 5 exons and has 97% sequence homology with GYPA from the 5@#$%& UTR to the coding sequence encoding the first 45 amino acids. In addition to the M or N and S or s antigens, that commonly occur in all populations, about 40 related variant phenotypes have been identified. These variants include all the variants of the Miltenberger complex and several isoforms of Sta; also, Dantu, Sat, He, Mg, and deletion variants Ena, S-s-U- and Mk. Most of the variants are resulted from gene recombinations between GYPA and GYPB.
6194	GABA is the major inhibitory neurotransmitter in the mammalian brain where it acts at GABA-A receptors, which are ligand-gated chloride channels. Chloride conductance of these channels can be modulated by agents such as benzodiazepines that bind to the GABA-A receptor. At least 16 distinct subunits of GABA-A receptors have been identified.
6195	The protein encoded by this gene belongs to the SMAD, a family of proteins similar to the gene products of the Drosophila gene @#$%&mothers against decapentaplegic@#$%& (Mad) and the C. elegans gene Sma. SMAD proteins are signal transducers and transcriptional modulators that mediate multiple signaling pathways. This protein mediates the signals of the bone morphogenetic proteins (BMPs), which are involved in a range of biological activities including cell growth, apoptosis, morphogenesis, development and immune responses. In response to BMP ligands, this protein can be phosphorylated and activated by the BMP receptor kinase. The phosphorylated form of this protein forms a complex with SMAD4, which is important for its function in the transcription regulation. This protein is a target for SMAD-specific E3 ubiquitin ligases, such as SMURF1 and SMURF2, and undergoes ubiquitination and proteasome-mediated degradation. Alternatively spliced transcript variants encoding the same protein have been observed.
6196	This gene belongs to the protocadherin gene family, a subfamily of the cadherin superfamily. The gene encodes a protein with an extracellular domain containing 7 cadherin repeats. The gene product is an integral membrane protein that is thought to function in cell-cell recognition and adhesion. Alternative splicing yields isoforms with unique cytoplasmic tails.
6197	
6198	
6199	
6200	The protein encoded by this gene is a cellular protein that binds RNAs containing the G-rich element. Using indirect immunofluorescence microscopy this protein was found to be localized in the cytoplasm. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
6201	Immunoglobulins recognize foreign antigens and initiate immune responses such as phagocytosis and the complement system. Each immunoglobulin molecule consists of two identical heavy chains and two identical light chains. There are two classes of light chains, kappa and lambda. This region represents the germline organization of the lambda light chain locus. The locus includes V (variable), J (joining), and C (constant) segments. During B cell development, a recombination event at the DNA level joins a single V segment with a J segment; the C segment is later joined by splicing at the RNA level. Recombination of many different V segments with several J segments provides a wide range of antigen recognition. Additional diversity is attained by junctional diversity, resulting from the random additional of nucleotides by terminal deoxynucleotidyltransferase, and by somatic hypermutation, which occurs during B cell maturation in the spleen and lymph nodes. Several V segments and three C segments are known to be incapable of encoding a protein and are considered pseudogenes. The locus also includes several non-immunoglobulin genes, many of which are pseudogenes or are predicted by automated computational analysis or homology to other species.
6202	The olfactory neuroepithelium, which lines the posterior nasal cavity, is exposed to a wide range of odorants and airborne toxic compounds. Odorants, which are mostly small lipophilic molecules, enter the mucus flow and reach the odorant receptors on sensory neurons. Odorant sensing is generally a transient process, requiring an effective signal termination, which could be provided by biotransformation of the odorant in the epithelial supporting cells. Xenobiotic-metabolizing enzymes in the olfactory epithelium have been suggested to catalyze inactivation and facilitate elimination of odorants.[supplied by OMIM]
6203	
6204	Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene. The smaller protein isoform encoded by the shorter transcript variant is found only in HIV-1 infected cells.
6205	Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene. The smaller protein isoform encoded by the shorter transcript variant is found only in HIV-1 infected cells.
6206	
6207	This gene encodes a protein containing 9 WD repeats. WD repeats are approximately 30- to 40-amino acid domains containing several conserved residues, mostly including a trp-asp at the C-terminal end. WD domains are involved in protein-protein interactions. The encoded protein may help induce the disassembly of actin filaments. Two transcript variants encoding different isoforms have been found for this gene.
6208	This gene encodes one family member of vacuolar protein sorting 10 (VPS10) domain-containing receptor proteins. The VPS10 domain name comes from the yeast carboxypeptidase Y sorting receptor Vps10 protein. Members of this gene family are large with many exons but the CDS lengths are usually less than 3700 nt. Very large introns typically separate the exons encoding the VPS10 domain; the remaining exons are separated by much smaller-sized introns. These genes are strongly expressed in the central nervous system. Two of the five family members (sortilin and sortilin-related receptor) are synthesized as preproproteins; it is not yet known if this encoded protein is also a preproprotein.
6209	
6210	Lymphoid enhancer-binding factor-1 (LEF1) is a 48-kD nuclear protein that is expressed in pre-B and T cells. It binds to a functionally important site in the T-cell receptor-alpha (TCRA; MIM 186880) enhancer and confers maximal enhancer activity. LEF1 belongs to a family of regulatory proteins that share homology with high mobility group protein-1 (HMG1; MIM 163905).[supplied by OMIM]
6211	
6212	
6213	
6214	
6215	Palladin is a component of actin-containing microfilaments that control cell shape, adhesion, and contraction.[supplied by OMIM]
6216	This gene is the human homolog of mouse BMP-2-inducible kinase. Bone morphogenic proteins (BMPs) play a key role in skeletal development and patterning. Expression of the mouse gene is increased during BMP-2 induced differentiation and the gene product is a putative serine/threonine protein kinase containing a nuclear localization signal. Therefore, the protein encoded by this human homolog is thought to be a protein kinase with a putative regulatory role in attenuating the program of osteoblast differentiation. Two transcript variants encoding different isoforms have been found for this gene.
6217	
6218	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to cytoplasmic filaments. Its function has not been identified.
6219	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two RRM domains that bind to RNAs. Two alternatively spliced transcript variants have been described for this gene. The variants encode the same protein but have different 3@#$%& UTRs. This protein is similar to its family member HNRPD.
6220	
6221	
6222	MTP encodes the large subunit of the heterodimeric microsomal triglyceride transfer protein. Protein disulfide isomerase (PDI) completes the heterodimeric microsomal triglyceride transfer protein, which has been shown to play a central role in lipoprotein assembly. Mutations in MTP can cause abetalipoproteinemia.
6223	
6224	This gene encodes a protein that contains four domains present in other developmental proteins: a PWWP domain, an HMG box, a SET domain, and a PHD-type zinc finger. It is expressed ubiquitously in early development. Wolf-Hirschhorn syndrome (WHS) is a malformation syndrome associated with a hemizygous deletion of the distal short arm of chromosome 4. This gene maps to the 165 kb WHS critical region and has also been involved in the chromosomal translocation t(4;14)(p16.3;q32.3) in multiple myelomas. This gene contains 25 exons. Alternative splicing of this gene generates nine transcript variants, which encode four different isoforms.
6225	This gene is expressed ubiquitously with higher levels in fetal than in adult tissues. It encodes a protein sharing 93% sequence identity with the mouse protein. Wolf-Hirschhorn syndrome (WHS) is a malformation syndrome associated with a hemizygous deletion of the distal short arm of chromosome 4. This gene is mapped to the 165 kb WHS critical region, and may play a role in the phenotype of the WHS or Pitt-Rogers-Danks syndrome. The encoded protein is found to be capable of reacting with HLA-A2-restricted and tumor-specific cytotoxic T lymphocytes, suggesting a target for use in specific immunotherapy for a large number of cancer patients. This protein has also been shown to be a member of the NELF (negative elongation factor) protein complex that participates in the regulation of RNA polymerase II transcription elongation.
6226	The DNA-associated protein encoded by this gene is a member of the paired family of homeobox proteins localized to the nucleus. The protein functions as a transcription factor involved in the development of several major noradrenergic neuron populations and the determination of neurotransmitter phenotype. The gene product is linked to enhancement of second messenger-mediated activation of the dopamine beta-hydroylase, c-fos promoters and several enhancers, including cyclic amp-response element and serum-response element.
6227	
6228	Basic helix-loop-helix (bHLH) transcription factors have been implicated in cell lineage determination and differentiation. The protein encoded by this gene is a bHLH transcription factor and shares similarity with another bHLH transcription factor, Twist. It is thought that during osteoblast development this protein may inhibit osteoblast maturation and maintain cells in a preosteoblast phenotype.
6229	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbls class and, in addition to an F-box, contains several tandem leucine-rich repeats. Alternative splicing of this gene generates 2 transcript variants.
6230	
6231	
6232	
6233	This gene encodes a disintegrin and metalloproteinase with thrombospondin motifs-6 (ADAMTS6), which is a member of the ADAMTS protein family. Members of the family share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. Multiple alternatively spliced transcript variants have been found for this gene, but the biological validity of some variants has not been determined.
6234	This gene encodes a protein with multiple ankyrin repeat domains and a single KH-domain. Co-transcription of this gene and the neighboring downstream gene (EIF4EBP3) generates a transcript (MASK-BP3) which encodes a fusion protein comprised of the MASK protein sequence for the majority of the protein and a different C-terminus due to an alternate reading frame for the EIF4EBP3 segments.
6235	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. This protein functions as a modifier of endothelial cell migration and proliferation, as well as an angiogenic factor. It acts as a mitogen for a variety of mesoderm- and neuroectoderm-derived cells in vitro, thus is thought to be involved in organogenesis. Three alternatively spliced variants encoding different isoforms have been described.
6236	
6237	
6238	
6239	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections.
6240	This gene encodes a membrane bound protein with several transient phosphorylation sites and PEST motifs. Conservation of proteins with PEST sequences among different species supports their functional significance. PEST sequences typically occur in proteins with high turnover rates. Immunological characteristics of this protein are species specific. This protein also undergoes N-terminal myristoylation.
6241	
6242	Members of the semaphorin protein family, such as SEMA5A, are involved in axonal guidance during neural development (Adams et al., 1996).[supplied by OMIM]
6243	DNA damage and/or hyperproliferative signals activate wildtype p53 tumor suppressor protein (TP53; MIM 191170), inducing cell cycle arrest or apoptosis. Mutations that inactivate p53 occur in 50% of all tumors. Polyak et al. (1997) used serial analysis of gene expression (SAGE) to evaluate cellular mRNA levels in a colorectal cancer cell line transfected with p53. Of 7,202 transcripts identified, only 14 were expressed at levels more than 10-fold higher in p53-expressing cells than in control cells. Polyak et al. (1997) termed these genes @#$%&p53-induced genes,@#$%& or PIGs, several of which were predicted to encode redox-controlling proteins. They noted that reactive oxygen species (ROS) are potent inducers of apoptosis. Flow cytometric analysis showed that p53 expression induces ROS production, which increases as apoptosis progresses under some conditions. The authors stated that the PIG10 gene, also called ENC1, encodes an actin-binding protein.[supplied by OMIM]
6244	Eukaryotes contain two protein translational systems, one in the cytoplasm and one in the mitochondria. Mitochondrial translation is crucial for maintaining mitochondrial function and mutations in this system lead to a breakdown in the respiratory chain-oxidative phosphorylation system and to impaired maintenance of mitochondrial DNA. This gene encodes one of mitochondrial translation elongation factors. Its role in the regulation of normal mitochondrial function and in different disease states attributed to mitochondrial dysfunction is not known. Alternative splicing results in at least three transcript variants encoding distinct isoforms.
6245	
6246	
6247	
6248	
6249	
6250	
6251	The protein encoded by this gene belongs to the subtilisin-like proprotein convertase family. The members of this family are proprotein convertases that process latent precursor proteins into their biologically active products. This encoded protein is a type I proinsulin-processing enzyme that plays a key role in regulating insulin biosynthesis. It is also known to cleave proopiomelanocortin, prorenin, proenkephalin, prodynorphin, prosomatostatin and progastrin. Mutations in this gene are thought to cause obesity. This encoded protein is associated with carcinoid tumors. The use of alternate polyadenylation sites has been found for this gene. Multiple alternatively spliced transcript variants have been described for this gene but their full length nature is not known.
6252	The protein encoded by this gene is an endogenous calpain (calcium-dependent cysteine protease) inhibitor. It consists of an N-terminal domain L and four repetitive calpain-inhibition domains (domains 1-4), and it is involved in the proteolysis of amyloid precursor protein. The calpain/calpastatin system is involved in numerous membrane fusion events, such as neural vesicle exocytosis and platelet and red-cell aggregation. The encoded protein is also thought to affect the expression levels of genes encoding structural or regulatory proteins. Several alternatively spliced transcript variants of this gene have been described, but only some of their full length natures have been determined.
6253	
6254	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein does not contain a coiled coil region, like some family members, but contains a SH3 domain. The specific function of this protein has not been determined.
6255	
6256	Coagulation factor II receptor is a 7-transmembrane receptor involved in the regulation of thrombotic response.  Proteolytic cleavage leads to the activation of the receptor.  F2R is a G-protein coupled receptor family member.
6257	Hexokinases phosphorylate glucose to produce glucose-6-phosphate, thus committing glucose to the glycolytic pathway. This gene encodes hexokinase 3. Similar to hexokinases 1 and 2, this allosteric enzyme is inhibited by its product glucose-6-phosphate.
6258	The protein encoded by this gene contains a RING finger, a motif present in a variety of functionally distinct proteins and known to be involved in protein-protein and protein-DNA interactions.
6259	This gene encodes a disintegrin and metalloproteinase with thrombospondin motifs-2 (ADAMTS2), which is a member of the ADAMTS protein family. Members of the family share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. The enzyme encoded by this gene excises the N-propeptide of type I, type II and type V procollagens. Mutations in this gene cause Ehlers-Danlos syndrome type VIIC, a recessively inherited connective-tissue disorder. Alternative splicing of the gene generates 2 transcript variants. The short transcript encodes a protein, which has no significant procollagen N-peptidase activity.
6260	
6261	
6262	
6263	
6264	The protein encoded by this gene is a pleiotropic cytokine produced by activated T cells. This cytokine is a ligand for interleukin 4 receptor. The interleukin 4 receptor also binds to IL13, which may contribute to many overlapping functions of this cytokine and IL13. STAT6, a signal transducer and activator of transcription, has been shown to play a central role in mediating the immune regulatory signal of this cytokine. This gene, IL3, IL5, IL13, and CSF2 form a cytokine gene cluster on chromosome 5q, with this gene particularly close to IL13. This gene, IL13 and IL5 are found to be regulated coordinately by several long-range regulatory elements in an over 120 kilobase range on the chromosome. Two alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported.
6265	This gene encodes a ubiquinone-binding protein of low molecular mass.  This function of this protein has not yet been determined; however, it is a homologue of bovine low molecular mass ubiquinone-binding protein gene.
6266	The immunosuppressant drug cyclosporin A blocks a calcium-dependent signal from the T-cell receptor (TCR) that normally leads to T-cell activation. When bound to cyclophilin B, cyclosporin A binds and inactivates the key signaling intermediate calcineurin. The protein encoded by this gene functions similarly to cyclosporin A, binding to cyclophilin B and acting downstream of the TCR and upstream of calcineurin by causing an influx of calcium. This integral membrane protein appears to be a new participant in the calcium signal transduction pathway, implicating cyclophilin B in calcium signaling, even in the absence of cyclosporin.
6267	This gene belongs to the cytokine gene family which encode secreted proteins involved in immunoregulatory and inflammatory processes. The protein encoded by this gene is structurally related to the CXC (Cys-X-Cys) subfamily of cytokines. Members of this subfamily are characterized by two cysteines separated by a single amino acid. This cytokine displays chemotactic activity for monocytes but not for lymphocytes, dendritic cells, neutrophils or macrophages. It has been implicated that this cytokine is involved in the homeostasis of monocyte-derived macrophages rather than in inflammation.
6268	
6269	The protein encoded by this gene interacts with atrophin-1. Atrophin-1 contains a polyglutamine repeat, expansion of which is responsible for dentatorubral and pallidoluysian atrophy. This encoded protein is characterized by two WW domains, a guanylate kinase-like domain, and multiple PDZ domains. It has structural similarity to the membrane-associated guanylate kinase homologue (MAGUK) family.
6270	This gene encodes a multipass transmembrane protein that is expressed in joints and other tissues and controls pyrophosphate levels in cultured cells. Mutation at the mouse @#$%&progressive ankylosis@#$%& (ank) locus causes a generalized, progressive form of arthritis accompanied by mineral deposition, formation of bony outgrowths, and joint destruction. The human homolog is virtually identical to the mouse protein and ANKH-mediated control of pyrophosphate levels has been suggested as a possible mechanism regulating tissue calcification and susceptibility to arthritis in higher animals.
6271	Synaptopodin is an actin-associated protein that may play a role in actin-based cell shape and motility. The name synaptopodin derives from the protein@#$%&s associations with postsynaptic densities and dendritic spines and with renal podocytes (Mundel et al., 1997).[supplied by OMIM]
6272	
6273	Laminins, a family of extracellular matrix glycoproteins, are the major noncollagenous constituent of basement membranes.  They have been implicated in a wide variety of biological processes including cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis.  Laminins are composed of 3 non identical chains: laminin alpha, beta and gamma (formerly A, B1, and B2, respectively) and they form a cruciform structure consisting of 3 short arms, each formed by a different chain, and a long arm composed of all 3 chains.  Each laminin chain is a multidomain protein encoded by a distinct gene.  Several isoforms of each chain have been described.  Different alpha, beta and gamma chain isomers combine to give rise to different heterotrimeric laminin isoforms which are designated by Arabic numerals in the order of their discovery, i.e. alpha1beta1gamma1 heterotrimer is laminin 1.  The biological functions of the different chains and trimer molecules are largely unknown, but some of the chains have been shown to differ with respect to their tissue distribution, presumably reflecting diverse functions in vivo.  This gene encodes the beta chain isoform laminin, beta 2.  The beta 2 chain contains the 7 structural domains typical of beta chains of laminin, including the short alpha region. However, unlike beta 1 chain, beta 2 has a more restricted tissue distribution.  It is enriched in the basement membrane of muscles at the neuromuscular junctions, kidney glomerulus and vascular smooth muscle.  Transgenic mice in which the beta 2 chain gene was inactivated by homologous recombination, showed defects in the maturation of neuromuscular junctions and impairment of glomerular filtration.  Alternative splicing involving a non consensus 5@#$%& splice site (gc) in the 5@#$%& UTR of this gene has been reported.  It was suggested that inefficient splicing of this first intron, which does not change the protein sequence, results in a greater abundance of the unspliced form of the transcript than the spliced form. The full-length nature of the spliced transcript is not known.
6274	
6275	This gene encodes a component of vacuolar ATPase (V-ATPase), a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, receptor-mediated endocytosis, and synaptic vesicle proton gradient generation. V-ATPase is composed of a cytosolic V1 domain and a transmembrane V0 domain. The V1 domain consists of three A and three B subunits, two G subunits plus the C, D, E, F, and H subunits. The V1 domain contains the ATP catalytic site. The V0 domain consists of five different subunits: a, c, c@#$%&, c@#$%!, and d. Additional isoforms of many of the V1 and V0 subunit proteins are encoded by multiple genes or alternatively spliced transcript variants. This encoded protein is possibly part of the V0 subunit. Since two nontranscribed pseudogenes have been found in dog, it is possible that the localization to chromosome 2 for this gene by radiation hybrid mapping is representing a pseudogene. Genomic mapping puts the chromosomal location on 5q35.3.
6276	C7 is a component of the complement system.  It participates in the formation of Membrane Attack Complex (MAC).  People with C7 deficiency are prone to bacterial infection.
6277	
6278	
6279	This gene encodes the homolog of the Drosophila melanogaster Nipped-B gene product and fungal Scc2-type sister chromatid cohesion proteins. The Drosophila protein facilitates enhancer-promoter communication of remote enhancers and plays a role in developmental regulation. It is also homologous to a family of chromosomal adherins with broad roles in sister chromatid cohesion, chromosome condensation, and DNA repair. The human protein has a bipartite nuclear targeting sequence and a putative HEAT repeat. Condensins, cohesins and other complexes with chromosome-related functions also contain HEAT repeats. Mutations in this gene result in Cornelia de Lange syndrome.
6280	
6281	
6282	
6283	
6284	
6285	
6286	The protein encoded by this gene belongs to the tigger subfamily of the pogo superfamily of DNA-mediated transposons in humans. These proteins are related to DNA transposons found in fungi and nematodes, and more distantly to the Tc1 and mariner transposases. They are also very similar to the major mammalian centromere protein B. The exact function of this gene is not known.
6287	This gene encodes a type II classical cadherin from the cadherin superfamily. The encoded membrane protein is a calcium dependent cell-cell adhesion glycoprotein comprised of five extracellular cadherin repeats, a transmembrane region and a highly conserved cytoplasmic tail. Cadherins mediate cell-cell binding in a homophilic manner, contributing to the sorting of heterogeneous cell types and the maintenance of orderly structures such as epithelium. Strong transcriptional expression of this gene has been observed in hepatocellular and renal carcinoma cell lines, suggesting a possible role in metastasis and invasion.
6288	
6289	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbws class and, in addition to an F-box, contains multiple WD40 repeats. This gene contains at least 14 exons, and its alternative splicing generates 3 transcript variants diverging at the presence/absence of two alternate exons.
6290	The protein encoded by this gene is a receptor for glucocorticoids and can act as both a transcription factor and a regulator of other transcription factors. The encoded protein can bind DNA as a homodimer or as a heterodimer with another protein such as the retinoid X receptor. This protein can also be found in heteromeric cytoplasmic complexes along with heat shock factors and immunophilins. The protein is typically found in the cytoplasm until it binds a ligand, which induces transport into the nucleus. Mutations in this gene are a cause of glucocorticoid resistance, or cortisol resistance. Alternate splicing, the use of at least three different promoters, and alternate translation initiation sites result in several transcript variants encoding the same protein or different isoforms, but the full-length nature of some variants has not been determined.
6291	
6292	
6293	The protein encoded by this gene belongs to the EGR family of C2H2-type zinc-finger proteins. It is a nuclear protein and functions as a transcriptional regulator. The products of target genes it activates are required for differentitation and mitogenesis. Studies suggest this is a cancer suppresor gene.
6294	Lysophospholipases are enzymes that act on biological membranes to regulate the multifunctional lysophospholipids. The protein encoded by this gene hydrolyzes lysophosphatidylcholine in both monomeric and micellar forms. The use of alternate polyadenylation sites has been found for this gene. There are alternatively spliced transcript variants described for this gene but the full length nature is not known yet.
6295	This gene encodes the enzyme that metabolizes thiopurine drugs via S-adenosyl-L-methionine as the S-methyl donor and S-adenosyl-L-homocysteine as a byproduct. Thiopurine drugs such as 6-mercaptopurine are used as chemotherapeutic agents. Genetic polymorphisms that affect this enzymatic activity are correlated with variations in sensitivity and toxicity to such drugs within individuals. A pseudogene for this locus is located on chromosome 18q.
6296	
6297	The product of this gene is an important intermediate enzyme in polyamine biosynthesis. The polyamines spermine, spermidine, and putrescine are low-molecular-weight aliphatic amines essential for cellular proliferation and tumor promotion. This gene spans 22 kb comprised of 9 exons and 8 introns and encoding two species of mRNA of 2.1 and 3.4-3.6 kb originating from the use of two different polyadenylation signals. The pro-protein is an approximate 38.3 kDa which is known to undergo processing at amino acid 68 to yield two fragments of 32- and 6-kDa.
6298	
6299	
6300	
6301	This gene encodes the alpha chain of type XII collagen, a member of the FACIT (fibril-associated collagens with interrupted triple helices) collagen family. Type XII collagen is a homotrimer found in association with type I collagen, an association that is thought to modify the interactions between collagen I fibrils and the surrounding matrix. Two transcript variants encoding different isoforms have been identified for this gene.
6302	CD109 is a GPI-linked cell surface antigen expressed by CD34+ acute myeloid leukemia cell lines, T-cell lines, activated T lymphoblasts, endothelial cells, and activated platelets (Lin et al., 2002). In addition, the platelet-specific Gov antigen system, implicated in refractoriness to platelet transfusion, neonatal alloimmune thrombocytopenia, and posttransfusion purpura, is carried by CD109 (Kelton et al., 1990; Lin et al., 2002).[supplied by OMIM]
6303	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. This enzyme is located in the membrane of the endoplasmic reticulum (ER) and may contribute to quality control ER-associated degradation by the ubiquitin-proteasome system.
6304	
6305	The protein encoded by this gene is a substrate for protein kinase C. It is localized to the plasma membrane and is an actin filament crosslinking protein. Phosphorylation by protein kinase C or binding to calcium-calmodulin inhibits its association with actin and with the plasma membrane, leading to its presence in the cytoplasm. The protein is thought to be involved in cell motility, phagocytosis, membrane trafficking and mitogenesis.
6306	
6307	GPR119 is a member of the rhodopsin family of G protein-coupled receptors (GPRs) (Fredriksson et al., 2003).[supplied by OMIM]
6308	
6309	
6310	
6311	
6312	
6313	Cytosolic and membrane-bound forms of glutathione S-transferase are encoded by two distinct supergene families. These enzymes function in the detoxification of electrophilic compounds, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress, by conjugation with glutathione. The genes encoding these enzymes are known to be highly polymorphic. These genetic variations can change an individual@#$%&s susceptibility to carcinogens and toxins as well as affect the toxicity and efficacy of some drugs. At present, eight distinct classes of the soluble cytoplasmic mammalian glutathione S-transferases have been identified: alpha, kappa, mu, omega, pi, sigma, theta and zeta. This gene encodes a glutathione S-tranferase belonging to the alpha class. The alpha class genes, located in a cluster mapped to chromosome 6, are the most abundantly expressed glutathione S-transferases in liver. In addition to metabolizing bilirubin and certain anti-cancer drugs in the liver, the alpha class of these enzymes exhibit glutathione peroxidase activity thereby protecting the cells from reactive oxygen species and the products of peroxidation.
6314	
6315	
6316	The protein encoded by this gene is a component of the exocyst complex, a multiple protein complex essential for targeting exocytic vesicles to specific docking sites on the plasma membrane. Though best characterized in yeast, the component proteins and the functions of the exocyst complex have been demonstrated to be highly conserved in higher eukaryotes. At least eight components of the exocyst complex, including this protein, are found to interact with the v-ral simian leukemia viral oncogene homolog A (RALA), a Ras-related small GTPase involved in controlling actin cytoskeletal remodeling and vesicle transport. The binding of this protein to RALA has been shown to mediate filopodia formation in fibroblasts. Mutation studies of the similar gene in Drosophila suggested that this protein is required for neuronal membrane traffic.
6317	
6318	
6319	
6320	The enzyme encoded by this gene is responsible for the catalysis and formation of S-lactoyl-glutathione from methylglyoxal condensation and reduced glutatione.  Glyoxalase I is linked to HLA and is localized to 6p21.3-p21.1, between HLA and the centromere.
6321	The protein encoded by this gene is one subunit of a trimeric complex, forming a highly conserved transcription factor that binds to CCAAT motifs in the promoter regions in a variety of genes. Subunit A associates with a tight dimer composed of the B and C subunits, resulting in a trimer that binds to DNA with high specificity and affinity. The sequence specific interactions of the complex are made by the A subunit, suggesting a role as the regulatory subunit. In addition, there is evidence of post-transcriptional regulation in this gene product, either by protein degradation or control of translation. Further regulation is represented by alternative splicing in the glutamine-rich activation domain, with clear tissue-specific preferences for the two isoforms.
6322	
6323	
6324	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein found in the outer segment of both rod and cone photoreceptor cells. It may function as an adhesion molecule involved in stabilization and compaction of outer segment disks or in the maintenance of the curvature of the rim. This protein is essential for disk morphogenesis. Defects in this gene are associated with both central and peripheral retinal degenerations. Some of the various phenotypically different disorders are autosomal dominant retinitis pigmentosa, progressive macular degeneration, macular dystrophy and retinitis pigmentosa digenic.
6325	The protein encoded by this gene belongs to the highly conserved cyclin family, whose members are characterized by a dramatic periodicity in protein abundance through the cell cycle. Cyclins function as regulators of CDK kinases. Different cyclins exhibit distinct expression and degradation patterns which contribute to the temporal coordination of each mitotic event. This cyclin forms a complex with and functions as a regulatory subunit of CDK4 or CDK6, whose activtiy is required for cell cycle G1/S transition. This protein has been shown to interact with and be involved in the phosphorylation of tumor suppressor protein Rb. The CDK4 activity associated with this cyclin was reported to be necessary for cell cycle progression through G2 phase into mitosis after UV radiation.
6326	
6327	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order   structures. This gene is intronless and encodes a member of the histone H4 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6.
6328	
6329	
6330	Glutathione peroxidase (GPX)/glutathione reductase is part of the hydrogen peroxide scavenging system found within the epididymis in the mammalian male reproductive tract. GPX5 expression is epididymis-specific and the transcript is unique from other GPXs because it contains a deletion resulting in an mRNA that does not contain a selenocysteine (UGA) codon (an unusual amino acid present in other GPXs). This deletion also renders the mRNA incapable of encoding an active GPX isoenzyme. For this reason, GPX5 is selenium-independent and has very little activity towards hydrogen peroxide or organic hydroperoxides. GPX5, which is bound to the acrosome of sperm, may act to protect sperm from premature acrosome reaction in the epididymis.
6331	
6332	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This protein localizes to cytoplasmic bodies. Studies in mice suggest that this protein plays a role in terminal differentiation of erythroid cells. Alternate splicing of this gene generates two transcript variants encoding different isoforms.
6333	
6334	Caveolae are small domains on the inner cell membrane involved in vesicular trafficking and signal transduction.  FLOT1 encodes a caveolae-associated, integral membrane protein.  The function of flotillin 1 has not been determined.
6335	This gene is induced by cytokines and interferon. Its protein product is thought to be involved in negative regulation of growth of vascular smooth muscle cells, which contributes to the anti-inflammatory response to vessel wall trauma. The gene expresses three transcripts.
6336	A cluster of genes, BAT1-BAT5, has been localized in the vicinity of the genes for TNF alpha and TNF beta. This gene is found near this cluster; it was mapped near the gene for C2 within a 120-kb region that included a HSP70 gene pair. These genes are all within the human major histocompatibility complex class III region. This gene was thought to be two different genes, NG36 and G9a, adjacent to each other but a recent publication shows that there is only a single gene. The protein encoded by this gene is thought to be involved in intracellular protein-protein interaction. There are three alternatively spliced transcript variants of this gene but only two are fully described.
6337	This gene encodes a ubiquitously expressed member of the TALE/PBX homeobox family. It was identified by its similarity to a homeobox gene which is involved in t(1;19) translocation in acute pre-B-cell leukemias. This protein is a transcriptional activator which binds to the TLX1 promoter. The gene is located within the major histocompatibility complex (MHC) on chromosome 6.
6338	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S13P family of ribosomal proteins. It is located in the cytoplasm. The gene product of the E. coli ortholog (ribosomal protein S13) is involved in the binding of fMet-tRNA, and thus, in the initiation of translation. This gene is an ortholog of mouse Ke3. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
6339	
6340	The protein encoded by this gene is a member of the peroxisome proliferator-activated receptor (PPAR) family. PPARs are nuclear hormone receptors that bind peroxisome proliferators and control the size and number of peroxisomes produced by cells. PPARs mediate a variety of biological processes, and may be involved in the development of several chronic diseases, including diabetes, obesity, atherosclerosis, and cancer. This protein is a potent inhibitor of ligand-induced transcription activity of PPAR delta and PPAR gamma. It may function as an integrator of transcription repression and nuclear receptor signaling. The expression of this gene is found to be elevated in colorectal cancer cells. The elevated expression can be repressed by adenomatosis polyposis coli (APC), a tumor suppressor protein related to APC/beta-catenin signaling pathway. Knockout studies in mice suggested the role of this protein in myelination of the corpus callosum, lipid metabolism, and epidermal cell proliferation. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
6341	G protein-coupled receptors (GPCRs, or GPRs) contain 7 transmembrane domains and transduce extracellular signals through heterotrimeric G proteins.[supplied by OMIM]
6342	
6343	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbls class and, in addition to an F-box, contains at least 9 tandem leucine-rich repeats.
6344	SIM1 and SIM2 genes are Drosophila single-minded (sim) gene homologs.  SIM1 transcript was detected only in fetal kidney out of various adult and fetal tissues tested.  Since the sim gene plays an important role in Drosophila development and has peak levels of expression during the period of neurogenesis,it was proposed that the human SIM gene is a candidate for involvement in certain dysmorphic features (particularly the facial and skull characteristics), abnormalities of brain development, and/or mental retardation of Down syndrome.
6345	
6346	
6347	This gene shares both structural and functional similarities with the dystrophin gene.  It contains an actin-binding N-terminus, a triple coiled-coil repeat central region, and a C-terminus that consists of protein-protein interaction motifs which interact with dystroglycan protein components.  The protein encoded by this gene is located at the neuromuscular synapse and myotendinous junctions, where it participates in post-synaptic membrane maintenance and acetylcholine receptor clustering.  Mouse studies suggest that this gene may serve as a functional substitute for the dystrophin gene and therefore, may serve as a potential therapeutic alternative to muscular dystrophy which caused by mutations in the dystrophin gene.  Alternative splicing of the utrophin gene has been described; however, the full-length nature of these variants has not yet been determined.
6348	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein does not contain a coiled coil region, like some family members, but does contain a SH3 domain near its N-terminus. This protein interacts with the cytoplasmic domains of the precursor but not the processed forms of a disintegrin and metalloprotease domain 9 and 15. This protein binds the beta-appendage domain of adaptor protein 2 and may function to assist adaptor protein 2 in its role at the plasma membrane. This protein interacts with activated Cdc42-associated kinase-2 to regulate the degradation of epidermal growth factor receptor protein.
6349	
6350	
6351	This gene encodes a member of the sirtuin family of proteins, homologs to the yeast Sir2 protein. Members of the sirtuin family are characterized by a sirtuin core domain and grouped into four classes. The functions of human sirtuins have not yet been determined; however, yeast sirtuin proteins are known to regulate epigenetic gene silencing and suppress recombination of rDNA. Studies suggest that the human sirtuins may function as intracellular regulatory proteins with mono-ADP-ribosyltransferase activity. The protein encoded by this gene is included in class III of the sirtuin family. Alternative splicing of this gene results in two transcript variants.
6352	
6353	
6354	
6355	
6356	The A-kinase anchor proteins (AKAPs) are a group of structurally diverse proteins, which have the common function of binding to the regulatory subunit of protein kinase A (PKA) and confining the holoenzyme to discrete locations within the cell. This gene encodes a member of the AKAP family. The encoded protein is expressed in endothelial cells, cultured fibroblasts, and osteosarcoma cells. It associates with protein kinases A and C and phosphatase, and serves as a scaffold protein in signal transduction. This protein and RII PKA colocalize at the cell periphery. This protein is a cell growth-related protein. Antibodies to this protein can be produced by patients with myasthenia gravis. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
6357	
6358	
6359	This gene product is a member of the Vanin family of proteins which share extensive sequence similarity with each other, and also with biotinidase. The family includes secreted and membrane-associated proteins, a few of which have been reported to participate in hematopoietic cell trafficking. No biotinidase activity has been demonstrated for any of the vanin proteins, however, they possess pantetheinase activity, which may play a role in oxidative-stress response. The encoded protein is a secreted molecule whose exact function is not known. This gene lies in close proximity to, and in same transcriptional orientation as two other vanin genes on chromosome 6q23-q24. Several transcript variants encoding different isoforms have been described for this gene.
6360	This protein belongs to the aldehyde dehydrogenases family of proteins. It plays a role in a pathway of 9-cis-retinoic acid biosynthesis in vivo. This enzyme converts 9-cis-retinal into the retinoid X receptor ligand 9-cis-retinoic acid, and has approximately 40-fold higher activity with 9-cis-retinal than with all-trans-retinal. Therefore, it is the first known aldehyde dehydrogenase to show a preference for 9-cis-retinal relative to all-trans-retinal. Two transcript variants encoding distinct isoforms have been identified for this gene.
6361	
6362	
6363	
6364	The Sec61 complex is the central component of the protein translocation apparatus of the endoplasmic reticulum (ER) membrane. The protein encoded by this gene and SEC62 protein are found to be associated with ribosome-free SEC61 complex. It is speculated that Sec61-Sec62-Sec63 may perform post-translational protein translocation into the ER. The Sec61-Sec62-Sec63 complex might also perform the backward transport of ER proteins that are subject to the ubiquitin-proteasome-dependent degradation pathway. The encoded protein is an integral membrane protein located in the rough ER.
6365	The protein encoded by this gene stimulates the formation of a non-mitotic multinucleate syncytium from proliferative cytotrophoblasts during trophoblast differentiation. An alternatively spliced transcript variant of this gene may exist.
6366	
6367	The Wilms tumor suppressor gene WT1 appears to play a role in both transcriptional and posttranscriptional regulation of certain cellular genes. This gene encodes a WT1-associating protein, which is a ubiquitously expressed nuclear protein. Like WT1 protein, this protein is localized throughout the nucleoplasm as well as in speckles and partially colocalizes with splicing factors. Alternative splicing of this gene results in three transcript variants, two of which encode the same isoform.
6368	
6369	
6370	This gene encodes a member of the sialomucin protein family. The encoded protein was originally identified as an important component of glomerular podocytes. Podocytes are highly differentiated epithelial cells with interdigitating foot processes covering the outer aspect of the glomerular basement membrane. Other biological activities of the encoded protein include: binding in a membrane protein complex with Na+/H+ exchanger regulatory factor to intracellular cytoskeletal elements, playing a role in hematopoetic cell differentiation, and being expressed in vascular endothelium cells and binding to L-selectin.
6371	
6372	
6373	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. The encoded protein sequence is 100% identical to the mouse homolog and 98% identical to the frog and zebrafish homologs. Two alternatively spliced transcript variants have been found for this gene and they encode distinct isoforms.
6374	
6375	This gene encodes a component of vacuolar ATPase (V-ATPase), a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, receptor-mediated endocytosis, and synaptic vesicle proton gradient generation. V-ATPase is composed of a cytosolic V1 domain and a transmembrane V0 domain. The V1 domain consists of three A and three B subunits, two G subunits plus the C, D, E, F, and H subunits. The V1 domain contains the ATP catalytic site. The V0 domain consists of five different subunits: a, c, c@#$%&, c@#$%!, and d. Additional isoforms of many of the V1 and V0 subunit proteins are encoded by multiple genes or alternatively spliced transcript variants. The encoded protein is one of three A subunit proteins. Three splice variants have been described in the literature that encode the same protein. Additional splice variants were noted during sequence analysis but their full-length nature was not determined. Mutations in this gene were found in patients with renal tubular acidosis associated with normal audiometry.
6376	
6377	
6378	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
6379	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
6380	
6381	
6382	
6383	
6384	This gene encodes a member of the DNAJ protein family. DNAJ family members are characterized by a highly conserved amino acid stretch called the @#$%&J-domain@#$%& and function as one of the two major classes of molecular chaperones involved in a wide range of cellular events, such as protein folding and oligomeric protein complex assembly. This family member may also play a role in polyglutamine aggregation in specific neurons. Alternative splicing of this gene results in multiple transcript variants; however, not all variants have been fully described.
6385	
6386	Similarity to a related locus suggests that this gene encodes a subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. This subunit exists as a heterodimer with another polymerase subunit; together they form a core subassembly unit of the polymerase. Two similar genes are located downstream of this locus. Alternative splicing of this gene results in three transcript variants encoding different isoforms.
6387	
6388	
6389	This gene encodes four products of the tachykinin peptide hormone family, substance P and neurokinin A, as well as the related peptides, neuropeptide K and neuropeptide gamma.  These hormones are thought to function as neurotransmitters which interact with nerve receptors and smooth muscle cells.  They are known to induce behavioral responses and function as vasodilators and secretagogues.  Alternative splicing of exons 4 and/or 6 produces four known products of undetermined significance.
6390	Vesicle mediated protein sorting plays an important role in segregation of intracellular molecules into distinct organelles. Genetic studies in yeast have identified more than 40 vacuolar protein sorting (VPS) genes involved in vesicle transport to vacuoles. This gene encodes the human ortholog of yeast Vps41 protein which is also conserved in Drosophila, tomato, and Arabidopsis. Expression studies in yeast and human indicate that this protein may be involved in the formation and fusion of transport vesicles from the Golgi. Several transcript variants encoding different isoforms have been described for this gene, however, the full-length nature of not all is known.
6391	Secreted frizzled-related protein 4 (SFRP4) is a member of the SFRP family that contains a cysteine-rich domain homologous to the putative Wnt-binding site of Frizzled proteins. SFRPs act as soluble modulators of Wnt signaling. The expression of SFRP4 in ventricular myocardium correlates with apoptosis related gene expression.
6392	
6393	
6394	The protein encoded by this gene is a GTPase which belongs to the RAS superfamily of small GTP-binding proteins. Members of this superfamily appear to regulate a diverse array of cellular events, including the control of cell growth, cytoskeletal reorganization, and the activation of protein kinases. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
6395	
6396	WD40 repeat proteins are key components of many essential biologic functions. They regulate the assembly of multiprotein complexes by presenting a beta-propeller platform for simultaneous and reversible protein-protein interactions. Members of the WIPI subfamily of WD40 repeat proteins, such as WIPI2, have a 7-bladed propeller structure and contain a conserved motif for interaction with phospholipids (Proikas-Cezanne et al., 2004).[supplied by OMIM]
6397	
6398	
6399	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
6400	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
6401	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
6402	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
6403	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
6404	
6405	This gene is one of two Cys-Cys (CC) cytokine genes clustered on the q arm of chromosome 7. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The CC cytokines are proteins characterized by two adjacent cysteines. The cytokine encoded by this gene displays chemotactic activity for normal peripheral blood eosinophils and basophils. The product of this gene is one of three related chemokines that specifically activate chemokine receptor CCR3. This chemokine may contribute to the eosinophil accumulation in atopic diseases.
6406	This gene belongs to the subfamily of small cytokine CC genes. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The CC cytokines are proteins characterized by two adjacent cysteines. The cytokine encoded by this gene displays chemotactic activity on resting T lymphocytes, a minimal activity on neutrophils, and is negative on monocytes and activated T lymphocytes. The protein is also a strong suppressor of colony formation by a multipotential hematopoietic progenitor cell line.
6407	
6408	
6409	
6410	
6411	
6412	Aryl hydrocarbon receptor is a ligand-activated transcription factor involved in the regulation of biological responses to planar aromatic hydrocarbons.  AHR has been shown to regulate xenobiotic-metabolizing enzymes such as cytochrome P450.  AHR ligands included a variety of aromatic hydrocarbons.
6413	
6414	
6415	
6416	
6417	This gene encodes disintegrin and metalloprotease (ADAM) domain 22, which is a member of the ADAM protein family. Members of this family are membrane-anchored proteins structurally related to snake venom disintegrins, and have been implicated in a variety of biologic processes involving  cell-cell and cell-matrix interactions, including fertilization, muscle development, and neurogenesis. This gene is highly expressed in the brain and may function as an integrin ligand in the brain. It has been described that alternative splicing of this gene generates several transcript variants.
6418	This gene is a member of the insulin-like growth factor binding protein (IGFBP) family and encodes a protein with an IGFBP domain and a thyroglobulin type-I domain. The protein forms a ternary complex with insulin-like growth factor acid-labile subunit (IGFALS) and either insulin-like growth factor (IGF) I or II. In this form, it circulates in the plasma, prolonging the half-life of IGFs and altering their interaction with cell surface receptors. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
6419	
6420	
6421	Vitamin B6, or pyridoxal 5-prime-phosphate (PLP), is critical for normal cellular function, and some cancer cells have notable differences in vitamin B6 metabolism compared to their normal counterparts. The rate-limiting enzyme in vitamin B6 synthesis is pyridoxine-5-prime-phosphate (PNP) oxidase (PNPO; EC 1.4.3.5).[supplied by OMIM]
6422	
6423	
6424	
6425	
6426	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the MRP subfamily which is involved in multi-drug resistance. This protein functions as a chloride channel and controls the regulation of other transport pathways. Mutations in this gene have been observed in patients with the autosomal recessive disorders cystic fibrosis (CF) and congenital bilateral aplasia of the vas deferens (CBAVD). Alternative splice variants have been described, many of which result from mutations in the CFTR gene.
6427	The gene for this product maps to a region on chromosome 7 identified as an autism-susceptibility locus. Mutation screening of the entire coding region in autistic individuals failed to identify phenotype-specific variants, suggesting that coding mutations for this gene are unlikely to be involved in the etiology of autism. The function of this gene product has not been determined. Transcript variants encoding different isoforms of this protein have been described.
6428	
6429	The human NDUFA5 gene codes for the B13 subunit of complex I of the respiratory chain, which transfers electrons from NADH to ubiquinone. The high degree of conservation of NDUFA5 extending to plants and fungi indicates its functional significance in the enzyme complex. The protein localizes to the inner mitochondrial membrane as part of the 7 component-containing, water soluble @#$%!iron-sulfur protein@#$%! (IP) fraction of complex I, although its specific role is unknown. It is assumed to undergo post-translational removal of the initiator methionine and N-acetylation of the next amino acid. The predicted secondary structure is primarily alpha helix, but the carboxy-terminal half of the protein has high potential to adopt a coiled-coil form. The amino-terminal part contains a putative beta sheet rich in hydrophobic amino acids that may serve as mitochondrial import signal. Related pseudogenes have also been identified on four other chromosomes.
6430	
6431	
6432	
6433	
6434	This gene encodes a zinc finger protein. The zinc finger proteins are involved in DNA binding and protein-protein interactions. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
6435	
6436	Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. It is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, F0, which comprises the proton channel. The F1 complex consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled in a ratio of 3 alpha, 3 beta, and a single representative of the other 3. The F0 seems to have nine subunits (a, b, c, d, e, f, g, F6 and 8). This gene encodes the f subunit of the F0 complex. Alternatively spliced transcript variants encoding different isoforms have been identified for this gene.
6437	
6438	
6439	NCALD is a member of the EF-hand calcium-binding protein superfamily.[supplied by OMIM]
6440	The protein encoded this gene contains two RING-finger motifs and an IBR (in between RING fingers) motif. This protein is an E3 ubiquintin ligase that is localized in Lewy bodies (LBs), a characteristic neuronal inclusion in Parkinson@#$%&s disease (PD) brains. This protein interacts with UBE2L3/UBCH7 and UBE2E2/UBCH8, but not other ubiquitin-conjugating enzymes. This protein is found to bind and ubiquitylate synphilin 1 (SNCAIP), which is a interacting protein of alpha synuclein in neurons, and a major component of LB. Alternatively spliced transcript variants encoding the same protein have been reported.
6441	This gene encodes a macrophage scavenger receptor-like protein. This protein has been shown to deplete reactive oxygen species, and thus play an important role in protecting cells from oxidative stress. The expression of this gene is induced by oxidative stress. Alternatively spliced transcript variants encoding distinct isoforms have been described.
6442	
6443	
6444	
6445	
6446	
6447	
6448	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the mitochondrial inner membrane. The enzyme has steroid 18-hydroxylase activity to synthesize aldosterone and 18-oxocortisol as well as steroid 11 beta-hydroxylase activity. Mutations in this gene cause corticosterone methyl oxidase deficiency.
6449	
6450	Angiopoietins are proteins with important roles in vascular development and angiogenesis. All angiopoietins bind with similar affinity to an endothelial cell-specific tyrosine-protein kinase receptor. The protein encoded by this gene is a secreted glycoprotein that activates the receptor by inducing its tyrosine phosphorylation. It plays a critical role in mediating reciprocal interactions between the endothelium and surrounding matrix and mesenchyme. The protein also contributes to blood vessel maturation and stability, and may be involved in early development of the heart. Two transcript variants encoding different isoforms have been found for this gene.
6451	
6452	
6453	
6454	A large family of ATPases has been described, whose key feature is that they share a conserved region of about 220 amino acids that contains an ATP-binding site. The proteins that belong to this family either contain one or two AAA (ATPases Associated with diverse cellular Activities) domains. AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes. The protein encoded by this gene contains two AAA domains, as well as a bromodomain.
6455	
6456	Hook proteins are cytosolic coiled-coil proteins that contain conserved N-terminal domains, which attach to microtubules, and more divergent C-terminal domains, which mediate binding to organelles. The Drosophila Hook protein is a component of the endocytic compartment.[supplied by OMIM]
6457	This gene is thought to play an important role in calcium homeostasis. Alternative splicing of this gene results in five transcript variants which vary in protein translation, the coding of catalytic domains, and tissue expression. Variation among these transcripts impacts their functions which involve roles in the calcium storage and release process in the endoplasmic and sarcoplasmic reticulum as well as hydroxylation of aspartic acid and asparagine in epidermal growth factor-like domains of various proteins.
6458	This gene is related to the Wolf-Hirschhorn syndrome candidate-1 gene and encodes a protein with PWWP (proline-tryptophan-tryptophan-proline) domains. The function of the protein has not been determined. Two alternatively spliced variants have been described.
6459	This gene encodes a telomere specific protein which is a component of the telomere nucleoprotein complex. This protein is present at telomeres throughout the cell cycle and functions as an inhibitor of telomerase, acting in cis to limit the elongation of individual chromosome ends. The protein structure contains a C-terminal Myb motif, a dimerization domain near its N-terminus and an acidic N-terminus. Two transcripts of this gene are alternatively spliced products.
6460	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development, reproduction, and tissue remodeling, as well as in disease processes, such as arthritis and metastasis. Most MMP@#$%&s are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. This gene produces two transcripts, which encode a membrane-bound form and a soluble form of the protein. Both forms of the protein activate MMP2 by cleavage. This gene was once referred to as MT-MMP2, but was renamed as MT-MMP3 or MMP16.
6461	
6462	This gene encodes the class A macrophage scavenger receptors, which include three different types (1, 2, 3) generated by alternative splicing of this gene. These receptors or isoforms are macrophage-specific trimeric integral membrane glycoproteins and have been implicated in many macrophage-associated physiological and pathological processes including atherosclerosis, Alzheimer@#$%&s disease, and host defense. The isoforms type 1 and type 2 are functional receptors and are able to mediate the endocytosis of modified low density lipoproteins (LDLs). The isoform type 3 does not internalize modified LDL (acetyl-LDL) despite having the domain shown to mediate this function in the types 1 and 2 isoforms. It has an altered intracellular processing and is trapped within the endoplasmic reticulum, making it unable to perform endocytosis. The isoform type 3 can inhibit the function of isoforms type 1 and type 2 when co-expressed, indicating a dominant negative effect and suggesting a mechanism for regulation of scavenger receptor activity in macrophages.
6463	
6464	NFKB1 (MIM 164011) or NFKB2 (MIM 164012) is bound to REL (MIM 164910), RELA (MIM 164014), or RELB (MIM 604758) to form the NFKB complex. The NFKB complex is inhibited by I-kappa-B proteins (NFKBIA, MIM 164008, or NFKBIB, MIM 604495), which inactivate NF-kappa-B by trapping it in the cytoplasm. Phosphorylation of serine residues on the I-kappa-B proteins by kinases (IKBKA, MIM 600664, or IKBKB) marks them for destruction via the ubiquitination pathway, thereby allowing activation of the NF-kappa-B complex. Activated NFKB complex translocates into the nucleus and binds DNA at kappa-B-binding motifs such as 5-prime GGGRNNYYCC 3-prime or 5-prime HGGARNYYCC 3-prime (where H is A, C, or T; R is an A or G purine; and Y is a C or T pyrimidine).[supplied by OMIM]
6465	
6466	This gene encodes a member of the 1-acylglycerol-3-phosphate O-acyltransferase family. This integral membrane protein converts lysophosphatidic acid to phosphatidic acid, the second step in de novo phospholipid biosynthesis.
6467	
6468	The gene encodes a transcriptional regulator that belongs to the EGR family of C2H2-type zinc-finger proteins. It is an immediate-early growth response gene which is induced by mitogenic  stimulation. The protein encoded by this gene participates in the transcriptional regulation of genes in controling biological rhythm. It may also plays a role in muscle development.
6469	
6470	Rho GTPases play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. The encoded protein may form complex with G proteins and stimulate Rho-dependent signals.
6471	
6472	The protein encoded by this gene binds to an anti-proliferative protein, B-cell translocation protein 1, which negatively regulates cell proliferation. Binding of the two proteins, which is driven by phosphorylation of the anti-proliferative protein, causes signaling events in cell division that lead to changes in cell proliferation associated with cell-cell contact. The protein has both mouse and yeast orthologs. Alternate splicing of this gene results in two transcript variants encoding different isoforms.
6473	This gene encodes a heterodimeric protein consisting of a nonglycosylated alpha subunit and a glycosylated beta subunit that is cleaved to the mature enzyme posttranslationally. The encoded protein catalyzes the synthesis and degradation of ceramide into sphingosine and fatty acid. Mutations in this gene have been associated with a lysosomal storage disorder known as Farber disease. Two transcript variants encoding distinct isoforms have been identified for this gene.
6474	
6475	This gene encodes a protein with slight similarity to members of the inhibitors of nuclear factor kappa B (IKB) family. The protein contains a region of ankyrin repeats, but its function is unknown. A second transcript variant has been described, but its full-length nature has not been determined.
6476	
6477	
6478	
6479	This gene encodes a member of the CHD family of proteins which are characterized by the presence of chromo (chromatin organization modifier) domains and SNF2-related helicase/ATPase domains. This protein is one of the components of a histone deacetylase complex referred to as the Mi-2/NuRD complex which participates in the remodeling of chromatin by deacetylating histones. Chromatin remodeling is essential for many processes including transcription. Autoantibodies against this protein are found in a subset of patients with dermatomyositis. Three alternatively spliced transcripts encoding different isoforms have been described.
6480	
6481	ASPN belongs to a family of leucine-rich repeat (LRR) proteins associated with the cartilage matrix. The name asporin reflects the unique aspartate-rich N terminus and the overall similarity to decorin (MIM 125255) (Lorenzo et al., 2001).[supplied by OMIM]
6482	
6483	
6484	Relaxins are known endocrine and autocrine/paracrine hormones, belonging to the insulin gene superfamily. In the human there are three non-allelic relaxin genes, RLN1, RLN2 and RLN3. RLN1 and RLN2 share high sequence homology. The active form of the encoded protein consists of an A chain and a B chain but their cleavage sites are not definitely described yet. Relaxin is produced by the ovary, and targets the mammalian reproductive system to ripen the cervix, elongate the pubic symphysis and inhibit uterine contraction. It may have additional roles in enhancing sperm motility, regulating blood pressure, controlling heart rate and releasing oxytocin and vasopressin. There are two alternatively spliced transcript variants encoding different isoforms described for this gene.
6485	
6486	
6487	The bifunctional enzyme, UDP-N-acetylglucosamine 2-epimerase (UDP-GlcNAc 2-epimerase/N-acetylmannosamine kinase) regulates and initiates biosynthesis of N-acetylneuraminic acid (NeuAc), a precursor of sialic acids. UDP-GlcNAc 2-epimerase activity is rate-limiting for the biosynthesis of sialic acid and is required for sialylation in hematopoietic cells. The activity of the enzyme can be controlled at the transcriptional level and can affect the sialylation and function of specific cell surface molecules expressed on B cells and myeloid cells. Modification of cell surface molecules with sialic acid is crucial for their function in many biologic processes, including cell adhesion and signal transduction. Differential sialylation of cell surface molecules is also implicated in the tumorigenicity and metastatic behavior of malignant cells. Sialuria is a rare inborn error of metabolism characterized by cytoplasmic accumulation and increased urinary excretion of free NeuAc.
6488	
6489	
6490	
6491	
6492	
6493	The PALM2-AKAP2 mRNA is a naturally occurring co-transcribed product of the neighboring PALM2 and AKAP2 genes. The significance of this co-transcribed mRNA and the function of its protein product have not yet been determined. Alternative splicing of this gene results in several transcript variants encoding different isoforms, but the full-length nature of some of these variants has not been defined.
6494	
6495	
6496	
6497	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to cytoplasmic bodies and its function has not been determined. Four alternatively spliced transcript variants for this gene have been described.
6498	This intronless gene belongs to the forkhead family of transcription factors, which is characterized by a distinct forkhead domain. This gene functions as a thyroid transcription factor which likely plays a crucial role in thyroid morphogenesis. Mutations in this gene are associated with congenital hypothyroidism and cleft palate with thyroid dysgenesis. The map localization of this gene suggests it may also be a candidate gene for squamous cell epithelioma and hereditary sensory neuropathy type I.
6499	This intronless gene belongs to the forkhead family of transcription factors, which is characterized by a distinct forkhead domain. This gene functions as a thyroid transcription factor which likely plays a crucial role in thyroid morphogenesis. Mutations in this gene are associated with congenital hypothyroidism and cleft palate with thyroid dysgenesis. The map localization of this gene suggests it may also be a candidate gene for squamous cell epithelioma and hereditary sensory neuropathy type I.
6500	The protein encoded by this gene is a member of the dual specificity protein tyrosine phosphatase family. This protein is highly similar to Saccharomyces cerevisiae Cdc14, a protein tyrosine phosphatase involved in the exit of cell mitosis and initiation of DNA replication, which suggests the role in cell cycle control. This protein has been shown to interact with and dephosphorylates tumor suppressor protein p53, and is thought to regulate the function of p53. Alternative splice of this gene results in 3 transcript variants encoding distinct isoforms.
6501	This gene was identified based on its homology to the gene encoding the RING3 protein, a serine/threonine kinase. The gene localizes to 9q34, a region which contains several major histocompatibility complex (MHC) genes. The function of the encoded protein is not known.
6502	
6503	
6504	
6505	This gene is located in the surfeit gene cluster, a group of very tightly linked genes that do not share sequence similarity. The gene demonstrates features of a housekeeping gene, being ubiquitously expressed, and the encoded protein has been localized to the nucleolus. The protein includes motifs found in both the mouse and fish orthologs, which suggests a putative function as a nucleolar-matrix protein with nucleic acid-binding properties, based on characteristics determined in mouse.
6506	
6507	The product of this gene belongs to the ficolin family of proteins. This family is characterized by the presence of a leader peptide, a short N-terminal segment, followed by a collagen-like region, and a C-terminal fibrinogen-like domain. This gene is predominantly expressed in the liver, and has been shown to have carbohydrate binding and opsonic activities. Alternatively spliced transcript variants encoding different isoforms have been identified.
6508	
6509	This gene encodes a member of the neurotrophic tyrosine receptor kinase (NTRK) family. This kinase is a membrane-bound receptor that, upon neurotrophin binding, phosphorylates itself and members of the MAPK pathway. Signalling through this kinase leads to cell differentiation. Mutations in this gene have been associated with obesity and mood disorders. Alternate transcriptional splice variants encoding different isoforms have been found for this gene, but only two of them have been characterized to date.
6510	The protein encoded by this gene is a member of the X11 protein family. It is a neuronal adaptor protein that interacts with the Alzheimer@#$%&s disease amyloid precursor protein (APP). It stabilises APP and inhibits production of proteolytic APP fragments including the A beta peptide that is deposited in the brains of Alzheimer@#$%&s disease patients. This gene product is believed to be involved in signal transduction processes. It is also regarded as a putative vesicular trafficking protein in the brain that can form a complex with the potential to couple synaptic vesicle exocytosis to neuronal cell adhesion.
6511	This gene lies adjacent to the tumor suppressor gene CDKN2A in a region that is frequently mutated and deleted in a wide variety of tumors. This gene encodes a cyclin-dependent kinase inhibitor, which forms a complex with CDK4 or CDK6, and prevents the activation of the CDK kinases, thus the encoded protein functions as a cell growth regulator that controls cell cycle G1 progression. The expression of this gene was found to be dramatically induced by TGF beta, which suggested its role in the TGF beta induced growth inhibition. Two alternatively spliced transcript variants of this gene, which encode distinct proteins, have been reported.
6512	The protein encoded by this gene is a member of the CARD protein family, which is defined by the presence of a characteristic caspase-associated recruitment domain (CARD). CARD is a protein interaction domain known to participate in activation or suppression of CARD containing members of the caspase family, and thus plays an important regulatory role in cell apoptosis. This protein was identified by its selective association with the CARD domain of BCL10, a postive regulator of apoptosis and NF-kappaB activation, and is thought to function as a molecular scaffold for the assembly of a BCL10 signaling complex that activates NF-kappaB. Several alternatively spliced transcript variants have been observed, but their full-length natures are not clearly defined.
6513	
6514	This gene encodes a protein that is highly similar to the yeast SIN1 protein, a stress-activated protein kinase. Alternatively spliced transcript variants encoding distinct isoforms have been described. Alternate polyadenylation sites as well as alternate 3@#$%& UTRs have been identified for transcripts of this gene.
6515	
6516	The Aspergillus nidulans @#$%&never in mitosis A@#$%& (NIMA) gene encodes a serine/threonine kinase that controls initiation of mitosis. NIMA-related kinases (NEKs) are a group of protein kinases that are homologous to NIMA. Evidence suggests that NEKs perform functions similar to those of NIMA.[supplied by OMIM]
6517	
6518	Phosducin-like protein is a putative modulator of heterotrimeric G proteins. The protein shares extensive amino acid sequence homology with phosducin, a phosphoprotein expressed in retina and pineal gland. Both phosducin-like protein and phosphoducin have been shown to regulate G-protein signaling by binding to the beta-gamma subunits of G proteins.
6519	Prostaglandin-endoperoxide synthase (PTGS), also known as cyclooxygenase, is the key enzyme in prostaglandin biosynthesis, and acts both as a dioxygenase and as a peroxidase. There are two isozymes of PTGS: a constitutive PTGS1 and an inducible PTGS2, which differ in their regulation of expression and tissue distribution. This gene encodes PTGS1, which regulates angiogenesis in endothelial cells, and is inhibited by nonsteroidal anti-inflammatory drugs such as aspirin. PTGS1 is thought to be involved in cell-cell signaling and maintaining tissue homeostasis. Alternative splicing of this gene generates two transcript variants. The expression of these two transcripts is differentially regulated by relevant cytokines and growth factors.
6520	The protein encoded by this gene is a member of the TNF receptor (TNFR) associated factor (TRAF) protein family. TRAF proteins associate with, and mediate the signal transduction from various receptors of the TNFR superfamily. This protein and TRAF2 form a heterodimeric complex, which is required for TNF-alpha-mediated activation of MAPK8/JNK and NF-kappaB. The protein complex formed by this protein and TRAF2 also interacts with inhibitor-of-apoptosis proteins (IAPs), and thus mediates the anti-apoptotic signals from TNF receptors. The expression of this protein can be induced by Epstein-Barr virus (EBV). EBV infection membrane protein 1 (LMP1) is found to interact with this and other TRAF proteins; this interaction is thought to link LMP1-mediated B lymphocyte transformation to the signal transduction from TNFR family receptors.
6521	
6522	This gene was isolated using microarray hybridization to screen for gene expression differences between fetal and adult testes. The conserved testis development-related genes found in both human and mouse testes may include genes that are likely to be involved in testicular functions, including spermatogenesis. This gene had higher expression in adults, compared to 6-month embryos. The specific function of this gene product has not been determined. Alternative splicing has been observed at this locus and three transcript variants, each encoding a distinct isoform, have been identified.
6523	
6524	
6525	This gene is a member of the neuronal calcium sensor gene family, which encode calcium-binding proteins expressed predominantly in neurons. The protein encoded by this gene regulates G protein-coupled receptor phosphorylation in a calcium-dependent manner and can substitute for calmodulin.  This protein is thought to be associated with secretory granules and may be involved in the regulation of neurosecretion.
6526	
6527	The catalytic conversion of UTP to CTP is accomplished by the enzyme cytidine-5-prime-triphosphate synthetase. The enzyme is important in the biosynthesis of phospholipids and nucleic acids, and plays a key role in cell growth, development, and tumorigenesis. The region to which the CTPS gene has been mapped is the location of breakpoints involved in several tumor types
6528	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L11P family of ribosomal proteins. It is located in the cytoplasm. The protein binds directly to the 26S rRNA. This gene is co-transcribed with the U65 snoRNA, which is located in its fourth intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
6529	Adenylate kinase is an enzyme involved in regulating the adenine nucleotide composition within a cell by catalyzing the reversible transfer of phosphate group among adinine nucleotides. Three isozymes of adenylate kinase have been identified in vertebrates, adenylate isozyme 1 (AK1), 2 (AK2) and 3 (AK3). AK1 is found in the cytosol of skeletal muscle, brain and erythrocytes, whereas AK2 and AK3 are found in the mitochondria of other tissues including liver and heart. AK1 was identified because of its association with a rare genetic disorder causing nonspherocytic hemolytic anemia where a mutation in the AK1 gene was found to reduce the catalytic activity of the enzyme. Alternative splicing of the AK1 gene yields two transcripts of 0.9 kb and 2.5 kb. The two transcripts differ in the 3-prime non-coding region and use two distinct polyadenylation signals.
6530	The protein encoded by this gene may control steps in the cycling of proteins through the trans-Golgi network to endosomes, lysosomes and the plasma membrane. Mutations in this gene cause the autosomal recessive disorder, chorea-acanthocytosis. Alternative splicing of this gene results in multiple transcript variants.
6531	
6532	
6533	
6534	This gene is a member of the melanoma antigen gene (MAGE) family. The proteins of this family are tumor-specific antigens that can be recognized by autologous cytolytic T lymphocytes. This protein contains a large number of unique short repetitive sequences in front of the MAGE-homologous sequence, and therefore is about 800 aa longer than the other MAGE proteins.
6535	
6536	
6537	
6538	
6539	Cytochrome b (-245) is composed of cytochrome b alpha (CYBA) and beta (CYBB) chain. It has been proposed as a primary component of the microbicidal oxidase system of phagocytes. CYBB deficiency is one of five described biochemical defects associated with chronic granulomatous disease (CGD). In this disorder, there is decreased activity of phagocyte NADPH oxidase; neutrophils are able to phagocytize bacteria but cannot kill them in the phagocytic vacuoles. The cause of the killing defect is an inability to increase the cell@#$%&s respiration and consequent failure to deliver activated oxygen into the phagocytic vacuole.
6540	
6541	
6542	Voltage-gated proton (hydrogen) channels play an important role in cellular defense against acidic stress. They are unique among ion channels with respect to their extremely high selectivity, marked temperature dependence, and unitary conductance, which is 3 orders of magnitude lower than that of most other ion channels. NOX1 is a homolog of the catalytic subunit of the superoxide-generating NADPH oxidase of phagocytes, gp91phox. Three splice variants of NOX1 have been identified, NOH-1L, NOH-1S and NOH-1Lv. The NOH-1S currents were reversibly blocked by zinc, a known H+ channel inhibitor. The NOH-1S variant does not contain an electron transport chain and it is thought that H+ conductance is its main physiologic function, whereas NOH-1L may conduct H+ ions as part of its electron transport mechanism.
6543	This gene encodes a zinc finger protein of unknown function. It bears similarity to a zinc finger protein which acts as a transcriptional activator. This gene lies in a region of the X chromosome which has been associated with mental retardation.
6544	
6545	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth, and invasion. This gene is located to a region associated with Borjeson-Forssman-Lehmann syndrome (BFLS), a syndromal X-linked mental retardation, which suggests it may be a candidate gene for familial cases of the BFL syndrome. The function of this gene has not yet been determined. Two alternatively spliced transcripts encoding different isoforms have been described for this gene.
6546	LIM proteins, named for @#$%&LIN11, ISL1, and MEC3,@#$%& are defined by the possession of a highly conserved double zinc finger motif called the LIM domain.[supplied by OMIM]
6547	
6548	This gene is a member of the MAGEB gene family. The members of this family have their entire coding sequences located in the last exon, and the encoded proteins show 50 to 68% sequence identity to each other. The promoters and first exons of the MAGEB genes show considerable variability, suggesting that the existence of this gene family enables the same function to be expressed under different transcriptional controls. This gene is localized in the DSS (dosage-sensitive sex reversal) critical region, and expressed in testis and in a significant fraction of tumors of various histological types. This gene and other MAGEB members are clustered on chromosome Xp22-p21. Multiple alternatively spliced transcript variants encoding the same protein have been found for this gene, however, the full length nature of some variants has not been defined.
6549	Autophagy is the process by which endogenous proteins and damaged organelles are destroyed intracellularly. Autophagy is postulated to be essential for cell homeostasis and cell remodeling during differentiation, metamorphosis, non-apoptotic cell death, and aging. Reduced levels of autophagy have been described in some malignant tumors, and a role for autophagy in controlling the unregulated cell growth linked to cancer has been proposed. This gene encodes a member of the autophagin protein family. The encoded protein is also designated as a member of the C-54 family of cysteine proteases. Transcript variants that encode distinct isoforms have been identified.
6550	
6551	Adaptor protein complex 1 is found at the cytoplasmic face of coated vesicles located at the Golgi complex, where it mediates both the recruitment of clathrin to the membrane and the recognition of sorting signals within the cytosolic tails of transmembrane receptors. This complex is a heterotetramer composed of two large, one medium, and one small adaptin subunit. The protein encoded by this gene serves as the small subunit of this complex and is a member of the adaptin protein family. Transcript variants utilizing alternative polyadenylation signals exist for this gene.
6552	This gene encodes a protein required for synthesis of N-acetylglucosaminyl phosphatidylinositol (GlcNAc-PI), the first intermediate in the biosynthetic pathway of GPI anchor. The GPI anchor is a glycolipid found on many blood cells and which serves to anchor proteins to the cell surface. Paroxysmal nocturnal hemoglobinuria, an acquired hematologic disorder, has been shown to result from mutations in this gene. Three mRNA species have been described that result from alternative splicing of exon 2.
6553	This X-linked gene escapes X-inactivation and has a homolog on the Y chromosome. Homology to a Drosophila gene suggests that this gene encodes a ubiquitin-specific protease, which may regulate protein levels by removing ubiquitin from proteins marked for digestion by proteasomes. The protein may also be involved in oocyte proliferation. Two transcript variants for this gene exist.
6554	Proper cohesion of sister chromatids is a prerequisite for the correct segregation of chromosomes during cell division. The cohesin multiprotein complex is required for sister chromatid cohesion. This complex is composed partly of two structural maintenance of chromosomes (SMC) proteins, SMC3 and either SMC1L2 or the protein encoded by this gene. Most of the cohesin complexes dissociate from the chromosomes before mitosis, although those complexes at the kinetochore remain. Therefore, the encoded protein is thought to be an important part of functional kinetochores. In addition, this protein interacts with BRCA1 and is phosphorylated by ATM, indicating a potential role for this protein in DNA repair. This gene, which belongs to the SMC gene family, is located in an area of the X-chromosome that escapes X inactivation.
6555	ARHGAP6 is similar to the rhoGAP family of proteins which play a role in the regulation of actin polymerization at the plasma membrane of several cellular processes. Rho GTPase activating protein 6 is thought to have two independent functions, one as a GTPase-activating protein (GAP) with specificity for RhoA and another as a cytoskeletal protein that promotes actin remodeling. Rho GTPase activating protein 6 exists as five different splice variants through the use of alternatively spliced exons 1a, 1b, 11, and splicing of a cryptic splice donar site in exon 12.
6556	
6557	The protein encoded by this gene is a member of a family of membrane glycoproteins.  This glycoprotein provides selectins with carbohydrate ligands.  It may play a role in tumor cell metastasis.  It may also function in the protection, maintenance, and adhesion of the lysosome.  Alternative splicing of the gene produces two known products of unknown function.
6558	
6559	The product of this gene belongs to a subfamily of calcium binding proteins, which share similarity to calmodulin. Calcium binding proteins are an important component of calcium mediated cellular signal transduction. Expression of this gene is retina-specific. The mouse homolog of this protein has been shown to express in the inner nuclear layer of the retina, suggested its role in neuronal functioning. The specific function of this gene is unknown. Study of the transcripts and genomic structure revealed that the 5@#$%& end of this gene is complementary and reverse to that of CABP3 gene, and the sequence beyond the overlapping region is nearly identical to that of CABP3. Thus, these two genes encode the protein products with distinct N-terminal halves but identical C-terminal halves.
6560	Iduronate-2-sulfatase is required for the lysosomal degradation of heparan sulfate and dermatan sulfate. Mutations in this X-chromosome gene that result in enzymatic deficiency lead to the sex-linked Mucopolysaccharidosis Type II, also known as Hunter Syndrome. Iduronate-2-sulfatase has a strong sequence homology with human arylsulfatases A, B, and C, and human glucosamine-6-sulfatase. A splice variant of this gene has been described.
6561	
6562	Kinesins, such as KIF4A, are microtubule-based motor proteins that generate directional movement along microtubules. They are involved in many crucial cellular processes, including cell division (Zhu and Jiang, 2005).[supplied by OMIM]
6563	
6564	
6565	
6566	
6567	This gene is a member of the MAGEA gene family. The members of this family encode proteins with 50 to 80% sequence identity to each other. The promoters and first exons of the MAGEA genes show considerable variability, suggesting that the existence of this gene family enables the same function to be expressed under different transcriptional controls. The MAGEA genes are clustered at chromosomal location Xq28. They have been implicated in some hereditary disorders, such as dyskeratosis congenita.
6568	The protein encoded by this gene is an isozyme of the long-chain fatty-acid-coenzyme A ligase family. Although differing in substrate specificity, subcellular localization, and tissue distribution, all isozymes of this family convert free long-chain fatty acids into fatty acyl-CoA esters, and thereby play a key role in lipid biosynthesis and fatty acid degradation. This isozyme preferentially utilizes arachidonate as substrate. The absence of this enzyme may contribute to the mental retardation or Alport syndrome. Alternative splicing of this gene generates 2 transcript variants.
6569	Calpains are ubiquitous, well-conserved family of calcium-dependent, cysteine proteases. The calpain proteins are heterodimers consisting of an invariant small subunit and variable large subunits. The large subunit possesses a cysteine protease domain, and both subunits possess calcium-binding domains. Calpains have been implicated in neurodegenerative processes, as their activation can be triggered by calcium influx and oxidative stress. The protein encoded by this gene is highly expressed in the placenta. Its C-terminal region lacks any homology to the calmodulin-like domain of other calpains. It also lacks critical active site residues and thus is suggested to be proteolytically inactive.
6570	This gene encodes disintegrin and metalloprotease (ADAM) domain 28, which is a member of the ADAM protein family. Members of this family are membrane-anchored proteins structurally related to snake venom disintegrins, and have been implicated in a variety of biologic processes involving cell-cell and cell-matrix interactions, including fertilization, muscle development, and neurogenesis. The member encoded by this gene is a lymphocyte-expressed ADAM protein. Alternative splicing of this gene generates two transcript variants. The shorter version encodes a secreted isoform, while the longer version encodes a transmembrane isoform.
6571	
6572	Cytoplasmic ribosomes, organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein, which shares sequence similarity with yeast ribosomal protein L44, belongs to the L44E (L36AE) family of ribosomal proteins. Although this gene has been referred to as ribosomal protein L44 (RPL44), its official name is ribosomal protein L36a (RPL36A). This gene and the human gene officially named ribosomal protein L36a-like (RPL36AL) encode nearly identical proteins; however, they are distinct genes. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
6573	This gene encodes a member of the ALEX family of proteins which may play a role in tumor suppression. The encoded protein contains a potential N-terminal transmembrane domain and a single Armadillo (arm) repeat. Other proteins containing the arm repeat are involved in development, maintenance of tissue integrity, and tumorigenesis. This gene is closely localized with other family members on the X chromosome. Three transcript variants encoding the same protein have been identified for this gene.
6574	This gene is similar to a mouse gene that encodes a TATA box binding protein-associated factor, and shows testis-specific expression.
6575	This translocase has similarity to yeast mitochondrial proteins that are involved in the import of metabolite transporters from the cytoplasm into the mitochondrial inner membrane. The gene is mutated in Deafness Dystonia Syndrome (MTS/DFN-1) and it is postulated that MTS/DFN-1 is a mitochondrial disease caused by a defective mitochondrial protein import system.
6576	This gene encodes coagulation factor VIII, which participates in the intrinsic pathway of blood coagulation; factor VIII is a cofactor for factor IXa which, in the presence of Ca+2 and phospholipids, converts factor X to the activated form Xa.  This gene produces two alternatively spliced transcripts. Transcript variant 1 encodes a large glycoprotein, isoform a, which circulates in plasma and associates with von Willebrand factor in a noncovalent complex. This protein undergoes multiple cleavage events. Transcript variant 2 encodes a putative small protein, isoform b, which consists primarily of the phospholipid binding domain of factor VIIIc. This binding domain is essential for coagulant activity. Defects in this gene results in hemophilia A, a common recessive X-linked coagulation disorder.
6577	This gene encodes a subunit of the BRCA1-BRCA2-containing complex (BRCC), which is an E3 ubiquitin ligase. This protein is also thought to be involved in the cellular response to ionizing radiation and progression through the G2/M checkpoint. Alternative splicing results in multiple transcript variants.
6578	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L10E family of ribosomal proteins. It is located in the cytoplasm. In vitro studies have shown that the chicken protein can bind to c-Jun and can repress c-Jun-mediated transcriptional activation, but these activities have not been demonstrated in vivo. This gene was initially identified as a candidate for a Wilms tumor suppressor gene, but later studies determined that this gene is not involved in the suppression of Wilms tumor. This gene has been referred to as @#$%&laminin receptor homolog@#$%& because a chimeric transcript consisting of sequence from this gene and sequence from the laminin receptor gene was isolated; however, it is not believed that this gene encodes a laminin receptor. Transcript variants utilizing alternative polyA signals exist. The variant with the longest 3@#$%& UTR overlaps the deoxyribonuclease I-like 1 gene on the opposite strand. This gene is co-transcribed with the small nucleolar RNA gene U70, which is located in its fifth intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
6579	The protein encoded by this gene shares significant sequence identity with the murine TSC-22 and Drosophila shs, both of which are leucine zipper proteins, that function as transcriptional regulators. The expression of this gene is stimulated by glucocorticoids and interleukin 10, and it appears to play a key role in the anti-inflammatory and immunosuppressive effects of this steroid and chemokine. Transcript variants encoding different isoforms have been identified for this gene.
6580	The pseudoautosomal region (PAR1) contains an interval of 170 kb that tends to be deleted in individuals with short stature and different rearrangements on Xp22 or Yp11.3. SHOX (Short stature HOmeoboX-containing gene) a homeo box-containing gene located in the PAR1, has at least 2 alternatively spliced forms encoding proteins with different patterns of expression. SHOX is involved in idiopathic growth retardation and in the short stature phenotype of Turner syndrome patients. The 2 mRNAs, SHOXa and SHOXb, code for proteins of 292 and 225 amino acids, respectively. Both transcripts have a common 5@#$%& end, diverging after exon IV. The SHOX gene is composed of 6 exons ranging in size from 58 bp to 1,146 bp. SHOX is highly conserved across species. The absence of the SHOX gene may be responsible for the growth failure in Turner syndrome females.
6581	
6582	The protein encoded by this gene is a member of the sulfatase family. Sulfatases are essential for the correct composition of bone and cartilage matrix. This encoded protein is postranslationally glycosylated and localized to the lysosome. Two splice variants of this gene have been identified.
6583	
6584	The CLCN family of voltage-dependent chloride channel genes comprises nine members (CLCN1-7, Ka and Kb) which demonstrate quite diverse functional characteristics while sharing significant sequence homology. Chloride channel 4 has an evolutionary conserved CpG island and is conserved in both mouse and hamster. This gene is mapped in close proximity to APXL (Apical protein Xenopus laevis-like) and OA1 (Ocular albinism type I), which are both located on the human X chromosome at band p22.3. The physiological role of chloride channel 4 remains unknown but may contribute to the pathogenesis of neuronal disorders.
6585	The protein encoded by this gene shares significant similarities with the apical protein from Xenopus laevis which is implicated in amiloride-sensitive sodium channel activity. This gene is a strong candidate gene for ocular albinism type 1 syndrome.
6586	
6587	This gene encodes an actin sequestering protein which plays a role in regulation of actin polymerization. The protein is also involved in cell proliferation, migration, and differentiation. This gene escapes X inactivation and has a homolog on chromosome Y.
6588	This gene encodes a nuclear protein and has similarity to drosophila male-specific lethal-3 gene. The drosophila protein plays a critical role in a dosage-compensation pathway, which equalizes X-linked gene expression in males and females. Thus this encoded protein is thought to play a similar function in chromatin remodeling and transcriptional regulation. This gene has been found to undergo X inactivation. There are four alternatively spliced transcript variants of this gene encoding different isoforms.
6589	
6590	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, and it has a homolog on the X chromosome. The gene mutation causes male infertility, Sertoli cell-only syndrome or severe hypospermatogenesis, suggesting that this gene plays a key role in the spermatogenic process. Alternative splicing of this gene generates 2 transcripts, which differ only in the length of the 3@#$%& UTR.
6591	
6592	This gene encodes a flavin adenine dinucleotide (FAD)-dependent oxidoreductase which catalyzes the reduction of the delta-24 double bond of sterol intermediates during cholesterol biosynthesis. The protein contains a leader sequence that directs it to the endoplasmic reticulum membrane. Missense mutations in this gene have been associated with desmosterolosis. Also, reduced expression of the gene occurs in the temporal cortex of Alzheimer disease patients and overexpression has been observed in adrenal gland cancer cells.
6593	
6594	The protein encoded by this gene belongs to the @#$%&regulator of G protein signaling@#$%& family. It inhibits signal transduction by increasing the GTPase activity of G protein alpha subunits. It also may play a role in regulating the kinetics of signaling in the phototransduction cascade.
6595	
6596	
6597	
6598	
6599	Tight junctions represent one mode of cell-to-cell adhesion in epithelial or endothelial cell sheets, forming continuous seals around cells and serving as a physical barrier to prevent solutes and water from passing freely through the paracellular space. The protein encoded by this immunoglobulin superfamily gene member is an important regulator of tight junction assembly in epithelia. In addition, the encoded protein can act as (1) a receptor for reovirus, (2) a ligand for the integrin LFA1, involved in leukocyte transmigration, and (3) a platelet receptor. Five transcript variants encoding two different isoforms have been found for this gene.
6600	
6601	
6602	
6603	This gene encodes a protein that contains a SET domain. SET domains appear to be protein-protein interaction domains that mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). The function of this gene has not been determined. Two alternatively spliced transcript variants have been found for this gene.
6604	
6605	Tight junctions represent one mode of cell-to-cell adhesion in epithelial or endothelial cell sheets, forming continuous seals around cells and serving as a physical barrier to prevent solutes and water from passing freely through the paracellular space. The protein encoded by this immunoglobulin superfamily gene member is localized in the tight junctions between high endothelial cells. Unlike other proteins in this family, the this protein is unable to adhere to leukocyte cell lines and only forms weak homotypic interactions. The encoded protein is a member of the junctional adhesion molecule protein family and acts as a receptor for another member of this family.
6606	
6607	
6608	The protein encoded by this gene is a member of the keratin gene family. The type II cytokeratins consist of basic or neutral proteins which are arranged in pairs of heterotypic keratin chains coexpressed during differentiation of simple and stratified epithelial tissues. This type II cytokeratin is specifically expressed in the basal layer of the epidermis with family member KRT14. Mutations in these genes have been associated with a complex of diseases termed epidermolysis bullosa simplex. The type II cytokeratins are clustered in a region of chromosome 12q12-q13.
6609	Natural killer (NK) cells are lymphocytes that can mediate lysis of certain tumor cells and virus-infected cells without previous activation. They can also regulate specific humoral and  cell-mediated immunity. NK cells preferentially express several calcium-dependent (C-type) lectins, which have been implicated in the regulation of NK cell function. D12S2489E is a member of the NKG2 group which are expressed primarily in natural killer (NK) cells and encodes a family of transmembrane proteins characterized by a type II membrane orientation (extracellular C terminus) and the presence of a C-type lectin domain. The NKG2 gene family is located within the NK complex, a region that contains several C-type lectin genes preferentially expressed on NK cells. The first non-coding exon at the 5@#$%& end of the D12S2489E transcript is included in the 3@#$%& end of the KLRC4 transcript.
6610	
6611	
6612	
6613	
6614	
6615	This gene encodes a protein that contains two WD domains and an FYVE zinc finger region. The function of this gene is unknown. An alternatively spliced transcript variant of this gene may exist.
6616	
6617	This gene encodes coagulation factor VII which is a vitamin K-dependent factor essential for hemostasis. This factor circulates in the blood in a zymogen form, and is converted to an active form by either factor IXa, factor Xa, factor XIIa, or thrombin by minor proteolysis. Upon activation of the factor VII, a heavy chain containing a catalytic domain and a light chain containing 2 EGF-like domains are generated, and two chains are held together by a disulfide bond. In the presence of factor III and calcium ions, the activated factor then further activates the coagulation cascade by converting factor IX to factor IXa and/or factor X to factor Xa. Alternative splicing of this gene results in 2 transcripts. Defects in this gene can cause coagulopathy.
6618	
6619	The protein encoded by this gene belongs to the family of latent transforming growth factor (TGF)-beta binding proteins (LTBP), which are extracellular matrix proteins with multi-domain structure. This protein is the largest member of the LTBP family possessing unique regions and with most similarity to the fibrillins. It has thus been suggested that it may have multiple functions: as a member of the TGF-beta latent complex, as a structural component of microfibrils, and a role in cell adhesion.
6620	
6621	
6622	
6623	
6624	
6625	This gene encodes a key enzyme in catabolism of quinolinate, an intermediate in the tryptophan-nicotinamide adenine dinucleotide pathway. Quinolinate acts as a most potent endogenous exitotoxin to neurons. Elevation of quinolinate levels in the brain has been linked to the pathogenesis of neurodegenerative disorders such as epilepsy, Alzheimer@#$%&s disease, and Huntington@#$%&s disease.
6626	
6627	
6628	
6629	Several proteins have been found to be prenylated and methylated at their carboxyl-terminal ends. Prenylation was initially believed to be important only for membrane attachment. However, another role for prenylation appears to be its importance in protein-protein interactions. The only nuclear proteins known to be prenylated in mammalian cells are prelamin A- and B-type lamins. Prelamin A is farnesylated and carboxymethylated on the cysteine residue of a carboxyl-terminal CaaX motif. This post-translationally modified cysteine residue is removed from prelamin A when it is endoproteolytically processed into mature lamin A. The protein encoded by this gene binds to the prenylated prelamin A carboxyl-terminal tail domain. It may be a component of a prelamin A endoprotease complex. The encoded protein is located in the nucleus, where it partially colocalizes with the nuclear lamina. It shares limited sequence similarity with iron-only bacterial hydrogenases. Transcript variants encoding different isoforms and/or utilizing alternative polyadenylation sites exist.
6630	
6631	
6632	
6633	ITGAE encodes integrin alpha chain E. Integrins are heterodimeric integral membrane proteins composed of an alpha chain and a beta chain. This I-domain containing alpha integrin undergoes post-translational cleavage in the extracellular domain that yields disulfide-linked heavy and light chains, In combination with beta 7 (ITGB7) these chains form the E-cadherin binding integrin known as @#$%&human mucosal lymphocyte-1@#$%& antigen (HML-1). This integrin is preferentially expressed on human intestinal intraepithelial lymphocytes (IEL), and in addition to adhesion, it may serve as an accessory molecule for IEL activation.
6634	
6635	This gene encodes a member of the membrane-associated guanylate kinase (MAGUK) family. It heteromultimerizes with DLG2. With DLG2 it is recruited into the same NMDA receptor and potassium channel clusters. These two MAGUK proteins may interact at postsynaptic sites to form a multimeric scaffold for the clustering of receptors, ion channels, and associated signaling proteins.
6636	This gene encodes a protein involved in G protein-mitogen-activated protein kinase (MAPK) signaling cascades. When overexpressed in mammalian cells, this gene could potently suppress a RAS- and MAPK-mediated signal and interfere with JNK activity, suggesting that the function of this gene may be signal repression. The encoded protein is an integral subunit of the NCOR1-HDAC3 (nuclear receptor corepressor 1-histone deacetylase 3) complex, and it was shown that the complex inhibits JNK activation through this subunit and thus could potentially provide an alternative mechanism for hormone-mediated antagonism of AP1 (activator protein 1) function.
6637	
6638	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. Sequence analysis identified two transcript variants that encode different isoforms. A pseudogene corresponding to this gene is found on chromosome 5q.
6639	
6640	Nuclear factor of activated T-cells (NFAT) is a transcription factor required for T-cell expression of interleukin 2. NFAT binds to a sequence in the IL2 enhancer known as the antigen receptor response element 2. In addition, NFAT can bind RNA and is an essential component for encapsidation and protein priming of hepatitis B viral polymerase. NFAT is a heterodimer of 45 kDa and 90 kDa proteins, the larger of which is the product of this gene. The encoded protein, which is primarily localized to ribosomes, probably regulates transcription at the level of mRNA elongation. At least three transcript variants encoding three different isoforms have been found for this gene.
6641	
6642	
6643	
6644	
6645	
6646	
6647	
6648	
6649	
6650	Integrins are heterodimeric proteins made up of alpha and beta subunits. At least 18 alpha and 8 beta subunits have been described in mammals. Integrin family members are membrane receptors involved in cell adhesion and recognition in a variety of processes including embryogenesis, hemostasis, tissue repair, immune response and metatastatic diffusion of tumor cells. The protein encoded by this gene is a beta subunit. Six alternatively spliced variants have been found for this gene which encode five proteins with alternate carboxy termini.
6651	The Fos gene family consists of 4 members: FOS, FOSB, FOSL1, and FOSL2. These genes encode leucine zipper proteins that can dimerize with proteins of the JUN family, thereby forming the transcription factor complex AP-1. As such, the FOS proteins have been implicated as regulators of cell proliferation, differentiation, and transformation.
6652	
6653	This gene encodes a C2H2 type zinc-finger protein by its similarity to the mouse Bcl11a/Evi9 protein. The corresponding mouse gene is a common site of retroviral integration in myeloid leukemia, and may function as a leukemia disease gene, in part, through its interaction with BCL6. During hematopoietic cell differentiation, this gene is down-regulated. It is possibly involved in lymphoma pathogenesis since translocations associated with B-cell malignancies also deregulates its expression. Five alternatively spliced transcript variants of this gene, which encode distinct isoforms, have been reported.
6654	This gene encodes a protein that functions as a translation initiation factor and is universally conserved. It encodes one of the two proteins identified as translation initiation factors, the other factor being a complex composed of three subunits. These translation factors act independently of each other, however, their function is the same: to position the initiator methionine tRNA on the start codon of the mRNA in association with the 40S ribosomal subunit and GTP so that translation initiates accurately.
6655	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. Sequence analysis identified at least three transcript variants. Pseudogenes corresponding to this gene are found on chromosomes 6p and 12p.
6656	
6657	This gene belongs to the tumor necrosis factor receptor superfamily. The encoded protein is postulated to play a regulatory role in suppressing FasL- and LIGHT-mediated cell death. It acts as a decoy receptor that competes with death receptors for ligand binding. Overexpression of this gene has been noted in gastrointestinal tract tumors, and it is located in a gene-rich cluster on chromosome 20, with other potentially tumor-related genes. Two transcript variants encoding the same isoform, but differing in the 5@#$%& UTR, have been observed for this gene.
6658	This gene encodes a member of the WAP-type four-disulfide core (WFDC) domain family. Most WFDC proteins contain only one WFDC domain, and this encoded protein contains two WFDC domains. The WFDC domain, or WAP signature motif, contains eight cysteines forming four disulfide bonds at the core of the protein, and functions as a protease inhibitor. Most WFDC gene members are localized to chromosome 20q12-q13 in two clusters: centromeric and telomeric. This gene belongs to the centromeric cluster.
6659	
6660	
6661	
6662	
6663	This gene encodes a protein that has sequence similarity to zeta crystallin, also known as quinone oxidoreductase. This zeta crystallin-like protein also contains an NAD(P)H binding site. The specific function of this protein is not known. Alternatively spliced variants that encode different protein isoforms have been identified but the full-length nature of only some of them is known. This gene also seems to contain a rare AT-AC splice-site.
6664	The protein encoded by this gene is an acyltransferase that converts lysophosphatidic acid into phosphatidic acid, which is the second step in the de novo phospholipid biosynthetic pathway. The encoded protein may be an integral membrane protein.
6665	This gene encodes a G-protein beta-subunit-like polypeptide which is a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This protein contains 6 WD repeats and is highly expressed in the heart. The gene maps to the region on chromosome 22q11, which is deleted in DiGeorge syndrome, trisomic in derivative 22 syndrome and tetrasomic in cat-eye syndrome. Therefore, this gene may contribute to the etiology of those disorders.
6666	The protein encoded by this gene is a GTPase which belongs to the RAS superfamily of small GTP-binding proteins. Members of this superfamily appear to regulate a diverse array of cellular events, including the control of cell growth, cytoskeletal reorganization, and the activation of protein kinases.
6667	The protein encoded by this gene is a GTPase which belongs to the RAS superfamily of small GTP-binding proteins. Members of this superfamily appear to regulate a diverse array of cellular events, including the control of cell growth, cytoskeletal reorganization, and the activation of protein kinases.
6668	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and splicesosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, which is an ATPase activated by a variety of RNA species but not by dsDNA. This protein and that encoded by DDX5 gene are more closely related to each other than to any other member of the DEAD box family. Alternative splicing of this gene generates 2 transcript variants encoding different isoforms with the longer transcript reported to also initiate translation at a non-AUG (CUG) start site.
6669	
6670	This gene encodes a protein that contains a tetratricopeptide repeat domain. The encoded protein also interacts with the rotavirus non-structural protein NSP3.
6671	
6672	
6673	The nuclear import of karyophilic proteins is directed by short amino acid sequences termed nuclear localization signals (NLSs). Karyopherins, or importins, are cytoplasmic proteins that recognize NLSs and dock NLS-containing proteins to the nuclear pore complex. The protein encoded by this gene shares the sequence similarity with Xenopus importin-alpha and Saccharomyces cerevisiae Srp1. This protein is found to interact with the NLSs of DNA helicase Q1 and SV40 T antigen.
6674	
6675	
6676	
6677	
6678	This gene is a member of a multi-gene family which shares strong similarity with the Drosophila dishevelled gene, dsh. The Drosophila dishevelled gene encodes a cytoplasmic phosphoprotein that regulates cell proliferation.
6679	
6680	
6681	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members bind heparin and possess broad mitogenic and angiogenic activities. This protein has been implicated in diverse biological processes, such as limb and nervous system development, wound healing, and tumor growth. The mRNA for this gene contains multiple polyadenylation sites, and is alternatively translated from AUG and non-AUG (CUG) initiation codons resulting in five different isoforms with distinct properties. The CUG-initiated isoforms are localized in the nucleus and are responsible for the intracrine effect, whereas, the AUG-initiated form is mostly cytosolic and is responsible for the paracrine and autocrine effects of this FGF.
6682	
6683	This gene is a member of the protocadherin alpha gene cluster, one of three related gene clusters tandemly linked on chromosome five that demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The alpha gene cluster is composed of 15 cadherin superfamily genes related to the mouse CNR genes and consists of 13 highly similar and 2 more distantly related coding sequences. The tandem array of 15 N-terminal exons, or variable exons, are followed by downstream C-terminal exons, or constant exons, which are shared by all genes in the cluster. The large, uninterrupted N-terminal exons each encode six cadherin ectodomains while the C-terminal exons encode the cytoplasmic domain. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins that most likely play a critical role in the establishment and function of specific cell-cell connections in the brain. Alternative splicing has been observed and additional variants have been suggested but their full-length nature has yet to be determined.
6684	
6685	
6686	
6687	This gene encodes a calcium-binding protein belonging to the penta-EF-hand protein family. Calcium binding is important for homodimerization and for conformational changes required for binding to other protein partners. This gene product participates in T cell receptor-,  Fas-, and glucocorticoid-induced programmed cell death. In mice deficient for this gene product, however, apoptosis was not blocked suggesting this gene product is functionally redundant.
6688	
6689	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. This structure consists of approximately 146 bp of DNA wrapped around a nucleosome, an octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is intronless and encodes a member of the histone H2A family. Transcripts from this gene lack polyA tails; instead, they contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6p22-p21.3.
6690	
6691	
6692	
6693	
6694	
6695	Clathrin is the main structural component of the lattice covering the cytoplasmic face of the coated pits and coated vesicles in which specific macromolecules are entrapped in the process of receptor-mediated endocytosis.  Clathrin is a large, soluble protein composed of heavy chains and light chains.  Two genes, CLTA and CLTB encode the clathrin light chain proteins. The light chains are believed to be regulatory elements in clathrin function.  The brain-specific light chain variant contains an insert ecoding 30 amino acids immediately downstream of the region encoding the heavy chain binding domain.
6696	This gene product shares extensive sequence similarity with the rat neuronal olfactomedin-related ER localized protein. While the exact function of the encoded protein is not known, its abundant expression in brain suggests that it may have an essential role in nerve tissue. Several alternatively spliced transcripts encoding different isoforms have been found for this gene.
6697	
6698	
6699	
6700	This gene encodes a protein with 3@#$%& exonuclease activity. Enzymes with this activity are involved in DNA replication, repair, and recombination. Similarity to an E. coli protien suggests that this enzyme may be a subunit of DNA polymerase III, which does not have intrinsic exonuclease activity.
6701	Alpha-1-adrenergic receptors (alpha-1-ARs) are members of the G protein-coupled receptor superfamily. They activate mitogenic responses and regulate growth and proliferation of many cells. There are 3 alpha-1-AR subtypes: alpha-1A, -1B and -1D, all of which signal through the Gq/11 family of G-proteins and different subtypes show different patterns of activation. This gene encodes alpha-1B-adrenergic receptor, which induces neoplastic transformation when transfected into NIH 3T3 fibroblasts and other cell lines. Thus, this normal cellular gene is identified as a protooncogene. This gene comprises 2 exons and a single large intron of at least 20 kb that interrupts the coding region.
6702	This gene belongs to a class of genes that are activated in tumors. These genes are expressed in tumors of different histologic types but not in normal tissues, except for spermatogenic cells and, for some, placenta. The proteins encoded by these genes appear to be strictly tumor specific, and hence may be excellent sources of antigens for cancer immunotherapy. This gene is expressed in sarcomas.
6703	A human melanoma-associated chondroitin sulfate proteoglycan plays a role in stabilizing cell-substratum interactions during early events of melanoma cell spreading on endothelial basement membranes. CSPG4 represents an integral membrane chondroitin sulfate proteoglycan expressed by human malignant melanoma cells.
6704	
6705	Oxysterols regulate cholesterol homeostasis through liver X receptor (LXR) and sterol regulatory element-binding protein (SREBP) mediated signaling pathway. This gene is an insulin-induced gene. It encodes an endoplasmic reticulum (ER) membrane protein that plays a critical role in regulating cholesterol concentrations in cells. This protein binds to the sterol-sensing domains of SREBP cleavage-activating protein (SCAP) and HMG CoA reductase, and is essential for the sterol-mediated trafficking of the two proteins. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
6706	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S5P family of ribosomal proteins. It is located in the cytoplasm. This gene shares sequence similarity with mouse LLRep3. It is co-transcribed with the small nucleolar RNA gene U64, which is located in its third intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
6707	
6708	This gene encodes a cytoplasmic protein tyrosine kinase which is found concentrated in the focal adhesions that form between cells growing in the presence of extracellular matrix constituents. The encoded protein is a member of the FAK subfamily of protein tyrosine kinases but lacks significant sequence similarity to kinases from other subfamilies. Activation of this gene may be an important early step in cell growth and intracellular signal transduction pathways triggered in response to certain neural peptides or to cell interactions with the extracellular matrix. At least four transcript variants encoding four different isoforms have been found for this gene, but the full-length natures of only two of them have been determined.
6709	
6710	Huntington@#$%&s disease (HD), a neurodegenerative disorder characterized by loss of striatal neurons, is caused by an expansion of a polyglutamine tract in the HD protein huntingtin. This gene encodes a protein that interacts with huntingtin, with two cytoskeletal proteins (dynactin and pericentriolar autoantigen protein 1), and with a hepatocyte growth factor-regulated tyrosine kinase substrate. The interactions with cytoskeletal proteins and a kinase substrate suggest a role for this protein in vesicular trafficking or organelle transport. Two transcripts encoding different isoforms have been completely described. Although additional isoforms have been reported, the full-length nature of the transcripts encoding them has not been defined.
6711	The protein encoded by this gene is a zinc finger protein present in the basal cell layer of the epidermis and in hair follicles. It is also found in abundance in the germ cells of testis and ovary. This protein is thought to play a regulatory role in keratinocyte proliferation and it may also be a regulator for rRNA transcription. This gene seems to have multiple alternatively spliced transcript variants, but their full-length nature is not known yet. There seems to be evidence of multiple polyadenylation sites for this gene.
6712	Mice homozygous for the rd mutation display hereditary retinal degeneration which has been considered a model for human retinitis pigmentosa. In affected animals, the retinal rod photoreceptor cells begin degenerating at about postnatal day 8, and by 4 weeks no photoreceptors are left. Farber and Lolley (1974, 1976) showed that degeneration is preceded by accumulation of cyclic GMP in the retina and is correlated with deficient activity of the rod photoreceptor cGMP-phosphodiesterase. Bennett et al. (1996) tested the possibility of altering the course of retinal degeneration through subretinal injection of recombinant replication defective adenovirus that contained the murine cDNA for wildtype beta-PDE. Subretinal injection of rd mice was carried out 4 days after birth, before the onset of rd retinal degeneration. Following therapy, beta-PDE transcripts and enzyme activity were detected, and histologic studies revealed that photoreceptor cell death was significantly retarded.[supplied by OMIM]
6713	
6714	
6715	
6716	Sodium bicarbonate cotransporters (NBCs) mediate the coupled movement of sodium and bicarbonate ions across the plasma membrane of many cells. This is an electrogenic process with an apparent stoichiometry of 3 bicarbonate ions per sodium ion. Sodium bicarbonate cotransport is involved in bicarbonate secretion/absorption and intracellular pH regulation. Romero and Boron (1999) reviewed NBCs. Soleimani and Burnham (2000) reviewed NBCs and their regulation in physiologic and pathophysiologic states.[supplied by OMIM]
6717	
6718	Myosin is a hexameric ATPase cellular motor protein. It is composed of two heavy chains, two nonphosphorylatable alkali light chains, and two phosphorylatable regulatory light chains. This gene encodes a myosin alkali light chain expressed in both slow-twitch skeletal muscle and in nonmuscle tissue.
6719	Natural killer (NK) cells are lymphocytes that can mediate lysis of certain tumor cells and virus-infected cells without previous activation. They can also regulate specific humoral and cell-mediated immunity. NK cells preferentially express several calcium-dependent (C-type) lectins, which have been implicated in the regulation of NK cell function. The group, designated KLRC (NKG2) are expressed primarily in natural killer (NK) cells and encodes a family of transmembrane proteins characterized by a type II membrane orientation (extracellular C terminus) and the presence of a C-type lectin domain. The KLRC (NKG2) gene family is located within the NK complex, a region that contains several C-type lectin genes preferentially expressed on NK cells. KLRC2 alternative splice variants have been described but their full-length nature has not been determined.
6720	
6721	
6722	
6723	
6724	
6725	
6726	This intronless gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins. The protein may function in the developing nervous system and play a role in tumorigenesis.
6727	
6728	
6729	
6730	
6731	The RP2 locus has been implicated as one cause of X-linked retinitis pigmentosa. The predicted gene product shows homology with human cofactor C, a protein involved in the ultimate step of beta-tubulin folding. Progressive retinal degeneration may therefore be due to the accumulation of incorrectly-folded photoreceptor or neuron-specific tubulin isoforms followed by progressive cell death
6732	Properdin (factor P) is a plasma protein that is active in the alternative complement pathway of the innate immune system. It is a positive regulatory factor that binds to many microbial surfaces to stabilize the C3b,Bb convertase. The C3b,Bb convertase then rapidly cleaves more C3 to C3b, which acts either as an opsonin or to reinitiate the pathway in an amplification loop that proceeds on the bacterial cell, but not on the host cell (Janeway et al., 2001). In the alternative pathway, C3 is thus activated through factor B, factor D, and properdin P, under the control of factors I and H (Fearon and Austen, 1980). Deficiency of properdin is associated in particular with a heightened susceptibility to Neisseria species (Janeway et al., 2001).[supplied by OMIM]
6733	
6734	The protein encoded by this gene is a member of the S-adenosylmethionine-binding protein family. It is a nucleolar protein and may be involved in the processing and modification of rRNA. Three alternatively spliced transcript variants encoding different isoforms have been described for this gene.
6735	Members of the subtilisin-like proprotein convertase family process latent precursor proteins into their biologically active products. The protein encoded by this gene appears to function as an endogenous inhibitor of proprotein convertase subtilisin/kexin type 1.
6736	
6737	The protein encoded by this gene is similar to an iron transport protein found in mouse. The mouse protein is similar to ceruloplasmin, a serum multi-copper ferroxidase, and is thought to be a membrane-bound protein responsible for transport of dietary iron from epithelial cells of the intestinal lumen into the circulatory system. In mouse, defects in this gene can lead to severe microcytic anemia. Two transcript variants encoding different isoforms have been described for this gene.
6738	The choroideremia gene encodes for a protein, the Rab escort protein-1 (REP1), which is involved in membrane trafficking.[supplied by OMIM]
6739	This gene encodes a member of the epidermal growth factor (EGF) repeat superfamily. Members of this superfamily are characterized by the presence of EGF-like repeats and are often involved in the regulation of cell cycle, proliferation, and developmental processes. The gene product contains a signal peptide, suggesting that it is secreted; an EGF repeat region consisting of 4 complete EGF-like repeats and 1 partial EGF-like repeat, 3 of which have a calcium-binding consensus sequence; an arg-gly-asp integrin association motif; and a MAM domain, which is believed to have an adhesive function. This gene is expressed early during development, and its expression has been detected in lung and meningioma tumors.
6740	This gene is a member of the septin family of GTPases. Members of this family are required for cytokinesis. One version of pediatric acute myeloid leukemia is the result of a reciprocal translocation between chromosomes 11 and X, with the breakpoint associated with the genes encoding the mixed-lineage leukemia and septin 2 proteins. This gene encodes four transcript variants encoding three distinct isoforms. An additional transcript variant has been identified, but its biological validity has not been determined.
6741	
6742	
6743	The cysteinyl leukotrienes LTC4, LTD4, and LTE4 are important mediators of human bronchial asthma. Pharmacologic studies have determined that cysteinyl leukotrienes activate at least 2 receptors, the protein encoded by this gene and CYSLTR2. This encoded receptor is a member of the superfamily of G protein-coupled receptors. Activation of this receptor by LTD4 results in contraction and proliferation of smooth muscle, oedema, eosinophil migration and damage to the mucus layer in the lung.
6744	
6745	
6746	This gene encodes a member of the ALEX family of proteins and may play a role in tumor suppression. The encoded protein contains a potential N-terminal transmembrane domain and two Armadillo (arm) repeats. Other proteins containing the arm repeat are involved in development, maintenance of tissue integrity, and tumorigenesis. This gene is closely localized with other family members, including ALEX2 and ALEX3, on the X chromosome.
6747	
6748	
6749	
6750	
6751	This gene is a member of the GAGE family, which is expressed in a variety of tumors and in some fetal and reproductive tissues. The protein encoded by this gene shares a sequence similarity with other GAGE/PAGE proteins. It may also belong to a family of CT (cancer-testis) antigens. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene, but the biological validity of some variants have not been determined.
6752	The pseudoautosomal region (PAR1) contains an interval of 170 kb that tends to be deleted in individuals with short stature and different rearrangements on Xp22 or Yp11.3. SHOX (Short stature HOmeoboX-containing gene) a homeo box-containing gene located in the PAR1, has at least 2 alternatively spliced forms encoding proteins with different patterns of expression. SHOX is involved in idiopathic growth retardation and in the short stature phenotype of Turner syndrome patients. The 2 mRNAs, SHOXa and SHOXb, code for proteins of 292 and 225 amino acids, respectively. Both transcripts have a common 5@#$%& end, diverging after exon IV. The SHOX gene is composed of 6 exons ranging in size from 58 bp to 1,146 bp. SHOX is highly conserved across species. The absence of the SHOX gene may be responsible for the growth failure in Turner syndrome females.
6753	
6754	
6755	
6756	
6757	Heterotrimeric guanine nucleotide-binding proteins (G proteins), which integrate signals between receptors and effector proteins, are composed of an alpha, a beta, and a gamma subunit. These subunits are encoded by families of related genes. This gene encodes a beta subunit. Beta subunits are important regulators of alpha subunits, as well as of certain signal transduction receptors and effectors. This gene uses alternative polyadenylation signals.
6758	The protein encoded by this gene is a DNA helicase. A dominant-negative mutant of this protein blocks chromosomal DNA replication and suggests that its function is required for S phase entry.
6759	
6760	
6761	
6762	The protein encoded by this gene is one subunit of a trimeric complex, forming a highly conserved transcription factor that binds with high specificity to CCAAT motifs in the promoter regions in a variety of genes. This gene product, subunit C, forms a tight dimer with the B subunit, a prerequisite for subunit A association. The resulting trimer binds to DNA with high specificity and affinity. Subunits B and C each contain a histone-like motif. Observation of the histone nature of these subunits is supported by two types of evidence; protein sequence alignments and experiments with mutants. Additional regulation, preliminarily supported by the EST database, may be represented by alternative splicing in this subunit.
6763	
6764	
6765	
6766	The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21. This protein may function in the inhibition of casein kinase and altered expression of this protein is associated with the disease cystic fibrosis.
6767	
6768	
6769	The protein encoded by this gene belongs to a small class of the protein tyrosine phosphatase (PTP) family. PTPs are cell signaling molecules that play regulatory roles in a variety of cellular processes. PTPs in this class contain a protein tyrosine phosphatase catalytic domain and a characteristic C-terminal prenylation motif. This PTP has been shown to primarily associate with plasmic and endosomal membrane through its C-terminal prenylation. This PTP was found to interact with the beta-subunit of Rab geranylgeranyltransferase II (beta GGT II), and thus may function as a regulator of GGT II activity. Overexpression of this gene in mammalian cells conferred a transformed phenotype, which suggested its role in tumorigenesis. Three alternatively spliced transcript variants, which encode two distinct isoforms, have been described.
6770	
6771	
6772	
6773	This gene encodes a member of the Argonaute family of proteins which play a role in RNA interference. The encoded protein is highly basic containing PAZ and PIWI domains, and it may play a role in short-interfering-RNA-mediated gene silencing. This gene is located on chromosome 1 in a cluster of closely related family members including argonaute 3, and eukaryotic translation initiation factor 2C, 1.
6774	Heparan sulfate proteoglycan is a major component of basement membranes, where the molecule may be involved in the stabilization of other molecules as well as being involved with glomerular permeability to macromolecules and cell adhesion. This form of HSPG, known as HSPG2 or perlecan, is encoded by a gene that maps to chromosome 1. The gene for the form of HSPG associated with the cell surface of fibroblasts has been mapped to human chromosome 8 (MIM 142460).[supplied by OMIM]
6775	
6776	MAD2L2 is a component of the mitotic spindle assembly checkpoint that prevents the onset of anaphase until all chromosomes are properly aligned at the metaphase plate.  MAD2L2 is a homolog of MAD2L1.
6777	
6778	This gene is a member of a group of genes whose transcript levels are increased following stressful growth arrest conditions and treatment with DNA-damaging agents. The protein encoded by this gene responds to environmental stresses by mediating activation of the p38/JNK pathway via MTK1/MEKK4 kinase. The DNA damage-induced transcription of this gene is mediated by both p53-dependent and -independent mechanisms.
6779	
6780	
6781	This gene encodes a member of the retinoid X receptor (RXR) family of nuclear receptors which are involved in mediating the antiproliferative effects of retinoic acid (RA). This receptor forms dimers with the retinoic acid, thyroid hormone, and vitamin D receptors, increasing both DNA binding and transcriptional function on their respective response elements. This gene is expressed at significantly lower levels in non-small cell lung cancer cells. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
6782	This gene encodes a member of the arm-repeat (armadillo) and plakophilin gene families. Plakophilin proteins contain numerous armadillo repeats, localize to cell desmosomes and nuclei, and participate in linking cadherins to intermediate filaments in the cytoskeleton. This protein may be involved in molecular recruitment and stabilization during desmosome formation. Mutations in this gene have been associated with the ectodermal dysplasia/skin fragility syndrome.
6783	Metabolic N-oxidation of the diet-derived amino-trimethylamine (TMA) is mediated by flavin-containing monooxygenase and is subject to an inherited FMO3 polymorphism in man resulting in a small subpopulation with reduced TMA N-oxidation capacity resulting in fish odor syndrome Trimethylaminuria. Three forms of the enzyme, FMO1 found in fetal liver, FMO2 found in adult liver, and FMO3 are encoded by genes clustered in the 1q23-q25 region.   Flavin-containing monooxygenases are NADPH-dependent flavoenzymes that catalyzes the oxidation of soft nucleophilic heteroatom centers in drugs, pesticides, and xenobiotics.
6784	This gene encodes a protein with 5@#$%& to 3@#$%& exonuclease activity as well as an RNase H activity. It is similar to the Saccharomyces cerevisiae protein Exo1 which interacts with Msh2 and which is involved in mismatch repair and recombination. Alternative splicing of this gene results in three transcript variants encoding two different isoforms.
6785	This gene encodes a nuclear protein containing 10 WD repeats. WD repeats are approximately 30- to 40-amino acid domains containing several conserved residues, which usually include a trp-asp at the C-terminal end. Proteins belonging to the WD repeat family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation.
6786	This gene is a necessary element in receptor tyrosine kinase pathways, possibly as a tyrosine phosphorylation target.  It is involved in regulation of RAF in the MAPK pathway and may also play a role in a MAPK-independent pathway.
6787	The gene variously symbolized ALL1, HRX, or MLL located on 11q23 has been demonstrated to be fused with a number of translocation partners in cases of leukemia. t(1;11)(q21;q23) translocations that fused the MLL gene to a gene on chromosomal band 1q21, AF1Q, in 2 infants with acute myelomonocytic leukemia have been demonstrated. The N-terminal portion of the MLL gene is critical for leukemogenesis in translocations involving band 11q23. The AF1Q cDNA sequence encodes 90 amino acids. The predicted wildtype AF1Q product was a 9-kD protein with no similarity to any other protein in the data banks. It was found to be highly expressed in the thymus but not in peripheral lymphoid tissues. In contrast to its restricted distribution in normal hematopoietic tissue, AF1Q was expressed in all leukemic cell lines tested.
6788	
6789	
6790	In a subset of papillary renal cell carcinomas, a t(X;1)(p11;q21) chromosome translocation has been repeatedly reported and is thought to be the cause of the cancer. As a result of the translocation, the transcription factor TFE3 on the X chromosome becomes fused to this gene on chromosome 1. The fused gene results in the fusion of N-terminal proline-rich region of the protein encoded by this gene to the entire TFE3 protein. This protein has been shown to interact with the mitotic checkpoint protein MAD2B, which suggests that the dominant-negative effect of the fusion protein with TFE3 may lead to a mitotic checkpoint defect. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
6791	SEMA4A is a member of the semaphorin family of soluble and transmembrane proteins. Semaphorins are involved in guidance of axonal migration during neuronal development and in immune responses.[supplied by OMIM]
6792	This gene belongs to the homeobox family, members of which encode a highly conserved family of transcription factors that play important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, consisting of 9 to 11 genes arranged in tandem, and located on different chromosomes. This gene is one of several HOXC genes located in a cluster on chromosome 12. Three genes, HOXC5, HOXC4 and HOXC6, are co-transcribed from a primary transcript. Subsequent processing results in gene-specific transcripts, which sometimes share the 5@#$%& non-coding exon. The product of this gene may contribute to the breast cell phenotype through co-operative interactions with other HOX-derived proteins and repression of their target genes. Two alternatively spliced transcripts encoding different isoforms have been characterized to date.
6793	
6794	
6795	
6796	The protein encoded by this gene is thought to be a transcriptional corepressor. However, molecules that interact with this protein have not yet been identified. The protein is a member of the tripartite motif family. This motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. Three alternatively spliced transcript variants for this gene have been described, however, the full length nature of one variant has not been determined.
6797	Heterochromatin protein-1 (HP1) is a methyl-lysine binding protein localized at heterochromatin sites, where it mediates gene silencing.[supplied by OMIM]
6798	
6799	
6800	
6801	The protein encoded by this gene binds CTNNB1 and prevents interaction between CTNNB1 and TCF family members. The encoded protein is a negative regulator of the Wnt signaling pathway. Two transcript variants encoding the same protein have been found for this gene.
6802	
6803	Amylases are secreted proteins that hydrolyze 1,4-alpha-glucoside bonds in oligosaccharides and polysaccharides, and thus catalyze the first step in digestion of dietary starch and glycogen. The human genome has a cluster of several amylase genes that are expressed at high levels in either salivary gland or pancreas. This gene encodes an amylase isoenzyme produced by the salivary gland. Alternative splicing results in multiple transcript variants encoding the same protein.
6804	
6805	
6806	Mucolipins constitute a family of cation channel proteins with homologs in mouse, Drosophila, and C. elegans. Mutations in the human MCOLN1 gene (MIM 605248) cause mucolipodosis IV (MIM 262650).[supplied by OMIM]
6807	The protein encoded by this gene regulates the turnover of diphosphoinositol polyphosphates. The turnover of these high-energy diphosphoinositol polyphosphates represents a molecular switching activity with important regulatory consequences. Molecular switching by diphosphoinositol polyphosphates may contribute to regulating intracellular trafficking. Several alternatively spliced transcript variants have been described, but the full-length nature of some variants has not been determined. Isoforms DIPP2alpha and DIPP2beta are distinguishable from each other solely by DIPP2beta possessing one additional amino acid due to intron boundary skidding in alternate splicing.
6808	
6809	
6810	
6811	
6812	
6813	
6814	
6815	
6816	MYOC encodes the protein myocilin, which is believed to have a role in cytoskeletal function. MYOC is expressed in many occular tissues, including the trabecular meshwork, and was revealed to be the trabecular meshwork glucocorticoid-inducible response protein (TIGR). The trabecular meshwork is a specialized eye tissue essential in regulating intraocular pressure, and mutations in MYOC have been identified as the cause of hereditary juvenile-onset open-angle glaucoma.
6817	This gene encodes a component of vacuolar ATPase (V-ATPase), a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, receptor-mediated endocytosis, and synaptic vesicle proton gradient generation. V-ATPase is composed of a cytosolic V1 domain and a transmembrane V0 domain. The V1 domain consists of three A and three B subunits, two G subunits plus the C, D, E, F, and H subunits. The V1 domain contains the ATP catalytic site. The V0 domain consists of five different subunits: a, c, c@#$%&, c@#$%!, and d. Additional isoforms of many of the V1 and V0 subunit proteins are encoded by multiple genes or alternatively spliced transcript variants. This gene encodes one of three G subunit proteins. Transcript variants encoding different isoforms have been found for this gene.
6818	
6819	Elastases form a subfamily of serine proteases that hydrolyze many proteins in addition to elastin. Humans have six elastase genes which encode the structurally similar proteins elastase 1, 2, 2A, 2B, 3A, and 3B. Unlike other elastases, pancreatic elastase 1 is not expressed in the pancreas. To date, elastase 1 expression has only been detected in skin keratinocytes. Clinical literature that describes human elastase 1 activity in the pancreas is actually referring to elastase 2A.
6820	This gene encodes a member of the leucine-rich repeat-containing subgroup of the G protein-coupled 7-transmembrane protein superfamily. The encoded protein is a glycoprotein hormone receptor with a large N-terminal extracellular domain that contains leucine-rich repeats important for the formation of a horseshoe-shaped interaction motif for ligand binding. Alternative splicing of this gene results in multiple transcript variants.
6821	This gene encodes a member of the leucine-rich repeat-containing subgroup of the G protein-coupled 7-transmembrane protein superfamily. The encoded protein is a glycoprotein hormone receptor with a large N-terminal extracellular domain that contains leucine-rich repeats important for the formation of a horseshoe-shaped interaction motif for ligand binding. Alternative splicing of this gene results in multiple transcript variants.
6822	
6823	
6824	
6825	This gene encodes a dual serine/threonine and tyrosine protein kinase which is expressed in multiple tissues. Multiple alternatively spliced transcript variants have been found, but the biological validity of some variants has not been determined.
6826	The calpains, calcium-activated neutral proteases, are nonlysosomal, intracellular cysteine proteases. The mammalian calpains include ubiquitous, stomach-specific, and muscle-specific proteins. The ubiquitous enzymes consist of heterodimers with distinct large, catalytic subunits associated with a common small, regulatory subunit. This gene encodes the large subunit of the ubiquitous enzyme, calpain 2. Multiple heterogeneous transcriptional start sites in the 5@#$%& UTR have been reported.
6827	
6828	This gene was identified by its oncogenic transforming activity in cells. The encoded protein is a member of the serine/threonine protein kinase family. This kinase can activate both the MAP kinase and JNK kinase pathways. This kinase was shown to activate IkappaB kinases, and thus induce the nuclear production of NF-kappaB. This kinase was also found to promote the production of TNF-alpha and IL-2 during T lymphocyte activation. Studies of a similar gene in rat suggested the direct involvement of this kinase in the proteolysis of NF-kappaB1,p105 (NFKB1). This gene may also utilize a downstream in-frame translation start codon, and thus produce an isoform containing a shorter N-terminus. The shorter isoform has been shown to display weaker transforming activity.
6829	
6830	
6831	The RAS oncogene (MIM 190020) is mutated in nearly one-third of all human cancers. Members of the RAS superfamily are plasma membrane GTP-binding proteins that modulate intracellular signal transduction pathways. A subfamily of RAS effectors, including RASSF3, share a RAS association (RA) domain.[supplied by OMIM]
6832	
6833	
6834	MAT1A encodes methionine adenosyltransferase I (alpha isoform). MATIA catalyzes the formation of S-adenosylmethionine from methionine and ATP. Both the beta and alpha isoforms may be encoded by MATIA. Methionine adenosyltransferase deficiency is known to be caused by recessive as well as dominant mutations, the latter identified in autosomal dominant persistant hypermethioninemia.
6835	
6836	
6837	
6838	
6839	
6840	This gene encodes a serine protease involved in degradation of the extracellular matrix and possibly tumor cell migration and proliferation. A specific polymorphism in this gene may be associated with late-onset Alzheimer disease and also with decreased affinity for fibrin-binding. The protein encoded by this gene converts plasminogen to plasmin by specific cleavage of an Arg-Val bond in plasminogen. This gene@#$%&s proprotein is cleaved at a Lys-Ile bond by plasmin to form a two-chain derivative in which a single disulfide bond connects the amino-terminal A-chain to the catalytically active, carboxy-terminal B-chain. This two-chain derivative is also called HMW-uPA (high molecular weight uPA). HMW-uPA can be further processed into LMW-uPA (low molecular weight uPA) by cleavage of chain A into a short chain A (A1) and an amino-terminal fragment. LMW-uPA is proteolytically active but does not bind to the uPA receptor.
6841	This gene encodes a protein that is a member of the dickkopf family. It is a secreted protein with two cysteine rich regions and is involved in embryonic development through its inhibition of the WNT signaling pathway. Elevated levels of DKK1 in bone marrow plasma and peripheral blood is associated with the presence of osteolytic bone lesions in patients with multiple myeloma.
6842	This gene belongs to a family of genes that function as receptors for tachykinins.  Receptor affinities are specified by variations in the 5@#$%&-end of the sequence.  The receptors belonging to this family are characterized by interactions with G proteins and 7 hydrophobic transmembrane regions.  This gene encodes the receptor for the tachykinin neuropeptide substance K, also referred to as neurokinin A.
6843	
6844	
6845	
6846	
6847	
6848	X-prolyl aminopeptidase (EC 3.4.11.9) is a proline-specific metalloaminopeptidase that specifically catalyzes the removal of any unsubstituted N-terminal amino acid that is adjacent to a penultimate proline residue. Because of its specificity toward proline, it has been suggested that X-prolyl aminopeptidase is important in the maturation and degradation of peptide hormones, neuropeptides, and tachykinins, as well as in the digestion of otherwise resistant dietary protein fragments, thereby complementing the pancreatic peptidases. Deficiency of X-prolyl aminopeptidase results in excretion of large amounts of imino-oligopeptides in urine (Blau et al., 1988).[supplied by OMIM]
6849	This gene encodes a member of the matrix metalloproteinase (MMP) family. Proteins in this family are involved in the breakdown of extracellular matrix for both normal physiological processes, such as embryonic development, reproduction, and tissue remodeling,and disease processes, such as asthma and metastasis. The specific function of this protein has not been determined.
6850	
6851	
6852	
6853	
6854	
6855	
6856	
6857	
6858	This gene encodes a member of the RSK (ribosomal S6 kinase) family of serine/threonine kinases. This kinase contains 2 nonidentical kinase catalytic domains and phosphorylates the S6 ribosomal protein and eucaryotic translation initiation factor 4B (eIF4B). Phosphorylation of S6 leads to an increase in protein synthesis and cell proliferation. Alternate transcriptional splice variants encoding different isoforms have been found for this gene, but only two of them have been characterized to date.
6859	
6860	Zinc is an essential cofactor for hundreds of enzymes. It is involved in protein, nucleic acid, carbohydrate, and lipid metabolism, as well as in the control of gene transcription, growth, development, and differentiation. SLC39A13 belongs to a subfamily of proteins that show structural characteristics of zinc transporters (Taylor and Nicholson, 2003).[supplied by OMIM]
6861	L-glutamate is the major excitatory neurotransmitter in the central nervous system and activates both ionotropic and metabotropic glutamate receptors.  Glutamatergic neurotransmission is involved in most aspects of normal brain function and can be perturbed in many neuropathologic conditions. The metabotropic glutamate receptors are a family of G protein-coupled receptors, that have been divided into 3 groups on the basis of sequence homology, putative signal transduction mechanisms, and pharmacologic properties.  Group I includes GRM1 and GRM5 and these receptors have been shown to activate phospholipase C.  Group II includes GRM2 and GRM3 while Group III includes GRM4, GRM6, GRM7 and GRM8.  Group II and III receptors are linked to the inhibition of the cyclic AMP cascade but differ in their agonist selectivities.  Alternative splice variants of GRM8 have been described but their full-length nature has not been determined.
6862	Otubains are deubiquitylating cysteine proteases (DUBs; see MIM 602519) that belong to the ovarian tumor (OTU) protein superfamily. Like other DUBs, otubains cleave proteins precisely at the ubiquitin (UB; see MIM 191339)-protein bond.[supplied by OMIM]
6863	
6864	
6865	
6866	Cofilin is a widely distributed intracellular actin-modulating protein that binds and depolymerizes filamentous F-actin and inhibits the polymerization of monomeric G-actin in a pH-dependent manner. It is involved in the translocation of actin-cofilin complex from cytoplasm to nucleus.[supplied by OMIM]
6867	Cathepsins are papain family cysteine proteinases that represent a major component of the lysosomal proteolytic system. Cathepsins generally contain a signal sequence, followed by a propeptide and then a catalytically active mature region. The very long (251 amino acid residues) proregion of the cathepsin F precursor contains a C-terminal domain similar to the pro-segment of cathepsin L-like enzymes, a 50-residue flexible linker peptide, and an N-terminal domain predicted to adopt a cystatin-like fold. The cathepsin F proregion is unique within the papain family cysteine proteases in that it contains this additional N-terminal segment predicted to share structural similarities with cysteine protease inhibitors of the cystatin superfamily. This cystatin-like domain contains some of the elements known to be important for inhibitory activity. CTSF encodes a predicted protein of 484 amino acids which contains a 19 residue signal peptide. Cathepsin F contains five potential N-glycosylation sites, and it may be targeted to the endosomal/lysosomal compartment via the mannose 6-phosphate receptor pathway. The cathepsin F gene is ubiquitously expressed, and it maps to chromosome 11q13, close to the gene encoding cathepsin W.
6868	
6869	NGL1 is a specific binding partner for netrin G1 (NTNG1; MIM 608818), which is a member of the netrin family of axon guidance molecules (Lin et al., 2003).[supplied by OMIM]
6870	
6871	This gene encodes a member of the CSRP family of LIM domain proteins, which may be involved in regulatory processes important for development and cellular differentiation. The LIM/double zinc-finger motif found in this protein is found in a group of proteins with critical functions in gene regulation, cell growth, and somatic differentiation. Mutations in this gene are thought to cause heritable forms of hypertrophic cardiomyopathy (HCM) and dilated cardiomyopathy (DCM) in humans.
6872	This gene encodes a protein that is a member of the cysteine-aspartic acid protease (caspase) family. Sequential activation of caspases plays a central role in the execution-phase of cell apoptosis. Caspases exist as inactive proenzymes composed of a prodomain and a large and small protease subunit. Activation of caspases requires proteolytic processing at conserved internal aspartic residues to generate a heterodimeric enzyme consisting of the large and small subunits. This caspase is able to cleave and activate its own, as well as caspase 1, precursor proteins. When overexpressed, this gene has been shown to induce cell apoptosis. Alternative splicing results in 3 transcript variants encoding distinct isoforms.
6873	This gene encodes a mitochondrially localized enzyme that catalyzes the reversible formation of acetoacetyl-CoA from two molecules of acetyl-CoA. This gene spans approximately 27 kb and contains 12 exons interrupted by 11 introns. Defects in this gene are associated with the alpha-methylacetoaceticaciduria disorder, an inborn error of isoleucine catabolism characterized by urinary excretion of 2-methyl-3-hydroxybutyric acid, 2-methylacetoacetic acid, tiglylglycine, and butanone.
6874	
6875	
6876	
6877	Guanylyl cyclase C (GCC, or GUCY2C; MIM 601330) produces cGMP following the binding of either endogenous ligands or heat-stable enterotoxins secreted by E. coli and other enteric bacteria. Activation of GCC initiates a signaling cascade that leads to phosphorylation of the cystic fibrosis transmembrane conductance regulator (CFTR; MIM 602421), followed by a net efflux of ions and water into the intestinal lumen. IKEPP is a regulatory protein that associates with GCC and regulates the amount of cGMP produced following receptor stimulation.[supplied by OMIM]
6878	The product encoded by this gene belongs to the heat shock protein 70 family which contains both heat-inducible and constitutively expressed members. The latter are called heat-shock cognate proteins. This gene encodes a heat-shock cognate protein. This protein binds to nascent polypeptides to facilitate correct folding. It also functions as an ATPase in the disassembly of clathrin-coated vesicles during transport of membrane components through the cell. Two alternatively spliced variants have been characterized to date.
6879	
6880	
6881	The protein encoded by this gene is similar to thymocyte protein Thy28, a highly conserved protein identified in vertebrates and plants. This protein has been implicated in the apoptosis of lymphocytes. Three alternatively spliced transcript variants encoding two distinct isoforms have been observed.
6882	
6883	Oxysterol binding protein is an intracellular protein that is believed to transport sterols from lysosomes to the nucleus where the sterol down-regulates the genes for the LDL receptor, HMG-CoA reductase, and HMG synthetase
6884	This gene encodes a protein which is highly similar to the rat Grp78-binding protein (GBP). Two transcript variants which encode proteins that differ by 1 aa have been identified.
6885	
6886	
6887	
6888	
6889	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S15P family of ribosomal proteins. It is located in the cytoplasm. The protein has been shown to bind to the 5.8S rRNA in rat. The gene product of the E. coli ortholog (ribosomal protein S15) functions at early steps in ribosome assembly. This gene is co-transcribed with two U14 small nucleolar RNA genes, which are located in its third and fifth introns. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
6890	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the MRP subfamily which is involved in multi-drug resistance. This protein functions as a modulator of ATP-sensitive potassium channels and insulin release. Mutations and deficiencies in this protein have been observed in patients with hyperinsulinemic hypoglycemia of infancy, an autosomal recessive disorder of unregulated and high insulin secretion. Mutations have also been associated with non-insulin-dependent diabetes mellitus type II, an autosomal dominant disease of defective insulin secretion. Alternative splicing of this gene has been observed; however, the transcript variants have not yet been fully described.
6891	
6892	
6893	The protein encoded by this gene is a hormone secreted by parathyroid cells. This hormone elevates blood Ca2+ level by dissolving the salts in bone and preventing their renal excretion. Defects in this gene are a cause of familial isolated hypoparathyroidism (FIH).
6894	
6895	The protein encoded by this gene is a cell-surface glycoprotein involved in cell-cell interactions, cell adhesion and migration. It is a receptor for hyaluronic acid (HA) and can also interact with other ligands, such as osteopontin, collagens, and matrix metalloproteinases (MMPs). This protein participates in a wide variety of cellular functions including lymphocyte activation, recirculation and homing, hematopoiesis, and tumor metastasis. Transcripts for this gene undergo complex alternative splicing that results in many functionally distinct isoforms, however, the full length nature of some of these variants has not been determined. Alternative splicing is the basis for the structural and functional diversity of this protein, and may be related to tumor metastasis.
6896	This gene was identified by its ability to suppress the tumorigenicity of Hela cells in nude mice. The protein encoded by this gene contains a C-terminal region that shares similarity with the Rab 3 family of small GTP binding proteins. This protein preferentially binds to the SH3 domain of c-Abl kinase, and acts as a regulator of MAPK1/ERK2 kinase, which may contribute to its ability to reduce the tumorigenic phenotype in cells. Three alternatively spliced transcript variants of this gene encoding distinct isoforms are identified.
6897	
6898	The product of this gene belongs to the family of G-protein coupled receptors. This family has several receptor subtypes with different pharmacological selectivity, which overlaps in some cases, for various adenosine and uridine nucleotides. This receptor is responsive to both adenosine and uridine nucleotides. It may participate in control of the cell cycle of endometrial carcinoma cells. Three transcript variants encoding the same protein have been identified for this gene.
6899	
6900	
6901	Prenyltransferases attach either a farnesyl group or a geranylgeranyl group in thioether linkage to the cysteine residue of protein@#$%&s with a C-terminal CAAX box. CAAX geranylgeranyltransferase and CAAX farnesyltransferase are heterodimers that share the same alpha subunit but have different beta subunits. This gene encodes the alpha subunit of these transferases. Alternative splicing results in multiple transcript variants encoding different isoforms.
6902	
6903	
6904	Ubiquitin (MIM 191339), a highly conserved protein involved in the regulation of intracellular protein breakdown, cell cycle regulation, and stress response, is released from degraded proteins by disassembly of the polyubiquitin chains. The disassembly process is mediated by ubiquitin-specific proteases (USPs). Also see USP1 (MIM 603478).[supplied by OMIM]
6905	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is thought to be involved in growth-related cellular processes. This gene is associated with tumorigenesis and osteosarcoma.
6906	The protein encoded by this gene is a member of the Ser/Thr protein kinase family. This protein is highly similar to the gene products of S. cerevisiae cdc28 and S. pombe cdc2. It is a catalytic subunit of the protein kinase complex that is important for cell cycle G1 phase progression. The activity of this kinase is restricted to the G1-S phase, which is controlled by the regulatory subunits D-type cyclins and CDK inhibitor p16(INK4a). This kinase was shown to be responsible for the phosphorylation of retinoblastoma gene product (Rb). Mutations in this gene as well as in its related proteins including D-type cyclins, p16(INK4a) and Rb were all found to be associated with tumorigenesis of a variety of cancers. Multiple polyadenylation sites of this gene have been reported.
6907	
6908	
6909	Complex II of the respiratory chain, which is specifically involved in the oxidation of succinate, carries electrons from FADH to CoQ.  The complex is composed of four nuclear-encoded subunits and is localized in the mitochondrial inner membrane.  The 70 kDa flavoprotein subunit contains a characteristic cysteine triplet.  Mutations in the flavoprotein subunit gene have been linked to a form of mitochondrial respiratory chain deficiency known as Leigh syndrome.
6910	This gene encodes one of the selenoproteins containing selenocysteine at the active site. The selenocysteine is encoded by the nonsense (stop) codon TGA. This gene is expressed at low levels in a broad range of human tissues and organs.
6911	The protein encoded by this gene contains a RING finger, a motif present in a variety of functionally distinct proteins and known to be involved in protein-protein and protein-DNA interactions.
6912	
6913	PAH encodes the enzyme phenylalanine hydroxylase that is the rate-limiting step in phenylalanine catabolism. Deficiency of this enzyme activity results in the autosomal recessive disorder phenylketonuria.
6914	
6915	
6916	
6917	
6918	Homogentisate 1,2-dioxygenase (HGD) gene mutations are the molecular cause of alkaptonuria, a rare hereditary disorder of the phenylalanine catabolism. The highest expression of HGD is in the prostate, small intestine, colon, and liver. The HGD gene contains 14 exons. Conflicting reports have placed the gene at 3q2, 3q13.3-q21, 3q21-q24, 3q21-q23, or 3q25-q26.
6919	The replication of DNA in eukaryotic cells is carried out by a complex chromosomal replication apparatus, in which DNA polymerase alpha and primase are two key enzymatic components. Primase, which is a heterodimer of a small subunit and a large subunit, synthesizes small RNA primers for the Okazaki fragments made during discontinuous DNA replication. The protein encoded by this gene is the small, 49 kDa primase subunit.
6920	
6921	Hexokinases phosphorylate glucose to produce glucose-6-phosphate, thus committing glucose to the glycolytic pathway. Alternative splicing of this gene results in three tissue-specific forms of glucokinase, one found in pancreatic islet beta cells and two found in liver. The protein localizes to the outer membrane of mitochondria. In contrast to other forms of hexokinase, this enzyme is not inhibited by its product glucose-6-phosphate but remains active while glucose is abundant. Mutations in this gene have been associated with non-insulin dependent diabetes mellitus (NIDDM), also called maturity onset diabetes of the young, type 2 (MODY2); mutations have also been associated with persistent hyperinsulinemic hypoglycemia of infancy (PHHI).
6922	
6923	This gene encodes a protein which is similar in structure to hyaluronidases. Hyaluronidases intracellularly degrade hyaluronan, one of the major glycosaminoglycans of the extracellular matrix. Hyaluronan is thought to be involved in cell proliferation, migration and differentiation. However, this protein has not yet been shown to have hyaluronidase activity.
6924	
6925	
6926	
6927	
6928	The protein encoded by this gene belongs to a large family of proteins defined by an alpha/beta hydrolase fold, and contains three sequence motifs that correspond to a catalytic triad found in the esterase/lipase/thioesterase subfamily. It differs from other members of this subfamily in that its putative catalytic triad contains an asparagine instead of the serine residue. Mutations in this gene have been associated with Chanarin-Dorfman syndrome, a triglyceride storage disease with impaired long-chain fatty acid oxidation.
6929	The protein encoded by this gene has both oxidoreductase and epimerase activities and is involved in androgen catabolism. The oxidoreductase activity can convert 3 alpha-adiol to dihydrotestosterone, while the epimerase activity can convert androsterone to epi-androsterone. Both reactions use NAD+ as the preferred cofactor. This gene is a member of the retinol dehydrogenase family. Transcript variants utilizing alternative polyadenylation signals exist.
6930	
6931	
6932	The product of this gene belongs to the serine/threonine protein kinase family, and to the Ca(2+)/calmodulin-dependent protein kinase subfamily. This enzyme is a multifunctional serine/threonine protein kinase with limited tissue distribution, that has been implicated in transcriptional regulation in lymphocytes, neurons and male germ cells.
6933	HERC6 belongs to the HERC family of ubiquitin ligases, all of which contain a HECT domain and at least 1 RCC1 (MIM 179710)-like domain (RLD). The 350-amino acid HECT domain is predicted to catalyze the formation of a thioester with ubiquitin before transferring it to a substrate, and the RLD is predicted to act as a guanine nucleotide exchange factor for small G proteins (Hochrainer et al., 2005).[supplied by OMIM]
6934	HERC6 belongs to the HERC family of ubiquitin ligases, all of which contain a HECT domain and at least 1 RCC1 (MIM 179710)-like domain (RLD). The 350-amino acid HECT domain is predicted to catalyze the formation of a thioester with ubiquitin before transferring it to a substrate, and the RLD is predicted to act as a guanine nucleotide exchange factor for small G proteins (Hochrainer et al., 2005).[supplied by OMIM]
6935	
6936	
6937	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a member of the histone H1 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the small histone gene cluster on chromosome 6p22-p21.3.
6938	
6939	
6940	
6941	
6942	Dolichol-phosphate mannose (Dol-P-Man) serves as a donor of mannosyl residues on the lumenal side of the endoplasmic reticulum (ER). Lack of Dol-P-Man results in defective surface expression of GPI-anchored proteins. Dol-P-Man is synthesized from GDP-mannose and dolichol-phosphate on the cytosolic side of the ER by the enzyme dolichyl-phosphate mannosyltransferase. The protein encoded by this gene is a subunit of dolichyl-phosphate mannosyltransferase and acts as a stabilizer subunit of the dolichyl-phosphate mannosyltransferase complex.
6943	
6944	The VG5Q gene encodes a potent angiogenic factor that contains a forkhead-associated domain and a G-patch domain.[supplied by OMIM]
6945	Protein kinase C (PKC) is a family of serine- and threonine-specific protein kinases that can be activated by calcium and the second messenger diacylglycerol. PKC family members phosphorylate a wide variety of protein targets and are known to be involved in diverse cellular signaling pathways. PKC family members also serve as major receptors for phorbol esters, a class of tumor promoters. Each member of the PKC family has a specific expression profile and is believed to play a distinct role in cells. The protein encoded by this gene is one of the PKC family members. This kinase has been shown to be involved in many different cellular functions, such as neuron channel activation, apoptosis, cardioprotection from ischemia, heat shock response, as well as insulin exocytosis. Knockout studies in mice suggest that this kinase is important for lipopolysaccharide (LPS)-mediated signaling in activated macrophages and may also play a role in controlling anxiety-like behavior.
6946	The protein encoded by this gene is a member of a superfamily of WD repeat proteins.  The function of this protein is not yet determined; however, other related family members perform functions involved in regulatory processes.
6947	
6948	Plastins are a family of actin-binding proteins that are conserved throughout eukaryote evolution and expressed in most tissues of higher eukaryotes. In humans, two ubiquitous plastin isoforms (L and T) have been identified. Plastin 1 (otherwise known as Fimbrin) is a third distinct plastin isoform which is specifically  expressed at high levels in the small intestine. The L isoform is expressed only in hemopoietic cell lineages, while the T isoform has been found in all other normal cells of solid tissues that have replicative potential (fibroblasts, endothelial cells, epithelial cells, melanocytes, etc.). The C-terminal 570 amino acids of the T-plastin and L-plastin proteins are 83% identical. It contains a potential calcium-binding site near the N terminus.
6949	
6950	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the proteasome B-type family, also known as the T1B family, that is a 20S core beta subunit. This gene is located in the class II region of the MHC (major histocompatibility complex). Expression of this gene is induced by gamma interferon and this gene product replaces catalytic subunit 3 (proteasome beta 5 subunit) in the immunoproteasome. Proteolytic processing is required to generate a mature subunit. Two alternative transcripts encoding two isoforms have been identified; both isoforms are processed to yield the same mature subunit.
6951	
6952	This gene encodes a member of the glutathione peroxidase family. Glutathione peroxidase functions in the detoxification of hydrogen peroxide, and is one of the most important antioxidant enzymes in humans. It has been reported that the protein encoded by this gene protects from CD95-induced apoptosis in cultured breast cancer cells and inhibits 5-lipoxygenase in blood cells, and its overexpression delays endothelial cell growth and increases resistance to toxic challenges. This protein is one of only a few proteins known in higher vertebrates to contain selenocysteine, which occurs at the active site of glutathione peroxidase and is coded by the nonsense (stop) codon TGA. In addition, this protein is characterized in a polyalanine sequence polymorphism in the N-terminal region, which includes three alleles with five, six or seven alanine (ALA) repeats in this sequence. The allele with five ALA repeats is significantly associated with breast cancer risk. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
6953	
6954	
6955	The WNK1 gene encodes a cytoplasmic serine-threonine kinase expressed in distal nephron.[supplied by OMIM]
6956	
6957	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that has had confusing nomenclature in the literature. Pseudogenes corresponding to this gene are found on chromosomes 3p, 5q, and 10q.
6958	
6959	
6960	Sialyltransferases catalyze the transfer of sialic acid from cytidine 5-prime monophospho-N-acetylneuraminic acid (CMP-NeuAc) to terminal positions of glycoprotein and glycolipid carbohydrate groups. Terminal NeuAc residues are key determinants of carbohydrate structures, such as the sialyl-Lewis X determinants, and are widely distributed in many cell types.[supplied by OMIM]
6961	This gene encodes a member of the RecQ subfamily and the DEAH (Asp-Glu-Ala-His) subfamily of DNA and RNA helicases. DNA helicases are involved in many aspects of DNA metabolism, including transcription, replication, recombination, and repair. This protein contains a nuclear localization signal in the C-terminus and shows a predominant nucleolar localization. It possesses an intrinsic 3@#$%& to 5@#$%& DNA helicase activity, and is also a 3@#$%& to 5@#$%& exonuclease. Based on interactions between this protein and Ku70/80 heterodimer in DNA end processing, this protein may be involved in the repair of double strand DNA breaks. Defects in this gene are the cause of werner syndrome, an autosomal recessive disorder characterized by premature aging.
6962	
6963	
6964	
6965	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
6966	This gene encodes an ubiquitin-like protein (ubiquilin) that shares high degree of similarity with related products in yeast, rat and frog. Ubiquilins contain a N-terminal ubiquitin-like domain and a C-terminal ubiquitin-associated domain. They physically associate with both proteasomes and ubiquitin ligases, and thus are thought to functionally link the ubiquitination machinery to the proteasome to affect in vivo protein degradation. This ubiquilin has also been shown to modulate accumulation of presenilin proteins, and is found in lesions associated with Alzheimer@#$%&s and Parkinson@#$%&s disease. Two transcript variants encoding different isoforms have been found for this gene.
6967	This gene encodes an ubiquitin-like protein (ubiquilin) that shares high degree of similarity with related products in yeast, rat and frog. Ubiquilins contain a N-terminal ubiquitin-like domain and a C-terminal ubiquitin-associated domain. They physically associate with both proteasomes and ubiquitin ligases, and thus are thought to functionally link the ubiquitination machinery to the proteasome to affect in vivo protein degradation. This ubiquilin has also been shown to modulate accumulation of presenilin proteins, and is found in lesions associated with Alzheimer@#$%&s and Parkinson@#$%&s disease. Two transcript variants encoding different isoforms have been found for this gene.
6968	This gene encodes a component of vacuolar ATPase (V-ATPase), a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, receptor-mediated endocytosis, and synaptic vesicle proton gradient generation. V-ATPase is composed of a cytosolic V1 domain and a transmembrane V0 domain. The V1 domain consists of three A and three B subunits, two G subunits plus the C, D, E, F, and H subunits. The V1 domain contains the ATP catalytic site. The V0 domain consists of five different subunits: a, c, c@#$%&, c@#$%!, and d. Additional isoforms of many of the V1 and V0 subunit proteins are encoded by multiple genes or alternatively spliced transcript variants. This gene encodes one of two V1 domain C subunit proteins.
6969	
6970	The diastrophic dysplasia sulfate transporter is a transmembrane glycoprotein implicated in the pathogenesis of several human chondrodysplasias. It apparently is critical in cartilage for sulfation of proteoglycans and matrix organization.
6971	
6972	
6973	
6974	
6975	
6976	This gene encodes an immunoregulatory cytokine produced primarily by activated Th2 cells. This cytokine is involved in several stages of B-cell maturation and differentiation. It up-regulates CD23 and MHC class II expression, and promotes IgE isotype switching of B cells. This cytokine down-regulates macrophage activity, thereby inhibits the production of pro-inflammatory cytokines and chemokines. This cytokine is found to be critical to the pathogenesis of allergen-induced asthma but operates through mechanisms independent of IgE and eosinophils. This gene, IL3, IL5, IL4, and CSF2 form a cytokine gene cluster on chromosome 5q, with this gene particularly close to IL4.
6977	
6978	
6979	
6980	The pyruvate dehydrogenase complex is a nuclear-encoded mitochondrial matrix multienzyme complex that provides the primary link between glycolysis and the tricarboxylic acid (TCA) cycle by catalyzing the irreversible conversion of pyruvate into acetyl-CoA. The PDH complex is composed of multiple copies of 3 enzymes: E1 (PDHA1); dihydrolipoyl transacetylase (DLAT; MIM 608770) (E2; EC 2.3.1.12); and dihydrolipoyl dehydrogenase (DLD; MIM 246900) (E3; EC 1.8.1.4). The E1 enzyme is a heterotetramer of 2 alpha and 2 beta subunits. The E1-alpha subunit contains the E1 active site and plays a key role in the function of the PDH complex (Brown et al., 1994).[supplied by OMIM]
6981	The dystrophin-glycoprotein complex (DGC) is a multisubunit protein complex that spans the sarcolemma and provides structural linkage between the subsarcolemmal cytoskeleton and the extracellular matrix of muscle cells. There are 3 main subcomplexes of the DGC: the cytoplasmic proteins dystrophin (DMD; MIM 300377) and syntrophin (SNTA1; MIM 601017), the alpha- and beta-dystroglycans (see MIM 128239), and the sarcoglycans (see, e.g., SGCA; MIM 600119) (Crosbie et al., 2000).[supplied by OMIM]
6982	
6983	This gene encodes a cytoplasmic protein that contains epidermal growth factor (EGF) -like repeats. The encoded heterotrimeric protein may be involved in cell growth regulation and differentiation. A similar protein in rodents is involved in craniosynostosis. An alternative splice variant has been described but its full-length sequence has not been determined.
6984	
6985	ADP-ribosylation factor 3 (ARF3) is a member of the human ARF gene family. These genes encode small guanine nucleotide-binding proteins that stimulate the ADP-ribosyltransferase activity of cholera toxin and play a role in vesicular trafficking and as activators of phospholipase D.  The gene products include 6 ARF proteins and 11 ARF-like proteins and constitute 1 family of the RAS superfamily. The ARF proteins are categorized as class I (ARF1, ARF2,and ARF3), class II (ARF4 and ARF5) and class III (ARF6) and members of each class share a common gene organization. The ARF3 gene contains five exons and four introns.
6986	This gene encodes a protein belonging to the vacuolar-protein-sorting-13 gene family. In yeast, vacuolar-protein-sorting-13 proteins are involved in trafficking of membrane proteins between the trans-Golgi network and the prevacuolar compartment. While several transcript variants may exist for this gene, the full-length natures of only two have been described to date. These two represent the major variants of this gene and encode distinct isoforms.
6987	
6988	Cofactor C is one of four proteins (cofactors A, D, E, and C) involved in the pathway leading to correctly folded beta-tubulin from folding intermediates. Cofactors A and D are believed to play a role in capturing and stabilizing beta-tubulin intermediates in a quasi-native confirmation. Cofactor E binds to the cofactor D/beta-tubulin complex; interaction with cofactor C then causes the release of beta-tubulin polypeptides that are committed to the native state.
6989	
6990	
6991	Synuclein-gamma is a member of the synuclein family of proteins which are believed to be involved in the pathogenesis of neurodegenerative diseases.  High levels of SNCG have been identified in advanced breast carcinomas suggesting a correlation between overexpression of SNCG and breast tumor development.
6992	
6993	
6994	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene but the biological validity of some variants has not been determined.
6995	Glycogen synthase kinase-3 (GSK3) is a proline-directed serine-threonine kinase that was initially identified as a phosphorylating and inactivating glycogen synthase. Two isoforms, alpha (GSK3A; MIM 606784) and beta, show a high degree of amino acid homology (Stambolic and Woodgett, 1994). GSK3B is involved in energy metabolism, neuronal cell development, and body pattern formation (Plyte et al., 1992).[supplied by OMIM]
6996	
6997	
6998	
6999	
7000	
7001	The encoded protein is a secreted calcium-binding protein which is found in the cytoplasm. It is related to calbindin D-28K and calretinin. This protein is thought to be involved in KCL-stimulated calcium flux and cell proliferation.
7002	This gene was identified as a c-Myc responsive gene, and behaves as a direct c-Myc target gene. Overexpression of this gene is found to enhance the transformation of lymphoblastoid cells, and it complements a transformation-defective Myc Box II mutant, suggesting its involvement in c-Myc-mediated cell transformation. Two alternatively spliced transcript variants encoding distinct isoforms have been reported.
7003	
7004	Mutations in this gene are associated with Nijmegen breakage syndrome, an autosomal recessive chromosomal instability syndrome characterized by microcephaly, growth retardation, immunodeficiency, and cancer predisposition. The encoded protein is a member of the MRE11/RAD50 double-strand break repair complex which consists of 5 proteins. This gene product is thought to be involved in DNA double-strand break repair and DNA damage-induced checkpoint activation.
7005	This gene encodes a member of the C-lectin family, proteins that possess collagen-like sequences and carbohydrate recognition domains. The other members of this family are secreted proteins and bind to carbohydrate antigens on microorganisms facilitating their recognition and removal. This gene product is a cytosolic protein, a characteristic that suggests that it may have different biological functions than other C-lectins.
7006	Fibrillar collagen types I-III are synthesized as precursor molecules known as procollagens. These precursors contain amino- and carboxyl-terminal peptide extensions known as N- and C-propeptides, respectively, which are cleaved, upon secretion of procollagen from the cell, to yield the mature triple helical, highly structured fibrils. This gene encodes a glycoprotein which binds and drives the enzymatic cleavage of type I procollagen and heightens C-proteinase activity.
7007	This gene belongs to the homeobox family, members of which encode a highly conserved family of transcription factors that play important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, consisting of 9 to 11 genes arranged in tandem, and located on different chromosomes. This gene is one of several HOXC genes located in a cluster on chromosome 12. Three genes, HOXC5, HOXC4 and HOXC6, are co-transcribed from a primary transcript. Subsequent processing results in gene-specific transcripts, which sometimes share the 5@#$%& non-coding exon. The product of this gene may contribute to the breast cell phenotype through co-operative interactions with other HOX-derived proteins and repression of their target genes. Two alternatively spliced transcripts encoding different isoforms have been characterized to date.
7008	
7009	
7010	This gene is a highly conserved lung cancer candidate gene. No other information about this gene is currently available.
7011	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L1P family of ribosomal proteins. It is located in the cytoplasm. The expression of this gene is downregulated in the thymus by cyclosporin-A (CsA), an immunosuppressive drug. Studies in mice have shown that the expression of the ribosomal protein L10a gene is downregulated in neural precursor cells during development. This gene previously was referred to as NEDD6 (neural precursor cell expressed, developmentally downregulated 6), but it has been renamed RPL10A (ribosomal protein 10a). As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
7012	
7013	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a protein identified as belonging to both the 28S and the 39S subunits. Further experiments will be needed to identify the specific subunit localization. Sequence analysis identified three transcript variants that encode two different isoforms. Pseudogenes corresponding to this gene are found on chromosomes 4q, 6p, 6q, 7p, and 15q.
7014	Catechol-O-methyltransferase catalyzes the transfer of a methyl group from S-adenosylmethionine to catecholamines, including the neurotransmitters dopamine, epinephrine, and norepinephrine.  This O-methylation results in one of the major degradative pathways of the catecholamine transmitters.  In addition to its role in the metabolism of endogenous substances, COMT is important in the metabolism of catechol drugs used in the treatment of hypertension, asthma, and Parkinson disease.  COMT is found in two forms in tissues, a soluble form (S-COMT) and a membrane-bound form (MB-COMT).  The differences between S-COMT and MB-COMT reside within the N-termini.  The transcript variants are formed through the use of alternative translation initiation sites and promoters.
7015	
7016	The protein encoded by this gene is located in the nuclear envelope. It has protein similarity to nuclear transport factor 2. This protein functions as a nuclear export factor in both RAN (Ras-related nuclear protein)- and CRM1 (required for chromosome region maintenance)-dependent pathways. It is found to stimulate the export of U1 snRNA in RAN- and CRM1-dependent pathways and the export of tRNA and mRNA in a CRM1-independent pathway. The encoded protein heterodimerizes with Tap protein and may regulate the ability of Tap protein to mediate nuclear mRNA export. The use of alternate polyadenylation sites has been found for this gene.
7017	ARHGEF9 belongs to a family of Rho-like GTPases that act as molecular switches by cycling from the active GTP-bound state to the inactive GDP-bound state. These proteins are key regulators of the actin cytoskeleton and are involved in cell signaling.[supplied by OMIM]
7018	WD40 repeat proteins are key components of many essential biologic functions. They regulate the assembly of multiprotein complexes by presenting a beta-propeller platform for simultaneous and reversible protein-protein interactions. Members of the WIPI subfamily of WD40 repeat proteins, such as WIPI4, have a 7-bladed propeller structure and contain a conserved motif for interaction with phospholipids (Proikas-Cezanne et al., 2004).[supplied by OMIM]
7019	
7020	BTG3 is a protein that interacts with CAF1, a component of the general transcription multisubunit complex. It is thought that BTG3 is involved in negative control of the cell cycle. The protein encoded by this gene binds BTG3. Studies with mouse homolog suggest that this protein may also interact with a specific nuclear matrix/scaffold-associated region (MAR). Transcript variants encoding different isoforms have been described for this gene.
7021	BTG3 is a protein that interacts with CAF1, a component of the general transcription multisubunit complex. It is thought that BTG3 is involved in negative control of the cell cycle. The protein encoded by this gene binds BTG3. Studies with mouse homolog suggest that this protein may also interact with a specific nuclear matrix/scaffold-associated region (MAR). Transcript variants encoding different isoforms have been described for this gene.
7022	PUS1 converts uridine into pseudouridine after the nucleotide has been incorporated into RNA. Pseudouridine may have a functional role in tRNAs and may assist in the peptidyl transfer reaction of rRNAs (Chen and Patton, 1999).[supplied by OMIM]
7023	
7024	
7025	
7026	
7027	
7028	Neurexins are a family of proteins that function in the vertebrate nervous system as cell adhesion molecules and receptors. They are encoded by several unlinked genes of which two, NRXN1 and NRXN3, are among the largest known human genes. Three of the genes (NRXN1-3) utilize two alternate promoters and include numerous alternatively spliced exons to generate thousands of distinct mRNA transcripts and protein isoforms. The majority of transcripts are  produced from the upstream promoter and encode alpha-neurexin isoforms; a much smaller number of transcripts are produced from the downstream promoter and encode beta-neurexin isoforms. The alpha-neurexins contain epidermal growth factor-like (EGF-like) sequences and laminin G domains, and have been shown to interact with neurexophilins. The beta-neurexins lack EGF-like sequences and contain fewer laminin G domains than alpha-neurexins.
7029	The protein encoded by this gene is a member of the keratin gene family. As a type II hair keratin, it is a basic protein which heterodimerizes with type I keratins to form hair and nails. The type II hair keratins are clustered in a region of chromosome 12q13 and are grouped into two distinct subfamilies based on structure similarity. One subfamily, consisting of KRTHB1, KRTHB3, and KRTHB6, is highly related. The other less-related subfamily includes KRTHB2, KRTHB4, and KRTHB5. All hair keratins are expressed in the hair follicle; this hair keratin, as well as KRTHB3 and KRTHB6, is found primarily in the hair cortex. Mutations in this gene and KRTHB6 have been observed in patients with a rare dominant hair disease, monilethrix.
7030	
7031	
7032	
7033	This gene encodes a subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This subunit contains an N-terminal glutathione transferase domain, which may be involved in regulating the assembly of multisubunit complexes containing this elongation factor and aminoacyl-tRNA synthetases.
7034	
7035	Activator of apoptosis Hrk regulates apoptosis through interaction with death-repressor proteins Bcl-2 and Bcl-X(L). The HRK protein lacks significant homology to other BCL2 family members except for an 8-amino acid region that was similar to the BCL2 homology domain-3 (BH3) motif of BIK.  HRK interacts with BCL2 and BCLXL via the BH3 domain, but not with the death-promoting BCL2-related proteins BAX, BAK, or BCLXS. HRK localizes to membranes of intracellular organelles in a pattern similar to that previously reported for BCL2 and BCLXL.
7036	
7037	
7038	
7039	This gene encodes a double zinc finger (FYVE domain) protein that interacts directly with SMAD2 and SMAD3, and is involved in Alzheimer@#$%&s disease. SMAD proteins transmit signals from transmembrane serine/threonine kinase receptors to the nucleus. The FYVE domain has been identified in a number of unrelated signaling molecules. This protein functions to recruit SMAD2 to the transforming growth factor-beta receptor. The FYVE domain is required to maintain the normal localization of this protein but is not involved in mediating interaction with SMADs. The C-terminal domain of this protein interacts with the TGFB receptor. This protein is a component of the TGFB pathway that brings the SMAD substrate to the receptor. Three alternatively spliced transcripts encoding distinct isoforms have been found for this gene.
7040	
7041	
7042	The protein encoded by this gene catalyzes the transport of phosphate into the mitochondrial matrix, either by proton cotransport or in exchange for hydroxyl ions. The protein contains three related segments arranged in tandem which are related to those found in other characterized members of the mitochondrial carrier family. Both the N-terminal and C-terminal regions of this protein protrude toward the cytosol. Multiple alternatively spliced transcript variants have been isolated.
7043	
7044	
7045	
7046	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
7047	
7048	
7049	
7050	
7051	The protein encoded by this gene is a member of the RecQ DNA helicase family. DNA helicases are enzymes involved in various types of DNA repair, including mismatch repair, nucleotide excision repair and direct repair. Some members of this family are associated with genetic disorders with predisposition to malignancy and chromosomal instability. The biological function of this helicase has not yet been determined. Two alternatively spliced transcripts, which encode the same isoform but differ in their 5@#$%& and 3@#$%& UTRs, have been described.
7052	
7053	Death-associated protein kinase 1 is a positive mediator of gamma-interferon induced programmed cell death.  DAPK1 encodes a structurally unique 160-kD calmodulin dependent serine-threonine kinase that carries 8 ankyrin repeats and 2 putative P-loop consensus sites. It is a tumor suppressor candidate.
7054	This gene encodes an alpha integrin. Integrins are heterodimeric integral membrane glycoproteins composed of an alpha chain and a beta chain that mediate cell-cell and cell-matrix adhesion. The protein encoded by this gene, when bound to the beta 1 chain, forms an integrin that is a receptor for VCAM1, cytotactin and osteopontin. Expression of this gene has been found to be upregulated in small cell lung cancers.
7055	
7056	Methionine is an essential amino acid required for protein synthesis and one carbon metabolism. Its synthesis is catalyzed by the enzyme methionine synthase. Methionine synthase eventually becomes inactive due to the oxidation of its cob(I)alamin cofactor. The protein encoded by this gene regenerates a functional methionine synthase via reductive methylation. It is a member of the ferredoxin-NADP(+) reductase (FNR) family of electron transferases. Patients of the cbl-E complementation group of disorders of folate/cobalamin metabolism are defective in reductive activation of methionine synthase. This gene produces two transcripts.
7057	Plasma prekallikrein is a glycoprotein that participates in the surface-dependent activation of blood coagulation, fibrinolysis, kinin generation and inflammation.  It is synthesized in the liver and secreted into the blood as a single polypeptide chain.  Plasma prekallikrein is converted to plasma kallikrein by factor XIIa by the cleavage of an internal Arg-Ile bond.  Plasma kallikrein therefore is composed of a heavy chain and a light chain held together by a disulphide bond.  The heavy chain originates from the amino-terminal end of the zymogen and contains 4 tandem repeats of 90 or 91 amino acids.  Each repeat harbors a novel structure called the apple domain.  The heavy chain is required for the surface-dependent pro-coagulant activity of plasma kallikrein.  The light chain contains the active site or catalytic domain of the enzyme and is homologous to the trypsin family of serine proteases. Plasma prekallikrein deficiency causes a prolonged activated partial thromboplastin time in patients.
7058	
7059	
7060	
7061	
7062	The protein encoded by this gene contains a RING finger domain, a motif present in a variety of functionally distinct proteins and known to be involved in protein-protein and protein-DNA interactions. This gene is located in a chromosomal region known to be frequently deleted in patients with neurofibromatosis. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
7063	The protein encoded by this gene is a helix-loop-helix (HLH) protein that can form heterodimers with members of the basic HLH family of transcription factors. The encoded protein has no DNA binding activity and therefore can inhibit the DNA binding and transcriptional activation ability of basic HLH proteins with which it interacts. This protein may play a role in cell growth, senescence, and differentiation. Two transcript variants encoding different isoforms have been found for this gene.
7064	This gene encodes a subunit of mitochondrial ATP synthase. Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. ATP synthase is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, Fo, comprising the proton channel. The catalytic portion of mitochondrial ATP synthase consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled with a stoichiometry of 3 alpha, 3 beta, and a single representative of the other 3. The proton channel consists of three main subunits (a, b, c). This gene encodes the epsilon subunit of the catalytic core. Two pseudogenes of this gene are located on chromosomes 4 and 13.
7065	
7066	This gene encodes a member of the C-type lectin/C-type lectin-like domain (CTL/CTLD) superfamily. Members of this family share a common protein fold and have diverse functions, such as cell adhesion, cell-cell signalling, glycoprotein turnover, and roles in inflammation and immune response. This gene is closely linked to other CTL/CTLD superfamily members on chromosome 12p13 in the natural killer gene complex region.
7067	
7068	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that is similar to the chicken pro-apoptotic protein p52. Transcript variants using alternative promoters or polyA sites have been mentioned in the literature but the complete description of these sequences is not available.
7069	The protein encoded by this gene is a ubiquitously expressed nuclear protein. It is found among several proteins that bind directly to retinoblastoma protein, which regulates cell proliferation. This protein complexes with transcriptional co-repressor CTBP. It is also associated with BRCA1 and is thought to modulate the functions of BRCA1 in transcriptional regulation, DNA repair, and/or cell cycle checkpoint control. It is suggested that this gene may itself be a tumor suppressor acting in the same pathway as BRCA1. Three transcript variants encoding two different isoforms have been found for this gene. More transcript variants exist, but their full-length natures have not been determined.
7070	
7071	This gene encodes an alpha-1 subunit of a voltage-dependent calcium channel. Calcium channels mediate the influx of calcium ions into the cell upon membrane polarization. The alpha-1 subunit consists of 24 transmembrane segments and forms the pore through which ions pass into the cell. The calcium channel consists of a complex of alpha-1, alpha-2/delta, beta, and gamma subunits in a 1:1:1:1 ratio. There are multiple isoforms of each of these proteins, either encoded by different genes or the result of alternative splicing of transcripts. The protein encoded by the gene described in this record binds to and is inhibited by dihydropyridine. Many alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
7072	
7073	
7074	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind RNAs, followed by a glycine-rich C-terminus.
7075	The protein encoded by this gene acts as receptor for arginine vasopressin. This receptor belongs to the subfamily of G-protein coupled receptors which includes AVPR1B, V2R and OXT receptors. Its activity is mediated by G proteins which stimulate a phosphatidylinositol-calcium second messenger system. The receptor mediates cell contraction and proliferation, platelet aggregation, release of coagulation factor and glycogenolysis.
7076	
7077	
7078	
7079	
7080	
7081	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein. This gene lies adjacent to and downstream of the gonadotropin-releasing hormone precursor gene.
7082	
7083	This gene encodes a protein that has been directly linked to Bardet-Biedl syndrome. The primary features of this syndrome include retinal dystrophy, obesity, polydactyly, renal abnormalities and learning disabilities. Experimentation in non-human eukaryotes suggests that this gene is expressed in ciliated cells and that it is required for the formation of cilia. Alternate transcriptional splice variants have been observed but have not been fully characterized.
7084	
7085	The alpha-1,3-fucosyltransferases constitute a large family of glycosyltransferases with a high degree of homology. The enzymes of this family comprise 3 main activity patterns called myeloid, plasma, and Lewis, based on their capacity to transfer alpha-L-fucose to distinct oligosaccharide acceptors, their sensitivity to N-ethylmaleimide inhibition, their cation requirements, and their tissue-specific expression patterns. The different categories of alpha-1,3-fucosyltransferases are sequentially expressed during embryo-fetal development.[supplied by OMIM]
7086	
7087	
7088	
7089	
7090	
7091	This gene is deleted in the primary tumor of hepatocellular carcinoma. It maps to 8p22-p21.3, a region frequently deleted in solid tumors. It is suggested that this gene is a candidate tumor suppressor gene for human liver cancer, as well as for prostate, lung, colorectal, and breast cancers. Alternative splicing at this locus results in several transcript variants encoding different isoforms.
7092	
7093	
7094	This gene encodes a type II transmembrane glycoprotein that is the highly polymorphic Kell blood group antigen. The encoded protein contains sequence and structural similarity to members of a family of zinc metalloglycoproteins with neutral endopeptidase activity.
7095	The protein encoded by this gene is a member of the SWI/SNF family of proteins and is highly similar to the brahma protein of Drosophila. Members of this family have helicase and ATPase activities and are thought to regulate transcription of certain genes by altering the chromatin structure around those genes. The encoded protein is part of the large ATP-dependent chromatin remodeling complex SNF/SWI, which is required for transcriptional activation of genes normally repressed by chromatin. Two transcript variants encoding different isoforms have been found for this gene, which contains a trinucleotide repeat (CAG) length polymorphism.
7096	
7097	
7098	The globular WW domain is composed of 38 to 40 semiconserved amino acids shared by proteins of diverse functions including structural, regulatory, and signaling proteins. The domain is involved in mediating protein-protein interactions through the binding of polyproline ligands. This gene encodes a WW domain binding protein, which binds to the WW domain of Yes kinase-associated protein by a conserved region: XPPXY motif. The function of this protein has not been determined.
7099	The cyclic tetradecapeptide somatostatin is widely distributed throughout the body and is an important regulator of endocrine and nervous system function. It exerts its biologic actions by binding to specific high-affinity receptors on the cell surface.[supplied by OMIM]
7100	This gene encodes a protein with a sterol sensing domain (SSD) and seven WD domains. In the presence of cholesterol, this protein binds to sterol regulatory element binding proteins (SREBPs) and mediates their transport from the ER to the Golgi. The SREBPs are then proteolytically cleaved and regulate sterol biosynthesis.
7101	
7102	
7103	
7104	
7105	SIL1 is a resident endoplasmic reticulum (ER) glycoprotein that interacts with the ATPase domain of BIP (HSPA5; MIM 138120) and enhances nucleotide exchange.[supplied by OMIM]
7106	Mutations in this gene have been observed in patients with Bardet-Biedl syndrome type 7. The encoded protein may play a role in eye, limb, cardiac and reproductive system development. Two transcript variants encoding distinct isoforms have been identified for this gene.
7107	This gene is a member of the MAGEA gene family. The members of this family encode proteins with 50 to 80% sequence identity to each other. The promoters and first exons of the MAGEA genes show considerable variability, suggesting that the existence of this gene family enables the same function to be expressed under different transcriptional controls. The MAGEA genes are clustered at chromosomal location Xq28. They have been implicated in some hereditary disorders, such as dyskeratosis congenita. This gene has two identical copies at different loci. Alternatively spliced transcript variants encoding the same protein have been identified for this gene.
7108	
7109	
7110	The protein encoded by this gene is a cytoplasmic membrane-associated protein that indirectly coordinates endocytic membrane traffic with the actin assembly machinery. In addition, the encoded protein may regulate the formation of clathrin-coated vesicles and could be involved in synaptic vesicle recycling. This protein has been shown to interact with dynamin, CDC42, SNAP23, SNAP25, SPIN90, EPS15, EPN1, EPN2, and STN2. Multiple transcript variants encoding different isoforms have been found for this gene, but the full-length nature of only two of them have been characterized so far.
7111	
7112	
7113	
7114	The enzyme encoded by this gene is an important intermediary in mammalian carbohydrate interconversions. It transfers a glucose moiety from glucose-1-phosphate to MgUTP and forms UDP-glucose and MgPPi. In liver and muscle tissue, UDP-glucose is a direct precursor of glycogen; in lactating mammary gland it is converted to UDP-galactose which is then converted to lactose. The eukaryotic enzyme has no significant sequence similarity to the prokaryotic enzyme. Two transcript variants encoding different isoforms have been found for this gene.
7115	This gene encodes a member of the basic helix-loop-helix (BHLH) family of transcription factors. The protein activates transcription by binding to the E box (5@#$%&-CANNTG-3@#$%&). Dimerization with other BHLH proteins is required for efficient DNA binding. This protein plays a role in the neuronal commitment and differentiation and in the generation of olfactory and autonomic neurons. It is highly expressed in medullary thyroid cancer and small cell lung cancer and may be a useful marker for these cancers. The presence of a CAG repeat in the gene suggests that it may also play a role in tumor formation.
7116	
7117	A large protein complex, termed the anaphase-promoting complex (APC), or the cyclosome, promotes metaphase-anaphase transition by ubiquitinating its specific substrates such as mitotic cyclins and anaphase inhibitor, which are subsequently degraded by the 26S proteasome. Biochemical studies have shown that the vertebrate APC contains eight subunits. The composition of the APC is highly conserved in organisms from yeast to humans. The product of this gene is a component of the complex and shares sequence similarity with a recently identified family of proteins called cullins, which may also be involved in ubiquitin-mediated degradation.
7118	
7119	The protein encoded by this gene is an interleukin 3 specific subunit of a heterodimeric cytokine receptor. The receptor is comprised of a ligand specific alpha subunit and a signal transducing beta subunit shared by the receptors for interleukin 3 (IL3), colony stimulating factor 2 (CSF2/GM-CSF), and interleukin 5 (IL5). The binding of this protein to IL3 depends on the beta subunit. The beta subunit is activated by the ligand binding, and is required for the biological activities of IL3. This gene and the gene encoding the colony stimulating factor 2 receptor alpha chain (CSF2RA) form a cytokine receptor gene cluster in a X-Y pseudoautosomal region on chromosomes X or Y.
7120	
7121	
7122	This gene encodes a protein that may function as a Rho GTPase-activating protein. Transcript variants encoding distinct isoforms have been identified.
7123	
7124	
7125	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L15E family of ribosomal proteins. It is located in the cytoplasm. This gene shares sequence similarity with the yeast ribosomal protein YL10 gene. Although this gene has been referred to as RPL10, its official symbol is RPL15. This gene has been shown to be overexpressed in some esophageal tumors compared to normal matched tissues. Transcript variants utilizing alternative polyA signals exist. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
7126	Rho GTPases play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. The encoded protein may form complex with G proteins and stimulate Rho-dependent signals. This protein is similar to rat collybistin protein. Alternative splicing of this gene generates two transcript variants which encode different isoforms. Also there is possibility for the usage of multiple polyadenylation sites for this gene.
7127	This gene encodes a protein which may function in signal transduction pathways and whose expression is elevated in germinal cell lymphomas. It contains a putative PDZ-interacting domain, an immunoreceptor tyrosine-based activation motif (ITAM), and two putative SH2 binding sites. In B cells, its expression is specifically induced by interleukin-4. Alternative splicing results in multiple transcript variants encoding different isoforms.
7128	
7129	The protein encoded by this gene is a ubiquitous actin monomer-binding protein belonging to the profilin family. It is thought to regulate actin polymerization in response to extracellular signals. There are two alternatively spliced transcript variants encoding different isoforms described for this gene.
7130	
7131	
7132	
7133	
7134	
7135	The protein encoded by this gene is similar to bovine and porcine proteins which accelerate transfer of certain glycosphingolipids and glyceroglycolipids between membranes. It is thought to be a cytoplasmic protein.
7136	In vertebrates, the genes encoding the class of transcription factors called homeobox genes are found in clusters named A, B, C, and D on four separate chromosomes. Expression of these proteins is spatially and temporally regulated during embryonic development. This gene is part of the A cluster on chromosome 7 and encodes a DNA-binding transcription factor which may regulate gene expression, morphogenesis, and differentiation. More specifically, it may function in fertility, embryo viability, and regulation of hematopoietic lineage commitment. Two transcript variants encoding different isoforms have been found for this gene.
7137	The protein encoded by this gene is a member of the homeodomain family of DNA binding proteins. It may regulate gene expression, morphogenesis, and differentiation and it may also play a role in the cell cycle progession. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
7138	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
7139	
7140	This gene encodes a protein that contains a domain with homology to the ancient conserved region of the archain 1 gene and a domain that may be involved in binding ubiquitin-conjugating enzymes. The protein encoded by this gene has been shown to bind to the conserved membrane-proximal sequence of the cytoplasmic tail of integrin alpha(IIb) subunits. These subunits play a crucial role in the integrin alpha(IIb)beta(3) inside-out signalling in platelets and megakaryocytes that leads to platelet aggregation and thrombus formation. This gene overlaps the gene for mitochondrial serine protease 25. Three alternative transcripts encoding distinct isoforms have been described.
7141	
7142	The protein encoded by this gene has been termed a @#$%&growth@#$%& or @#$%&plasticity@#$%& protein because it is expressed at high levels in neuronal growth cones during development and axonal regeneration. This encoded protein is considered a crucial component of an effective regenerative response in the nervous system.
7143	
7144	
7145	This gene encodes a protein that is highly similar to mouse and rat kynurenine aminotransferase II. The rat protein is a homodimer with two transaminase activities. One activity is the transamination of alpha-aminoadipic acid, a final step in the saccaropine pathway which is the major pathway for L-lysine catabolism. The other activity involves the transamination of kynurenine to produce kynurenine acid, the precursor of kynurenic acid which has neuroprotective properties. Two alternative transcripts encoding the same isoform have been identified, however, additional alternative transcripts and isoforms may exist.
7146	Cytosolic and membrane-bound forms of glutathione S-transferase are encoded by two distinct supergene families. These enzymes are involved in cellular defense against toxic, carcinogenic, and pharmacologically active electrophilic compounds. At present, eight distinct classes of the soluble cytoplasmic mammalian glutathione S-transferases have been identified: alpha, kappa, mu, omega, pi, sigma, theta and zeta. This gene encodes a glutathione S-tranferase belonging to the alpha class genes that are located in a cluster mapped to chromosome 6. Genes of the alpha class are highly related and encode enzymes with glutathione peroxidase activity. However, during evolution, this alpha class gene diverged accumulating mutations in the active site that resulted in differences in substrate specificity and catalytic activity. The enzyme encoded by this gene catalyzes the double bond isomerization of precursors for progesterone and testosterone during the biosynthesis of steroid hormones. An additional transcript variant has been identified, but its full length sequence has not been determined.
7147	This gene encodes one of the members of the superfamily of potassium channel proteins containing two pore-forming P domains. The product of this gene is expressed exclusively in the pancreas and is activated at alkaline pH.
7148	Centrosome-associated protein E is a kinesin-like motor protein that accumulates in the G2 phase of the cell cycle.  Unlike other centrosome-associated proteins, it is not present during interphase and first appears at the centromere region of chromosomes during prometaphase. CENPE is proposed to be one of the motors responsible for mammalian chromosome movement and/or spindle elongation.
7149	The family of natriuretic peptides (see MIM 108780) elicit a number of vascular, renal, and endocrine effects that are important in the maintenance of blood pressure and extracellular fluid volume. These effects are mediated by specific binding of the peptides to cell surface receptors in the vasculature, kidney, adrenal, and brain.[supplied by OMIM]
7150	The product of this gene belongs to the family of purinoceptors for ATP. This receptor functions as a ligand-gated ion channel and is responsible for ATP-dependent lysis of macrophages through the formation of membrane pores permeable to large molecules. Activation of this nuclear receptor by ATP in the cytoplasm may be a mechanism by which cellular activity can be coupled to changes in gene expression. Alternatively spliced variants encoding different isoforms have been identified.
7151	
7152	
7153	This gene encodes a subunit of the splicing factor 3b protein complex. Splicing factor 3b, together with splicing factor 3a and a 12S RNA unit, forms the U2 small nuclear ribonucleoproteins complex (U2 snRNP). The splicing factor 3b/3a complex binds pre-mRNA upstream of the intron@#$%&s branch site in a sequence-independent manner and may anchor the U2 snRNP to the pre-mRNA. The protein encoded by this gene contains a PHD-finger-like domain that is flanked by highly basic N- and C-termini. This protein belongs to the PHD-finger superfamily and may act as a chromatin-associated protein. This gene has several pseudogenes on different chromosomes.
7154	
7155	
7156	Tyrosine kinases are either receptor molecules, which contain transmembrane and extracellular domains, or nonreceptor proteins, which are located intracellularly. One family of nonreceptor TKs includes the genes TEC (MIM 600583), TXK (MIM 600058), ITK (MIM 186973), and BTK (MIM 300300). All of these proteins are homologs of the Drosophila Src28 TK and contain an SH3 and SH2 domain upstream of the TK domain.[supplied by OMIM]
7157	
7158	
7159	
7160	The MMAB gene encodes cob(I)alamin adenosyltransferase (EC 2.5.1.17), which catalyzes the final step in the synthesis of the coenzyme adenosylcobalamin (AdoCbl). AdoCbl is a vitamin B12-containing coenzyme for methylmalonyl-CoA mutase (MIM 609058).[supplied by OMIM]
7161	
7162	
7163	
7164	
7165	
7166	Cathelicidin antimicrobial protein is an antimicrobial protein found in specific granules of polymorphonuclear leukocytes (PMNs).[supplied by OMIM]
7167	HSF1 is a heat-shock transcription factor. Transcription of heat-shock genes is rapidly induced after temperature stress.  Hsp90, by itself and/or associated with multichaperone complexes, is a major repressor of HSF1.
7168	
7169	
7170	
7171	
7172	Members of the Rab protein family are nontransforming monomeric GTP-binding proteins of the Ras superfamily that contain 4 highly conserved regions involved in GTP binding and hydrolysis. Rabs are prenylated, membrane-bound proteins involved in vesicular fusion and trafficking. The mammalian RAB proteins show striking similarities to the S. cerevisiae YPT1 and SEC4 proteins, Ras-related GTP-binding proteins involved in the regulation of secretion.[supplied by OMIM]
7173	
7174	
7175	
7176	In vertebrates, the genes encoding the class of transcription factors called homeobox genes are found in clusters named A, B, C, and D on four separate chromosomes. Expression of these proteins is spatially and temporally regulated during embryonic development. This gene is part of the A cluster on chromosome 7 and encodes a DNA-binding transcription factor which may regulate gene expression, morphogenesis, and differentiation. Methylation of this gene may result in the loss of its expression and, since the encoded protein upregulates the tumor suppressor p53, this protein may play an important role in tumorigenesis.
7177	
7178	
7179	
7180	Kinesins, such as KIF2, are microtubule-associated motor proteins. For background information on kinesins, see MIM 148760.[supplied by OMIM]
7181	
7182	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of the ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class. Alternative splicing of this gene generates at least 3 transcript variants diverging at the 5@#$%& terminus.
7183	Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. It is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, F0, which comprises the proton channel. The F1 complex consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled in a ratio of 3 alpha, 3 beta, and a single representative of the other 3. The F0 seems to have nine subunits (a, b, c, d, e, f, g, F6 and 8). This gene encodes the f subunit of the F0 complex. Alternatively spliced transcript variants encoding different isoforms have been identified for this gene.
7184	Retinoic acid plays a critical role in development, cellular growth, and differentiation. The specific function of this intronless, retinoic acid-induced gene has not yet been determined; however, it has been suggested to play a role in development. Localization of this gene designates it to be a candidate for diseases such as Nance-Horan syndrome, sensorineural deafness, non-specific X-linked mental retardation, oral-facial-digital syndrome, and Fried syndrome.
7185	Vasoactive intestinal peptide (VIP) is a neuroprotective factor that has been shown to have a stimulatory effect on the growth of some tumor cells and an inhibitory effect on others. The protein encoded by this gene is upregulated by VIP and may be involved in the stimulatory effect of VIP on certain tumor cells. The encoded protein contains one homeobox and nine zinc finger domains, suggesting that it functions as a transcription factor. This gene is also upregulated in normal proliferative tissues. Finally, the encoded protein may increase the viability of certain cell types through modulation of p53 activity. Two transcript variants encoding the same protein have been found for this gene.
7186	Adenosine deaminase catalyzes the hydrolysis of adenosine to inosine. ADA deficiency causes one form of severe combined immunodeficiency disease (SCID), in which there is dysfunction of both B and T lymphocytes with impaired cellular immunity and decreased production of immunoglobulins.
7187	
7188	This gene may play a role in the development and normal function of the ovaries. Mutations of this gene have been linked to premature ovarian failure. Alternative splicing results in two protein isoforms. Furthermore, each splice variant undergoes additional splicing in the 3@#$%& UTR. Overall 4 splice variants have been described.
7189	Human chromosomal region Xp22.3-p21.3 comprises the area between the pseudoautosomal boundary and the Duchenne muscular dystrophy gene (MIM 300377). This region harbors several disease loci, including OFD1 (MIM 311200), CFNS (MIM 304110), DFN6 (MIM 300066), and SEDT (MIM 313400). It also contains a region of homology with both the short and the long arms of the Y chromosome and undergoes frequent chromosomal rearrangements.[supplied by OMIM]
7190	A number sign (#) is used with this entry because the phenotype is caused by mutation in the gene encoding N-sulfoglucosamine sulfohydrolase (SGSH; MIM 605270). The Sanfilippo syndrome, or mucopolysaccharidosis III, is a lysosomal storage disease due to impaired degradation of heparan sulfate (Esposito et al, 2000 [PubMed 10727844]). MPS III includes 4 types, each due to the deficiency of a different enzyme: heparan N-sulfatase (type A); alpha-N-acetylglucosaminidase (type B; MIM 252920); acetyl CoA:alpha-glucosaminide acetyltransferase (type C; MIM 252930); and N-acetylglucosamine 6-sulfatase (type D; MIM 252940). The Sanfilippo syndrome is characterized by severe central nervous system degeneration, but only mild somatic disease. Onset of clinical features usually occurs between 2 and 6 years; severe neurologic degeneration occurs in most patients between 6 and 10 years of age, and death occurs typically during the second or third decade of life. Type A has been reported (van de Kamp et al., 1981 [PubMed 6796310]) to be the most severe, with earlier onset and rapid progression of symptoms and shorter survival.[supplied by OMIM]
7191	
7192	
7193	
7194	This gene encodes a type I integral membrane protein found only in the rough endoplasmic reticulum. The encoded protein is part of an N-oligosaccharyl transferase complex that links high mannose oligosaccharides to asparagine residues found in the Asn-X-Ser/Thr consensus motif of nascent polypeptide chains. This protein forms part of the regulatory subunit of the 26S proteasome and may mediate binding of ubiquitin-like domains to this proteasome.
7195	
7196	
7197	
7198	Laminins, a family of extracellular matrix glycoproteins, are the major noncollagenous constituent of basement membranes. They have been implicated in a wide variety of biological processes including cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis. Many of the effects of laminin are mediated through interactions with cell surface receptors. These receptors include members of the integrin family, as well as non-integrin laminin-binding proteins. This gene encodes a high-affinity, non-integrin family, laminin receptor 1. This receptor has been variously called 67 kD laminin receptor, 37 kD laminin receptor precursor (37LRP) and p40 ribosome-associated protein. The amino acid sequence of laminin receptor 1 is highly conserved through evolution, suggesting a key biological function. It has been observed that the level of the laminin receptor transcript is higher in colon carcinoma tissue and lung cancer cell line than their normal counterparts. Also, there is a correlation between the upregulation of this polypeptide in cancer cells and their invasive and metastatic phenotype. Multiple copies of this gene exist, however, most of them are pseudogenes thought to have arisen from retropositional events. Two alternatively spliced transcript variants encoding the same protein have been found for this gene.
7199	
7200	
7201	
7202	
7203	
7204	The protein encoded by this gene is a homeodomain protein that lacks certain conserved residues required for DNA binding. It was reported that choriocarcinoma cell lines and tissues failed to express this gene, which suggested the possible involvement of this gene in malignant conversion of placental trophoblasts. Studies in mice suggested that this protein may interact with serum response factor (SRF) and modulate SRF-dependent cardiac-specific gene expression and cardiac development. Multiple alternatively spliced transcript variants encoding the same protein have been observed, the full-length natures of only some have been determined.
7205	This gene encodes a member of the ephrin (EPH) family. The ephrins and EPH-related receptors comprise the largest subfamily of receptor protein-tyrosine kinases and have been implicated in mediating developmental events, especially in the nervous system and in erythropoiesis. Based on their structures and sequence relationships, ephrins are divided into the ephrin-A (EFNA) class, which are anchored to the membrane by a glycosylphosphatidylinositol linkage, and the ephrin-B (EFNB) class, which are transmembrane proteins. This gene encodes an EFNA class ephrin which binds to the EPHA2, EPHA4, EPHA5, EPHA6, and EPHA7 receptors. Two transcript variants that encode different isoforms were identified through sequence analysis.
7206	This gene encodes one of the six subunits of type IV collagen, the major structural component of basement membranes. This particular collagen IV subunit, however, is only found in a subset of basement membranes. Like the other members of the type IV collagen gene family, this gene is organized in a head-to-head conformation with another type IV collagen gene so that each gene pair shares a common promoter. Mutations in this gene are associated with type II autosomal recessive Alport syndrome (hereditary glomerulonephropathy) and with familial benign hematuria (thin basement membrane disease). Two transcripts, differing only in their transcription start sites, have been identified for this gene and, as is common for collagen genes, multiple polyadenylation sites are found in the 3@#$%& UTR.
7207	
7208	ADP-ribosylation factor 5 (ARF5) is a member of the human ARF gene family. These genes encode small guanine nucleotide-binding proteins that stimulate the ADP-ribosyltransferase activity of cholera toxin and play a role in vesicular trafficking and as activators of phospholipase D.  The gene products include 6 ARF proteins and 11 ARF-like proteins and constitute 1 family of the RAS superfamily. The ARF proteins are categorized as class I (ARF1, ARF2,and ARF3), class II (ARF4 and ARF5) and class III (ARF6). The members of each class share a common gene organization. The ARF5 gene spans approximately 3.2kb of genomic DNA and contains six exons and five introns.
7209	
7210	The SGCE gene encodes the epsilon member of the sarcoglycan family, transmembrane components of the dystrophin-glycoprotein complex, which links the cytoskeleton to the extracellular matrix.[supplied by OMIM]
7211	
7212	F-box proteins are an expanding family of eukaryotic proteins characterized by an approximately 40 amino acid motif, the F box. Some F-box proteins have been shown to be critical for the ubiquitin-mediated degradation of cellular regulatory proteins. In fact, F-box proteins are one of the four subunits of ubiquitin protein ligases, called SCFs. SCF ligases bring ubiquitin conjugating enzymes to substrates that are specifically recruited by the different F-box proteins. A large family of mammalian F-box proteins has recently been identified and classified into three groups based on the presence of either the WD-40 repeats, the leucine-rich repeats, or the presence or absence of other protein-protein interacting domains. The FBXW2 gene product, the second identified member of the F-box gene family, contains multiple WD-40 repeats.
7213	Chloride channels are a diverse group of proteins that regulate fundamental cellular processes including stabilization of cell membrane potential, transepithelial transport, maintenance of intracellular pH, and regulation of cell volume. Chloride intracellular channel 3 is a member of the p64 family and is predominantly localized in the nucleus and stimulates chloride ion channel activity. In addition, this protein may participate in cellular growth control, based on its association with ERK7, a member of the MAP kinase family.
7214	The protein encoded by this gene is actually a preproprotein that is cleaved into four distinct mature peptides. One of these, glucagon, is a pancreatic hormone that counteracts the glucose-lowering action of insulin by stimulating glycogenolysis and gluconeogenesis. Glucagon is a ligand for a specific G-protein linked receptor whose signalling pathway controls cell proliferation. Two of the other peptides are secreted from gut endocrine cells and promote nutrient absorption through distinct mechanisms. Finally, the fourth peptide is similar to glicentin, an active enteroglucagon.
7215	
7216	This gene encodes a member of the CHD family of proteins which are characterized by the presence of chromo (chromatin organization modifier) domains and SNF2-related helicase/ATPase domains. This protein is one of the components of a histone deacetylase complex referred to as the Mi-2/NuRD complex which participates in the remodeling of chromatin by deacetylating histones. Chromatin remodeling is essential for many processes including transcription. Autoantibodies against this protein are found in a subset of patients with dermatomyositis. Three alternatively spliced transcripts encoding different isoforms have been described.
7217	
7218	
7219	
7220	
7221	
7222	
7223	Initiation of transcription by RNA polymerase II requires the activities of more than 70 polypeptides. The protein that coordinates these activities is transcription factor IID (TFIID), which binds to the core promoter to position the polymerase properly, serves as the scaffold for assembly of the remainder of the transcription complex, and acts as a channel for regulatory signals. TFIID is composed of the TATA-binding protein (TBP) and a group of evolutionarily conserved proteins known as TBP-associated factors or TAFs. TAFs may participate in basal transcription, serve as coactivators, function in promoter recognition or modify general transcription factors (GTFs) to facilitate complex assembly and transcription initiation. This gene encodes a protein that is similar to one of the small subunits of TFIID, TBP-associated factor 9.
7224	This gene encodes a member of the lipid transfer/lipopolysaccharide binding protein (LT/LBP) gene family. It is highly expressed in hypertrophic tonsils. This gene and three other members of the LT/LBP gene family form a cluster on the long arm of chromosome 20.
7225	The protein encoded by this gene catalyzes the formation of inositol 1,4,5-trisphosphate and diacylglycerol from phosphatidylinositol 4,5-bisphosphate. This reaction uses calcium as a cofactor and plays an important role in the intracellular transduction of receptor-mediated tyrosine kinase activators. For example, when activated by SRC, the encoded protein causes the Ras guanine nucleotide exchange factor RasGRP1 to translocate to the Golgi, where it activates Ras. Also, this protein has been shown to be a major substrate for heparin-binding growth factor 1 (acidic fibroblast growth factor)-activated tyrosine kinase. Two transcript variants encoding different isoforms have been found for this gene.
7226	The protein encoded by this gene is found in the nucleus and may be involved in the early transcriptional regulation of patterning of the mesoderm. The encoded basic helix-loop-helix protein requires dimerization with another basic helix-loop-helix protein for efficient DNA binding.
7227	This gene is a member of the MAGEB gene family. The members of this family have their entire coding sequences located in the last exon, and the encoded proteins show 50 to 68% sequence identity to each other. The promoters and first exons of the MAGEB genes show considerable variability, suggesting that the existence of this gene family enables the same function to be expressed under different transcriptional controls. This gene is localized in the DSS (dosage-sensitive sex reversal) critical region. It is expressed in testis and placenta, and in a significant fraction of tumors of various histological types. The MAGEB genes are clustered on chromosome Xp22-p21.
7228	
7229	
7230	SLC38A4 is found predominantly in liver and transports both cationic and neutral amino acids. The transport of cationic amino acids by SLC38A4 is Na(+) and pH independent, while the transport of neutral amino acids is Na(+) and pH dependent (Hatanaka et al., 2001).[supplied by OMIM]
7231	Focal adhesions are actin-rich structures that enable cells to adhere to the extracellular matrix and at which protein complexes involved in signal transduction assemble. Zyxin is a zinc-binding phosphoprotein that concentrates at focal adhesions and along the actin cytoskeleton. Zyxin has an N-terminal proline-rich domain and three LIM domains in its C-terminal half. The proline-rich domain may interact with SH3 domains of proteins involved in signal transduction pathways while the LIM domains are likely involved in protein-protein binding. Zyxin may function as a messenger in the signal transduction pathway that mediates adhesion-stimulated changes in gene expression and may modulate the cytoskeletal organization of actin bundles. Alternative splicing results in multiple transcript variants that encode the same isoform.
7232	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. The rat homolog is predominantly expressed in embryonic brown adipose tissue and has significant mitogenic activity, which suggests a role in proliferation of embryonic brown adipose tissue.
7233	The galectins are a family of beta-galactoside-binding proteins implicated in modulating cell-cell and cell-matrix interactions. LGALS4 is an S-type lectin that is strongly underexpressed in colorectal cancer. The 323-amino acid LGALS4 protein contains 2 homologous, approximately 150-amino acid carbohydrate recognition domains and all amino acids typically conserved in galectins.
7234	Neurexins are a family of proteins that function in the vertebrate nervous system as cell adhesion molecules and receptors. They are encoded by several unlinked genes of which two, NRXN1 and NRXN3, are among the largest known human genes. Three of the genes (NRXN1-3) utilize two alternate promoters and include numerous alternatively spliced exons to generate thousands of distinct mRNA transcripts and protein isoforms. The majority of transcripts are  produced from the upstream promoter and encode alpha-neurexin isoforms; a much smaller number of transcripts are produced from the downstream promoter and encode beta-neurexin isoforms. The alpha-neurexins contain epidermal growth factor-like (EGF-like) sequences and laminin G domains, and have been shown to interact with neurexophilins. The beta-neurexins lack EGF-like sequences and contain fewer laminin G domains than alpha-neurexins.
7235	Glutathione S-transferase (GST) theta 1 (GSTT1) is a member of a superfamily of proteins that catalyze the conjugation of reduced glutathione to a variety of electrophilic and hydrophobic compounds. Human GSTs can be divided into five main classes: alpha, mu, pi, theta, and zeta. The theta class includes GSTT1 and GSTT2. The GSTT1 and GSTT2 share 55% amino acid sequence identity and both of them were claimed to have an important role in human carcinogenesis. The GSTT1 gene is located approximately 50kb away from the GSTT2 gene. The GSTT1 and GSTT2 genes have a similar structure, being composed of five exons with identical exon/intron boundaries.
7236	
7237	Two types of spliceosomes catalyze splicing of pre-mRNAs. The major U2-type spliceosome is found in all eukaryotes and removes U2-type introns, which represent more than 99% of pre-mRNA introns. The minor U12-type spliceosome is found in some eukaryotes and removes U12-type introns, which are rare and have distinct splice consensus signals. The U12-type spliceosome consists of several small nuclear RNAs and associated proteins. This gene encodes a 65K protein that is a component of the U12-type spliceosome. This protein contains two RNA recognition motifs (RRMs), suggesting that it may contact one of the small nuclear RNAs of the minor spliceosome.
7238	
7239	
7240	
7241	
7242	
7243	
7244	The protein encoded by this gene is a member of the interleukin 1 receptor family. This protein is similar to the interleukin 1 accessory proteins, and is most closely related to interleukin 1 receptor accessory protein-like 1 (IL1RAPL1). This gene and IL1RAPL1 are located at a region on chromosome X that is associated with X-linked non-syndromic mental retardation.
7245	
7246	
7247	
7248	
7249	
7250	
7251	
7252	The enzyme encoded by this gene is responsible for the catalysis of the 5-beta-reduction of bile acid intermediates and steroid hormones carrying a delta(4)-3-one structure.  Deficiency of this enzyme may contribute to hepatic dysfunction.
7253	
7254	
7255	
7256	This gene was identified by isolating cDNA from a size-fractionated library derived from brain in an attempt to characterize genes encoding large proteins. The function of the protein encoded by this gene has not yet been determined.
7257	
7258	The protein encoded by this gene is one of two human homologs of Saccharomyces cerevisiae Rad23, a protein involved in the nucleotide excision repair (NER). This protein was found to be a component of the protein complex that specifically complements the NER defect of xeroderma pigmentosum group C (XP-c) cell extracts in vitro. This protein was also shown to interact with, and elevate the nucleotide excision activity of 3-methyladenine-DNA glycosylase (MPG), which suggested a role in DNA damage recognition in base excision repair. This protein contains an N-terminal ubiquitin-like domain, which was reported to interact with 26S proteasome, and thus this protein may be involved in the ubiquitin mediated proteolytic pathway in cells.
7259	
7260	
7261	
7262	
7263	
7264	This gene product is a member of the Vanin family of proteins which share extensive sequence similarity with each other, and also with biotinidase. The family includes secreted and membrane-associated proteins, a few of which have been reported to participate in hematopoietic cell trafficking. No biotinidase activity has been demonstrated for any of the vanin proteins, however, they possess pantetheinase activity, which may play a role in oxidative-stress response. This protein, like its mouse homolog, is likely a GPI-anchored cell surface molecule. The mouse protein is expressed by the perivascular thymic stromal cells and regulates migration of T-cell progenitors to the thymus. This gene lies in close proximity to, and in same transcriptional orientation as two other vanin genes on chromosome 6q23-q24.
7265	
7266	
7267	
7268	This gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins. This protein plays a role in hair, blood vessel, and lymphatic vessel development. Mutations in this gene have been associated with recessive and dominant forms of hypotrichosis-lymphedema-telangiectasia.
7269	The protein encoded by this gene hydrolyzes cerebroside sulfate to cerebroside and sulfate. Defects in this gene lead to metachromatic leucodystrophy (MLD), a progressive demyelination disease which results in a variety of neurological symptoms and ultimately death.
7270	
7271	
7272	
7273	
7274	
7275	
7276	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the MRP subfamily which is involved in multi-drug resistance. The specific function of this protein has not yet been determined; however, this protein may play a role in cellular detoxification as a pump for its substrate, organic anions.
7277	This gene encodes a protein that is involved in glycosylphosphatidylinositol (GPI)-anchor biosynthesis. The GPI-anchor is a glycolipid which contains three mannose molecules in its core backbone. The GPI-anchor is found on many blood cells and serves to anchor proteins to the cell surface. This protein and another GPI synthesis protein, PIGO, function in the transfer of ethanolaminephosphate (EtNP) to the third mannose in GPI. At least two alternatively spliced transcripts encoding distinct isoforms have been found for this gene.
7278	
7279	
7280	
7281	
7282	The protein encoded by this gene is a member of the TNF-receptor superfamily. This receptor is activated by tumor necrosis factor-related apoptosis inducing ligand (TNFSF10/TRAIL), and thus transduces cell death signal and induces cell apoptosis. Studies with FADD-deficient mice suggested that FADD, a death domain containing adaptor protein, is required for the apoptosis mediated by this protein.
7283	This gene was identified by the up-regulation of its expression upon Epstein-Barr virus infection of primary B lymphocytes. This gene is predicted to encode a G protein-coupled receptor that is most closely related to the thrombin receptor. Expression of this gene was detected in B-lymphocyte cell lines and lymphoid tissues but not in T-lymphocyte cell lines or peripheral blood T lymphocytes. The function of this gene is unknown.
7284	
7285	
7286	
7287	
7288	
7289	The protein encoded by this gene is one of the signaling and transforming proteins containing Src homology 2 and 3 (SH2 and SH3) domains. It is located in the cytoplasm and is an adaptor protein involved in transducing signals from receptor tyrosine kinases to downstream signal recipients such as RAS.
7290	
7291	The protein encoded by this gene is a DNA ligase that joins single-strand breaks in a double-stranded polydeoxynucleotide in an ATP-dependent reaction. This protein is essential for V(D)J recombination and DNA double-strand break (DSB) repair through nonhomologous end joining (NHEJ). This protein forms a complex with the X-ray repair cross complementing protein 4 (XRCC4), and further interacts with the DNA-dependent protein kinase (DNA-PK). Both XRCC4 and DNA-PK are known to be required for NHEJ. The crystal structure of the complex formed by this protein and XRCC4 has been resolved. Defects in this gene are the cause of LIG4 syndrome. Alternatively spliced transcript variants encoding the same protein have been observed.
7292	
7293	
7294	
7295	
7296	This gene encodes a glycosylated transmembrane protein that is cleaved to form a mature, secreted protein. The N-terminus of the precursor protein shares characteristics with other surfactant proteins and is sometimes called chondrosurfactant protein although no biological activity has yet been defined for it. The C-terminus of the precursor protein contains a 25 kDa mature protein called leukocyte cell-derived chemotaxin-1 or chondromodulin-1. The mature protein promotes chondrocyte growth and inhibits angiogenesis. This gene is expressed in the avascular zone of prehypertrophic cartilage and its expression decreases during chondrocyte hypertrophy and vascular invasion. The mature protein likely plays a role in endochondral bone development by permitting cartilaginous anlagen to be vascularized and replaced by bone. It may be involved also in the broad control of tissue vascularization during development. Alternative splicing results in multiple transcript variants encoding different isoforms.
7297	The inner- and outer-arm dyneins, which bridge between the doublet microtubules in axonemes, are the force-generating proteins responsible for the sliding movement in axonemes. The intermediate and light chains, thought to form the base of the dynein arm, help mediate attachment and may also participate in regulating dynein activity. This gene encodes an intermediate chain dynein, belonging to the large family of motor proteins. Mutations in this gene result in abnormal ciliary ultrastructure and function associated with primary ciliary dyskinesia (PCD) and Kartagener syndrome.
7298	
7299	
7300	Myosin VI is a molecular motor involved in intracellular vesicle and organelle transport. It is one of the so-called unconventional myosins.[supplied by OMIM]
7301	
7302	
7303	
7304	This gene encodes an isoform of the alpha subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This isoform (alpha 2) is expressed in brain, heart and skeletal muscle, and the other isoform (alpha 1) is expressed in brain, placenta, lung, liver, kidney, and pancreas. This gene may be critical in the development of ovarian cancer.
7305	
7306	
7307	
7308	
7309	Arylsulfatase E is a member of the sulfatase family.  ARSE is glycosylated postranslationally and localized to the golgi apparatus. Sulfatases are essential for the correct composition of bone and cartilage matrix.  X-linked chondrodysplasia punctata (CDPX), a disease characterized by abnormalities in cartilage and bone development, has been linked to mutations in this gene.
7310	This gene encodes a protein whose function has been linked to hair growth. A similar protein in rat functions as a transcriptional corepressor for thyroid hormone and interacts with histone deacetylases. Mutations in this gene have been documented in cases of autosomal recessive congenital alopecia and atrichia with papular lesions.
7311	
7312	
7313	
7314	
7315	
7316	The protein encoded by this gene is a member of the Serine/Threonine protein kinase family. This kinase contains multiple functional domains. Its kinase domain is highly similar to that of the myotonic dystrophy protein kinase (DMPK). This kinase also contains a Rac interactive binding (CRIB) domain, and has been shown to bind CDC42. It may function as a CDC42 downstream effector mediating CDC42 induced peripheral actin formation, and promoting cytoskeletal reorganization. Multiple alternatively spliced transcript variants have been described, and the full-length nature of two of them has been reported.
7317	
7318	The protein encoded by this gene interacts with the Von Hippel-Lindau protein to form an intracellular complex.  Because it functions as a chaperone protein, it is suspected that it may play a role in the transport of the Von Hippel-Lindau protein from the perinuclear granules to the nucleus or cytoplasm.
7319	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of the ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class and it is upregulated in nasopharyngeal carcinoma.
7320	
7321	
7322	
7323	This gene encodes a protein which contains a conserved nuclear localization signal. The encoded protein may have a role in spermatogenesis, cell cycle control, and in meiotic cell division.
7324	
7325	
7326	
7327	
7328	
7329	This gene belongs to the chemokine-like factor gene superfamily, a novel superfamily that links the chemokine gene family and the transmembrane 4 superfamily. This gene is one of several chemokine-like factor genes located in a cluster on chromosome 3. The protein encoded by this gene is highly expressed in leukocytes, but its exact function is unknown. Alternatively spliced transcript variants encoding different isoforms have been identified.
7330	This encoded protein is thought to be a secreted protein belonging to the disintegrin metalloproteinase family. Its expression is upregulated during dendritic cells maturation. This protein may play an important role in dendritic cell function and their interactions with germinal center T cells.
7331	The product of this gene belongs to the phosphatase 2A regulatory subunit B family. Protein phosphatase 2A is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. The B regulatory subunit might modulate substrate selectivity and catalytic activity. This gene encodes an alpha isoform of the regulatory subunit B56 subfamily.
7332	
7333	
7334	
7335	
7336	
7337	
7338	
7339	
7340	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene is a candidate tumor suppressor gene and located in the critical region of loss of heterozygosity (LOH). It encodes a DEAD box protein, which is highly conserved in evolution, indicating an important cellular function.
7341	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. Sequence analysis identified two transcript variants.
7342	
7343	
7344	
7345	
7346	
7347	
7348	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. This gene encodes a membrane protein which has sequence similarity to the KCNE1 gene product, a member of the potassium channel, voltage-gated, isk-related subfamily. This intronless gene is deleted in AMME contiguous gene syndrome and may be involved in the cardiac and neurologic abnormalities found in the AMME contiguous gene syndrome.
7349	The neuropeptide galanin modulates a variety of physiologic processes including cognition/memory, sensory/pain processing, hormone secretion, and feeding behavior.  The human galanin receptors are G protein-coupled receptors that functionally couple to their intracellular effector through distinct signaling pathways.  GALR3 is found in many tissues and may be expressed as 1.4-, 2.4-, and 5-kb transcripts
7350	Apurinic/apyrimidinic (AP) sites occur frequently in DNA molecules by spontaneous hydrolysis, by DNA damaging agents or by DNA glycosylases that remove specific abnormal bases. AP sites are pre-mutagenic lesions that can prevent normal DNA replication so the cell contains systems to identify and repair such sites. Class II AP endonucleases cleave the phosphodiester backbone 5@#$%& to the AP site. This gene encodes a protein shown to have a weak class II AP endonuclease activity. Most of the encoded protein is located in the nucleus but some is also present in mitochondria. This protein may play an important role in both nuclear and mitochondrial base excision repair (BER).
7351	The minichromosome maintenance protein 3 (MCM3) is one of the MCM proteins essential for the initiation of DNA replication. The protein encoded by this gene is a MCM3 binding protein. It was reported to have phosphorylation-dependent DNA-primase activity, which was up-regulated in antigen immunization induced germinal center. This protein was demonstrated to be an acetyltransferase that acetylates MCM3 and plays a role in DNA replication. The mutagenesis of a nuclear localization signal of MCM3 affects the binding of this protein with MCM3, suggesting that this protein may also facilitate MCM3 nuclear localization.
7352	The protein encoded by this gene is a member of the SWI/SNF family of proteins. Members of this family have helicase and ATPase activities and are thought to regulate transcription of certain genes by altering the chromatin structure around those genes. The protein encoded by this gene is a component of the chromatin remodeling and spacing factor RSF, a facilitator of the transcription of class II genes by RNA polymerase II. The encoded protein is similar in sequence to the Drosophila ISWI chromatin remodeling protein.
7353	
7354	The protein encoded by this gene is a cytoplasmic enzyme involved in energy homeostasis. The encoded protein reversibly catalyzes the transfer of phosphate between ATP and various phosphogens such as creatine phosphate. It acts as a homodimer in brain as well as in other tissues, and as a heterodimer with a similar muscle isozyme in heart. The encoded protein is a member of the ATP:guanido phosphotransferase protein family. A pseudogene of this gene has been characterized.
7355	GABA is the major inhibitory neurotransmitter in the mammalian brain where it acts at GABA receptors, which are ligand-gated chloride channels. GABRR1 is a member of the rho subunit family.
7356	
7357	The nuclear import of the spliceosomal snRNPs U1, U2, U4 and U5, is dependent on the presence of a complex nuclear localization signal. The latter is composed of the 5@#$%&-2,2,7-terminal trimethylguanosine (m3G) cap structure of the U snRNA and the Sm core domain. The protein encoded by this gene interacts specifically with m3G-cap and functions as an snRNP-specific nuclear import receptor.
7358	This gene encodes a membrane glycoprotein found on peripheral blood cells, glomerular podocytes, and follicular dendritic cells. The protein is a receptor for complement components C3b and C4b and regulates the activity of the complement cascade. Variation in this protein is the basis of the Knops blood group system. The two most common alleles, F and S, differ by 8 exons and are thought to be the result of an unequal crossover event. A secreted form of the protein present in plasma has been described, but its full length nature has not been determined.
7359	
7360	RHD11, a member of the short-chain dehydrogenase/reductase (SDR) superfamily of oxidoreductases, is expressed at high levels in prostate epithelium, and its expression is regulated by androgens.[supplied by OMIM]
7361	Exportin-5 belongs to a large family of karyopherins (see MIM 602738) that mediate the transport of proteins and other cargo between the nuclear and cytoplasmic compartments.[supplied by OMIM]
7362	
7363	
7364	This gene encodes a mitogen-activated kinase which localizes to the nucleus. The gene maps to the major histocompatability complex (MHC) class II region on chromosome 6p21.3 but sequence comparison suggests that the protein is not involved in the immune response. Homology to the Drosophila gene female sterile homeotic suggests that this human gene may be part of a signal transduction pathway involved in growth control.
7365	
7366	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L24E family of ribosomal proteins. It is located in the cytoplasm. This gene has been referred to as ribosomal protein L30 because the encoded protein shares amino acid identity with the L30 ribosomal proteins from S. cerevisiae; however, its official name is ribosomal protein L24. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
7367	The predicted 218-amino acid protein contains a bHLH motif, a putative nuclear localization signal, a glycine-rich region, and a stretch of acidic residues. MSC is capable of binding an E-box element either as a homodimer or as a heterodimer with E2A in vitro, and forms heterodimers with E2A proteins in vivo. MSC contains a transcriptional repression domain and is capable of inhibiting the transactivation capability of E47, an E2A protein, in mammalian cells. MSC is a downstream target of the B-cell receptor signal transduction pathway.
7368	
7369	This gene appears to be an evolving pseudogene of L-threonine 3-dehydrogenase (TDH). In both prokaryotes and eukaryotes, TDH catalyzes the first of two steps in one of two L-threonine degradation pathways. However, in human, the single gene with sequence similarity to TDH is not capable of encoding a functional TDH protein; the predicted protein lacks most of the C-terminus and parts of the NAD+ binding motif when compared to other species@#$%& TDH proteins. This suggests that the human gene is therefore a pseudogene. Transcripts of this gene are found in all tissues and alternatively spliced transcripts have been described. It is not known if these transcripts are translated, or if the possible protein product provides any functional role.
7370	
7371	Acylphosphatase can hydrolyze the phosphoenzyme intermediate of different membrane pumps, particularly the Ca2+/Mg2+-ATPase from sarcoplasmic reticulum of skeletal muscle. Two isoenzymes have been isolated, called muscle acylphosphatase and erythrocyte acylphosphatase on the basis of their tissue localization. This gene encodes the muscle-type isoform (MT). An increase of the MT isoform is associated with muscle differentiation.
7372	The protein encoded by this gene belongs to the subtilisin-like proprotein convertase family. The members of this family are proprotein convertases that process latent precursor proteins into their biologically active products. This encoded protein mediates posttranslational endoproteolytic processing for several integrin alpha subunits. It is thought to process prorenin, pro-membrane type-1 matrix metalloproteinase and HIV-1 glycoprotein gp160. Two alternatively spliced transcripts are described for this gene but only one has its full length nature known.
7373	G proteins mediate a number of cellular processes. The protein encoded by this gene belongs to the RGS (regulators of G-protein signaling) family and specifically interacts with G protein, GAI3. This protein is a guanosine triphosphatase-activating protein that functions to down-regulate Galpha i/Galpha q-linked signaling.
7374	The protein encoded by this gene belongs to the RAD/GEM family of GTP-binding proteins. It is associated with the inner face of the plasma membrane and could play a role as a regulatory protein in receptor-mediated signal transduction. Alternative splicing occurs at this locus and two transcript variants encoding the same protein have been identified.
7375	
7376	
7377	
7378	The nuclear pore complex is a massive structure that extends across the nuclear envelope, forming a gateway that regulates the flow of macromolecules between the nucleus and the cytoplasm. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells. This gene is a member of the FG-repeat-containing nucleoporins. The protein encoded by this gene is localized to the cytoplasmic face of the nuclear pore complex where it is required for proper cell cycle progression and nucleocytoplasmic transport. The 3@#$%& portion of this gene forms a fusion gene with the DEK gene on chromosome 6 in a t(6,9) translocation associated with acute myeloid leukemia and myelodysplastic syndrome.
7379	
7380	Fructose-1,6-bisphosphatase 1, a gluconeogenesis regulatory enzyme, catalyzes the hydrolysis of fructose 1,6-bisphosphate to fructose 6-phosphate and inorganic phosphate. Fructose-1,6-diphosphatase deficiency is associated with hypoglycemia and metabolic acidosis.
7381	
7382	
7383	
7384	The enzyme system for cleavage of glycine (glycine cleavage system; EC 2.1.2.10), which is confined to the mitochondria, is composed of 4 protein components: P protein (a pyridoxal phosphate-dependent glycine decarboxylase; MIM 238300), H protein (a lipoic acid-containing protein; MIM 238330), T protein (a tetrahydrofolate-requiring enzyme), and L protein (a lipoamide dehydrogenase; MIM 238331). Glycine encephalopathy (GCE; MIM 605899) may be due to a defect in any one of these enzymes.[supplied by OMIM]
7385	
7386	Complex I, or NADH:ubiquinone oxidoreductase, the first multisubunit enzyme complex of the mitochondrial respiratory chain, plays a vital role in cellular ATP production, the primary source of energy for many crucial processes in living cells. It removes electrons from NADH and passes them by a series of different protein-coupled redox centers to the electron acceptor ubiquinone. In well-coupled mitochondria, the electron flux leads to ATP generation via the building of a proton gradient across the inner membrane. Complex I is composed of at least 41 subunits, of which 7 are encoded by the mitochondrial genome (ND1-6, ND4L) and the remainder by nuclear genes (van den Heuvel et al., 1998).[supplied by OMIM]
7387	
7388	
7389	
7390	
7391	This gene encodes a member of the guanine nucleotide-binding protein (G protein)-coupled receptor kinase subfamily of the Ser/Thr protein kinase family. The protein phosphorylates the activated forms of G protein-coupled receptors thus initiating their deactivation. Several transcript variants encoding different isoforms have been described for this gene.
7392	Members of the trefoil family are characterized by having at least one copy of the trefoil motif, a 40-amino acid domain that contains three conserved disulfides. They are stable secretory proteins expressed in gastrointestinal mucosa. Their functions are not defined, but they may protect the mucosa from insults, stabilize the mucus layer and affect healing of the epithelium. This gene is expressed in goblet cells of the intestines and colon. This gene and two other related trefoil family member genes are found in a cluster on chromosome 21.
7393	
7394	
7395	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the peptidase T1A family, that is a 20S core alpha subunit. A pseudogene has been identified on the Y chromosome.
7396	
7397	
7398	
7399	
7400	
7401	
7402	
7403	RDHE2 belongs to a family of short-chain alcohol dehydrogenases/reductases that catalyze the first and rate-limiting step that generates retinaldehyde from retinol (Matsuzaka et al., 2002).[supplied by OMIM]
7404	
7405	This gene encodes the acidic form of complement factor 4, part of the classical activation pathway. The protein is expressed as a single chain precursor which is proteolytically cleaved into a trimer of alpha, beta, and gamma chains prior to secretion. The trimer provides a surface for interaction between the antigen-antibody complex and other complement components. The alpha chain may be cleaved to release C4 anaphylatoxin, a mediator of local inflammation. Deficiency of this protein is associated with systemic lupus erythematosus and type I diabetes mellitus. This gene localizes to the major histocompatibility complex (MHC) class III region on chromosome 6. Varying haplotypes of this gene cluster exist, such that individuals may have 1, 2, or 3 copies of this gene.
7406	
7407	The protein encoded by this gene stimulates guanine nucleotide exchange on ARF1 and Rab proteins. This protein is throught to be involved in membrane transport processes
7408	MICA encodes the higly polymorphic MHC (HLA) class I chain-related gene A. The protein product is expressed on the cell surface, although unlike canonical class I molecules does not seem to associate with beta-2-microglobulin. It is thought that MICA functions as a stress-induced antigen that is broadly recognized by intestinal epithelial gamma delta T cells.
7409	The protein encoded by this gene is a retinoblastoma binding protein that may play a role in the regulation of cell proliferation and differentiation. Two alternatively spliced transcript variants of this gene encoding an identical protein have been reported.
7410	
7411	
7412	
7413	
7414	
7415	
7416	
7417	The protein encoded by this gene is a member of the calcium-binding protein family.  The specific function of this protein has not yet been determined; however this protein is known to interact with DNA-dependent protein kinase and may play a role in kinase-phosphatase regulation of DNA end joining.  This protein also interacts with integrin alpha(IIb)beta(3), which may implicate this protein as a regulatory molecule for alpha(IIb)beta(3).
7418	RBP1 is the carrier protein involved in the transport of retinol (vitamin A alcohol) from the liver storage site to peripheral tissue. Vitamin A is a fat-soluble vitamin necessary for growth, reproduction, differentiation of epithelial tissues, and vision. The gene harbors four exons encoding 24, 59, 33, and 16 amino acid residues respectively. The second intervening sequence alone occupies 19 kb of the 21 kb of the gene.
7419	
7420	This gene encodes a member of the Ly-6/neurotoxin gene family, a group of lymphocyte antigens that attach to the cell surface by a glycosylphosphatidylinositol anchor and have a unique structure showing conserved 8-10 cysteine residues with a characteristic spacing pattern. Functional analysis indicates that this protein is not a ligand or neurotransmitter but has the capacity to enhance nicotinic acetylcholine receptor function in the presence of acetylcholine. This gene may also play a role in the pathogenesis of psoriasis vulgaris. Alternatively spliced variants encoding different isoforms have been identified.
7421	
7422	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This endoplasmic reticulum protein participates in the synthesis of cholesterol by catalyzing the removal of the 14alpha-methyl group from lanosterol. Homologous genes are found in all three eukaryotic phyla, fungi, plants, and animals, suggesting that this is one of the oldest cytochrome P450 genes.
7423	
7424	
7425	
7426	
7427	
7428	
7429	
7430	This gene encodes an enzyme involved in the catabolism of choline, catalyzing the oxidative demethylation of dimethylglycine to form sarcosine. The enzyme is found as a monomer in the mitochondrial matrix, and uses flavin adenine dinucleotide and folate as cofactors. Mutation in this gene causes dimethylglycine dehydrogenase deficiency, characterized by a fishlike body odor, chronic muscle fatigue, and elevated levels of the muscle form of creatine kinase in serum.
7431	
7432	This gene encodes a molecular chaperone that is member of the chaperonin containing TCP1 complex (CCT), also known as the TCP1 ring complex (TRiC). This complex consists of two identical stacked rings, each containing eight different proteins. Unfolded polypeptides enter the central cavity of the complex and are folded in an ATP-dependent manner. The complex folds various proteins, including actin and tubulin. Alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
7433	DAB2 mRNA is expressed in normal ovarian epithelial cells but is down-regulated or absent from ovarian carcinoma cell lines. The 770-amino acid predicted protein has an overall 83% identity with the mouse p96 protein, a putative mitogen-responsive phosphoprotein; homology is strongest in the amino-terminal end of the protein in a region corresponding to the phosphotyrosine interaction domain.  The down-regulation of DAB2 may play an important role in ovarian carcinogenesis. This gene was initially named DOC2 (for Differentially expressed in Ovarian Cancer) and is distinct from the DOC2A and DOC2B genes (for double C2-like domains, alpha and beta).
7434	
7435	The protein encoded by this gene is a member of the MAP kinase family. MAP kinases, also known as extracellular signal-regulated kinases (ERKs), act as an integration point for multiple biochemical signals, and are involved in a wide variety of cellular processes such as proliferation, differentiation, transcription regulation and development. The activation of this kinase requires its phosphorylation by upstream kinases. Upon activation, this kinase translocates to the nucleus of the stimulated cells, where it phosphorylates nuclear targets. Two alternatively spliced transcript variants encoding the same protein, but differing in the UTRs, have been reported for this gene.
7436	
7437	This gene product is a highly conserved protein associated with the centromere. It is a DNA-binding protein containing a helix-loop-helix DNA binding motif at the N-terminus, and a dimerization domain at the C-terminus. The DNA binding domain recognizes and binds a 17-bp sequence (CENP-B box) in the centromeric satellite DNA. This protein is proposed to play an important role in the assembly of specific centromere structures in interphase nuclei and on mitotic chromosomes. It is also considered a major centromere autoantigen recognized by sera from patients with anti-centromere antibodies.
7438	The protein encoded by this gene is a deubiquitinating enzyme that is phosphorylated at the onset of mitosis and then dephosphorylated at the metaphase/anaphase transition. A mutant form of the encoded protein was shown to block cell division, and it may be that this enzyme acts on H2A and H2B, the major ubiquitinated proteins of chromatin. Two transcript variants encoding different isoforms have been found for this gene.
7439	
7440	
7441	
7442	
7443	
7444	This gene encodes a member of the aldo/keto reductase superfamily, which consists of more than 40 known enzymes and proteins. These enzymes catalyze the conversion of aldehydes and ketones to their corresponding alcohols by utilizing NADH and/or NADPH as cofactors. The enzymes display overlapping but distinct substrate specificity. This enzyme binds bile acid with high affinity, and shows minimal 3-alpha-hydroxysteroid dehydrogenase activity. This gene shares high sequence identity with three other gene members and is clustered with those three genes at chromosome 10p15-p14.
7445	SUMO1 (UBL1; MIM 601912) is a small ubiquitin-like protein that can be covalently conjugated to other proteins. SENP2 is one of a group of enzymes that process newly synthesized SUMO1 into the conjugatable form and catalyze the deconjugation of SUMO1-containing species.[supplied by OMIM]
7446	This gene encodes a member of the DNase family. The protein hydrolyzes DNA, is not inhibited by actin, and mediates the breakdown of DNA during apoptosis. Alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
7447	
7448	This gene encodes one of the myosin light chains, a component of the hexameric ATPase cellular motor protein myosin. Myosin is composed of two heavy chains, two nonphosphorylatable alkali light chains, and two phosphorylatable regulatory light chains. This gene product, one of the regulatory light chains, is expressed in fetal muscle and in adult retina, cerebellum, and basal ganglia.
7449	This gene encodes a zinc finger protein that contains a jumonji domain.
7450	
7451	
7452	
7453	This gene encodes an enzyme that converts lysophosphatidic acid (LPA) into phosphatidic acid (PA). LPA and PA are two phospholipids involved in signal transduction and in lipid biosynthesis in cells. This enzyme localizes to the endoplasmic reticulum. This gene is located in the class III region of the human major histocompatibility complex. Alternative splicing results in two transcript variants encoding the same protein.
7454	
7455	
7456	
7457	This gene was identified in the neuroblastoma tumour suppressor candidate region on chromosome 1p36. It contains a TFIID-31 domain, similar to that found in TATA box-binding protein-associated factor, TAF(II)31, which is required for p53-mediated transcription activation. This gene was expressed at very low levels in neuroblastoma tumours, and was shown to reduce cell growth in neuroblastoma cells, suggesting that it may have a role in a cell death pathway. Two alternatively spliced transcript variants encoding different isoforms have been identified for this gene.
7458	This gene encodes a member of the aldo/keto reductase superfamily, which consists of more than 40 known enzymes and proteins. This member catalyzes the reduction of a number of aldehydes, including the aldehyde form of glucose, and is thereby implicated in the development of diabetic complications by catalyzing the reduction of glucose to sorbitol. There are a few putative pseudogenes for this gene, and one of them has been confirmed and mapped to chromosome 3.
7459	
7460	Histones play a critical role in transcriptional regulation, cell cycle progression, and developmental events. Histone acetylation/deacetylation alters chromosome structure and affects transcription factor access to DNA. The protein encoded by this gene has sequence homology to members of the histone deacetylase family. Two alternatively spliced transcript variants have been observed for the gene. This gene is orthologous to mouse HDAC7 gene whose protein promotes repression mediated via transcriptional corepressor SMRT.
7461	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the MRP subfamily which is involved in multi-drug resistance. This protein functions in the cellular export of its substrate, cyclic nucleotides. This export contributes to the degradation of phosphodiesterases and possibly an elimination pathway for cyclic nucleotides. Studies show that this protein provides resistance to thiopurine anticancer drugs, 6-mercatopurine and thioguanine, and the anti-HIV drug 9-(2-phosphonylmethoxyethyl)adenine. This protein may be involved in resistance to thiopurines in acute lymphoblastic leukemia and antiretroviral nucleoside analogs in HIV-infected patients. Alternative splicing of this gene has been detected; however, the complete sequence and translation initiation site is unclear.
7462	This gene encodes the large subunit of DNA damage-binding protein which is a heterodimer composed of a large and a small subunit. This protein functions in nucleotide-excision repair. Its defective activity causes the repair defect in the patients with xeroderma pigmentosum complementation group E (XPE). However, it remains for mutation analysis to demonstrate whether the defect in XPE patients is in this gene or the gene encoding the small subunit. In addition, Best vitelliform mascular dystrophy is mapped to the same region as this gene on 11q, but no sequence alternations of this gene are demonstrated in Best disease patients.
7463	The protein encoded by this gene is a growth-related cell surface protein. It was identifed because it reacts with the monoclonal antibody KI in cells, such as the ovarian carcinoma line OVCAR-3, also expressing the CAKI surface glycoprotein. The encoded protein has two enzymatic activities: catalysis of hydroquinone or NADH oxidation, and protein disulfide interchange. The two activities alternate with a period length of about 24 minutes. The encoded protein also displays prion-like properties. Two transcript variants encoding different isoforms have been found for this gene.
7464	This intronless gene belongs to a family of RAS-related genes. The proteins encoded by these genes share approximately 50% amino acid identity with the classical RAS proteins and have numerous structural features in common. The most striking difference between the RAP and RAS proteins resides in their 61st amino acid: glutamine in RAS is replaced by threonine in RAP proteins. Evidence suggests that this protein may be polyisoprenylated and palmitoylated.
7465	
7466	GPCR41 and GPCR42 (MIM 607883) act as receptors for porcine endogenous retrovirus subgroup A (PERV-A).[supplied by OMIM]
7467	
7468	
7469	
7470	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that belongs to the ribosomal protein S17P family. The encoded protein is moderately conserved between human mitochondrial and prokaryotic ribosomal proteins. Pseudogenes corresponding to this gene are found on chromosomes 1p, 3p, 6q, 14p, 18q, and Xq.
7471	
7472	
7473	
7474	
7475	
7476	The WNT gene family consists of structurally related genes which encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family. It encodes a protein which shows 99% amino acid identity to the mouse Wnt7A protein. This gene not only guides the development of the anterior-posterior axis in the female reproductive tract, but also plays a critical role in uterine smooth muscle pattering and maintenance of adult uterine function. It is also responsive to changes in the levels of sex steriod hormone in the female reproductive tract. Decreased expression of this gene in human uterine leiomyoma is found to be inversely associated with the expression of estrogen receptor-alpha.
7477	
7478	This gene belongs to the amiloride-sensitive Na+ channel and degenerin (NaC/DEG) family, members of which have been identified in many animal species ranging from the nematode to human. The amiloride-sensitive Na(+) channel encoded by this gene is primarily expressed in the small intestine, however, its exact function is not known.
7479	The CD133 gene codes for a pentaspan transmembrane glycoprotein. The CD133 antigen appears to belong to a new molecular family of 5-TM proteins, as the characterization of the CD133 antigen and prominin in the mouse were the first descriptions of a 5-TM glycoprotein structure. This @#$%!family@#$%! includes members from several different species (which may be homologs) including human, mouse, rat, fly, and worm. The 5-TM structure includes an extracellular N-terminus, two short intracellular loops, two large extracellular loops and an intracellular C-terminus CD133 was initially shown to be expressed on primitive hematopoietic stem and progenitor cells and retinoblastoma. CD133 has since been shown to be expressed on hemangioblasts, and neural stem cells as well as on developing epithelium. Expression patterns for CD133 generally mimic those of the murine prominin molecule, although CD133 antigen has not yet been demonstrated on adult epithelial tissue. The CD133 positive fraction of human bone marrow, cord blood and peripheral blood have been shown to efficiently engraft in xenotransplantation models, and have been shown to contain the majority of the granulocyte/macrophage precursors, NOD/SCID repopulating cells and CD34 + dendritic cell precursors. Phenotypically, CD133 positive cells in blood and marrow are CD34 bright, with CD34 dim CD71 bright cells being negative for CD133 expression. Many leukemias express CD133 as well as CD34 , but some investigators have noted leukemic blasts which are CD133+ and CD34 negative. No natural ligand has yet been demonstrated for the CD133 molecule, and its function in hematopoietic tissue is unknown. However, Maw and colleagues have described a mutation in the CD133 gene, which is responsible for retinal degeneration in an Indian pedigree. [PROW]
7480	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. This gene encodes a member of the potassium channel, voltage-gated, subfamily H. This member is a pore-forming (alpha) subunit of a voltage-gated non-inactivating delayed rectifier potassium channel. This gene is not expressed in differentiating myoblasts. Alternative splicing results in three transcript variants encoding distinct isoforms.
7481	The nuclear lamina consists of a two-dimensional matrix of proteins located next to the inner nuclear membrane. The lamin family of proteins make up the matrix and are highly conserved in evolution. During mitosis, the lamina matrix is reversibly disassembled as the lamin proteins are phosphorylated. Lamin proteins are thought to be involved in nuclear stability, chromatin structure and gene expression. Vertebrate lamins consist of two types, A and B. This gene encodes one of the two B type proteins, B1.
7482	
7483	This gene encodes a G-protein coupled receptor for melanocyte-stimulating hormone and adrenocorticotropic hormone that is expressed in tissues other than the adrenal cortex and melanocytes. This gene maps to the same region as the locus for benign neonatal epilepsy. Mice deficient for this gene have increased fat mass despite decreased food intake suggesting a role for this gene product in the regulation of energy homeostasis.
7484	
7485	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a non-ATPase subunit of the 19S regulator. Two transcripts encoding different isoforms have been described. Pseudogenes have been identified on chromosomes 3 and 20.
7486	
7487	
7488	
7489	Tyrosine hydroxylase (EC 1.14.16.2) is involved in the conversion of phenylalanine to dopamine. As the rate-limiting enzyme in the synthesis of catecholamines, tyrosine hydroxylase has a key role in the physiology of adrenergic neurons.[supplied by OMIM]
7490	The protein encoded by this gene is one of the three catalytic subunits of protein phosphatase 1 (PP1). PP1 is a serine/threonine specific protein phosphatase known to be involved in the regulation of a variety of cellular processes, such as cell division, glycogen metabolism, muscle contractility, protein synthesis, and HIV-1 viral transcription. Mouse studies suggest that PP1 functions as a suppressor of learning and memory. At least three alternatively spliced transcript variants encoding two distinct isoforms have been observed.
7491	
7492	This gene belongs to subfamily P of the forkhead box (FOX) transcription factor family. Forkhead box transcription factors play important roles in the regulation of tissue- and cell type-specific gene transcription during both development and adulthood. Many members of the forkhead box gene family, including members of subfamily P, have roles in mammalian oncogenesis. This gene may play a role in the development of tumors of the kidney and larynx. Alternative splicing of this gene produces multiple transcript variants, some encoding different isoforms.
7493	Aminoacyl-tRNA synthetases catalyze the aminoacylation of tRNA by their cognate amino acid. Because of their central role in linking amino acids with nucleotide triplets contained in tRNAs, aminoacyl-tRNA synthetases are thought to be among the first proteins that appeared in evolution. VARS2 belongs to class-I aminoacyl-tRNA synthetase family and is located in the class III region of the major histocompatibility complex.
7494	The protein encoded by this gene contains multiple N-terminal WD40 domains and a C-terminal high mobility group (HMG) box. WD40 domains are found in a variety of eukaryotic proteins and may function as adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly. HMG boxes are found in many eukaryotic proteins involved in chromatin assembly, transcription and replication. Alternative splicing results in two transcript variants encoding different isoforms.
7495	
7496	HLA-DPB belongs to the HLA class II beta chain paralogues. This class II molecule is a heterodimer consisting of an alpha (DPA) and a beta chain (DPB), both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa and its gene contains 6 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail. Within the DP molecule both the alpha chain and the beta chain contain the polymorphisms specifying the peptide binding specificities, resulting in up to 4 different molecules.
7497	
7498	
7499	
7500	This gene encodes a protein localized to the nucleus in proliferating cells. Expression of this gene is modulated by cytokines in natural killer and T cells. The gene product is thought to play a role in apoptosis but the specific role has not yet been determined. Two transcripts encoding different isoforms have been identified.
7501	The protein encoded by this gene catalyzes the conversion of 10-formyltetrahydrofolate, NADP, and water to tetrahydrofolate, NADPH, and carbon dioxide. The encoded protein belongs to the aldehyde dehydrogenase family and is responsible for formate oxidation in vivo. Deficiencies in this gene can result in an accumulation of formate and subsequent methanol poisoning.
7502	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. The function of this protein has not been determined. This gene results in three transcript variants encoding two distinct isoforms.
7503	The giant protein titin, together with its associated proteins, interconnects the major structure of sarcomeres, the M bands and Z discs. The C-terminal end of the titin string extends into the M line, where it binds tightly to M-band constituents of apparent molecular masses of 190 kD and 165 kD. The predicted MYOM2 protein contains 1,465 amino acids. Like MYOM1, MYOM2 has a unique N-terminal domain followed by 12 repeat domains with strong homology to either fibronectin type III or immunoglobulin C2 domains. Protein sequence comparisons suggested that the MYOM2 protein and bovine M protein are identical.
7504	
7505	
7506	Aminoacyl-tRNA synthetases catalyze the aminoacylation of tRNA by their cognate amino acid. Because of their central role in linking amino acids with nucleotide triplets contained in tRNAS, aminoacyl-tRNA synthetases are thought to be among the first proteins that appeared in evolution. Isoleucine-tRNA synthetase belongs to the class-I aminoacyl-tRNA synthetase family and has been identified as a target of autoantibodies in the autoimmune disease polymyositis/dermatomyositis. Two alternatively spliced variants have been isolated that differ in the presence or absence of an unspliced intron in the 5@#$%& untranslated region.
7507	The Golgi apparatus, which participates in glycosylation and transport of proteins and lipids in the secretory pathway, consists of a series of stacked cisternae (flattened membrane sacs). Interactions between the Golgi and microtubules are thought to be important for the reorganization of the Golgi after it fragments during mitosis. The golgins are a family of proteins, of which the protein encoded by this gene is a member, that are localized to the Golgi. This encoded protein is associated with Sjogren@#$%&s syndrome.
7508	The Golgi apparatus, which participates in glycosylation and transport of proteins and lipids in the secretory pathway, consists of a series of stacked cisternae (flattened membrane sacs). Interactions between the Golgi and microtubules are thought to be important for the reorganization of the Golgi after it fragments during mitosis. The golgins are a family of proteins, of which the protein encoded by this gene is a member, that are localized to the Golgi. This encoded protein is associated with Sjogren@#$%&s syndrome.
7509	
7510	
7511	The protein encoded by this gene localizes to the nucleus and it interacts with the RING finger domain of the breast cancer 1, early onset protein (BRCA1). This gene is thought to be a tumor suppressor gene that functions in the BRCA1 growth control pathway. There are multiple polyadenylation sites found in this gene.
7512	The protein encoded by this gene encodes a melanocyte differentiation antigen that is expressed in a high percentage of melanoma cell lines. A similar sequence gene in medaka, @#$%&B,@#$%& encodes a transporter that mediates melanin synthesis. Mutations in this gene are a cause of oculocutaneous albinism type 4. Two alternatively spliced transcript variants encoding different isoforms have been found for this gene.
7513	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. The encoded protein also contains a regulator of G protein signaling (RGS) domain. Regulator of G protein signaling family members are regulatory molecules that act as GTPase activating proteins for G alpha subunits of heterotrimeric G proteins. Two transcript variants encoding distinct isoforms have been identified for this gene.
7514	The protein encoded by this gene is a member of the epidermal growth factor family. It is an autocrine growth factor as well as a mitogen for astrocytes, Schwann cells, and fibroblasts. It is related to epidermal growth factor (EGF) and transforming growth factor alpha (TGF-alpha). This protein interacts with the EGF/TGF-alpha receptor to promote the growth of normal epithelial cells and inhibits the growth of certain aggressive carcinoma cell lines. This encoded protein is associated with a psoriasis-like skin phenotype.
7515	Nucleotides are involved in numerous biochemical reactions and pathways within the cell as substrates, cofactors, and effectors. Nudix hydrolases, such as NUDT12, regulate the concentrations of individual nucleotides and of nucleotide ratios in response to changing circumstances (Abdelraheim et al., 2003).[supplied by OMIM]
7516	
7517	
7518	
7519	
7520	The protein encoded by this gene is a transcriptional coactivator that can interact with nuclear hormone receptors to enhance their transcriptional activator functions. The encoded protein has been shown to be involved in the hormone-dependent coactivation of several receptors, including prostanoid, retinoid, vitamin D3, thyroid hormone, and steroid receptors. The encoded protein may also act as a general coactivator since it has been shown to interact with some basal transcription factors, histone acetyltransferases, and methyltransferases.
7521	DLM1 encodes a Z-DNA binding protein. Z-DNA formation is a dynamic process, largely controlled by the amount of supercoiling.[supplied by OMIM]
7522	This gene encodes a member of the transcription elongation factor A (SII)-like (TCEAL) gene family. Members of this family contain TFA domains and may function as nuclear phosphoproteins that modulate transcription in a promoter context-dependent manner. Multiple family members are located on the X chromosome. Alternative splicing occurs for this gene; however, the full-length nature of all transcript variants has not yet been described.
7523	SSR4, also called TRAPD, is assumed to be involved in protein secretion. It is located in the Xq28 region, arranged in a compact head-to-head manner with the IDH3G gene. These two genes are driven by a bidirectional promoter located between them, and encode proteins involved in unrelated biochemical pathways located in different compartments of the cell. The nontranscribed intergenic region represents only 133 bp and is embedded in a CpG island. The CpG island functions as a bidirectional promoter to initiate the transcription of both functionally unrelated genes with distinct expression patterns. SSR4 consists of six exons and is approximately 70 kb telomeric to the ALD gene. Although alternative splicing of exon 5 has not been detected in human SSR4, transcript variants missing the region homologous to human exon 5 have been detected in both Xenopus laevis and Mus musculus.
7524	Chloride channels are a diverse group of proteins that regulate fundamental cellular processes including stabilization of cell membrane potential, transepithelial transport, maintenance of intracellular pH, and regulation of cell volume. Chloride intracellular channel 4 (CLIC4) protein, encoded by the CLIC4 gene, is a member of the p64 family; the gene is expressed in many tissues and exhibits a intracellular vesicular pattern in Panc-1 cells (pancreatic cancer cells).
7525	
7526	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. Four sequence-related potassium channel genes - shaker, shaw, shab, and shal - have been identified in Drosophila, and each has been shown to have human homolog(s). This gene encodes a member of the potassium channel, voltage-gated, shal-related subfamily, members of which form voltage-activated A-type potassium ion channels and are prominent in the repolarization phase of the action potential. This member mediates a rapidly inactivating, A-type outward potassium current which is not under the control of the N terminus as it is in Shaker channels.
7527	This gene product is a member of a large family of CDC2-related serine/threonine protein kinases. It accumulates primarily in the cytoplasm, with a lower level in the nucleus.
7528	This gene encodes a member of the WD-protein subfamily. The encoded protein contains five WD-repeats spanning most of the protein and an SOCS box in the C-terminus. The SOCS box may act as a bridge between specific substrate-binding domains and E3 ubiquitin protein ligases.
7529	The CTRP5 protein is a member of the C1q (see MIM 120550)/tumor necrosis factor (MIM 191160) superfamily, which shows diverse functions including cell adhesion and basement membrane components (Shapiro and Scherer, 1998). C1QTNF5 is mutant in late-onset retinal degeneration (LORD; MIM 605670).[supplied by OMIM]
7530	
7531	
7532	
7533	
7534	
7535	This gene, a member of the histidine triad gene family, encodes a diadenosine 5@#$%&,5@#$%&@#$%&@#$%&-P1,P3-triphosphate hydrolase involved in purine metabolism. The gene encompasses the common fragile site FRA3B on chromosome 3, where carcinogen-induced damage can lead to translocations and aberrant transcripts of this gene. In fact, aberrant transcripts from this gene have been found in about half of all esophageal, stomach, and colon carcinomas.
7536	This gene shares high similarity with the Drosophila Scm (sex comb on midleg) gene. It encodes a protein which contains four malignant brain tumor repeat (mbt) domains and may be involved in antigen recognition. Several alternative splice variants have been characterized.
7537	
7538	Deoxycytidine kinase (DCK) is required for the phosphorylation of several deoxyribonucleosides and their nucleoside analogs.  Deficiency of DCK is associated with resistance to antiviral and anticancer chemotherapeutic agents.  Conversely, increased deoxycytidine kinase activity is associated with increased activation of these compounds to cytotoxic nucleoside triphosphate derivatives.  DCK is clinically important because of its relationship to drug resistance and sensitivity.
7539	Carboxypeptidase E cleaves C-terminal amino acid residues and is involved in neuropeptide processing. It is a peripheral membrane protein.  CPE specifically binds regulated secretory pathway proteins, including prohormones, but not constitutively secreted proteins.
7540	Sm-like proteins were identified in a variety of organisms based on sequence homology with the Sm protein family (see SNRPD2; MIM 601061). Sm-like proteins contain the Sm sequence motif, which consists of 2 regions separated by a linker of variable length that folds as a loop. The Sm-like proteins are thought to form a stable heteromer present in tri-snRNP particles, which are important for pre-mRNA splicing.[supplied by OMIM]
7541	The FRAS1 gene encodes a putative extracellular matrix (ECM) protein and is mutated in Fraser syndrome (MIM 219000) (McGregor et al., 2003).[supplied by OMIM]
7542	This gene is a member of the MAD gene family . The MAD genes encode basic helix-loop-helix-leucine zipper proteins that heterodimerize with MAX protein, forming a transcriptional repression complex. The MAD proteins compete for MAX binding with MYC, which heterodimerizes with MAX forming a transcriptional activation complex. Studies in rodents suggest that the MAD genes are tumor suppressors and contribute to the regulation of cell growth in differentiating tissues.
7543	
7544	
7545	
7546	This gene encodes a transmembrane glycoprotein which mediates interaction between newly assembled major histocompatibility complex (MHC) class I molecules and the transporter associated with antigen processing (TAP), which is required for the transport of antigenic peptides across the endoplasmic reticulum membrane. This interaction is essential for optimal peptide loading on the MHC class I molecule. Up to four complexes of MHC class I and this protein may be bound to a single TAP molecule. This protein contains a C-terminal double-lysine motif (KKKAE) known to maintain membrane proteins in the endoplasmic reticulum. This gene lies within the major histocompatibility complex on chromosome 6. Alternative splicing results in three transcript variants encoding different isoforms.
7547	
7548	TXNDC3 encodes a group II thioredoxin protein, composed of a thioredoxin domain and 3 NDP kinase domains, with testis-specific expression.[supplied by OMIM]
7549	
7550	
7551	Galactosylceramidase (GALC) is a lysosomal enzyme that hydrolyzes several galactolipids. GALC deficiency is associated with Krabbe disease (globoid cell leukodystrophy). The gene is about 60 kb in length and consists of 17 exons. This gene contains ten GC-box-like sequences within the promoter region but no typical TATA box. This feature is also characteristic of other lysosomal protein encoding genes.
7552	This gene encodes a member of the Polycomb-group (PcG) family. PcG family members form multimeric protein complexes, which are involved in maintaining the transcriptional repressive state of genes over successive cell generations. This protein associates with the embryonic ectoderm development protein, the VAV1 oncoprotein, and the X-linked nuclear protein. This protein may play a role in the hematopoietic and central nervous systems. Two transcript variants encoding distinct isoforms have been identified for this gene.
7553	This gene encodes a zinc finger protein of the Kruppel family. The protein contains a SCAN box and a KRAB A domain. A similar protein in mouse is differentially expressed in spermatogenesis. Alternative splicing of this gene results in two transcript variants differing only in the 5@#$%& UTR. Additional variants have been described, but their full length sequences have not been determined.
7554	The protein encoded by this gene is likely a phosphoserine aminotransferase, based on similarity to proteins in mouse, rabbit, and Drosophila. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
7555	This gene encodes a member of the plasticity-related gene (PRG) family. Members of the PRG family mediate lipid phosphate phosphatase activity in neurons and are known to be involved in neuronal plasticity. The protein encoded by this gene does not perform its function through enzymatic phospholipid degradation. This gene is strongly expressed in brain. It shows dynamic expression regulation during brain development and neuronal excitation. Alternatively spliced transcript variants encoding the same protein have been observed.
7556	
7557	
7558	
7559	
7560	This gene encodes an unknown protein. Its cytogenetic localization and function have not been determined.
7561	NB1, a glycosyl-phosphatidylinositol (GPI)-linked N-glycosylated cell surface glycoprotein, was first described in a case of neonatal alloimmune neutropenia (Lalezari et al., 1971 [PubMed 5552408]).[supplied by OMIM]
7562	NB1, a glycosyl-phosphatidylinositol (GPI)-linked N-glycosylated cell surface glycoprotein, was first described in a case of neonatal alloimmune neutropenia (Lalezari et al., 1971 [PubMed 5552408]).[supplied by OMIM]
7563	The protein encoded by this gene belongs to the member of the SEC22 family of vesicle trafficking proteins. This protein has similarity to rat SEC22 and may act in the early stages of the secretory pathway. There is evidence for the use of multiple poly A sites in this gene.
7564	
7565	
7566	
7567	This gene is a member of the DAZ gene family and is a candidate for the human Y-chromosomal azoospermia factor (AZF). Its expression is restricted to premeiotic germ cells, particularly in spermatogonia. It encodes an RNA-binding protein that is important for spermatogenesis. Four copies of this gene are found on chromosome Y within palindromic duplications; one pair of genes is part of the P2 palindrome and the second pair is part of the P1 palindrome. Each gene contains a 2.4 kb repeat including a 72-bp exon, called the DAZ repeat; the number of DAZ repeats is variable and there are several variations in the sequence of the DAZ repeat. Each copy of the gene also contains a 10.8 kb region that may be amplified; this region includes five exons that encode an RNA recognition motif (RRM) domain. This gene contains two copies of the 10.8 kb repeat. Alternative splicing results in multiple transcript variants encoding different isoforms.
7568	This gene encodes a cysteine protease that has a strict specificity for hydrolysis of asparaginyl bonds. This enzyme may be involved in the processing of bacterial peptides and endogenous proteins for MHC class II presentation in the lysosomal/endosomal systems. Enzyme activation is triggered by acidic pH and appears to be autocatalytic. Protein expression occurs after monocytes differentiate into dendritic cells. A fully mature, active enzyme is produced following lipopolysaccharide expression in mature dendritic cells. Overexpression of this gene may be associated with the majority of solid tumor types. This gene has a pseudogene on chromosome 13. Several alternatively spliced transcript variants have been described, but the biological validity of only two has been determined. These two variants encode the same isoform.
7569	
7570	
7571	
7572	This gene was identified by its low or completely missing expression in esophageal squamous cell carcinomas. Normal expression of the gene occurs in the esophagus, stomach, small intestine, colon and placenta. Alternatively spliced transcript variants that encode the same protein have been identified.
7573	
7574	
7575	
7576	
7577	
7578	
7579	
7580	
7581	The protein encoded by this gene coats lipid storage droplets in adipocytes, thereby protecting them until they can be broken down by hormone-sensitive lipase. The encoded protein is the major cAMP-dependent protein kinase substrate in adipocytes and, when unphosphorylated, may play a role in the inhibition of lipolysis.
7582	This gene encodes a member of the C-type lectin/C-type lectin-like domain (CTL/CTLD) superfamily. Members of this family share a common protein fold and have diverse functions, such as cell adhesion, cell-cell signalling, glycoprotein turnover, and roles in inflammation and immune response. The encoded type 2 transmembrane protein may play a role in dendritic cell function. Two transcript variants encoding distinct isoforms have been identified for this gene.
7583	
7584	
7585	Argininosuccinate lyase (EC 4.3.2.1) is a urea cycle enzyme that catalyzes the cleavage of argininosuccinate to fumarate and arginine, an essential step in the process of detoxification of ammonia via the urea cycle.[supplied by OMIM]
7586	This gene was identified by the interaction of its gene product with Grap2, a leukocyte-specific adaptor protein important for immune cell signaling. The protein encoded by this gene was shown to interact with cyclin D. Transfection of this gene in cells was reported to reduce the phosphorylation of Rb gene product by cyclin D-dependent protein kinase, and inhibit E2F1-mediated transcription activity. This protein was also found to interact with helix-loop-helix protein E12 and is thought to be a negative regulator of liver-specific gene expression. Two alternatively spliced variants, which encode distinct isoforms, have been reported.
7587	The CIA30 gene encodes a human homolog of a Neurospora crassa protein involved in the assembly of complex I.[supplied by OMIM]
7588	
7589	
7590	
7591	This gene is one of several Cys-Cys (CC) cytokine genes clustered on the q arm of chromosome 16. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The CC cytokines are proteins characterized by two adjacent cysteines. The cytokine encoded by this gene displays chemotactic activity for T lymphocytes, but not monocytes or granulocytes. The product of this gene binds to chemokine receptors CCR4 and CCR8. This chemokine plays important roles in T cell development in thymus as well as in trafficking and activation of mature T cells.
7592	
7593	
7594	
7595	
7596	
7597	
7598	
7599	This gene encodes a member of the copine family, which is composed of calcium-dependent membrane-binding proteins. The gene product contains two N-terminal C2 domains and one von Willebrand factor A domain. Sequence analysis identified two alternatively spliced transcript variants that encode different isoforms.
7600	
7601	
7602	This gene encodes one of three enzymes that are involved in metabolizing serine and glycine. L-serine dehydratase converts L-serine to pyruvate and ammonia and requires pyridoxal phosphate as a cofactor. The encoded protein can also metabolize threonine to NH4+ and 2-ketobutyrate. The encoded protein is found predominantly in the liver.
7603	The human alpha globin gene cluster located on chromosome 16 spans about 30 kb and includes the following five loci:  5@#$%&- zeta - pseudozeta - pseudoalpha-1 - alpha-2 - alpha-1 -3@#$%&. The alpha-2 (HBA2) and alpha-1 (HBA1) coding sequences are identical. These genes differ slightly over the 5@#$%& untranslated regions and the introns, but they differ significantly over the 3@#$%& untranslated regions. Two alpha chains plus two beta chains constitute HbA, which in normal adult life comprises about 97% of  the total hemoglobin; alpha chains combine with delta chains to constitute HbA-2, which with HbF (fetal hemoglobin) makes up the  remaining 3% of adult hemoglobin.  Alpha thalassemias result from deletions of each of the alpha genes as well as deletions of both HBA2 and HBA1 respectively; some nondeletion alpha thalassemias have also been reported.
7604	
7605	
7606	
7607	Aquaporin 8 (AQP8) is a water channel protein. Aquaporins are a family of small integral membrane proteins related to the major intrinsic protein (MIP or AQP0).  Aquaporin 8 mRNA is found in pancreas and colon but not other tissues.
7608	
7609	
7610	
7611	
7612	RAP1B and RAP1A (MIM 179520) belong to a superfamily of RAS (see MIM 190020)-like small GTP-binding proteins involved in cell signaling.[supplied by OMIM]
7613	This gene encodes an adaptor protein that is composed of two protein-protein interaction domains: a N-terminal PYRIN-PAAD-DAPIN domain (PYD) and a C-terminal caspase-recruitment domain (CARD). The PYD and CARD domains are members of the six-helix bundle death domain-fold superfamily that mediates assembly of large signaling complexes in the inflammatory and apoptotic signaling pathways via the activation of caspase. In normal cells, this protein is localized to the cytoplasm; however, in cells undergoing apoptosis, it forms ball-like aggregates near the nuclear periphery. Three transcript variants encoding different isoforms have been noted for this gene.
7614	
7615	
7616	
7617	
7618	
7619	
7620	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L14P family of ribosomal proteins. It is located in the cytoplasm. This gene has been referred to as rpL17 because the encoded protein shares amino acid identity with ribosomal protein L17 from Saccharomyces cerevisiae; however, its official symbol is RPL23. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
7621	
7622	
7623	The CMRF35 antigen, which was identified by reactivity with a monoclonal antibody, is present on monocytes, neutrophils, and some T and B lymphocytes (Jackson et al., 1992).[supplied by OMIM]
7624	Bidirectional transport of macromolecules between the cytoplasm and nucleus occurs through nuclear pore complexes (NPCs) embedded in the nuclear envelope. NPCs are composed of subcomplexes, and NUP85, or PCNT1, is part of one such subcomplex, Nup107-160 (Loiodice et al., 2004).[supplied by OMIM]
7625	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes one of the non-ATPase subunits of the 19S regulator lid.
7626	
7627	The protein encoded by this gene is a scavenger receptor that is expressed in endothelial cells and mediates the binding and degradation of acetylated low density lipoprotein (Ac-LDL). Transcription factors ZNF444/EZF-2 and SP1 are found to be responsible for the regulation of this gene. Five alternatively spliced transcript variants encoding distinct isoforms have been described.
7628	
7629	
7630	
7631	The protein encoded by this gene recognizes the sequence CCTTGAG along with other members of the HMG-box class DNA-binding proteins. It acts during chondrocyte differentiation and, with steroidogenic factor 1, regulates transcription of the anti-Muellerian hormone (AMH) gene. Deficiencies lead to the skeletal malformation syndrome campomelic dysplasia, frequently with sex reversal.
7632	
7633	
7634	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix for both normal physiological processes, such as embryonic development, reproduction and tissue remodeling, and disease processes, such as asthma and metastasis. This gene encodes a secreted enzyme that degrades casein. Its expression pattern suggests that it plays a role in tissue homeostasis and in wound repair. Transcript variants encoding different isoforms have been described.
7635	
7636	Histones play a critical role in transcriptional regulation, cell cycle progression, and developmental events. Histone acetylation/deacetylation alters chromosome structure and affects transcription factor access to DNA. The protein encoded by this gene belongs to the class II histone deacetylase/acuc/apha family. It possesses histone deacetylase activity and represses transcription when tethered to a promoter. It coimmunoprecipitates only with HDAC3 family member and might form multicomplex proteins. It also interacts with myocyte enhancer factor-2 (MEF2) proteins, resulting in repression of MEF2-dependent genes. This gene is thought to be associated with colon cancer. Two transcript variants encoding different isoforms have been found for this gene.
7637	The protein encoded by this gene has similarity to a yeast protein. It seems to be part of a peripheral membrane protein complex localized on cis/medial Golgi cisternae where it may participate in tethering intra-Golgi transport vesicles.
7638	Histones play a critical role in transcriptional regulation, cell cycle progression, and developmental events. Histone acetylation/deacetylation alters chromosome structure and affects transcription factor access to DNA. The protein encoded by this gene belongs to the class II histone deacetylase/acuc/apha family. It possesses histone deacetylase activity and represses transcription when tethered to a promoter. It coimmunoprecipitates only with HDAC3 family member and might form multicomplex proteins. It also interacts with myocyte enhancer factor-2 (MEF2) proteins, resulting in repression of MEF2-dependent genes. This gene is thought to be associated with colon cancer. Two transcript variants encoding different isoforms have been found for this gene.
7639	
7640	
7641	Synaptic vesicles are responsible for regulating the storage and release of neurotransmitters in the nerve terminal. The protein encoded by this gene is an abundant integral membrane protein of cholinergic synaptic vesicles and is thought to be involved in vesicular transport. It belongs to the quinone oxidoreductase subfamily of zinc-containing alcohol dehydrogenase proteins.
7642	This gene is one of several Cys-Cys (CC) cytokine genes clustered on the q-arm of chromosome 17. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The CC cytokines are proteins characterized by two adjacent cysteines. The cytokine encoded by this gene displays chemotactic activity for monocytes, lymphocytes, basophils and eosinophils, but not neutrophils. This chemokine plays a role in accumulation of leukocytes during inflammation.  It may also be involved in the recruitment of monocytes into the arterial wall during artherosclerosis.
7643	Chondroadherin is a cartilage matrix protein thought to mediate adhesion of isolated chondrocytes.  The protein contains 11 leucine-rich repeats flanked by cysteine-rich regions. The chondroadherin messenger RNA is present in chondrocytes at all ages.
7644	Phototransduction in rod and cone photoreceptors is regulated by groups of signaling proteins. The encoded protein is thought to play a crucial role in cone phototransduction. It belongs to the G protein gamma family and localized specifically in cones. There is evidence for use of multiple polyadenylation sites by this gene.
7645	
7646	This gene belongs to the homeobox family of genes. The homeobox genes encode a highly conserved family of transcription factors that play an important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, located on different chromosomes, consisting of 9 to 11 genes arranged in tandem. This gene is one of several homeobox HOXB genes located in a cluster on chromosome 17. The exact role of this gene has yet to be determined.
7647	
7648	
7649	
7650	
7651	The protein encoded by this gene belongs to the family of G-protein coupled receptors, that are preferentially activated by adenosine and uridine nucleotides. This gene aligns with an internal intron of the retinoblastoma susceptibility gene in the reverse orientation.
7652	Gamma-aminobutyric acid A receptors [GABA(A) receptors] are ligand-gated chloride channels that mediate inhibitory neurotransmission. This gene encodes GABA(A) receptor-associated protein, which is highly positively charged in its N-terminus and shares sequence similarity with light chain-3 of microtubule-associated proteins 1A and 1B. This protein clusters neurotransmitter receptors by mediating interaction with the cytoskeleton.
7653	The protein encoded by this gene is a ubiquitous actin monomer-binding protein belonging to the profilin family. It is thought to regulate actin polymerization in response to extracellular signals. Deletion of this gene is associated with Miller-Dieker syndrome.
7654	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein and is highly similar in sequence and structure to transmembrane 4 superfamily member 1. It may play a role in cell proliferation, and overexpression of this protein may be associated with the uncontrolled growth of tumour cells.
7655	
7656	KRT17 encodes the type I intermediate filament chain keratin 17, expressed in nail bed, hair follicle, sebaceous glands, and other epidermal appendages. Mutations in this gene lead to Jackson-Lawler type pachyonychia congenita and steatocystoma multiplex.
7657	
7658	
7659	
7660	
7661	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the ultrahigh sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
7662	
7663	This gene encodes ubiquitin, one of the most conserved proteins known. Ubiquitin is required for ATP-dependent, nonlysosomal intracellular protein degradation of abnormal proteins and normal proteins with a rapid turnover. Ubiquitin is covalently bound to proteins to be degraded, and presumably labels these proteins for degradation. Ubiquitin also binds to histone H2A in actively transcribed regions but does not cause histone H2A degradation, suggesting that ubiquitin is also involved in regulation of gene expression. This gene consists of three direct repeats of the ubiquitin coding sequence with no spacer sequence. Consequently, the protein is expressed as a polyubiquitin precursor with a final amino acid after the last repeat. Aberrant form of this protein has been noticed in patients with Alzheimer@#$%&s and Down syndrome.
7664	This gene encodes an unconventional myosin. This protein differs from other myosins in that it has a long N-terminal extension preceding the conserved motor domain. Studies in mice suggest that this protein is necessary for actin organization in the hair cells of the cochlea. Mutations in this gene have been associated with profound, congenital, neurosensory, nonsyndromal deafness. This gene is located within the Smith-Magenis syndrome region on chromosome 17. Read-through transcripts containing an upstream gene and this gene have been identified, but they are not thought to encode a fusion protein. Several alternatively spliced transcript variants have been described, but their full length sequences have not been determined.
7665	
7666	
7667	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L21E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
7668	
7669	
7670	
7671	
7672	This gene encodes a member of the C-lectin family, proteins that possess collagen-like sequences and carbohydrate recognition domains. This protein is a scavenger receptor, a cell surface glycoprotein that can bind to carbohydrate antigens on microorganisms facilitating their recognition and removal. In addition, these receptors can recognize oxidized phospholipids so they may also participate in removing oxidatively damaged or apoptotic cells. Two transcripts have been identified for this gene.
7673	
7674	
7675	TCF4 encodes transcription factor 4, a basic helix-turn-helix transcription factor. The protein recognizes an Ephrussi-box (@#$%&E-box@#$%&) binding site (@#$%&CANNTG@#$%&) - a motif first identified in immunoglobulin enhancers. TCF4 is expressed predominantly in pre-B-cells, although it is found in other tissues as well. TCF4 is known to produce multiple transcripts; however as the complete structure is only known for the transcript that encodes the b isoform, that is the variant presented here.
7676	
7677	In mammalian cells, urea is the chief end-product of nitrogen catabolism and plays an important role in the urinary concentration mechanism. Thus, the plasma membrane of erythrocytes and some renal epithelial cells exhibit an elevated urea permeability that is mediated by highly selective urea transporters. In mammals, 2 urea transporters have been identified: the renal tubular urea transporter, UT2, and the erythrocyte urea transporter, UT11 (SLC14A1; MIM 111000).[supplied by OMIM]
7678	Follicular lymphoma variant translocation 1 is a secreted protein which is weakly expressed in hematopoietic tissue.  FVT1 shows a high rate of transcription in some T-cell malignancies and in phytohemagglutinin-stimulated lymphocytes.  The proximity of FVT1 to BCL2 suggests that it may participate in the tumoral process
7679	This gene encodes a colon cancer antigen which was defined by serological analysis of recombinant cDNA expression libraries. The function of this gene is unknown.
7680	
7681	
7682	
7683	The connexin gene family codes for the protein subunits of gap junction channels that mediate direct diffusion of ions and metabolites between the cytoplasm of adjacent cells. Connexins span the plasma membrane 4 times, with amino- and carboxy-terminal regions facing the cytoplasm. Connexin genes are expressed in a cell type-specific manner with overlapping specificity. The gap junction channels have unique properties depending on the type of connexins constituting the channel.[supplied by OMIM]
7684	
7685	
7686	The protein encoded by this gene belongs to the family of P-type cation transport ATPases, and to the subfamily of Na+/K+ -ATPases. Na+/K+ -ATPase is an integral membrane protein responsible for establishing and maintaining the electrochemical gradients of Na and K ions across the plasma membrane. These gradients are essential for osmoregulation, for sodium-coupled transport of a variety of organic and inorganic molecules, and for electrical excitability of nerve and muscle. This enzyme is composed of two subunits, a large catalytic subunit (alpha) and a smaller glycoprotein subunit (beta). The catalytic subunit of Na+/K+ -ATPase is encoded by multiple genes. This gene encodes an alpha 3 subunit.
7687	
7688	The protein encoded by this gene is an evolutionarily conserved protein similar to yeast SSF1 as well as to the gene product of the Drosophila gene peter pan (ppan). SSF1 is known to be involved in the second step of mRNA splicing. Both SSF1 and ppan are essential for cell growth and proliferation. This gene was found to cotranscript with P2RY11/P2Y(11), an immediate downstream gene on the chromosome that encodes a ATP receptor. The chimeric transcripts of this gene and P2RY11 were found to be ubiquitously present and regulated during granulocytic differentiation. Exogenous expression of this gene was reported to reduce the anchorage-independent growth of some tumor cells.
7689	
7690	
7691	
7692	This gene encodes a protein that has a domain resembling seven transmembrane G protein-coupled hormone receptors (7TM receptors) at its C-terminus. The N-terminus of the encoded protein has six EGF-like modules, separated from the transmembrane segments by a serine/threonine-rich domain, a feature reminiscent of mucin-like, single-span, integral membrane glycoproteins with adhesive properties.
7693	
7694	
7695	
7696	In mice, CD4+ helper T-cells differentiate into type 1 (Th1) cells, which are critical for cell-mediated immunity, predominantly under the influence of IL12. Also, IL4 influences their differentiation into type 2 (Th2) cells, which are critical for most antibody responses. Mice deficient in these cytokines, their receptors, or associated transcription factors have impaired, but are not absent of, Th1 or Th2 immune responses. This gene encodes a protein which is similar to the mouse T-cell cytokine receptor Tccr at the amino acid level, and is predicted to be a glycosylated transmembrane protein.
7697	This gene encodes an enzyme that hydrolyzes terminal, non-reducing alpha-D-mannose residues in alpha-D-mannosides. Its activity is necessary for the catabolism of N-linked carbohydrates released during glycoprotein turnover and it is member of family 38 of glycosyl hydrolases. The full length protein is processed in two steps. First, a 49 aa leader sequence is cleaved off and the remainer of the protein is processed into 3 peptides of 70 kDa, 42 kDa (D) and 13/15 kDa (E). Next, the 70 kDa peptide is further processed into three peptides (A, B and C). The A, B and C peptides are disulfide-linked. Defects in this gene have been shown to be the cause of lysosomal alpha-mannosidosis (AM), a lysosomal storage disease characterized by the accumulation of unbranched oligosaccharide chains.
7698	ASNA1 is the human homolog of the bacterial arsA gene. In E. coli, ArsA ATPase is the catalytic component of a multisubunit oxyanion pump that is responsible for resistance to arsenicals and antimonials.[supplied by OMIM]
7699	
7700	
7701	
7702	
7703	SPI1 (MIM 165170) and SPIB are members of a subfamily of ETS (see ETS1; MIM 164720) transcription factors. ETS proteins share a conserved ETS domain that mediates specific DNA binding. SPIB and SPI1 bind to a purine-rich sequence, the PU box (5-prime-GAGGAA-3-prime).[supplied by OMIM]
7704	IRF3 encodes interferon regulatory factor 3, a member of the interferon regulatory transcription factor (IRF) family. IRF3 is found in an inactive cytoplasmic form that upon serine/threonine phosphorylation forms a complex with CREBBP. This complex translocates to the nucleus and activates the transcription of interferons alpha and beta, as well as other interferon-induced genes.
7705	
7706	
7707	Caspase recruitment domain (CARD)-containing proteins, such as CARD8, are involved in pathways leading to activation of caspases or nuclear factor kappa-B (NFKB; see MIM 164011) in the context of apoptosis or inflammation, respectively (Bouchier-Hayes et al., 2001).[supplied by OMIM]
7708	This gene encodes one of the selenoproteins containing selenocysteine at the active site. The selenocysteine is encoded by the nonsense (stop) codon TGA. This protein shows highest expression in skeletal muscle and hart, and may be involved in oxidation-reduction reactions. A retroprocessed pseudogene, SEPW1P, has been identified and mapped to chromosome 1p35-34.
7709	The protein encoded by this gene is a member of the basic helix-loop-helix (bHLH)-PAS family of transcription factors. Studies of a related mouse gene suggest that it functions in neurons. The exact function of this gene is unclear, but it may play protective or modulatory roles during late embryogenesis and postnatal development.
7710	Members of the F-box protein family, such as FBXO44, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
7711	
7712	
7713	
7714	
7715	
7716	
7717	
7718	
7719	
7720	This gene encodes subunit 2 of the splicing factor 3a protein complex. The splicing factor 3a heterotrimer includes subunits 1, 2 and 3 and is necessary for the in vitro conversion of 15S U2 snRNP into an active 17S particle that performs pre-mRNA splicing. Subunit 2 interacts with subunit 1 through its amino-terminus while the single zinc finger domain of subunit 2 plays a role in its binding to the 15S U2 snRNP. Subunit 2 may also function independently of its RNA splicing function as a microtubule-binding protein.
7721	This gene was originally cloned from human myeloblasts and found to be selectively expressed in inflammed colonic epithelium. The protein encoded is a member of the olfactomedin-related protein family. The exact function of this gene has not yet been determined.
7722	This gene encodes a protein which, with 5-lipoxygenase, is required for leukotriene synthesis. Leukotrienes are arachidonic acid metabolites which have been implicated in various types of inflammatory responses, including asthma, arthritis and psoriasis. This protein localizes to the plasma membrane. Inhibitors of its function impede translocation of 5-lipoxygenase from the cytoplasm to the cell membrane and inhibit 5-lipoxygenase activation.
7723	
7724	
7725	
7726	This gene encodes a transmembrane signaling polypeptide which contains an immunoreceptor tyrosine-based activation motif (ITAM) in its cytoplasmic domain. The encoded protein may associate with the killer-cell inhibitory receptor (KIR) family of membrane glycoproteins and may act as an activating signal transduction element. This protein may bind zeta-chain (TCR) associated protein kinase 70kDa (ZAP-70) and spleen tyrosine kinase (SYK) and play a role in signal transduction, bone modeling, brain myelination, and inflammation. Mutations within this gene have been associated with polycystic lipomembranous osteodysplasia with sclerosing leukoencephalopathy (PLOSL), also known as Nasu-Hakola disease. Its putative receptor, triggering receptor expressed on myeloid cells 2 (TREM2), also causes PLOSL. Two alternative transcript variants encoding distinct isoforms have been identified for this gene. Other alternative splice variants have been described, but their full-length nature has not been deterimined.
7727	
7728	
7729	
7730	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
7731	
7732	
7733	
7734	Glutathione peroxidase catalyzes the reduction of hydrogen peroxide, organic hydroperoxide, and lipid peroxides by reduced glutathione and functions in the protection of cells against oxidative damage. Human plasma glutathione peroxidase has been shown to be a selenium-containing enzyme.
7735	
7736	
7737	This gene encodes a DNA-binding protein which specifically recognizes conserved target sequences at the breakpoint junction of chromosomal translocations. Translin polypeptides form a multimeric structure that is responsible for its DNA-binding activity. Recombination-associated motifs and translin-binding sites are present at recombination hotspots and may serve as indicators of breakpoints in genes which are fused by translocations. These binding activities may play a crucial role in chromosomal translocation in lymphoid neoplasms.
7738	
7739	The protein encoded by this gene is a G protein-coupled receptor and belongs to the glucagon-VIP-secretin receptor family. It binds secretin which is the most potent regulator of pancreatic bicarbonate, electrolyte and volume secretion. Secretin and its receptor are suggested to be involved in pancreatic cancer and autism.
7740	
7741	
7742	
7743	Steroid 5-alpha reductase 2 is a microsomal protein expressed at high levels in androgen-sensitive tissues such as the prostate.  This isoform is active at acidic pH and is sensitive to the 4-azasteroid inhibitor finasteride. Deficiencies can result in male pseudohermaphroditism, specifically pseudovaginal perineoscrotal hypospadias (PPSH).
7744	This gene encodes a member of the cyclophilin family. Cyclophilins catalyze the cis-trans isomerization of peptidylprolyl imide bonds in oligopeptides. They have been proposed to act either as catalysts or as molecular chaperones in protein-folding events. Transcript variants derived from alternative splicing and/or alternative polyadenylation exist; some of these variants encode different isoforms.
7745	The vertebrate sodium channel is a voltage-gated ion channel essential for the generation and propagation of action potentials, chiefly in nerve and muscle. Voltage-sensitive sodium channels are heteromeric complexes consisting of a large central pore-forming glycosylated alpha subunit and 2 smaller auxiliary beta subunits. Functional studies have indicated that the transmembrane alpha subunit of the brain sodium channels is sufficient for expression of functional sodium channels (Goldin et al., 1986 [PubMed 2429308]; Isom, 2002). Brain sodium channel alpha subunits form a gene subfamily with several structurally distinct isoforms clustering on chromosome 2, types I, II (SCN2A1, MIM 182390; SCN2A2, MIM 601219); and III (SCN3A, MIM 182391). There are also several distinct sodium channel alpha subunit isoforms in skeletal and cardiac muscle (SCN4A, MIM 603967; SCN5A, MIM 600163; SCN7A, MIM 182392).[supplied by OMIM]
7746	
7747	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein does not contain a coiled coil region, like some family members, but contains a B41 domain. This protein interacts with the cytoplasmic domain of P-selectin, and may function in the intracellular trafficking of P-selectin.
7748	The protein encoded by this gene is a regulatory subunit of the AMP-activated protein kinase (AMPK). AMPK is a heterotrimer consisting of an alpha catalytic subunit, and non-catalytic beta and gamma subunits. AMPK is an important energy-sensing enzyme that monitors cellular energy status. In response to cellular metabolic stresses, AMPK is activated, and thus phosphorylates and inactivates acetyl-CoA carboxylase (ACC) and beta-hydroxy beta-methylglutaryl-CoA reductase (HMGCR), key enzymes involved in regulating de novo biosynthesis of fatty acid and cholesterol. This subunit is one of the gamma regulatory subunits of AMPK. It is dominantly expressed in skeletal muscle. Studies of the pig counterpart suggest that this subunit may play a key role in the regulation of energy metabolism in skeletal muscle.
7749	
7750	This gene encodes a 42.3 kD subunit of dynactin, a macromolecular complex consisting of 10 subunits ranging in size from 22 to 150 kD. Dynactin binds to both microtubules and cytoplasmic dynein. It is involved in a diverse array of cellular functions, including ER-to-Golgi transport, the centripetal movement of lysosomes and endosomes, spindle formation, chromosome movement, nuclear positioning, and axonogenesis. This subunit, like ACTR1A, is an actin-related protein. These two proteins are of equal length and share 90% amino acid identity. They are present in a constant ratio of approximately 1:15 in the dynactin complex.
7751	
7752	Endothelin receptor type B is a G protein-coupled receptor which activates a phosphatidylinoitol-calcium second messenger system. Its ligand, endothelin, consists of a family of three potent vasoactive peptides: ET1, ET2, and ET3. Studies suggest that the multigenic disorder, Hirschsprung disease type 2, is due to mutation in endothelin receptor type B gene.  A splice variant, named SVR, has been described; the sequence of the ETB-SVR receptor is identical to ETRB except for the intracellular C-terminal domain.  While both splice variants bind ET1, they exhibit different responses upon binding which suggests that they may be functionally distinct.
7753	Protein kinase C (PKC) is a family of serine- and threonine-specific protein kinases that can be activated by calcium and the second messenger diacylglycerol. PKC family members phosphorylate a wide variety of protein targets and are known to be involved in diverse cellular signaling pathways. PKC family members also serve as major receptors for phorbol esters, a class of tumor promoters. Each member of the PKC family has a specific expression profile and is believed to play a distinct role. The protein encoded by this gene is one of the PKC family members. This kinase can be activated rapidly by the agonists of G protein-coupled receptors. It resides in both cytoplasm and nucleus, and its nuclear accumulation is found to be dramatically enhanced in response to its activation. This kinase can also be activated after B-cell antigen receptor (BCR) engagement, which requires intact phopholipase C gamma and the involvement of other PKC family members.
7754	The protein encoded by this gene is an accessory subunit of the heterodimeric receptor for IL18. This protein enhances the IL18 binding activity of IL18R1 (IL1RRP), a ligand binding subunit of IL18 receptor. The coexpression of IL18R1 and this protein is required for the activation of NF-kappaB and MAPK8 (JNK) in response to IL18.
7755	This gene encodes a nuclear envelope protein that appears to be involved in spermatogenesis, either directly or by influencing genes that play a more direct role in the process. This multi-exon locus is the homolog of the mouse and drosophila germ cell-less gene but the human genome also contains a single-exon locus on chromosome 5 that contains an open reading frame capable of encoding a highly-related protein.
7756	
7757	
7758	
7759	Synaptobrevins/VAMPs, syntaxins, and the 25-kD synaptosomal-associated protein SNAP25 are the main components of a protein complex involved in the docking and/or fusion of synaptic vesicles with the presynaptic membrane. The protein encoded by this gene is a member of the vesicle-associated membrane protein (VAMP)/synaptobrevin family. It is associated with the perinuclear vesicular structures of the early endocytic compartment. It has been found that VAMP8 interacts specifically with the soluble NSF-attachment protein (alpha-SNAP), most likely through an VAMP8-containing SNARE complex.
7760	
7761	Branching structures in complex N-glycans are synthesized on a commom core structure of Man3GlcNAc2Asn in the Golgi apparatus by the N-acetylglucosaminyltransferases. The mannosyl (alpha-1,3-)-glycoprotein beta-1,4-N-acetylglucosaminyltransferases, which include isoenzyme A (MGAT4A) and isoenzyme B (MGAT4B), are key N-acetylglucosaminyltransferases regulating formation of tri- and other multiantennary structures. MGAT4A and MGAT4B share 62% amino acid sequence identity. The human MGAT4A and bovine GnT-IV share 96% amino acid sequence identity. The expression levels of the MGAT4A mRNA are significantly different in various tissues and cell lines.
7762	
7763	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that belongs to the ribosomal protein S5P family. Pseudogenes corresponding to this gene are found on chromosomes 4q, 5q, and 18q.
7764	Cytokine-mediated signal transduction in the JAK-STAT cascade requires the involvement of adaptor molecules. One such signal-transducing adaptor molecule contains an SH3 domain that is required for induction of MYC and cell growth. The protein encoded by this gene binds to the SH3 domain of the signal-transducing adaptor molecule, and plays a critical role in cytokine-mediated signaling for MYC induction and cell cycle progression. Multiple alternatively spliced transcript variants encoding the same protein isoform have been found for this gene.
7765	
7766	
7767	
7768	Many vertebrate homeo box-containing genes have been identified on the basis of their sequence similarity with Drosophila developmental genes. Members of the Dlx gene family contain a homeobox that is related to that of Distal-less (Dll), a gene expressed in the head and limbs of the developing fruit fly. The Distal-less (Dlx) family of genes comprises at least 6 different members, DLX1-DLX6. The DLX proteins are postulated to play a role in forebrain and craniofacial development
7769	Many vertebrate homeo box-containing genes have been identified on the basis of their sequence similarity with Drosophila developmental genes. Members of the Dlx gene family contain a homeobox that is related to that of Distal-less (Dll), a gene expressed in the head and limbs of the developing fruit fly. The Distal-less (Dlx) family of genes comprises at least 6 different members, DLX1-DLX6. The DLX proteins are postulated to play a role in forebrain and craniofacial development
7770	
7771	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This protein is highly similar to the mouse WD repeat domain 12 protein at the amino acid level. The function of this protein is not known.
7772	
7773	This gene is thought to be involved in apoptosis, and may also be involved in hematopoietic development and differentiation. Two alternatively spliced transcript variants encoding different isoforms have been found for this gene.
7774	
7775	
7776	3-Hydroxyanthranilate 3,4-dioxygenase is a monomeric cytosolic protein belonging to the family of intramolecular dioxygenases containing nonheme ferrous iron. It is widely distributed in peripheral organs, such as liver and kidney, and is also present in low amounts in the central nervous system. HAAO catalyzes the synthesis of quinolinic acid (QUIN) from 3-hydroxyanthranilic acid. QUIN is an excitotoxin whose toxicity is mediated by its ability to activate glutamate N-methyl-D-aspartate receptors. Increased cerebral levels of QUIN may participate in the pathogenesis of neurologic and inflammatory disorders. HAAO has been suggested to play a role in disorders associated with altered tissue levels of QUIN.
7777	
7778	
7779	
7780	The protein encoded by this gene is similar in sequence and possibly function to the product of the retinoblastoma 1 (RB1) gene. The RB1 gene product is a tumor suppressor protein that appears to be involved in cell cycle regulation, as it is phosphorylated in the S to M phase transition and is dephosphorylated in the G1 phase of the cell cycle. Both the RB1 protein and the product of this gene can form a complex with adenovirus E1A protein and SV40 large T-antigen, with the SV40 large T-antigen binding only to the unphosphorylated form of each protein. In addition, both proteins can inhibit the transcription of cell cycle genes containing E2F binding sites in their promoters. Due to the sequence and biochemical similarities with the RB1 protein, it is thought that the protein encoded by this gene may also be a tumor suppressor. Two transcript variants encoding different isoforms have been found for this gene.
7781	
7782	The protein encoded by this gene is a nuclear transcription factor which binds DNA as a homodimer. The encoded protein controls the expression of several genes, including hepatocyte nuclear factor 1 alpha, a transcription factor which regulates the expression of several hepatic genes. This gene may play a role in development of the liver, kidney, and intestines. Mutations in this gene have been associated with monogenic autosomal dominant non-insulin-dependent diabetes mellitus type I. At least three different transcript variants encoding three different isoforms have been found for this gene.
7783	
7784	Molybdenum cofactor (MoCo) is necessary for the function of all molybdoenzymes. One of the enzymes required for the biosynthesis of MoCo is molybdopterin synthase (MPT synthase). The protein encoded by this gene adenylates and activates MPT synthase. This gene contains no introns. A pseudogene of this gene is present on chromosome 14.
7785	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This mitochondrial protein initiates the degradation of 1,25-dihydroxyvitamin D3, the physiologically active form of vitamin D3, by hydroxylation of the side chain. In regulating the level of vitamin D3, this enzyme plays a role in calcium homeostasis and the vitamin D endocrine system.
7786	The protein encoded by this gene is associated with the nuclear scaffold in proliferating cells. It was found to bind to the catalytic subunit of protein phosphatase-1 (PP1) and inhibit PP1 activity, suggesting that this protein may function as a regulatory subunit of PP1. Alternative splicing at this locus results in several transcript variants encoding different isoforms.
7787	This gene encodes an endoplasmic reticulum associated protein that is involved in glycosylphosphatidylinositol (GPI)-anchor biosynthesis. The GPI anchor is a glycolipid found on many blood cells and which serves to anchor proteins to the cell surface. The protein encoded by this gene is a subunit of the GPI N-acetylglucosaminyl (GlcNAc) transferase that transfers GlcNAc to phosphatidylinositol (PI) on the cytoplasmic side of the endoplasmic reticulum.
7788	The protein encoded by this gene is a member of the signal-regulatory-protein (SIRP) family, and also belongs to the immunoglobulin superfamily. SIRP family members are receptor-type transmembrane glycoproteins known to be involved in the negative regulation of receptor tyrosine kinase-coupled signaling processes. The function of this protein has not yet been determined. Two alternatively spliced transcript variants of this gene, which encode distinct isoforms, have been reported.
7789	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP contains an extracellular domain, a single transmembrane segment and two tandem intracytoplasmic catalytic domains, and thus represents a receptor-type PTP. This PTP has been shown to dephosphorylate and activate Src family tyrosine kinases, and is implicated in the regulation of integrin signaling, cell adhesion and proliferation. Three alternatively spliced variants of this gene, which encode two distinct isoforms, have been reported.
7790	
7791	The protein encoded by this gene is found in the nucleus and is a cofactor of DNA polymerase delta. The encoded protein acts as a homotrimer and helps increase the processivity of leading strand synthesis during DNA replication. In response to DNA damage, this protein is ubiquitinated and is involved in the RAD6-dependent DNA repair pathway. Two transcript variants encoding the same protein have been found for this gene. Pseudogenes of this gene have been described on chromosome 4 and on the X chromosome.
7792	The protein encoded by this gene catalyzes the formation of inositol 1,4,5-trisphosphate and diacylglycerol from phosphatidylinositol 4,5-bisphosphate. This reaction uses calcium as a cofactor and plays an important role in the intracellular transduction of many extracellular signals in the retina. Two transcript variants encoding different isoforms have been found for this gene.
7793	The protein encoded by this gene is a GTPase and member of the RAS-like GTP-binding protein family. The encoded protein is expressed in endothelial cells, where it promotes reorganization of the actin cytoskeleton and morphological changes in the cells.
7794	
7795	
7796	
7797	The protein encoded by this gene shares similarity with Saccharomyces cerevisiae Cdc91, a predicted integral membrane protein that may function in cell division control. The protein encoded by this gene is the fifth subunit of GPI transamidase that attaches GPI-anchors to proteins.
7798	This gene is a classical cadherin from the cadherin superfamily. The encoded protein is a calcium-dependent cell-cell adhesion glycoprotein comprised of five extracellular cadherin repeats, a transmembrane region and a highly conserved cytoplasmic tail. Based on studies in chicken and mouse, this cadherin is thought to play an important role during brain segmentation and neuronal outgrowth. In addition, a role in kidney and muscle development is indicated. Of particular interest are studies showing stable cis-heterodimers of cadherins 2 and 4 in cotransfected cell lines. Previously thought to interact in an exclusively homophilic manner, this is the first evidence of cadherin heterodimerization.
7799	SAMSN1 is a member of a novel gene family of putative adaptors and scaffold proteins containing SH3 and SAM (sterile alpha motif) domains (Claudio et al., 2001).[supplied by OMIM]
7800	
7801	
7802	
7803	The protein encoded by this gene is a type I integral membrane protein and belongs to the type II cytokine family of receptors. The encoded protein forms part of the interferon gamma receptor and is thought to interact with GAF, JAK1, and/or JAK2 in the interferon gamma signal transduction pathway.
7804	
7805	Carbonyl reductase is one of several monomeric, NADPH-dependent oxidoreductases having wide specificity for carbonyl compounds. This enzyme is widely distributed in human tissues. Another carbonyl reductase gene, CRB3, lies close to this gene on chromosome 21q.
7806	This gene encodes a protein that belongs to the serine protease family. The encoded protein contains a type II transmembrane domain, a receptor class A domain, a scavenger receptor cysteine-rich domain and a protease domain. Serine proteases are known to be involved in many physiological and pathological processes. This gene was demonstrated to be up-regulated by androgenic hormones in prostate cancer cells and down-regulated in androgen-independent prostate cancer tissue. The protease domain of this protein is thought to be cleaved and secreted into cell media after autocleavage. The biological function of this gene is unknown.
7807	Phosphofructokinase (PFK) is a tetrameric enzyme that catalyzes a key step in glycolysis, namely the conversion of D-fructose 6-phosphate to D-fructose 1,6-bisphosphate. Separate genes encode a muscle subunit (M) and a liver subunit (L). PFK from muscle is a homotetramer of M subunits, PFK from liver is a homotetramer of L-subunits, while PFK from platelets can be composed of any tetrameric combination of M and L subunits. The protein encoded by this gene represents the L subunit. Two transcript variants encoding different isoforms have been found for this gene.
7808	Phosphofructokinase (PFK) is a tetrameric enzyme that catalyzes a key step in glycolysis, namely the conversion of D-fructose 6-phosphate to D-fructose 1,6-bisphosphate. Separate genes encode a muscle subunit (M) and a liver subunit (L). PFK from muscle is a homotetramer of M subunits, PFK from liver is a homotetramer of L-subunits, while PFK from platelets can be composed of any tetrameric combination of M and L subunits. The protein encoded by this gene represents the L subunit. Two transcript variants encoding different isoforms have been found for this gene.
7809	Transport of folate compounds into mammalian cells can occur via receptor-mediated (see MIM 136430) or carrier-mediated mechanisms. A functional coordination between these 2 mechanisms has been proposed to be the method of folate uptake in certain cell types. Methotrexate (MTX) is an antifolate chemotherapeutic agent that is actively transported by the carrier-mediated uptake system. RFC1 plays a role in maintaining intracellular concentrations of folate.[supplied by OMIM]
7810	The protein encoded by this gene catalyzes the hydrolysis of cAMP and cGMP to their corresponding monophosphates. The encoded protein plays a role in signal transduction by regulating the intracellular concentration of these cyclic nucleotides. Multiple transcript variants encoding several different isoforms have been found for this gene.
7811	
7812	
7813	This gene, a member of the SNAP25 gene family, encodes a protein involved in multiple membrane trafficking steps. Two other members of this gene family, SNAP23 and SNAP25, encode proteins that bind a syntaxin protein and mediate synaptic vesicle membrane docking and fusion to the plasma membrane. The protein encoded by this gene binds tightly to multiple syntaxins and is localized to intracellular membrane structures rather than to the plasma membrane. While the protein is mostly membrane-bound, a significant fraction of it is found free in the cytoplasm. Use of multiple polyadenylation sites has been noted for this gene.
7814	CD179a (VpreB) is a 126 aa-long polypeptide with apparent MW of 16-18 kDa. It is expressed selectively at the early stages of B cell development, namely, in proB and early preB cells. CD179a has an Ig V domain-like structure, but lacks the last beta-strand (beta7) of a typical V domain. Instead, it has a carboxyl terminal end that shows no sequence homologies to any other proteins. CD179a associates non-covalently with CD179b (lambda5 or lambda-like) carrying an Ig C domain-like structure to form an Ig light chain-like structure, which is called the surrogate light chain or pseudo light chain. In this complex, the incomplete V domain of CD179a appears to be complemented by the extra beta7 strand of CD179b. On the surface of early preB cells, CD179a/CD179b surrogate light chain is disulfide-linked to membrane-bound Ig mu heavy chain in association with a signal transducer CD79a/CD79b heterodimer to form a B cell receptor-like structure, so-called preB cell receptor (preBCR). Though no CD179a-related human disease or pathology has been reported yet, the deficiency of other components of preB cell receptor such as CD179b, Ig mu heavy chain and CD79a has been shown to result in severe impairment of B cell development and agammaglobulinemia in human. PreBCR transduces signals for: 1) cellular proliferation, differentiation from the proB cell to preB cell stage, 2) allelic exclusion at the Ig heavy chain gene locus, and 3) promotion of Ig light chain gene rearrangements. Thus, preBCR functions as a checkpoint in early B cell development to monitor the production of Ig mu heavy chain through a functional rearrangement of Ig heavy chain gene as well as the potency of Ig mu heavy chain to associate with Ig light chain [PROW]
7815	This gene encodes a protein with no known function but with slight similarity to a yeast vacuolar protein. The gene is located in a region deleted in pediatric rhabdoid tumors of the brain, kidney and soft tissues, but mutations in this gene have not been associated with the disease.
7816	
7817	
7818	
7819	The protein encoded by this gene is a plasma protease inhibitor and member of the serine protease inhibitor class.  Polymorphisms in this protein appear to be tissue specific and influence protease targeting.  Variation in this protein@#$%&s sequence have been implicated in Alzheimer@#$%&s disease, and deficiency of this protein has been associated with liver disease.  Mutations have been identified in patients with Parkinson disease and chronic obstructive pulmonary disease.
7820	This gene encodes a cytosolic protein which belongs to a family of lipid-binding proteins including Sec14p, alpha-tocopherol transfer protein, and cellular retinol-binding protein. The encoded protein stimulates squalene monooxygenase which is a downstream enzyme in the cholesterol biosynthetic pathway.
7821	
7822	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of the ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class and it may play a role in regulation of hematopoiesis. Alternatively spliced transcript variants of this gene have been reported, but the full length nature of the variants has not been defined.
7823	
7824	The product of this gene is a mitochondrial matrix enzyme that is encoded by the nucleus. It may play roles in cyanide detoxification, the formation of iron-sulfur proteins, and the modification of sulfur-containing enzymes. The gene product contains two highly conservative domains (rhodanese homology domains), suggesting these domains have a common evolutionary origin.
7825	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is intronless and encodes a member of the histone H1 family.
7826	The protein encoded by this gene is a serine/threonine protein kinase and a member of the casein kinase I protein family, whose members have been implicated in the control of cytoplasmic and nuclear processes, including DNA replication and repair. The encoded protein is found in the cytoplasm as a monomer and can phosphorylate a variety of proteins, including itself. This protein has been shown to phosphorylate period, a circadian rhythm protein. Two transcript variants encoding the same protein have been found for this gene.
7827	This gene encodes a protein similar to the rat neuronal pentraxin receptor. The rat protein pentraxin is thought to mediate neuronal uptake of synaptic material and the presynaptic snake venom toxin, taipoxin. Alternative splicing of this gene results in two transcript variants encoding two isoforms which differ by one amino acid. Studies in rat indicate that translation of this mRNA initiates at a non-AUG (CUG) codon. This may also be true for human and mouse, based on strong sequence conservation amongst these species.
7828	
7829	
7830	SMC1L2 belongs to a family of proteins required for chromatid cohesion and DNA recombination during meiosis and mitosis.[supplied by OMIM]
7831	Activation of members of the mitogen-activated protein kinase family is a major mechanism for transduction of extracellular signals. Stress-activated protein kinases are one subclass of MAP kinases. The protein encoded by this gene functions as a signal transducer during differentiation of myoblasts to myotubes.
7832	
7833	
7834	
7835	
7836	
7837	
7838	
7839	
7840	Beta-catenin is an adherens junction protein. Adherens junctions (AJs; also called the zonula adherens) are critical for the establishment and maintenance of epithelial layers, such as those lining organ surfaces. AJs mediate adhesion between cells, communicate a signal that neighboring cells are present, and anchor the actin cytoskeleton. In serving these roles, AJs regulate normal cell growth and behavior. At several stages of embryogenesis, wound healing, and tumor cell metastasis, cells form and leave epithelia. This process, which involves the disruption and reestablishment of epithelial cell-cell contacts, may be regulated by the disassembly and assembly of AJs. AJs may also function in the transmission of the @#$%&contact inhibition@#$%& signal, which instructs cells to stop dividing once an epithelial sheet is complete.[supplied by OMIM]
7841	Beta-catenin is an adherens junction protein. Adherens junctions (AJs; also called the zonula adherens) are critical for the establishment and maintenance of epithelial layers, such as those lining organ surfaces. AJs mediate adhesion between cells, communicate a signal that neighboring cells are present, and anchor the actin cytoskeleton. In serving these roles, AJs regulate normal cell growth and behavior. At several stages of embryogenesis, wound healing, and tumor cell metastasis, cells form and leave epithelia. This process, which involves the disruption and reestablishment of epithelial cell-cell contacts, may be regulated by the disassembly and assembly of AJs. AJs may also function in the transmission of the @#$%&contact inhibition@#$%& signal, which instructs cells to stop dividing once an epithelial sheet is complete.[supplied by OMIM]
7842	
7843	
7844	
7845	The protein encoded by this gene belongs to the SEC13 family of WD-repeat proteins. It has similarity to the yeast SEC13 protein, which is required for vesicle biogenesis from endoplasmic reticulum during the transport of proteins. Two transcript variants encoding different isoforms have been found for this gene.
7846	This gene encodes one of seven subunits of the human Arp2/3 protein complex. The Arp2/3 protein complex has been implicated in the control of actin polymerization in cells and has been conserved through evolution. The exact role of the protein encoded by this gene, the p20 subunit, has yet to be determined.
7847	
7848	This gene encodes a protein sharing high sequence similarity with ribosomal protein L39. Although the name of this gene has been referred to as @#$%&ribosomal protein L39@#$%& in the public databases, its official name is @#$%&ribosomal protein L39-like@#$%&. It is not currently known whether the encoded protein is a functional ribosomal protein or whether it has evolved a function that is independent of the ribosome.
7849	The serine-threonine protein kinase encoded by the AKT1 gene is catalytically inactive in serum-starved primary and immortalized fibroblasts. AKT1 and the related AKT2 are activated by platelet-derived growth factor. The activation is rapid and specific, and it is abrogated by mutations in the pleckstrin homology domain of AKT1. It was shown that the activation occurs through phosphatidylinositol 3-kinase. In the developing nervous system AKT is a critical mediator of growth factor-induced neuronal survival. Survival factors can suppress apoptosis in a transcription-independent manner by activating the serine/threonine kinase AKT1, which then phosphorylates and inactivates components of the apoptotic machinery. Multiple alternatively spliced transcript variants have been found for this gene.
7850	
7851	
7852	The protein encoded by this gene is localized exclusively in the cell nucleus. It plays a role in signal transduction, and may function in the maintenance of sarcomeres and in the assembly of myofibrils into sarcomeres. It also plays an important role in stress fiber formation. The gene is involved in therapy-related leukemia by a chromosomal translocation t(3;11)(p21;q23) that involves this gene and the myeloid/lymphoid leukemia gene. Alternative splicing occurs in this locus and two transcript variants encoding distinct isoforms have been identified.
7853	
7854	
7855	Adaptins are important components of clathrin-coated vesicles transporting ligand-receptor complexes from the plasma membrane or from the trans-Golgi network to lysosomes. The adaptin family of proteins is compsed of four classes of molecules named alpha, beta-, beta prime- and gamma- adaptins. Adaptins, together with medium and small subunits, form a heterotetrameric complex called an adaptor, whose role is to promote the formation of clathrin-coated pits and vesicles. The protein encoded by this gene is a gamma-adaptin protein and it belongs to the adaptor complexes large subunits family. This protein along with the complex is thought to function at some trafficking step in the complex pathways between the trans-Golgi network and the cell surface. There are two alternatively spliced transcript variants of this gene encoding the same protein.
7856	The function of this gene has not yet been determined; however, since it is related to the villin gene, it may play a role in actin-bundling.
7857	ENTPD3 is similar to E-type nucleotidases (NTPases). NTPases, such as CD39, mediate catabolism of extracellular nucleotides. ENTPD3 contains 4 apyrase-conserved regions which is characteristic of NTPases.
7858	The calcium-sensing receptor (CASR) functions as a sensor for parathyroid and kidney to determine the extracellular calcium concentration and thus helps to maintain a stable calcium concentration. Mutations that inactivate CASR cause familial hypocalciuric hypercalcemia, whereas mutations that activate CASR are the cause of autosomal dominant hypocalcemia. An altenatively spliced transcript variant encoding 1088 aa has been found for this gene, but its full-length nature has not been defined.
7859	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L23P family of ribosomal proteins. It is located in the cytoplasm. The protein may be one of the target molecules involved in mediating growth inhibition by interferon. In yeast, the corresponding protein binds to a specific site on the 26S rRNA. This gene is co-transcribed with the U42A, U42B, U101A, and U101B small nucleolar RNA genes, which are located in its third, first, second, and fourth introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
7860	The protein encoded by this gene is a lipid-requiring mitochondrial membrane enzyme. This protein has a specific requirement of phosphatidylcholine for optimal enzymatic function and is a member of the short-chain alcohol dehydrogenase superfamily. Three alternatively spliced transcript variants encoding the same protein have been observed.
7861	
7862	
7863	
7864	The protein encoded by this gene belongs to the claudin family of tight junction associated proteins and is a major component of central nervous system myelin that is necessary for normal CNS function. There is growing evidence that the protein determines the permeability between layers of myelin sheaths via focal adhesion and, with its expression highly regulated during development, may play an important role in cellular proliferation and migration. In addition, the protein is a candidate autoantigen in the development of autoimmune demyelinating disease.
7865	This gene encodes one of the essential subunits of RNA polymerase II that is shared by the other two eukaryotic DNA-directed RNA polymerases, I and III.
7866	Phosphatidylserine-specific phospholipase A1-alpha (PLA1A) acts specifically on phosphatidylserine (PS) and 1-acyl-2-lysophosphatidylserine (lyso-PS) to hydrolyze fatty acids at the sn-1 position of these phospholipids.[supplied by OMIM]
7867	
7868	
7869	
7870	The protein encoded by this gene is a peripheral membrane protein which recycles between the cytosol and the Golgi apparatus during interphase. It is regulated by phosphorylation: dephosphorylated protein associates with the Golgi membrane and dissociates from the membrane upon phosphorylation. Ras-associated protein 1 recruits this protein to coat protein complex II (COPII) vesicles during budding from the endoplasmic reticulum, where it interacts with a set of COPII vesicle-associated SNAREs to form a cis-SNARE complex that promotes targeting to the Golgi apparatus. Transport from the ER to the cis/medial Golgi compartments requires the action of this gene product, GM130 and giantin in a sequential manner.
7871	
7872	This gene product mediates the attachment of erythroblasts to macrophages. This attachment promotes terminal maturation and enucleation of erythroblasts, presumably by suppressing apoptosis. This protein is an integral membrane protein with the N-terminus on the extracellular side and the C-terminus on the cytoplasmic side of the cell. Two immunologically related isoforms of erythroblast macrophage protein with apparent molecular weights of 33 kD and 36 kD were detected in macrophage membranes; this gene encodes the larger isoform. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene, but the biological validity of some variants has not been determined.
7873	DISCONTINUED: The record for PBM1 has been withdrawn by HGNC.
7874	The protein encoded by this gene is a bZIP transcription factor which can bind as a homodimer to certain DNA regulatory regions. It can also form heterodimers with the related protein CEBP-delta. The encoded protein may be essential for terminal differentiation and functional maturation of committed granulocyte progenitor cells. Mutations in this gene have been associated with Specific Granule Deficiency, a rare congenital disorder. Multiple variants of this gene have been described, but the full-length nature of only one has been determined.
7875	The protein encoded by this gene is a member of the Toll-like receptor (TLR) family which plays a fundamental role in pathogen recognition and activation of innate immunity. TLRs are highly conserved from Drosophila to humans and share structural and functional similarities. They recognize pathogen-associated molecular patterns (PAMPs) that are expressed on infectious agents, and mediate the production of cytokines necessary for the development of effective immunity. The various TLRs exhibit different patterns of expression. This receptor is most abundantly expressed in placenta and pancreas, and is restricted to the dendritic subpopulation of the leukocytes. It recognizes dsRNA associated with viral infection, and induces the activation of NF-kappaB and the production of type I interferons. It may thus play a role in host defense against viruses. Use of alternative polyadenylation sites to generate different length transcripts has been noted for this gene.
7876	
7877	Alpha-synuclein is a member of the synuclein family, which also includes beta- and gamma-synuclein. Synucleins are abundantly expressed in the brain and alpha- and beta-synuclein inhibit phospholipase D2 selectively. SNCA may serve to integrate presynaptic signaling and membrane trafficking. Defects in SNCA have been implicated in the pathogenesis of Parkinson disease. SNCA peptides are a major component of amyloid plaques in the brains of patients with Alzheimer@#$%&s disease. Two alternatively spliced transcripts of SNCA have been identified. Additional splicing may be present but the full-length nature of these variants has not been determined.
7878	The protein encoded by this gene is a cytokine that regulates T and natural killer cell activation and proliferation. This cytokine and interleukine 2 share many biological activities. They are found to bind common hematopoietin receptor subunits, and may compete for the same receptor, and thus negatively regulate each other@#$%&s activity. The number of CD8+ memory cells is shown to be controlled by a balance between this cytokine and IL2. This cytokine induces the activation of JAK kinases, as well as the phosphorylation and activation of transcription activators STAT3, STAT5, and STAT6. Studies of the mouse counterpart suggested that this cytokine may increase the expression of apoptosis inhibitor BCL2L1/BCL-x(L), possibly through the transcription activation activity of STAT6, and thus prevent apoptosis. Three alternatively spliced transcript variants of this gene encoding two distinct isoforms have been reported.
7879	
7880	
7881	
7882	This gene encodes a cell surface tyrosine kinase receptor for members of the platelet-derived growth factor family. These growth factors are mitogens for cells of mesenchymal origin. The identity of the growth factor bound to a receptor monomer determines whether the functional receptor is a homodimer or a heterodimer, composed of both platelet-derived growth factor receptor alpha and beta polypeptides. Studies in knockout mice, where homozygosity is lethal, indicate that the alpha form of the platelet-derived growth factor receptor is particularly important for kidney development since mice heterozygous for the receptor exhibit defective kidney phenotypes.
7883	The protein encoded by this gene is a small GTPase of the Rho-subfamily, which regulates signaling pathways that control diverse cellular functions including cell morphology, migration, endocytosis and cell cycle progression. This protein is highly similar to Saccharomyces cerevisiae Cdc 42, and is able to complement the yeast cdc42-1 mutant. The product of oncogene Dbl was reported to specifically catalyze the dissociation of GDP from this protein. This protein could regulate actin polymerization through its direct binding to Neural Wiskott-Aldrich syndrome protein (N-WASP), which subsequently activates Arp2/3 complex. Alternative splicing of this gene results in at least two transcript variants.
7884	
7885	
7886	HLA-G belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. HLA-G is expressed on fetal derived placental cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domain, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exon 6 encodes the cytoplasmic tail.
7887	This gene encodes a protein of unknown function. It is located in the Smith-Magenis syndrome region on chromosome 17.
7888	
7889	
7890	
7891	
7892	This gene encodes a 105 kD protein which can undergo cotranslational processing by the 26S proteasome to produce a 50 kD protein. The 105 kD protein is a Rel protein-specific transcription inhibitor and the 50 kD protein is a DNA binding subunit of the NF-kappa-B (NFKB) protein complex. NFKB is a transcription regulator that is activated by various intra- and extra-cellular stimuli such as cytokines, oxidant-free radicals, ultraviolet irradiation, and bacterial or viral products. Activated NFKB translocates into the nucleus and stimulates the expression of genes involved in a wide variety of biological functions. Inappropriate activation of NFKB has been associated with a number of inflammatory diseases while persistent inhibition of NFKB leads to inappropriate immune cell development or delayed cell growth.
7893	
7894	
7895	
7896	
7897	
7898	
7899	
7900	
7901	
7902	Neuropeptide B (NPB) is an endogenous peptide ligand for G protein-coupled receptor-7 (GPR7; MIM 600730).[supplied by OMIM]
7903	The P-type adenosinetriphosphatases (P-type ATPases) are a family of proteins which use the free energy of ATP hydrolysis to drive uphill transport of ions across membranes. Several subfamilies of P-type ATPases have been identified. One subfamily catalyzes transport of heavy metal ions. Another subfamily transports non-heavy metal ions (NMHI). The protein encoded by this gene is a member of the third subfamily of P-type ATPases and acts to transport amphipaths, such as phosphatidylserine.
7904	
7905	
7906	
7907	
7908	Mitochondrial uncoupling proteins (UCP) are members of the family of mitochondrial anion carrier proteins (MACP). UCPs separate oxidative phosphorylation from ATP synthesis with energy dissipated as heat, also referred to as the mitochondrial proton leak. UCPs facilitate the transfer of anions from the inner to the outer mitochondrial membrane and the return transfer of protons from the outer to the inner mitochondrial membrane. They also reduce the mitochondrial membrane potential in mammalian cells. Tissue specificity occurs for the different UCPs and the exact methods of how UCPs transfer H+/OH- are not known. UCPs contain the three homologous protein domains of MACPs. This gene is expressed only in brown adipose tissue, a specialized tissue which functions to produce heat.
7909	This gene encodes class II alcohol dehydrogenase 4 pi subunit, which is a member of the alcohol dehydrogenase family. Members of this enzyme family metabolize a wide variety of substrates, including ethanol, retinol, other aliphatic alcohols, hydroxysteroids, and lipid peroxidation products. Class II alcohol dehydrogenase is a homodimer composed of 2 pi subunits. It exhibits a high activity for oxidation of long-chain aliphatic alcohols and aromatic alcohols and is less sensitive to pyrazole. This gene is localized to chromosome 4 in the cluster of alcohol dehydrogenase genes.
7910	
7911	
7912	
7913	
7914	This gene encodes a protein that belongs to the inositol phosphokinase (IPK) family. This protein is likely responsible for the conversion of inositol hexakisphosphate (InsP6) to diphosphoinositol pentakisphosphate (InsP7/PP-InsP5). It may also convert 1,3,4,5,6-pentakisphosphate (InsP5) to PP-InsP4 and affect the growth suppressive and apoptotic activities of interferon-beta in some ovarian cancers. Alternative splicing results in multiple transcript variants encoding different isoforms.
7915	The protein encoded by this gene is a secreted cytokine that is important for the proliferation of T and B lymphocytes. The receptor of this cytokine is a heterotrimeric protein complex whose gamma chain is also shared by interleukin 4 (IL4) and interleukin 7 (IL7). The expression of this gene in mature thymocytes is monoallelic, which represents an unusual regulatory mode for controlling the precise expression of a single gene. The targeted disruption of a similar gene in mice leads to ulcerative colitis-like disease, which suggests an essential role of this gene in the immune response to antigenic stimuli.
7916	The protein encoded by this gene is a secreted cytokine that is important for the proliferation of T and B lymphocytes. The receptor of this cytokine is a heterotrimeric protein complex whose gamma chain is also shared by interleukin 4 (IL4) and interleukin 7 (IL7). The expression of this gene in mature thymocytes is monoallelic, which represents an unusual regulatory mode for controlling the precise expression of a single gene. The targeted disruption of a similar gene in mice leads to ulcerative colitis-like disease, which suggests an essential role of this gene in the immune response to antigenic stimuli.
7917	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order      structures. This gene is intronless and encodes a member of the histone H2A family.
7918	FGFBP1, or HBP17, binds to both acidic (FGF1; MIM 131220) and basic (FGF2; MIM 134920) fibroblast growth factors in a reversible manner. It also binds to perlecan (HSPG2; MIM 142461).[supplied by OMIM]
7919	
7920	
7921	
7922	
7923	
7924	
7925	
7926	
7927	
7928	Ankyrins are a family of proteins that are believed to link the integral membrane proteins to the underlying spectrin-actin cytoskeleton and play key roles in activities such as cell motility, activation, proliferation, contact, and the maintenance of specialized membrane domains. Multiple isoforms of ankyrin with different affinities for various target proteins are expressed in a tissue-specific, developmentally regulated manner. Most ankyrins are typically composed of three structural domains: an amino-terminal domain containing multiple ankyrin repeats; a central region with a highly conserved spectrin binding domain; and a carboxy-terminal regulatory domain which is the least conserved and subject to variation. Ankyrin 3 is an immunologically distinct gene product from ankyrins 1 and 2, and was originally found at the axonal initial segment and nodes of Ranvier of neurons in the central and peripheral nervous systems. Alternatively spliced variants may be expressed in other tissues. Although multiple transcript variants encoding several different isoforms have been found for this gene, the full-length nature of only two have been characterized.
7929	
7930	
7931	This gene is a member of the leucine-rich repeat and PDZ domain (LAP) family. The encoded protein contains 17 leucine-rich repeats and one PDZ domain. It binds to the unphosphorylated form of the ERBB2 protein and regulates ERBB2 function and localization. It has also been shown to affect the Ras signaling pathway by disrupting Ras-Raf interaction. Alternate transcriptional splice variants encoding different isoforms have been found for this gene, but only two of them have been characterized to date.
7932	
7933	Calcitonin is a peptide hormone synthesized by the parafollicular cells of the thyroid. It causes reduction in serum calcium, an effect opposite to that of parathyroid hormone (PTH; MIM 168450).[supplied by OMIM]
7934	The protein encoded by this gene is a neuronal surface glycoprotein found in cortical and subcortical regions of the limbic system. During development of the limbic system, this encoded protein is found on the surface of axonal membranes and growth cones, where it acts as a selective homophilic adhesion molecule, and guides the development of specific patterns of neuronal connections.
7935	This tumor suppressor gene is a member of a nuclear histone/protein methyltransferase superfamily. It encodes a zinc finger protein that can bind to retinoblastoma protein, estrogen receptor, and the TPA-responsive element (MTE) of the heme-oxygenase-1 gene. Although the functions of this protein have not been fully characterized, it may (1) play a role in transcriptional regulation during neuronal differentiation and pathogenesis of retinoblastoma, (2) act as a transcriptional activator of the heme-oxygenase-1 gene, and (3) be a specific effector of estrogen action. Three transcript variants encoding different isoforms have been found for this gene.
7936	
7937	Natriuretic peptides comprise a family of 3 structurally related molecules: atrial natriuretic peptide (ANP; MIM 108780), brain natriuretic peptide (BNP; MIM 600295), and C-type natriuretic peptide, CNP, encoded by a gene symbolized NPPC. These peptides possess potent natriuretic, diuretic, and vasodilating activities and are implicated in body fluid homeostasis and blood pressure control.[supplied by OMIM]
7938	The Ra-reactive factor (RARF) is a complement-dependent bactericidal factor that binds to the Ra and R2 polysaccharides expressed by certain enterobacteria. Alternate splicing of this gene results in two transcript variants encoding two RARF components that are involved in the mannan-binding lectin pathway of complement activation. The longer isoform is cleaved into two chains which form a heterodimer linked by a disulfide bond. The encoded proteins are members of the trypsin family of peptidases.
7939	
7940	
7941	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections.
7942	Endocytosis of cell surface proteins is mediated by a complex molecular machinery that assembles on the inner surface of the plasma membrane. This gene encodes one of two human homologs of the Drosophila melanogaster stoned B protein. This protein is related to components of the endocytic machinery and exhibits a modular structure consisting of an N-terminal proline-rich domain, a central region of homology specific to the human stoned B-like proteins, and a C-terminal region homologous to the mu subunits of adaptor protein (AP) complexes. Co-transcription of this gene and the neighboring downstream gene generates a rare transcript (SALF), which encodes a fusion protein comprised of sequence sharing identity with each individual gene product.
7943	Roadblock/LC7 is one member of an ancient dynein light chain protein family conserved in nematode, fruit fly, mouse, and rat. The protein encoded by this gene is one of two human proteins that share sequence similarity with roadblock/LC7. Expression of this gene is up-regulated in hepatocellular carcinomas, suggesting that this gene may be involved in tumor progression. Multiple transcript variants encoding different isoforms have been found for this gene.
7944	This gene encodes one of the minor histocompatibility antigens, which play an important role in the induction of cytotoxic T lymphocyte (CTL) reactivity against leukemia after human histocompatibility leukocyte antigen (HLA)-identical allogeneic bone marrow transplantation (BMT). This gene is only expressed in B cell acute lymphoblastic leukemia cells and Epstein-Barr virus-transformed B cells. The translation of this mRNA initiates at a non-AUG (CUG) codon.
7945	
7946	
7947	The protein encoded by this gene is a member of the LAR protein-tyrosine phosphatase-interacting protein (liprin) family. Liprins interact with members of LAR family of transmembrane protein tyrosine phosphatases, which are known to be important for axon guidance and mammary gland development. This protein binds to the intracellular membrane-distal phosphatase domain of tyrosine phosphatase LAR, and appears to localize LAR to cell focal adhesions. This interaction may regulate the disassembly of focal adhesion and thus help orchestrate cell-matrix interactions. Alternatively spliced transcript variants encoding distinct isoforms have been described.
7948	
7949	
7950	
7951	
7952	Steroid 5-alpha-reductase (EC 1.3.99.5) catalyzes the conversion of testosterone into the more potent androgen, dihydrotestosterone (DHT). There are 2 isoforms of the enzyme: SRD5A1 and SRD5A2 (MIM 607306).[supplied by OMIM]
7953	
7954	
7955	HLA-DOA belongs to the HLA class II alpha chain paralogues. HLA-DOA forms a heterodimer with HLA-DOB. The heterodimer, HLA-DO, is found in lysosomes in B cells and regulates HLA-DM-mediated peptide loading on MHC class II molecules. In comparison with classical HLA class II molecules, this gene exhibits very little sequence variation, especially at the protein level.
7956	
7957	Sulfotransferase enzymes catalyze the sulfate conjugation of many hormones, neurotransmitters, drugs, and xenobiotic compounds. These cytosolic enzymes are different in their tissue distributions and substrate specificities. The gene structure (number and length of exons) is similar among family members. However, the total genomic length of this gene is greater than that of other SULT1 genes.
7958	Phosphorylation of serine and threonine residues in proteins is a crucial step in the regulation of many cellular functions ranging from hormonal regulation to cell division and even short-term memory. The level of phosphorylation is controlled by the opposing actions of protein kinases and protein phosphatases. Protein phosphatase 1 (PP1) is 1 of 4 major serine/threonine-specific protein phosphatases which have been identified in eukaryotic cells. PP1 associates with various regulatory subunits that dictate its subcellular localization and modulate its substrate specificity. Several subunits that target PP1 to glycogen have been identified. This gene encodes a glycogen-targeting subunit of PP1.
7959	Serum-inducible kinase is a member of the @#$%&polo@#$%& family of serine/threonine protein kinases that have a role in normal cell division.[supplied by OMIM]
7960	This gene encodes a member of the regulator of G-protein signaling family. This protein is contains a conserved, 120 amino acid motif called the RGS domain. The protein attenuates the signaling activity of G-proteins by binding to activated, GTP-bound G alpha subunits and acting as a GTPase activating protein (GAP), increasing the rate of conversion of the GTP to GDP. This hydrolysis allows the G alpha subunits to bind G beta/gamma subunit heterodimers, forming inactive G-protein heterotrimers, thereby terminating the signal. Alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
7961	
7962	
7963	
7964	
7965	
7966	
7967	
7968	
7969	
7970	
7971	
7972	
7973	The activation of gene transcription is a multistep process that is triggered by factors that recognize transcriptional enhancer sites in DNA. These factors work with co-activators to direct transcriptional initiation by the RNA polymerase II apparatus. The protein encoded by this gene is a subunit of the CRSP (cofactor required for SP1 activation) complex, which, along with TFIID, is required for efficient activation by SP1. This protein is also a component of other multisubunit complexes e.g. thyroid hormone receptor-(TR-) associated proteins which interact with TR and facilitate TR function on DNA templates in conjunction with initiation factors and cofactors.
7974	Glutamate receptors are the predominant excitatory neurotransmitter receptors in the mammalian brain and are activated in a variety of normal neurophysiologic processes. These receptors are heteromeric protein complexes with multiple subunits, each possessing transmembrane regions, and all arranged to form a ligand-gated ion channel. The classification of glutamate receptors is based on their activation by different pharmacologic agonists. The GRIA1 belongs to a family of alpha-amino-3-hydroxy-5-methyl-4-isoxazole propionate (AMPA) receptors. Each of the members (GRIA1-4) include flip and flop isoforms generated by alternative RNA splicing. The receptor subunits encoded by each isoform vary in their signal transduction properties. The isoform presented here is the flop isoform. In situ hybridization experiments showed that human GRIA1 mRNA is present in granule and pyramidal cells in the hippocampal formation.
7975	
7976	Fibulin 1 is a secreted glycoprotein that becomes incorporated into a fibrillar extracellular matrix. Calcium-binding is apparently required to mediate its binding to laminin and nidogen. It mediates platelet adhesion via binding fibrinogen. Four splice variants which differ in the 3@#$%& end have been identified. Each variant encodes a different isoform, but no functional distinctions have been identified among the four variants.
7977	The protein encoded by this gene is a member of the fibroblast growth factor receptor family, where amino acid sequence is highly conserved between members and throughout evolution. FGFR family members differ from one another in their ligand affinities and tissue distribution. A full-length representative protein consists of an extracellular region, composed of three immunoglobulin-like domains, a single hydrophobic membrane-spanning segment and a cytoplasmic tyrosine kinase domain. The extracellular portion of the protein interacts with fibroblast growth factors, setting in motion a cascade of downstream signals, ultimately influencing mitogenesis and differentiation. This particular family member is a high-affinity receptor for acidic, basic and/or keratinocyte growth factor, depending on the isoform. Mutations in this gene are associated with many craniosynostotic syndromes and bone malformations. The genomic organization of this gene encompasses 20 exons. Alternative splicing in multiple exons, including those encoding the Ig-like domains, the transmembrane region and the carboxyl terminus, results in varied isoforms which differ in structure and specificity.
7978	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1, a B-box type 2, and a coiled-coil region. The protein localizes to both the nucleus and the cytoplasm, and may represent a participant in the initiation of glycogen synthesis. Multiple transcript variants have been found for this gene, and some of them encode the same isoform.
7979	
7980	DBP is a member of the PAR bZIP (proline and acidic amino acid-rich basic leucine zipper) transcription factor family (Khatib et al., 1994).[supplied by OMIM]
7981	
7982	
7983	
7984	
7985	
7986	
7987	
7988	
7989	The protein encoded by this gene belongs to the highly conserved cyclin family, whose members are characterized by a dramatic periodicity in protein abundance through the cell cycle. Cyclins function as regulators of CDK kinases. Different cyclins exhibit distinct expression and degradation patterns which contribute to the temporal coordination of each mitotic event. This cyclin forms a complex with CDK7 kinase and ring finger protein MAT1. The kinase complex is able to phosphorylate CDK2 and CDC2 kinases, thus functions as a CDK-activating kinase (CAK). This cyclin and its kinase partner are components of TFIIH, as well as RNA polymerase II protein complexes. They participate in two different transcriptional regulation processes, suggesting an important link between basal transcription control and the cell cycle machinery.
7990	This gene is a member of the MYC family and encodes a protein with a basic helix-loop-helix (bHLH) domain. This protein is located in the nucleus and must dimerize with another bHLH protein in order to bind DNA. Amplification of this gene is associated with a variety of tumors, most notably neuroblastomas.
7991	
7992	
7993	
7994	This gene encodes a component of prolyl 4-hydroxylase, a key enzyme in collagen synthesis composed of two identical alpha subunits and two beta subunits. The encoded protein is one of several different types of alpha subunits and provides the major part of the catalytic site of the active enzyme. In collagen and related proteins, prolyl 4-hydroxylase catalyzes the formation of 4-hydroxyproline that is essential to the proper three-dimensional folding of newly synthesized procollagen chains. Alternatively spliced transcript variants encoding different isoforms have been described.
7995	ECM2 encodes extracellular matrix protein 2, so named because it shares extensive similarity with known extracelluar matrix protiens.
7996	This gene encodes a member of the plakin protein family of adhesion junction plaque proteins. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene, but the full-length nature of some variants has not been defined. It has been known that some isoforms are expressed in neural and muscle tissue, anchoring neural intermediate filaments to the actin cytoskeleton, and some isoforms are expressed in epithelial tissue, anchoring keratin-containing intermediate filaments to hemidesmosomes. Consistent with the expression, mice defective for this gene show skin blistering and neurodegeneration.
7997	
7998	This gene encodes a member of the DEAD box protein family. DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure, such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. The protein encoded by this gene is a component of the 17S U2 snRNP complex; it plays an important role in pre-mRNA splicing.
7999	Lactate dehydrogenase A catalyzes the conversion of L-lactate and NAD to pyruvate and NADH in the final step of anaerobic glycolysis.  LDHA is found predominantly in muscle tissue and belongs to the lactate dehydrogenase family.  Mutations in LDHA have been linked to exertional myoglobinuria.
8000	This gene is a member of the glutathione peroxidase family and encodes a selenium-dependent glutathione peroxidase that is one of two isoenzymes responsible for the majority of the glutathione-dependent hydrogen peroxide-reducing activity in the epithelium of the gastrointestinal tract. Studies in knockout mice indicate that mRNA expression levels respond to luminal microflora, suggesting a role of the ileal glutathione peroxidases in preventing inflammation in the GI tract.
8001	
8002	
8003	
8004	Sodium channel beta subunits, such as SCN4B, are auxiliary subunits that alter the channel kinetics of voltage-gated alpha subunits (see MIM 182389).[supplied by OMIM]
8005	
8006	
8007	
8008	
8009	
8010	
8011	
8012	
8013	The product of this gene belongs to the phosphatase 2A regulatory subunit B family. Protein phosphatase 2A is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. The B regulatory subunit might modulate substrate selectivity and catalytic activity. This gene encodes an epsilon isoform of the regulatory subunit B56 subfamily.
8014	The protein encoded by this gene is a human guanine nucleotide releasing protein for Ras protein. It belongs to the adaptor-type Src homology (SH)2-containing molecules. Src homology 2 domains are globular protein modules present in a large variety of functionally distinct proteins. They mediate binding events that control the activity and localization of many proteins involved in the transmission of signals from the cell surface to the nucleus. The mRNAs of these proteins are expressed ubiquitously in human adult and fetal tissues. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some variants has not been determined.
8015	Acetylcholine receptors at mature mammalian neuromuscular junctions are pentameric protein complexes composed of four subunits in the ratio of two alpha subunits to one beta, one epsilon, and one delta subunit. The achetylcholine receptor changes subunit composition shortly after birth when the epsilon subunit replaces the gamma subunit seen in embryonic receptors. Mutations in the epsilon subunit are associated with congenital myasthenic syndrome.
8016	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes one of the ATPase subunits, a member of the triple-A family of ATPases which have a chaperone-like activity. This subunit has been shown to interact with several of the basal transcription factors so, in addition to participation in proteasome functions, this subunit may participate in the regulation of transcription. This subunit may also compete with PSMC3 for binding to the HIV tat protein to regulate the interaction between the viral protein and the transcription complex.
8017	
8018	This gene is a member of the sulfatase family, and more specifically, the arylsulfatase subfamily. Members of the subfamily share similarity in sequence and splice sites, and are clustered together on chromosome X, suggesting that they are derived from recent gene duplication events. Sulfatases are essential for the correct composition of bone and cartilage matrix. The activity of this protein, unlike that of arylsulfatase E, is not inhibited by warfarin.
8019	This gene product shares extensive sequence similarity with the rat neuronal olfactomedin-related ER localized protein. While the exact function of the encoded protein is not known, its abundant expression in brain suggests that it may have an essential role in nerve tissue. Several alternatively spliced transcripts encoding different isoforms have been found for this gene.
8020	
8021	This gene encodes a member of the C-type lectin/C-type lectin-like domain (CTL/CTLD) superfamily. Members of this family share a common protein fold and have diverse functions, such as cell adhesion, cell-cell signalling, glycoprotein turnover, and roles in inflammation and immune response. The encoded type II transmembrane protein interacts with dnax-activation protein 12 and may play a role in cell activation. Alternative splice variants have been described but their full-length sequence has not been determined.
8022	
8023	
8024	
8025	Phosphorylase kinase (EC 2.7.1.38) is a crucial glycogenolytic regulatory enzyme. The holoenzymes are composed of 16 subunit proteins containing equimolar ratios of 4 subunit types known as alpha, beta, gamma, and delta. Skeletal muscle contains the highest amount of phosphorylase kinase enzymatic activity, although activity is also observed in liver, cardiac muscle, brain, and several other tissues. Kinetic differences exist between the cardiac and adult skeletal muscle enzymes, and the liver and skeletal muscle enzymes are known to be separate isozymes.[supplied by OMIM]
8026	
8027	This gene is a member of the TAFA family which is composed of five highly homologous genes that encode small secreted proteins. These proteins contain conserved cysteine residues at fixed positions, and are distantly related to MIP-1alpha, a member of the CC-chemokine family. The TAFA proteins are predominantly expressed in specific regions of the brain, and are postulated to function as brain-specific chemokines or neurokines, that act as regulators of immune and nervous cells.
8028	
8029	
8030	MHC class II gene expression is controlled primarily at the transcriptional level by transcription factors that bind to the X and Y boxes, two highly conserved elements in the proximal promoter of MHC class II genes. The protein encoded by this gene is a transcriptional repressor capable of binding to the conserved X box motif of HLA-DRA and other MHC class II genes in vitro. The protein may play a role in regulating the duration of an inflammatory response by limiting the period in which class II MHC molecules are induced by IFN-gamma. Three alternative splice variants, each of which encodes a different isoform, have been identified.
8031	
8032	
8033	
8034	
8035	
8036	This gene encodes a receptor for bradykinin. The 9 aa bradykinin peptide elicits many responses including vasodilation, edema, smooth muscle spasm and pain fiber stimulation. This receptor associates with G proteins that stimulate a phosphatidylinositol-calcium second messenger system. Alternate start codons result in two isoforms of the protein.
8037	Voltage-gated proton (hydrogen) channels play an important role in cellular defense against acidic stress. They are unique among ion channels with respect to their extremely high selectivity, marked temperature dependence, and unitary conductance, which is 3 orders of magnitude lower than that of most other ion channels. NOX1 is a homolog of the catalytic subunit of the superoxide-generating NADPH oxidase of phagocytes, gp91phox. Three splice variants of NOX1 have been identified, NOH-1L, NOH-1S and NOH-1Lv. The NOH-1S currents were reversibly blocked by zinc, a known H+ channel inhibitor. The NOH-1S variant does not contain an electron transport chain and it is thought that H+ conductance is its main physiologic function, whereas NOH-1L may conduct H+ ions as part of its electron transport mechanism.
8038	HMG-CoA reductase is the rate-limiting enzyme for cholesterol synthesis and is regulated via a negative feedback mechanism mediated by sterols and non-sterol metabolites derived from mevalonate, the product of the reaction catalyzed by reductase. Normally in mammalian cells this enzyme is suppressed by cholesterol derived from the internalization and degradation of low density lipoprotein (LDL) via the LDL receptor.  Competitive inhibitors of the reductase induce the expression of LDL receptors in the liver, which in turn increases the catabolism of plasma LDL and lowers the plasma concentration of cholesterol, an important determinant of atherosclerosis.
8039	Holocarboxylase synthetase (EC 6.3.4.10) covalently links biotin to propionyl-CoA carboxylase (PCCA; MIM 232000), pyruvate carboxylase (PC; MIM 608786), alpha-methylcrotonyl-CoA carboxylase (MCCC1; MIM 609010), and acetyl-CoA carboxylase (ACACA; MIM 200350).[supplied by OMIM]
8040	Many DNA-binding transcriptional activator proteins enhance the initiation rate of RNA polymerase II-mediated gene transcription by interacting functionally with the general transcription machinery bound at the basal promoter. Adaptor proteins are usually required for this activation, possibly to acetylate and destabilize nucleosomes, thereby relieving chromatin constraints at the promoter. The protein encoded by this gene is a transcriptional activator adaptor and has been found to be part of the PCAF histone acetylase complex. In addition, it associates with the tumor suppressor protein p53 and is required for full activity of p53 and p53-mediated apoptosis. At least four alternatively spliced variants have been found for this gene, but the full-length nature of some variants has not been determined.
8041	
8042	
8043	
8044	
8045	
8046	
8047	
8048	
8049	This gene encodes a protein with a conserved RNA-binding motif. The protein is found predominantly in the nucleus, although it is also present in the cytoplasm. It is preferentially associated with mRNAs produced by splicing, including both nuclear mRNAs and newly exported cytoplasmic mRNAs. It is thought that the protein remains associated with spliced mRNAs as a tag to indicate where introns had been present, thus coupling pre- and post-mRNA splicing events. Previously, it was thought that two genes encode this protein, RBM8A and RBM8B; it is now thought that the RBM8B locus is a pseudogene. Two alternative start codons result in two forms of the protein, and this gene also uses multiple polyadenylation sites.
8050	
8051	SLP-76 was originally identified as a substrate of the ZAP-70 protein tyrosine kinase following T cell receptor (TCR) ligation in the leukemic T cell line Jurkat. The SLP-76 locus has been localized to human chromosome 5q33 and the gene structure has been partially characterized in mice. The human and murine cDNAs both encode 533 amino acid proteins that are 72% identical and comprised of three modular domains. The NH2-terminus contains an acidic region that includes a PEST domain and several tyrosine residues which are phosphorylated following TCR ligation. SLP-76 also contains a central proline-rich domain and a COOH-terminal SH2 domain. A number of additional proteins have been identified that associate with SLP-76 both constitutively and inducibly following receptor ligation, supporting the notion that SLP-76 functions as an adaptor or scaffold protein. Studies using SLP-76 deficient T cell lines or mice have provided strong evidence that SLP-76 plays a positive role in promoting T cell development and activation as well as mast cell and platelet function [PROW].
8052	
8053	The protein encoded by this gene belongs to the aconitase/IPM isomerase family. It is an enzyme that catalyzes the interconversion of citrate to isocitrate via cis-aconitate in the second step of the TCA cycle. This protein is encoded in the nucleus and functions in the mitochondrion. It was found to be one of the mitochondrial matrix proteins that are preferentially degraded by the serine protease 15(PRSS15), also known as Lon protease, after oxidative modification.
8054	This gene encodes a DNA topoisomerase, an enzyme that controls and alters the topologic states of DNA during transcription. This enzyme catalyzes the transient breaking and rejoining of a single strand of DNA which allows the strands to pass through one another, thus altering the topology of DNA. This gene is localized to chromosome 20 and has pseudogenes which reside on chromosomes 1 and 22.
8055	This gene is considered a member of a gene family predicted to encode transmembrane proteins. The specific function of this gene is unknown; however, expression in the inner ear suggests that it may be crucial for normal auditory function. Mutations in this gene may underlie hereditary disorders of balance and hearing.
8056	Calpains are ubiquitous, well-conserved family of calcium-dependent, cysteine proteases. The calpain proteins are heterodimers consisting of an invariant small subunit and variable large subunits. The large catalytic subunit has four domains: domain I, the N-terminal regulatory domain that is processed upon calpain activation; domain II, the protease domain; domian III, a linker domain of unknown function; and domain IV, the calmodulin-like calcium-binding domain. This gene encodes a large subunit. It is an atypical calpain in that it lacks the calmodulin-like calcium-binding domain and instead has a divergent C-terminal domain. It is similar in organization to calpains 5 and 6. This gene is associated with type 2 or non-insulin-dependent diabetes mellitus (NIDDM) and located within the NIDDM1 region. Multiple alternative transcript variants encoding different isoforms have been described for this gene.
8057	
8058	
8059	Corticotropin-releasing hormone is a potent stimulator of synthesis and secretion of preopiomelanocortin-derived peptides. Although CRH concentrations in the human peripheral circulation are normally low, they increase throughout pregnancy and fall rapidly after parturition. Maternal plasma CRH probably originates from the placenta. Human plasma contains a CRH-binding protein which inactivates CRH and which may prevent inappropriate pituitary-adrenal stimulation in pregnancy.
8060	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. This enzyme functions in the ubiquitination of the tumor-suppressor protein p53, which is induced by an E3 ubiquitin-protein ligase. Two alternatively spliced transcript variants have been found for this gene and they encode distinct isoforms.
8061	This receptor binds insulin-like growth factor with a high affinity. It has tyrosine kinase activity. The insulin-like growth factor I receptor plays a critical role in transformation events. Cleavage of the precursor generates alpha and beta subunits. It is highly overexpressed in most malignant tissues where it functions as an anti-apoptotic agent by enhancing cell survival.
8062	
8063	
8064	
8065	This gene encodes a protein containing an alpha/beta hydrolase fold, which is a catalytic domain found in a very wide range of enzymes. The function of this protein has not been determined.
8066	
8067	
8068	
8069	
8070	
8071	
8072	
8073	
8074	
8075	The protein encoded by this gene is found as a pentamer and is a major substrate for the cAMP-dependent protein kinase in cardiac muscle. The encoded protein is an inhibitor of cardiac muscle sarcoplasmic reticulum Ca(2+)-ATPase in the unphosphorylated state, but inhibition is relieved upon phosphorylation of the protein. The subsequent activation of the Ca(2+) pump leads to enhanced muscle relaxation rates, thereby contributing to the inotropic response elicited in heart by beta-agonists. The encoded protein is a key regulator of cardiac diastolic function. Mutations in this gene are a cause of inherited human dilated cardiomyopathy with refractory congestive heart failure.
8076	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This phosphoprotein localizes to nuclear bodies where it functions as a transcription factor and tumor suppressor. Its expression is cell-cycle related and it regulates the p53 response to oncogenic signals. The gene is often involved in the translocation with the retinoic acid receptor alpha gene associated with acute promyelocytic leukemia (APL). Extensive alternative splicing of this gene results in several variations of the protein@#$%&s central and C-terminal regions; all variants encode the same N-terminus. Fourteen transcript variants for this gene have been described; however, the full length nature of three of the variants has not been determined.
8077	
8078	
8079	
8080	
8081	
8082	
8083	
8084	
8085	
8086	
8087	
8088	The correlation of anti-sperm antibodies with cases of unexplained infertility implicates a role for these antibodies in blocking fertilization. Improved diagnosis and treatment of immunologic infertility, as well as identification of proteins for targeted contraception, are dependent on the identification and characterization of relevant sperm antigens. The protein expressed by this gene is recognized by anti-sperm antibodies from infertile males. Furthermore, antibodies generated against the recombinant protein block in vitro fertilization. This protein localizes to the acrosomal membrane of spermatids and mature spermatozoa where it is thought to play a role in acrosomal morphogenesis and in sperm-egg binding and fusion, respectively.
8089	
8090	This gene, a Kirsten ras oncogene homolog from the mammalian ras gene family, encodes a protein that is a member of the small GTPase superfamily. A single amino acid substitution is responsible for an activating mutation. The transforming protein that results is implicated in various malignancies, including lung adenocarcinoma, mucinous adenoma, ductal carcinoma of the pancreas and colorectal carcinoma. Alternative splicing leads to variants encoding two isoforms that differ in the C-terminal region.
8091	
8092	Membrane mucins, such as MUC17, function in epithelial cells to provide cytoprotection, maintain luminal structure, provide signal transduction, and confer antiadhesive properties upon cancer cells that lose their apical/basal polarization.[supplied by OMIM]
8093	
8094	The protein encoded by this gene is a member of the TNF-receptor superfamily. This receptor has been shown to activate NF-kappaB and MAPK8/JNK, and induce cell apoptosis. Through its death domain, this receptor interacts with TRADD protein, which is known to serve as an adaptor that mediates signal transduction of TNF-receptors. Knockout studies in mice suggested that this gene plays a role in T-helper cell activation, and may be involved in inflammation and immune regulation.
8095	Crystallins are separated into two classes: taxon-specific, or enzyme, and ubiquitous. The latter class constitutes the major proteins of vertebrate eye lens and maintains the transparency and refractive index of the lens. Since lens central fiber cells lose their nuclei during development, these crystallins are made and then retained throughout life, making them extremely stable proteins. Mammalian lens crystallins are divided into alpha, beta, and gamma families; beta and gamma crystallins are also considered as a superfamily. Alpha and beta families are further divided into acidic and basic groups. Seven protein regions exist in crystallins: four homologous motifs, a connecting peptide, and N- and C-terminal extensions. Alpha crystallins are composed of two gene products: alpha-A and alpha-B, for acidic and basic, respectively. Alpha crystallins can be induced by heat shock and are members of the small heat shock protein (sHSP also known as the HSP20) family. They act as molecular chaperones although they do not renature proteins and release them in the fashion of a true chaperone; instead they hold them in large soluble aggregates. Post-translational modifications decrease the ability to chaperone. These heterogeneous aggregates consist of 30-40 subunits; the alpha-A and alpha-B subunits have a 3:1 ratio, respectively. Two additional functions of alpha crystallins are an autokinase activity and participation in the intracellular architecture. Alpha-A and alpha-B gene products are differentially expressed; alpha-A is preferentially restricted to the lens and alpha-B is expressed widely in many tissues and organs. Defects in this gene cause autosomal dominant congenital cataract (ADCC).
8096	CITED4 belongs to a family of transcriptional coactivators that bind several proteins, including CREB-binding protein (MIM 600140) and p300 (MIM 602700).[supplied by OMIM]
8097	
8098	This gene encodes a member of the ZIC family of C2H2-type zinc finger proteins. Members of this family are important during development, and have been associated with X-linked visceral heterotaxy and holoprosencephaly type 5. This gene is closely linked to the gene encoding zinc finger protein of the cerebellum 1, a related family member on chromosome 3. This gene encodes a protein of unknown function.
8099	This gene encodes a member of the steroid-thyroid hormone-retinoid receptor superfamily. The encoded protein may act as a transcriptional activator. The protein can efficiently bind the NGFI-B Response Element (NBRE). Three different versions of extraskeletal myxoid chondrosarcomas (EMCs) are the result of reciprocal translocations between this gene and other genes. The translocation breakpoints are associated with Nuclear Receptor Subfamily 4, Group A, Member 3 (on chromosome 9) and either Ewing Sarcome Breakpoint Region 1 (on chromosome 22), RNA Polymerase II, TATA Box-Binding Protein-Associated Factor, 68-KD (on chromosome 17), or Transcription factor 12 (on chromosome 15). Four transcript variants encoding three distinct isoforms have been identified for this gene.
8100	
8101	
8102	
8103	
8104	
8105	
8106	
8107	
8108	
8109	
8110	
8111	
8112	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
8113	Aggrecan 1 is a member of the aggrecan/versican proteoglycan family.  As an integral part of the extracellular matrix in cartilagenous tissue, the function of aggrecan 1 is to withstand compression in cartilage.  Mutations in this gene may be involved in skeletal dysplasia and spinal degeneration.  Alternative splicing of two domains has been observed in the aggrecan 1 gene; however, it is unknown whether other transcripts exist or what effects they have on gene function.
8114	This gene encodes a protein that contains four domains present in other developmental proteins: a PWWP domain, an HMG box, a SET domain, and a PHD-type zinc finger. It is expressed ubiquitously in early development. Wolf-Hirschhorn syndrome (WHS) is a malformation syndrome associated with a hemizygous deletion of the distal short arm of chromosome 4. This gene maps to the 165 kb WHS critical region and has also been involved in the chromosomal translocation t(4;14)(p16.3;q32.3) in multiple myelomas. This gene contains 25 exons. Alternative splicing of this gene generates nine transcript variants, which encode four different isoforms.
8115	HLA-DOB belongs to the HLA class II beta chain paralogues. This class II molecule is a heterodimer consisting of an alpha (DOA) and a beta chain (DOB), both anchored in the membrane. It is located in intracellular vesicles. DO suppresses peptide loading of MHC class II molecules by inhibiting HLA-DM. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa and its gene contains 6 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail.
8116	The protein encoded by this gene is an integral membrane protein involved in gamma-aminobutyric acid (GABA) and glycine uptake into synaptic vesicles. The encoded protein is a member of amino acid/polyamine transporter family II.
8117	
8118	
8119	
8120	
8121	
8122	
8123	
8124	
8125	In immature T cells the T-cell receptor beta-chain gene (TCRB; MIM 186930) is rearranged and expressed before the TCRA (MIM 186880) chain. At this early stage, TCRB can associate with the pre-T-cell receptor alpha chain (PTCRA). The PTCRA, together with TCRB and the CD3 complex (see MIM 186740), minimally make up the pre-T cell receptor (pre-TCR), which regulates early T cell development.[supplied by OMIM]
8126	Huntington@#$%&s disease (HD), a neurodegenerative disorder characterized by loss of striatal neurons, is caused by an expansion of a polyglutamine tract in the HD protein huntingtin. This gene encodes a protein that interacts with the huntingtin-associated protein 1, which is a huntingtin binding protein that may function in vesicle trafficking. An alternatively spliced variant has been described but its full-length sequence has not been determined.
8127	
8128	
8129	
8130	
8131	The protein encoded by this gene is a member of the F-actin capping protein alpha subunit family. It is the alpha subunit of the barbed-end actin binding protein Cap Z. By capping the barbed end of actin filaments, Cap Z regulates the growth of the actin filaments at the barbed end.
8132	This gene is a member of the cadherin superfamily, genes encoding calcium dependent cell-cell adhesion glycoproteins. The protein encoded by this gene is a large, single-pass transmembrane protein composed of an extracellular domain containing 27 repeats that show significant homology to the cadherin ectodomain. Expressed in the neurosensory epithelium, the protein is thought to be involved in stereocilia organization and hair bundle formation. The gene is located in a region containing the human deafness loci DFNB12 and USH1D. Usher syndrome 1D and nonsyndromic autosomal recessive deafness DFNB12 are caused by allelic mutations of this novel cadherin-like gene. Two alternative splice variants have been identified, encoding different isoforms. Additional variants have been observed but their full-length nature has not been determined.
8133	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. This structure consists of approximately 146 bp of DNA wrapped around a nucleosome, an octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is intronless and encodes a member of the histone H3 family. Transcripts from this gene lack polyA tails; instead, they contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6p22-p21.3.
8134	
8135	Signal-mediated nuclear import and export proceed through the nuclear pore complex (NPC), which is comprised of approximately 50 unique proteins collectively known as nucleoporins. The 98 kD nucleoporin is generated through a biogenesis pathway that involves synthesis and proteolytic cleavage of a 186 kD precursor protein. This cleavage results in the 98 kD nucleoporin as well as a 96 kD nucleoporin, both of which are localized to the nucleoplasmic side of the NPC. Rat studies show that the 98 kD nucleoporin functions as one of several docking site nucleoporins of transport substrates. The human gene has been shown to fuse to several genes following chromsome translocatons in acute myelogenous leukemia (AML) and T-cell acute lymphocytic leukemia (T-ALL). This gene is one of several genes located in the imprinted gene domain of 11p15.5, an important tumor-suppressor gene region. Alterations in this region have been associated with the Beckwith-Wiedemann syndrome, Wilms tumor, rhabdomyosarcoma, adrenocortical carcinoma, and lung, ovarian, and breast cancer. Alternative splicing of this gene results in several transcript variants; however, not all variants have been fully described.
8136	
8137	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a member of the histone H4 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the small histone gene cluster on chromosome 6p22-p21.3.
8138	
8139	
8140	This gene encodes the phosphatase 2A catalytic subunit. Protein phosphatase 2A is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. This gene encodes a beta isoform of the catalytic subunit. Two transcript variants encoding the same protein have been identified for this gene.
8141	
8142	
8143	
8144	
8145	
8146	
8147	HLA-E belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. HLA-E binds a restricted subset of peptides derived from the leader peptides of other class I molecules. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail.
8148	
8149	Members of the ezrin (VIL2; MIM 123900)-radixin (RDX; MIM 179410)-moesin (MSN; MIM 309845) (ERM) protein family are highly concentrated in the apical aspect of polarized epithelial cells. These cells are studded with microvilli containing bundles of actin filaments, which must attach to the membrane to assemble and maintain the microvilli. The ERM proteins, together with merlin, the NF2 (MIM 607379) gene product, are thought to be linkers between integral membrane and cytoskeletal proteins, and they bind directly to actin in vitro. Actin cytoskeleton reorganization requires the activation of a sodium/hydrogen exchanger (SLC9A3; MIM 182307). SLC9A3R1 is an ERM-binding protein.[supplied by OMIM]
8150	Many vertebrate homeo box-containing genes have been identified on the basis of their sequence similarity with Drosophila developmental genes. Members of the Dlx gene family contain a homeobox that is related to that of Distal-less (Dll), a gene expressed in the head and limbs of the developing fruit fly.  The Distal-less (Dlx) family of genes comprises at least 6 different members, DLX1-DLX6. Trichodentoosseous syndrome (TDO), an autosomal dominant condition, has been correlated with DLX3 gene mutation.
8151	This gene is highly similar to the v-src gene of Rous sarcoma virus. This proto-oncogene may play a role in the regulation of embryonic development and cell growth. The protein encoded by this gene is a tyrosine-protein kinase whose activity can be inhibited by phosphorylation by c-SRC kinase. Mutations in this gene could be involved in the malignant progression of colon cancer. Two transcript variants encoding the same protein have been found for this gene.
8152	
8153	
8154	
8155	
8156	This gene was originally isolated through subtractive hybridization due to its increased expression in differentiated chondrocytes versus dedifferentiated chondrocytes. The resulting protein contains a predicted ezrin-like domain, a Dbl homology domain, and a pleckstrin homology domain. It is believed to be a member of the band 4.1 superfamily whose members link the cytoskeleton to the cell membrane. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
8157	Sahai (1983) demonstrated phosphate-activated glutaminase (EC 3.5.1.2) in human platelets. It is the major enzyme yielding glutamate from glutamine. Significance of the enzyme derives from its possible implication in behavior disturbances in which glutamate acts as a neurotransmitter (Prusiner, 1981). High heritability of platelet glutaminase was indicated by studies of Sahai and Vogel (1983) who found an intraclass correlation coefficient of 0.96 for monozygotic twins and 0.53 for dizygotic twins.[supplied by OMIM]
8158	
8159	Chloride channels are a diverse group of proteins that regulate fundamental cellular processes including stabilization of cell membrane potential, transepithelial transport, maintenance of intracellular pH, and regulation of cell volume. Chloride intracellular channel 1  is a member of the p64 family; the protein localizes principally to the cell nucleus and exhibits both nuclear and plasma membrane chloride ion channel activity.
8160	The activation of mitogen-activated protein kinase (MAPK) cascades transduces various extracellular signals to the nucleus to induce gene expression, cell proliferation, differentiation, cell cycle arrest, and apoptosis. For full activation of MAPKs, dual-specificity kinases phosphorylate both threonine and tyrosine residues in MAPK TXY motifs. MKPs are dual-specificity phosphatases that dephosphorylate the TXY motif, thereby negatively regulating MAPK activity.[supplied by OMIM]
8161	
8162	
8163	HLA-DQA1 belongs to the HLA class II alpha chain paralogues. The class II molecule is a heterodimer consisting of an alpha (DQA) and a beta chain (DQB), both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B Lymphocytes, dendritic cells, macrophages). The alpha chain is approximately 33-35 kDa. It is encoded by 5 exons; exon 1 encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, and exon 4 encodes the transmembrane domain and the cytoplasmic tail. Within the DQ molecule both the alpha chain and the beta chain contain the polymorphisms specifying the peptide binding specificities, resulting in up to four different molecules. Typing for these polymorphisms is routinely done for bone marrow transplantation.
8164	
8165	
8166	
8167	
8168	The A-kinase anchor proteins (AKAPs) are a group of structurally diverse proteins which have the common function of binding to the regulatory subunit of protein kinase A (PKA) and confining the holoenzyme to discrete locations within the cell. This gene encodes a member of the AKAP family. Alternate splicing of this gene results in many isoforms that localize to the centrosome and the Golgi apparatus, and interact with numerous signaling proteins from multiple signal transduction pathways. These signaling proteins include type II protein kinase A, serine/threonine kinase protein kinase N, protein phosphatase 1, protein phosphatase 2a, protein kinase C-epsilon and phosphodiesterase 4D3.
8169	HLA-DOA belongs to the HLA class II alpha chain paralogues. HLA-DOA forms a heterodimer with HLA-DOB. The heterodimer, HLA-DO, is found in lysosomes in B cells and regulates HLA-DM-mediated peptide loading on MHC class II molecules. In comparison with classical HLA class II molecules, this gene exhibits very little sequence variation, especially at the protein level.
8170	This gene encodes a member of the steroid-thyroid hormone-retinoid receptor superfamily. Expression is induced by phytohemagglutinin in human lymphocytes and by serum stimulation of arrested fibroblasts. The encoded protein acts as a nuclear transcription factor. Translocation of the protein from the nucleus to mitochondria induces apoptosis. Three transcript variants encoding two distinct isoforms have been identified for this gene.
8171	The protein encoded by this gene is an insulin like hormone produced mainly in gonadal tissues in males and females. Studies of the mouse counterpart suggest that this gene may be involved in the development of urogenital tract and female fertility. It may also act as a hormone to regulate growth and differentiation of gubernaculum, and thus mediating intra-abdominal testicular descent. The mutations in this gene may lead to, but not a frequent cause of, cryptorchidism.
8172	
8173	
8174	
8175	
8176	This gene is one member of a family of sulfate/anion transporter genes. Family members are well conserved in their genomic (number and size of exons) and protein (aa length among species) structures yet have markedly different tissue expression patterns. This gene is expressed primarily in spermatocytes. Two transcript variants encoding different isoforms have been found.
8177	
8178	
8179	
8180	The protein encoded by this gene is involved in the import of precursor proteins into mitochondria. The encoded protein has a chaperone-like activity, binding the mature portion of unfolded proteins and aiding their import into mitochondria. This protein, which is found in the cytoplasm and sometimes associated with the outer mitochondrial membrane, has a weak ATPase activity and contains 6 TPR repeats.
8181	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. Members of the PTP family share a highly conserved catalytic motif, which is essential for the catalytic activity. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. Epidermal growth factor receptor and the adaptor protein Shc were reported to be substrates of this PTP, which suggested the roles in growth factor mediated cell signaling. Three alternatively spliced variants of this gene, which encode isoforms differing at their extreme C-termini, have been described. The different C-termini are thought to determine the substrate specificity, as well as the cellular localization of the isoforms. Two highly related but distinctly processed pseudogenes that localize to distinct chromosomes have been reported.
8182	
8183	Gamma-aminobutyric acid (GABA) is the major inhibitory neurotransmitter in the mammalian brain where it acts at GABA-A receptors, which are ligand-gated chloride channels. Chloride conductance of these channels can be modulated by agents such as benzodiazepines that bind to the GABA-A receptor. The GABA-A receptor is generally pentameric and there are five types of subunits: alpha, beta, gamma, delta, and rho. This gene encodes the delta subunit.
8184	The estrogen receptor (ESR) is a ligand-activated transcription factor composed of several domains important for hormone binding, DNA binding, and activation of transcription. Alternative splicing results in several ESR1 mRNA transcripts, which differ primarily in their 5-prime untranslated regions. The translated receptors show less variability.[supplied by OMIM]
8185	L-glutamate is the major excitatory neurotransmitter in the central nervous system and activates both ionotropic and metabotropic glutamate receptors.  Glutamatergic neurotransmission is involved in most aspects of normal brain function and can be perturbed in many neuropathologic conditions. The metabotropic glutamate receptors are a family of G protein-coupled receptors, that have been divided into 3 groups on the basis of sequence homology, putative signal transduction mechanisms, and pharmacologic properties.  Group I includes GRM1 and GRM5 and these receptors have been shown to activate phospholipase C.  Group II includes GRM2 and GRM3 while Group III includes GRM4, GRM6, GRM7 and GRM8.   Group II and III receptors are linked to the inhibition of the cyclic AMP cascade but differ in their agonist selectivities. Alternative splice variants of the GRM1 gene have been described but their full-length nature has not been determined.
8186	This gene encodes a protein containing a polynucleotide kinase domain (PNK) near the N-terminal region, and a Small MutS Related (Smr) domain near the C-terminal region. The encoded protein can bind to both B-cell leukemia/lymphoma 3 (BCL-3) and neural precursor cell expressed, developmentally downregulated 4, (Nedd4) proteins. This protein binds and hydrolyzes ATP, may function as a 5@#$%&-polynucleotide kinase, and has the capacity to be a ubiquitylation substrate. This protein may play a role in transcription-coupled DNA repair or genetic recombination.
8187	
8188	
8189	
8190	
8191	NCB5OR is a flavohemoprotein that contains functional domains found in both cytochrome b5 (CYB5; MIM 250790) and CYB5 reductase (DIA1; MIM 250800).[supplied by OMIM]
8192	
8193	
8194	
8195	
8196	
8197	
8198	HLA-DRB1 belongs to the HLA class II beta chain paralogues. The class II molecule is a heterodimer consisting of an alpha (DRA) and a beta chain (DRB), both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa. It is encoded by 6 exons, exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail. Within the DR molecule the beta chain contains all the polymorphisms specifying the peptide binding specificities. Hundreds of DRB1 alleles have been described and typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. DRB1 is expressed at a level five times higher than its paralogues DRB3, DRB4 and DRB5. DRB1 is present in all individuals. Allelic variants of DRB1 are linked with either none or one of the genes DRB3, DRB4 and DRB5. There are 4 related pseudogenes: DRB2, DRB6, DRB7, DRB8 and DRB9.
8199	
8200	
8201	
8202	
8203	
8204	
8205	This gene encodes a trypsinogen, which is a member of the trypsin family of serine proteases. This enzyme is secreted by the pancreas and cleaved to its active form in the small intestine. It is active on peptide linkages involving the carboxyl group of lysine or arginine. Mutations in this gene are associated with hereditary pancreatitis. This gene and several other trypsinogen genes are localized to the T cell receptor beta locus on chromosome 7.
8206	This gene encodes a nucleolar protein with a LIS1 homology domain. The protein is involved in ribosomal DNA gene transcription through its interaction with upstream binding factor (UBF). Mutations in this gene have been associated with Treacher Collins syndrome, a disorder which includes abnormal craniofacial development. Alternate transcriptional splice variants encoding different isoforms have been found for this gene, but only three of them have been characterized to date.
8207	The RAN GTPase plays an essential role in the transport of macromolecules between the cytoplasm and the nucleus. The nucleotide-bound state of RAN is regulated by RANGAP1, a RAN-specific GTPase activating protein, and by RCC1, a specific guanine nucleotide exchange factor. RANBP3 (RAN binding protein 3) contains FXFG motifs characteristic of a subgroup of nucleoporins, and a C-terminal domain showing similarity to RANBP1. RANBP3 exhibits several alternatively spliced transcript variants containing differences in the N-terminus, two internal exons, and the 3@#$%&UTR.
8208	Capsaicin, the main pungent ingredient in hot chili peppers, elicits a sensation of burning pain by selectively activating sensory neurons that convey information about noxious stimuli to the central nervous system. The protein encoded by this gene is a receptor for capsaicin and is a non-selective cation channel that is structurally related to members of the TRP family of ion channels. This receptor is also activated by increases in temperature in the noxious range, suggesting that it functions as a transducer of painful thermal stimuli in vivo. Four transcript variants encoding the same protein, but with different 5@#$%& UTR sequence, have been described for this gene.
8209	The ERM protein family members ezrin, radixin, and moesin are cytoskeletal effector proteins linking actin to membrane-bound proteins at the cell surface. Myosin regulatory light chain interacting protein (MIR) is a novel ERM-like protein that interacts with myosin regulatory light chain and inhibits neurite outgrowth.
8210	
8211	
8212	
8213	
8214	
8215	
8216	
8217	Bidirectional transport of macromolecules between the cytoplasm and nucleus occurs through nuclear pore complexes (NPCs) embedded in the nuclear envelope. NPCs are composed of subcomplexes, and NUP43 is part of one such subcomplex, Nup107-160 (Loiodice et al., 2004).[supplied by OMIM]
8218	
8219	
8220	This gene was identified by its similarity to the ST7 tumor suppressor gene found in the chromosome 7q31 region. This gene is clustered in a tail-to-tail manner with the WNT2B gene in a chromosomal region known to be deleted and rearranged in a variety of cancers. The function of this gene has not yet been determined. Several transcript variants encoding many different isoforms have been described, but some have not been fully characterized.
8221	
8222	
8223	
8224	
8225	This gene is located in a region close to the locus of the pseudogene of chemokine (C-C motif) receptor-like 1 on chromosome 6. The specific function of this gene has not yet been determined.
8226	Bleomycin hydrolase (BMH) is a cytoplasmic cysteine peptidase that is highly conserved through evolution; however, the only known activity of the enzyme is metabolic inactivation of the glycopeptide bleomycin (BLM), an essential component of combination chemotherapy regimens for cancer. The protein contains the signature active site residues of the cysteine protease papain superfamily.
8227	This gene belongs to the protocadherin gene family, a subfamily of the cadherin superfamily. The gene encodes a protein with an extracellular domain containing 7 cadherin repeats. The gene product is an integral membrane protein that is thought to function in cell-cell recognition and adhesion. Alternative splicing yields isoforms with unique cytoplasmic tails.
8228	The cannabinoid delta-9-tetrahydrocannabinol is the principal psychoactive ingredient of marijuana. The proteins encoded by this gene and the cannabinoid receptor 1 (brain) (CNR1) gene have the characteristics of a guanine nucleotide-binding protein (G-protein)-coupled receptor for cannabinoids. They inhibit adenylate cyclase activity in a dose-dependent, stereoselective, and pertussis toxin-sensitive manner. These proteins have been found to be involved in the cannabinoid-induced CNS effects (including alterations in mood and cognition) experienced by users of marijuana. The cannabinoid receptors are members of family 1 of the G-protein-coupled receptors.
8229	This gene encodes one of the three alpha chains of type VI collagen, a beaded filament collagen found in most connective tissues. The product of this gene contains several domains similar to von Willebrand Factor type A domains. These domains have been shown to bind extracellular matrix proteins, an interaction that explains the importance of this collagen in organizing matrix components. Mutations in this gene are associated with Bethlem myopathy and Ullrich scleroatonic muscular dystrophy. Three transcript variants have been identified for this gene.
8230	This gene encodes a protein that may be involved in the degradation of G proteins via the ubiquitin-dependent proteasome pathway. The encoded protein binds to members of subfamily A of the regulator of the G-protein signaling (RGS) family through an N-terminal leucine-rich region. This protein also has a central RING finger-like domain and E3 ubiquitin ligase activity. This protein is highly conserved from flies to humans. Defects in this gene may cause the autosomal recessive, infantile malignant form of osteopetrosis.
8231	This gene is a member of the protocadherin gamma gene cluster, one of three related clusters tandemly linked on chromosome five. These gene clusters have an immunoglobulin-like organization, suggesting that a novel mechanism may be involved in their regulation and expression. The gamma gene cluster includes 22 genes divided into 3 subfamilies. Subfamily A contains 12 genes, subfamily B contains 7 genes and 2 pseudogenes, and the more distantly related subfamily C contains 3 genes. The tandem array of 22 large, variable region exons are followed by a constant region, containing 3 exons shared by all genes in the cluster. Each variable region exon encodes the extracellular region, which includes 6 cadherin ectodomains and a transmembrane region. The constant region exons encode the common cytoplasmic region. These neural cadherin-like cell adhesion proteins most likely play a critical role in the establishment and function of specific cell-cell connections in the brain. Alternative splicing has been described for the gamma cluster genes.
8232	
8233	
8234	
8235	
8236	
8237	
8238	
8239	The protein encoded by this gene is a member of the thioesterase superfamily.
8240	Serine palmitoyltransferase, which consists of two different subunits, is the key enzyme in sphingolipid biosynthesis. It converts L-serine and palmitoyl-CoA to 3-oxosphinganine with pyridoxal 5@#$%&-phosphate as a cofactor. The product of this gene is the long chain base subunit 1 of serine palmitoyltransferase. Mutations in this gene were identified in patients with hereditary sensory neuropathy type 1. Alternatively spliced variants encoding different isoforms have been identified.
8241	This gene product is a member of the serine-arginine (SR) family of proteins, which is involved in constitutive and regulated RNA splicing. Members of this family are characterized by N-terminal RNP1 and RNP2 motifs, which are required for binding to RNA, and multiple C-terminal SR/RS repeats, which are important in mediating association with other cellular proteins. This protein can influence splice site selection of adenovirus E1A pre-mRNA. It interacts with the oncoprotein TLS, and abrogates the influence of TLS on E1A pre-mRNA splicing. Alternative splicing of this gene results in at least two transcript variants encoding different isoforms. In addition, transcript variants utilizing alternative polyA sites exist.
8242	CLASPs, such as CLASP1, are nonmotor microtubule-associated proteins that interact with CLIPs (e.g., CLIP170; MIM 179838). CLASP1 is involved in the regulation of microtubule dynamics at the kinetochore and throughout the spindle (Maiato et al., 2003).[supplied by OMIM]
8243	This gene encodes the alpha chain of type XXI collagen, a member of the FACIT collagen family (fibril-associated collagens with interrupted helices). Type XXI collagen is localized to tissues containing type I collagen so, like other members of this collagen family, it may serve to maintain the integrity of the extracellular matrix. A smaller transcript has been detected in heart and skeletal muscle; it may represent an as yet uncharacterized transcript variant.
8244	
8245	Acetyl-Coenzyme A acetyltransferase 2 is an enzyme involved in lipid metabolism.  Reported patients with ACAT2 deficiency have shown severe mental retardation and hypotonus.  The ACAT2 gene shows complementary overlapping with the 3-prime region of the TCP1 gene in both mouse and human. These genes are encoded on opposite strands of DNA, as well as in opposite transcriptional orientation.
8246	This gene encodes a member of the Kruppel family of C2H2-type zinc-finger transcription factor proteins. The encoded protein acts as a transcriptional activator. Two transcript variants encoding distinct isoforms have been identified for this gene. Other transcript variants have been described, but their full length sequence has not been determined.
8247	This gene encodes a member of the tumor protein D52 (TPD52) family. The encoded protein contains a coiled-coil domain and may form homo- or hetero-dimer with TPD52 family members. The protein is reported to be involved in cell proliferation and calcium signaling. It also interacts with the mitogen-activated protein kinase kinase kinase 5 (MAP3K5/ASK1) and positively regulates MAP3K5-induced apoptosis. Multiple alternatively spliced transcript variants have been observed, but the full-length nature of some variants has not yet been determined.
8248	
8249	
8250	This gene encodes a trafficking membrane protein which transports proteins among the medial- and trans-Golgi compartments. Due to its chromosomal location and trafficking function, this gene may be involved in familial essential hypertension. Three transcript variants encoding three different isoforms have been found for this gene.
8251	Three different forms of human pancreatic procarboxypeptidase A have been isolated. The A1 and A2 forms are monomeric proteins with different biochemical properties. The A2 form of pancreatic procarboxypeptidase acts on aromatic C-terminal residues
8252	
8253	
8254	
8255	
8256	
8257	
8258	
8259	
8260	Ganglioside GM3 is known to participate in the induction of cell differentiation, modulation of cell proliferation, maintenance of fibroblast morphology, signal transduction, and integrin-mediated cell adhesion. The protein encoded by this gene is a type II membrane protein which catalyzes the formation of GM3 using lactosylceramide as the substrate. The encoded protein is a member of glycosyltransferase family 29 and may be localized to the Golgi apparatus.
8261	
8262	
8263	
8264	
8265	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
8266	
8267	This gene encodes the integrin alpha X chain protein. Integrins are heterodimeric integral membrane proteins composed of an alpha chain and a beta chain. This protein combines with the beta 2 chain (ITGB2) to form a leukocyte-specific integrin referred to as inactivated-C3b (iC3b) receptor 4 (CR4). The alpha X beta 2 complex seems to overlap the properties of the alpha M beta 2 integrin in the adherence of neutrophils and monocytes to stimulated endothelium cells, and in the phagocytosis of complement coated particles.
8268	
8269	This gene encodes a protein containing a large tandem-repeat domain that contains no cysteine residues. Mutations in this gene have been associated with Alstrom syndrome. The encoded protein may play a role in hearing, sight, obesity, and liver function. Alternative splice variants have been described but their full length sequences have not been determined.
8270	This gene encodes a protein containing a large tandem-repeat domain that contains no cysteine residues. Mutations in this gene have been associated with Alstrom syndrome. The encoded protein may play a role in hearing, sight, obesity, and liver function. Alternative splice variants have been described but their full length sequences have not been determined.
8271	
8272	This gene encodes the protein elongin B, which is a subunit of the transcription factor B (SIII) complex. The SIII complex is composed of elongins A/A2, B and C. It activates elongation by RNA polymerase II by suppressing transient pausing of the polymerase at many sites within transcription units. Elongin A functions as the transcriptionally active component of the SIII complex, whereas elongins B and C are regulatory subunits. Elongin A2 is specifically expressed in the testis, and capable of forming a stable complex with elongins B and C. The von Hippel-Lindau tumor suppressor protein binds to elongins B and C, and thereby inhibits transcription elongation. Two alternatively spliced transcript variants encoding different isoforms have been described for this gene.
8273	
8274	
8275	
8276	
8277	
8278	
8279	
8280	
8281	Sulfation is one of the most extensive modifications for glycan chains in glycoconjugates. GAL3ST4 belongs to a family of sulfotransferases that catalyze the C3 sulfation of galactose in O-linked glycoproteins.[supplied by OMIM]
8282	This gene encodes a secreted protein resembling members of the ADAMTS (a disintegrin and metalloproteinase with thrombospondin motif) family. This protein lacks the propeptide region and the metalloproteinase and disintegrin-like domains, which are typical of the ADAMTS family, but contains other ADAMTS domains, including the thrombospondin type 1 motif. This protein may have important functions in the extracellular matrix. Alternative splicing of this gene results in 3 transcript variants encoding different isoforms.
8283	
8284	
8285	
8286	
8287	
8288	This gene encodes a protein associated with the cytoplasmic surface of synaptic vesicles. A subset of patients with stiff-man syndrome who were also affected by breast cancer are positive for autoantibodies against this protein. Alternate splicing of this gene results in two transcript variants encoding different isoforms. Additional splice variants have been described, but their full length sequences have not been determined.
8289	
8290	
8291	
8292	
8293	
8294	
8295	
8296	
8297	
8298	
8299	
8300	
8301	
8302	At the nuclear envelope, the nuclear lamina and heterochromatin are adjacent to the inner nuclear membrane. The protein encoded by this gene binds DNA and is a component of heterochromatin. This protein also can bind lamin B receptor, an integral membrane protein found in the inner nuclear membrane. The dual binding functions of the encoded protein may explain the association of heterochromatin with the inner nuclear membrane. Two transcript variants encoding the same protein but differing in the 5@#$%& UTR, have been found for this gene.
8303	
8304	Beta actin is one of six different actin isoforms which have been identified.  ACTB is one of the two nonmuscle cytoskeletal actins.  Actins are highly conserved proteins that are involved in cell motility, structure and integrity.  Alpha actins are a major constituent of the contractile apparatus.
8305	
8306	Glycosphingolipids (GSLs) are a group of membrane components that contain lipid and sugar moieties. They are present in essentially all animal cells and are believed to have important roles in various cellular processes.  UDP-glucose ceramide glucosyltransferase catalyzes the first glycosylation step in glycosphingolipid biosynthesis.  The product, glucosylceramide, is the core structure of more than 300 GSLs. UGCG is widely expressed and transciption is upregulated during keratinocyte differentiation.
8307	
8308	
8309	
8310	
8311	
8312	The protein encoded by this gene is a nuclear receptor coactivator that interacts with nuclear hormone receptors to enhance their transcriptional activator functions. The encoded protein has histone acetyltransferase activity and recruits p300/CBP-associated factor and CREB binding protein as part of a multisubunit coactivation complex. This protein is initially found in the cytoplasm but is translocated into the nucleus upon phosphorylation. Two transcript variants encoding different isoforms have been found for this gene. In addition, a polymorphic repeat region is found in the C-terminus of the encoded protein.
8313	The protein encoded by this gene is a nuclear receptor coactivator that interacts with nuclear hormone receptors to enhance their transcriptional activator functions. The encoded protein has histone acetyltransferase activity and recruits p300/CBP-associated factor and CREB binding protein as part of a multisubunit coactivation complex. This protein is initially found in the cytoplasm but is translocated into the nucleus upon phosphorylation. Two transcript variants encoding different isoforms have been found for this gene. In addition, a polymorphic repeat region is found in the C-terminus of the encoded protein.
8314	The protein encoded by this gene is a type II transmembrane serine proteinase that is found attached to the cell surface. The encoded protein may be involved in matrix remodeling processes in the liver.
8315	
8316	
8317	
8318	The protein encoded by this gene is a member of serine/threonine protein kinase family. This kinase preferentially activates the kinases involved in the MAP kinase signaling pathway including MAPK7 and MAP2K4. This kinase has been shown to directly phosphorylate and activate Ikappa B kinases (IKKs), and thus plays a role in NF-kappa B signaling pathway. This kinase has also been found to bind and activate protein kinase C-related kinase 2 (PRKCL2/PRK2), which suggests its involvement in PRKCL2 regulated signaling process.
8319	The protein encoded by this gene is a member of serine/threonine protein kinase family. This kinase preferentially activates the kinases involved in the MAP kinase signaling pathway including MAPK7 and MAP2K4. This kinase has been shown to directly phosphorylate and activate Ikappa B kinases (IKKs), and thus plays a role in NF-kappa B signaling pathway. This kinase has also been found to bind and activate protein kinase C-related kinase 2 (PRKCL2/PRK2), which suggests its involvement in PRKCL2 regulated signaling process.
8320	The A-kinase anchor proteins (AKAPs) are a group of structurally diverse proteins, which have the common function of binding to the regulatory subunit of protein kinase A (PKA) and confining the holoenzyme to discrete locations within the cell. This gene encodes a member of the AKAP family. The protein anchors PKA in ciliary axonemes and, in this way, may play a role in regulating ciliary beat frequency. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
8321	Malate dehydrogenase catalyzes the reversible oxidation of malate to oxaloacetate, utilizing the NAD/NADH cofactor system in the citric acid cycle. The protein encoded by this gene is localized to the mitochondria and may play pivotal roles in the malate-aspartate shuttle that operates in the metabolic coordination between cytosol and mitochondria.
8322	
8323	
8324	
8325	This gene encodes an enzyme that catalyzes the second step of glycosylphosphatidylinositol (GPI) biosynthesis, which is the de-N-acetylation of N-acetylglucosaminylphosphatidylinositol (GlcNAc-PI). Study of a similar rat enzyme suggests that this protein localizes to the endoplasmic reticulum.
8326	
8327	
8328	
8329	This gene encodes a member of the paraoxonase gene family, which includes three known members located adjacent to each other on the long arm of chromosome 7. The encoded protein is ubiquitously expressed in human tissues, membrane-bound, and may act as a cellular antioxidant, protecting cells from oxidative stress. Hydrolytic activity against acylhomoserine lactones, important bacterial quorum-sensing mediators, suggests the encoded protein may also play a role in defense responses to pathogenic bacteria. Mutations in this gene may be associated with vascular disease and a number of quantitative phenotypes related to diabetes. Alternatively spliced transcript variants encoding different isoforms have been described.
8330	PAX2 encodes paired box gene 2, one of many human homologues of the Drosophila melanogaster gene prd. The central feature of this transcription factor gene family is the conserved DNA-binding paired box domain. PAX2 is believed to be a target of transcriptional supression by the tumor supressor gene WT1. Mutations within PAX2 have been shown to result in optic nerve colobomas and renal hypoplasia. Alternative splicing of this gene results in multiple transcript variants.
8331	
8332	
8333	
8334	
8335	
8336	
8337	The protein encoded by this gene, a member of the carnitine/choline acetyltransferase family, is the rate-controlling enzyme of the long-chain fatty acid beta-oxidation pathway in muscle mitochondria. This enzyme is required for the net transport of long-chain fatty acyl-CoAs from the cytoplasm into the mitochondria. At least four transcript variants encoding two different isoforms have been found for this gene, and one of the transcripts is bicistronic. In addition, two of the transcripts utilize alternative polyadenylation signals.
8338	The origin recognition complex (ORC) is a highly conserved six subunit protein complex essential for the initiation of the DNA replication in eukaryotic cells. Studies in yeast demonstrated that ORC binds specifically to origins of replication and serves as a platform for the assembly of additional initiation factors such as Cdc6 and Mcm proteins. The protein encoded by this gene is a subunit of the ORC complex. It has been shown to form a core complex with ORC2L, -3L, and 4L. Alternatively spliced transcript variants encoding distinct isoforms have been described.
8339	
8340	
8341	
8342	
8343	Hook proteins are cytosolic coiled-coil proteins that contain conserved N-terminal domains, which attach to microtubules, and more divergent C-terminal domains, which mediate binding to organelles. The Drosophila Hook protein is a component of the endocytic compartment.[supplied by OMIM]
8344	
8345	
8346	
8347	
8348	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids.
8349	
8350	
8351	Caveolae are small domains on the inner cell membrane involved in vesicular trafficking and signal transduction.  FLOT1 encodes a caveolae-associated, integral membrane protein.  The function of flotillin 1 has not been determined.
8352	This gene is predominantly expressed in prostate tissue, and is found to be upregulated in multiple cancer cell lines. The gene product is predicted to be a six-transmembrane protein, and was shown to be a cell surface antigen significantly expressed at cell-cell junctions.
8353	This gene encodes a member of the Ser/Thr protein kinase family. This kinase functions as a mitogen-activated protein kinase (MAP kinase)- activated protein kinase. MAP kinases are also known as extracellular signal-regulated kinases (ERKs), act as an integration point for multiple biochemical signals. This kinase was shown to be activated by growth inducers and stress stimulation of cells. In vitro studies demonstrated that ERK, p38 MAP kinase and Jun N-terminal kinase were all able to phosphorylate and activate this kinase, which suggested the role of this kinase as an integrative element of signaling in both mitogen and stress responses. This kinase was reported to interact with, phosphorylate and repress the activity of E47, which is a basic helix-loop-helix transcription factor known to be involved in the regulation of tissue-specific gene expression and cell differentiation.
8354	The protein encoded by this gene is one subunit of a trimeric complex, forming a highly conserved transcription factor that binds with high specificity to CCAAT motifs in the promoter regions in a variety of genes. This gene product, subunit B, forms a tight dimer with the C subunit, a prerequisite for subunit A association. The resulting trimer binds to DNA with high specificity and affinity. Subunits B and C each contain a histone-like motif. Observation of the histone nature of these subunits is supported by two types of evidence; protein sequence alignments and experiments with mutants.
8355	Serine palmitoyltransferase (SPT; EC 2.3.1.50) is the key enzyme in sphingolipid biosynthesis. It catalyzes the pyridoxal-5-prime-phosphate-dependent condensation of L-serine and palmitoyl-CoA to 3-oxosphinganine.[supplied by OMIM]
8356	
8357	
8358	
8359	
8360	This gene encodes a secreted, homodimeric glycoprotein that is expressed in a wide variety of tissues and may have autocrine or paracrine functions. The gene contains a 5@#$%& UTR rich in CAG trinucleotide repeats. The encoded protein contains 11 conserved cysteine residues and is phosphorylated by protein kinase C exclusively on its serine residues. The protein may play a role in the regulation of renal and intestinal calcium and phosphate transport, cell metabolism, or cellular calcium/phosphate homeostasis. Overexpression of human stanniocalcin 1 in mice produces high serum phosphate levels, dwarfism, and increased metabolic rate. This gene has altered expression in hepatocellular, ovarian, and breast cancers.
8361	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
8362	
8363	
8364	
8365	
8366	
8367	
8368	
8369	
8370	
8371	
8372	The protein encoded by this gene belongs to the G protein-coupled receptor family that activate a phosphatidylinositol-calcium second messenger system. Binding and pharmacological studies demonstrate that this receptor binds neurotensin as well as several other ligands already described for neurotensin NT1 receptor. However, unlike NT1 receptor, this gene recognizes, with high affinity, levocabastine, a histamine H1 receptor antagonist previously shown to compete with neurotensin for low-affinity binding sites in brain. These activities suggest that this receptor may be of physiological importance and that a natural agonist for the receptor may exist.
8373	Alpha actinins belong to the spectrin gene superfamily which represents a diverse group of cytoskeletal proteins, including the alpha and beta spectrins and dystrophins.  Alpha actinin is an actin-binding protein with multiple roles in different cell types.  In nonmuscle cells, the cytoskeletal isoform is found along microfilament bundles and adherens-type junctions, where it is involved in binding actin to the membrane.  In contrast, skeletal, cardiac, and smooth muscle isoforms are localized to the Z-disc and analogous dense bodies, where they help anchor the myofibrillar actin filaments.  This gene encodes a nonmuscle, alpha actinin isoform which is concentrated in the cytoplasm, and thought to be involved in metastatic processes.  Mutations in this gene have been associated with focal and segmental glomerulosclerosis.
8374	
8375	
8376	
8377	
8378	
8379	The protein encoded by this gene is one of the subunits of a heterodimeric cytokine complex. This protein is related to interleukin 12A (IL12A). It interacts with Epstein-Barr virus induced gene 3 (EBI3), a protein similar to interleukin 12B (IL12B), and forms a complex that has been shown to drive rapid expansion of naive but not memory CD4(+) T cells. The complex is also found to synergize strongly with interleukin 12 to trigger interferon gamma (IFNG) production of naive CD4(+) T cells. The biological effects of this cytokine are mediated by the class I cytokine receptor (WSX1/TCRR).
8380	
8381	This gene encodes a member of a family of candidate taste receptors that are members of the G protein-coupled receptor superfamily and that are specifically expressed by taste receptor cells of the tongue and palate epithelia. Each of these apparently intronless taste receptor genes encodes a 7-transmembrane receptor protein, functioning as a bitter taste receptor. This gene is clustered with another 3 candidate taste receptor genes in chromosome 7 and is genetically linked to loci that influence bitter perception.
8382	
8383	
8384	
8385	
8386	
8387	
8388	
8389	
8390	Major histocompatibility (MHC) class II molecules are transmembrane proteins that have a central role in development and control of the immune system. The protein encoded by this gene, along with regulatory factor X-associated ankyrin-containing protein and regulatory factor-5, forms a complex that binds to the X box motif of certain MHC class II gene promoters and activates their transcription. Once bound to the promoter, this complex associates with the non-DNA-binding factor MHC class II transactivator, which controls the cell type specificity and inducibility of MHC class II gene expression. Mutations in this gene have been linked to bare lymphocyte syndrome type II, complementation group D. Transcript variants utilizing different polyA signals have been found for this gene.
8391	
8392	
8393	This gene encodes a coregulator for the alpha and beta estrogen receptors and the orphan nuclear receptor NR1D2. The protein localizes to the nucleus, and is thought to have both coactivator and corepressor functions. Its interaction with nuclear receptors is independent of the AF2 domain on the receptors, which is known to regulate interaction with other coreceptors. Two alternatively spliced transcript variants for this gene have been described. However, the full length nature of one of the variants has not been determined.
8394	
8395	
8396	
8397	
8398	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
8399	ARHJ belongs to the Rho family of small GTP-binding proteins. Rho proteins regulate the dynamic assembly of cytoskeletal components for several physiologic processes, such as cell proliferation and motility and the establishment of cell polarity. They are also involved in pathophysiologic process, such as cell transformation and metastasis.[supplied by OMIM]
8400	
8401	
8402	
8403	
8404	
8405	Acyl-Coenzyme A oxidase 3 also know as pristanoyl -CoA oxidase (ACOX3)is involved in the desaturation of 2-methyl branched fatty acids in peroxisomes.  Unlike the rat homolog, the human gene is expressed in very low amounts in liver such that its mRNA was undetectable by routine Northern-blot analysis or its product by immunoblotting or by enzyme activity measurements.  However the human cDNA encoding a 700 amino acid protein with a peroxisomal targeting C-terminal tripeptide S-K-L was isolated and is thought to be expressed under special conditions such as specific developmental stages or in a tissue specific manner in tissues that have not yet been examined.
8406	Voltage-gated proton (hydrogen) channels play an important role in cellular defense against acidic stress. They are unique among ion channels with respect to their extremely high selectivity, marked temperature dependence, and unitary conductance, which is 3 orders of magnitude lower than that of most other ion channels. NOX1 is a homolog of the catalytic subunit of the superoxide-generating NADPH oxidase of phagocytes, gp91phox. Three splice variants of NOX1 have been identified, NOH-1L, NOH-1S and NOH-1Lv. The NOH-1S currents were reversibly blocked by zinc, a known H+ channel inhibitor. The NOH-1S variant does not contain an electron transport chain and it is thought that H+ conductance is its main physiologic function, whereas NOH-1L may conduct H+ ions as part of its electron transport mechanism.
8407	The protein encoded by this gene is a member of the fibroblast growth factor receptor family, where amino acid sequence is highly conserved between members and throughout evolution. FGFR family members differ from one another in their ligand affinities and tissue distribution. A full-length representative protein consists of an extracellular region, composed of three immunoglobulin-like domains, a single hydrophobic membrane-spanning segment and a cytoplasmic tyrosine kinase domain. The extracellular portion of the protein interacts with fibroblast growth factors, setting in motion a cascade of downstream signals, ultimately influencing mitogenesis and differentiation. This particular family member binds both acidic and basic fibroblast growth factors and is involved in limb induction. Mutations in this gene can lead to Pfeiffer syndrome and Jackson-Weiss syndrome. The genomic organization of this gene is very similar to family members 2-4, encompassing 19 exons that are subject to complex alternative splicing, which allows for structural, tissue expression and ligand affinity variations among the isoforms.
8408	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
8409	
8410	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene encodes a member of the histone H2A family. Several transcripts have been identified for this gene.
8411	
8412	This gene belongs to the homeobox family of genes. The homeobox genes encode a highly conserved family of transcription factors that play an important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, located on different chromosomes, consisting of 9 to 11 genes arranged in tandem. This gene is one of several homeobox HOXB genes located in a cluster on chromosome 17. The exact role of this gene has yet to be determined.
8413	
8414	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L2P family of ribosomal proteins. It is located in the cytoplasm. In rat, the protein associates with the 5.8S rRNA, very likely participates in the binding of aminoacyl-tRNA, and is a constituent of the elongation factor 2-binding site at the ribosomal subunit interface. Alternatively spliced transcript variants encoding the same protein exist. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
8415	Tryptophan hydroxylase (TPH; EC 1.14.16.4) is the rate-limiting enzyme in the synthesis of serotonin (5-hydroxytryptamine, or 5HT). 5HT is causally involved in numerous central nervous activities, and it has several functions in peripheral tissues, including the maintenance of vascular tone and gut motility.[supplied by OMIM]
8416	
8417	
8418	
8419	The ubiquinone-binding protein is a nucleus-encoded component of ubiquinol-cytochrome c oxidoreductase (Complex III) in the mitochondrial respiratory chain and plays an important role in electron transfer as a complex of ubiquinone and QP-C. Complex III consists of 11 subunits, only 1 (cytochrome b; MIM 516020) being encoded by the mitochondrial DNA.[supplied by OMIM]
8420	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein and is similar in sequence to its family member CD53 antigen. It is known to complex with integrins and other transmembrane 4 superfamily proteins.
8421	
8422	
8423	
8424	
8425	
8426	
8427	
8428	This gene is a member of the BCL2/adenovirus E1B 19 kd-interacting protein (BNIP) family.  Though the specific function is unknown, it interacts with the E1B 19 kDa protein which is responsible for the protection of virally-induced cell death, as well as E1B 19 kDa-like sequences of BCL2, also an apoptotic protector.
8429	
8430	
8431	
8432	The A-kinase anchor proteins (AKAPs) are a group of structurally diverse proteins, which have the common function of binding to the regulatory subunit of protein kinase A (PKA) and confining the holoenzyme to discrete locations within the cell. This gene encodes a member of the AKAP family. The encoded protein interacts with both the type I and type II regulatory subunits of PKA; therefore, it is a dual-specific AKAP. This protein is highly enriched in mitochondria. It contains RGS (regulator of G protein signalling) domains, in addition to a PKA-RII subunit-binding domain. The mitochondrial localization and the presence of RGS domains may have important implications for the function of this protein in PKA and G protein signal transduction.
8433	
8434	
8435	
8436	The protein encoded by this gene is a member of the E2F family of transcription factors. The E2F family plays a crucial role in the control of cell cycle and action of tumor suppressor proteins and is also a target of the transforming proteins of small DNA tumor viruses. The E2F proteins contain several evolutionally conserved domains found in most members of the family. These domains include a DNA binding domain, a dimerization domain which determines interaction with the differentiation regulated transcription factor proteins (DP), a transactivation domain enriched in acidic amino acids, and a tumor suppressor protein association domain which is embedded within the transactivation domain. This protein is differentially phosphorylated and is expressed in a wide variety of human tissues. It is more homologous to E2F4, a family member, than to other members. Both this protein and E2F4 interact with tumor suppressor proteins p130 and p107, but not with pRB.
8437	
8438	
8439	
8440	
8441	
8442	
8443	The calpains, calcium-activated neutral proteases, are nonlysosomal, intracellular cysteine proteases. The mammalian calpains include ubiquitous, stomach-specific, and muscle-specific proteins. The ubiquitous enzymes consist of heterodimers with distinct large, catalytic subunits associated with a common small, regulatory subunit. This gene encodes the large subunit of the ubiquitous enzyme, calpain 1.
8444	
8445	Corticotropin-releasing hormone (CRH) is a 41-amino acid peptide derived from a 191-amino acid preprohormone. CRH is secreted by the paraventricular nucleus (PVN) of the hypothalamus in response to stress.  Marked reduction in CRH has been observed in association with Alzheimer disease and autosomal recessive hypothalamic corticotropin dificiency has multiple and potentially fatal metabolic consequences including hypoglycemia and hepatitis.  In addition to production in the hypothalamus, CRH is also synthesized in peripheral tissues, such as T lymphocytes and is highly expressed in the placenta.  In the placenta CRH is a marker that determines the length of gestation and the timing of parturition and delivery.   A rapid increase in circulating levels of CRH occurs at the onset of parturition, suggesting that, in addition to its metabolic functions, CRH may act as a trigger for parturition.
8446	
8447	ADP-ribosylation factor-like 7 is a member of the ADP-ribosylation factor family of GTP-binding proteins.  ARL7 is closely similar to ARL4 and ARL6 and each has a nuclear localization signal and an unusually high guanine nucleotide exchange rate.
8448	The protein encoded by this gene is a member of the cAMP-dependent protein kinase (PKA) inhibitor family. This protein was demonstrated to interact with and inhibit the activities of both C alpha and C beta catalytic subunits of the PKA. Alternatively spliced transcript variants encoding the same protein have been reported.
8449	
8450	
8451	The protein encoded by this gene is a transmembrane protein found in the cis-Golgi complex. The encoded protein is part of the multisubunit transport protein particle (TRAPP) complex and may be involved in vesicular transport from the endoplasmic reticulum to the Golgi. Mutations in this gene could be responsible for the Unverricht-Lundborg type of progressive myoclonus epilepsy, or for autoimmune polyglandular disease type 1. Two transcript variants encoding different isoforms have been found for this gene.
8452	
8453	This gene encodes one of the members of the superfamily of potassium channel proteins containing two pore-forming domains. The product of this gene is an open channel that can be stimulated by arachidonic acid.
8454	
8455	
8456	
8457	A large protein complex, termed the anaphase-promoting complex (APC), or the cyclosome, promotes metaphase-anaphase transition by ubiquitinating its specific substrates such as mitotic cyclins and anaphase inhibitor, which are subsequently degraded by the 26S proteasome. Biochemical studies have shown that the vertebrate APC contains eight subunits. The composition of the APC is highly conserved in organisms from yeast to humans. The exact function of this gene product is not known.
8458	
8459	
8460	The activation of gene transcription is a multistep process that is triggered by factors that recognize transcriptional enhancer sites in DNA. These factors work with co-activators to direct transcriptional initiation by the RNA polymerase II apparatus. The protein encoded by this gene is a subunit of the CRSP (cofactor required for SP1 activation) complex, which, along with TFIID, is required for efficient activation by SP1. This protein is also a component of other multisubunit complexes e.g. thyroid hormone receptor-(TR-) associated proteins which interact with TR and facilitate TR function on DNA templates in conjunction with initiation factors and cofactors.
8461	
8462	The intronless NAT2 gene encodes N-acetyltransferase 2 (arylamine N-acetyltransferase 2). This enzyme functions to both activate and deactivate arylamine and hydrazine drugs and carcinogens. Polymorphisms in this gene are reponsible for the N-acetylation polymorphism in which human populations segregate into rapid,intermediate, and slow acetylator phenotypes. Polymorphisms in NAT2 are also associated with higher incidences of cancer and drug toxicity.A second arylamine N-acetyltransferase gene (NAT1) is located near NAT2.
8463	
8464	
8465	
8466	Defensins are a family of microbicidal and cytotoxic peptides thought to be involved in host defense. They are abundant in the granules of neutrophils and also found in the epithelia of mucosal surfaces such as those of the intestine, respiratory tract, urinary tract, and vagina. Members of the defensin family are highly similar in protein sequence and distinguished by a conserved cysteine motif. Several alpha defensin genes appear to be clustered on chromosome 8. The protein encoded by this gene, defensin, alpha 1, is found in the microbicidal granules of neutrophils and likely plays a role in phagocyte-mediated host defense. It differs from defensin, alpha 3 by only one amino acid.
8467	
8468	
8469	
8470	This gene encodes a subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This subunit functions as guanine nucleotide exchange factor. It is reported that this subunit interacts with HIV-1 Tat, and thus it represses the translation of host-cell, but not HIV-1, mRNAs. Several alternatively spliced transcript variants have been found for this gene, however, the full length nature of only two variants has been determined.
8471	
8472	
8473	The CUGBP1 gene encodes a member of the CELF family of RNA processing factors that regulates pre-mRNA splicing.[supplied by OMIM]
8474	This gene encodes a subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This subunit functions as guanine nucleotide exchange factor. It is reported that this subunit interacts with HIV-1 Tat, and thus it represses the translation of host-cell, but not HIV-1, mRNAs. Several alternatively spliced transcript variants have been found for this gene, however, the full length nature of only two variants has been determined.
8475	
8476	This gene encodes a component of vacuolar ATPase (V-ATPase), a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, receptor-mediated endocytosis, and synaptic vesicle proton gradient generation. V-ATPase is composed of a cytosolic V1 domain and a transmembrane V0 domain. The V1 domain consists of three A and three B subunits, two G subunits plus the C, D, E, F, and H subunits. The V1 domain contains the ATP catalytic site. The V0 domain consists of five different subunits: a, c, c@#$%&, c@#$%!, and d. Additional isoforms of many of the V1 and V0 subunit proteins are encoded by multiple genes or alternatively spliced transcript variants. This encoded protein is one of two V1 domain B subunit isoforms and is the only B isoform highly expressed in osteoclasts. A pseudogene has been found for this gene.
8477	This gene belongs to the ephrin receptor subfamily of the protein-tyrosine kinase family. EPH and EPH-related receptors have been implicated in mediating developmental events, particularly in the nervous system. Receptors in the EPH subfamily typically have a single kinase domain and an extracellular region containing a Cys-rich domain and 2 fibronectin type III repeats. The ephrin receptors are divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands. The protein encoded by this gene functions as a receptor for ephrin A2, A3 and A5 and plays a role in short-range contact-mediated axonal guidance during development of the mammalian nervous system.
8478	
8479	
8480	
8481	The protein encoded by this gene is a member of the transcription cyclin family. These cyclins may regulate transcription through their association with and activation of cyclin-dependent kinases (CDK) that phosphorylate the C-terminal domain (CTD) of the large subunit of RNA polymerase II. This gene product may play a dual role in regulating CDK and RNA polymerase II activities.
8482	
8483	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S10E family of ribosomal proteins. It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
8484	
8485	Proteins of the large Rab GTPase family (see RAB1A; MIM 179508) have regulatory roles in the formation, targeting, and fusion of intracellular transport vesicles. RAB11FIP1 is one of many proteins that interact with and regulate Rab GTPases (Hales et al., 2001).[supplied by OMIM]
8486	Somatostatin acts at many sites to inhibit the release of many hormones and other secretory proteins. The biologic effects of somatostatin are probably mediated by a family of G protein-coupled receptors that are expressed in a tissue-specific manner. SSTR2 is a member of the superfamily of receptors having seven transmembrane segments and is expressed in highest levels in cerebrum and kidney.
8487	
8488	
8489	The protein encoded by this gene is a type II membrane protein that catalyzes the transfer of sialic acid from CMP-sialic acid to galactose-containing substrates. The encoded protein is normally found in the Golgi but can be proteolytically processed to a soluble form. Correct glycosylation of the encoded protein may be critical to its sialyltransferase activity. This protein, which is a member of glycosyltransferase family 29, can use the same acceptor substrates as does sialyltransferase 4B. Two transcript variants encoding the same protein have been found for this gene. Other transcript variants may exist, but have not been fully characterized yet.
8490	
8491	The protein encoded by this gene is a type II membrane protein that catalyzes the transfer of sialic acid from CMP-sialic acid to galactose-containing substrates. The encoded protein is normally found in the Golgi but can be proteolytically processed to a soluble form. Correct glycosylation of the encoded protein may be critical to its sialyltransferase activity. This protein, which is a member of glycosyltransferase family 29, can use the same acceptor substrates as does sialyltransferase 4B. Two transcript variants encoding the same protein have been found for this gene. Other transcript variants may exist, but have not been fully characterized yet.
8492	
8493	
8494	
8495	
8496	This gene encodes a protein involved in endosomal sorting  of cell surface receptors via a multivesicular body/late endosome pathway. The encoded protein is one of the three subunits of the ESCRT-I complex (endosomal complexes required for transport) involved in the sorting of ubiquitinated proteins. The two other subunits of ESCRT-I are vesicular protein sorting 23, also known as tumor susceptibility gene 101 (TSG101), and vesicular protein sorting 37. Two alternative transcripts encoding different isoforms have been described. Additional alternative transcripts may exist but the proteins encoded by these transcripts have not been verified experimentally.
8497	
8498	
8499	Nicotinamide adenine dinucleotide (NAD) is a coenzyme in metabolic redox reactions, a precursor for several cell signaling molecules, and a substrate for protein posttranslational modifications. NAD synthetase (EC 6.3.5.1) catalyzes the final step in the biosynthesis of NAD from nicotinic acid adenine dinucleotide (NaAD).[supplied by OMIM]
8500	The protein encoded by this gene catalyzes the penultimate step of the arginine biosynthetic pathway. There are approximately 10 to 14 copies of this gene including the pseudogenes scattered across the human genome, among which the one located on chromosome 9 appears to be the only functional gene for argininosuccinate synthetase. Mutations in the chromosome 9 copy of ASS cause citrullinemia. Two transcript variants encoding the same protein have been found for this gene.
8501	
8502	
8503	The Golgi apparatus, which participates in glycosylation and transport of proteins and lipids in the secretory pathway, consists of a series of stacked cisternae (flattened membrane sacs). Interactions between the Golgi and microtubules are thought to be important for the reorganization of the Golgi after it fragments during mitosis. This gene encodes a member of the golgin family of proteins, whose members localize to the Golgi. This protein is a coiled-coil membrane protein that has been postulated to play a role in vesicle tethering and docking. Translocations involving this gene and the ret proto-oncogene have been found in tumor tissues; the chimeric sequences have been designated RET-II and PTC5.
8504	
8505	This gene encodes one of the members of the superfamily of potassium channel proteins containing two pore-forming P domains. The message for this gene is mainly expressed in the cortical distal tubules and collecting ducts of the kidney. The protein is highly sensitive to external pH and this, in combination with its expression pattern, suggests it may play an important role in renal potassium transport.
8506	The cysteinyl leukotrienes LTC4, LTD4, and LTE4 are important mediators of human bronchial asthma. Pharmacologic studies have determined that cysteinyl leukotrienes activate at least 2 receptors, the protein encoded by this gene and CYSLTR1. This encoded receptor is a member of the superfamily of G protein-coupled receptors. It seems to play a major role in endocrine and cardiovascular systems.
8507	
8508	
8509	
8510	
8511	
8512	The piggyBac family of proteins, found in diverse animals, are transposases related to the transposase of the canonical piggyBac transposon from the moth, Trichoplusia ni. This family also includes genes in several genomes, including human, that appear to have been derived from the piggyBac transposons. This gene belongs to the subfamily of piggyBac transposable element derived (PGBD) genes. The PGBD proteins appear to be novel, with no obvious relationship to other transposases, or other known protein families. This gene overlaps with the ERCC6 gene on chromosome 10, and pseudogenes of this locus have been found on chromosomes 4, 5 and 12.
8513	
8514	
8515	
8516	This gene belongs to the sulfotransferase gene family. Sulfotransferases generate sulfated glycosaminoglycan (GAG) moities during chondroitin sulfate biosynthesis. They create considerable structural diversity among chondroitin sulfates by transferring sulfate with remarkable specificity for the underlying oligosaccharide substrate. This gene product mainly transfers sulfate to N-acetylgalactosamine. The regulated expression of each member of this gene family may be an important determinant of sulfated GAGs expression and the associated function of chondroitin sulfates as regulators of many biologic processes. This gene is part of a gene cluster on chromosome Xp11.23.
8517	
8518	
8519	
8520	
8521	This gene is expressed in the kidney cortical epithelial cells and is upregulated by hyperglycemia. The encoded protein shares a high level of similarity to the rat homolog, and contains 3 C2 domains and a diacylglycerol-binding C1 domain. Hyperglycemia increases the levels of diacylglycerol, which has been shown to induce apoptosis in cells transfected with this gene and thus contribute to the renal cell complications of hyperglycemia. Studies in other species also indicate a role for this protein in the priming step of synaptic vesicle exocytosis.
8522	
8523	SEP1 localizes to cytoplasmic foci containing a complex of mRNA-degrading enzymes. In addition to mRNA metabolism, yeast Sep1 has been implicated in a variety of nuclear and cytoplasmic functions, including homologous recombination, meiosis, telomere maintenance, and microtubule assembly.[supplied by OMIM]
8524	
8525	The protein encoded by this gene is a member of a family of actin-related proteins (ARPs) which share significant amino acid sequence identity to conventional actins. Both actins and ARPs have an actin fold, which is an ATP-binding cleft, as a common feature. The ARPs are involved in diverse cellular processes, including vesicular transport, spindle orientation, nuclear migration and chromatin remodeling. This gene (ACTL7A), and related gene, ACTL7B, are intronless, and are located approximately 4 kb apart in a head-to-head orientation within the familial dysautonomia candidate region on 9q31. Based on mutational analysis of the ACTL7A gene in patients with this disorder, it was concluded that it is unlikely to be involved in the pathogenesis of dysautonomia. The ACTL7A gene is expressed in a wide variety of adult tissues, however, its exact function is not known.
8526	
8527	
8528	
8529	The protein encoded by this gene delays the onset of apoptosis and promotes homologous recombination repair of damaged DNA. Mutations in this gene result in Fanconi anemia.
8530	The WNT gene family consists of structurally related genes which encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family. It encodes a protein which shows 99% and 91% amino acid identity to the mouse and Xenopus Wnt7A proteins, respectively. Among members of the human WNT family, this protein is most similar to WNT7A protein (77.1% total amino acid identity). This gene may play important roles in the development and progression of gastric cancer, esophageal cancer, and pancreatic cancer.
8531	
8532	The protein encoded by this gene belongs to the membrane-associated guanylate kinase (MAGUK) family, a class of proteins that functions as molecular scaffolds for the assembly of multiprotein complexes at specialized regions of the plasma membrane. This protein is also a member of the CARD protein family, which is defined by carrying a characteristic caspase-associated recruitment domain (CARD). This protein shares a similar domain structure with CARD11 protein. The CARD domains of both proteins have been shown to specifically interact with BCL10, a protein known to function as a positive regulator of cell apoptosis and NF-kappaB activation. When expressed in cells, this protein activated NF-kappaB and induced the phosphorylation of BCL10. Two alternatively spliced variants of this gene encoding distinct isoforms have been reported.
8533	
8534	
8535	Defensins are a family of microbicidal and cytotoxic peptides thought to be involved in host defense. They are abundant in the granules of neutrophils and also found in the epithelia of mucosal surfaces such as those of the intestine, respiratory tract, urinary tract, and vagina. Members of the defensin family are highly similar in protein sequence and distinguished by a conserved cysteine motif. Several alpha defensin genes appear to be clustered on chromosome 8. The protein encoded by this gene, defensin, alpha 1, is found in the microbicidal granules of neutrophils and likely plays a role in phagocyte-mediated host defense. It differs from defensin, alpha 3 by only one amino acid.
8536	
8537	
8538	
8539	The protein encoded by this gene contains a RING finger, which is a motif known to be involved in protein-protein interactions. This protein is a membrane-bound ubiquitin ligase. It can regulate cell motility by targeting paxillin ubiquitination and altering the distribution and localization of paxillin in cytoplasm and cell focal adhesions.
8540	The protein encoded by this gene delays the onset of apoptosis and promotes homologous recombination repair of damaged DNA. Mutations in this gene result in Fanconi anemia.
8541	
8542	
8543	
8544	
8545	
8546	The protein encoded by this gene is highly similar to the protein encoded by the Saccharomyces cerevisiae SEC14 gene. The SEC14 protein is a phophatidylinositol transfer protein that is essential for biogenesis of Golgi-derived transport vesicles, and thus is required for the export of yeast secretory proteins from the Golgi complex. The specific function of this protein has not yet been determined.
8547	Nucleotide sugars, which are synthesized in the cytosol or the nucleus, are high-energy donor substrates for glycosyltransferases located in the lumen of the endoplasmic reticulum and Golgi apparatus. Translocation of nucleotide sugars from the cytosol into the lumen compartment is mediated by specific nucleotide sugar transporters, such as SLC35D2 (Suda et al., 2004).[supplied by OMIM]
8548	
8549	Aldo-keto reductases, such as AKR7A3, are involved in the detoxification of aldehydes and ketones.[supplied by OMIM]
8550	Nicotinamide adenine dinucleotide (NAD+) is essential for life in all organisms, both as a coenzyme for oxidoreductases and as a source of ADP-ribosyl groups used in various reactions. Nicotinic acid and nicotinamide, collectively known as niacin, are the vitamin precursors of NAD+. Nicotinamide riboside kinases, such as NRK1, function to synthesize NAD+ through nicotinamide mononucleotide using nicotinamide riboside as the precursor (Bieganowski and Brenner, 2004).[supplied by OMIM]
8551	This gene represents a candidate gene for human lipodystrophy, characterized by loss of body fat, fatty liver, hypertriglyceridemia, and insulin resistance. Mouse studies suggest that this gene functions during normal adipose tissue development and may also play a role in human triglyceride metabolism.
8552	The protein encoded by this gene is a member of the Ser/Thr protein kinase family, and is most closely related to mitogen-activated protein kinases (MAP kinases). MAP kinases also known as extracellular signal-regulated kinases (ERKs), are activated through protein phosphorylation cascades and act as integration points for multiple biochemical signals. This kinase is localized in the nucleus, and has been reported to be activated in fibroblasts upon treatment with serum or phorbol esters.
8553	
8554	
8555	This gene is located in the surfeit gene cluster, which is comprised of very tightly linked housekeeping genes that do not share sequence similarity. The encoded protein is a conserved integral membrane protein containing multiple putative transmembrane regions. In eukaryotic cells, protein transport between the endoplasmic reticulum and Golgi compartments is mediated in part by non-clathrin-coated vesicular coat proteins (COPs). The specific function of this protein has not been determined but its yeast homolog is directly required for packaging glycosylated pro-alpha-factor into COPII vesicles. This gene uses multiple polyadenylation sites, resulting in transcript length variation. The existence of alternatively spliced transcript variants has been suggested, but their validity has not been determined.
8556	The protein encoded by this gene belongs to the @#$%&regulator of G protein signaling@#$%& family. It inhibits signal transduction by increasing the GTPase activity of G protein alpha subunits. It also may play a role in regulating the kinetics of signaling in the phototransduction cascade.
8557	
8558	This gene encodes a protein involved in spindle checkpoint function. The encoded protein contains four WD repeat domains and has sequence similarity with the yeast BUB3 protein. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
8559	Vesicle mediated protein sorting plays an important role in segregation of intracellular molecules into distinct organelles. Genetic studies in yeast have identified more than 40 vacuolar protein sorting (VPS) genes involved in vesicle transport to vacuoles. This gene encodes the human ortholog of yeast Vps41 protein which is also conserved in Drosophila, tomato, and Arabidopsis. Expression studies in yeast and human indicate that this protein may be involved in the formation and fusion of transport vesicles from the Golgi. Several transcript variants encoding different isoforms have been described for this gene, however, the full-length nature of not all is known.
8560	Crystallins are separated into two classes: taxon-specific, or enzyme, and ubiquitous. The latter class constitutes the major proteins of vertebrate eye lens and maintains the transparency and refractive index of the lens. Since lens central fiber cells lose their nuclei during development, these crystallins are made and then retained throughout life, making them extremely stable proteins. Mammalian lens crystallins are divided into alpha, beta, and gamma families; beta and gamma crystallins are also considered as a superfamily. Alpha and beta families are further divided into acidic and basic groups. Seven protein regions exist in crystallins: four homologous motifs, a connecting peptide, and N- and C-terminal extensions. Gamma-crystallins are a homogeneous group of highly symmetrical, monomeric proteins typically lacking connecting peptides and terminal extensions. They are differentially regulated after early development. Four gamma-crystallin genes (gamma-A through gamma-D) and three pseudogenes (gamma-E, gamma-F, gamma-G) are tandemly organized in a genomic segment as a gene cluster. Whether due to aging or mutations in specific genes, gamma-crystallins have been involved in cataract formation.
8561	Multiprotein complexes are key determinants of Golgi apparatus structure and its capacity for intracellular transport and glycoprotein modification. Several complexes have been identified, including the Golgi transport complex (GTC), the LDLC complex, which is involved in glycosylation reactions, and the SEC34 complex, which is involved in vesicular transport. These 3 complexes are identical and have been termed the conserved oligomeric Golgi (COG) complex, which includes COG4 (Ungar et al., 2002).[supplied by OMIM]
8562	
8563	
8564	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. This gene encodes a member of the potassium voltage-gated channel subfamily V. This member is identified as a @#$%&silent subunit@#$%&, and it does not form homomultimers, but forms heteromultimers with several other subfamily members. Through obligatory heteromerization, it exerts a function-altering effect on other potassium channel subunits. This protein is strongly expressed in pancreas and has a weaker expression in several other tissues.
8565	
8566	Defensins are cysteine-rich cationic polypeptides that are important in the host immunologic response to invading microorganisms. The protein encoded by this gene is secreted and is a member of the beta defensin protein family. Beta defensin genes are found in several clusters throughout the genome, with this gene mapping to a cluster at 20p13.
8567	This gene lies on the opposite strand from the SNRPN gene in the Prader-Willi/Angelman syndrome critical region on chromosome 15. This gene may be differentially expressed in neuropsychiatric disorders.
8568	
8569	
8570	
8571	
8572	
8573	
8574	
8575	
8576	
8577	
8578	This gene encodes a mitochondrial NAD-dependent malic enzyme, a homotetrameric protein, that catalyzes the oxidative decarboxylation of malate to pyruvate. It had previously been weakly linked to a syndrome known as Friedreich ataxia that has since been shown to be the result of mutation in a completely different gene.
8579	
8580	
8581	
8582	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this DEAD box protein family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a member of this family. The encoded protein has 97% sequence identity with the mouse HELG protein. Alternative splicing of this gene generates 3 transcript variants.
8583	
8584	
8585	Fructose-1,6-bisphosphate aldolase (EC 4.1.2.13 ) is a tetrameric glycolytic enzyme that catalyzes the reversible conversion of fructose-1,6-bisphosphate to glyceraldehyde 3-phosphate and dihydroxyacetone phosphate. Vertebrates have 3 aldolase isozymes which are distinguished by their electrophoretic and catalytic properties. Differences indicate that aldolases A, B, and C are distinct proteins, the products of a family of related @#$%&housekeeping@#$%& genes exhibiting developmentally regulated expression of the different isozymes. The developing embryo produces aldolase A, which is produced in even greater amounts in adult muscle where it can be as much as 5% of total cellular protein. In adult liver, kidney and intestine, aldolase A expression is repressed and aldolase B is produced. In brain and other nervous tissue, aldolase A and C are expressed about equally. There is a high degree of homology between aldolase A and C. Defects in ALDOB cause hereditary fructose intolerance.
8586	
8587	
8588	
8589	
8590	
8591	
8592	
8593	
8594	
8595	This gene encodes a member of a large protein family that contains the LIM domain, a unique cysteine-rich zinc-binding domain. The encoded protein may function as a transcriptional regulator and may be involved in the control of differentiation and development of neural and lymphoid cells. Two alternatively spliced transcript variants encoding distinct isoforms have been described for this gene. Another alternatively spliced variant has been identified, but its biological nature has not been determined.
8596	This gene encodes a member of a large protein family that contains the LIM domain, a unique cysteine-rich zinc-binding domain. The encoded protein may function as a transcriptional regulator and may be involved in the control of differentiation and development of neural and lymphoid cells. Two alternatively spliced transcript variants encoding distinct isoforms have been described for this gene. Another alternatively spliced variant has been identified, but its biological nature has not been determined.
8597	Voltage-gated potassium channels form the largest and most diversified class of ion channels and are present in both excitable and nonexcitable cells. Their main functions are associated with the regulation of the resting membrane potential and the control of the shape and frequency of action potentials. The alpha subunits are of 2 types: those that are functional by themselves and those that are electrically silent but capable of modulating the activity of specific functional alpha subunits. The protein encoded by this gene is not functional by itself but can form heteromultimers with member 1 and with member 2 (and possibly other members) of the Shab-related subfamily of potassium voltage-gated channel proteins. This gene belongs to the S subfamily of the potassium channel family.
8598	Rho GTPases play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. The encoded protein belongs to a family of cytoplasmic proteins that activate the Ras-like family of Rho proteins by exchanging bound GDP for GTP. It may form a complex with G proteins and stimulate Rho-dependent signals. This protein can induce membrane ruffling. Multiple alternatively spliced transcript variants encoding different isoforms have been described for this gene, but some of their full-length sequences have not been determined.
8599	
8600	
8601	
8602	Oxysterols are byproducts of cholesterol that can have cytotoxic effects on many cell types. The protein encoded by this gene binds oxysterols such as 7-ketocholesterol and may inhibit their cytotoxicity. The encoded protein is membrane-bound. Two transcript variants encoding different isoforms have been found for this gene.
8603	
8604	
8605	DCAL1 is a type II transmembrane, C-type lectin-like protein expressed on dendritic cells (DCs) and B cells. It interacts with subsets of T cells as a costimulatory molecule that enhances interleukin-4 (IL4; MIM 147780) production.[supplied by OMIM]
8606	Carnitine acetyltransferase (CRAT) is a key enzyme in the metabolic pathway in mitochondria, peroxisomes and endoplasmic reticulum. CRAT catalyzes the reversible transfer of acyl groups from an acyl-CoA thioester to carnitine and regulates the ratio of acylCoA/CoA in the subcellular compartments. Different subcellular localizations of the CRAT mRNAs are thought to result from alternative splicing of the CRAT gene suggested by the divergent sequences in the 5@#$%& region of peroxisomal and mitochondrial CRAT cDNAs and the location of an intron where the sequences diverge. The alternatively splicing of this gene results in three distinct isoforms, one of which contains an N-terminial mitochondrial transit peptide, and has been shown to be located in mitochondria.
8607	This gene encodes a member of the ephrin (EPH) family. The ephrins and EPH-related receptors comprise the largest subfamily of receptor protein-tyrosine kinases and have been implicated in mediating developmental events, especially in the nervous system and in erythropoiesis. Based on their structures and sequence relationships, ephrins are divided into the ephrin-A (EFNA) class, which are anchored to the membrane by a glycosylphosphatidylinositol linkage, and the ephrin-B (EFNB) class, which are transmembrane proteins. This gene encodes an EFNA class ephrin. Three transcript variants that encode distinct proteins have been identified.
8608	
8609	
8610	
8611	
8612	
8613	
8614	
8615	
8616	
8617	
8618	
8619	
8620	
8621	
8622	The Fos gene family consists of 4 members: FOS, FOSB, FOSL1, and FOSL2. These genes encode leucine zipper proteins that can dimerize with proteins of the JUN family, thereby forming the transcription factor complex AP-1. As such, the FOS proteins have been implicated as regulators of cell proliferation, differentiation, and transformation.
8623	
8624	
8625	
8626	
8627	
8628	
8629	This gene belongs to the Bar subclass of the homeobox gene family. The function of this gene has not yet been determined; however, studies in the mouse and chick homolog suggest a role in developing teeth and craniofacial mesenchyme of neural crest origin. The role of these homologs implicates the human gene as a candidate for unmapped disorders involving tooth and jaw development.
8630	
8631	
8632	The protein encoded by this gene belongs to a family of P-type cation-transporting ATPases. The gastric H+, K+-ATPase is a heterodimer consisting of a high molecular weight catalytic alpha subunit and a smaller but heavily glycosylated beta subunit. This enzyme is a proton pump that catalyzes the hydrolysis of ATP coupled with the exchange of H(+) and K(+) ions across the plasma membrane. It is also responsible for gastric acid secretion. This gene encodes the beta subunit of the gastric H+, K+-ATPase.
8633	This gene is a homolog of the Drosophila polyhomeotic gene, which is a member of the Polycomb group of genes. The gene product is a component of a multimeric protein complex that contains EDR2 and the vertebrate Polycomb protein BMH1. The gene product, the EDR2 protein, and the Drosophila polyhomeotic protein share 2 highly conserved domains, named homology domains I and II. These domains are involved in protein-protein interactions and may mediate heterodimerization of the protein encoded by this gene and the EDR2 protein.
8634	
8635	Members of the Kruppel-like zinc finger protein family, such as GLIS2, function as activators and/or repressors of gene transcription.[supplied by OMIM]
8636	
8637	This gene belongs to the homeobox family of genes. The homeobox genes encode a highly conserved family of transcription factors that play an important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, located on different chromosomes, consisting of 9 to 11 genes arranged in tandem. This gene is one of several homeobox HOXD genes located at 2q31-2q37 chromosome regions. Deletions that removed the entire HOXD gene cluster or 5@#$%& end of this cluster have been associated with severe limb and genital abnormalities. The protein encoded by this gene may play a role in the regulation of cell adhesion processes.
8638	
8639	This gene is a candidate gene for alveolar soft part sarcoma (ASPS). It has been found that this gene is fused with transcription factor TFE3 gene in ASPS and also in renal cell carcinomas. Several alternatively spliced transcript variants of this gene have been described, but their full length nature has not been determined.
8640	The protein encoded by this gene is one of the three catalytic subunits of protein phosphatase 1 (PP1). PP1 is a serine/threonine specific protein phosphatase known to be involved in the regulation of a variety of cellular processes, such as cell division, glycogen metabolism, muscle contractility, protein synthesis, and HIV-1 viral transcription. Increased PP1 activity has been observed in the end stage of heart failure. Studies in both human and mice suggest that PP1 is an important regulator of cardiac function. Mouse studies also suggest that PP1 functions as a suppressor of learning and memory. Three alternatively spliced transcript variants encoding different isoforms have been found for this gene.
8641	
8642	
8643	
8644	
8645	
8646	
8647	
8648	
8649	
8650	
8651	
8652	
8653	
8654	The VMD2 gene (MIM 607854) underlying vitelliform macular dystrophy and the 3 VMD2-like genes, VMD2L1 (MIM 607335), VMD2L2 (MIM 607336), and VMD2L3, encode transmembrane (TM) spanning proteins that share a homology region with a high content of aromatic residues including an invariant arginine (R), phenylalanine (F), and proline (P) motif. VMD2 and the 3 VMD2-related genes share a conserved gene structure with almost identical sizes of the 8 RFP-TM domain encoding exons and highly conserved positions of their corresponding exon-intron boundaries. Each of the 4 paralogous genes contains a unique 3-prime end of variable length without significant homology to known proteins or motifs (Stohr et al., 2002).[supplied by OMIM]
8655	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is intronless and encodes a testis/sperm-specific member of the histone H2B family. Transcripts from this gene contain a palindromic termination element.
8656	
8657	
8658	
8659	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility.
8660	
8661	
8662	
8663	The protein encoded by this gene is a DNA-binding transcription factor and is a member of the NR1 subfamily of nuclear hormone receptors. The specific functions of this protein are not known; however, studies of a similar gene in mice have shown that this gene may be essential for lymphoid organogenesis and may play an important regulatory role in thymopoiesis. In addition, studies in mice suggest that the protein encoded by this gene may inhibit the expression of Fas ligand and IL2. Two transcript variants encoding different isoforms have been found for this gene.
8664	
8665	
8666	
8667	The 2;5 chromosomal translocation is frequently associated with anaplastic large cell lymphomas (ALCLs). The translocation creates a fusion gene consisting of the ALK (anaplastic lymphoma kinase) gene and the nucleophosmin (NPM) gene: the 3@#$%& half of ALK, derived from chromosome 2, is fused to the 5@#$%& portion of NPM from chromosome 5. A recent study shows that the product of the NPM-ALK fusion gene is oncogenic. The deduced amino acid sequences reveal that ALK is a novel receptor protein-tyrosine kinase having a putative transmembrane domain and an extracellular domain. These sequences are absent in the product of the transforming NPM-ALK gene. ALK shows the greatest sequence similarity to LTK (leukocyte tyrosine kinase). ALK plays an important role in the development of the brain and exerts its effects on specific neurons in the nervous system.
8668	
8669	The protein encoded by this gene is a member of the type 3 G protein-coupled receptor family. Members of this superfamily are characterized by a signature 7-transmembrane domain motif. The specific function of this protein is unknown; however, this protein may mediate the cellular effects of retinoic acid on the G protein signal transduction cascade.
8670	The pattern of cellular proliferation and differentiation that leads to normal development of embryonic structures often depends upon the localized production of secreted protein signals. Cells surrounding the source of a particular signal respond in a graded manner according to the effective concentration of the signal, and this response produces the pattern of cell types constituting the mature structure. A segment-polarity gene known as dispatched has been identified in Drosophila and its protein product is required for normal Hedgehog (Hh) signaling. This gene is one of two human homologs of Drosophila dispatched.
8671	
8672	
8673	
8674	
8675	
8676	
8677	
8678	
8679	
8680	
8681	
8682	
8683	This gene encodes a cytoplasmic protein tyrosine kinase which is found concentrated in the focal adhesions that form between cells growing in the presence of extracellular matrix constituents. The encoded protein is a member of the FAK subfamily of protein tyrosine kinases but lacks significant sequence similarity to kinases from other subfamilies. Activation of this gene may be an important early step in cell growth and intracellular signal transduction pathways triggered in response to certain neural peptides or to cell interactions with the extracellular matrix. At least four transcript variants encoding four different isoforms have been found for this gene, but the full-length natures of only two of them have been determined.
8684	
8685	The protein encoded by this gene is a transmembrane protein containing three extracellular CUB domains. This protein is found to be overexpressed in colon and lung cancers. Its expression level is correlated with the metastatic ability of carcinoma cells. This protein is located on the cell surface. It has been shown to be tyrosine phosphorylated in a cancer cell line. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
8686	
8687	
8688	
8689	
8690	
8691	
8692	
8693	This gene encodes a large, transmembrane receptor protein which may function in angiogenesis, lymphocyte homing, cell adhesion, or receptor scavenging. The protein contains 7 fasciclin, 16 epidermal growth factor (EGF)-like, and 2 laminin-type EGF-like domains as well as a C-type lectin-like hyaluronan-binding Link module. The protein is primarily expressed on sinusoidal endothelial cells of liver, spleen, and lymph node. The receptor has been shown to endocytose ligands such as low density lipoprotein, Gram-positive and Gram-negative bacteria, and advanced glycosylation end products. Supporting its possible role as a scavenger receptor, the protein rapidly cycles between the plasma membrane and early endosomes.
8694	
8695	
8696	
8697	
8698	
8699	
8700	
8701	Heterotrimeric G proteins play vital roles in cellular responses to external signals. The specificity of a G protein-receptor interaction is primarily mediated by the gamma subunit.[supplied by OMIM]
8702	
8703	Exportins, such as XPO6, recruit cargo in the nucleoplasm in the presence of RAN (MIM 601179)-GTP and form ternary export complexes. These complexes are transported through nuclear pore complexes to the cytoplasm, where GTP is hydrolyzed and the export complex is disassembled.[supplied by OMIM]
8704	
8705	This gene encodes a protein that localizes to the nuclear matrix. The protein also has RNA binding activity, and has a predicted coiled-coil domain.
8706	
8707	
8708	
8709	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This protein contains 7 WD repeats. Alternatively spliced transcript variants encoding the same protein have been identified.
8710	
8711	
8712	Protein disulfide isomerases (EC 5.3.4.1), such as PDIP, are endoplasmic reticulum (ER) resident proteins that catalyze protein folding and thiol-disulfide interchange reactions (Desilva et al., 1996).[supplied by OMIM]
8713	This gene encodes a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, namely a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region.
8714	
8715	
8716	
8717	This gene encodes a protein with acyl-CoA thioesterase activity towards medium (C12) and long-chain (C18) fatty acyl-CoA substrates. Expression of a similar murine protein in brown adipose tissue is induced by cold exposure and repressed by warmth. Expression of the mouse protein has been associated with obesity, with higher expression found in obesity-resistant mice compared with obesity-prone mice. Alternative splicing results in two transcript variants encoding different isoforms.
8718	
8719	This gene is similar to the Drosophila melanogaster dachshund gene, which encodes a nuclear factor essential for determining cell fates in the eye, leg, and nervous system of the fly. It is a member of the Ski gene family and is thought to be involved in eye development. Four alternatively spliced transcripts encoding different isoforms have been described for this gene.
8720	Ubiquitin (MIM 191339), a highly conserved protein involved in the regulation of intracellular protein breakdown, cell cycle regulation, and stress response, is released from degraded proteins by disassembly of the polyubiquitin chains. The disassembly process is mediated by ubiquitin-specific proteases (USPs). Also see USP1 (MIM 603478).[supplied by OMIM]
8721	
8722	
8723	
8724	
8725	
8726	In eukaryotic cells, protein transport between the endoplasmic reticulum and Golgi compartments is mediated in part by non-clathrin-coated vesicular coat proteins (COPs).  Seven coat proteins have been identified, and they represent subunits of a complex known as coatomer. The subunits are designated alpha-COP, beta-COP, beta-prime-COP, gamma-COP, delta-COP, epsilon-COP, and zeta-COP.  The alpha-COP, encoded by COPA, shares high sequence similarity with RET1P, the alpha subunit of the coatomer complex in yeast.  Also, the N-terminal 25 amino acids of alpha-COP encode the bioactive peptide, xenin, which stimulates exocrine pancreatic secretion and may act as a gastrointestinal hormone.
8727	
8728	
8729	
8730	Cyclin-dependent kinases (CDKs), which play an essential role in cell cycle control of eukaryotic cells, are phosphorylated and thus activated by the CDK-activating kinase (CAK). CAK is a multisubunit protein that includes CDK7 (MIM 601955), cyclin H (CCNH; MIM 601953), and MAT1. MAT1 (for @#$%&menage a trois-1@#$%&) is involved in the assembly of the CAK complex.[supplied by OMIM]
8731	Sulfated glycoconjugates are found in glycoproteins, proteoglycans, glycolipids, and polysaccharides. GP3ST transfers a sulfate group to the hydroxyl at C3 of nonreducing beta-galactosyl residues.[supplied by OMIM]
8732	
8733	
8734	
8735	
8736	
8737	
8738	
8739	
8740	This gene encodes a ubiquitously expressed nuclear protein which belongs to a highly conserved subfamily of WD-repeat proteins. It is present in protein complexes involved in histone acetylation and chromatin assembly. It is part of the Mi-2 complex which has been implicated in chromatin remodeling and transcriptional repression associated with histone deacetylation. This encoded protein is also part of co-repressor complexes, which is an integral component of transcriptional silencing. It is found among several cellular proteins that bind directly to retinoblastoma protein to regulate cell proliferation. This protein also seems to be involved in transcriptional repression of E2F-responsive genes.
8741	This gene encodes a protein that contains a SET domain. SET domains appear to be protein-protein interaction domains that mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). The function of this gene has not been determined. Two alternatively spliced transcript variants have been found for this gene.
8742	
8743	
8744	
8745	
8746	
8747	
8748	
8749	
8750	
8751	
8752	This gene product is a member of a family of proteins characterized by a specific cysteine-rich C-terminal domain, that is involved in transcriptional regulation of viral genome expression. Translation initiation from two different sites, including a non-AUG (GUG) codon, results in the production of two different isoforms (p32 and p40) which have different subcellular localization; p32 is mainly found in the cytoplasm, whereas p40 is targeted to the nucleolus. Both isoforms have transcriptional regulatory activity that is attributable to the cysteine-rich C-terminal domain.
8753	
8754	
8755	
8756	
8757	Coagulation factor II is proteolytically cleaved to form thrombin in the first step of the coagulation cascade which ultimately results in the stemming of blood loss.  F2 also plays a role in maintaining vascular integrity during development and postnatal life.  Mutations in F2 leads to various forms of thrombosis and dysprothrombinemia.
8758	
8759	
8760	
8761	
8762	The family of natriuretic peptides (see MIM 108780) elicit a number of vascular, renal, and endocrine effects that are important in the maintenance of blood pressure and extracellular fluid volume. These effects are mediated by specific binding of the peptides to cell surface receptors in the vasculature, kidney, adrenal, and brain.[supplied by OMIM]
8763	
8764	
8765	Cholesterol homeostasis is regulated, at least in part, by sterol regulatory element (SRE)-binding proteins (e.g., SREBP1; MIM 184756) and by liver X receptors (e.g., LXRA; MIM 602423). Upon sterol depletion, LXRs are inactive and SREBPs are cleaved, after which they bind promoter SREs and activate genes involved in cholesterol biosynthesis and uptake. Sterol transport is mediated by vesicles or by soluble protein carriers, such as steroidogenic acute regulatory protein (STAR; MIM 600617). STAR is homologous to a family of proteins containing a 200- to 210-amino acid STAR-related lipid transfer (START) domain, including STARD4 (Soccio et al., 2002).[supplied by OMIM]
8766	Calcium-dependent membrane-binding proteins may regulate molecular events at the interface of the cell membrane and cytoplasm. This gene is one of several genes that encode a calcium-dependent protein containing two N-terminal type II C2 domains and an integrin A domain-like sequence in the C-terminus.
8767	This gene encodes a member of the LIM homeobox gene family of developmentally expressed transcription factors. The encoded protein contains a homeodomain and two cysteine-rich zinc-binding LIM domains involved in protein-protein interactions. The protein is highly similar to a mouse protein that causes gonadal agenesis when inactivated, suggesting a role in gonadal development. Alternative splicing results in multiple transcript variants.
8768	
8769	
8770	
8771	
8772	
8773	
8774	
8775	
8776	
8777	
8778	This gene encodes a protein that may play a role in organelle biogenesis associated with melanosomes, platelet dense granules, and lysosomes. The encoded protein is a component of three different protein complexes termed biogenesis of lysosome-related organelles complex (BLOC)-3, BLOC4, and BLOC5. Mutations in this gene are associated with Hermansky-Pudlak syndrome type 1. Multiple transcript variants encoding distinct isoforms have been identified for this gene; the full-length sequences of some of these have not been determined yet.
8779	
8780	
8781	
8782	
8783	
8784	
8785	
8786	Human glutamate receptor delta-2 (GRID2) is a relatively new member of the family of ionotropic glutamate receptors which are the predominant excitatory neurotransmitter receptors in the mammalian brain.  GRID2 is a predicted 1,007 amino acid protein that shares 97% identity with the mouse homolog which is expressed selectively in cerebellar Purkinje cells.  A point mutation in mouse GRID2, associated with the phenotype named @#$%&lurcher@#$%&, in the heterozygous state leads to ataxia resulting from selective, cell-autonomous apoptosis of cerebellar Purkinje cells during postnatal development. Mice homozygous for this mutation die shortly after birth from massive loss of mid- and hindbrain neurons during late embryogenesis.  This strongly suggests a role for GRID2 in neuronal apoptotic death.
8787	The heat-shock response is elicited by exposure of cells to thermal and chemical stress and through the activation of HSFs (heat shock factors) results in the elevated expression of heat-shock induced genes.  Heat shock factor binding protein 1 (HSBP1), is a 76-amino-acid protein that binds to heat shock factor 1(HSF1), which is a transcription factor involved in the HS response.  During HS response, HSF1 undergoes conformational transition from an inert non-DNA-binding monomer to active functional trimers. HSBP1 is nuclear-localized and interacts  with the active trimeric state of HSF1 to negatively regulate HSF1 DNA-binding activity. Overexpression of HSBP1 in mammalian cells represses the transactivation activity of HSF1. When overexpressed in C.elegans HSBP1 has severe effects on survival of the animals after thermal and chemical stress consistent with a role of HSBP1 as a negative regulator of heat shock response.
8788	The gamma-aminobutyric acid (GABA) A receptor is a multisubunit chloride channel that mediates the fastest inhibitory synaptic transmission in the central nervous system. The subunit encoded by this gene is expressed in several non-neuronal tissues including the uterus and ovaries. This subunit can assemble with known GABA A receptor subunits, and the presence of this subunit alters the sensitivity of recombinant receptors to modulatory agents such as pregnanolone.
8789	
8790	
8791	
8792	
8793	
8794	This gene encodes a protein that contains a cytosolic N-terminus, multiple helical transmembrane domains, and an endoplasmic reticulum membrane retention signal, TKGH, in the C-terminus. The encoded protein may be important in development and homeostasis of the inner ear and retina. Mutations within this gene have been associated with Usher syndrome type IIIa. Multiple transcript variants encoding distinct isoforms have been identified for this gene.
8795	
8796	
8797	LCOR is a transcriptional corepressor widely expressed in fetal and adult tissues that is recruited to agonist-bound nuclear receptors through a single LxxLL motif, also referred to as a nuclear receptor (NR) box.[supplied by OMIM]
8798	
8799	This gene encodes a phosphatase enzyme that is involved in actin polymerization and is found in the trans-Golgi network. Mutations in this gene cause oculocerebrorenal syndrome of Lowe and also Dent disease.
8800	
8801	
8802	Glutamic-oxaloacetic transaminase is a pyridoxal phosphate-dependent enzyme which exists in cytoplasmic and inner-membrane mitochondrial forms, GOT1 and GOT2, respectively.  GOT plays a role in amino acid metabolism and the urea and tricarboxylic acid cycles. The two enzymes are homodimeric and show close homology.
8803	
8804	
8805	
8806	
8807	Potassium channels are present in most mammalian cells, where they participate in a wide range of physiologic responses. The protein encoded by this gene is an integral membrane protein and inward-rectifier type potassium channel. The encoded protein, which has a greater tendency to allow potassium to flow into a cell rather than out of a cell, probably participates in establishing action potential waveform and excitability of neuronal and muscle tissues. Mutations in this gene have been associated with Andersen syndrome, which is characterized by periodic paralysis, cardiac arrhythmias, and dysmorphic features.
8808	This intronless gene encodes the receptor protein for melanocyte-stimulating hormone (MSH). The encoded protein, a seven pass transmembrane G protein coupled receptor, controls melanogenesis. Two types of melanin exist: red pheomelanin and black eumelanin. Gene mutations that lead to a loss in function are associated with increased pheomelanin production, which leads to lighter skin and hair color. Eumelanin is photoprotective but pheomelanin may contribute to UV-induced skin damage by generating free radicals upon UV radiation. Binding of MSH to its receptor activates the receptor and stimulates eumelanin synthesis. This receptor is a major determining factor in sun sensitivity and is a genetic risk factor for melanoma and non-melanoma skin cancer. Over 30 variant alleles have been identified which correlate with skin and hair color, providing evidence that this gene is an important component in determining normal human pigment variation.
8809	
8810	
8811	
8812	
8813	Multiprotein complexes are key determinants of Golgi apparatus structure and its capacity for intracellular transport and glycoprotein modification. Several complexes have been identified, including the Golgi transport complex (GTC), the LDLC complex, which is involved in glycosylation reactions, and the SEC34 complex, which is involved in vesicular transport. These 3 complexes are identical and have been termed the conserved oligomeric Golgi (COG) complex, which includes COG7 (Ungar et al., 2002).[supplied by OMIM]
8814	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, and it may be involved in ribosome assembly. Fusion of this gene and the nucleoporin gene, NUP98, by inversion 11 (p15q22) chromosome translocation is found in the patients with de novo or therapy-related myeloid malignancies.
8815	
8816	
8817	This gene is a member of the MER/AXL/TYRO3 receptor kinase family and encodes a transmembrane protein with two fibronectin type-III domains, two Ig-like C2-type (immunoglobulin-like) domains, and one tyrosine kinase domain. Mutations in this gene have been associated with disruption of the retinal pigment epithelium (RPE) phagocytosis pathway and onset of autosomal recessive retinitis pigmentosa (RP).
8818	The ITGB2 protein product is the integrin beta chain beta 2. Integrins are integral cell-surface proteins composed of an alpha chain and a beta chain.  A given chain may combine with multiple partners resulting in different integrins.  For example, beta 2 combines with the alpha L chain to form the integrin LFA-1, and combines with the alpha M chain to form the integrin Mac-1. Integrins are known to participate in cell adhesion as well as cell-surface mediated signalling.
8819	The protein encoded by this gene shares similarity with the product of Drosophila syd gene, required for the functional interaction of kinesin I with axonal cargo. Studies of the similar gene in mouse suggested that this protein may interact with, and regulate the activity of numerous protein kinases of the JNK signaling pathway, and thus function as a scaffold protein in neuronal cells. The C. elegans counterpart of this gene is found to regulate synaptic vesicle transport possibly by integrating JNK signaling and kinesin-1 transport. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
8820	
8821	The protein encoded by this gene is a post-proline cleaving aminopeptidase expressed in quiescent lymphocytes. The resting lymphocytes are maintained through suppression of apoptosis, a state which is disrupted by inhibition of this novel serine protease. The enzyme has strong sequence homology with prolylcarboxypeptidase and is active at both acidic and neutral pH.
8822	This gene is an estrogen-responsive gene that is an early response gene in the estrogen receptor-regulated pathway. It is thought to play an important role in hormone-responsive tissues and cancer. Three alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
8823	The protein encoded by this gene is a member of the AAA protein family (ATPases associated with diverse cellular activities), and is the homolog of the yeast Vps4 protein. In humans, two paralogs of the yeast protein have been identified. The former share a high degree of aa sequence similarity with each other, and also with yeast Vps4 and mouse Skd1 proteins. The mouse Skd1 (suppressor of K+ transport defect 1) has been shown to be really an yeast Vps4 ortholog. Functional studies indicate that both human paralogs associate with the endosomal compartments, and are involved in intracellular protein trafficking, similar to Vps4 protein in yeast. The gene encoding this paralog has been mapped to chromosome 16; the gene for the other resides on chromosome 18.
8824	This gene encodes a cytokine member of the cysteine knot superfamily, characterized by nine conserved cysteines and a cysteine knot region. The cerberus-related cytokines, together with Dan and DRM/Gremlin, represent a group of bone morphogenetic protein (BMP) antagonists that can bind directly to BMPs and inhibit their activity.
8825	Bone morphogenic proteins (see BMP2, MIM 112261) are important for body patterning and morphogenesis. PRDC is one of several BMP antagonists that regulate the functions of BMPs during embryonic development and tissue differentiation (Sudo et al., 2004).[supplied by OMIM]
8826	
8827	
8828	
8829	
8830	
8831	
8832	
8833	
8834	This gene encodes a subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. The product of this gene contains four conserved cysteines characteristic of an atypical zinc-binding domain. Like its counterpart in yeast, this subunit may be shared by the other two DNA-directed RNA polymerases.
8835	
8836	
8837	This gene encodes a subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. The product of this gene contains four conserved cysteines characteristic of an atypical zinc-binding domain. Like its counterpart in yeast, this subunit may be shared by the other two DNA-directed RNA polymerases.
8838	MONDOA forms heterodimers with MLX (MIM 602976) that can bind to and activate transcription from CACGTG E boxes (Billin et al., 2000 [PubMed 11073985]).[supplied by OMIM]
8839	
8840	
8841	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L24P family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
8842	
8843	
8844	
8845	
8846	
8847	This gene encodes a protein that plays an important role in the organization of the actin cytoskeleton. The encoded protein binds to a region of Wiskott-Aldrich syndrome protein that is frequently mutated in Wiskott-Aldrich syndrome, an X-linked recessive disorder. Impairment of the interaction between these two proteins may contribute to the disease. Alternative transcript variants exist for this gene, but their full-length natures have not been described.
8848	
8849	
8850	
8851	
8852	NFKB1 (MIM 164011) or NFKB2 (MIM 164012) is bound to REL (MIM 164910), RELA (MIM 164014), or RELB (MIM 604758) to form the NFKB complex. The NFKB complex is inhibited by I-kappa-B proteins (NFKBIA, MIM 164008, or NFKBIB), which inactivate NF-kappa-B by trapping it in the cytoplasm. Phosphorylation of serine residues on the I-kappa-B proteins by kinases (IKBKA, MIM 600664 or IKBKB, MIM 603258) marks them for destruction via the ubiquitination pathway, thereby allowing activation of the NF-kappa-B complex. Activated NFKB complex translocates into the nucleus and binds DNA at kappa-B-binding motifs such as 5-prime GGGRNNYYCC 3-prime or 5-prime HGGARNYYCC 3-prime (where H is A, C, or T; R is an A or G purine; and Y is a C or T pyrimidine).[supplied by OMIM]
8853	
8854	
8855	
8856	
8857	
8858	
8859	
8860	
8861	
8862	
8863	
8864	
8865	
8866	
8867	Proteoglycans, which consist of a core protein and covalently linked glycosaminoglycans, are components of the extracellular matrix. SPOCK3 encodes a member of a novel Ca(2+)-binding proteoglycan family.[supplied by OMIM]
8868	
8869	X-prolyl aminopeptidase (EC 3.4.11.9) is a proline-specific metalloaminopeptidase that specifically catalyzes the removal of any unsubstituted N-terminal amino acid that is adjacent to a penultimate proline residue. Because of its specificity toward proline, it has been suggested that X-prolyl aminopeptidase is important in the maturation and degradation of peptide hormones, neuropeptides, and tachykinins, as well as in the digestion of otherwise resistant dietary protein fragments, thereby complementing the pancreatic peptidases. Deficiency of X-prolyl aminopeptidase results in excretion of large amounts of imino-oligopeptides in urine (Blau et al., 1988).[supplied by OMIM]
8870	
8871	
8872	
8873	
8874	
8875	Staufen homolog 2 is a member of the family of double-stranded RNA (dsRNA)-binding proteins involved in the transport and/or localization of mRNAs to different subcellular compartments and/or organelles.  These proteins are characterized by the presence of multiple dsRNA-binding domains which are required to bind RNAs having double-stranded secondary structures.  Staufen homolog 2 shares 48.5% and 59.9% similarity with drosophila and human staufen, respectively.  The exact function of Staufen homolog 2 is not known, but since it contains 3 copies of conserved dsRNA binding domain, it could be involved in double-stranded RNA binding events.
8876	Mutations in this gene are associated with Pendred syndrome, the most common form of syndromic deafness, an autosomal-recessive disease. It is highly homologous to the SLC26A3 gene; they have similar genomic structures and this gene is located 3@#$%& of the SLC26A3 gene. The encoded protein has homology to sulfate transporters.
8877	The importin-alpha/beta complex and the GTPase Ran mediate nuclear import of proteins with a classical nuclear localization signal. The protein encoded by this gene is a member of a class of approximately 20 potential Ran targets that share a sequence motif related to the Ran-binding site of importin-beta. Similar to importin-beta, this protein prevents the activation of Ran@#$%&s GTPase by RanGAP1 and inhibits nucleotide exchange on RanGTP, and also binds directly to nuclear pore complexes where it competes for binding sites with importin-beta and transportin. This protein has a Ran-dependent transport cycle and it can cross the nuclear envelope rapidly and in both directions. At least four importin beta-like transport receptors, namely importin beta itself, transportin, RanBP5 and RanBP7, directly bind and import ribosomal proteins.
8878	
8879	The protein encoded by this gene is a glycogen phosphorylase found predominantly in the brain. The encoded protein forms homodimers which can associate into homotetramers, the enzymatically active form of glycogen phosphorylase. The activity of this enzyme is positively regulated by AMP and negatively regulated by ATP, ADP, and glucose-6-phosphate. This enzyme catalyzes the rate-determining step in glycogen degradation.
8880	
8881	
8882	
8883	
8884	
8885	This gene encodes a member of the tweety family of proteins. Members of this family function as chloride anion channels. The encoded protein functions as a calcium(2+)-activated large conductance chloride(-) channel.
8886	
8887	CD109 is a GPI-linked cell surface antigen expressed by CD34+ acute myeloid leukemia cell lines, T-cell lines, activated T lymphoblasts, endothelial cells, and activated platelets (Lin et al., 2002). In addition, the platelet-specific Gov antigen system, implicated in refractoriness to platelet transfusion, neonatal alloimmune thrombocytopenia, and posttransfusion purpura, is carried by CD109 (Kelton et al., 1990; Lin et al., 2002).[supplied by OMIM]
8888	
8889	This gene encodes an alpha chain for one of the low abundance fibrillar collagens. Fibrillar collagen molecules are trimers that can be composed of one or more types of alpha chains. Type V collagen is found in tissues containing type I collagen and appears to regulate the assembly of heterotypic fibers composed of both type I and type V collagen. This gene product is closely related to type XI collagen and it is possible that the collagen chains of types V and XI constitute a single collagen type with tissue-specific chain combinations. Mutations in this gene are associated with Ehlers-Danlos syndrome, types I and II. Two transcripts that differ in the length of the 3@#$%&UTR due to the use of alternative polyadenylation signals have been identified for this gene.
8890	
8891	
8892	
8893	This gene encodes a subunit of mitochondrial ATP synthase. Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. ATP synthase is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, Fo, comprising the proton channel. The catalytic portion of mitochondrial ATP synthase consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled with a stoichiometry of 3 alpha, 3 beta, and a single representative of the other 3. The proton channel consists of three main subunits (a, b, c). This gene encodes the beta subunit of the catalytic core.
8894	L-glutamate is the major excitatory neurotransmitter in the central nervous system and activates both ionotropic and metabotropic glutamate receptors. Glutamatergic neurotransmission is involved in most aspects of normal brain function and can be perturbed in many neuropathologic conditions. The metabotropic glutamate receptors are a family of G protein-coupled receptors, that have been divided into 3 groups on the basis of sequence homology, putative signal transduction mechanisms, and pharmacologic properties. Group I includes GRM1 and GRM5 and these receptors have been shown to activate phospholipase C. Group II includes GRM2 and GRM3 while Group III includes GRM4, GRM6, GRM7 and GRM8. Group II and III receptors are linked to the inhibition of the cyclic AMP cascade but differ in their agonist selectivities.
8895	Interferon consensus sequence-binding protein (ICSBP) is a transcription factor of the interferon (IFN) regulatory factor (IRF) family. Proteins of this family are composed of a conserved DNA-binding domain in the N-terminal region and a divergent C-terminal region that serves as the regulatory domain. The IRF family proteins bind to the IFN-stimulated response element (ISRE) and regulate expression of genes stimulated by type I IFNs, namely IFN-alpha and IFN-beta. IRF family proteins also control expression of IFN-alpha and IFN-beta-regulated genes that are induced by viral infection.
8896	This protein belongs to the basic helix-loop-helix (BHLH) family of transcription factors. It activates E-box dependent transcription along with E47.
8897	
8898	
8899	
8900	
8901	SLY contains an Src homology-3 (SH3) domain and a sterile alpha motif (SAM), both of which are found in proteins involved in cell signaling (Beer et al., 2001).[supplied by OMIM]
8902	
8903	
8904	The prompt clearance of cells undergoing apoptosis is critical during embryonic development, normal tissue turnover, inflammation, and autoimmunity. CED6 is an evolutionarily conserved adaptor protein required for efficient engulfment of apoptotic cells by phagocytes.[supplied by OMIM]
8905	
8906	
8907	
8908	This gene encodes a cell surface tyrosine kinase receptor for members of the platelet-derived growth factor family. These growth factors are mitogens for cells of mesenchymal origin. The identity of the growth factor bound to a receptor monomer determines whether the functional receptor is a homodimer or a heterodimer, composed of both platelet-derived growth factor receptor alpha and beta polypeptides. Studies in knockout mice, where homozygosity is lethal, indicate that the alpha form of the platelet-derived growth factor receptor is particularly important for kidney development since mice heterozygous for the receptor exhibit defective kidney phenotypes.
8909	Platelet glycoprotein IX (GP9) is a small membrane glycoprotein found on the surface of human platelets. It forms a 1-to-1 noncovalent complex with glycoprotein Ib (GP Ib), a platelet surface membrane glycoprotein complex that functions as a receptor for von Willebrand factor (VWF; MIM 193400). The main portion of the receptor is a heterodimer composed of 2 polypeptide chains, an alpha chain (GP1BA; MIM 606672) and a beta chain (GP1BB; MIM 138720), that are linked by disulfide bonds. The complete receptor complex includes noncovalent association of the alpha and beta subunits with GP9 and platelet glycoprotein V (GP5; MIM 173511).[supplied by OMIM]
8910	
8911	
8912	The protein encoded by this gene is a proinflammatory cytokine produced by activated T cells. This cytokine regulates the activities of NF-kappaB and mitogen-activated protein kinases. This cytokine can stimulate the expression of IL6 and cyclooxygenase-2 (PTGS2/COX-2), as well as enhance the production of nitric oxide (NO). High levels of this cytokine are associated with several chronic inflammatory diseases including rheumatoid arthritis, psoriasis and multiple sclerosis.
8913	
8914	
8915	
8916	
8917	
8918	
8919	
8920	
8921	The protein encoded by this gene is part of the large ATP-dependent chromatin remodeling complex SWI/SNF, which is required for transcriptional activation of genes normally repressed by chromatin. The encoded protein, either alone or when in the SWI/SNF complex, can bind to 4-way junction DNA, which is thought to mimic the topology of DNA as it enters or exits the nucleosome. The protein contains a DNA-binding HMG domain, but disruption of this domain does not abolish the DNA-binding or nucleosome-displacement activities of the SWI/SNF complex. Unlike most of the SWI/SNF complex proteins, this protein has no yeast counterpart.
8922	
8923	
8924	
8925	
8926	
8927	
8928	
8929	
8930	
8931	This gene is a germ cell-specific gene that encodes proteins that interact with POG (proliferation of germ cells). Alternatively spliced transcript variants of a similar mouse gene encode at least three different proteins, namely gametogenetin protein 1a, gametogenetin protein 2, and gametogenetin protein 3, which show a perinuclear, cytoplasmic, and nucleolar localization, respectively. These proteins regulate the localization of POG and may play a role in spermatogenesis. Alternatively spliced transcript variants encoding different isoforms have been identified for this gene.
8932	
8933	
8934	
8935	
8936	
8937	In Aspergillus nidulans, lack of the serine/threonine kinase NimA (never in mitosis A) results in cell cycle arrest in G2, while overexpression causes the premature onset of mitotic events. The protein encoded by this gene is similar in sequence to the Aspergillus nidulans protein and may therefore play a role in mitotic regulation. However, the encoded protein differs from other NimA family members in that it is not cell cycle regulated and is found primarily in the cytoplasm. Three transcript variants have been found for this gene, but the full-length nature of only two of them has been characterized.
8938	MAD1L1 is a component of the mitotic spindle-assembly checkpoint that prevents the onset of anaphase until all chromosome are properly aligned at the metaphase plate. MAD1L1 functions as a homodimer and interacts with MAD2L1. MAD1L1 may play a role in cell cycle control and tumor suppression. Three transcript variants encoding the same protein have been found for this gene.
8939	The protein encoded by this gene belongs to the cdc2/cdkx subfamily of the ser/thr family of protein kinases. It has similarity to rat protein which is thought to play a role in terminally differentiated neurons.
8940	The ADF (actin-depolymerizing factor)/cofilin family (see MIM 601442) is composed of stimulus-responsive mediators of actin dynamics. ADF/cofilin proteins are inactivated by kinases such as LIM domain kinase-1 (LIMK1; MIM 601329). The SSH family appears to play a role in actin dynamics by reactivating ADF/cofilin proteins in vivo (Niwa et al., 2002).[supplied by OMIM]
8941	MHC2TA encodes a non-DNA binding transactivator that functions both in constitutive and inducible MHC Class II expression. Mutations in MHC2TA result in the complementation group A of Bare Lymphocyte Syndrome.
8942	
8943	
8944	
8945	
8946	CD222 is a 250kDa transmembrane protein with a short cytoplasmic tail containing an internalization signal. CD222 was originally identified as a receptor for IGFII and M6P-containing proteins (e.g. lysosomal hydrolases). Lysosomal enzymes are sorted to lysosomes via CD222 either from the Golgi, where the enzymes acquire M6P, or from the extracellular space. The majority of CD222 molecules (approximately 90-95%) are located intracellularly, only 5-10% is present on the cell membrane. The internalization rate seems to be enhanced by ligand induced dimerization of CD222 as well as by phosphorylation of its cytoplasmic serine. CD222 is also a receptor for TGFbeta latency associated peptide (LAP), proliferin and may bind several molecules independently of M6P, including plasminogen, CD87 or retinoic acid. It is involved in activation of latent TGFbeta [PROW]
8947	
8948	
8949	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to cytoplasmic bodies and its function has not been determined. Four alternatively spliced transcript variants for this gene have been described.
8950	
8951	
8952	
8953	The protein encoded by this gene is a member of the Toll-like receptor (TLR) family which plays a fundamental role in pathogen recognition and activation of innate immunity. TLRs are highly conserved from Drosophila to humans and share structural and functional similarities. They recognize pathogen-associated molecular patterns (PAMPs) that are expressed on infectious agents, and mediate the production of cytokines necessary for the development of effective immunity. The various TLRs exhibit different patterns of expression. This gene is most highly expressed in lymphoid tissues such as spleen, lymph node, thymus, and tonsil. Its exact function is not known. Multiple alternatively spliced transcript variants encoding the same protein have been found for this gene.
8954	
8955	
8956	
8957	This gene is the human homolog of mouse BMP-2-inducible kinase. Bone morphogenic proteins (BMPs) play a key role in skeletal development and patterning. Expression of the mouse gene is increased during BMP-2 induced differentiation and the gene product is a putative serine/threonine protein kinase containing a nuclear localization signal. Therefore, the protein encoded by this human homolog is thought to be a protein kinase with a putative regulatory role in attenuating the program of osteoblast differentiation. Two transcript variants encoding different isoforms have been found for this gene.
8958	
8959	
8960	
8961	
8962	
8963	Dystrophin is a large, rod-like cytoskeletal protein found at the inner surface of muscle fibers. Dystrophin is missing in Duchenne Muscular Dystrophy patients and is present in reduced amounts in Becker Muscular Dystrophy patients. The protein encoded by this gene is a peripheral membrane protein found associated with dystrophin and dystrophin-related proteins. This gene is a member of the syntrophin gene family, which contains at least two other structurally-related genes. Two transcript variants encoding different protein isoforms have been described for this gene.
8964	
8965	Kallikreins are a subgroup of serine proteases having diverse physiological functions. Growing evidence suggests that many kallikreins are implicated in carcinogenesis and some have potential as novel cancer and other disease biomarkers. This gene is one of the fifteen kallikrein subfamily members located in a cluster on chromosome 19. Its protein product is a protease present in seminal plasma. It is thought to function normally in the liquefaction of seminal coagulum, presumably by hydrolysis of the high molecular mass seminal vesicle protein. Serum level of this protein, called PSA in the clinical setting, is useful in the diagnosis and monitoring of prostatic carcinoma. Alternate splicing of this gene generates two transcript variants encoding different isoforms. Additional transcript variants have been described, but it is unclear if these transcripts are normally expressed or if they are specific to benign or malignant tumors.
8966	This gene encodes a protein that is abnormally elevated in malignant tumors of astrocytic origin. It is a strongly conserved gene containing a BRCT domain that is essential for the activity of this gene product. The gene plays a crucial role in cell proliferation and may be necessary for oncogenic transformation and tumor progression.
8967	
8968	This gene is the human homolog of mouse jerky gene. The encoded protein has similarity to several nuclear regulatory proteins, including centromere protein B, suggesting that it might function as a DNA-binding protein. Insertional inactivation of this gene in transgenic mice resulted in epileptic seizures. Childhood Absence Epilepsy (CAE) has been mapped to the same chromosomal location (8q24.3) as this gene, making this gene a strong candidate for CAE.
8969	
8970	
8971	
8972	
8973	
8974	The piggyBac family of proteins, found in diverse animals, are transposases related to the transposase of the canonical piggyBac transposon from the moth, Trichoplusia ni. This family also includes genes in several genomes, including human, that appear to have been derived from the piggyBac transposons. This gene belongs to the subfamily of piggyBac transposable element derived (PGBD) genes. The PGBD proteins appear to be novel, with no obvious relationship to other transposases, or other known protein families. The exact function of this gene is not known. Two transcript variants encoding different isoforms have been found for this gene.
8975	
8976	
8977	
8978	
8979	This gene encodes a protein that is a member of the Shank family of synaptic proteins that may function as molecular scaffolds in the postsynaptic density (PSD). Shank proteins contain multiple domains for protein-protein interaction, including ankyrin repeats, an SH3 domain, a PSD-95/Dlg/ZO-1 domain, a sterile alpha motif domain, and a proline-rich region. This particular family member contains a PDZ domain, a consensus sequence for cortactin SH3 domain-binding peptides and a sterile alpha motif. The alternative splicing demonstrated in Shank genes has been suggested as a mechanism for regulating the molecular structure of Shank and the spectrum of Shank-interacting proteins in the PSDs of adult and developing brain. Two alternative splice variants, encoding distinct isoforms, are reported. Additional splice variants exist but their full-length nature has not been determined.
8980	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. The function of this gene has not yet been determined. The expression pattern of the mouse homolog implies a role in nervous system development.
8981	
8982	
8983	
8984	
8985	
8986	The protein encoded by this gene is one polypeptide of a small nuclear ribonucleoprotein complex and belongs to the snRNP SMB/SMN family. The protein plays a role in pre-mRNA processing, possibly tissue-specific alternative splicing events. Although individual snRNPs are believed to recognize specific nucleic acid sequences through RNA-RNA base pairing, the specific role of this family member is unknown. The protein arises from a bicistronic transcript that also encodes a protein identified as the SNRPN upstream reading frame (SNURF). Multiple transcription initiation sites have been identified and extensive alternative splicing occurs in the 5@#$%& untranslated region. Additional splice variants have been described but sequences for the complete transcripts have not been determined. The 5@#$%& UTR of this gene has been identified as an imprinting center. Alternative splicing or deletion caused by a translocation event in this paternally-expressed region is responsible for Angelman syndrome or Prader-Willi syndrome due to parental imprint switch failure.
8987	
8988	
8989	The protein encoded by this gene is a member of the IRS1-like multisubstrate docking protein family. The encoded protein is an important mediator of branching tubulogenesis and plays a central role in cellular growth response, transformation and apoptosis. Two transcript variants encoding different isoforms have been found for this gene.
8990	This gene is induced by androgen in prostate adenocarcinoma cells. Transcript variants utilizing alternative polyadenylation signals have been described in the literature.
8991	
8992	
8993	This gene encodes a member of the family of adenylate cyclases, which are membrane-associated enzymes that catalyze the formation of the secondary messenger cyclic adenosine monophosphate (cAMP). This enzyme is insensitive to Ca(2+)/calmodulin, and is stimulated by the G protein beta and gamma subunit complex. It is expressed in brain.
8994	
8995	3-hydroxyisobutyrate dehydrogenase (3-hydroxy-2-methylpropanoate:NAD(+) oxidoreductase, EC 1.1.1.31) is a dimeric mitochondrial enzyme that catalyzes the NAD(+)-dependent, reversible oxidation of 3-hydroxyisobutyrate, an intermediate of valine catabolism, to methylmalonate semialdehyde.[supplied by OMIM]
8996	
8997	This gene encodes a nuclear envelope protein that appears to be involved in spermatogenesis, either directly or by influencing genes that play a more direct role in the process. This multi-exon locus is the homolog of the mouse and drosophila germ cell-less gene but the human genome also contains a single-exon locus on chromosome 5 that contains an open reading frame capable of encoding a highly-related protein.
8998	
8999	
9000	
9001	
9002	
9003	TOB2 belongs to the TOB (see TOB1; MIM 605523)/BTG1 (MIM 109580) family of antiproliferative proteins, which are involved in the regulation of cell cycle progression.[supplied by OMIM]
9004	This gene encodes a member of the GIT protein family. GIT proteins interact with G protein-coupled receptor kinases and possess ADP-ribosylation factor (ARF) GTPase-activating protein (GAP) activity. This gene undergoes extensive alternative splicing; although ten transcript variants have been described, the full length sequence has been determined for only four variants. The various isoforms have functional differences, with respect to ARF GAP activity and to G protein-coupled receptor kinase 2 binding.
9005	This gene encodes a member of the RSK (ribosomal S6 kinase) family of serine/threonine kinases. This kinase contains 2 non-identical kinase catalytic domains and phosphorylates various substrates, including members of the mitogen-activated kinase (MAPK) signalling pathway. The activity of this protein has been implicated in controlling cell growth and differentiation. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
9006	
9007	
9008	
9009	
9010	This gene encodes the third of three enzymes that posttranslationally modify isoprenylated C-terminal cysteine residues in certain proteins and target those proteins to the cell membrane. This enzyme localizes to the endoplasmic reticulum. Alternative splicing results in two transcript variants encoding different isoforms.
9011	
9012	
9013	
9014	PIST is a PDZ domain-containing Golgi protein. PDZ domains contain approximately 90 amino acids and bind the extreme C terminus of proteins in a sequence-specific manner.[supplied by OMIM]
9015	
9016	
9017	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
9018	
9019	
9020	
9021	
9022	
9023	Oxysterol binding protein is an intracellular protein that is believed to transport sterols from lysosomes to the nucleus where the sterol down-regulates the genes for the LDL receptor, HMG-CoA reductase, and HMG synthetase
9024	This gene encodes one of the DNA-unwinding enzymes which prefers partially unwound 3@#$%&-tailed substrates and can also unwind partial RNA/DNA and RNA/RNA duplexes in an ATP-dependent fashion. This enzyme is a member of the heterogeneous nuclear RNA-binding proteins and is also an element of the Ras signal transduction pathway. It binds specifically to the Ras-GTPase-activating protein by associating with its SH3 domain. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
9025	GEMIN5 is part of a large macromolecular complex localized to both the cytoplasm and the nucleus that plays a role in the cytoplasmic assembly of small nuclear ribonucleoproteins (snRNPs). Other members of this complex include SMN (MIM 600354), GEMIN2 (SIP1; MIM 602595), GEMIN3 (DDX20; MIM 606168), and GEMIN4 (MIM 606969).[supplied by OMIM]
9026	Interleukin 1 induces synthesis of acute phase and proinflammatory proteins during infection, tissue damage, or stress, by forming a complex at the cell membrane with an interleukin 1 receptor and an accessory protein. This gene encodes an interleukin 1 receptor accessory protein. Alternative splicing of this gene results in two transcript variants encoding two different isoforms, one membrane-bound and one soluble.
9027	
9028	
9029	The protein encoded by this gene belongs to the sperm-coating protein family of epididymal origin. This protein and its canine homolog are the first known examples of proteins with four tandemly arranged fibronectin type 2 (Fn2) domains in the Fn2-module protein family.
9030	
9031	CLDN18 belongs to the large claudin family of proteins, which form tight junction strands in epithelial cells (Niimi et al., 2001).[supplied by OMIM]
9032	
9033	
9034	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. Expression of this gene is up-regulated by interferon. This gene is mappped to chromosome 11p15, where it resides within a TRIM gene cluster. Alternate splicing of this gene generates four transcript variants. Additionally, a read-through transcript transcribed from this gene and TRIM6 has been observed.
9035	
9036	The protein encoded by this gene belongs to the subtilisin-like proprotein convertase family. The members of this family are proprotein convertases that process latent precursor proteins into their biologically active products. This encoded protein is a calcium-dependent serine endoprotease that can efficiently cleave precursor proteins at their paired basic amino acid processing sites. Some of its substrates are: proparathyroid hormone, transforming growth factor beta 1 precursor, proalbumin, pro-beta-secretase, membrane type-1 matrix metalloproteinase, beta subunit of pro-nerve growth factor and von Willebrand factor. It is also thought to be one of the proteases responsible for the activation of HIV envelope glycoproteins gp160 and gp140. This gene is thought to play a role in tumor progression. The use of alternate polyadenylation sites has been found for this gene.
9037	
9038	
9039	
9040	
9041	
9042	
9043	
9044	This gene is a member of KDWK gene family. The product of this gene associates with GMEB1 protein, and the complex is essential for parvovirus DNA replication. Study of rat homolog implicates the role of this gene in modulation of transactivation by the glucocorticoid receptor bound to glucocorticoid response elements. This gene appears to use multiple polyadenylation sites.
9045	The protein encoded by this gene is a guanine nucleotide exchange factor (GEF) similar to the Saccharomyces cerevisiae CDC25 gene product. Functional analysis has demonstrated that this protein stimulates the dissociation of GDP from RAS protein. The studies of the similar gene in mouse suggested that the Ras-GEF activity of this protein in brain can be activated by Ca2+ influx, muscarinic receptors, and G protein beta-gamma subunit. Mouse studies also indicated that the Ras-GEF signaling pathway mediated by this protein may be important for long-term memory. Two alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported.
9046	
9047	
9048	Aminoacyl-tRNA synthetases are a class of enzymes that charge tRNAs with their cognate amino acids. FARS1 encodes a mitochondrially-located phenylalanine-tRNA synthetase (PheRS) which consists of a single polypeptide chain unlike the (alpha-beta)2 structure of the prokaryotic and eukaryotic cytoplasmic forms of PheRS. Structure analysis and catalytic properties indicate mitochondrial PheRSs may constitute a class of PheRS distinct from the enzymes found in prokaryotes and the eukaryotic cytoplasm.
9049	
9050	
9051	
9052	
9053	
9054	
9055	
9056	
9057	
9058	The protein encoded by this gene belongs to the family of P-type primary ion transport ATPases characterized by the formation of an aspartyl phosphate intermediate during the reaction cycle. These enzymes remove bivalent calcium ions from eukaryotic cells against very large concentration gradients and play a critical role in intracellular calcium homeostasis. The mammalian plasma membrane calcium ATPase isoforms are encoded by at least four separate genes and the diversity of these enzymes is further increased by alternative splicing of transcripts. The expression of different isoforms and splice variants is regulated in a developmental, tissue- and cell type-specific manner, suggesting that these pumps are functionally adapted to the physiological needs of particular cells and tissues. This gene encodes the plasma membrane calcium ATPase isoform 3. Alternatively spliced transcript variants encoding different isoforms have been identified.
9059	There are at least four distinct but related alkaline phosphatases: intestinal, placental, placental-like, and liver/bone/kidney (tissue non-specific). The first three are located together on chromosome 2 while the tissue non-specific form is located on chromosome 1. The exact physiological function of the alkaline phosphatases is not known. The product of this gene is a membrane bound glycosylated enzyme that is localized to fetal and adult intestines although it is closely related to the placental form of the enzyme. Messages of several sizes have been detected but these do not differ in the coding region and may be the result of several polyadenylation signals in the 3@#$%& untranslated region.
9060	This gene encodes a double zinc finger (FYVE domain) protein that interacts directly with SMAD2 and SMAD3, and is involved in Alzheimer@#$%&s disease. SMAD proteins transmit signals from transmembrane serine/threonine kinase receptors to the nucleus. The FYVE domain has been identified in a number of unrelated signaling molecules. This protein functions to recruit SMAD2 to the transforming growth factor-beta receptor. The FYVE domain is required to maintain the normal localization of this protein but is not involved in mediating interaction with SMADs. The C-terminal domain of this protein interacts with the TGFB receptor. This protein is a component of the TGFB pathway that brings the SMAD substrate to the receptor. Three alternatively spliced transcripts encoding distinct isoforms have been found for this gene.
9061	This intronless gene is located upstream of the WT1 gene; these two genes are bi-directionally transcribed from the same promoter region. The WT1 gene is a tumor suppressor gene that has been implicated in Wilms tumor, and it is expressed in cells of mesodermal origin. This gene and the WT1 gene have the same temporal and cell-restricted expression pattern, although the expression of this gene is less abundant than WT1 expression. WT1 is involved in the development of the urogenital system, whereas the role of this gene in embryogenesis is unclear. It has been found that methylation of this gene is implicated in chemoresistant acute myeloid leukemia.
9062	
9063	
9064	
9065	
9066	
9067	SHPRH is a ubiquitously expressed protein that contains motifs characteristics of several DNA repair proteins, transcription factors, and helicases.[supplied by OMIM]
9068	This gene encodes a cytokine distantly related to type I interferons and the IL-10 family. This gene, interleukin 28A (IL28A), and interleukin 28B (IL28B) are three closely related cytokine genes that form a cytokine gene cluster on a chromosomal region mapped to 19q13. Expression of the cytokines encoded by the three genes can be induced by viral infection. All three cytokines have been shown to interact with a heterodimeric class II cytokine receptor that consists of interleukin 10 receptor, beta (IL10RB) and interleukin 28 receptor, alpha (IL28RA).
9069	
9070	The protein encoded by this gene is a guanine nucleotide exchange factor (GEF) similar to the Saccharomyces cerevisiae CDC25 gene product. Functional analysis has demonstrated that this protein stimulates the dissociation of GDP from RAS protein. The studies of the similar gene in mouse suggested that the Ras-GEF activity of this protein in brain can be activated by Ca2+ influx, muscarinic receptors, and G protein beta-gamma subunit. Mouse studies also indicated that the Ras-GEF signaling pathway mediated by this protein may be important for long-term memory. Two alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported.
9071	
9072	The protein encoded by this gene is a member of the LAR protein-tyrosine phosphatase-interacting protein (liprin) family. Liprins interact with members of LAR family of transmembrane protein tyrosine phosphatases, which are known to be important for axon guidance and mammary gland development. It has been proposed that liprins are multivalent proteins that form complex structures and act as scaffolds for the recruitment and anchoring of LAR family of tyrosine phosphatases. This protein was found to interact with S100A4, a calcium-binding protein related to tumor invasiveness and metastasis. In vitro experiment demonstrated that the interaction inhibited the phosphorylation of this protein by protein kinase C and protein kinase CK2. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
9073	
9074	This gene encodes a protein with an N-terminal filamin-binding domain, a central proline-rich domain, and, multiple C-terminal LIM domains. This protein localizes at cell junctions and may link cell adhesion structures to the actin cytoskeleton. This protein may be involved in the assembly and stabilization of actin-filaments and likely plays a role in modulating cell adhesion, cell morphology and cell motility. This protein also localizes to the nucleus and may affect cardiomyocyte differentiation after binding with the CSX/NKX2-5 transcription factor. Alternative splicing results in multiple transcript variants encoding different isoforms.
9075	This gene encodes the human homolog of the yeast Imp3 protein. The protein localizes to the nucleoli and interacts with the U3 snoRNP complex. The protein contains an S4 domain.
9076	
9077	
9078	
9079	Members of the F-box protein family, such as FBXL19, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
9080	
9081	The protein encoded by this gene is a member of the ETS-domain transcription factor family and the ternary complex factor (TCF) subfamily. Proteins in this subfamily regulate transcription when recruited by serum response factor to bind to serum response elements. This protein is activated by signal-induced phosphorylation; studies in rodents suggest that it is a transcriptional inhibitor in the absence of Ras, but activates transcription when Ras is present.
9082	
9083	
9084	This gene is a member of the synapsin gene family. Synapsins encode neuronal phosphoproteins which associate with the cytoplasmic surface of synaptic vesicles. Family members are characterized by common protein domains, and they are implicated in synaptogenesis and the modulation of neurotransmitter release, suggesting a potential role in several neuropsychiatric diseases. This member of the synapsin family plays a role in regulation of axonogenesis and synaptogenesis. The protein encoded serves as a substrate for several different protein kinases and phosphorylation may function in the regulation of this protein in the nerve terminal. The TIMP1 gene is located within an intron of this gene and is transcribed in the opposite direction. Mutations in this gene may be associated with X-linked disorders with primary neuronal degeneration such as Rett syndrome. Differential splicing of this gene involving alternative use of splice acceptor sites results in two transcripts.
9085	
9086	
9087	
9088	
9089	The protein encoded by this gene is the alpha-3 subunit of one of several alpha/beta-subunit heterodimeric transcription factors that regulate many adaptive responses to low oxygen tension (hypoxia). The alpha-3 subunit lacks the transactivation domain found in factors containing either the alpha-1 or alpha-2 subunits. It is thought that factors containing the alpha-3 subunit are negative regulators of hypoxia-inducible gene expression. At least four transcript variants encoding four different isoforms have been found for this gene.
9090	
9091	
9092	
9093	
9094	The protein encoded by this gene is the founding member of the protein tyrosine phosphatase (PTP) family, which was isolated and identified based on its enzymatic activity and amino acid sequence. PTPs catalyze the hydrolysis of the phosphate monoesters specifically on tyrosine residues. Members of the PTP family share a highly conserved catalytic motif, which is essential for the catalytic activity. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP has been shown to act as a negative regulator of insulin signaling by dephosphorylating the phosphotryosine residues of insulin receptor kinase. This PTP was also reported to dephosphorylate epidermal growth factor receptor kinase, as well as JAK2 and TYK2 kinases, which implicated the role of this PTP in cell growth control, and cell response to interferon stimulation.
9095	Pleckstrin homology domains bind polyphosphoinositides and are involved in platelet cytoskeletal changes. This gene contains a (CAG)n trinucleotide repeat which has been associated with several neurological disorders in which the repeats code for polyglutamine in the protein. This gene may play an important role in the anti-apoptotic effects of insulin-like growth factor I.
9096	Expression of this gene is thought to serve as a marker for differentiated vascular smooth muscle cells which may have a role in regulating growth and differentiation of this cell type. The encoded protein is highly similar to the corresponding rat and mouse proteins. Multiple alternatively spliced transcript variants have been found for this gene, but the full-length nature of only one variant has been defined.
9097	
9098	
9099	
9100	
9101	
9102	Spectrin is a tetrameric cytoskeletal protein essential for determination of cell shape, resilience of membranes to mechanical stress, positioning of transmembrane proteins, and organization of organelles and molecular traffic. Alpha- and beta-spectrin subunits form antiparallel dimers that self associate to give the spectrin tetramer. Beta subunits, such as SPTBN1, contain most of the spectrin binding activity (Hayes et al., 2000).[supplied by OMIM]
9103	
9104	
9105	The cbl oncogene was first identified as part of a transforming retrovirus which induces mouse pre-B and pro-B cell lymphomas. As an adaptor protein for receptor protein-tyrosine kinases, it positively regulates receptor protein-tyrosine kinase ubiquitination in a manner dependent upon its variant SH2 and RING finger domains. Ubiquitination of receptor protein-tyrosine kinases terminates signaling by marking active receptors for degradation.
9106	
9107	
9108	Histone acetylation, particularly of histone H4, has been proposed to play an important role in replication-dependent nucleosome assembly.  The HAT1 protein contains D, A, and B motifs, which are present in many N-acetyltransferases, including those that acetylate substrates other than histones. The HAT1 holoenzyme consists of 2 subunits: the catalytic 46-kD HAT1 and the accessory p46. The p46 subunit stimulated the activity of HAT1 and bound to core histones. The HAT1 holoenzyme acetylated newly synthesized but not nucleosomal histone H4 at lys5 and lys12, and, to a lesser extent, histone H2A at lys5.  HAT1 and p46 polypeptides are located in the nucleus of S-phase cells.
9109	
9110	
9111	
9112	
9113	
9114	
9115	
9116	
9117	
9118	
9119	
9120	
9121	
9122	
9123	The protein encoded by this gene belongs to the TRIM protein family. It has multiple zinc finger motifs and a leucine zipper motif. It has been proposed to form homo- or heterodimers which are involved in nucleic acid binding. Thus, it may act as a transcriptional regulatory factor involved in carcinogenesis and/or differentiation. It may also function in the suppression of radiosensitivity since it is associated with ataxia telangiectasia phenotype. Multiple alternatively spliced transcript variants have been detected for this gene but the full-length nature of all of them has not been described yet. One of the transcripts of 2.4 kb, appears to be untranslated and probably nonfunctional.
9124	This gene encodes a serine protease expressed exclusively in the thymus. It is thought to play a role in the alternative antigen presenting pathway used by cortical thymic epithelial cells during the positive selection of T cells. The gene is found in the large histone gene cluster on chromosome 6, near the major histocompatibility complex (MHC) class I region. A second transcript variant has been described, but its full length nature has not been determined.
9125	
9126	
9127	
9128	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the S39E family of ribosomal proteins. It is located in the cytoplasm. In rat, the protein is the smallest, and one of the most basic, proteins of the ribosome. This gene is co-transcribed with the U69 small nucleolar RNA gene, which is located in its second intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
9129	Members of the Rab protein family are nontransforming monomeric GTP-binding proteins of the Ras superfamily that contain 4 highly conserved regions involved in GTP binding and hydrolysis. Rabs are prenylated, membrane-bound proteins involved in vesicular fusion and trafficking. The mammalian RAB proteins show striking similarities to the S. cerevisiae YPT1 and SEC4 proteins, Ras-related GTP-binding proteins involved in the regulation of secretion.[supplied by OMIM]
9130	
9131	
9132	
9133	
9134	This gene is a member of the XAGE subfamily, which belongs to the GAGE family. The GAGE genes are expressed in a variety of tumors and in some fetal and reproductive tissues. This gene is strongly expressed in normal testis, and in Ewing@#$%&s sarcoma, rhabdomyosarcoma, a breast cancer and a germ cell tumor. The protein encoded by this gene shares a sequence similarity with other GAGE/PAGE proteins. Because of the expression pattern and the sequence similarity, this protein also belongs to a family of CT (cancer-testis) antigens.
9135	
9136	
9137	
9138	The protein encoded by this gene is a type II membrane protein that can be cleaved by furin to produce a secreted form. The encoded protein, which belongs to the tumor necrosis factor family, acts as a homotrimer and may be involved in cell-cell signaling during the development of ectodermal organs. Defects in this gene are a cause of ectodermal dysplasia, anhidrotic, which is also known as X-linked hypohidrotic ectodermal dysplasia. Several transcript variants encoding many different isoforms have been found for this gene.
9139	This gene encodes one of eight subunits composing COP9 signalosome, a highly conserved protein complex that functions as an important regulator in multiple signaling pathways. The structure and function of COP9 signalosome is similar to that of the 19S regulatory particle of 26S proteasome. COP9 signalosome has been shown to interact with SCF-type E3 ubiquitin ligases and act as a positive regulator of E3 ubiquitin ligases.
9140	The protein encoded by this gene is a cytokine receptor that specifically mediates the biological effects of interleukin 9 (IL9). The functional IL9 receptor complex requires this protein as well as the interleukin 2 receptor, gamma (IL2RG), a common gamma subunit shared by the receptors of many different cytokines. The ligand binding of this receptor leads to the activation of various JAK kinases and STAT proteins, which connect to different biologic responses. This gene is located at the pseudoautosomal regions of X and Y chromosomes. Genetic studies suggested an association of this gene with the development of asthma. Multiple pseudogenes on chromosome 9, 10, 16, and 18 have been described. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
9141	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development,reproduction, and tissue remodeling, as well as in disease processes, such as arthritis and metastasis. Most MMP@#$%&s are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. It is thought that the protein encoded by this gene is cleaved at both ends to yield the active enzyme, but this processing has not been fully described. The enzyme degrades soluble and insoluble elastin. It may play a role in aneurysm formation and studies in mice suggest a role in the development of emphysema. The gene is part of a cluster of MMP genes which localize to chromosome 11q22.3.
9142	
9143	This gene encodes a protein belonging to the RAN-GTPase exportin family that mediates export of tRNA from the nucleus to the cytoplasm. Translocation of tRNA to the cytoplasm occurs once exportin has bound both tRNA and GTP-bound RAN.
9144	The protein encoded by this gene is a member of the G-protein coupled receptor family 1 and has been shown to be a receptor for prostacyclin. Prostacyclin, the major product of cyclooxygenase in macrovascular endothelium, elicits a potent vasodilation and inhibition of platelet aggregation through binding to this receptor.
9145	
9146	
9147	
9148	This gene encodes glucose-6-phosphate dehydrogenase. This protein is a cytosolic enzyme encoded by a housekeeping X-linked gene whose main function is to produce NADPH, a key electron donor in the defense against oxidizing agents and in reductive biosynthetic reactions. G6PD is remarkable for its genetic diversity. Many variants of G6PD, mostly produced from missense mutations, have been described with wide ranging levels of enzyme activity and associated clinical symptoms. G6PD deficiency may cause neonatal jaundice, acute hemolysis, or severe chronic non-spherocytic hemolytic anemia.
9149	Selenoproteins, such as SELO, contain the rare twenty-first amino acid, selenocysteine (sec). These proteins lack common amino acid sequence motifs, but the 3-prime untranslated regions of selenoprotein genes have a common stem-loop structure, the sec insertion sequence (SECIS), that is necessary for the recognition of UGA as a sec codon rather than as a stop signal.[supplied by OMIM]
9150	
9151	Isocitrate dehydrogenases catalyze the oxidative decarboxylation of isocitrate to 2-oxoglutarate. These enzymes belong to two distinct subclasses, one of which utilizes NAD(+) as the electron acceptor and the other NADP(+). Five isocitrate dehydrogenases have been reported: three NAD(+)-dependent isocitrate dehydrogenases, which localize to the mitochondrial matrix, and two NADP(+)-dependent isocitrate dehydrogenases, one of which is mitochondrial and the other predominantly cytosolic. NAD(+)-dependent isocitrate dehydrogenases catalyze the allosterically regulated rate-limiting step of the tricarboxylic acid cycle. Each isozyme is a heterotetramer that is composed of two alpha subunits, one beta subunit, and one gamma subunit. The protein encoded by this gene is the gamma subunit of one isozyme of NAD(+)-dependent isocitrate dehydrogenase. This gene is a candidate gene for periventricular heterotopia. Several alternatively spliced transcript variants of this gene have been described, but only some of their full length natures have been determined.
9152	
9153	The protein encoded by this gene is a small cellular or pericellular matrix proteoglycan that is closely related in structure to two other small proteoglycans, decorin and fibromodulin. The encoded protein and decorin are thought to be the result of a gene duplication. Decorin contains one attached glycosaminoglycan chain, while this protein probably contains two chains. For this reason, this protein is called biglycan. This protein is thought to function in connective tissue metabolism by binding to collagen fibrils and transfering growth factor-beta. It may promote neuronal survival. This gene is a candidate gene for the Happle syndrome.
9154	
9155	
9156	The protein encoded by this gene is similar in sequence to the amino terminus of Drosophila enhancer of split groucho, a protein involved in neurogenesis during embryonic development. The encoded protein, which belongs to the groucho/TLE family of proteins, can function as a homooligomer or as a heteroologimer with other family members to dominantly repress the expression of other family member genes. Three transcript variants encoding different isoforms have been found for this gene.
9157	This gene encodes a protein sharing sequence similarity with ribosomal protein L10. It is not currently known whether the encoded protein is a functional ribosomal protein or whether it has evolved a function that is independent of the ribosome. This gene is intronless.
9158	
9159	
9160	
9161	The globular WW domain is composed of 38 to 40 semiconserved amino acids shared by proteins of diverse functions including structural, regulatory, and signaling proteins. The domain is involved in mediating protein-protein interactions through the binding of polyproline ligands. This gene encodes a WW domain binding protein. This gene also encodes a domain with similarity to the transcription elongation factor A, SII-related family. Alternative splicing results in multiple transcript variants encoding a single isoform.
9162	This gene is thought to be involved in apoptosis. Multiple polyadenylation sites have been found for this gene.
9163	The Myc/Max/Mad network comprises a group of transcription factors that co-interact to regulate gene-specific transcriptional activation or repression. This gene encodes a protein member of the Myc/Max/Mad network. This protein has a basic-Helix-Loop-Helix-zipper domain (bHLHzip) with which it binds the canonical DNA sequence CANNTG, known as the E box, following heterodimerization with Max proteins. This protein is likely a transcriptional repressor and an antagonist of Myc-dependent transcriptional activation and cell growth. This protein represses transcription by binding to DNA binding proteins at its N-terminal Sin3-interaction domain.
9164	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. Studies in mouse suggest that this protein plays a role in DNA postreplication repair.
9165	This gene encodes retinoic acid receptor beta, a member of the thyroid-steroid hormone receptor superfamily of nuclear transcriptional regulators. This receptor localizes to the cytoplasm and to subnuclear compartments. It binds retinoic acid, the biologically active form of vitamin A which mediates cellular signalling in embryonic morphogenesis, cell growth and differentiation. It is thought that this protein limits growth of many cell types by regulating gene expression. The gene was first identified in a hepatocellular carcinoma where it flanks a hepatitis B virus integration site. The gene expresses at least two transcript variants; one additional transcript has been described, but its full length nature has not been determined.
9166	
9167	
9168	Proteins that contain a SET domain, such as SETDB2, modulate gene expression epigenetically through histone H3 (see MIM 601128) methylation. SETDB2 is likely a histone H3 methyltransferase, as it contains both the active site and flanking cysteine residues required for catalytic activity (Zhang et al., 2003).[supplied by OMIM]
9169	Proteins that contain a SET domain, such as SETDB2, modulate gene expression epigenetically through histone H3 (see MIM 601128) methylation. SETDB2 is likely a histone H3 methyltransferase, as it contains both the active site and flanking cysteine residues required for catalytic activity (Zhang et al., 2003).[supplied by OMIM]
9170	
9171	
9172	
9173	This gene is located in the nonrecombining portion of the Y chromosome, and is expressed specifically in testis. It encodes a protein which is similar to XK (X-linked Kell blood group precursor), a putative membrane transport protein. This gene is present as two identical copies within a palindromic region; this record represents the more centromeric copy.
9174	The secreted polypeptide noggin, encoded by the NOG gene, binds and inactivates members of the transforming growth factor-beta (TGF-beta) superfamily signaling proteins, such as bone morphogenetic protein-4 (BMP4).  By diffusing through extracellular matrices more efficiently than members of the TGF-beta superfamily, noggin may have a principal role in creating morphogenic gradients. Noggin appears to have pleiotropic effect, both early in development as well as in later stages.  It was originally isolated from Xenopus based on its ability to restore normal dorsal-ventral body axis in embryos that had been artificially ventralized by UV treatment.  The results of the mouse knockout of noggin suggest that it is involved in numerous developmental processes, such as neural tube fusion and joint formation.  Recently, several dominant human NOG mutations in unrelated families with proximal symphalangism (SYM1) and multiple synostoses syndrome (SYNS1) were identified; both SYM1 and SYNS1 have multiple joint fusion as their principal feature, and map to the same region (17q22) as NOG.  All NOG mutations altered evolutionarily conserved amino acid residues. The amino acid sequence of human noggin is highly homologous to that of Xenopus, rat and mouse.
9175	
9176	
9177	
9178	The retinoblastoma tumor suppressor (pRB) protein binds with many other proteins. In various human cancers, pRB suppresses cellular proliferation and is inactivated. Cell cycle-dependent phosphorylation regulates the activity of pRB. This gene encodes a protein which binds to underphosphorylated but not phosphorylated pRB. Multiple alternatively spliced transcript variants that encode different isoforms have been found for this gene.
9179	
9180	There is a single copy of this gene on chromosome Y at the center of the P3 palindrome.
9181	
9182	
9183	
9184	
9185	
9186	TCF21 encodes a transcription factor of the basic helix-loop-helix family. The TCF21 product is mesoderm specific, and expressed in embryonic epicardium, mesenchyme-derived tissues of lung, gut, gonad, and both mesenchymal and glomerular epithelial cells in the kidney. Two transcript variants encoding the same protein have been found for this gene.
9187	
9188	This gene encodes a disintegrin and metalloproteinase with thrombospondin motifs-1 (ADAMTS1), which is a member of the ADAMTS protein family. Members of the family share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. The protein encoded by this gene contains 2 disintegrin loops and 3 C-terminal TS motifs and has anti-angiogenic activity. The expression of this gene may be associated with various inflammatory processes as well as development of cancer cachexia. This gene is likely to be necessary for normal growth, fertility, and organ morphology and function.
9189	This gene encodes a key enzyme in catabolism of quinolinate, an intermediate in the tryptophan-nicotinamide adenine dinucleotide pathway. Quinolinate acts as a most potent endogenous exitotoxin to neurons. Elevation of quinolinate levels in the brain has been linked to the pathogenesis of neurodegenerative disorders such as epilepsy, Alzheimer@#$%&s disease, and Huntington@#$%&s disease.
9190	
9191	
9192	The protein encoded by this gene is a kinase that is predominantly found in the nucleus. However, it can interact with PDLIM1/CLP-36 in the cytoplasm and localize to actin stress fibers. The encoded protein may be a regulator of actin stress fibers in nonmuscle cells.
9193	
9194	
9195	The PALM2-AKAP2 mRNA is a naturally occurring co-transcribed product of the neighboring PALM2 and AKAP2 genes. The significance of this co-transcribed mRNA and the function of its protein product have not yet been determined. Alternative splicing of this gene results in several transcript variants encoding different isoforms, but the full-length nature of some of these variants has not been defined.
9196	
9197	
9198	Members of the F-box protein family, such as FBXO15, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
9199	
9200	
9201	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
9202	This gene is a member of the F-box/WD-40 gene family, which recruit specific target proteins through their WD-40 protein-protein binding domains for ubiquitin mediated degradation. In mouse, a highly similar protein is thought to be responsible for maintaining the apical ectodermal ridge of developing limb buds; disruption of the mouse gene results in the absence of central digits, underdeveloped or absent metacarpal/metatarsal bones and syndactyly. This phenotype is remarkably similar to split hand-split foot malformation in humans, a clinically heterogeneous condition with a variety of modes of transmission. An autosomal recessive form has been mapped to the chromosomal region where this gene is located, and complex rearrangements involving duplications of this gene and others have been associated with the condition. A pseudogene of this locus has been mapped to one of the introns of the BCR gene on chromosome 22.
9203	Formin homology (FH) domain proteins (e.g., FMN; MIM 136535) play a role in cytoskeletal organization and/or establishment of cell polarity.[supplied by OMIM]
9204	
9205	
9206	
9207	This gene encodes a member of the cysteine protease family C13 that is involved in glycosylphosphatidylinositol (GPI)-anchor biosynthesis. The GPI-anchor is a glycolipid found on many blood cells and serves to anchor proteins to the cell surface. This protein is a member of the multisubunit enzyme, GPI transamidase and is thought to be its enzymatic component. GPI transamidase mediates GPI anchoring in the endoplasmic reticulum, by catalyzing the transfer of fully assembled GPI units to proteins.
9208	
9209	
9210	
9211	This gene encodes an apolipoprotein E receptor, a member of the low density lipoprotein receptor (LDLR) family. Apolipoprotein E is a small lipophilic plasma protein and a component of lipoproteins such as chylomicron remnants, very low density lipoprotein (VLDL), and high density lipoprotein (HDL). The apolipoprotein E receptor is involved in cellular recognition and internalization of these lipoproteins. Alternative splicing generates multiple transcript variants encoding distinct isoforms for this gene.
9212	
9213	
9214	
9215	
9216	
9217	The protein encoded by this gene is a member of the G protein-coupled receptor family. This receptor was identified as a gene induced by the Epstein-Barr virus (EBV), and is thought to be a mediator of EBV effects on B lymphocytes. This receptor is expressed in various lymphoid tissues and activates B and T lymphocytes. It has been shown to control the migration of memory T cells to inflamed tissues, as well as stimulate dendritic cell maturation. The chemokine (C-C motif) ligand 19 (CCL19/ECL) has been reported to be a specific ligand of this receptor.
9218	Agrin is a neuronal aggregating factor that induces the aggregation of acetylcholine receptors and other postsynaptic proteins on muscle fibers and is crucial for the formation of the neuromuscular junction.[supplied by OMIM]
9219	This gene is a member of the Ig superfamily and encodes a cell surface sialoglycoprotein expressed by cytokine-activated endothelium. This type I membrane protein mediates leukocyte-endothelial cell adhesion and signal transduction, and may play a role in the development of artherosclerosis and rheumatoid arthritis. Two alternatively spliced transcripts encoding different isoforms have been described for this gene.
9220	
9221	The Fos gene family consists of 4 members: FOS, FOSB, FOSL1, and FOSL2. These genes encode leucine zipper proteins that can dimerize with proteins of the JUN family, thereby forming the transcription factor complex AP-1. As such, the FOS proteins have been implicated as regulators of cell proliferation, differentiation, and transformation.
9222	
9223	
9224	
9225	
9226	This gene is a member of the gamma-aminobutyric acid (GABA) neurotransmitter gene family and encodes a high-affinity mammalian brain L-proline transporter protein. This transporter protein differs from other sodium-dependent plasma membrane carriers by its pharmacological specificity, kinetic properties, and ionic requirements.
9227	RAS genes (e.g., MIM 190020) encode membrane-bound guanine nucleotide-binding proteins that function in the transduction of signals that control cell growth and differentiation. Binding of GTP activates RAS proteins, and subsequent hydrolysis of the bound GTP to GDP and phosphate inactivates signaling by these proteins. GTP binding can be catalyzed by guanine nucleotide exchange factors for RAS, and GTP hydrolysis can be accelerated by GTPase-activating proteins (GAPs). The first exchange factor to be identified for RAS was the S. cerevisiae CDC25 gene product. Genetic analysis indicated that CDC25 is essential for activation of RAS proteins. In Drosophila, the protein encoded by the @#$%&son of sevenless@#$%& gene (Sos) contains a domain that shows sequence similarity with the catalytic domain of CDC25. Sos may act as a positive regulator of RAS by promoting guanine nucleotide exchange.[supplied by OMIM]
9228	
9229	
9230	
9231	
9232	
9233	
9234	
9235	This gene encodes a member of the IGF-II mRNA-binding protein (IMP) family. The protein encoded by this gene contains four K homology domains and two RNA recognition motifs. It functions by binding to the 5@#$%& UTR of the insulin-like growth factor 2 (IGF2) mRNA and regulating IGF2 translation.
9236	
9237	
9238	
9239	
9240	This gene encodes a Type I transmembrane glycoprotein that is an integral component of the multimeric gamma-secretase complex. The encoded protein cleaves integral membrane proteins, including Notch receptors and beta-amyloid precursor protein, and may be a stabilizing cofactor required for gamma-secretase complex assembly. The cleavage of beta-amyloid precursor protein yields amyloid beta peptide, the main component of the neuritic plaque and the hallmark lesion in the brains of patients with Alzheimer@#$%&s disease; however, the nature of the encoded protein@#$%&s role in Alzheimer@#$%&s disease is not known for certain. Alternatively spliced transcript variants have been described, but their full-length nature has not been determined.
9241	
9242	The protein encoded by this gene contains a type 1 thrombospondin domain, which is present in thrombospondin, a number of proteins involved in the complement pathway, as well as in extracellular matrix proteins. Three alternatively spliced transcript variants encoding distinct isoforms have been observed.
9243	The protein encoded by this gene forms a potassium channel that is thought to play a critical role in the regulation of neuronal excitability, particularly in sensory cells of the cochlea. The current generated by this channel is inhibited by M1 muscarinic acetylcholine receptors and activated by retigabine, a novel anti-convulsant drug. The encoded protein can form a homomultimeric potassium channel or possibly a heteromultimeric channel in association with the protein encoded by the KCNQ3 gene. Defects in this gene are a cause of nonsyndromic sensorineural deafness type 2 (DFNA2), an autosomal dominant form of progressive hearing loss. Two transcript variants encoding different isoforms have been found for this gene.
9244	
9245	
9246	This gene encodes a calcium-binding protein belonging to the penta-EF-hand protein family. Calcium binding is important for homodimerization and for conformational changes required for binding to other protein partners. This gene product participates in T cell receptor-,  Fas-, and glucocorticoid-induced programmed cell death. In mice deficient for this gene product, however, apoptosis was not blocked suggesting this gene product is functionally redundant.
9247	
9248	
9249	
9250	
9251	This gene includes 11 exons spanning 25 kb and maps to a region of chromosome 16 that is sometimes deleted in breast and prostrate cancer. The second intron contains an apparently intronless gene, C16orf3, that is transcribed in the opposite orientation. This gene is a putative tumor suppressor gene.
9252	LOR encodes loricrin, a major protein component of the cornified cell envelope found in terminally differentiated epidermal cells. Mutations in LOR may be the cause of both Vohwinkel@#$%&s syndrome and progressive symmetric erythrokeratoderma, both inherited skin diseases.
9253	Synaptopodin is an actin-associated protein that may play a role in actin-based cell shape and motility. The name synaptopodin derives from the protein@#$%&s associations with postsynaptic densities and dendritic spines and with renal podocytes (Mundel et al., 1997).[supplied by OMIM]
9254	
9255	
9256	
9257	
9258	
9259	
9260	
9261	
9262	
9263	
9264	Eukaryotic protein kinases are enzymes that belong to a very extensive family of proteins which share a conserved catalytic core common with both serine/threonine and tyrosine protein kinases. This gene encodes an intestinal serine/threonine kinase harboring a dual phosphorylation site found in mitogen-activating protein (MAP) kinases. The protein localizes to the intestinal crypt region and is thought to be important in intestinal epithelial cell proliferation and differentiation. Alternative splicing has been observed at this locus and two variants, encoding the same isoform, have been identified.
9265	
9266	
9267	
9268	In Drosophila melanogaster, the @#$%&Polycomb@#$%& group (PcG) of genes are part of a cellular memory system that is responsible for the stable inheritance of gene activity. PcG proteins form a large multimeric, chromatin-associated protein complex. The protein encoded by this gene has homology to the Drosophila PcG protein @#$%&polyhomeotic@#$%& (Ph) and is known to heterodimerize with EDR1 and colocalize with BMI1 in interphase nuclei of human cells. The specific function in human cells has not yet been determined. Two transcript variants encoding different isoforms have been found for this gene.
9269	The protooncogene PIM1 encodes a protein kinase upregulated in prostate cancer.[supplied by OMIM]
9270	
9271	
9272	
9273	
9274	
9275	
9276	
9277	Epoxide hydrolase plays an important role in both the activation and detoxification of exogenous chemicals such as polycyclic aromatic hydrocarbons.
9278	
9279	
9280	
9281	
9282	This gene encodes an alpha-1 subunit of a voltage-dependent calcium channel. Calcium channels mediate the influx of calcium ions into the cell upon membrane polarization. The alpha-1 subunit consists of 24 transmembrane segments and forms the pore through which ions pass into the cell. The calcium channel consists of a complex of alpha-1, alpha-2/delta, beta, and gamma subunits in a 1:1:1:1 ratio. There are multiple isoforms of each of these proteins, either encoded by different genes or the result of alternative splicing of transcripts. Alternate transcriptional splice variants of the gene described here have been observed but have not been thoroughly characterized.
9283	This gene belongs to the forkhead family of transcription factors which are characterized by a distinct forkhead domain. This gene likely functions as a trigger for apoptosis through expression of genes necessary for cell death. Translocation of this gene with the MLL gene is associated with secondary acute leukemia. Alternatively spliced transcript variants encoding the same protein have been observed.
9284	The Fos gene family consists of 4 members: FOS, FOSB, FOSL1, and FOSL2. These genes encode leucine zipper proteins that can dimerize with proteins of the JUN family, thereby forming the transcription factor complex AP-1. As such, the FOS proteins have been implicated as regulators of cell proliferation, differentiation, and transformation.
9285	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the peptidase T1A family, that is a 20S core alpha subunit.
9286	
9287	
9288	The protein encoded by this gene is a cytokine that shares the sequence similarity with IL17. The treatment of endothelial cells with this cytokine has been shown to stimulate the production of other cytokines including IL6, IL8 and CSF2/ GM-CSF. The increased expression of IL8 induced by this cytokine was found to be NF-kappa B-dependent.
9289	
9290	
9291	
9292	
9293	
9294	
9295	
9296	
9297	
9298	
9299	
9300	
9301	
9302	
9303	
9304	
9305	
9306	
9307	
9308	This gene encodes the general transcription factor IIB, one of the ubiquitous factors required for transcription initiation by RNA polymerase II. The protein localizes to the nucleus where it forms a complex (the DAB complex) with transcription factors IID and IIA. Transcription factor IIB serves as a bridge between IID, the factor which initially recognizes the promoter sequence, and RNA polymerase II.
9309	
9310	
9311	
9312	
9313	
9314	
9315	
9316	Renin catalyzes the first step in the activation pathway of angiotensinogen--a cascade that can result in aldosterone release,vasoconstriction, and increase in blood pressure. Renin, an aspartyl protease, cleaves angiotensinogen to form angiotensin I, which is converted to angiotensin II by angiotensin I converting enzyme, an important regulator of blood pressure and electrolyte balance. Transcript variants that encode different protein isoforms and that arise from alternative splicing and the use of alternative promoters have been described, but their full-length nature has not been determined. Mutations in this gene have been shown to cause familial hyperproreninemia.
9317	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. This gene and the gene for a semaphorin class 4 protein (SEMA4G) overlap at map location 10q24.31 and are transcribed in opposite directions. Sequence analysis identified multiple transcript variants encoding at least four different protein isoforms.
9318	
9319	
9320	
9321	
9322	The protein encoded by this gene is a member of the type 3 G protein-coupled receptor family. Members of this superfamily are characterized by a signature 7-transmembrane domain motif. The specific function of this protein is unknown; however, this protein may mediate the cellular effects of retinoic acid on the G protein signal transduction cascade. Two transcript variants encoding different isoforms have been found for this gene.
9323	T-cell receptor zeta, together with T-cell receptor alpha/beta and gamma/delta heterodimers and CD3-gamma, -delta, and -epsilon, forms the T-cell receptor-CD3 complex. The zeta chain plays an important role in coupling antigen recognition to several intracellular signal-transduction pathways. Low expression of the antigen results in impaired immune response. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
9324	
9325	In mouse, recombining binding protein L (RBP-L) is a transcription factor that binds to DNA sequences almost identical to that bound by the Notch receptor signalling pathway transcription factor RBP-J. However, unlike RBP-J, RBP-L does not interact with Notch receptors. RBP-L has been shown to activate transcription in concert with Epstein-Barr virus nuclear antigen-2 (EBNA2). The protein encoded by this gene is similar in sequence to the mouse RPB-L protein and Drosophila suppressor of hairless protein.
9326	This gene encodes a member of the alpha-2/delta subunit family, a protein in the voltage-dependent calcium channel complex. Calcium channels mediate the influx of calcium ions into the cell upon membrane polarization and consist of a complex of alpha-1, alpha-2/delta, beta, and gamma subunits in a 1:1:1:1 ratio. Various versions of each of these subunits exist, either expressed from similar genes or the result of alternative splicing. Research on a highly similar protein in rabbit suggests the protein described in this record is cleaved into alpha-2 and delta subunits. Alternate transcriptional splice variants of this gene, encoding different isoforms, have been characterized.
9327	
9328	
9329	The protein encoded by this gene is the tropomyosin-binding subunit of the troponin complex, which is located on the thin filament of striated muscles and regulates muscle contraction in response to alterations in intracellular calcium ion concentration. Mutations in this gene have been associated with familial hypertrophic cardiomyopathy as well as with dilated cardiomyopathy. Transcripts for this gene undergo alternative splicing that results in many tissue-specific isoforms, however, the full length nature of some of these variants has not yet been determined.
9330	
9331	
9332	
9333	
9334	The protein encoded by this gene is similar to the yeast Mob1 protein. Yeast Mob1 binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation.
9335	The protein encoded by this gene is a type III transmembrane adaptor protein that binds to the tyrosine kinase csk protein. It is thought to be involved in the regulation of T cell activation.
9336	
9337	
9338	
9339	
9340	This gene belongs to a family of Sp1 related genes that encode transcription factors that regulate transcription by binding to consensus GC- and GT-box regulatory elements in target genes. This protein contains a zinc finger DNA-binding domain and several transactivation domains, and has been reported to function as a bifunctional transcription factor that either stimulates or represses the transcription of numerous genes. Transcript variants encoding different isoforms have been described for this gene, and one has been reported to initiate translation from a non-AUG (AUA) start codon. Additional isoforms, resulting from the use of alternate downstream translation initiation sites, have also been noted.
9341	The protein encoded by this gene is an enzymatic component of the tricarboxylic acid (TCA) cycle, or Krebs cycle, and catalyzes the formation of L-malate from fumarate. It exists in both a cytosolic form and an N-terminal extended form, differing only in the translation start site used. The N-terminal extended form is targeted to the mitochondrion, where the removal of the extension generates the same form as in the cytoplasm. It is similar to some thermostable class II fumarases and functions as a homotetramer. Mutations in this gene can cause fumarase deficiency and lead to progressive encephalopathy.
9342	
9343	DVL1, the human homolog of the Drosophila dishevelled gene (dsh) encodes a cytoplasmic phosphoprotein that regulates cell proliferation, acting as a transducer molecule for developmental processes, including segmentation and neuroblast specification. DVL1 is a candidate gene for neuroblastomatous transformation. The Schwartz-Jampel syndrome and Charcot-Marie-Tooth disease type 2A have been mapped to the same region as DVL1. The phenotypes of these diseases may be consistent with defects which might be expected from aberrant expression of a DVL gene during development. Three transcript variants encoding three different isoforms have been found for this gene.
9344	The protein encoded by this gene is a lysosomal cysteine proteinase involved in bone remodeling and resorption. This protein, which is a member of the peptidase C1 protein family, is predominantly expressed in osteoclasts. However, the encoded protein is also expressed in a significant fraction of human breast cancers, where it could contribute to tumor invasiveness. Mutations in this gene are the cause of pycnodysostosis, an autosomal recessive disease characterized by osteosclerosis and short stature. This gene may be subject to RNA editing.
9345	This gene is a member of the frizzled gene family. Members of this family encode seven-transmembrane domain proteins that are receptors for the Wingless type MMTV integration site family of signaling proteins. Most frizzled receptors are coupled to the beta-catenin canonical signaling pathway. The function of this protein is unknown, although it may play a role in mammalian hair follicle development.
9346	
9347	
9348	
9349	This gene encodes a subunit of the N-methyl-D-aspartate (NMDA) receptors, which belong to the superfamily of glutamate-regulated ion channels, and function in physiological and pathological processes in the central nervous system. This subunit shows greater than 90% identity to the corresponding subunit in rat. Studies in the knockout mouse deficient in this subunit suggest that this gene may be involved in the development of synaptic elements by modulating NMDA receptor activity.
9350	
9351	
9352	
9353	
9354	
9355	
9356	
9357	
9358	The protein encoded by this gene is a member of kinesin-like protein family. Proteins of this family are microtubule-dependent molecular motors that transport organelles within cells and move chromosomes during cell division. This protein is important for anaphase chromosome segregation and may be required to coordinate the onset of sister centromere separation.
9359	This gene encodes a H1 histone binding protein that is involved in transporting histones into the nucleus of dividing cells. Multiple isoforms are encoded by transcript variants of this gene. The somatic form is expressed in all mitotic cells, is localized to the nucleus, and is coupled to the cell cycle. The testicular form is expressed in embryonic tissues, tumor cells, and the testis. In male germ cells, this protein is localized to the cytoplasm of primary spermatocytes, the nucleus of spermatids, and the periacrosomal region of mature spermatozoa.
9360	
9361	This gene encodes a member of the neurotrophic tyrosine kinase receptor (NTKR) family. This kinase is a membrane-bound receptor that, upon neurotrophin binding, phosphorylates itself and members of the MAPK pathway. The presence of this kinase leads to cell differentiation and may play a role in specifying sensory neuron subtypes. Mutations in this gene have been associated with congenital insensitivity to pain, anhidrosis, self-mutilating behavior, mental retardation and cancer. Alternate transcriptional splice variants of this gene have been found, but only three have been characterized to date.
9362	This gene encodes an integral membrane protein associated with presynaptic vesicles in neuronal cells. The exact function of this protein is unclear, but studies of a similar murine protein suggest that it functions in synaptic plasticity without being required for synaptic transmission. The gene product belongs to the synaptogyrin gene family. Three alternatively spliced variants encoding three different isoforms have been identified.
9363	The Fos gene family consists of 4 members: FOS, FOSB, FOSL1, and FOSL2. These genes encode leucine zipper proteins that can dimerize with proteins of the JUN family, thereby forming the transcription factor complex AP-1. As such, the FOS proteins have been implicated as regulators of cell proliferation, differentiation, and transformation.
9364	
9365	
9366	This gene encodes a member of the plakin protein family of adhesion junction plaque proteins. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene, but the full-length nature of some variants has not been defined. It has been known that some isoforms are expressed in neural and muscle tissue, anchoring neural intermediate filaments to the actin cytoskeleton, and some isoforms are expressed in epithelial tissue, anchoring keratin-containing intermediate filaments to hemidesmosomes. Consistent with the expression, mice defective for this gene show skin blistering and neurodegeneration.
9367	Protein kinase C (PKC) is a family of serine- and threonine-specific protein kinases that can be activated by calcium and the second messenger diacylglycerol. PKC family members phosphorylate a wide variety of protein targets and are known to be involved in diverse cellular signaling pathways. PKC family members also serve as major receptors for phorbol esters, a class of tumor promoters. Each member of the PKC family has a specific expression profile and is believed to play a distinct role in cells. The protein encoded by this gene is one of the PKC family members. This kinase has been shown to be involved in many different cellular functions, such as neuron channel activation, apoptosis, cardioprotection from ischemia, heat shock response, as well as insulin exocytosis. Knockout studies in mice suggest that this kinase is important for lipopolysaccharide (LPS)-mediated signaling in activated macrophages and may also play a role in controlling anxiety-like behavior.
9368	
9369	
9370	
9371	
9372	
9373	
9374	
9375	This gene encodes a T-type member of the alpha-1 subunit family, a protein in the voltage-dependent calcium channel complex. Calcium channels mediate the influx of calcium ions into the cell upon membrane polarization and consist of a complex of alpha-1, alpha-2/delta, beta, and gamma subunits in a 1:1:1:1 ratio. The alpha-1 subunit has 24 transmembrane segments and forms the pore through which ions pass into the cell. There are multiple isoforms of each of the proteins in the complex, either encoded by different genes or the result of alternative splicing of transcripts. Alternate transcriptional splice variants, encoding different isoforms, have been characterized for the gene described here. Studies suggest certain mutations in this gene lead to childhood absence epilepsy (CAE).
9376	This gene encodes a secreted protein resembling members of the ADAMTS (a disintegrin and metalloproteinase with thrombospondin motif) family. This protein lacks the propeptide region and the metalloproteinase and disintegrin-like domains, which are typical of the ADAMTS family, but contains other ADAMTS domains, including the thrombospondin type 1 motif. This protein may have important functions in the extracellular matrix. Alternative splicing of this gene results in 3 transcript variants encoding different isoforms.
9377	
9378	The nuclear lamina consists of a two-dimensional matrix of proteins located next to the inner nuclear membrane. The lamin family of proteins make up the matrix and are highly conserved in evolution. During mitosis, the lamina matrix is reversibly disassembled as the lamin proteins are phosphorylated. Lamin proteins are thought to be involved in nuclear stability, chromatin structure and gene expression. Vertebrate lamins consist of two types, A and B. Through alternate splicing, this gene encodes three type A lamin isoforms. Mutations in this gene lead to several diseases: Emery-Dreifuss muscular dystrophy, familial partial lipodystrophy, limb girdle muscular dystrophy, dilated cardiomyopathy, Charcot-Marie-Tooth disease, and Hutchinson-Gilford progeria syndrome.
9379	
9380	This gene encodes a member of the peptidase S1 family. The encoded protein is a serum calcium-decreasing factor that has chymotrypsin-like protease activity. Alternatively spliced transcript variants have been observed, but their full-length nature has not been determined.
9381	
9382	
9383	FOXJ1 is a member of the forkhead gene family, which was originally identified in Drosophila. The forkhead family is composed of transcription factors with a conserved 100-amino acid DNA-binding motif.[supplied by OMIM]
9384	
9385	
9386	
9387	The protein encoded by this gene is a potent growth promoting cytokine. This cytokine is capable of supporting the proliferation of a broad range of hematopoietic cell types. It is involved in a variety of cell activities such as cell growth, differentiation and apoptosis. This cytokine has been shown to also possess neurotrophic activity, and it may be associated with neurologic disorders.
9388	
9389	
9390	The protein encoded by this gene is a member of the SEC23 subfamily of the SEC23/SEC24 family. It is part of a protein complex and found in the ribosome-free transitional face of the endoplasmic reticulum (ER) and associated vesicles. This protein has similarity to yeast Sec23p component of COPII. COPII is the coat protein complex responsible for vesicle budding from the ER. The encoded protein is suggested to play a role in the ER-Golgi protein trafficking.
9391	
9392	
9393	
9394	
9395	
9396	
9397	
9398	
9399	Activating transcription factor 3 (ATF3)is a member of the mammalian activation transcription factor/cAMP responsive element-binding (CREB) protein family of transcription factors.  It encodes a protein with a calculated molecular mass of 22 kD. ATF3 represses rather than activates transcription from promoters with ATF binding elements. An alternatively spliced form of ATF3 (ATF3 delta Zip) encodes a truncated form ATF3 protein lacking the leucine zipper protein-dimerization motif and does not bind to DNA. In contrast to ATF3, ATF3 delta Zip stimulates transcription presumably by sequestering inhibitory co-factors away from the promoter.  It is possible that alternative splicing of the ATF3 gene may be physiologically important in the regulation of target genes.
9400	
9401	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. This gene encodes a membrane protein which has sequence similarity to the KCNE1 gene product, a member of the potassium channel, voltage-gated, isk-related subfamily. This intronless gene is deleted in AMME contiguous gene syndrome and may be involved in the cardiac and neurologic abnormalities found in the AMME contiguous gene syndrome.
9402	
9403	
9404	Xylosyltransferase (XT; EC 2.4.2.26) catalyzes the transfer of UDP-xylose to serine residues within XT recognition sequences of target proteins. Addition of this xylose to the core protein is required for the biosynthesis of the glycosaminoglycan chains characteristic of proteoglycans.[supplied by OMIM]
9405	
9406	
9407	The female gamete, the oocyte, serves the distinct purpose of transmitting the maternal genome and other maternal factors critical for postovulation events. Oocytes have diverse functions in ovarian folliculogenesis, fertilization, and embryogenesis. ZAR1 is an oocyte-specific gene that appears to function at the oocyte-to-gamete transition (Wu et al., 2003).[supplied by OMIM]
9408	
9409	This gene encodes a member of the epidermal growth factor receptor (EGFR) family of receptor tyrosine kinases. This membrane-bound protein has a neuregulin binding domain but not an active kinase domain. It therefore can bind this ligand but not convey the signal into the cell through protein phosphorylation. However, it does form heterodimers with other EGF receptor family members which do have kinase activity. Heterodimerization leads to the activation of pathways which lead to cell proliferation or differentiation. Amplification of this gene and/or overexpression of its protein have been reported in numerous cancers, including prostate, bladder, and breast tumors. Alternate transcriptional splice variants encoding different isoforms have been characterized. One isoform lacks the intermembrane region and is secreted outside the cell. This form acts to modulate the activity of the membrane-bound form. Additional splice variants have also been reported, but they have not been thoroughly characterized.
9410	
9411	
9412	
9413	
9414	
9415	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. A pseudogene corresponding to this gene is found at 8q21.11.
9416	
9417	
9418	
9419	Members of the F-box protein family, such as FBXO41, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
9420	
9421	
9422	
9423	
9424	Phosducin-like protein is a putative modulator of heterotrimeric G proteins. The protein shares extensive amino acid sequence homology with phosducin, a phosphoprotein expressed in retina and pineal gland. Both phosducin-like protein and phosphoducin have been shown to regulate G-protein signaling by binding to the beta-gamma subunits of G proteins.
9425	
9426	
9427	
9428	
9429	
9430	
9431	
9432	
9433	
9434	
9435	
9436	
9437	
9438	
9439	HLA-A belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon 1 encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Hundreds of HLA-A alleles have been described.
9440	
9441	
9442	
9443	
9444	This gene encodes a protein that acts as a repressor of beta-interferon gene expression. The protein binds specifically to the PRDI (positive regulatory domain I element) of the beta-IFN gene promoter. Transcription of this gene increases upon virus induction. Two alternatively spliced transcript variants that encode different isoforms have been reported.
9445	This gene encodes a member of the Ser/Thr protein kinase family. This kinase is regulated through direct phosphorylation by p38 MAP kinase. In conjunction with p38 MAP kinase, this kinase is known to be involved in many cellular processes including stress and inflammatory responses, nuclear export, gene expression regulation and cell proliferation. Heat shock protein HSP27 was shown to be one of the substrates of this kinase in vivo. Two transcript variants encoding two different isoforms have been found for this gene.
9446	This gene encodes a member of the serine/threonine protein kinase family, that is highly similar to yeast SPS1/STE20 kinase. Yeast SPS1/STE20 functions near the beginning of the MAP kinase signal cascades that is essential for yeast pheromone response. This kinase was shown to activate Jun kinase in mammalian cells, which suggested a role in stress response. Two alternatively spliced transcript variants encoding the same protein have been described for this gene.
9447	This gene encodes a disintegrin and metalloprotease (ADAM) domain 12, which is a member of the ADAM protein family. Members of this family are membrane-anchored proteins structurally related to snake venom disintegrins, and have been implicated in a variety of biologic processes involving  cell-cell and cell-matrix interactions, including fertilization, muscle development, and neurogenesis. This gene has 2 alternative splicing transcript forms: a shorter secreted form and a longer membrane-bound form, that diverge at their 3@#$%& ends. The shorter form is found to stimulate myogenesis.
9448	
9449	
9450	
9451	FCRL2 belongs to the Fc receptor family. Fc receptors are involved in phagocytosis, antibody-dependent cell cytotoxicity, immediate hypersensitivity, and transcytosis of immunoglobulins via their ability to bind immunoglobulin (Ig) constant regions (Chikaev et al., 2005).[supplied by OMIM]
9452	
9453	
9454	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein does not contain a coiled coil region like some family members, and its function is unknown. This gene results in two transcript variants encoding two distinct isoforms.
9455	
9456	
9457	
9458	The protein encoded by this gene is a regulatory subunit of the AMP-activated protein kinase (AMPK). AMPK is a heterotrimer consisting of an alpha catalytic subunit, and non-catalytic beta and gamma subunits. AMPK is an important energy-sensing enzyme that monitors cellular energy status. In response to cellular metabolic stresses, AMPK is activated, and thus phosphorylates and inactivates acetyl-CoA carboxylase (ACC) and beta-hydroxy beta-methylglutaryl-CoA reductase (HMGCR), key enzymes involved in regulating de novo biosynthesis of fatty acid and cholesterol. This subunit may be a positive regulator of AMPK activity. It is highly expressed in skeletal muscle and thus may have tissue-specific roles.
9459	
9460	
9461	This gene encodes a member of the sestrin family of PA26-related proteins. The encoded protein may function in the regulation of cell growth and survival. This protein may be involved in cellular response to different stress conditions.
9462	
9463	This gene is a member of the UBA domain family, whose members include proteins having connections to ubiquitin and the ubiquitination pathway. The ubiquitin associated domain is thought to be a non-covalent ubiquitin binding domain consisting of a compact three helix bundle. This particular protein originates from a gene locus in a refined region on chromosome 9 undergoing loss of heterozygosity in nasopharyngeal carcinoma (NPC). Taking into account its cytogenetic location, this UBA domain family member is being studies as a putative target for mutation in nasopharyngeal carcinomas.
9464	
9465	The protein encoded by this gene is a cytokine that belongs to the IL10 cytokine subfamily. This cytokine is found to be preferentially expressed in monocytes. It can bind the IL20 receptor complex and lead to the activation of the signal transducer and activator of transcription 3 (STAT3). A similar cytokine in mouse is reported to up-regulate the expression of IL6 and TNF-alpha and induce apoptosis, which suggests a role of this cytokine in inflammatory responses. Alternatively spliced transcript variants encoding the distinct isoforms have been described.
9466	This gene encodes a protein sharing a low level of sequence similarity with ribosomal protein P0. The function of the encoded protein is currently unknown. This gene uses alternative polyA sites.
9467	
9468	This gene encodes a member of the STAT-induced STAT inhibitor (SSI), also known as suppressor of cytokine signaling (SOCS), family. SSI family members are cytokine-inducible negative regulators of cytokine signaling. The expression of this gene is induced by various cytokines, including IL6, IL10, and interferon (IFN)-gamma. The protein encoded by this gene can bind to JAK2 kinase, and inhibit the activity of JAK2 kinase. Studies of the mouse counterpart of this gene suggested the roles of this gene in the negative regulation of fetal liver hematopoiesis, and placental development.
9469	
9470	
9471	
9472	The M channel is a slowly activating and deactivating potassium channel that plays a critical role in the regulation of neuronal excitability.  The M channel is formed by the association of the protein encoded by this gene and a related protein encoded by the KCNQ3 gene, both integral membrane proteins. M channel currents are inhibited by M1 muscarinic acetylcholine receptors and activated by retigabine, a novel anti-convulsant drug. Defects in this gene are a cause of benign familial neonatal convulsions type 1 (BFNC), also known as epilepsy, benign neonatal type 1 (EBN1). At least five transcript variants encoding five different isoforms have been found for this gene.
9473	
9474	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
9475	
9476	
9477	
9478	
9479	
9480	Pre-mRNA splicing occurs in two sequential transesterification steps. The protein encoded by this gene is found to be essential for the catalytic step II in pre-mRNA splicing process. It is found in the spliceosome, and contains seven WD repeats, which function in protein-protein interactions. This protein has a sequence similarity to yeast Prp17 protein, which functions in two different cellular processes: pre-mRNA splicing and cell cycle progression. It suggests that this protein may play a role in cell cycle progression.
9481	
9482	
9483	
9484	
9485	Initiation of transcription by RNA polymerase II requires the activities of more than 70 polypeptides. The protein that coordinates these activities is transcription factor IID (TFIID), which binds to the core promoter to position the polymerase properly, serves as the scaffold for assembly of the remainder of the transcription complex, and acts as a channel for regulatory signals. TFIID is composed of the TATA-binding protein (TBP) and a group of evolutionarily conserved proteins known as TBP-associated factors or TAFs. TAFs may participate in basal transcription, serve as coactivators, function in promoter recognition or modify general transcription factors (GTFs) to facilitate complex assembly and transcription initiation. This gene encodes a small subunit associated with a subset of TFIID complexes. This subunit interacts with TBP and with two other small subunits of TFIID, TAF10 and TAF11. There is a pseudogene located on chromosome 6.
9486	
9487	This gene encodes disintegrin and metalloprotease (ADAM) domain 23, which is a member of the ADAM protein family. Members of this family are membrane-anchored proteins structurally related to snake venom disintegrins, and have been implicated in a variety of biologic processes involving  cell-cell and cell-matrix interactions, including fertilization, muscle development, and neurogenesis. Like ADAM22, this gene is highly expressed in the brain and may function as an integrin ligand in the brain. However, this gene and ADAM22 have different chromosomal localizations.
9488	The mitochondrial deoxyribonucleotide (dNTP) pool is separated from the cytosolic pool because the mitochondria inner membrane is impermeable to charged molecules. The mitochondrial pool is maintained by either import of cytosolic dNTPs through dedicated transporters or by salvaging deoxynucleosides within the mitochondria; apparently, enzymes of the de novo dNTP synthesis pathway are not present in the mitochondria. In nonreplicating cells, where cytosolic dNTP synthesis is downregulated, mtDNA synthesis depends solely on the mitochondrial salvage pathway enzymes, the deoxyribonucleoside kinases. Two of the 4 human deoxyribonucleoside kinases, deoxyguanosine kinase (DGK) and thymidine kinase-2, are expressed in mitochondria. Human DGK, encoded by the DGUOK gene (MIM 601465), efficiently phosphorylates deoxyguanosine and deoxyadenosine, whereas TK2 phosphorylates deoxythymidine, deoxycytidine, and deoxyuridine. Thymidine kinase-2 (TK2) is a deoxyribonucleoside kinase that phosphorylates thymidine, deoxycytidine, and deoxyuridine, and also phosphorylates antiviral and anticancer nucleoside analogs.[supplied by OMIM]
9489	This gene encodes the enzyme Man(9)GlcNAc(2)-PP-Dol alpha-1,3-glucosyltransferase. This enzyme participates in the glucosylation of the oligomannose core. The addition of glucose residues to the oligomannose core is necessary to ensure substrate recognition, and therefore, effectual transfer of the oligomannose core to the nascent glycoproteins. Mutations in this gene result in an accumulation of glucosylated lipid-linked oligosaccharides which leads to carbohydrate-deficient glycoprotein syndrome.
9490	This gene encodes a protein that interacts with the gamma subunit of AP1 clathrin-adaptor complex. The AP1 complex is located at the trans-Golgi network and associates specific proteins with clathrin-coated vesicles. This encoded protein may act to connect the AP1 complex to other proteins. Alternatively spliced transcript variants that encode different isoforms have been described for this gene.
9491	The protein encoded by this gene localizes to focal adhesions, regions of the plasma membrane where the cell attaches to the extracellular matrix. This protein crosslinks actin filaments and contains a Src homology 2 (SH2) domain, which is often found in molecules involved in signal transduction. This protein is a substrate of calpain II. A second transcript from this gene has been described, but its full length nature has not been determined.
9492	This gene encodes a protein which contains a proline-rich region. The encoded protein may function in renal tubular development and function. This protein interacts with nephrocystin. Mutations in this gene are associated with nephronophthisis type 4. Multiple alternative transcript variants have been described but their full-length nature has not been determined.
9493	
9494	PLCD3 belongs to a large family of enzymes that catalyze the hydrolysis of phosphatidylinositol 4,5-bisphosphate (PtdIns(4,5)P2) to generate the second messengers Ins(1,4,5)P3 and diacylglycerol, which mediate a wide variety of cellular stimuli.[supplied by OMIM]
9495	
9496	
9497	CD1B is a group-1 member of the CD1 family of major histocompatibility (MHC)-like glycoproteins. See CD1A (MIM 188370) for background information on CD1 molecules.[supplied by OMIM]
9498	
9499	
9500	
9501	
9502	This gene encodes an essential eukaryotic translation initiation factor. The protein is required for the binding of the 43S complex (a 40S subunit, eIF2/GTP/Met-tRNAi and eIF3) to the 5@#$%& end of capped RNA.
9503	
9504	
9505	
9506	
9507	This gene encodes an integral outer mitochondrial membrane protein that blocks the apoptotic death of some cells such as lymphocytes. Constitutive expression of BCL2, such as in the case of translocation of BCL2 to Ig heavy chain locus, is thought to be the cause of follicular lymphoma. Two transcript variants, produced by alternate splicing, differ in their C-terminal ends.
9508	This gene product is a member of the serine-arginine (SR) family of proteins, which is involved in constitutive and regulated RNA splicing. Members of this family are characterized by N-terminal RNP1 and RNP2 motifs, which are required for binding to RNA, and multiple C-terminal SR/RS repeats, which are important in mediating association with other cellular proteins. This protein can influence splice site selection of adenovirus E1A pre-mRNA. It interacts with the oncoprotein TLS, and abrogates the influence of TLS on E1A pre-mRNA splicing. Alternative splicing of this gene results in at least two transcript variants encoding different isoforms. In addition, transcript variants utilizing alternative polyA sites exist.
9509	
9510	
9511	
9512	
9513	
9514	This gene encodes a member of the @#$%&regulator of G protein signaling@#$%& (RGS) gene family. The encoded protein may function as a guanosine triphosphatase (GTPase)-activating protein as well as a transcriptional repressor. This protein may play a role in tumorigenesis. Multiple transcript variants encoding distinct isoforms have been identified for this gene. Other alternative splice variants have been described but their biological nature has not been determined.
9515	
9516	
9517	C8 beta is one of the three subunits that comprise the component 8 (C8) of the complement system.  C8 participates in the formation of Membrane Attack Complex that results in the lysis of cells.  Patients with C8B deficiency are prone to bacteria infection.
9518	The protein encoded by this gene is membrane-associated and contains five ankyrin repeats, a protein phosphatase-1-interacting domain, and a carboxy-terminal CAAX box domain. Synthesis of the encoded protein is inhibited by transforming growth factor beta-1. The protein may bind to the membrane through its CAAX box domain and may act as a signaling molecule through interaction with protein phosphatase-1.
9519	
9520	
9521	
9522	This gene encodes a trypsinogen, which is a member of the trypsin family of serine proteases. This enzyme is secreted by the pancreas and cleaved to its active form in the small intestine. It is active on peptide linkages involving the carboxyl group of lysine or arginine. Mutations in this gene are associated with hereditary pancreatitis. This gene and several other trypsinogen genes are localized to the T cell receptor beta locus on chromosome 7.
9523	
9524	Glutamate-cysteine ligase, also known as gamma-glutamylcysteine synthetase is the first rate limiting enzyme of glutathione synthesis. The enzyme consists of two subunits, a heavy catalytic subunit and a light regulatory subunit. The gene encoding the catalytic subunit encodes a protein of 367 amino acids with a calculated molecular weight of 72.773 kDa and maps to chromosome 6. The regulatory subunit is derived from a different gene located on chromosome 1p22-p21. Deficiency of gamma-glutamylcysteine synthetase in human is associated with enzymopathic hemolytic anemia.
9525	
9526	
9527	
9528	
9529	
9530	
9531	This gene encodes a member of the alpha-2/delta subunit family, a protein in the voltage-dependent calcium channel complex. Calcium channels mediate the influx of calcium ions into the cell upon membrane polarization and consist of a complex of alpha-1, alpha-2/delta, beta, and gamma subunits in a 1:1:1:1 ratio. Various versions of each of these subunits exist, either expressed from similar genes or the result of alternative splicing. Research on a highly similar protein in rabbit suggests the protein described in this record is cleaved into alpha-2 and delta subunits. Alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
9532	
9533	
9534	
9535	
9536	Initiation of transcription by RNA polymerase II requires the activities of more than 70 polypeptides. The protein that coordinates these activities is transcription factor IID (TFIID), which binds to the core promoter to position the polymerase properly, serves as the scaffold for assembly of the remainder of the transcription complex, and acts as a channel for regulatory signals. TFIID is composed of the TATA-binding protein (TBP) and a group of  evolutionarily conserved proteins known as TBP-associated factors or TAFs. TAFs may participate in basal transcription, serve as coactivators, function in promoter recognition or modify general transcription factors (GTFs) to facilitate complex  assembly and transcription initiation. This gene encodes a protein that is a component of the PCAF histone acetylase complex and structurally similar to one of  the histone-like TAFs, TAF5. The PCAF histone acetylase complex, which is composed of more than 20 polypeptides some of which are TAFs, is required for myogenic transcription and differentiation.
9537	
9538	
9539	
9540	
9541	
9542	This gene encodes a ubiquitin ligase that is specific for receptor-regulated SMAD proteins in the bone morphogenetic protein (BMP) pathway. A similar protein in Xenopus is involved in embryonic pattern formation. Alternative splicing results in multiple transcript variants encoding different isoforms. An additional transcript variant has been identified, but its full length sequence has not been determined.
9543	
9544	MHC2TA encodes a non-DNA binding transactivator that functions both in constitutive and inducible MHC Class II expression. Mutations in MHC2TA result in the complementation group A of Bare Lymphocyte Syndrome.
9545	This gene encodes a bipartite protein with distinct amino- and carboxy-terminal domains. The amino-terminus contains nuclear localization signals and the carboxy-terminus contains numerous consecutive sequences with extensive similarity to proteins in the gelsolin family of actin-binding proteins, which cap, nucleate, and/or sever actin filaments. The gene product is tightly associated with both actin filaments and plasma membranes, suggesting a role as a high-affinity link between the actin cytoskeleton and the membrane. Its function may include recruitment of actin and other cytoskeletal proteins into specialized structures at the plasma membrane and in the nuclei of growing cells. Two transcript variants encoding different isoforms of supervillin have been described.
9546	This gene encodes a nuclear protein and has similarity to drosophila male-specific lethal-3 gene. The drosophila protein plays a critical role in a dosage-compensation pathway, which equalizes X-linked gene expression in males and females. Thus this encoded protein is thought to play a similar function in chromatin remodeling and transcriptional regulation. This gene has been found to undergo X inactivation. There are four alternatively spliced transcript variants of this gene encoding different isoforms.
9547	Involucrin, a component of the keratinocyte crosslinked envelope, is found in the cytoplasm and crosslinked to membrane proteins by transglutaminase.  This gene is mapped to 1q21, among calpactin I light chain, trichohyalin, profillaggrin, loricrin, and calcyclin.
9548	
9549	
9550	NIMA-related kinases share high amino acid sequence identity with the gene product of the Aspergillus nidulans @#$%&never in mitosis A@#$%& gene, which controls initiation of mitosis.[supplied by OMIM]
9551	
9552	The process of differentiation from mesodermal precursor cells to myoblasts has led to the discovery of a variety of tissue-specific factors that regulate muscle gene expression. The myogenic basic helix-loop-helix proteins, including myoD (MIM 159970), myogenin (MIM 159980), MYF5 (MIM 159990), and MRF4 (MIM 159991) are 1 class of identified factors. A second family of DNA binding regulatory proteins is the myocyte-specific enhancer factor-2 (MEF2) family. Each of these proteins binds to the MEF2 target DNA sequence present in the regulatory regions of many, if not all, muscle-specific genes. The MEF2 genes are members of the MADS gene family (named for the yeast mating type-specific transcription factor MCM1, the plant homeotic genes @#$%&agamous@#$%& and @#$%&deficiens@#$%& and the human serum response factor SRF (MIM 600589)), a family that also includes several homeotic genes and other transcription factors, all of which share a conserved DNA-binding domain.[supplied by OMIM]
9553	
9554	
9555	
9556	
9557	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S10E family of ribosomal proteins. It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
9558	This gene encodes a protein containing domains found in the La related protein of Drosophila melanogaster. La motif-containing proteins are thought to be RNA-binding proteins, where the La motif and adjacent amino acids fold into an RNA recognition motif. The La motif is also found in proteins unrelated to the La protein. Alternative splicing has been observed at this locus and three variants, encoding distinct isoforms, are described. Additional splice variation has been identified but the full-length nature of these transcripts has not been determined.
9559	The activation of mitogen-activated protein kinase (MAPK) cascades transduces various extracellular signals to the nucleus to induce gene expression, cell proliferation, differentiation, cell cycle arrest, and apoptosis. For full activation of MAPKs, dual-specificity kinases phosphorylate both threonine and tyrosine residues in MAPK TXY motifs. MKPs are dual-specificity phosphatases that dephosphorylate the TXY motif, thereby negatively regulating MAPK activity.[supplied by OMIM]
9560	This gene encodes a member of the annexin family, a group of calcium-dependent phospholipid-binding proteins. Annexins have unique N-terminal domains and conserved C-terminal domains, which contain the calcium-dependent phospholipid-binding sites. The encoded protein is a 56-kD antigen recognized by sera from patients with various autoimmune diseases. Transcript variants encoding the same isoform have been identified.
9561	
9562	This gene is thought to encode a protein that regulates intracellular protein trafficking to and from the lysosome. Mutations in this gene cause Chediak-Higashi syndrome, a rare lysosomal storage disorder. Alternative splicing results in two transcript variants encoding different isoforms; additional splice variants have been described but their full-length nature has not been determined.
9563	
9564	Protein phosphatase-1 (PP1; see MIM 176875) participates in the regulation of a wide variety of cellular functions by reversible protein phosphorylation. The ability of PP1 to regulate diverse functions resides in its capacity to interact with a variety of regulatory subunits that may target PP1 to specific subcellular locations, modulate its substrate specificity, and allow its activity to be responsive to extracellular signals. Several targeting subunits of PP1 have been identified, including PPP1R5, the glycogen-binding subunits PPP1R3 (MIM 600917) and PPP1R4, and the nuclear inhibitor of PP1 (PPP1R8; MIM 602636).[supplied by OMIM]
9565	
9566	
9567	
9568	
9569	
9570	
9571	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbws class; in addition to an F-box, this protein contains multiple WD-40 repeats. This protein is homologous to Xenopus bTrCP1, yeast Met30, Neurospora Scon2 and Drosophila Slimb proteins. It interacts with HIV-1 Vpu and connects CD4 to the proteolytic machinery. It also associates specifically with phosphorylated IkappaBalpha and beta-catenin destruction motifs, probably functioning in multiple transcriptional programs by activating the NF-kappaB pathway and inhibiting the beta-catenin pathway.
9572	
9573	
9574	
9575	
9576	
9577	
9578	
9579	CD160 is an 27 kDa glycoprotein which was initially identified with the monoclonal antibody BY55. Its expression is tightly associated with peripheral blood NK cells and CD8 T lymphocytes with cytolytic effector activity. The cDNA sequence of CD160 predicts a cysteine-rich, glycosylphosphatidylinositol-anchored protein of 181 amino acids with a single Ig-like domain weakly homologous to KIR2DL4 molecule. CD160 is expressed at the cell surface as a tightly disulfide-linked multimer. RNA blot analysis revealed CD160 mRNAs of 1.5 and 1.6 kb whose expression was highly restricted to circulating NK and T cells, spleen and small intestine. Within NK cells CD160 is expressed by CD56dimCD16+ cells whereas among circulating T cells its expression is mainly restricted to TCRgd bearing cells and to TCRab+CD8brightCD95+CD56+CD28-CD27-cells. In tissues, CD160 is expressed on all intestinal intraepithelial lymphocytes. CD160 shows a broad specificity for binding to both classical and nonclassical MHC class I molecules.
9580	Mutations in dysferlin, a protein associated with the plasma membrane, can cause muscle weakness that affects both proximal and distal muscles. The protein encoded by this gene is a type II membrane protein that is structurally similar to dysferlin. It is a member of the ferlin family and associates with both plasma and nuclear membranes. The protein contains C2 domains that play a role in calcium-mediated membrane fusion events, suggesting that it may be involved in membrane regeneration and repair. Two transcript variants encoding different isoforms have been found for this gene. Other possible variants have been detected, but their full-length natures have not been determined.
9581	
9582	
9583	
9584	This gene encodes a member of the WNT1 inducible signaling pathway (WISP) protein subfamily, which belongs to the connective tissue growth factor (CTGF) family. WNT1 is a member of a family of cysteine-rich, glycosylated signaling proteins that mediate diverse developmental processes. The CTGF family members are characterized by four conserved cysteine-rich domains: insulin-like growth factor-binding domain, von Willebrand factor type C module, thrombospondin domain and C-terminal cystine knot-like domain. This gene may be downstream in the WNT1 signaling pathway that is relevant to malignant transformation. It is expressed at a high level in fibroblast cells, and overexpressed in colon tumors. The encoded protein binds to decorin and biglycan, two members of a family of small leucine-rich proteoglycans present in the extracellular matrix of connective tissue, and possibly prevents the inhibitory activity of decorin and biglycan in tumor cell proliferation. It also attenuates p53-mediated apoptosis in response to DNA damage through activation of the Akt kinase. It is 83% identical to the mouse protein at the amino acid level. Alternative splicing of this gene generates 2 transcript variants.
9585	Gangliosides are membrane-bound glycosphingolipids containing sialic acid. Ganglioside GD3 is known to be important for cell adhesion and growth of cultured malignant cells. The protein encoded by this gene is a type II membrane protein that catalyzes the transfer of sialic acid from CMP-sialic acid to GM3 to produce gangliosides GD3 and GT3. The encoded protein may be found in the Golgi apparatus and is a member of glycosyltransferase family 29.
9586	
9587	The vitamin A metabolite, all-trans retinoic acid (atRA), plays an important role in neuronal development, including neurite outgrowth. RAINB1 is an atRA-responsive gene.[supplied by OMIM]
9588	
9589	Activator protein-2 alpha (AP-2 alpha) is a developmentally-regulated transcription factor and important regulator of gene expression during vertebrate development and carcinogenesis. The protein encoded by this gene is a member of the Kruppel-like zinc finger protein family and can repress expression of the AP-2 alpha gene by binding to a specific site in the AP-2 alpha gene promoter. Repression by the encoded protein requires binding with a corepressor, CtBP1. Two transcript variants encoding different isoforms have been found for this gene.
9590	
9591	
9592	
9593	
9594	
9595	
9596	
9597	
9598	Stearoyl-CoA desaturases catalyze the committed step in the biosynthesis of monounsaturated fatty acids from saturated fatty acids. This reaction involves the introduction of a cis-double bond between carbons 9 and 10 in a spectrum of methylene-interrupted fatty acyl-CoAs.[supplied by OMIM]
9599	
9600	
9601	
9602	
9603	
9604	MBL2 encodes the soluble mannose-binding protein found in serum.  MBL2 recognizes mannose and N-acetylglucosamine on bacterial pathogens, and is capable of activating the classical complement pathway.
9605	The flavin-containing monooxygenases are NADPH-dependent enzymes that catalyze the oxidation of many drugs and xenobiotics. In most mammals, there is a flavin-containing monooxygenase that catalyzes the N-oxidation of some primary alkylamines through an N-hydroxylamine intermediate. However, in humans, this enzyme is truncated and is probably rapidly degraded. The protein encoded by this gene represents the truncated form and apparently has no catalytic activity. A functional allele found in African Americans has been reported, but no sequence evidence has been deposited to support the finding. This gene is found in a cluster with the FMO1, FMO3, and FMO4 genes on chromosome 1.
9606	
9607	
9608	
9609	
9610	This gene product belongs to the neurexin family, members of which function in the vertebrate nervous system as cell adhesion molecules and receptors. This protein, like other neurexin proteins, contains epidermal growth factor repeats and laminin G domains. In addition, it includes an F5/8 type C domain, discoidin/neuropilin- and fibrinogen-like domains, and thrombospondin N-terminal-like domains. Alternative splicing of this gene results in 2 transcript variants encoding different isoforms.
9611	
9612	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is intronless and encodes a member of the histone H2A family. Transcripts from this gene contain a palindromic termination element.
9613	
9614	
9615	
9616	
9617	
9618	
9619	
9620	
9621	
9622	
9623	
9624	Mahogunin (MGRN1) is a C3HC4 RING-containing protein with E3 ubiquitin ligase activity in vitro.[supplied by OMIM]
9625	
9626	The expression of this gene in adipose tissue is induced by fasting. The function of protein encoded by this gene is currently unknown.
9627	The protein encoded by this gene belongs to the family of P2X receptors, which are ATP-gated ion channels and mediate rapid and selective permeability to cations. This gene is predominantly expressed in skeletal muscle, and regulated by p53. The encoded protein is associated with VE-cadherin at the adherens junctions of human umbilical vein endothelial cells.
9628	
9629	This gene encodes a member of the family of voltage-gated potassium (Kv) channel-interacting proteins (KCNIPs), which belong to the recoverin branch of the EF-hand superfamily. Members of the KCNIP family are small calcium binding proteins. They all have EF-hand-like domains, and differ from each other in the N-terminus. They are integral subunit components of native Kv4 channel complexes. They may regulate A-type currents, and hence neuronal excitability, in response to changes in intracellular calcium. The protein encoded by this gene is also shown to function as a calcium-regulated transcriptional repressor, and to interact with presenilins. Mutations in the presenilin genes have been implicated in Alzheimer@#$%&s disease. Due to utilization of an alternate in-frame translation start codon, this gene encodes two isoforms with different sizes.
9630	The Fanconi anemia complementation group (FANC) currently includes FANCA, FANCB, FANCC, FANCD1 (also called BRCA2), FANCD2, FANCE, FANCF, FANCG, and FANCL. Fanconi anemia is a genetically heterogeneous recessive disorder characterized by cytogenetic instability, hypersensitivity to DNA crosslinking agents, increased chromosomal breakage, and defective DNA repair. The members of the Fanconi anemia complementation group do not share sequence similarity; they are related by their assembly into a common nuclear protein complex. This gene encodes the protein for complementation group B. Alternative splicing results in two transcript variants encoding the same protein.
9631	Extracellular signals are transduced into cells through mitogen-activated protein kinases. The structural organization of these kinases into specific signaling domains is facilitated by scaffolding proteins involved in closely tethering different kinases so that successive phosphorylation events can occur. The protein encoded by this gene is a scaffolding protein that brings together mitogen-activated protein kinases and their transcription factor targets for the activation of specific signaling pathways. This gene which is abundantly expressed in testicular haploid germ cells encodes a protein that is recognized by sperm-agglutinating antibodies and implicated in infertility. Alternatively spliced variants that encode different protein isoforms have been described but the full-length sequences of only two have been determined.
9632	
9633	The protein encoded by this gene catalyzes the fourth step of porphyrin biosynthesis in the heme biosynthetic pathway. Defects in this gene cause congenital erythropoietic porphyria (Gunther@#$%&s disease).
9634	
9635	
9636	
9637	
9638	
9639	
9640	PLCE1 belongs to the phospholipase family that catalyzes the hydrolysis of polyphosphoinositides such as phosphatidylinositol-4,5-bisphosphate (PtdIns(4,5)P2) to generate the second messengers Ins(1,4,5)P3 and diacylglycerol. These products initiate a cascade of intracellular responses that result in cell growth and differentiation and gene expression.[supplied by OMIM]
9641	Rho GTPases play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. The encoded protein may form complex with G proteins and stimulate Rho-dependent signals.
9642	
9643	
9644	
9645	The protein encoded by this gene is an acyltransferase that converts lysophosphatidic acid into phosphatidic acid, which is the second step in the de novo phospholipid biosynthetic pathway. The encoded protein may be an integral membrane protein.
9646	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S10E family of ribosomal proteins. It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
9647	This gene encodes a multidomain serine protease inhibitor that contains 15 potential inhibitory domains. The inhibitor may play a role in skin and hair morphogenesis and anti-inflammatory and/or antimicrobial protection of mucous epithelia. Mutations may result in Netherton syndrome, a disorder characterized by ichthyosis, defective cornification, and atopy.
9648	
9649	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to cytoplasmic bodies. Its function has not been identified. Five alternatively spliced transcript variants for this gene have been described. However, the full length nature of two of the variants has not been determined.
9650	BAG proteins compete with Hip for binding to the Hsc70/Hsp70 ATPase domain and promote substrate release. All the BAG proteins have an approximately 45-amino acid BAG domain near the C terminus but differ markedly in their N-terminal regions. The predicted BAG2 protein contains 211 amino acids. The BAG domains of BAG1, BAG2, and BAG3 interact specifically with the Hsc70 ATPase domain in vitro and in mammalian cells. All 3 proteins bind with high affinity to the ATPase domain of Hsc70 and inhibit its chaperone activity in a Hip-repressible manner.
9651	
9652	
9653	
9654	
9655	V7-3 shows structural characteristics of an Na(+) and Cl(-)-dependent neurotransmitter transporter, including 12 transmembrane (TM) domains, intracellular N and C termini, and large extracellular loops containing multiple N-glycosylation sites.[supplied by OMIM]
9656	
9657	
9658	
9659	Members of the Ikaros (ZNFN1A1; MIM 603023) family of transcription factors, which includes Pegasus, are expressed in lymphocytes and are implicated in the control of lymphoid development.[supplied by OMIM]
9660	
9661	Defensins are cysteine-rich cationic polypeptides that are important in the host immunologic response to invading microorganisms. The protein encoded by this gene is secreted and is a member of the beta defensin protein family. Beta defensin genes are found in several clusters throughout the genome, with this gene mapping to a cluster at 20q11.1.
9662	
9663	
9664	The C/EBP family of transcription factors regulates viral and cellular CCAAT/enhancer element-mediated transcription.  C/EBP proteins  contain the bZIP region, which is characterized by two motifs in the C-terminal half of the protein: a basic region involved in DNA binding and a leucine zipper motif involved in dimerization. The C/EBP family consist of several related proteins, C/EBP alpha, C/EBP beta, C/EBP gamma, and C/EBP delta, that form homodimers and that form heterodimers with each other.  CCAAT/enhancer binding protein gamma may cooperate with Fos to bind PRE-I enhancer elements.
9665	The protein encoded by this gene is an inositol 1,4,5-trisphosphate (InsP3) 5-phosphatase and contains a Sac domain. The activity of this protein is specific for phosphatidylinositol 4,5-bisphosphate and phosphatidylinositol 3,4,5-trisphosphate. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
9666	
9667	This gene is a member of the cytidine deaminase gene family. It is one of seven related genes or pseudogenes found in a cluster, thought to result from gene duplication, on chromosome 22. Members of the cluster encode proteins that are structurally and functionally related to the C to U RNA-editing cytidine deaminase APOBEC1. It is thought that the proteins may be RNA editing enzymes and have roles in growth or cell cycle control. Alternatively spliced transcript variants encoding different isoforms have been identified.
9668	Annexin VI belongs to a family of calcium-dependent membrane and phospholipid binding proteins. Although their functions are still not clearly defined, several members of the annexin family have been implicated in membrane-related events along exocytotic and endocytotic pathways. The annexin VI gene is approximately 60 kbp long and contains 26 exons.  It encodes a protein of about 68 kDa that consists of eight 68-amino acid repeats separated by linking sequences of variable lengths. It is highly similar to human annexins I and II sequences, each of which contain four such repeats.  Exon 21 of annexin VI is alternatively spliced, giving rise to two isoforms that differ by a 6-amino acid insertion at the start of the seventh repeat.  Annexin VI has been implicated in mediating the endosome aggregation and vesicle fusion in secreting epithelia during exocytosis.
9669	This gene encodes a member of the NEDD4-like protein family. The family of proteins is known to possess ubiquitin-protein ligase activity. The encoded protein contains 4 tandem WW domains. The WW domain is a protein motif consisting of 35 to 40 amino acids and is characterized by 4 conserved aromatic residues. The WW domain may mediate specific protein-protein interactions. Three alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
9670	
9671	
9672	The protein encoded by this gene is involved in the synthesis of the unbranched glycosaminoglycan hyaluronan, or hyaluronic acid, which is a major constituent of the extracellular matrix. This gene is a member of the NODC/HAS gene family. Compared to the proteins encoded by other members of this gene family, this protein appears to be more of a regulator of hyaluronan synthesis. Two transcript variants encoding different isoforms have been found for this gene.
9673	
9674	
9675	
9676	
9677	
9678	This gene is a member of the SCC3 family and is expressed in the nucleus. It encodes a component of cohesin, a multisubunit protein complex that provides sister chromatid cohesion along the length of a chromosome from DNA replication through prophase and prometaphase, after which it is dissociated in preparation for segregation during anaphase.
9679	TIP39 is related to parathyroid hormone (PTH; MIM 168450) and PTH-related protein (PTHRP; MIM 168470) and is a ligand for PTH receptor-2 (PTHR2; MIM 601469) (John et al., 2002 [PubMed 11861531]).[supplied by OMIM]
9680	
9681	
9682	
9683	
9684	Glutathione S-transferase (GSTs) theta 2 (GSTT2) is a member of a superfamily of proteins that catalyze the conjugation of reduced glutathione to a variety of electrophilic and hydrophobic compounds. Human GSTs can be divided into five main classes: Alpha, Mu, Pi, Theta, and Zeta. The theta class includes GSTT1 and GSTT2. The GSTT1 and GSTT2 share 55% amino acid sequence identity and both of them were claimed to have an important role in human carcinogenesis. GSTT1 gene is located approximately 50kb away from the GSTT2 gene. GSTT1 and GSTT2 genes have a similar structure, being composed of five exons with identical exon/intron boundaries.
9685	
9686	This gene encodes one of the members of the superfamily of potassium channel proteins containing two pore-forming P domains. The gene product is an outwardly rectifying channel that is sensitive to changes in extracellular pH and is inhibited by extracellular acidification. Also referred to as an acid-sensitive potassium channel, it is activated by the anesthetics halothane and isoflurane. Although three transcripts are detected in northern blots, there is currently no sequence available to confirm transcript variants for this gene.
9687	The protein encoded by this gene is a member of the sulfatase family. Sulfatases are essential for the correct composition of bone and cartilage matrix. This encoded protein is postranslationally glycosylated and localized to the lysosome. Two splice variants of this gene have been identified.
9688	The product of this gene is a small iron-sulfur protein that transfers electrons from NADPH through ferredoxin reductase to a terminal cytochrome P450. This particular oxidation/reduction system is found in steroidogenic tissues, and is involved with the synthesis of bile acid and vitamin D. In addition to the expressed gene at this chromosomal locus (11q22), there are pseudogenes located on chromosomes 20 and 21. This gene product has been identified in a number of different tissues but all forms have been shown to be identical and are not tissue specific.
9689	
9690	
9691	
9692	The Sec61 complex is the central component of the protein translocation apparatus of the endoplasmic reticulum (ER) membrane. The protein encoded by this gene and SEC63 protein are found to be associated with ribosome-free SEC61 complex. It is speculated that Sec61-Sec62-Sec63 may perform post-translational protein translocation into the ER. The Sec61-Sec62-Sec63 complex might also perform the backward transport of ER proteins that are subject to the ubiquitin-proteasome-dependent degradation pathway. The encoded protein is an integral membrane protein located in the rough ER.
9693	The Sec61 complex is the central component of the protein translocation apparatus of the endoplasmic reticulum (ER) membrane. The protein encoded by this gene and SEC63 protein are found to be associated with ribosome-free SEC61 complex. It is speculated that Sec61-Sec62-Sec63 may perform post-translational protein translocation into the ER. The Sec61-Sec62-Sec63 complex might also perform the backward transport of ER proteins that are subject to the ubiquitin-proteasome-dependent degradation pathway. The encoded protein is an integral membrane protein located in the rough ER.
9694	This gene encodes a member of the RSK (ribosomal S6 kinase) family of serine/threonine kinases. This kinase contains 2 non-identical kinase catalytic domains and phosphorylates various substrates, including CREB1 and c-fos. Alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
9695	
9696	The product of this gene belongs to the phosphatase 2A regulatory subunit B family. Protein phosphatase 2A is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. The B regulatory subunit might modulate substrate selectivity and catalytic activity. This gene encodes a beta isoform of the regulatory subunit B56 subfamily.
9697	
9698	
9699	
9700	The protein encoded by this gene is a urate transporter and urate-anion exchanger which regulates the level of urate in the blood. This protein is an integral membrane protein primarily found in kidney. Two transcript variants encoding different isoforms have been found for this gene.
9701	
9702	Cofilin is a widely distributed intracellular actin-modulating protein that binds and depolymerizes filamentous F-actin and inhibits the polymerization of monomeric G-actin in a pH-dependent manner. It is involved in the translocation of actin-cofilin complex from cytoplasm to nucleus.[supplied by OMIM]
9703	
9704	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L11P family of ribosomal proteins. It is located in the cytoplasm. The protein binds directly to the 26S rRNA. This gene is co-transcribed with the U65 snoRNA, which is located in its fourth intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
9705	CEACAM3 is a member of the CEACAM family (see CEACAM1; MIM 109770) that is expressed specifically in granulocytes. Members of the CEACAM family, including CEACAM3, are used by several human bacterial pathogens, including Neisseria, Moraxella, and Hemophilus species, to anchor themselves to and/or invade host cells (Schmitter et al., 2004 [PubMed 14707113]).[supplied by OMIM]
9706	
9707	The protein encoded by this gene stimulates the formation of a non-mitotic multinucleate syncytium from proliferative cytotrophoblasts during trophoblast differentiation. An alternatively spliced transcript variant of this gene may exist.
9708	Lysophospholipases are enzymes that act on biological membranes to regulate the multifunctional lysophospholipids. The protein encoded by this gene hydrolyzes lysophosphatidylcholine to glycerophosphorylcholine and a free fatty acid. This enzyme is present in the plasma and thought to be associated with high-density lipoprotein. A later paper contradicts the function of this gene. It demonstrates that this gene encodes a lysosomal enzyme instead of a lysophospholipase and has both calcium-independent phospholipase A2 and transacylase activities.
9709	
9710	This translocase has similarity to yeast mitochondrial proteins that are involved in the import of metabolite transporters from the cytoplasm into the mitochondrial inner membrane. The gene is mutated in Deafness Dystonia Syndrome (MTS/DFN-1) and it is postulated that MTS/DFN-1 is a mitochondrial disease caused by a defective mitochondrial protein import system.
9711	
9712	
9713	Nicotinamide adenine dinucleotide (NAD) is a coenzyme in metabolic redox reactions, a precursor for several cell signaling molecules, and a substrate for protein posttranslational modifications. NAD synthetase (EC 6.3.5.1) catalyzes the final step in the biosynthesis of NAD from nicotinic acid adenine dinucleotide (NaAD).[supplied by OMIM]
9714	
9715	
9716	
9717	
9718	
9719	
9720	This gene encodes a member of the POP family of proteins containing three putative transmembrane domains. This gene is expressed in cardiac and skeletal muscle and may play an important role in these tissues during development.
9721	
9722	
9723	This gene encodes a calcium-binding protein belonging to the penta-EF-hand protein family. Calcium binding is important for homodimerization and for conformational changes required for binding to other protein partners. This gene product participates in T cell receptor-,  Fas-, and glucocorticoid-induced programmed cell death. In mice deficient for this gene product, however, apoptosis was not blocked suggesting this gene product is functionally redundant.
9724	
9725	
9726	
9727	Members of the Ikaros (ZNFN1A1; MIM 603023) family of transcription factors, which includes Eos, are expressed in lymphocytes and are implicated in the control of lymphoid development.[supplied by OMIM]
9728	This gene, CYP3A7, encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This enzyme hydroxylates testosterone and dehydroepiandrosterone 3-sulphate, which is involved in the formation of estriol during pregnancy. The enzyme also metabolizes some drugs such as aflatoxin B1. This gene is part of a cluster of cytochrome P450 genes on chromosome 7q21.1. Transcript variants have been described, but it is not known whether these transcripts are normally produced.
9729	
9730	
9731	
9732	The protein encoded by this gene binds to Bruton@#$%&s tyrosine kinase (BTK) and downregulates BTK@#$%&s kinase activity. In addition, the encoded protein disrupts BTK-mediated calcium mobilization and negatively regulates the activation of nuclear factor-kappa-B-driven transcription.
9733	BHC80 is a component of a BRAF35 (MIM 605535)/histone deacetylase (HDAC; see MIM 601241) complex (BHC) that mediates repression of neuron-specific genes through the cis-regulatory element known as repressor element-1 (RE1) or neural restrictive silencer (NRS) (Hakimi et al., 2002).[supplied by OMIM]
9734	The PALM2-AKAP2 mRNA is a naturally occurring co-transcribed product of the neighboring PALM2 and AKAP2 genes. The significance of this co-transcribed mRNA and the function of its protein product have not yet been determined. Alternative splicing of this gene results in several transcript variants encoding different isoforms, but the full-length nature of some of these variants has not been defined.
9735	
9736	This gene encodes a protein that is a member of the cysteine-aspartic acid protease (caspase) family. Sequential activation of caspases plays a central role in the execution-phase of cell apoptosis. Caspases exist as inactive proenzymes composed of a prodomain and a large and small protease subunit. Activation of caspases requires proteolytic processing at conserved internal aspartic residues to generate a heterodimeric enzyme consisting of the large and small subunits. This caspase is able to cleave and activate its own, as well as caspase 1, precursor proteins. When overexpressed, this gene has been shown to induce cell apoptosis. Alternative splicing results in 3 transcript variants encoding distinct isoforms.
9737	
9738	
9739	This gene encodes a protein that is a member of the cysteine-aspartic acid protease (caspase) family. Sequential activation of caspases plays a central role in the execution-phase of cell apoptosis. Caspases exist as inactive proenzymes composed of a prodomain and a large and small protease subunit. Activation of caspases requires proteolytic processing at conserved internal aspartic residues to generate a heterodimeric enzyme consisting of the large and small subunits. This caspase is able to cleave and activate its own, as well as caspase 1, precursor proteins. When overexpressed, this gene has been shown to induce cell apoptosis. Alternative splicing results in 3 transcript variants encoding distinct isoforms.
9740	This gene encodes a potential transmembrane protein which may function as a receptor or transporter molecule. This protein may play a role in nervous system development and maintenance. Alternative splice variants have been discribed, but their biological nature has not been determined. Mutations in this gene have been associated with autosomal dominant spastic paraplegia 6.
9741	The protein encoded by this gene belongs to the PI3/PI4-kinase family. This protein is an important cell cycle checkpoint kinase that phosphorylates; thus, it functions as a regulator of a wide variety of downstream proteins, including tumor suppressor proteins p53 and BRCA1, checkpoint kinase CHK2, checkpoint proteins RAD17 and RAD9, and DNA repair protein NBS1. This protein and the closely related kinase ATR are thought to be master controllers of cell cycle checkpoint signaling pathways that are required for cell response to DNA damage and for genome stability. Mutations in this gene are associated with ataxia telangiectasia, an autosomal recessive disorder. At least three alternatively spliced transcript variants, which encode distinct isoforms, have been identified.
9742	
9743	Heat-shock transcription factors (HSFs) activate heat-shock response genes under conditions of heat or other stresses. HSF4 lacks the carboxyl-terminal hydrophobic repeat which is shared among all vertebrate HSFs and has been suggested to be involved in the negative regulation of DNA binding activity
9744	
9745	The proto-oncogene MET product is the hepatocyte growth factor receptor and encodes tyrosine-kinase activity. The primary single chain precursor protein is post-translationally cleaved to produce the alpha and beta subunits, which are disulfide linked to form the mature receptor. Various mutations in the MET gene are associated with papillary renal carcinoma.
9746	
9747	HLA-A belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon 1 encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Hundreds of HLA-A alleles have been described.
9748	
9749	
9750	
9751	
9752	
9753	
9754	This gene is a member of the protocadherin gamma gene cluster, one of three related clusters tandemly linked on chromosome five. These gene clusters have an immunoglobulin-like organization, suggesting that a novel mechanism may be involved in their regulation and expression. The gamma gene cluster includes 22 genes divided into 3 subfamilies. Subfamily A contains 12 genes, subfamily B contains 7 genes and 2 pseudogenes, and the more distantly related subfamily C contains 3 genes. The tandem array of 22 large, variable region exons are followed by a constant region, containing 3 exons shared by all genes in the cluster. Each variable region exon encodes the extracellular region, which includes 6 cadherin ectodomains and a transmembrane region. The constant region exons encode the common cytoplasmic region. These neural cadherin-like cell adhesion proteins most likely play a critical role in the establishment and function of specific cell-cell connections in the brain. Alternative splicing has been described for the gamma cluster genes.
9755	
9756	
9757	
9758	
9759	
9760	
9761	
9762	
9763	
9764	
9765	
9766	
9767	
9768	
9769	This gene encodes a 50-kD subunit of dynactin, a macromolecular complex consisting of 10-11 subunits ranging in size from 22 to 150 kD. Dynactin binds to both microtubules and cytoplasmic dynein. It is involved in a diverse array of cellular functions, including ER-to-Golgi transport, the centripetal movement of lysosomes and endosomes, spindle formation, chromosome movement, nuclear positioning, and axonogenesis. This subunit is present in 4-5 copies per dynactin molecule. It contains three short alpha-helical coiled-coil domains that may mediate association with self or other dynactin subunits. It may interact directly with the largest subunit (p150) of dynactin and may affix p150 in place.
9770	
9771	
9772	
9773	
9774	
9775	
9776	
9777	
9778	
9779	
9780	
9781	
9782	
9783	
9784	
9785	
9786	The linked genes RAG1 and RAG2 act together to activate immunoglobulin V-D-J recombination.  RAG1 is involved in recognition of the DNA substrate.
9787	This gene encodes a disintegrin and metalloproteinase with thrombospondin motifs-4, which is a member of the ADAMTS protein family. Members of the family share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. The enzyme encoded by this gene lacks a C-terminal TS motif. It is responsible for the degradation of aggrecan, a major proteoglycan of cartilage, and brevican, a brain-specific extracellular matrix protein. The cleavage of aggrecan and brevican suggests key roles of this enzyme in arthritic disease and in the central nervous system, potentially, in the progression of glioma.
9788	The protein encoded by this gene is a subunit of a heterodimer that, along with SUPT16H, forms chromatin transcriptional elongation factor FACT. FACT interacts specifically with histones H2A/H2B to effect nucleosome disassembly and transcription elongation. FACT and cisplatin-damaged DNA may be crucial to the anticancer mechanism of cisplatin. This encoded protein contains a high mobility group box which most likely constitutes the structure recognition element for cisplatin-modified DNA. This protein also functions as a co-activator of the transcriptional activator p63.
9789	This gene encodes a protein which is a member of family 38 of the glycosyl hydrolases. The protein is located in the Golgi and catalyzes the final hydrolytic step in the asparagine-linked oligosaccharide (N-glycan) maturation pathway. Mutations in the mouse homolog of this gene have been shown to cause a systemic autoimmune disease similar to human systemic lupus erythematosus.
9790	
9791	The protein encoded by this gene is a member of the neurotrophin family, that controls survival and differentiation of mammalian neurons. This protein is closely related to both nerve growth factor and brain-derived neurotrophic factor. It may be involved in the maintenance of the adult nervous system, and may affect development of neurons in the embryo when it is expressed in human placenta. NTF3-deficient mice generated by gene targeting display severe movement defects of the limbs. The mature peptide of this protein is identical in all mammals examined including human, pig, rat and mouse.
9792	Calcium-dependent membrane-binding proteins may regulate molecular events at the interface of the cell membrane and cytoplasm. This gene is one of several genes that encode a calcium-dependent protein containing two N-terminal type II C2 domains and an integrin A domain-like sequence in the C-terminus. Sequence analysis identified multiple alternatively spliced transcript variants but their full-length natures could not be determined.
9793	
9794	
9795	
9796	
9797	The protein encoded by this gene is a member of the zinc finger family of proteins. The gene product contains C2H2 type domains, which are the classical zinc finger domains found in numerous nucleic acid-binding proteins. In addition to the monocistronic transcript originating from this locus, a co-transcribed variant composed of ZFP91 and CNTF sequence has been identified. The monocistronic and co-transcribed variants encode distinct isoforms. The co-transcription of ZFP91 and CNTF has also been observed in mouse.
9798	
9799	
9800	
9801	
9802	This gene encodes a member of the membrane-spanning 4A gene family. Members of this nascent protein family are characterized by common structural features and similar intron/exon splice boundaries and display unique expression patterns among hematopoietic cells and nonlymphoid tissues. The gene encoding this protein is localized to 11q12.1, among a cluster of family members. Alternative splicing of this gene results in several transcript variants.
9803	
9804	
9805	This gene encodes the enzyme responsible for pre-mRNA editing of the glutamate receptor subunit B by site-specific deamination of adenosines.  Alternative splicing of this gene results in several transcript variants, three which have been characterized by the presence or absence of an ALU cassette insert and a short or long C-terminal region.
9806	The protein encoded by this gene is essential for meiotic homologous recombination. Genetic recombination in meiosis plays an important role in generating diversity of genetic information. The product of this gene is structurally and evolutionary related to the products of the yeast RAD51 and E. coli RecA genes. Alternative splice variants of this gene have been described but their full-length nature has not been determined.
9807	The protein encoded by this gene is a member of the glucuronyltransferase gene family, enzymes that exhibit strict acceptor specificity, recognizing nonreducing terminal sugars and their anomeric linkages. This gene product catalyzes the formation of the glycosaminoglycan-protein linkage by way of a glucuronyl transfer reaction in the final step of the biosynthesis of the linkage region of proteoglycans.
9808	This gene encodes a lysosomal hyaluronidase. Hyaluronidases intracellularly degrade hyaluronan, one of the major glycosaminoglycans of the extracellular matrix. Hyaluronan is thought to be involved in cell proliferation, migration and differentiation. This enzyme is active at an acidic pH and is the major hyaluronidase in plasma. Mutations in this gene are associated with mucopolysaccharidosis type IX, or hyaluronidase deficiency. The gene is one of several related genes in a region of chromosome 3p21.3 associated with tumor suppression. Eight alternatively spliced transcript variants of this gene encoding six distinct isoforms have been described.
9809	
9810	
9811	This gene encodes the seventh largest subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. In yeast, the association of this subunit with the polymerase under suboptimal growth conditions indicates it may play a role in regulating polymerase function.
9812	TTL is a cytosolic enzyme involved in the posttranslational modification of alpha-tubulin (see MIM 602529). Alpha-tubulin within assembled microtubules is detyrosinated over time at the C terminus. After microtubule disassembly, TTL restores the tyrosine residues and consequently participates in a cycle of tubulin detyrosination and tyrosination.[supplied by OMIM]
9813	
9814	
9815	
9816	
9817	
9818	
9819	HLA-G belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. HLA-G is expressed on fetal derived placental cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domain, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exon 6 encodes the cytoplasmic tail.
9820	
9821	
9822	The protein encoded by this gene belongs to the highly conserved cyclin family, whose members are characterized by a dramatic periodicity in protein abundance through the cell cycle. Cyclins function as regulators of CDK kinases. Different cyclins exhibit distinct expression and degradation patterns which contribute to the temporal coordination of each mitotic event. This cyclin forms a complex with and functions as a regulatory subunit of CDK4 or CDK6, whose activtiy is required for cell cycle G1/S transition. This protein has been shown to interact with and be involved in the phosphorylation of tumor suppressor protein Rb. The CDK4 activity associated with this cyclin was reported to be necessary for cell cycle progression through G2 phase into mitosis after UV radiation.
9823	
9824	
9825	
9826	Members of the CD2 (see MIM 186990) subgroup of the Ig superfamily, such as CD84, have similar patterns of conserved disulfide bonds and function in adhesion interactions between T lymphocytes and accessory cells.[supplied by OMIM]
9827	
9828	
9829	
9830	
9831	
9832	The protein encoded by this gene catalyzes the phosphorylation of phosphatidylinositol-4-phosphate on the fifth hydroxyl of the myo-inositol ring to form phosphatidylinositol-4,5-bisphosphate. This gene is a member of the phosphatidylinositol-4-phosphate 5-kinase family. The encoded protein sequence does not show similarity to other kinases, but the protein does exhibit kinase activity. Additionally, the encoded protein interacts with p55 TNF receptor. Two transcript variants encoding different isoforms have been found for this gene.
9833	
9834	
9835	
9836	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. This enzyme is required for post-replicative DNA damage repair. Its protein sequence is 100% identical to the mouse, rat, and rabbit homologs, which indicates that this enzyme is highly conserved in eukaryotic evolution.
9837	This gene encodes a protein that is a member of the dickkopf family. The secreted protein contains two cysteine rich regions and is involved in embryonic development through its interactions with the Wnt signaling pathway. The expression of this gene is decreased in a variety of cancer cell lines and it may function as a tumor suppressor gene. Alternative splicing results in multiple transcript variants encoding the same protein.
9838	Proteolysis by the ubiquitin-proteasome system controls the concentration of many regulatory proteins. The selectivity of ubiquitylation is determined by ubiquitin E3 ligases, which recognize the substrate@#$%&s destabilization signal, or degron. The E3 ligase UBR2 participates in the N-end rule pathway, which targets proteins bearing an N-terminal degron, or N-degron (Kwon et al., 2003).[supplied by OMIM]
9839	
9840	The PDHX gene encodes component X of the pyruvate dehydrogenase (PDH) complex. For a detailed description of the pyruvate dehydrogenase complex, see MIM 300502. The mammalian PDH complex differs from that in E. coli and from the other mammalian alpha-keto acid dehydrogenases by the presence of a 53-kD protein called protein X. Component X binds to the E3 (MIM 246900) component of the PDH complex (Robinson et al., 1990; Aral et al., 1997).[supplied by OMIM]
9841	FDCSP is expressed by follicular dendritic cells (FDCs) and activated leukocytes during ongoing immune responses.[supplied by OMIM]
9842	This gene encodes a nuclear-encoded mitochondrial metalloprotease protein that is a member of the AAA (ATPases associated with a variety of cellular activities) protein family. Members of this protein family share an ATPase domain and have roles in diverse cellular processes including membrane trafficking, intracellular motility, organelle biogenesis, protein folding, and proteolysis. Two transcript variants encoding distinct isoforms have been identified for this gene. Mutations associated with this gene cause autosomal recessive spastic paraplegia 7.
9843	
9844	
9845	
9846	
9847	This gene encodes one of three subunits which combine to form cleavage stimulation factor (CSTF). CSTF is involved in the polyadenylation and 3@#$%&end cleavage of pre-mRNAs. Similar to mammalian G protein beta subunits, this protein contains transducin-like repeats.
9848	
9849	
9850	
9851	
9852	CGGBP1 influences expression of the fragile X mental retardation gene, FMR1 (MIM 309550), by specifically interacting with the 5-prime (CGG)n-3-prime repeat in its 5-prime UTR.[supplied by OMIM]
9853	
9854	The function of this protein has not yet been defined; however, this protein may play a role which is similar to or complementary to other T-cell leukemia/lymphoma proteins during early embrogenesis. The association of this gene with T-cell leukemia chromosome translocations implicates this gene as a candidate for leukemogenesis. Complex alternative splicing of this gene results in multiple transcript variants, six of which have been fully described. Among these described variants, four different open reading frames have been identified; three of the six variants code for an identical product.
9855	
9856	
9857	The retinoblastoma tumor suppressor (pRB) protein binds with many other proteins. In various human cancers, pRB suppresses cellular proliferation and is inactivated. Cell cycle-dependent phosphorylation regulates the activity of pRB. This gene encodes a protein which binds to underphosphorylated but not phosphorylated pRB. Multiple alternatively spliced transcript variants that encode different isoforms have been found for this gene.
9858	INPPL1 encodes inositol polyphosphate-5 phosphatase-like 1, a protein that in addition to the phosphatase domain contains an SH2 (src-homology domain 2) motif.
9859	
9860	
9861	
9862	This gene belongs to the homeobox family of genes. The homeobox genes encode a highly conserved family of transcription factors that play an important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, located on different chromosomes, consisting of 9 to 11 genes arranged in tandem. This gene is one of several homeobox HOXB genes located in a cluster on chromosome 17. The exact role of this gene has yet to be determined.
9863	This gene is a member of the tropomyosin family of highly conserved, widely distributed actin-binding proteins involved in the contractile system of striated and smooth muscles and the cytoskeleton of non-muscle cells. Tropomyosin is composed of two alpha-helical chains arranged as a coiled-coil. It is polymerized end to end along the two grooves of actin filaments and provides stability to the filaments. The encoded protein is one type of alpha helical chain that forms the predominant tropomyosin of striated muscle, where it also functions in association with the troponin complex to regulate the calcium-dependent interaction of actin and myosin during muscle contraction. In smooth muscle and non-muscle cells, alternatively spliced transcript variants encoding a range of isoforms have been described. Mutations in this gene are associated with type 3 familial hypertrophic cardiomyopathy.
9864	
9865	Amino acid transporters play essential roles in the uptake of nutrients, production of energy, chemical metabolism, detoxification, and neurotransmitter cycling. SLC38A1 is an important transporter of glutamine, an intermediate in the detoxification of ammonia and the production of urea. Glutamine serves as a precursor for the synaptic transmitter, glutamate (Gu et al., 2001).[supplied by OMIM]
9866	The SR (serine/arginine-rich) family contains a number of phosphoproteins that function as essential and alternative splicing factors. The SR family of proteins is characterized by the presence of a ribonucleoprotein (RNP)-type RNA binding motif and a carboxyl-terminal arginine-serine-rich (RS) domain. The protein encoded by this gene is a member of the SR family and functions as an essential splicing factor in vitro. This gene is thought to be an expressed PR264/SC35 retropseudogene.
9867	
9868	
9869	
9870	
9871	The protein encoded by this gene is a member of the keratin gene family. As a type II hair keratin, it is a basic protein which heterodimerizes with type I keratins to form hair and nails. The type II hair keratins are clustered in a region of chromosome 12q13 and are grouped into two distinct subfamilies based on structure similarity. One subfamily, consisting of KRTHB1, KRTHB3, and KRTHB6, is highly related. The other less-related subfamily includes KRTHB2, KRTHB4, and KRTHB5. All hair keratins are expressed in the hair follicle; this hair keratin, as well as KRTHB1 and KRTHB3, is found primarily in the hair cortex. Mutations in this gene and KRTHB1 have been observed in patients with a rare dominant hair disease, monilethrix.
9872	
9873	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of the ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class. Alternative splicing of this gene generates 2 transcript variants diverging at the 3@#$%& end.
9874	
9875	
9876	
9877	
9878	
9879	The protein encoded by this gene is similar to the gene product of S14, a rat gene whose expression is limited to liver and adipose tissue and is controlled by nutritional and hormonal factors. This gene has been shown to be expressed in liver and adipocytes, particularly in lipomatous modules. It is also found to be expressed in lipogenic breast cancers, which suggests a role in controlling tumor lipid metabolism.
9880	
9881	
9882	
9883	Vesicle mediated protein sorting plays an important role in segregation of intracellular molecules into distinct organelles. Genetic studies in yeast have identified more than 40 vacuolar protein sorting (VPS) genes involved in vesicle transport to vacuoles. This gene is a member of the Sec-1 domain family, and it encodes a protein similar to the yeast class C Vps33 protein. The mammalian class C VPS proteins are predominantly associated with late endosomes/lysosomes, and like their yeast counterparts, may mediate vesicle trafficking steps in the endosome/lysosome pathway.
9884	
9885	
9886	Members of the ATP-binding cassette (ABC) transporter superfamily carry out energy-dependent transport of substrate molecules. ABCA13 belongs to a subfamily of ABC transporters that contain 2 transmembrane (TM) domains, each with about 6 membrane-spanning segments, and 2 nucleotide-binding domains (NBDs), which are located in the cytoplasm.[supplied by OMIM]
9887	
9888	This gene is a member of the protocadherin gamma gene cluster, one of three related clusters tandemly linked on chromosome five. These gene clusters have an immunoglobulin-like organization, suggesting that a novel mechanism may be involved in their regulation and expression. The gamma gene cluster includes 22 genes divided into 3 subfamilies. Subfamily A contains 12 genes, subfamily B contains 7 genes and 2 pseudogenes, and the more distantly related subfamily C contains 3 genes. The tandem array of 22 large, variable region exons are followed by a constant region, containing 3 exons shared by all genes in the cluster. Each variable region exon encodes the extracellular region, which includes 6 cadherin ectodomains and a transmembrane region. The constant region exons encode the common cytoplasmic region. These neural cadherin-like cell adhesion proteins most likely play a critical role in the establishment and function of specific cell-cell connections in the brain. Alternative splicing has been described for the gamma cluster genes.
9889	This gene encodes a protein similar to calcium/calmodulin dependent protein kinase, however, its exact function is not known.
9890	This gene is a member of the cadherin superfamily, genes encoding calcium dependent cell-cell adhesion glycoproteins. The protein encoded by this gene is a large, single-pass transmembrane protein composed of an extracellular domain containing 27 repeats that show significant homology to the cadherin ectodomain. Expressed in the neurosensory epithelium, the protein is thought to be involved in stereocilia organization and hair bundle formation. The gene is located in a region containing the human deafness loci DFNB12 and USH1D. Usher syndrome 1D and nonsyndromic autosomal recessive deafness DFNB12 are caused by allelic mutations of this novel cadherin-like gene. Two alternative splice variants have been identified, encoding different isoforms. Additional variants have been observed but their full-length nature has not been determined.
9891	
9892	
9893	
9894	
9895	
9896	
9897	Selenoproteins, such as SELK, contain the rare twenty-first amino acid, selenocysteine (sec). These proteins lack common amino acid sequence motifs, but the 3-prime untranslated regions of selenoprotein genes have a common stem-loop structure, the sec insertion sequence (SECIS), that is necessary for the recognition of UGA as a sec codon rather than as a stop signal.[supplied by OMIM]
9898	
9899	
9900	
9901	
9902	This gene encodes a member of a family of neuronal cell surface proteins. Members of this family may act as splice site-specific ligands for beta-neurexins and may be involved in the formation and remodeling of central nervous system synapses. The encoded protein interacts with discs, large (Drosophila) homolog 4 (DLG4). Mutations in this gene have been associated with autism and Asperger syndrome. Two transcript variants encoding the same protein have been identified for this gene.
9903	
9904	
9905	Thyroid hormone receptors are hormone-dependent transcription factors that regulate expression of a variety of specific target genes. The protein encoded by this gene binds thyroid hormone receptor beta, but only in the presence of thyroid hormone. The encoded protein, a member of the HMGN protein family, is thought to reduce the compactness of the chromatin fiber in nucleosomes, thereby enhancing transcription from chromatin templates. Two transcript variants encoding different isoforms have been found for this gene.
9906	
9907	
9908	
9909	
9910	The protein encoded by this gene contains a RING finger, a motif known to be involved in protein-protein and protein-DNA interactions. This protein was reported to interact with the tumor suppressor protein RB1. Studies of the rat counterpart suggested that this protein may function as an E3 ubiquitin-protein ligase, and facilitate the ubiquitination and  degradation of syntaxin 1, which is an essential component of the neurotransmitter release machinery. Alternatively spliced transcript variants encoding distinct isoforms have been described.
9911	
9912	
9913	Zeta-globin (HBZ ) is an alpha-like hemoglobin. The zeta-globin polypeptide is synthesized in the yolk sac of the early embryo, while alpha-globin is produced throughout fetal and adult like. The zeta-globin gene is a member of the human alpha-globin gene cluster that involves 4 functional genes and 3 nonfunctional pseudogenes. The order of genes is: 5@#$%&-zeta -- pseudozeta -- pseudoalpha2 -- pseudoalpha1 -- alpha2 -- alph1 -- theta1-3@#$%&.
9914	The protein encoded by this gene is a regulatory subunit of the AMP-activated protein kinase (AMPK). AMPK is a heterotrimer consisting of an alpha catalytic subunit, and non-catalytic beta and gamma subunits. AMPK is an important energy-sensing enzyme that monitors cellular energy status. In response to cellular metabolic stresses, AMPK is activated, and thus phosphorylates and inactivates acetyl-CoA carboxylase (ACC) and beta-hydroxy beta-methylglutaryl-CoA reductase (HMGCR), key enzymes involved in regulating de novo biosynthesis of fatty acid and cholesterol. This subunit is one of the gamma regulatory subunits of AMPK. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
9915	
9916	The protein encoded by this gene is a member of the calmodulin-binding transcription activator (CAMTA) protein family. CAMTA family members share a common domain structure that consists of a transcription activation domain, a DNA-binding domain, and a calmodulin-binding domain. CAMTA proteins may function as transcription factors that respond to calcium signaling by directly binding to calmodulin. This protein was shown to possess transcription activation activity in a transcription reporter system in yeast. Alternatively spliced transcript variants have been identified, but their full-length natures have not been determined.
9917	
9918	ADP-ribosylation factor 3 (ARF3) is a member of the human ARF gene family. These genes encode small guanine nucleotide-binding proteins that stimulate the ADP-ribosyltransferase activity of cholera toxin and play a role in vesicular trafficking and as activators of phospholipase D.  The gene products include 6 ARF proteins and 11 ARF-like proteins and constitute 1 family of the RAS superfamily. The ARF proteins are categorized as class I (ARF1, ARF2,and ARF3), class II (ARF4 and ARF5) and class III (ARF6) and members of each class share a common gene organization. The ARF3 gene contains five exons and four introns.
9919	
9920	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. This gene encodes a member of the potassium channel, voltage-gated, subfamily G. This member functions as a modulatory subunit. The gene has strong expression in brain. Alternative splicing results in two transcript variants encoding distinct isoforms.
9921	
9922	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein that is known to complex with integrins. This gene is expressed in different carcinomas. The use of alternate polyadenylation sites has been found for this gene.
9923	
9924	
9925	This gene belongs to the homeobox family of genes. The homeobox genes encode a highly conserved family of transcription factors that play an important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, located on different chromosomes, consisting of 9 to 11 genes arranged in tandem. This gene is one of several homeobox HOXC genes located in a cluster on chromosome 12. The protein level is controlled during cell differentiation and proliferation, which may indicate this protein role in origin activation. The exact role of this gene has to be determined.
9926	
9927	
9928	This gene encodes a member of the Notch family. Members of this Type 1 transmembrane protein family share structural characteristics including an extracellular domain consisting of multiple epidermal growth factor-like (EGF) repeats, and an intracellular domain consisting of multiple, different domain types. Notch family members play a role in a variety of developmental processes by controlling cell fate decisions. The Notch signaling network is an evolutionarily conserved intercellular signaling pathway which regulates interactions between physically adjacent cells. In Drosophilia, notch interaction with its cell-bound ligands (delta, serrate) establishes an intercellular signaling pathway that plays a key role in development. Homologues of the notch-ligands have also been identified in human, but precise interactions between these ligands and the human notch homologues remain to be determined. This protein is cleaved in the trans-Golgi network, and presented on the cell surface as a heterodimer. This protein functions as a receptor for membrane bound ligands, and may play a role in vascular, renal and hepatic development. This gene may be associated with susceptibility to schizophrenia in a small portion of cases. An alternative splice variant has been described but its biological nature has not been determined.
9929	The product of this gene belongs to the integrin alpha chain family. Integrins are heterodimeric integral membrane proteins composed of an alpha chain and a beta chain. This gene encodes the integrin alpha 5 chain. Alpha chain 5 undergoes post-translational cleavage in the extracellular domain to yield disulfide-linked light and heavy chains that join with beta 1 to form a fibronectin receptor. In addition to adhesion, integrins are known to participate in cell-surface mediated signalling.
9930	Studies in Drosophila have identified this gene as encoding a positive regulator of the Notch-signaling pathway. The human gene encodes a protein of unknown function; however, it may play a role in basic helix-loop-helix transcription factor activity.
9931	
9932	
9933	This gene encodes a transmembrane glycoprotein which mediates interaction between newly assembled major histocompatibility complex (MHC) class I molecules and the transporter associated with antigen processing (TAP), which is required for the transport of antigenic peptides across the endoplasmic reticulum membrane. This interaction is essential for optimal peptide loading on the MHC class I molecule. Up to four complexes of MHC class I and this protein may be bound to a single TAP molecule. This protein contains a C-terminal double-lysine motif (KKKAE) known to maintain membrane proteins in the endoplasmic reticulum. This gene lies within the major histocompatibility complex on chromosome 6. Alternative splicing results in three transcript variants encoding different isoforms.
9934	
9935	
9936	
9937	This gene encodes a member of the grainyhead family of transcription factors. The encoded protein interacts with sister of mammalian grainyhead (SOM) and may function as a transcription factor during development. Two transcript variants encoding distinct isoforms have been identified for this gene.
9938	
9939	
9940	
9941	
9942	
9943	
9944	
9945	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order   structures. This gene is intronless and encodes a member of the histone H2B family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6.
9946	This gene product belongs to the family of candidate taste receptors that are members of the G-protein-coupled receptor superfamily. These proteins are specifically expressed in the taste receptor cells of the tongue and palate epithelia. They are organized in the genome in clusters and are genetically linked to loci that influence bitter perception in mice and humans. In functional expression studies, they respond to bitter tastants. This gene maps to the taste receptor gene cluster on chromosome 12p13.
9947	The protein encoded by this gene contains an A-T hook DNA binding motif which usually binds to other DNA binding structures to play an important role in chromatin modeling and transcription regulation. Its Poz domain is thought to function as a site for protein-protein interaction and is required for transcriptional repression, and the zinc-fingers comprise the DNA binding domain. Since the encoded protein has typical features of a transcription factor, it is postulated to be a repressor of gene expression. In small round cell sarcoma, this gene is fused to EWS by a small inversion of 22q, then the hybrid is thought to be translocated (t(1;22)(p36.1;q12). The rearrangement of chromosome 22 involves intron 8 of EWS and exon 1 of this gene creating a chimeric sequence containing the transactivation domain of EWS fused to zinc finger domain of this protein. This is a distinct example of an intra-chromosomal rearrangement of chromosome 22. There are four alternatively spliced transcript variants described for this gene.
9948	This gene maps to a GC-rich region of the X chromosome and was identified by its proximity to a CpG island. It is thought to be a housekeeping gene.
9949	
9950	
9951	
9952	
9953	
9954	
9955	
9956	
9957	This gene encodes a WASP interacting protein (WIP)-related protein. It has been shown that this protein has a role in the WASP-mediated organization of the actin cytoskeleton and that this protein is a potential link between the activated platelet-derived growth factor receptor and the actin polymerization machinery.
9958	
9959	
9960	
9961	
9962	
9963	
9964	Aquaporin 5 (AQP5) is a water channel protein.  Aquaporins are a family of small integral membrane proteins related to the major intrinsic protein (MIP or AQP0). Aquaporin 5 plays a role in the generation of saliva, tears and pulmonary secretions. AQP0, AQP2, AQP5, and AQP6 are closely related and all map to 12q13.
9965	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the MRP subfamily which is involved in multi-drug resistance. This protein functions as a modulator of ATP-sensitive potassium channels and insulin release. Mutations and deficiencies in this protein have been observed in patients with hyperinsulinemic hypoglycemia of infancy, an autosomal recessive disorder of unregulated and high insulin secretion. Mutations have also been associated with non-insulin-dependent diabetes mellitus type II, an autosomal dominant disease of defective insulin secretion. Alternative splicing of this gene has been observed; however, the transcript variants have not yet been fully described.
9966	
9967	
9968	The protein encoded by this gene is a member of the keratin gene family. The type II cytokeratins consist of basic or neutral proteins which are arranged in pairs of heterotypic keratin chains coexpressed during differentiation of simple and stratified epithelial tissues. This type II cytokeratin is specifically expressed in the basal layer of the epidermis with family member KRT14. Mutations in these genes have been associated with a complex of diseases termed epidermolysis bullosa simplex. The type II cytokeratins are clustered in a region of chromosome 12q12-q13.
9969	Thyrotropin-releasing hormone (TRH; MIM 275120) acts as a stimulator of hormone secretion from adenohypophyseal cells, and its signals are inactivated by TRH-degrading ectoenzyme.[supplied by OMIM]
9970	
9971	Tapasin, or TAPBP (MIM 601962), is a member of the variable-constant Ig superfamily that links major histocompatibility complex (MHC) class I molecules to the transporter associated with antigen processing (TAP; see MIM 170260) in the endoplasmic reticulum (ER). The TAPBP gene is located near the MHC complex on chromosome 6p21.3. TAPBPR is a member of the Ig superfamily that is localized on chromosome 12p13.3, a region somewhat paralogous to the MHC.[supplied by OMIM]
9972	
9973	
9974	
9975	
9976	Members of the F-box protein family, such as FBXL14, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
9977	This gene encodes pterin-4 alpha-carbinolamine dehydratase, an enzyme involved in phenylalanine hydroxylation. A deficiency of this enzyme leads to hyperphenylalaninemia. The enzyme regulates the homodimerization of the transcription factor hepatocyte nuclear factor 1 (HNF1). Alternative splicing results in two transcript variants.
9978	This protein belongs to the aldehyde dehydrogenase family of proteins. Aldehyde dehydrogenase is the second enzyme of the major oxidative pathway of alcohol metabolism. Two major liver isoforms of this enzyme, cytosolic and mitochondrial, can be distinguished by their electrophoretic mobilities, kinetic properties, and subcellular localizations. Most Caucasians have two major isozymes, while approximately 50% of Orientals have only the cytosolic isozyme, missing the mitochondrial isozyme. A remarkably higher frequency of acute alcohol intoxication among Orientals than among Caucasians could be related to the absence of the mitochondrial isozyme. This gene encodes a mitochondrial isoform, which has a low Km for acetaldehydes, and is localized in mitochondrial matrix.
9979	This protein belongs to the aldehyde dehydrogenase family of proteins. Aldehyde dehydrogenase is the second enzyme of the major oxidative pathway of alcohol metabolism. Two major liver isoforms of this enzyme, cytosolic and mitochondrial, can be distinguished by their electrophoretic mobilities, kinetic properties, and subcellular localizations. Most Caucasians have two major isozymes, while approximately 50% of Orientals have only the cytosolic isozyme, missing the mitochondrial isozyme. A remarkably higher frequency of acute alcohol intoxication among Orientals than among Caucasians could be related to the absence of the mitochondrial isozyme. This gene encodes a mitochondrial isoform, which has a low Km for acetaldehydes, and is localized in mitochondrial matrix.
9980	
9981	
9982	
9983	
9984	
9985	
9986	The protein encoded by this gene is a transcriptional regulatory protein. It contains paired amphipathic helix (PAH) domains, which are important for protein-protein interactions and may mediate repression by the Mad-Max complex.
9987	
9988	The erythropoietin receptor is a member of the cytokine receptor family. Upon erythropoietin binding, the erythropoietin receptor activates Jak2 tyrosine kinase which activates different intracellular pathways including: Ras/MAP kinase, phosphatidylinositol 3-kinase and STAT transcription factors. The stimulated erythropoietin receptor appears to have a role in erythroid cell survival. Defects in the erythropoietin receptor may produce erythroleukemia and familial erythrocytosis.
9989	The synaptotagmins are integral membrane proteins of synaptic vesicles thought to serve as Ca(2+) sensors in the process of vesicular trafficking and exocytosis. Calcium binding to synaptotagmin I participates in triggering neurotransmitter release at the synapse (Fernandez-Chacon et al., 2001).[supplied by OMIM]
9990	
9991	
9992	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters.  ABC proteins transport various molecules across extra- and intra-cellular membranes.  ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White).  This protein is a member of the MDR/TAP subfamily.  Members of the MDR/TAP subfamily are involved in multidrug resistance.  This gene is located 7 kb telomeric to gene family member ABCB2.  The protein encoded by this gene is involved in antigen presentation.  This protein forms a heterodimer with ABCB2 in order to transport peptides from the cytoplasm to the endoplasmic reticulum.  Mutations in this gene may be associated with ankylosing spondylitis, insulin-dependent diabetes mellitus, and celiac disease.  Alternative splicing of this gene produces two products which differ in peptide selectivity and level of restoration of surface expression of MHC class I molecules.
9993	This gene encodes a predicted transmembrane protein containing two extracellular CUB domains followed by a low-density lipoprotein class A (LDLa) domain. It also has an intracellular FXNPXY-like motif, which has been shown in other proteins to be essential for the internalization of clathrin coated pits during endocytosis. Alternatively spliced transcript variants have been observed, but they have not been fully characterized.
9994	
9995	
9996	
9997	
9998	
9999	
10000	
10001	
10002	
10003	
10004	
10005	
10006	This gene encodes a member of the type 3 G protein-coupling receptor family, characterized by the signature 7-transmembrane domain motif. The encoded protein may be involved in interaction between retinoid acid and G protein signalling pathways. Retinoic acid plays a critical role in development, cellular growth, and differentiation. This gene may play a role in embryonic development and epithelial cell differentiation.
10007	
10008	This gene encodes subunit 1 of the splicing factor 3b protein complex. Splicing factor 3b, together with splicing factor 3a and a 12S RNA unit, forms the U2 small nuclear ribonucleoproteins complex (U2 snRNP). The splicing factor 3b/3a complex binds pre-mRNA upstream of the intron@#$%&s branch site in a sequence independent manner and may anchor the U2 snRNP to the pre-mRNA. Splicing factor 3b is also a component of the minor U12-type spliceosome. The carboxy-terminal two-thirds of subunit 1 have 22 non-identical, tandem HEAT repeats that form rod-like, helical structures. Alternative splicing results in multiple transcript variants encoding different isoforms.
10009	
10010	
10011	
10012	
10013	Alkylglycerone phosphate synthase (AGPS) belongs to a group of peroxisomal enzymes that catalyze the first step in ether phospholipid biosynthesis which involves the acylation of dihydroxyacetonephosphate followed by the introduction of the ether linkage by alkyl-DHAP synthase.  Human AGPS shares a 92% nucleotide and amino acid sequence similarity with the guinea pig sequences, and the human sequence was identified by virtue of this homology.  Both proteins contain indentical peroxisomal targeting signal 2 sequences in their N-terminal presequences. Deficiency of AGPS in peroxisomes has been found in patients with the following peroxisomal disorders: Zellweger syndrome, rhizomelic chondrodysplasia punctate and adrenoleukodystrophy.
10014	
10015	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
10016	
10017	
10018	
10019	Acetylation of histone core particles modulates chromatin structure and gene expression. The opposing enzymatic activities of histone acetyltransferases and histone deacetylases, such as HDAC10, determine the acetylation status of histone tails (Kao et al., 2002).[supplied by OMIM]
10020	
10021	
10022	
10023	This gene encodes a common precursor for two peptides, neuromedin N and neurotensin. Neurotensin is a secreted tridecapeptide, which is widely distributed throughout the central nervous system, and may function as a neurotransmitter or a neuromodulator. It may be involved in dopamine-associated pathophysiological events, in the maintenance of gut structure and function, and in the regulation of fat metabolism. Tissue-specific processing may lead to the formation in some tissues of larger forms of neuromedin N and neurotensin. The large forms may represent more stable peptides that are also biologically active.
10024	This gene encodes a highly efficient N-acetylgalactosamine (GalNAc) kinase, which has galactokinase activity when galactose is present at high concentrations. Two alternatively spliced transcript variants encoding different isoforms have been found for this gene.
10025	
10026	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein does not contain a coiled coil region, like some family members. This gene encodes a protein of unknown function.
10027	
10028	
10029	Carboxypeptidases are enzymes that hydrolyze C-terminal peptide bonds. The carboxypeptidase family includes metallo-, serine, and cysteine carboxypeptidases. According to their substrate specificity, these enzymes are referred to as carboxypeptidase A (cleaving aliphatic residues) or carboxypeptidase B (cleaving basic amino residues). The protein encoded by this gene is activated by trypsin and acts on carboxypeptidase B substrates. After thrombin activation, the mature protein downregulates fibrinolysis. Polymorphisms have been described for this gene and its promoter region. Available sequence data analyses indicate splice variants that encode different isoforms.
10030	
10031	
10032	
10033	
10034	CLCN5 is a voltage-gated chloride channel.  Mutation of this gene results in Dent disease and renal tubular disorders complicated by nephrolithiasis.
10035	This gene is expressed in adipose tissue exclusively. It encodes a protein with similarity to collagens X and VIII and complement factor C1q. The encoded protein circulates in the plasma and is involved with metabolic and hormonal processes.
10036	
10037	
10038	
10039	
10040	
10041	
10042	
10043	
10044	This gene encodes a protein containing two conserved tandem RNA recognition motifs. Similar proteins in other species function as RNA-binding proteins and play central roles in posttranscriptional gene regulation. Two transcript variants encoding distinct isoforms have been identified for this gene.
10045	
10046	This gene is a member of a gene family which encodes enzymes responsible for the conversion of arginine residues to citrulline residues. This gene may play a role in granulocyte and macrophage development leading to inflammation and immune response.
10047	
10048	The protein encoded by this gene contains multiple tetratricopeptide repeat (TPR) motifs as well as the highly conserved J domain found in DNAJ chaperone family members. It is a member of the tetratricopeptide repeat family of proteins and acts as an inhibitor of the interferon-induced, dsRNA-activated protein kinase (PKR).
10049	
10050	
10051	
10052	The protein encoded by this gene is a receptor for activated C-kinase (RACK) protein. The encoded protein has been shown to bind in vitro to activated protein kinase C beta I. In addition, this protein is a cutaneous T-cell lymphoma-associated antigen. Finally, the protein contains a bromodomain and two zinc fingers, and is thought to be a transcriptional regulator. Multiple transcript variants encoding several different isoforms have been found for this gene.
10053	
10054	
10055	
10056	
10057	The protein encoded by this gene is the ligand for FAS. Both are transmembrane proteins. Interaction of FAS with this ligand is critical in triggering apoptosis of some types of cells such as lymphocytes. Defects in this gene may be related to some cases of systemic lupus erythematosus (SLE).
10058	
10059	
10060	
10061	
10062	
10063	
10064	
10065	
10066	
10067	
10068	This gene encodes a WD repeat-containing protein. Multiple alternatively spliced transcript variants have been found for this gene, but the full-length nature of some variants has not been defined.
10069	The protein encoded by this gene is a mitochondria protein highly similar to the peptide chain release factors (RFs) discovered in bacteria and yeast. The peptide chain release factors direct the termination of translation in response to the peptide chain termination codons. This protein contains the signature sequences of class I peptide chain release factors, suggesting its role in the termination of mitochondria protein synthesis. Multiple alternatively spliced transcript variants have been suggested by mRNA and EST data. Their full length natures, however, are not clear.
10070	
10071	
10072	
10073	
10074	
10075	
10076	
10077	The human complement subcomponent C1q associates with C1r and C1s in order to yield the first component of the serum complement system. The protein encoded by this gene is known to bind to the globular heads of C1q molecules and inhibit C1 activation. This protein has also been identified as the p32 subunit of pre-mRNA splicing factor SF2, as well as a hyaluronic acid-binding protein.
10078	
10079	This gene encodes a member of the glycosyltransferase O-Fuc family. This enzyme adds O-fucose through an O-glycosidic linkage to conserved serine or threonine residues in the epidermal growth factor-like repeats of a number of cell surface and secreted proteins. O-fucose glycans are involved in ligand-induced receptor signaling. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
10080	
10081	
10082	
10083	
10084	This gene belongs to the homeobox family of genes. The homeobox genes encode a highly conserved family of transcription factors that play an important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, which are located on different chromosomes and consist of 9 to 11 genes arranged in tandem. This gene is one of several homeobox HOXB genes located in a cluster on chromosome 17. Hoxb8 knockout mice exhibit an excessive pathologic grooming behavior,leading to hair removal and self-inflicted wounds at overgroomed sites. This behavior is similar to the behavior of humans suffering from the obsessive-compulsive spectrum disorder (OCD) trichotillomania.
10085	This gene encodes a selenoprotein containing selenocysteine at the active site. The selenocysteine is encoded by the usual stop codon UGA. Mutations in this gene cause the classical phenotype of multiminicore disease and congenital muscular dystrophy with spinal rigidity and restrictive respiratory syndrome. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
10086	
10087	
10088	
10089	
10090	
10091	
10092	
10093	The protein encoded by this gene is a member of the BAG1-related protein family. BAG1 is an anti-apoptotic protein that functions through interactions with a variety of cell apoptosis and growth related proteins including BCL-2, Raf-protein kinase, steroid hormone receptors, growth factor receptors and members of the heat shock protein 70 kDa family. This protein contains a BAG domain near the C-terminus, which could bind and inhibit the chaperone activity of Hsc70/Hsp70. Three transcript variants encoding two different isoforms have been found for this gene.
10094	
10095	
10096	Retinoids are indispensable light-sensitive elements of vision and also serve as essential modulators of cellular differentiation and proliferation in diverse cell types. RDH12 belongs to a family of dual-specificity retinol dehydrogenases that metabolize both all-trans- and cis-retinols (Haeseleer et al., 2002).[supplied by OMIM]
10097	
10098	The protein encoded by this gene is one of the highly conserved mini-chromosome maintenance proteins (MCM) that are essential for the initiation of eukaryotic genome replication. The hexameric protein complex formed by MCM proteins is a key component of the pre-replication complex (pre_RC) and may be involved in the formation of replication forks and in the recruitment of other DNA replication related proteins. The MCM complex consisting of this protein and MCM2, 6 and 7 proteins possesses DNA helicase activity, and may act as a DNA unwinding enzyme. The phosphorylation of this protein by CDC2 kinase reduces the DNA helicase activity and chromatin binding of the MCM complex. This gene is mapped to a region on the chromosome 8 head-to-head next to the PRKDC/DNA-PK, a DNA-activated protein kinase involved in the repair of DNA double-strand breaks. Alternatively spliced transcript variants encoding the same protein have been reported.
10099	
10100	Neurotensin receptor 1 belongs to the large superfamily of G-protein coupled receptors. NTSR1 mediates the multiple functions of neurotensin, such as hypotension, hyperglycemia, hypothermia, antinociception, and regulation of intestinal motility and secretion.
10101	
10102	This gene encodes a protein possessing an RNA helicase motif containing a DEXH box in its amino terminus and an RNA motif in the carboxy terminus. The encoded protein functions as a ribonuclease and is required by the RNA interference and small temporal RNA (stRNA) pathways to produce the active small RNA component that represses gene expression. Two transcript variants encoding the same protein have been identified for this gene.
10103	
10104	
10105	The Wilms tumor suppressor gene WT1 appears to play a role in both transcriptional and posttranscriptional regulation of certain cellular genes. This gene encodes a WT1-associating protein, which is a ubiquitously expressed nuclear protein. Like WT1 protein, this protein is localized throughout the nucleoplasm as well as in speckles and partially colocalizes with splicing factors. Alternative splicing of this gene results in three transcript variants, two of which encode the same isoform.
10106	The protein encoded by this gene contains a RING finger domain, a motif present in a variety of functionally distinct proteins and known to be involved in protein-protein and protein-DNA interactions.
10107	The protein encoded by this gene belongs to the CD28 and CTLA-4 cell-surface receptor family. It forms homodimers and plays an important role in cell-cell signaling, immune responses, and regulation of cell proliferation.
10108	
10109	This gene encodes a member of the forkhead class of DNA-binding proteins. These hepatocyte nuclear factors are transcriptional activators for liver-specific transcripts such as albumin and transthyretin, and they also interact with chromatin. Similar family members in mice have roles in the regulation of metabolism and in the differentiation of the pancreas and liver.
10110	
10111	
10112	
10113	cAMP is a signaling molecule important for a variety of cellular functions. cAMP exerts its effects by activating the cAMP-dependent protein kinase (AMPK), which transduces the signal through phosphorylation of different target proteins. The inactive holoenzyme of AMPK is a tetramer composed of two regulatory and two catalytic subunits. cAMP causes the dissociation of the inactive holoenzyme into a dimer of regulatory subunits bound to four cAMP and two free monomeric catalytic subunits. Four different regulatory subunits and three catalytic subunits of AMPK have been identified in humans. The protein encoded by this gene is a member of the Ser/Thr protein kinase family and is a catalytic subunit of AMPK. Three alternatively spliced transcript variants encoding distinct isoforms have been observed.
10114	
10115	
10116	
10117	
10118	MYH7 encodes the cardiac muscle beta (or slow) isoform of myosin. Changes in the relative abundance of MYH7 and MYH6 (the alpha, or fast, isoform of cardiac myosin heavy chain) correlate with the contractile velocity of cardiac muscle. Mutations in MYH7 are associated with familial hypertrophic cardiomyopathy.
10119	
10120	The glypicans comprise a family of glycosylphosphatidylinositol-anchored heparan sulfate proteoglycans. The glypicans have been implicated in the control of cell growth and division. Glypican 6 is a putative cell surface coreceptor for growth factors, extracellular matrix proteins, proteases and anti-proteases.
10121	The protein encoded by this gene belongs to the connexin family, alpha-type subfamily. It is an integral membrane protein that forms transmembrane channels, and may function in small molecule transport, muscle contraction and cell-cell communication. Alternative splicing occurs at this locus and two transcript variants encoding the same protein have been identified.
10122	
10123	This gene encodes an enzyme involved in catalyzing the conversion of angiotensin I into a physiologically active peptide angiotensin II. Angiotensin II is a potent vasopressor and aldosterone-stimulating peptide that controls blood pressure and fluid-electrolyte balance. This enzyme plays a key role in the renin-angiotensin system. Many studies have associated the presence or absence of a 287 bp Alu repeat element in this gene with the levels of circulating enzyme or cardiovascular pathophysiologies. Two most abundant alternatively spliced variants of this gene encode two isozymes - the somatic form and the testicular form that are equally active. Multiple additional alternatively spliced variants have been identified but their full length nature has not been determined.
10124	
10125	
10126	
10127	The protein encoded by this gene is a transcription factor that contains two GATA-type zinc fingers. The encoded protein is known to bind to hepatocyte nuclear factor-1alpha (HNF-1alpha), and this interaction is essential for cooperative activation of the intestinal lactase-phlorizin hydrolase promoter. In other organisms, similar proteins may be involved in the establishment of cardiac smooth muscle cell diversity.
10128	
10129	
10130	
10131	
10132	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the proteasome B-type family, also known as the T1B family, that is a 20S core beta subunit in the proteasome. This catalytic subunit is not present in the immunoproteasome and is replaced by catalytic subunit 3i (proteasome beta 8 subunit).
10133	
10134	
10135	
10136	
10137	
10138	
10139	Carbonic anhydrases (CAs) are a large family of zinc metalloenzymes that catalyze the reversible hydration of carbon dioxide. They participate in a variety of biological processes, including respiration, calcification, acid-base balance, bone resorption, and the formation of aqueous humor, cerebrospinal fluid, saliva, and gastric acid. This gene product is a type I membrane protein that is highly expressed in normal tissues, such as kidney, colon and pancreas, and has been found to be overexpressed in 10% of clear cell renal carcinomas. Two transcript variants encoding different isoforms have been identified for this gene.
10140	The protein encoded by this protein regulates inositol phosphate metabolism by phosphorylation of second messenger inositol 1,4,5-trisphosphate to Ins(1,3,4,5)P4. The activity of this encoded protein is responsible for regulating the levels of a large number of inositol polyphosphates that are important in cellular signaling. Both calcium/calmodulin and protein phosphorylation mechanisms control its activity.
10141	
10142	
10143	
10144	
10145	
10146	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This protein contains 2 bromodomains and multiple WD repeats, and the function of this protein is not known. This gene is located within the Down syndrome region-2 on chromosome 21. Alternative splicing of this gene generates 3 transcript variants diverging at the 3@#$%& ends.
10147	
10148	
10149	
10150	
10151	
10152	
10153	
10154	
10155	
10156	
10157	
10158	
10159	
10160	Aquaporins are a family of small integral membrane proteins related to the major intrinsic protein (MIP or AQP0). This gene encodes an aquaporin which functions as a molecular water channel protein. It is a homotetramer with 6 bilayer spanning domains and N-glycosylation sites. The protein physically resembles channel proteins and is abundant in erythrocytes and renal tubes. The gene encoding this aquaporin is a possible candidate for disorders involving imbalance in ocular fluid movement. Alternative splicing results in two transcript variants which encode the same protein.
10161	The product of this gene belongs to the family of purinoceptors for ATP. This receptor functions as a ligand-gated ion channel with relatively high calcium permeability. Binding to ATP mediates synaptic transmission between neurons and from neurons to smooth muscle, being responsible, for example, for sympathetic vasoconstriction in small arteries, arterioles and vas deferens. Mouse studies suggest that this receptor is essential for normal male reproductive function. It is possible that the development of selective antagonists for this receptor may provide an effective non-hormonal male contraceptive pill.
10162	The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21. This protein is widely expressed in various types of tissues with a high expression level in thyroid gland. In smooth muscle cells, this protein co-expresses with other family members in the nucleus and in stress fibers, suggesting diverse functions in signal transduction. Multiple alternatively spliced transcript variants encoding the same protein have been found for this gene.
10163	This gene encodes an enzyme that is involved in cholesterol biosynthesis. It is thought to be an integral membrane protein.
10164	The protein encoded by this gene is a DNA-binding transcription factor and is a member of the NR1 subfamily of nuclear hormone receptors. The specific functions of this protein are not known; however, studies of a similar gene in mice have shown that this gene may be essential for lymphoid organogenesis and may play an important regulatory role in thymopoiesis. In addition, studies in mice suggest that the protein encoded by this gene may inhibit the expression of Fas ligand and IL2. Two transcript variants encoding different isoforms have been found for this gene.
10165	The protein encoded by this gene is a member of the fibroblast growth factor receptor family, where amino acid sequence is highly conserved between members and throughout evolution. FGFR family members differ from one another in their ligand affinities and tissue distribution. A full-length representative protein consists of an extracellular region, composed of three immunoglobulin-like domains, a single hydrophobic membrane-spanning segment and a cytoplasmic tyrosine kinase domain. The extracellular portion of the protein interacts with fibroblast growth factors, setting in motion a cascade of downstream signals, ultimately influencing mitogenesis and differentiation. This particular family member binds both acidic and basic fibroblast growth factors and is involved in limb induction. Mutations in this gene can lead to Pfeiffer syndrome and Jackson-Weiss syndrome. The genomic organization of this gene is very similar to family members 2-4, encompassing 19 exons that are subject to complex alternative splicing, which allows for structural, tissue expression and ligand affinity variations among the isoforms.
10166	
10167	
10168	This gene encodes a subunit of mitochondrial ATP synthase. Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. ATP synthase is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, Fo, comprising the proton channel. This gene encodes the subunit s, also known as factor B, of the proton channel. This subunit is necessary for the energy transduction activity of the ATP synthase complexes. Alternatively spliced transcript variants encoding different isoforms have been identified.
10169	
10170	
10171	
10172	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. The protein encoded is a member of the histone H1 family. This gene contains introns, unlike most histone genes. The protein encoded is a member of the histone H1 family. The related mouse gene is expressed only in oocytes.
10173	
10174	This gene encodes an alpha-1 subunit of a voltage-dependent calcium channel. Calcium channels mediate the influx of calcium ions into the cell upon membrane polarization. The alpha-1 subunit consists of 24 transmembrane segments and forms the pore through which ions pass into the cell. The calcium channel consists of a complex of alpha-1, alpha-2/delta, beta, and gamma subunits in a 1:1:1:1 ratio. There are multiple isoforms of each of these proteins, either encoded by different genes or the result of alternative splicing of transcripts. The protein encoded by the gene described in this record binds to and is inhibited by dihydropyridine. Many alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
10175	This gene encodes an alpha-1 subunit of a voltage-dependent calcium channel. Calcium channels mediate the influx of calcium ions into the cell upon membrane polarization. The alpha-1 subunit consists of 24 transmembrane segments and forms the pore through which ions pass into the cell. The calcium channel consists of a complex of alpha-1, alpha-2/delta, beta, and gamma subunits in a 1:1:1:1 ratio. There are multiple isoforms of each of these proteins, either encoded by different genes or the result of alternative splicing of transcripts. The protein encoded by the gene described in this record binds to and is inhibited by dihydropyridine. Many alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
10176	
10177	
10178	
10179	This gene product belongs to the neurexin family, members of which function in the vertebrate nervous system as cell adhesion molecules and receptors. This protein, like other neurexin proteins, contains epidermal growth factor repeats and laminin G domains. In addition, it includes an F5/8 type C domain, discoidin/neuropilin- and fibrinogen-like domains, and thrombospondin N-terminal-like domains. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
10180	
10181	
10182	HLA-A belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon 1 encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Hundreds of HLA-A alleles have been described.
10183	
10184	Lymphoid nuclear protein related to AF4 (LAF4) is related to MLLT2 which is also referred to as AF4. LAF4 is a tissue-restricted nuclear transcriptional activator that is preferentially expressed in lymphoid tissue.  Isolation of LAF4 has defined a highly conserved LAF4/MLLT2 gene family of nuclear transcription factors that may function in lymphoid development and oncogenesis.
10185	This gene, a member of the SNW gene family, encodes a coactivator that enhances transcription from some Pol II promoters. This coactivator can bind to the ligand-binding domain of the vitamin D receptor and to retinoid receptors to enhance vitamin D-, retinoic acid-, estrogen-, and glucocorticoid-mediated gene expression. It can also interact with poly(A)-binding protein 2 to directly control the expression of muscle-specific genes at the transcriptional level. Finally, the protein may be involved in oncogenesis since it interacts with a region of SKI oncoproteins that is required for transforming activity.
10186	AKT2 is a putative oncogene encoding a protein belonging to a subfamily of serine/threonine kinases containing SH2-like (Src homology 2-like) domains. Furthermore, AKT2 was shown to be amplified and overexpressed in 2 of 8 ovarian carcinoma cell lines and 2 of 15 primary ovarian tumors. Overexpression of AKT2 contributes to the malignant phenotype of a subset of human ductal pancreatic cancers. AKT2 is a general protein kinase capable of phophorylating several known proteins.
10187	The protein encoded by this gene belongs to the basic helix-loop-helix family of transcription factors. This gene product is one of two closely related family members, the HAND proteins, which are asymmetrically expressed in the developing ventricular chambers and play an essential role in cardiac morphogenesis. Working in a complementary fashion, they function in the formation of the right ventricle and aortic arch arteries, implicating them as mediators of congenital heart disease. In addition, this transcription factor plays an important role in limb and branchial arch development.
10188	
10189	The zeta-sarcoglycan gene measures over 465 kb and localizes to 8p22. This protein is part of the sarcoglycan complex, a group of 6 proteins. The sarcoglycans are all N-glycosylated transmembrane proteins with a short intra-cellular domain, a single transmembrane region and a large extra-cellular domain containing a carboxyl-terminal cluster with several conserved cysteine residues. The sarcoglycan complex is part of the dystrophin-associated glycoprotein complex (DGC), which bridges the inner cytoskeleton and the extra-cellular matrix.
10190	
10191	This gene encodes a cytokine distantly related to type I interferons and the IL-10 family. This gene, interleukin 28A (IL28A), and interleukin 29 (IL29) are three closely related cytokine genes that form a cytokine gene cluster on a chromosomal region mapped to 19q13. Expression of the cytokines encoded by the three genes can be induced by viral infection. All three cytokines have been shown to interact with a heterodimeric class II cytokine receptor that consists of interleukin 10 receptor, beta (IL10RB) and interleukin 28 receptor, alpha (IL28RA).
10192	
10193	
10194	
10195	
10196	
10197	The protein encoded by this gene is a member of the LAR protein-tyrosine phosphatase-interacting protein (liprin) family. Liprins interact with members of LAR family of transmembrane protein tyrosine phosphatases, which are known to be important for axon guidance and mammary gland development. It has been proposed that liprins are multivalent proteins that form complex structures and act as scaffolds for the recruitment and anchoring of LAR family of tyrosine phosphatases. This protein was found to interact with S100A4, a calcium-binding protein related to tumor invasiveness and metastasis. In vitro experiment demonstrated that the interaction inhibited the phosphorylation of this protein by protein kinase C and protein kinase CK2. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
10198	
10199	
10200	
10201	This gene is located within the region of chromosome 9p that harbors tumor suppressor genes critical in carcinogenesis. It is an intronless gene which is downregulated in non-small-cell lung cancer and small-cell lung cancer cell lines, suggesting that it may play a role in lung tumorigenesis.
10202	
10203	
10204	
10205	
10206	This gene encodes a disintegrin and metalloprotease (ADAM) domain 19, which is a member of the ADAM protein family. Members of this family are membrane-anchored proteins structurally related to snake venom disintegrins, and have been implicated in a variety of biologic processes involving cell-cell and cell-matrix interactions, including fertilization, muscle development, and neurogenesis. This member is a type I transmembrane protein and serves as a marker for dendritic cell differentiation. It has also been demonstrated to be an active metalloproteinase, which may be involved in normal physiological and pathological processes such as cells migration, cell adhesion, cell-cell and cell-matrix interactions, and signal transduction. Alternative splicing generates 2 transcript variants for this gene.
10207	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and catalyzes the last steps of estrogen biosynthesis, three successive hydroxylations of the A ring of androgens. Mutations in this gene can result in either increased or decreased aromatase activity; the associated phenotypes suggest that estrogen functions both as a sex steroid hormone and in growth or differentiation. The gene expresses two transcript variants.
10208	Angiopoietins are members of the vascular endothelial growth factor family and the only known growth factors largely specific for vascular endothelium. Angiopoietin-1, angiopoietin-2, and angiopoietin-4 participate in the formation of blood vessels. ANGPTL2 protein is a secreted glycoprotein with homology to the angiopoietins and may exert a function on endothelial cells through autocrine or paracrine action.
10209	
10210	This gene encodes a member of the WAP-type four-disulfide core (WFDC) domain family. The WFDC domain, or WAP signature motif, contains eight cysteines forming four disulfide bonds at the core of the protein, and functions as a protease inhibitor. Most WFDC gene members are localized to chromosome 20q12-q13 in two clusters: centromeric and telomeric. This gene belongs to the telomeric cluster.
10211	
10212	
10213	
10214	
10215	
10216	
10217	
10218	
10219	
10220	
10221	
10222	
10223	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal phosphoprotein that is a component of the 60S subunit. The protein, which is a functional equivalent of the E. coli L7/L12 ribosomal protein, belongs to the L12P family of ribosomal proteins. It plays an important role in the elongation step of protein synthesis. Unlike most ribosomal proteins, which are basic, the encoded protein is acidic. Its C-terminal end is nearly identical to the C-terminal ends of the ribosomal phosphoproteins P0 and P2. The P1 protein can interact with P0 and P2 to form a pentameric complex consisting of P1 and P2 dimers, and a P0 monomer. The protein is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome. Through sequence analysis, two alternatively spliced transcript variants that encode different proteins were found.
10224	
10225	
10226	This gene encodes one of several forms of glutamic acid decarboxylase, identified as a major autoantigen in insulin-dependent diabetes. The enzyme encoded is responsible for catalyzing the production of gamma-aminobutyric acid from L-glutamic acid. A pathogenic role for this enzyme has been identified in the human pancreas since it has been identified as an autoantigen and an autoreactive T cell target in insulin-dependent diabetes. This gene may also play a role in the stiff man syndrome. Deficiency in this enzyme has been shown to lead to pyridoxine dependency with seizures. Alternative splicing of this gene results in two products, the predominant 67-kD form and a less-frequent 25-kD form.
10227	GJA9, also called connexin-36 (CX36), is a member of the connexin gene family that is expressed predominantly in mammalian neurons. Connexins associate in groups of 6 and are organized radially around a central pore to form connexons. Each gap junction intercellular channel is formed by the conjunction of 2 connexons. See GJB2 (MIM 121011) for additional background information on connexins.[supplied by OMIM]
10228	The TEK receptor tyrosine kinase is expressed almost exclusively in endothelial cells in mice, rats, and humans. This receptor possesses a unique extracellular domain containing 2 immunoglobulin-like loops separated by 3 epidermal growth factor-like repeats that are connected to 3 fibronectin type III-like repeats. The ligand for the receptor is angiopoietin-1.   Defects in TEK are associated with inherited venous malformations; the TEK signaling pathway appears to be critical for endothelial cell-smooth muscle cell communication in venous morphogenesis. TEK is closely related to the TIE receptor tyrosine kinase.
10229	
10230	
10231	CBL (MIM 165360) constitutively interacts with SH3 domain-containing proteins and, upon tyrosine phosphorylation, with SH2 domain-containing proteins. The SH3KBP1 gene encodes an 85-kD CBL-interacting protein that enhances tumor necrosis factor (MIM 191160)-mediated apoptotic cell death (Narita et al., 2001).[supplied by OMIM]
10232	CHSY1 synthesizes chondroitin sulfate, a glycosaminoglycan expressed on the surface of most cells and in extracellular matrices. Glycosaminoglycan chains are covalently linked to a wide range of core protein families and regulate many biologic processes, including cell proliferation and recognition, extracellular matrix deposition, and morphogenesis.[supplied by OMIM]
10233	
10234	
10235	
10236	
10237	
10238	
10239	
10240	
10241	
10242	
10243	
10244	
10245	
10246	
10247	
10248	
10249	
10250	
10251	
10252	The protein encoded by this intronless gene is a bZIP transcription factor which can bind as a homodimer to certain promoters and enhancers. It can also form heterodimers with the related proteins CEBP-beta and CEBP-gamma. The encoded protein has been shown to bind to the promoter and modulate the expression of the gene encoding leptin, a protein that plays an important role in body weight homeostasis. Also, the encoded protein can interact with CDK2 and CDK4, thereby inhibiting these kinases and causing growth arrest in cultured cells.
10253	
10254	
10255	
10256	
10257	
10258	
10259	
10260	
10261	
10262	Mitogen-activated protein kinase 4 is a member of the mitogen-activated protein kinase family. Tyrosine kinase growth factor receptors activate mitogen-activated protein kinases which then translocate into the nucleus where it phosphorylates nuclear targets.
10263	
10264	Stromal cell derived receptor 1 is a type I transmembrane protein belonging to the Ig superfamily.  The protein is believed to be involved in cell-cell interactions or cell-substrate interactions.  The alpha and beta transcripts show differential localization within the brain.
10265	
10266	
10267	
10268	
10269	TRIF, or TICAM1, is a Toll/IL1R (MIM 147810) (TIR) domain-containing adaptor molecule, like MYD88 (MIM 602170) and TIRAP (MIM 606252), that induces interferon-beta (IFNB; MIM 147640), specifically interacts with TLR3 (MIM 603029), and activates nuclear factor kappa-B (NFKB; see MIM 164011).[supplied by OMIM]
10270	
10271	
10272	
10273	
10274	
10275	The Golgi apparatus, which participates in glycosylation and transport of proteins and lipids in the secretory pathway, consists of a series of stacked, flattened membrane sacs referred to as cisternae. Interactions between the Golgi and microtubules are thought to be important for the reorganization of the Golgi after it fragments during mitosis. The golgins constitute a family of proteins which are localized to the Golgi. This gene encodes a golgin which structurally resembles its family member GOLGA2, suggesting that they may share a similar function. Several alternatively spliced transcript variants of this gene have been described, but the full-length natures of only three have been determined.
10276	
10277	The protein encoded by this gene is a member of the ADAMTS family. Members of this family share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. This gene product contains 2 C-terminal TS motifs, like those found in ADAMTS5 and ADAMTS6 proteins.
10278	The protein encoded by this gene is a member of the ADAMTS family. Members of this family share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. This gene product contains 2 C-terminal TS motifs, like those found in ADAMTS5 and ADAMTS6 proteins.
10279	
10280	
10281	
10282	This gene encodes a multifunctional proinflammatory cytokine that belongs to the tumor necrosis factor (TNF) superfamily. This cytokine is mainly secreted by macrophages. It can bind to, and thus functions through its receptors TNFRSF1A/TNFR1 and TNFRSF1B/TNFBR. This cytokine is involved in the regulation of a wide spectrum of biological processes including cell proliferation, differentiation, apoptosis, lipid metabolism, and coagulation. This cytokine has been implicated in a variety of diseases, including autoimmune diseases, insulin resistance, and cancer. Knockout studies in mice also suggested the neuroprotective function of this cytokine.
10283	
10284	
10285	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development, reproduction, and tissue remodeling, as well as in disease processes, such as arthritis and metastasis. Most MMPs are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. However, the protein encoded by this gene is a member of the membrane-type MMP (MT-MMP) subfamily; each member of this subfamily contains a potential transmembrane domain suggesting that these proteins are expressed at the cell surface rather than secreted. In addition, it is thought that this protein is activated intracellularly by furin-like enzymes. This protein activates MMP2 by cleavage. The gene has previously been referred to as MMP20 but has been renamed matrix metalloproteinase 25 (MMP25). This gene produces two transcript variants encoding the same protein.
10286	
10287	
10288	
10289	
10290	
10291	The product of this gene is a member of the nuclear factors of activated T cells family of transcription factors. Proteins belonging to this family play a central role in inducible gene transcription during the immune response. This protein regulates gene expression induced by osmotic stress in mammalian cells. Unlike monomeric members of this protein family, this protein exists as a homodimer and forms stable dimers with DNA elements. Five transcript variants encoding three different isoforms have been identified for this gene.
10292	This gene encodes the homolog of the mouse protein Cidea that has been shown to activate apoptosis. This activation of apoptosis is inhibited by the DNA fragmentation factor DFF45 but not by caspase inhibitors. Mice that lack functional Cidea have higher metabolic rates, higher lipolysis in brown adipose tissue and higher core body temperatures when subjected to cold. These mice are also resistant to diet-induced obesity and diabetes. This suggests that in mice this gene product plays a role in thermogenesis and lipolysis. Two alternative transcripts encoding different isoforms have been identified.
10293	The neuronal excitotoxin quinolinate is an intermediate in the de novo synthesis pathway of NAD from tryptophan, and has been implicated in the pathogenesis of several neurodegenerative disorders. Quinolinate is derived from alpha-amino-beta-carboxy-muconate-epsilon-semialdehyde (ACMS). ACMSD (ACMS decarboxylase; EC 4.1.1.45) can divert ACMS to a benign catabolite and thus prevent the accumulation of quinolinate from ACMS.[supplied by OMIM]
10294	
10295	
10296	The protein encoded by this gene is a member of the dual specificity protein phosphatase subfamily. These phosphatases inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues. They negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily (MAPK/ERK, SAPK/JNK, p38), which is associated with cellular proliferation and differentiation. Different members of the family of dual specificity phosphatases show distinct substrate specificities for various MAP kinases, different tissue distribution and subcellular localization, and different modes of inducibility of their expression by extracellular stimuli. This gene product is localized to the nucleus, and is novel in that it binds directly to RNA and splicing factors, and thus suggested to participate in nuclear mRNA metabolism.
10297	
10298	
10299	
10300	
10301	The protein encoded by this gene belongs to the SEC14 cytosolic factor family. It has similarity to yeast SEC14 and to Japanese flying squid RALBP which suggests a possible role of the gene product in an intracellular transport system.
10302	Tryptases comprise a family of trypsin-like serine proteases, the peptidase family S1. Tryptases are enzymatically active only as heparin-stabilized tetramers, and they are resistant to all known endogenous proteinase inhibitors. Several tryptase genes are clustered on chromosome 16p13.3. These genes are characterized by several distinct features. They have a highly conserved 3@#$%& UTR and contain tandem repeat sequences at the 5@#$%& flank and 3@#$%& UTR which are thought to play a role in regulation of the mRNA stability. These genes have an intron immediately upstream of the initiator Met codon, which separates the site of transcription initiation from protein coding sequence. This feature is characteristic of tryptases but is unusual in other genes. The alleles of this gene exhibit an unusual amount of sequence variation, such that the alleles were once thought to represent two separate genes, alpha and beta 1. Beta tryptases appear to be the main isoenzymes expressed in mast cells; whereas in basophils, alpha tryptases predominate. Tryptases have been implicated as mediators in the pathogenesis of asthma and other allergic and inflammatory disorders.
10303	
10304	
10305	MCC is a candidate for the putative colorectal tumor suppressor gene.  The MCC gene product may be involved in early stages of colorectal neoplasia in both sporadic and familial tumors
10306	The protein encoded by this gene may control steps in the cycling of proteins through the trans-Golgi network to endosomes, lysosomes and the plasma membrane. Mutations in this gene cause the autosomal recessive disorder, chorea-acanthocytosis. Alternative splicing of this gene results in multiple transcript variants.
10307	
10308	Members of the RAS superfamily of GTP-binding proteins, which includes MRAS, are membrane-anchored, intracellular signal transducers responsible for a variety of normal cellular functions. They are oncogenically activated in a significant fraction of tumors.[supplied by OMIM]
10309	
10310	
10311	
10312	
10313	
10314	Transforming growth factors (TGFs) are biologically active polypeptides that reversibly confer the transformed phenotype on cultured cells. TGF-alpha shows about 40% sequence homology with epidermal growth factor (EGF; MIM 131530) and competes with EGF for binding to the EGF receptor (MIM 131550), stimulating its phosphorylation and producing a mitogenic response.[supplied by OMIM]
10315	
10316	
10317	The protein encoded by this gene is a member of the TNF-receptor superfamily. This receptor is preferentially expressed in mature B lymphocytes, and may be important for B cell development and autoimmune response. This receptor has been shown to specifically bind to the tumor necrosis factor (ligand) superfamily, member 13b (TNFSF13B/TALL-1/BAFF), and to lead to NF-kappaB and MAPK8/JNK activation. This receptor also binds to various TRAF family members, and thus may transduce signals for cell survival and proliferation.
10318	The specific function of this gene has not yet been determined; however, the protein it encodes is known to be a major constituent of the ARP2/3 complex. This complex is located at the cell surface and is essential to cell shape and motility through lamellipodial actin assembly and protrusion. Two transcript variants encoding different isoforms have been found for this gene.
10319	
10320	
10321	
10322	
10323	
10324	
10325	
10326	
10327	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. This protein exhibits mitogenic activity for keratinizing epidermal cells, but essentially no activity for fibroblasts, which is similar to the biological activity of FGF7. Studies of the mouse homolog of suggested that this gene is required for embryonic epidermal morphogenesis including brain development, lung morphogenesis, and initiation of lim bud formation. This gene is also implicated to be a primary factor in the process of wound healing.
10328	This gene encodes a protein which is a member of the Formin/Diaphanous family of proteins. The gene is ubiquitously expressed but is found in abundance in the spleen. The encoded protein has sequence homology to Diaphanous and Formin proteins within the Formin Homology (FH)1 and FH2 domains. It also contains a coiled-coil domain, a collagen-like domain, two nuclear localization signals, and several potential PKC and PKA phosphorylation sites. It is a predominantly cytoplasmic protein and is expressed in a variety of human cell lines. The function of this protein has not been known.
10329	
10330	The anaphase-promoting complex (APC) consists of at least 8 protein subunits, including APC5, CDC27 (APC3; MIM 116946), CDC16 (APC6; MIM 603461), and CDC23 (APC8; MIM 603462).[supplied by OMIM]
10331	
10332	
10333	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
10334	
10335	
10336	
10337	
10338	
10339	
10340	
10341	
10342	Members of the Ikaros (ZNFN1A1; MIM 603023) family of transcription factors, which includes Eos, are expressed in lymphocytes and are implicated in the control of lymphoid development.[supplied by OMIM]
10343	The WNT gene family consists of structurally related genes that encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family. It is expressed in gastric cancer cell lines. The protein encoded by this gene shows 75% amino acid identity to chicken Wnt14, which has been shown to play a central role in initiating synovial joint formation in the chick limb. This gene is clustered with another family member, WNT3A, in the chromosome 1q42 region.
10344	This gene belongs to the protocadherin gene family, a subfamily of the cadherin superfamily. The gene encodes a protein with an extracellular domain containing 7 cadherin repeats. The gene product is an integral membrane protein that is thought to function in cell-cell recognition and adhesion. Alternative splicing yields isoforms with unique cytoplasmic tails.
10345	
10346	
10347	Cholesterol homeostasis is regulated, at least in part, by sterol regulatory element (SRE)-binding proteins (e.g., SREBP1; MIM 184756) and by liver X receptors (e.g., LXRA; MIM 602423). Upon sterol depletion, LXRs are inactive and SREBPs are cleaved, after which they bind promoter SREs and activate genes involved in cholesterol biosynthesis and uptake. Sterol transport is mediated by vesicles or by soluble protein carriers, such as steroidogenic acute regulatory protein (STAR; MIM 600617). STAR is homologous to a family of proteins containing a 200- to 210-amino acid STAR-related lipid transfer (START) domain, including STARD6 (Soccio et al., 2002).[supplied by OMIM]
10348	
10349	
10350	
10351	
10352	The human alpha globin gene cluster located on chromosome 16 spans about 30 kb and includes the following five loci:  5@#$%&- zeta - pseudozeta - pseudoalpha-1 - alpha-2 - alpha-1 -3@#$%&. The alpha-2 (HBA2) and alpha-1 (HBA1) coding sequences are identical. These genes differ slightly over the 5@#$%& untranslated regions and the introns, but they differ significantly over the 3@#$%& untranslated regions. Two alpha chains plus two beta chains constitute HbA, which in normal adult life comprises about 97% of  the total hemoglobin; alpha chains combine with delta chains to constitute HbA-2, which with HbF (fetal hemoglobin) makes up the  remaining 3% of adult hemoglobin.  Alpha thalassemias result from deletions of each of the alpha genes as well as deletions of both HBA2 and HBA1 respectively; some nondeletion alpha thalassemias have also been reported.
10353	This gene is a member of the MAGEB gene family. The members of this family have their entire coding sequences located in the last exon, and the encoded proteins show 50 to 68% sequence identity to each other. The promoters and first exons of the MAGEB genes show considerable variability, suggesting that the existence of this gene family enables the same function to be expressed under different transcriptional controls. This gene is expressed in testis, and in a significant fraction of tumors of various histological types. The MAGEB genes are clustered on chromosome Xp22-p21.
10354	ARL8 belongs to a family of proteins that are structurally similar to ADP-ribosylation factors (ARFs; see MIM 103180). ARLs and ARFs are part of the RAS superfamily of regulatory GTPases.[supplied by OMIM]
10355	
10356	
10357	
10358	Spermidine/spermine N(1)-acetyltransferase (SPD/SPM acetyltransferase) is a rate-limiting enzyme in the catabolic pathway of polyamine metabolism. It catalyzes the N(1)-acetylation of spermidine and spermine and, by the successive activity of polyamine oxidase, spermine can be converted to spermidine and spermidine to putrescine.[supplied by OMIM]
10359	
10360	
10361	GPT (MIM 138200) and GPT2 (EC 2.6.1.2), also known as alanine transaminases, are pyridoxal enzymes that catalyze the reversible transamination between alanine and 2-oxoglutarate to form pyruvate and glutamate. By mediating the conversion of these 4 major intermediate metabolites, these transaminases have roles in gluconeogenesis and in amino acid metabolism.[supplied by OMIM]
10362	
10363	The protein encoded by this gene belongs to a small class of the protein tyrosine phosphatase (PTP) family. PTPs are cell signaling molecules that play regulatory roles in a variety of cellular processes. PTPs in this class contain a protein tyrosine phosphatase catalytic domain and a characteristic C-terminal prenylation motif. This PTP has been shown to primarily associate with plasmic and endosomal membrane through its C-terminal prenylation. This PTP was found to interact with the beta-subunit of Rab geranylgeranyltransferase II (beta GGT II), and thus may function as a regulator of GGT II activity. Overexpression of this gene in mammalian cells conferred a transformed phenotype, which suggested its role in tumorigenesis. Three alternatively spliced transcript variants, which encode two distinct isoforms, have been described.
10364	
10365	This gene encodes a member of the family 1 of G-protein coupled receptors. This receptor mediates its action by association with G proteins that activate a phosphatidylinositol-calcium second messenger system. This receptor is a selenoprotein containing selenocysteine at the active site. The selenocysteine is encoded by the usual nonsense (stop) codon UGA. This gene has a pseudogene located on chromosome 14. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
10366	GTP-binding proteins, or G proteins, constitute a superfamily capable of binding GTP or GDP. G proteins are activated by binding GTP and are inactivated by hydrolyzing GTP to GDP. This general mechanism enables G proteins to perform a wide range of biologic activities.[supplied by OMIM]
10367	
10368	
10369	
10370	
10371	
10372	
10373	The protein encoded by this gene is a member of the keratin gene family. The type II cytokeratins consist of basic or neutral proteins which are arranged in pairs of heterotypic keratin chains coexpressed during differentiation of simple and stratified epithelial tissues. This type II cytokeratin is specifically expressed in the basal layer of the epidermis with family member KRT14. Mutations in these genes have been associated with a complex of diseases termed epidermolysis bullosa simplex. The type II cytokeratins are clustered in a region of chromosome 12q12-q13.
10374	
10375	
10376	
10377	This gene encodes a flavin adenine dinucleotide (FAD)-dependent oxidoreductase which catalyzes the reduction of the delta-24 double bond of sterol intermediates during cholesterol biosynthesis. The protein contains a leader sequence that directs it to the endoplasmic reticulum membrane. Missense mutations in this gene have been associated with desmosterolosis. Also, reduced expression of the gene occurs in the temporal cortex of Alzheimer disease patients and overexpression has been observed in adrenal gland cancer cells.
10378	This gene encodes a member of the cyclin family. Cyclins are important regulators of cell cycle transitions through their ability to bind and activate cyclin-dependent protein kinases. This member also belongs to the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of the ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class and it was one of the first proteins in which the F-box motif was identified.
10379	Aminoacyl-tRNA synthetases catalyze the aminoacylation of tRNA by their cognate amino acid. Because of their central role in linking amino acids with nucleotide triplets contained in tRNAs, aminoacyl-tRNA synthetases are thought to be among the first proteins that appeared in evolution. Tyrosyl-tRNA synthetase belongs to the class I tRNA synthetase family. Cytokine activities have also been observed for the human tyrosyl-tRNA synthetase, after it is split into two parts, an N-terminal fragment that harbors the catalytic site and a C-terminal fragment found only in the mammalian enzyme. The N-terminal fragment is an interleukin-8-like cytokine, whereas the released C-terminal fragment is an EMAP II-like cytokine.
10380	Potassium channels are present in most mammalian cells, where they participate in a wide range of physiologic responses. The protein encoded by this gene is an integral membrane protein and inward-rectifier type potassium channel. The encoded protein, which has a greater tendency to allow potassium to flow into a cell rather than out of a cell, is controlled by G-proteins. It associates with another G-protein-activated potassium channel to form a heteromultimeric pore-forming complex.
10381	
10382	
10383	The protein encoded by this gene belongs to the BCL2 protein family. BCL2 family members form hetero- or homodimers and act as anti- or pro-apoptotic regulators that are involved in a wide variety of cellular activities. This protein contains a single BCL2 homology domain 3 (BH3), and has been shown to bind BCL2 proteins and function as an apoptotic activator. This protein is found to be sequestered to myosin V motors by its association with dynein light chain 2, which may be important for sensing intracellular damage and triggering apoptosis. Alternatively spliced transcript variants encoding different isoforms have been identified.
10384	
10385	
10386	
10387	
10388	The protein encoded by this gene is a type II membrane protein that may be present in the Golgi apparatus. The encoded protein, which is a member of glycosyltransferase family 29, may be involved in the synthesis of gangliosides GD1c, GT1a, GQ1b, and GT3 from GD1a, GT1b, GM1b, and GD3, respectively.
10389	
10390	
10391	
10392	
10393	
10394	
10395	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. The protein encoded by this gene is a member of the DEAD/H box family of splicing factors. This protein resembles yeast Prp16 more closely than other DEAD/H family members. It is an ATPase and essential for the catalytic step II in pre-mRNA splicing process.
10396	Fibrosin is a lymphokine secreted by activated lymphocytes that induces fibroblast proliferation (Prakash and Robbins, 1998).[supplied by OMIM]
10397	LIM domain only 4 is a cysteine-rich, two LIM domain-containing protein that may play a role as a transcriptional regulator or possibly an oncogene.  Its mRNA is characterized by a GC-rich 5@#$%& region and by multiple ATTT motifs in the 3@#$%& region.  A variant transcript missing a portion of the 5@#$%& region has been identified but cannot be confirmed because of the GC-rich nature of the region.
10398	
10399	
10400	This gene encodes a protein that contains domains of thioredoxin and ERV1, members of two long-standing gene families. The gene expression is induced as fibroblasts begin to exit the proliferative cycle and enter quiescence, suggesting that this gene plays an important role in growth regulation. Two transcript variants encoding two different isoforms have been found for this gene.
10401	This gene encodes a protein that contains domains of thioredoxin and ERV1, members of two long-standing gene families. The gene expression is induced as fibroblasts begin to exit the proliferative cycle and enter quiescence, suggesting that this gene plays an important role in growth regulation. Two transcript variants encoding two different isoforms have been found for this gene.
10402	
10403	
10404	Calcyphosine-2 is a calcium-binding protein with 2 EF-hand motifs (Wang et al., 2002).[supplied by OMIM]
10405	Hydrolases are transported to lysosomes after binding to mannose 6-phosphate receptors in the trans-Golgi network. This gene encodes the enzyme that catalyzes the second step in the formation of the mannose 6-phosphate recognition marker on lysosomal hydrolases. Commonly known as @#$%&uncovering enzyme@#$%& or UCE, this enzyme removes N-acetyl-D-glucosamine (GlcNAc) residues from GlcNAc-alpha-P-mannose moieties and thereby produces the recognition marker. This reaction most likely occurs in the trans-Golgi network. This enzyme functions as a homotetramer of two disulfide-linked homodimers. In addition to having an N-terminal signal peptide, the protein@#$%&s C-terminus contains multiple signals for trafficking it between lysosomes, the plasma membrane, and trans-Golgi network.
10406	
10407	
10408	
10409	
10410	
10411	This gene encodes a trypsinogen, which is a member of the trypsin family of serine proteases. This enzyme is secreted by the pancreas and cleaved to its active form in the small intestine. It is active on peptide linkages involving the carboxyl group of lysine or arginine. Mutations in this gene are associated with hereditary pancreatitis. This gene and several other trypsinogen genes are localized to the T cell receptor beta locus on chromosome 7.
10412	The protein encoded by this gene is a member of the Ser/Thr portein kinase family. This protein contains a Cdc42/Rac-binding p21 binding domain resembling that of PAK kinase. The kinase domain of this protein is most closely related to that of myotonic dystrophy kinase-related ROK. Studies of the similar gene in rat suggested that this kinase may act as a downstream effector of Cdc42 in cytoskeletal reorganization.
10413	
10414	
10415	
10416	
10417	
10418	The protein encoded by this gene is a member of the apolipoprotein L family and may play a role in lipid exchange and transport throughout the body, as well as in reverse cholesterol transport from peripheral cells to the liver. Two transcript variants encoding two different isoforms have been found for this gene. Only one of the isoforms appears to be a secreted protein.
10419	
10420	
10421	
10422	This gene encodes one of seven subunits of the human Arp2/3 protein complex. The Arp2/3 protein complex has been implicated in the control of actin polymerization in cells and has been conserved through evolution. The exact role of the protein encoded by this gene, the p20 subunit, has yet to be determined.
10423	
10424	
10425	
10426	
10427	
10428	
10429	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
10430	Sphingosine-1-phosphate (SPP) is a novel lipid messenger with both intracellular and extracellular functions. Intracellularly, it regulates proliferation and survival, and extracellularly, it is a ligand for EDG1 (MIM 601974). Various stimuli increase cellular levels of SPP by activation of sphingosine kinase (SPHK), the enzyme that catalyzes the phosphorylation of sphingosine. Competitive inhibitors of SPHK block formation of SPP and selectively inhibit cellular proliferation induced by a variety of factors, including platelet-derived growth factor (e.g., MIM 173430) and serum.[supplied by OMIM]
10431	
10432	
10433	
10434	
10435	This gene encodes a member of the family of adenylate cyclases, which are membrane-associated enzymes that catalyze the formation of the secondary messenger cyclic adenosine monophosphate (cAMP). Mouse studies show that adenylate cyclase 4, along with adenylate cyclases 2 and 3, is expressed in olfactory cilia, suggesting that several different adenylate cyclases may couple to olfactory receptors and that there may be multiple receptor-mediated mechanisms for the generation of cAMP signals.
10436	
10437	
10438	
10439	
10440	
10441	
10442	
10443	
10444	This gene encodes a mitochondrial flavoprotein that initiates electron transport for cytochromes P450 receiving electrons from NADPH. Two alternatively spliced transcripts for this gene exist.
10445	
10446	
10447	The protein encoded by this gene is a member of the type 3 G protein-coupled receptor family. Members of this superfamily are characterized by a signature 7-transmembrane domain motif. The specific function of this protein is unknown; however, this protein may mediate the cellular effects of retinoic acid on the G protein signal transduction cascade. Two transcript variants encoding different isoforms have been found for this gene.
10448	
10449	
10450	This gene encodes a member of the Golgi-localized, gamma adaptin ear-containing, ARF-binding (GGA) family. This family includes ubiquitous coat proteins that regulate the trafficking of proteins between the trans-Golgi network and the lysosome. These proteins share an amino-terminal VHS domain which mediates sorting of the mannose 6-phosphate receptors at the trans-Golgi network. They also contain a carboxy-terminal region with homology to the ear domain of gamma-adaptins. Alternative splicing of this gene results in two transcript variants.
10451	
10452	
10453	The protein encoded by this gene is a member of the keratin gene family. The type II cytokeratins consist of basic or neutral proteins which are arranged in pairs of heterotypic keratin chains coexpressed during differentiation of simple and stratified epithelial tissues. This type II cytokeratin is specifically expressed in the simple epithelia lining the cavities of the internal organs and in the gland ducts and blood vessels. The genes encoding the type II cytokeratins are clustered in a region of chromosome 12q12-q13. Alternative splicing may result in several transcript variants; however, not all variants have been fully described.
10454	The erythropoietin receptor is a member of the cytokine receptor family. Upon erythropoietin binding, the erythropoietin receptor activates Jak2 tyrosine kinase which activates different intracellular pathways including: Ras/MAP kinase, phosphatidylinositol 3-kinase and STAT transcription factors. The stimulated erythropoietin receptor appears to have a role in erythroid cell survival. Defects in the erythropoietin receptor may produce erythroleukemia and familial erythrocytosis.
10455	O-linked N-acetylglucosamine (O-GlcNAc) transferase (OGT) catalyzes the addition of a single N-acetylglucosamine in O-glycosidic linkage to serine or threonine residues. Since both phosphorylation and glycosylation compete for similar serine or threonine residues, the two processes may compete for sites, or they may alter the substrate specificity of nearby sites by steric or electrostatic effects. The protein contains nine tetratricopeptide repeats and a putative bipartite nuclear localization signal. Three alternatively spliced transcript variants encoding distinct isoforms have been reported.
10456	O-linked N-acetylglucosamine (O-GlcNAc) transferase (OGT) catalyzes the addition of a single N-acetylglucosamine in O-glycosidic linkage to serine or threonine residues. Since both phosphorylation and glycosylation compete for similar serine or threonine residues, the two processes may compete for sites, or they may alter the substrate specificity of nearby sites by steric or electrostatic effects. The protein contains nine tetratricopeptide repeats and a putative bipartite nuclear localization signal. Three alternatively spliced transcript variants encoding distinct isoforms have been reported.
10457	ITAGV encodes integrin alpha chain V. Integrins are heterodimeric integral membrane proteins composed of an alpha chain and a beta chain. The I-domain containing integrin alpha V undergoes post-translational cleavage to yield disulfide-linked heavy and light chains, that combine with multiple integrin beta chains to form different integrins. Among the known associating beta chains (beta chains 1,3,5,6, and 8; @#$%&ITGB1@#$%&, @#$%&ITGB3@#$%&, @#$%&ITGB5@#$%&, @#$%&ITGB6@#$%&, and @#$%&ITGB8@#$%&), each can interact with extracellular matrix ligands; the alpha V beta 3 integrin, perhaps the most studied of these, is referred to as the Vitronectin receptor (VNR). In addition to adhesion, many integrins are known to facilitate signal transduction.
10458	
10459	
10460	
10461	EMCN is a mucin-like sialoglycoprotein that interferes with the assembly of focal adhesion complexes and inhibits interaction between cells and the extracellular matrix (Kinoshita et al., 2001).[supplied by OMIM]
10462	
10463	
10464	
10465	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. This gene and the gene for a semaphorin class 4 protein (SEMA4G) overlap at map location 10q24.31 and are transcribed in opposite directions. Sequence analysis identified multiple transcript variants encoding at least four different protein isoforms.
10466	This gene encodes the substrate of breast tumor kinase, an Src-type non-receptor tyrosine kinase. The encoded protein possesses domains and several tyrosine phosphorylation sites characteristic of adaptor proteins that mediate the interactions linking proteins involved in signal transduction pathways. Alternative splicing results in multiple transcript variants.
10467	The protein encoded by this gene is a type IV membrane protein. It is present in the plasma membrane and intracellular vesicles. It may also be associated with the cytoskeleton. This protein may function in vesicle trafficking, membrane fusion, protein complex assembly and cell motility. Alternative splicing occurs at this locus and two transcript variants encoding distinct isoforms have been identified.
10468	
10469	
10470	This gene encodes an enzyme involved in catalyzing the conversion of angiotensin I into a physiologically active peptide angiotensin II. Angiotensin II is a potent vasopressor and aldosterone-stimulating peptide that controls blood pressure and fluid-electrolyte balance. This enzyme plays a key role in the renin-angiotensin system. Many studies have associated the presence or absence of a 287 bp Alu repeat element in this gene with the levels of circulating enzyme or cardiovascular pathophysiologies. Two most abundant alternatively spliced variants of this gene encode two isozymes - the somatic form and the testicular form that are equally active. Multiple additional alternatively spliced variants have been identified but their full length nature has not been determined.
10471	Apolipoprotein H has been implicated in a variety of physiologic pathways including lipoprotein metabolism, coagulation, and the production of antiphospholipid autoantibodies.  APOH may be a required cofactor for anionic phospholipid binding by the antiphospholipid autoantibodies found in sera of many patients with lupus and primary antiphospholipid syndrome, but it does not seem to be required for the reactivity of antiphospholipid autoantibodies associated with infections.
10472	
10473	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
10474	
10475	Chromogranin B is a tyrosine-sulfated secretory protein found in a wide variety of peptidergic endocrine cells. Chromogranin functions as a neuroendocrine secretory granule protein which may be the precursor for other biologically active peptides.
10476	BCL9 is associated with B-cell acute lymphoblastic leukemia. It may be a target of translocation in B-cell malignancies with abnormalities of 1q21. Its function is unknown. The overexpression of BCL9 may be of pathogenic significance in B-cell malignancies.
10477	The WNT gene family consists of structurally related genes which encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family, and is the first signaling molecule shown to influence the sex-determination cascade. It encodes a protein which shows 98% amino acid identity to the Wnt4 protein of mouse and rat. This gene and DAX1, a gene known to antagonize the testis-determining factor, play a concerted role in both the control of female development and the prevention of testes formation. This gene and another two family members, WNT2 and WNT7B, may be associated with abnormal proliferation in breast tissue.
10478	
10479	
10480	
10481	
10482	
10483	
10484	
10485	
10486	The product of this gene belongs to the family of G-protein coupled receptors. This family has several receptor subtypes with different pharmacological selectivity, which overlaps in some cases, for various adenosine and uridine nucleotides. This receptor functions as a receptor for extracellular ATP and ADP. In platelets binding to ADP leads to mobilization of intracellular calcium ions via activation of phospholipase C, a change in platelet shape, and probably to platelet aggregation.
10487	
10488	
10489	
10490	
10491	
10492	The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21. This protein may function in stimulation of Ca2+-induced Ca2+ release, inhibition of microtubule assembly, and inhibition of protein kinase C-mediated phosphorylation. Reduced expression of this protein has been implicated in cardiomyopathies.
10493	
10494	
10495	
10496	The WNK4 gene encodes a serine-threonine kinase expressed in distal nephron.[supplied by OMIM]
10497	
10498	POLE4 is a histone-fold protein that interacts with other histone-fold proteins to bind DNA in a sequence-independent manner. These histone-fold protein dimers combine within larger enzymatic complexes for DNA transcription, replication, and packaging.[supplied by OMIM]
10499	
10500	
10501	
10502	
10503	The Tousled-like kinases, first described in Arabidopsis, are nuclear serine/threonine kinases that are potentially involved in the regulation of chromatin assembly.[supplied by OMIM]
10504	
10505	
10506	
10507	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
10508	This gene is a member of a phylogenetically conserved family of genes that share a common DNA-binding domain, the T-box. T-box genes encode transcription factors involved in the regulation of developmental processes. This protein is a transcriptional repressor and is thought to play a role in the anterior/posterior axis of the tetrapod forelimb. Mutations in this gene cause ulnar-mammary syndrome, affecting limb, apocrine gland, tooth, hair, and genital development. Alternative splicing of this gene results in three transcript variants encoding different isoforms; however, the full length nature of one variant has not been determined.
10509	Acetyl-Coenzyme A acetyltransferase 2 is an enzyme involved in lipid metabolism.  Reported patients with ACAT2 deficiency have shown severe mental retardation and hypotonus.  The ACAT2 gene shows complementary overlapping with the 3-prime region of the TCP1 gene in both mouse and human. These genes are encoded on opposite strands of DNA, as well as in opposite transcriptional orientation.
10510	
10511	
10512	
10513	
10514	
10515	
10516	
10517	
10518	EPN1 is an endocytic accessory protein that interacts with EPS15 (MIM 600051), the alpha subunit of the clathrin adaptor AP2 (AP2A1; MIM 601026), and clathrin (see MIM 118960), as well as with other accessory proteins for the endocytosis of clathrin-coated vesicles.[supplied by OMIM]
10519	
10520	The protein encoded by this gene is a peripheral membrane protein localized to the trans-Golgi network. It is sensitive to brefeldin A. This encoded protein contains a GRIP domain which is thought to be used in targeting. Two alternatively spliced transcript variants encoding different isoforms have been described for this gene.
10521	
10522	
10523	
10524	
10525	
10526	
10527	
10528	This gene was identified as a gene whose expression can be induced by the tumor necrosis factor alpha (TNF) in umbilical vein endothelial cells. Studies of a similar gene in mouse suggest that the expression of this gene is developmentally regulated in a tissue-specific manner.
10529	
10530	
10531	
10532	
10533	
10534	
10535	
10536	
10537	
10538	The Fanconi anemia complementation group (FANC) currently includes FANCA, FANCB, FANCC, FANCD1 (also called BRCA2), FANCD2, FANCE, FANCF, FANCG, and FANCL. Fanconi anemia is a genetically heterogeneous recessive disorder characterized by cytogenetic instability, hypersensitivity to DNA crosslinking agents, increased chromosomal breakage, and defective DNA repair. The members of the Fanconi anemia complementation group do not share sequence similarity; they are related by their assembly into a common nuclear protein complex. This gene encodes the protein for complementation group D2. This protein is monoubiquinated in response to DNA damage, resulting in its localization to nuclear foci with other proteins (BRCA1 AND BRCA2) involved in homology-directed DNA repair. Alternative splicing results in two transcript variants encoding different isoforms.
10539	
10540	This gene encodes a member of the GATA family of zinc-finger transcription factors. Members of this family recognize the GATA motif which is present in the promoters of many genes. This protein is thought to regulate genes involved in embryogenesis and in myocardial differentiation and function. Mutations in this gene have been associated with cardiac septal defects.
10541	
10542	Gamma-aminobutyric acid A receptors [GABA(A) receptors] are ligand-gated chloride channels that mediate inhibitory neurotransmission. This gene encodes GABA(A) receptor-associated protein, which is highly positively charged in its N-terminus and shares sequence similarity with light chain-3 of microtubule-associated proteins 1A and 1B. This protein clusters neurotransmitter receptors by mediating interaction with the cytoskeleton.
10543	
10544	
10545	
10546	The protein encoded by this gene is a member of the E2F family of transcription factors. The E2F family plays a crucial role in the control of cell cycle and action of tumor suppressor proteins and is also a target of the transforming proteins of small DNA tumor viruses. The E2F proteins contain several evolutionally conserved domains found in most members of the family. These domains include a DNA binding domain, a dimerization domain which determines interaction with the differentiation regulated transcription factor proteins (DP), a transactivation domain enriched in acidic amino acids, and a tumor suppressor protein association domain which is embedded within the transactivation domain. This protein and another 2 members, E2F1 and E2F2, have an additional cyclin binding domain. This protein binds specifically to retinoblastoma protein pRB in a cell-cycle dependent manner. Alternative gene splicing is found in the mouse homolog, but has not reported in human yet.
10547	
10548	This gene encodes a member of the DNAJ protein family. DNAJ family members are characterized by a highly conserved amino acid stretch called the @#$%&J-domain@#$%& and function as one of the two major classes of molecular chaperones involved in a wide range of cellular events, such as protein folding and oligomeric protein complex assembly. This family member may also play a role in polyglutamine aggregation in specific neurons. Alternative splicing of this gene results in multiple transcript variants; however, not all variants have been fully described.
10549	This gene encodes a member of the chloride intracellular channel family of proteins. The gene is part of a large triplicated region found on chromosomes 1, 6, and 21. An alternatively spliced transcript variant has been described, but its biological validity has not been determined.
10550	This gene product belongs to the SCAMP family of proteins which are secretory carrier membrane proteins. They function as carriers to the cell surface in post-golgi recycling pathways. Different family members are highly related products of distinct genes, and are usually expressed together. These findings suggest that the SCAMPs may function at the same site during vesicular transport rather than in separate pathways.
10551	
10552	
10553	
10554	The ITGB3 protein product is the integrin beta chain beta 3. Integrins are integral cell-surface proteins composed of an alpha chain and a beta chain. A given chain may combine with multiple partners resulting in different integrins. Integrin beta 3 is found along with the alpha IIb chain in platelets. Integrins are known to participate in cell adhesion as well as cell-surface mediated signalling.
10555	
10556	
10557	
10558	
10559	
10560	
10561	
10562	KRT17 encodes the type I intermediate filament chain keratin 17, expressed in nail bed, hair follicle, sebaceous glands, and other epidermal appendages. Mutations in this gene lead to Jackson-Lawler type pachyonychia congenita and steatocystoma multiplex.
10563	
10564	
10565	
10566	
10567	The protein encoded by this gene is member of the GAP1 family of GTPase-activating proteins. The gene product stimulates the GTPase activity of normal RAS p21 but not its oncogenic counterpart. Acting as a suppressor of RAS function, the protein enhances the weak intrinsic GTPase activity of RAS proteins resulting in the inactive GDP-bound form of RAS, thereby allowing control of cellular proliferation and differentiation. This family member is an inositol 1,3,4,5-tetrakisphosphate-binding protein, like the closely related RAS p21 protein activator 2. The two family members have distinct pleckstrin-homology domains, with this particular member having a domain consistent with its localization to the plasma membrane.
10568	The WNT gene family consists of structurally related genes which encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family. It encodes a protein which shows 96% amino acid identity to mouse Wnt3A protein, and 84% to human WNT3 protein, another WNT gene product. This gene is clustered with WNT14 gene, another family member, in chromosome 1q42 region.
10569	
10570	
10571	
10572	
10573	
10574	Defensins are cysteine-rich cationic polypeptides that are important in the immunologic response to invading microorganisms. The protein encoded by this gene is secreted and is a member of the beta defensin protein family. Beta defensin genes are found in several clusters throughout the genome, with this gene mapping to a cluster at 20p13.
10575	
10576	
10577	
10578	
10579	
10580	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the ultrahigh sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
10581	Dystrophin is a large, rod-like cytoskeletal protein found at the inner surface of muscle fibers. Dystrophin is missing in Duchenne Muscular Dystrophy patients and is present in reduced amounts in Becker Muscular Dystrophy patients. The protein encoded by this gene is a peripheral membrane protein found associated with dystrophin and dystrophin-related proteins. This gene is a member of the syntrophin gene family, which contains at least two other structurally-related genes.
10582	
10583	This gene encodes an enzyme that catalyzes the second step of glycosylphosphatidylinositol (GPI) biosynthesis, which is the de-N-acetylation of N-acetylglucosaminylphosphatidylinositol (GlcNAc-PI). Study of a similar rat enzyme suggests that this protein localizes to the endoplasmic reticulum.
10584	
10585	This gene encodes a member of the NK family of homeobox-containing proteins. The encoded protein may play a role in skeletal development.
10586	
10587	Histamine is a ubiquitous messenger molecule released from mast cells, enterochromaffin-like cells, and neurons. Its various actions are mediated by histamine receptors H1, H2, H3 and H4. Histamine receptor H4 belongs to the family 1 of G protein-coupled receptors and has the highest homology to histamine receptor H3 among known G protein-coupled receptors.
10588	The protein encoded by this gene is a cell-surface glycoprotein found on melanoma cells. The protein shares sequence similarity and iron-binding properties with members of the transferrin superfamily. The importance of the iron binding function has not yet been identified. This gene resides in the same region of chromosome 3 as members of the transferrin superfamily. Alternative splicing results in two transcript variants.
10589	
10590	
10591	
10592	
10593	
10594	
10595	The protein encoded by this gene is a member of the peptidyl-prolyl cis-trans isomerase (PPIase) family. PPIases catalyze the cis-trans isomerization of proline imidic peptide bonds in oligopeptides and accelerate the folding of proteins. This protein has been shown to possess PPIase activity and, similar to other family members, can bind to the immunosuppressant cyclosporin A.
10596	
10597	
10598	
10599	MC2R encodes one member of the five-member G-protein associated melanocortin receptor family. Melanocortins (melanocyte-stimulating hormones and adrenocorticotropic hormone) are peptides derived from pro-opiomelanocortin (POMC). MC2R is selectively activated by adrenocorticotropic hormone, whereas the other four melanocortin receptors recognize a variety of melanocortin ligands. Mutations in MC2R can result in familial glucocorticoid deficiency.
10600	
10601	The protein encoded by this gene is a type III glycoprotein that is located primarily in limiting membranes of lysosomes and endosomes. Studies of the similar protein in mice and rat suggested that this protein may participate in membrane transportation and the reorganization of endosomal/lysosomal compartment. Deficiency of the similar protein in mice was reported to impair cell membrane transport processes and cause pelvic junction obstruction, deafness, and peripheral neuropathy.
10602	
10603	
10604	
10605	
10606	
10607	
10608	DIRAS1 belongs to a distinct branch of the functionally diverse Ras (see HRAS; MIM 190020) superfamily of monomeric GTPases.[supplied by OMIM]
10609	The protein encoded by this gene is a calcium-dependent glycoprotein that is a member of the desmocollin subfamily of the cadherin superfamily. These desmosomal family members, along with the desmogleins, are found primarily in epithelial cells where they constitute the adhesive proteins of the desmosome cell-cell junction and are required for cell adhesion and desmosome formation. The desmosomal family members are arranged in two clusters on chromosome 18, occupying less than 650 kb combined. Alternative splicing results in two transcript variants encoding distinct isoforms.
10610	
10611	
10612	
10613	
10614	
10615	
10616	
10617	
10618	
10619	
10620	
10621	This gene is a classical cadherin from the cadherin superfamily. The encoded protein is a calcium dependent cell-cell adhesion glycoprotein comprised of five extracellular cadherin repeats, a transmembrane region and a highly conserved cytoplasmic tail. The protein functions during gastrulation and is required for establishment of left-right asymmetry. At certain central nervous system synapses, presynaptic to postsynaptic adhesion is mediated at least in part by this gene product.
10622	This gene is a type II classical cadherin from the cadherin superfamily and one of three cadherin 7-like genes located in a cluster on chromosome 18. The encoded membrane protein is a calcium dependent cell-cell adhesion glycoprotein comprised of five extracellular cadherin repeats, a transmembrane region and a highly conserved cytoplasmic tail. Type II (atypical) cadherins are defined based on their lack of a HAV cell adhesion recognition sequence specific to type I cadherins. Since disturbance of intracellular adhesion is a prerequisite for invasion and metastasis of tumor cells, cadherins are considered prime candidates for tumor suppressor genes.
10623	
10624	This gene encodes a heavily glycosylated protein which is a ligand for the NKG2D type II receptor. Binding of the ligand activates the cytolytic response of natural killer (NK) cells, CD8 alphabeta T cells, and gammadelta T cells which express the receptor. This protein is stress-induced and is similar to MHC class I molecules; however, it does not associate with beta-2-microglobulin or bind peptides.
10625	
10626	
10627	
10628	
10629	
10630	
10631	Follicular lymphoma variant translocation 1 is a secreted protein which is weakly expressed in hematopoietic tissue.  FVT1 shows a high rate of transcription in some T-cell malignancies and in phytohemagglutinin-stimulated lymphocytes.  The proximity of FVT1 to BCL2 suggests that it may participate in the tumoral process
10632	
10633	
10634	
10635	
10636	
10637	
10638	This gene encodes the D1 subtype of the dopamine receptor.The D1 subtype is the most abundant dopamine receptor in the central nervous system. This G-protein coupled receptor stimulates adenylyl cyclase and activates cyclic AMP-dependent protein kinases. D1 receptors regulate neuronal growth and development, mediate some behavioral responses, and modulate dopamine receptor D2-mediated events. Alternate transcription initiation sites result in two transcript variants of this gene.
10639	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the ALD subfamily, which is involved in peroxisomal import of fatty acids and/or fatty acyl-CoAs in the organelle. All known peroxisomal ABC transporters are half transporters which require a partner half transporter molecule to form a functional homodimeric or heterodimeric transporter. The function of this peroxisomal membrane protein is unknown; however this protein is speculated to function as a dimerization partner of ABCD1 and/or other peroxisomal ABC transporters. Mutations in this gene have been observed in patients with adrenoleukodystrophy, a severe demyelinating disease. This gene has been identified as a candidate for a modifier gene, accounting for the extreme variation among adrenoleukodystrophy phenotypes. This gene is also a candidate for a complement group of Zellweger syndrome, a genetically heterogeneous disorder of peroxisomal biogenesis.
10640	
10641	
10642	
10643	The protein encoded by this gene is a plasma glycoprotein of unknown function. The protein shows sequence similarity to the variable regions of some immunoglobulin supergene family member proteins.
10644	
10645	
10646	
10647	
10648	The specific function of this gene has yet to be determined; however, it has been speculated to play a role in transcription regulation and/or chromatin and histone metabolism.  It is considered the primary candidate gene in some haploinsufficiency syndromes such as DiGeorge syndrome, and insufficient production of the gene may disrupt normal embryonic development.  This gene is mapped to 22q11.21, centromeric to COMT.
10649	This gene encodes a tumor antigen recognized by autologous cytolytic lymphocytes (CTL).
10650	
10651	
10652	
10653	
10654	
10655	The enzyme mevalonate pyrophosphate decarboxylase catalyzes the conversion of mevalonate pyrophosphate into isopentenyl pyrophosphate in one of the early steps in cholesterol biosynthesis. It decarboxylates and dehydrates its substrate while hydrolyzing ATP.
10656	
10657	
10658	The protein encoded by this gene is a member of the ELAVL protein family. This encoded protein contains 3 RNA-binding domains and binds cis-acting AU-rich elements. It destabilizes mRNAs and thereby regulates gene expression.
10659	The protein encoded by this gene has a long and a short form, generated by use of alternative translational start codons. The long form is expressed in steroidogenic tissues such as testis, where it converts cholesteryl esters to free cholesterol for steroid hormone production. The short form is expressed in adipose tissue, among others, where it hydrolyzes stored triglycerides to free fatty acids.
10660	
10661	
10662	
10663	
10664	
10665	
10666	
10667	
10668	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is intronless and encodes a member of the histone H4 family. Transcripts from this gene lack polyA tails; instead, they contain a palindromic termination element.
10669	Organic ion transporters, such as SLC22A15, transport various medically and physiologically important compounds, including pharmaceuticals, toxins, hormones, neurotransmitters, and cellular metabolites. These transporters are also referred to as amphiphilic solute facilitators (ASFs).[supplied by OMIM]
10670	
10671	The protein encoded by this gene is an evolutionarily conserved protein similar to yeast SSF1 as well as to the gene product of the Drosophila gene peter pan (ppan). SSF1 is known to be involved in the second step of mRNA splicing. Both SSF1 and ppan are essential for cell growth and proliferation. This gene was found to cotranscript with P2RY11/P2Y(11), an immediate downstream gene on the chromosome that encodes a ATP receptor. The chimeric transcripts of this gene and P2RY11 were found to be ubiquitously present and regulated during granulocytic differentiation. Exogenous expression of this gene was reported to reduce the anchorage-independent growth of some tumor cells.
10672	
10673	
10674	
10675	
10676	
10677	In vertebrates, the genes encoding the class of transcription factors called homeobox genes are found in clusters named A, B, C, and D on four separate chromosomes. Expression of these proteins is spatially and temporally regulated during embryonic development. This gene is part of the A cluster on chromosome 7 and encodes a DNA-binding transcription factor which may regulate gene expression, morphogenesis, and differentiation. Expansion of a polyalanine tract in the encoded protein can cause hand-foot-uterus syndrome, also known as hand-foot-genital syndrome.
10678	
10679	
10680	
10681	The protein encoded by this gene belongs to the ligand-gated ionic channel family. It is an integral membrane protein and plays an important role in inhibiting neurotransmission by binding to the benzodiazepine receptor and opening an integral chloride channel. This gene is clustered with three other family members on chromosome 4.
10682	This gene is a type I subclass member of the Reg gene family. The Reg gene family is a multigene family grouped into four subclasses, types I, II, III and IV based on the primary structures of the encoded proteins. This gene encodes a protein secreted by the exocrine pancreas that is highly similar to the REG1A protein. The related REG1A protein is associated with islet cell regeneration and diabetogenesis, and may be involved in pancreatic lithogenesis. Reg family members REG1A, REGL, PAP and this gene are tandomly clustered on chromosome 2p12 and may have arisen from the same ancestral gene by gene duplication.
10683	
10684	
10685	
10686	
10687	The protein encoded by this proto-oncogene is a member of the Dbl family of guanine nucleotide exchange factors (GEF) for the Rho family of GTP binding proteins. The protein is important in hematopoiesis, playing a role in T-cell and B-cell development and activation. This particular GEF has been identified as the specific binding partner of Nef proteins from HIV-1. Coexpression and binding of these partners initiates profound morphological changes, cytoskeletal rearrangements and the JNK/SAPK signaling cascade, leading to increased levels of viral transcription and replication.
10688	The protein encoded by this gene is a component of the exocyst complex, a multiple protein complex essential for targeting exocytic vesicles to specific docking sites on the plasma membrane. Though best characterized in yeast, the component proteins and functions of exocyst complex have been demonstrated to be highly conserved in higher eukaryotes. At least eight components of the exocyst complex, including this protein, are found to interact with the v-ral simian leukemia viral oncogene homolog A (RALA), a Ras-related small GTPase involved in controlling actin cytoskeletal remodeling and vesicle transport. The protein complex containing the similar protein in rat was reported to be involved in synaptic vesicle docking and fusion at the presynaptic terminal.
10689	This gene encodes a subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This subunit functions as guanine nucleotide exchange factor. It is reported that this subunit interacts with HIV-1 Tat, and thus it represses the translation of host-cell, but not HIV-1, mRNAs. Several alternatively spliced transcript variants have been found for this gene, however, the full length nature of only two variants has been determined.
10690	
10691	
10692	
10693	Nerve growth factor receptor contains an extracellular domain containing four 40-amino acid repeats with 6 cysteine residues at conserved positions followed by a serine/threonine-rich region, a single transmembrane domain, and a 155-amino acid cytoplasmic domain. The cysteine-rich region contains the nerve growth factor binding domain.
10694	The autosomal dominant cerebellar ataxias (ADCA) are a heterogeneous group of neurodegenerative disorders characterized by progressive degeneration of the cerebellum, brain stem and spinal cord. Clinically, ADCA has been divided into three groups: ADCA types I-III. ADCAI is genetically heterogeneous, with five genetic loci, designated spinocerebellar ataxia (SCA) 1, 2, 3, 4 and 6, being assigned to five different chromosomes. ADCAII, which always presents with retinal degeneration (SCA7), and ADCAIII often referred to as the `pure@#$%& cerebellar syndrome (SCA5), are most likely homogeneous disorders. Several SCA genes have been cloned and shown to contain CAG repeats in their coding regions. ADCA is caused by the expansion of the CAG repeats, producing an elongated polyglutamine tract in the corresponding protein. The expanded repeats are variable in size and unstable, usually increasing in size when transmitted to successive generations. The function of the ataxins is not known. This locus has been mapped to chromosome 12, and it has been determined that the diseased allele contains 36-52 CAG repeats, compared to 22-23 in the normal allele. A potential transcript variant, missing an internal coding exon, has been described; however, its full-length nature is not certain.
10695	
10696	
10697	The globular WW domain is composed of 38 to 40 semiconserved amino acids shared by proteins of diverse functions including structural, regulatory, and signaling proteins. The domain is involved in mediating protein-protein interactions through the binding of polyproline ligands. This gene encodes a WW domain binding protein, which binds to the WW domain of Yes kinase-associated protein by its PY motifs. The function of this protein has not been determined.
10698	
10699	
10700	
10701	
10702	Relaxins are known endocrine and autocrine/paracrine hormones, belonging to the insulin gene superfamily. In the human there are three non-allelic relaxin genes, RLN1, RLN2 and RLN3. RLN1 and RLN2 share high sequence homology. Relaxin is produced by the ovary, and targets the mammalian reproductive system to ripen the cervix, elongate the pubic symphysis and inhibit uterine contraction. It may have additional roles in enhancing sperm motility, regulating blood pressure, controlling heart rate and releasing oxytocin and vasopressin. The protein encoded by this gene is a member of the relaxin family. The active form of the encoded protein consists of an A chain and a B chain but their cleavage sites are not definitely described yet. It may play a role in neuropeptide signaling processes.
10703	
10704	
10705	
10706	The protein encoded by this gene is a member of the gelsolin/villin family of actin regulatory proteins. This protein has structural similarity to villin. It binds actin and may play a role in the development of neuronal cells that form ganglia.
10707	
10708	
10709	This gene encodes a member of the nucleoporin family that shares 87% sequence identity with rat nucleoporin p58. The protein is localized to the nuclear rim and is a component of the nuclear pore complex (NPC). All molecules entering or leaving the nucleus either diffuse through or are actively transported by the NPC. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
10710	
10711	This gene encodes an enzyme that hydrolyzes terminal, non-reducing alpha-D-mannose residues in alpha-D-mannosides. Its activity is necessary for the catabolism of N-linked carbohydrates released during glycoprotein turnover and it is member of family 38 of glycosyl hydrolases. The full length protein is processed in two steps. First, a 49 aa leader sequence is cleaved off and the remainer of the protein is processed into 3 peptides of 70 kDa, 42 kDa (D) and 13/15 kDa (E). Next, the 70 kDa peptide is further processed into three peptides (A, B and C). The A, B and C peptides are disulfide-linked. Defects in this gene have been shown to be the cause of lysosomal alpha-mannosidosis (AM), a lysosomal storage disease characterized by the accumulation of unbranched oligosaccharide chains.
10712	
10713	DDX40 is a member of the DExH/D box family of ATP-dependent RNA helicases. RNA helicases catalyze the unwinding of double-stranded RNA and play a role in RNA metabolism, including pre-mRNA splicing, ribosome biogenesis, and organellar gene expression.[supplied by OMIM]
10714	
10715	The protein encoded by this gene is a member of the SWI/SNF family of proteins and is similar to the brahma protein of Drosophila. Members of this family have helicase and ATPase activities and are thought to regulate transcription of certain genes by altering the chromatin structure around those genes. The encoded protein is part of the large ATP-dependent chromatin remodeling complex SNF/SWI, which is required for transcriptional activation of genes normally repressed by chromatin. In addition, this protein can bind BRCA1, as well as regulate the expression of the tumorigenic protein CD44. Alternatively spliced transcripts have been found for this gene, but their full-length natures have not been determined.
10716	
10717	
10718	
10719	The protein encoded by this gene is a member of the TNF-receptor superfamily. This receptors can interact with various TRAF family proteins, through which this receptor induces the activation of NF-kappa B and MAPK8/JNK. This receptor and its ligand are important regulators of the interaction between T cells and dendritic cells. This receptor is also an essential mediator for osteoclast and lymph node development. Studies of the mouse counterpart suggest that this receptor directly mediates the osteoprotegerin ligand (OPGL)-induced osteoclastogenesis in osteoclast precursor cells.
10720	The protein encoded by this gene is a member of the dual specificity protein phosphatase subfamily. These phosphatases inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues. They negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily (MAPK/ERK, SAPK/JNK, p38), which is associated with cellular proliferation and differentiation. Different members of the family of dual specificity phosphatases show distinct substrate specificities for various MAP kinases, different tissue distribution and subcellular localization, and different modes of inducibility of their expression by extracellular stimuli. This gene product inactivates SAPK/JNK and p38, is expressed predominantly in the adult brain, heart, and skeletal muscle, is localized in the cytoplasm, and is induced by nerve growth factor and insulin. An intronless pseudogene for DUSP8 is present on chromosome 10q11.2.
10721	Kallikreins are a subgroup of serine proteases having diverse physiological functions. Growing evidence suggests that many kallikreins are implicated in carcinogenesis and some have potential as novel cancer and other disease biomarkers. This gene is one of the fifteen kallikrein subfamily members located in a cluster on chromosome 19. Its encoded enzyme is thought to be involved in the proteolysis of intercellular cohesive structures preceding desquamation, which is the shedding of the outermost layer of the epidermis. Alternative splicing of this gene results in two transcript variants encoding the same protein.
10722	
10723	
10724	
10725	SPI1 (MIM 165170) and SPIB are members of a subfamily of ETS (see ETS1; MIM 164720) transcription factors. ETS proteins share a conserved ETS domain that mediates specific DNA binding. SPIB and SPI1 bind to a purine-rich sequence, the PU box (5-prime-GAGGAA-3-prime).[supplied by OMIM]
10726	
10727	
10728	
10729	
10730	
10731	
10732	The HRMT1L2 gene encodes a protein arginine methyltransferase that functions as a histone methyltransferase specific for H4 (see MIM 602822).[supplied by OMIM]
10733	
10734	
10735	
10736	This gene encodes the second human homologue of the bacterial RuvB gene. Bacterial RuvB protein is a DNA helicase essential for homologous recombination and DNA double-strand break repair. Functional analysis showed that this gene product has both ATPase and DNA helicase activities. This gene is physically linked to the CGB/LHB gene cluster on chromosome 19q13.3, and is very close (55 nt) to the LHB gene, in the opposite orientation.
10737	LIG1 encodes DNA ligase I, with functions in DNA replication and the base excision repair process.  Mutations in LIG1 that lead to DNA ligase I deficiency result in immunodeficiency and increased sensitivity to DNA-damaging agents.
10738	
10739	Proteins of the large Rab GTPase family (see RAB1A; MIM 179508) have regulatory roles in the formation, targeting, and fusion of intracellular transport vesicles. RAB11FIP1 is one of many proteins that interact with and regulate Rab GTPases (Hales et al., 2001).[supplied by OMIM]
10740	
10741	
10742	
10743	
10744	
10745	
10746	
10747	
10748	The human glucocorticoid receptor DNA binding factor, which associates with the promoter region of the glucocorticoid receptor gene (hGR gene), is a repressor of glucocorticoid receptor transcription. The amino acid sequence deduced from the cDNA sequences show the presence of three sequence motifs characteristic of a zinc finger and one motif suggestive of a leucine zipper in which 1 cysteine is found instead of all leucines. The GRLF1 enhances the homologous down-regulation of wild-type hGR gene expression. Biochemical analysis suggests that GRLF1 interaction is sequence specific and that transcriptional efficacy of GRLF1 is regulated through its interaction with specific sequence motif. The level of expression is regulated by glucocorticoids. Two transcript variants encoding different isoforms have been found for this gene.
10749	
10750	
10751	
10752	
10753	
10754	
10755	This gene is a member of the tropomyosin family of highly conserved, widely distributed actin-binding proteins involved in the contractile system of striated and smooth muscles and the cytoskeleton of non-muscle cells. Tropomyosin is composed of two alpha-helical chains arranged as a coiled-coil. It is polymerized end to end along the two grooves of actin filaments and provides stability to the filaments. The encoded protein is one type of alpha helical chain that forms the predominant tropomyosin of striated muscle, where it also functions in association with the troponin complex to regulate the calcium-dependent interaction of actin and myosin during muscle contraction. In smooth muscle and non-muscle cells, alternatively spliced transcript variants encoding a range of isoforms have been described. Mutations in this gene are associated with type 3 familial hypertrophic cardiomyopathy.
10756	
10757	
10758	
10759	
10760	The intracellular fatty acid-binding proteins (FABPs) belong to a multigene family with nearly twenty identified members. FABPs are divided into at least three distinct types, namely the hepatic-, intestinal- and cardiac-type. They form 14-15 kDa proteins and are thought to participate in the uptake, intracellular metabolism and/or transport of long-chain fatty acids. They may also be responsible in the modulation of cell growth and proliferation. Intestinal fatty acid-binding protein 2 gene contains four exons and is an abundant cytosolic protein in small intestine epithelial cells. This gene has a polymorphism at codon 54 that identified an alanine-encoding allele and a threonine-encoding allele. Thr-54 protein is associated with increased fat oxidation and insulin resistance.
10761	This gene encodes a TBP- (TATA box-binding protein) associated phosphoprotein that represses both basal and activated levels of transcription. The encoded protein is phosphorylated in vivo and this phosphorylation affects its interaction with TBP. This protein contains a histone fold motif at the amino terminus, a TBP-binding domain, and a glutamine- and alanine-rich region. The binding of DR1 repressor complexes to TBP-promoter complexes may establish a mechanism in which an altered DNA conformation, together with the formation of higher order complexes, inhibits the assembly of the preinitiation complex and controls the rate of RNA polymerase II transcription.
10762	
10763	
10764	
10765	
10766	
10767	This gene encodes a member of the latrophilin subfamily of G-protein coupled receptors (GPCR). Latrophilins may function in both cell adhesion and signal transduction. In experiments with non-human species, endogenous proteolytic cleavage within a cysteine-rich GPS (G-protein-coupled-receptor proteolysis site) domain resulted in two subunits (a large extracellular N-terminal cell adhesion subunit and a subunit with substantial similarity to the secretin/calcitonin family of GPCRs) being non-covalently bound at the cell membrane. Latrophilin-1 has been shown to recruit the neurotoxin from black widow spider venom, alpha-latrotoxin, to the synapse plasma membrane.
10768	
10769	
10770	This gene belongs to the chemokine-like factor gene superfamily, a novel superfamily that links the chemokine gene family and the transmembrane 4 superfamily. This gene is one of several chemokine-like factor genes located in a cluster on chromosome 16. The protein encoded by this gene is widely expressed in many tissues, but its exact function is unknown. Alternatively spliced transcript variants encoding different isoforms have been identified.
10771	
10772	
10773	This gene encodes a protein containing two conserved tandem RNA recognition motifs. Similar proteins in other species function as RNA-binding proteins and play central roles in posttranscriptional gene regulation. Two transcript variants encoding distinct isoforms have been identified for this gene.
10774	
10775	
10776	This gene encodes a protein containing two conserved tandem RNA recognition motifs. Similar proteins in other species function as RNA-binding proteins and play central roles in posttranscriptional gene regulation. Two transcript variants encoding distinct isoforms have been identified for this gene.
10777	
10778	
10779	
10780	
10781	
10782	
10783	
10784	
10785	
10786	
10787	
10788	
10789	
10790	
10791	This gene encodes a protein that contains a MIT (Microtubule Interacting and Trafficking molecule) domain. This protein may be involved in endosomal trafficking, microtubule dynamics, or both functions. Mutations associated with this gene cause autosomal recessive spastic paraplegia 20 (Troyer syndrome).
10792	
10793	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. This gene encodes a member of the potassium channel, voltage-gated, isk-related subfamily. This member is a type I membrane protein, and a beta subunit that assembles with a potassium channel alpha-subunit to modulate the gating kinetics and enhance stability of the multimeric complex. This gene is prominently expressed in the embryo and in adult uterus.
10794	
10795	
10796	This gene is a member of the DAZ gene family and is a candidate for the human Y-chromosomal azoospermia factor (AZF). Its expression is restricted to premeiotic germ cells, particularly in spermatogonia. It encodes an RNA-binding protein that is important for spermatogenesis. Four copies of this gene are found on chromosome Y within palindromic duplications; one pair of genes is part of the P2 palindrome and the second pair is part of the P1 palindrome. Each gene contains a 2.4 kb repeat including a 72-bp exon, called the DAZ repeat; the number of DAZ repeats is variable and there are several variations in the sequence of the DAZ repeat. Each copy of the gene also contains a 10.8 kb region that may be amplified; this region includes five exons that encode an RNA recognition motif (RRM) domain. This gene contains one copy of the 10.8 kb repeat. Alternative splicing results in multiple transcript variants encoding different isoforms.
10797	
10798	
10799	
10800	
10801	
10802	
10803	The protein encoded by this gene is involved in the synthesis of the unbranched glycosaminoglycan hyaluronan, or hyaluronic acid, which is a major constituent of the extracellular matrix. This gene is a member of the NODC/HAS gene family. Compared to the proteins encoded by other members of this gene family, this protein appears to be more of a regulator of hyaluronan synthesis. Two transcript variants encoding different isoforms have been found for this gene.
10804	
10805	
10806	
10807	
10808	Members of the trefoil family are characterized by having at least one copy of the trefoil motif, a 40-amino acid domain that contains three conserved disulfides. They are stable secretory proteins expressed in gastrointestinal mucosa. Their functions are not defined, but they may protect the mucosa from insults, stabilize the mucus layer and affect healing of the epithelium. This gene is expressed in goblet cells of the intestines and colon. This gene and two other related trefoil family member genes are found in a cluster on chromosome 21.
10809	
10810	
10811	The protein encoded by this gene is a member of the immunophilin protein family, which play a role in immunoregulation and basic cellular processes involving protein folding and trafficking. Unlike the other members of the family, this encoded protein does not seem to have PPIase/rotamase activity. It may have a role in neurons associated with memory function.
10812	
10813	
10814	
10815	
10816	
10817	Costimulatory B7 molecules (e.g., B7-1, or CD80; MIM 112203) signal through CD28 (MIM 186760) family molecules such as CD28, CTLA4 (MIM 123890), and ICOS (MIM 604558).[supplied by OMIM]
10818	
10819	
10820	
10821	This gene encodes a member of the homer family of dendritic proteins. Members of this family regulate group 1 metabotrophic glutamate receptor function. The encoded protein may be involved in cell growth.
10822	This gene encodes a disintegrin and metalloproteinase with thrombospondin (ADAMTS) motifs-13, which is a member of the ADAMTS protein family. Members of the family share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. The enzyme encoded by this gene is the von Willebrand Factor (vWF)-cleaving protease, which is responsible for cleaving at the site of Tyr842-Met843 of the vWF molecule. A deficiency of this enzyme is associated with thrombotic thrombocytopenic purpura. Alternative splicing of this gene generates at least 4 transcript variants encoding different isoforms.
10823	
10824	
10825	
10826	
10827	This gene encodes a member of the neuropilin family of receptor proteins. The encoded transmembrane protein binds to SEMA3C protein {sema domain, immunoglobulin domain (Ig), short basic domain, secreted, (semaphorin) 3C} and SEMA3F protein {sema domain, immunoglobulin domain (Ig), short basic domain, secreted, (semaphorin) 3F}, and interacts with vascular endothelial growth factor (VEGF). This protein may play a role in cardiovascular development, axon guidance, and tumorigenesis. Multiple transcript variants encoding distinct isoforms have been identified for this gene.
10828	
10829	
10830	
10831	The protein encoded by this gene is a G-protein-coupled receptor. It has been shown to function as a prostanoid DP receptor. The activity of this receptor is mainly mediated by G-S proteins that stimulate adenylate cyclase resulting in an elevation of intracellular cAMP and Ca2+.  Knockout studies in mice suggest that the ligand of this receptor,  prostaglandin D2 (PGD2), functions as a mast cell-derived mediator to trigger asthmatic responses.
10832	
10833	This gene encodes a transmembrane signaling adaptor that contains a YxxM motif in its cytoplasmic domain. The encoded protein may form part of the immune recognition receptor complex with the C-type lectin-like receptor NKG2D. As part of this receptor complex, this protein may activate phosphatidylinositol 3-kinase dependent signaling pathways through its intracytoplasmic YxxM motif. This receptor complex may have a role in cell survival and proliferation by activation of NK and T cell responses. Alternative splicing results in two transcript variants encoding different isoforms.
10834	
10835	
10836	
10837	
10838	
10839	
10840	The protein encoded by this gene binds to an anti-proliferative protein, B-cell translocation protein 1, which negatively regulates cell proliferation. Binding of the two proteins, which is driven by phosphorylation of the anti-proliferative protein, causes signaling events in cell division that lead to changes in cell proliferation associated with cell-cell contact. The protein has both mouse and yeast orthologs. Alternate splicing of this gene results in two transcript variants encoding different isoforms.
10841	
10842	Palmitoylated membrane protein 2 is a member of a family of membrane-associated proteins termed MAGUKs (membrane-associated guanylate kinase homologs).  MAGUKs interact with the cytoskeleton and regulate cell proliferation, signaling pathways, and intracellular junctions.  Palmitoylated membrane protein 2 contains a conserved sequence, called the SH3 (src homology 3) motif, found in several other proteins that associate with the cytoskeleton and are suspected to play important roles in signal transduction.
10843	
10844	
10845	
10846	Junctional complexes between the plasma membrane and endoplasmic/sarcoplasmic reticulum are a common feature of all excitable cell types and mediate cross talk between cell surface and intracellular ion channels. The protein encoded by this gene is a component of junctional complexes and is composed of a C-terminal hydrophobic segment spanning the endoplasmic/sarcoplasmic reticulum membrane and a remaining cytoplasmic domain that shows specific affinity for the plasma membrane. This gene is a member of the junctophilin gene family. Alternative splicing has been observed at this locus and two variants encoding distinct isoforms are described.
10847	
10848	
10849	
10850	
10851	
10852	
10853	
10854	This gene encodes a 5@#$%& nucleotidase that localizes to the mitochondrial matrix. This enzyme dephosphorylates the 5@#$%&- and 2@#$%&(3@#$%&)-phosphates of uracil and thymine deoxyribonucleotides. The gene is located within the Smith-Magenis syndrome region on chromosome 17.
10855	In yeast, the Sec23-Sec24 complex is a component of coat protein II (COPII; see MIM 601924)-coated vesicles that mediate protein transport from the endoplasmic reticulum. SEC24A is 1 of several mammalian proteins that show structural and functional homology to yeast Sec24.[supplied by OMIM]
10856	Bone marrow stromal cells are involved in the growth and development of B-cells. The specific function of the protein encoded by the bone marrow stromal cell antigen 2 is undetermined; however, this protein may play a role in pre-B-cell growth and in rheumatoid arthritis.
10857	
10858	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the ABC1 subfamily. Members of the ABC1 subfamily comprise the only major ABC subfamily found exclusively in multicellular eukaryotes. This full transporter has been detected predominantly in myelo-lymphatic tissues with the highest expression in peripheral leukocytes, thymus, spleen, and bone marrow. The function of this protein is not yet known; however, the expression pattern suggests a role in lipid homeostasis in cells of the immune system. Alternative splicing of this gene results in two transcript variants.
10859	
10860	
10861	Members of the claudin protein family, such as CLDN2, are expressed in an organ-specific manner and regulate the tissue-specific physiologic properties of tight junctions (Sakaguchi et al., 2002).[supplied by OMIM]
10862	
10863	
10864	
10865	This gene encodes a zinc finger protein that contains a jumonji domain.
10866	
10867	
10868	This gene encodes a protein containing an alpha/beta hydrolase fold, which is a catalytic domain found in a very wide range of enzymes. The function of this protein has not been determined. Alternative splicing of this gene results in two transcript variants encoding the same protein.
10869	
10870	
10871	The protein encoded by this gene is the type 2 enzyme of the ecto-nucleoside triphosphate diphosphohydrolase family (E-NTPDase). E-NTPDases are a family of ecto-nucleosidases that hydrolyze 5@#$%&-triphosphates. This ecto-ATPase is an integral membrane protein. Alternative splicing of this gene results in multiple transcript variants.
10872	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a member of the histone H4 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6.
10873	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes including embryonic development cell growth, morphogenesis, tissue repair, tumor growth and invasion. This gene was shown to be expressed in normal brain, particularly the cerebellum. The rat homolog is preferentially expressed in the brain and able to enhance the survival of midbrain dopaminergic neurons in vitro.
10874	This gene is a member of the protocadherin gamma gene cluster, one of three related clusters tandemly linked on chromosome five. These gene clusters have an immunoglobulin-like organization, suggesting that a novel mechanism may be involved in their regulation and expression. The gamma gene cluster includes 22 genes divided into 3 subfamilies. Subfamily A contains 12 genes, subfamily B contains 7 genes and 2 pseudogenes, and the more distantly related subfamily C contains 3 genes. The tandem array of 22 large, variable region exons are followed by a constant region, containing 3 exons shared by all genes in the cluster. Each variable region exon encodes the extracellular region, which includes 6 cadherin ectodomains and a transmembrane region. The constant region exons encode the common cytoplasmic region. These neural cadherin-like cell adhesion proteins most likely play a critical role in the establishment and function of specific cell-cell connections in the brain. Alternative splicing has been described for the gamma cluster genes.
10875	
10876	
10877	The protein encoded by this gene is a member of the kinesin family. This protein is highly similar to mouse heavy chain kinesin member 1A protein which is an anterograde motor protein that transports membranous organelles along axonal microtubules. It is thought that this protein may play a critical role in the development of axonal neuropathies resulting from impaired axonal transport. There are multiple polyadenylation sites found in this gene.
10878	
10879	
10880	
10881	
10882	
10883	Members of the F-box protein family, such as FBXO31, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
10884	
10885	
10886	
10887	
10888	
10889	
10890	
10891	This gene encodes an N-acetylgalactosaminyltransferase, which has 97% sequence identity to the mouse protein. This gene is deleted in Williams syndrome, a multisystem developmental disorder caused by the deletion of contiguous genes at 7q11.23.
10892	HBXAP is involved in transcription repression, transcription coactivation when associated with hepatitis B virus X protein (HBX), and chromatin remodeling and spacing when associated with SNF2H (MIM 603375).[supplied by OMIM]
10893	
10894	
10895	Xanthine dehydrogenase belongs to the group of molybdenum-containing hydroxylases involved in the oxidative metabolism of purines. The enzyme is a homodimer.  Xanthine dehydrogenase can be converted to xanthine oxidase by reversible sulfhydryl oxidation or by irreversible proteolytic modification.  Defects in xanthine dehydrogenase cause xanthinuria, may contribute to adult respiratory stress syndrome, and may potentiate influenza infection through an oxygen metabolite-dependent mechanism.
10896	
10897	The ADF (actin-depolymerizing factor)/cofilin family (see MIM 601442) is composed of stimulus-responsive mediators of actin dynamics. ADF/cofilin proteins are inactivated by kinases such as LIM domain kinase-1 (LIMK1; MIM 601329). The SSH family appears to play a role in actin dynamics by reactivating ADF/cofilin proteins in vivo (Niwa et al., 2002).[supplied by OMIM]
10898	The protein encoded by this gene acts as a transcriptional coactivator for steroid and nuclear hormone receptors. It is a member of the p160/steroid receptor coactivator (SRC) family and like other family members has histone acetyltransferase activity and contains a nuclear localization signal, as well as bHLH and PAS domains. The product of this gene binds nuclear receptors directly and stimulates the transcriptional activities in a hormone-dependent fashion. Alternatively spliced transcript variants encoding different isoforms have been identified.
10899	
10900	
10901	Homeobox-containing genes play critical roles in regulating tissue-specific gene expression essential for tissue differentiation, as well as determining the temporal and spatial patterns of development (Shiojima et al., 1995). It has been demonstrated that a Drosophila homeobox-containing gene called @#$%&tinman@#$%& is expressed in the developing dorsal vessel and in the equivalent of the vertebrate heart. Mutations in tinman result in loss of heart formation in the embryo, suggesting that tinman is essential for Drosophila heart formation. Furthermore, abundant expression of Csx, the presumptive mouse homolog of tinman, is observed only in the heart from the time of cardiac differentiation. CSX, the human homolog of murine Csx, has a homeodomain sequence identical to that of Csx and is expressed only in the heart, again suggesting that CSX plays an important role in human heart formation.[supplied by OMIM]
10902	The origin recognition complex (ORC) is a highly conserved six subunits protein complex essential for the initiation of the DNA replication in eukaryotic cells. Studies in yeast demonstrated that ORC binds specifically to origins of replication and serves as a platform for the assembly of additional initiation factors such as Cdc6 and Mcm proteins. The protein encoded by this gene is a subunit of the ORC complex. This protein forms a core complex with ORC3L, -4L, and -5L. It also interacts with CDC45L and MCM10, which are proteins known to be important for the initiation of DNA replication. This protein has been demonstrated to specifically associate with the origin of replication of Epstein-Barr virus in human cells, and is thought to be required for DNA replication from viral origin of replication.
10903	
10904	This gene encodes a type I integral membrane glycoprotein. The encoded protein acts as a receptor and binds to both soluble and immobilized hyaluronan. This protein may function in lymphatic hyaluronan transport and have a role in tumor metastasis.
10905	
10906	Members of the F-box protein family, such as FBXL17, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
10907	
10908	This gene encodes one of the proteins important for centrosomal function. This protein is important for positioning and anchoring the microtubules minus-ends in epithelial cells. Localization of this protein to the centrosome requires three leucine zippers in the central coiled-coil domain. Multiple alternatively spliced transcript variants that encode different isoforms have been reported.
10909	
10910	
10911	The protein encoded by this gene contains a RING finger motif. The mouse counterpart of this protein has been shown to interact with Rela, the p65 subunit of NF-kappaB (NFKB), and modulate NFKB-mediated transcription activity. The mouse protein also binds ubiquitin-conjugating enzymes (E2s) and is a substrate for E2-dependent ubiquitination.
10912	
10913	This gene encodes a protein that interacts with the gamma subunit of AP1 clathrin-adaptor complex. The AP1 complex is located at the trans-Golgi network and associates specific proteins with clathrin-coated vesicles. This encoded protein may act to connect the AP1 complex to other proteins. Alternatively spliced transcript variants that encode different isoforms have been described for this gene.
10914	
10915	
10916	
10917	
10918	The protein encoded by this gene contains a C3HC4 type RING finger, which is a motif known to be involved in protein-protein interactions. The specific function of this protein has not yet been determined.
10919	
10920	The protein encoded by this gene is a Kruppel-related zinc finger protein which is characterized by the presence of an N-terminal repressor domain, the Kruppel-associated box (KRAB). The KRAB domain is a potent repressor of transcription; thus this protein may function in transcription regulation. Two alternative transcripts encoding the same isoform have been described.
10921	
10922	
10923	
10924	
10925	This intronless gene is a member of the frizzled gene family. Members of this family encode seven-transmembrane domain proteins that are receptors for the Wingless type MMTV integration site family of signaling proteins. Most frizzled receptors are coupled to the beta-catenin canonical signaling pathway. This gene is highly expressed in two human cancer cell lines, indicating that it may play a role in several types of cancer. The crystal structure of the extracellular cysteine-rich domain of a similar mouse protein has been determined.
10926	The protein encoded by this gene, which localizes to the endoplasmic reticulum, catalyzes intramembrane proteolysis of some signal peptides after they have been cleaved from a preprotein. This activity is required to generate signal sequence-derived human lymphocyte antigen-E epitopes that are recognized by the immune system, and to process hepatitis C virus core protein. The encoded protein is an integral membrane protein with sequence motifs characteristic of the presenilin-type aspartic proteases. Multiple transcript variants encoding several different isoforms have been found for this gene.
10927	
10928	
10929	
10930	
10931	
10932	This gene is an ortholog of the C. elegans unc-76 gene, which is necessary for normal axonal bundling and elongation within axon bundles. Other orthologs include the rat gene that encodes zygin II, which can bind to synaptotagmin.
10933	The protein encoded by this gene is a member of the immunophilin protein family, which play a role in immunoregulation and basic cellular processes involving protein folding and trafficking. This encoded protein is a cis-trans prolyl isomerase that binds the immunosuppressants FK506 and rapamycin. It interacts with several intracellular signal transduction proteins including type I TGF-beta receptor. It also interacts with multiple intracellular calcium release channels including the tetrameric skeletal muscle ryanodine receptor. In mouse, deletion of this homologous gene causes congenital heart disorder known as noncompaction of left ventricular myocardium. There is evidence of multiple alternatively spliced transcript variants for this gene, but the full length nature of some variants has not been determined.
10934	The protein encoded by this gene belongs to the calcium sensitive chloride conductance protein family. To date, all members of this gene family map to the same site on chromosome 1p31-p22 and share high degrees of homology in size, sequence and predicted structure, but differ significantly in their tissue distributions. Since this protein is expressed predominantly in trachea and lung, it is suggested to play a role in the complex pathogenesis of cystic fibrosis. It may also serve as adhesion molecule for lung metastatic cancer cells, mediating vascular arrest and colonization, and furthermore, it has been implicated to act as a tumor suppressor gene for breast cancer.
10935	
10936	
10937	
10938	The sarcoplasmic reticulum (SR) is an intracellular membrane compartment that controls intracellular calcium concentration and therefore plays a role in excitation-contraction coupling. In mouse skeletal muscle, Jp45 interacts with key proteins involved in excitation-contraction coupling at the SR (Anderson et al., 2003 [PubMed 12871958]).[supplied by OMIM]
10939	This gene encodes a high-affinity dickkopf homolog 1 (DKK1) transmembrane receptor that functionally cooperates with DKK1 to block wingless (WNT)/beta-catenin signaling. The encoded protein forms a ternary membrane complex with DKK1 and the WNT receptor lipoprotein receptor-related protein 6 (LRP6), and induces rapid endocytosis and removal of LRP6 from the plasma membrane. It contains extracellular kringle, WSC, and CUB domains. Alternatively spliced transcript variants encoding distinct isoforms have been observed for this gene.
10940	
10941	
10942	
10943	ACVR1C is a type I receptor for the TGFB (see MIM 190180) family of signaling molecules. Upon ligand binding, type I receptors phosphorylate cytoplasmic SMAD transcription factors, which then translocate to the nucleus and interact directly with DNA or in complex with other transcription factors (Bondestam et al., 2001).[supplied by OMIM]
10944	
10945	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
10946	
10947	
10948	Sahai (1983) demonstrated phosphate-activated glutaminase (EC 3.5.1.2) in human platelets. It is the major enzyme yielding glutamate from glutamine. Significance of the enzyme derives from its possible implication in behavior disturbances in which glutamate acts as a neurotransmitter (Prusiner, 1981). High heritability of platelet glutaminase was indicated by studies of Sahai and Vogel (1983) who found an intraclass correlation coefficient of 0.96 for monozygotic twins and 0.53 for dizygotic twins.[supplied by OMIM]
10949	
10950	The innate immune system recognizes microbial pathogens through Toll-like receptors (TLRs), which identify pathogen-associated molecular patterns. Different TLRs recognize different pathogen-associated molecular patterns and all TLRs have a Toll-interleukin 1 receptor (TIR) domain, which is responsible for signal transduction. The protein encoded by this gene is a TIR adaptor protein involved in the TLR4 signaling pathway of the immune system. It activates NF-kappa-B, MAPK1, MAPK3 and JNK, which then results in cytokine secretion and the inflammatory response. Alternative splicing of this gene results in several transcript variants; however, not all variants have been fully described.
10951	
10952	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein. In the prokaryotic ribosome, the comparable protein is thought to play an essential role in organizing the 3@#$%& domain of the 16 S rRNA in the vicinity of the P- and A-sites. Pseudogenes corresponding to this gene are found on chromosomes 8p and 12p.
10953	The protein encoded by this gene is expressed only in Langerhans cells which are immature dendritic cells of the epidermis and mucosa. It is localized in the Birbeck granules, organelles present in the cytoplasm of Langerhans cells and consisting of superimposed and zippered membranes. It is a C-type lectin with mannose binding specificity, and it has been proposed that mannose binding by this protein leads to internalization of antigen into Birbeck granules and providing access to a nonclassical antigen-processing pathway.
10954	CBL proteins, such as CBLC, are phosphorylated upon activation of a variety of receptors that signal via protein tyrosine kinases. Through interactions with proteins containing SRC (MIM 190090) homology-2 (SH2) and SH3 domains, CBL proteins modulate downstream cell signaling (Keane et al., 1999).[supplied by OMIM]
10955	
10956	
10957	The protein encoded by this gene contains a fork head DNA binding domain. This protein can bind to the purine-rich motifs of the HIV long terminal repeat (LTR), and to the similar purine-rich motif in the interleukin 2 (IL2) promoter. It may be involved in the regulation of viral and cellular promoter elements. Three alternatively spliced transcript variants encoding distinct isoforms have been described.
10958	
10959	This gene encodes a serine protease. The protein has been localized in the endoplasmic reticulum and interacts with an alternatively spliced form of mitogen-activated protein kinase 14. The protein has also been localized to the mitochondria with release to the cytosol following apoptotic stimulus. The protein is thought to induce apoptosis by binding the apoptosis inhibitory protein baculoviral IAP repeat-containing 4. Nuclear localization of this protein has also been observed. Alternate splicing of this gene results in two transcript variants encoding different isoforms. Additional transcript variants have been described, but their full-length sequences have not been determined.
10960	Members of the @#$%&frizzled@#$%& gene family encode 7-transmembrane domain proteins that are receptors for Wnt signaling proteins.  The FZD5 protein is believed to be the receptor for the Wnt5A ligand.
10961	
10962	
10963	
10964	The protein encoded by this gene is a member of the PP2C family of Ser/Thr protein phosphatases. PP2C family members are known to be negative regulators of cell stress response pathways. This phosphatase has been shown to dephosphorylate cyclin-dependent kinases (CDKs), and thus may be involved in cell cycle control. Overexpression of this phosphatase is reported to cause cell-growth arrest or cell death. Three alternatively spliced transcript variants encoding distinct isoforms have been described.
10965	
10966	
10967	
10968	
10969	This gene encodes a ubiquitin ligase that is specific for receptor-regulated SMAD proteins in the bone morphogenetic protein (BMP) pathway. A similar protein in Xenopus is involved in embryonic pattern formation. Alternative splicing results in multiple transcript variants encoding different isoforms. An additional transcript variant has been identified, but its full length sequence has not been determined.
10970	
10971	
10972	
10973	Synaptobrevins/VAMPs, syntaxins, and the 25-kD synaptosomal-associated protein are the main components of a protein complex involved in the docking and/or fusion of vesicles and cell membranes. The VAMP5 gene is a member of the vesicle-associated membrane protein (VAMP)/synaptobrevin family and the SNARE superfamily. This VAMP family member may participate in vesicle trafficking events that are associated with myogenesis.
10974	
10975	
10976	
10977	This gene belongs to the homeobox family, members of which encode a highly conserved family of transcription factors that play important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, consisting of 9 to 11 genes arranged in tandem, and located on different chromosomes. This gene is one of several HOXC genes located in a cluster on chromosome 12. Three genes, HOXC5, HOXC4 and HOXC6, are co-transcribed from a primary transcript. Subsequent processing results in gene-specific transcripts, which sometimes share the 5@#$%& non-coding exon. The product of this gene may contribute to the breast cell phenotype through co-operative interactions with other HOX-derived proteins and repression of their target genes. Two alternatively spliced transcripts encoding different isoforms have been characterized to date.
10978	
10979	This gene encodes a protein that contains four domains present in other developmental proteins: a PWWP domain, an HMG box, a SET domain, and a PHD-type zinc finger. It is expressed ubiquitously in early development. Wolf-Hirschhorn syndrome (WHS) is a malformation syndrome associated with a hemizygous deletion of the distal short arm of chromosome 4. This gene maps to the 165 kb WHS critical region and has also been involved in the chromosomal translocation t(4;14)(p16.3;q32.3) in multiple myelomas. This gene contains 25 exons. Alternative splicing of this gene generates nine transcript variants, which encode four different isoforms.
10980	The protein encoded by this gene is a member of the interleukin 1 cytokine family. This cytokine was shown to specifically inhibit the activation of NF-kappaB induced by interleukin 1 family, member 6 (IL1F6). This gene and eight other interleukin 1 family genes form a cytokine gene cluster on chromosome 2. Two alternatively spliced transcript variants encoding the same protein have been reported.
10981	
10982	
10983	
10984	
10985	
10986	
10987	
10988	
10989	
10990	
10991	Synaptotagmins, such as SYT6, share a common domain structure that includes a transmembrane domain and a cytoplasmic region composed of 2 C2 domains. Some synaptotagmins are involved in synaptic membrane fusion, while others have a more general function in endocytosis. For further information on synaptotagmins, see MIM 185605.[supplied by OMIM]
10992	
10993	
10994	This gene belongs to the TIMP gene family. The proteins encoded by this gene family are natural inhibitors of the matrix metalloproteinases, a group of peptidases involved in degradation of the extracellular matrix. The expression of this gene is induced in response to mitogenic stimulation and subject to cell cycle regulation. Mutations in this gene cause the autosomal dominant disorder Sorsby@#$%&s fundus dystrophy.
10995	
10996	
10997	
10998	The protein encoded by this gene is a type I transmembrane protein that belongs to the hemopoietin receptor superfamily. This protein binds to interleukine 12 (IL12) with a low affinity, and is thought to be a part of IL12 receptor complex. This protein forms a disulfide-linked oligomer, which is required for its IL12 binding activity. The coexpression of this and IL12RB2 proteins was shown to lead to the formation of high-affinity IL12 binding sites and reconstitution of IL12 dependent signaling. The lack of expression of this gene was found to result in the immunodeficiency of patients with severe mycobacterial and Salmonella infections. Two alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported.
10999	
11000	
11001	
11002	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
11003	
11004	
11005	
11006	
11007	
11008	The protein encoded by this gene belongs to the ferlin family and is a skeletal muscle protein found associated with the sarcolemma. It is involved in muscle contraction and contains C2 domains that play a role in calcium-mediated membrane fusion events, suggesting that it may be involved in membrane regeneration and repair. In addition, the protein encoded by this gene binds caveolin-3, a skeletal muscle membrane protein which is important in the formation of caveolae. Specific mutations in this gene have been shown to cause autosomal recessive limb girdle muscular dystrophy type 2B (LGMD2B) as well as Miyoshi myopathy.
11009	
11010	The protein encoded by this gene is a pyruvate kinase that catalyzes the production of phosphoenolpyruvate from pyruvate and ATP. This protein has been shown to interact with thyroid hormone, and thus may mediate cellular metabolic effects induced by thyroid hormones. This protein has been found to bind Opa protein, a bacterial outer membrane protein involved in gonococcal adherence to and invasion of human cells, suggesting a role of this protein in bacterial pathogenesis. Three alternatively spliced transcript variants encoding two distinct isoforms have been reported.
11011	Actins are highly conserved proteins that are involved in various types of cell motility, and maintenance of the cytoskeleton.  In vertebrates, three main groups of actin isoforms, alpha, beta and gamma have been identified.  The alpha actins are found in muscle tissues and are a major constituent of the contractile apparatus.  The beta and gamma actins co-exist in most cell types as components of the cytoskeleton, and as mediators of internal cell motility.  Actin, gamma 2, encoded by this gene, is a smooth muscle actin found in enteric tissues.
11012	
11013	
11014	This gene encodes a kinase expressed exclusively in the testis that is thought to play a role in either germ cell differentiation or mature sperm function.
11015	
11016	This gene encodes a G-protein beta-subunit-like polypeptide which is a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This protein contains 6 WD repeats and is highly expressed in the heart. The gene maps to the region on chromosome 22q11, which is deleted in DiGeorge syndrome, trisomic in derivative 22 syndrome and tetrasomic in cat-eye syndrome. Therefore, this gene may contribute to the etiology of those disorders.
11017	
11018	
11019	
11020	
11021	
11022	Methylenetetrahydrofolate reductase (EC 1.5.1.20) catalyzes the conversion of 5,10-methylenetetrahydrofolate to 5-methyltetrahydrofolate, a cosubstrate for homocysteine remethylation to methionine.[supplied by OMIM]
11023	Methylenetetrahydrofolate reductase (EC 1.5.1.20) catalyzes the conversion of 5,10-methylenetetrahydrofolate to 5-methyltetrahydrofolate, a cosubstrate for homocysteine remethylation to methionine.[supplied by OMIM]
11024	
11025	
11026	
11027	MUT encodes the mitochondrial enzymne methylmalonyl Coenzyme A mutase. In humans, MUT is a vitamin B12-dependent enzyme which catalyzes the isomerization of methylmalonyl-CoA to succinyl-CoA, while in other species this enzyme may have different functions. Mutations in MUT may lead to various types of methylmalonic aciduria.
11028	In mammals, the Y chromosome directs the development of the testess and plays an important role in spermatogenesis. A high percentage of infertile men have deletions that map to regions of the Y chromosome. The DAZ (deleted in azoospermia) gene cluster maps to the AZFc region and is deleted in many azoospermic and severely oligospermic men. It is thought that the Y chromosomal DAZ gene cluster arose from the transposition, amplification, and pruning of the ancestral autosomal gene DAZL. This gene encodes a RNA-binding protein with two RNP motifs that was originally identified by its interaction with the infertility factors DAZ and DAZL.
11029	This gene encodes a member of the trypsin family of serine proteases. The enzyme is expressed in the airways in a developmentally regulated manner. The gene is part of a cluster of serine protease genes on chromosome 16.
11030	
11031	
11032	
11033	
11034	
11035	
11036	
11037	
11038	
11039	The de novo synthesis of pyrimidine nucleotides is required for mammalian cells to proliferate. This gene encodes a trifunctional protein which is associated with the enzymatic activities of the first 3 enzymes in the 6-step pathway of pyrimidine biosynthesis: carbamoylphosphate synthetase (CPS II), aspartate transcarbamoylase, and dihydroorotase. This protein is regulated by the mitogen-activated protein kinase (MAPK) cascade, which indicates a direct link between activation of the MAPK cascade and de novo biosynthesis of pyrimidine nucleotides.
11040	
11041	
11042	
11043	
11044	The protein encoded by this gene is a RNA binding protein containing two KH domains and one RCG box, which is similar to FMRP and FXR1. It associates with polyribosomes, predominantly with 60S large ribosomal subunits. This encoded protein may self-associate or interact with FMRP and FXR1. It may have a role in the development of fragile X mental retardation syndrome.
11045	
11046	During the initiation of protein biosynthesis, initiation factor-2 (IF-2) promotes the binding of the initiator tRNA to the small subunit of the ribosome in a GTP-dependent manner. Prokaryotic IF-2 is a single polypeptide, while eukaryotic cytoplasmic IF-2 (eIF-2) is a trimeric protein. Bovine liver mitochondria contain IF-2(mt), an 85-kD monomeric protein that is equivalent to prokaryotic IF-2. The predicted 727-amino acid human protein contains a 29-amino acid presequence. Human IF-2(mt) shares 32 to 38% amino acid sequence identity with yeast IF-2(mt) and several prokaryotic IF-2s, with the greatest degree of conservation in the G domains of the proteins. Two transcript variants encoding the same protein have been found for this gene.
11047	The protein encoded by this gene mediates leucine-rich nuclear export signal (NES)-dependent protein transport. Exportin 1 specifically inhibits the nuclear export of Rev and U snRNAs. It is involved in the control of several cellular processes by controlling the localization of cyclin B, MPAK, and MAPKAP kinase 2. This protein also regulates NFAT and AP-1.
11048	
11049	Iduronate-2-sulfatase is required for the lysosomal degradation of heparan sulfate and dermatan sulfate. Mutations in this X-chromosome gene that result in enzymatic deficiency lead to the sex-linked Mucopolysaccharidosis Type II, also known as Hunter Syndrome. Iduronate-2-sulfatase has a strong sequence homology with human arylsulfatases A, B, and C, and human glucosamine-6-sulfatase. A splice variant of this gene has been described.
11050	This gene encodes a multidomain protein containing an N-terminal alpha-helical region with a coiled-coil motif, followed by a pleckstrin homology (PH) domain, an Arf-GAP domain, an ankyrin homology region, a proline-rich region, and a C-terminal Src homology 3 (SH3) domain. The protein localizes in the Golgi apparatus and at the plasma membrane, where it colocalizes with protein tyrosine kinase 2-beta (PYK2). The encoded protein forms a stable complex with PYK2 in vivo. This interaction appears to be mediated by binding of its SH3 domain to the C-terminal proline-rich domain of PYK2. The encoded protein is tyrosine phosphorylated by activated PYK2. In vitro it shows strong GTPase-activating protein (GAP) activity towards the small GTPases ADP-ribosylation factor (ARF) 1 and ARF5 and weak activity towards ARF6. The encoded protein is believed to function as an ARF GAP that controls ARF-mediated vesicle budding when recruited to Golgi membranes. In addition, it functions as a substrate and downstream target for PYK2 and SRC, a pathway that may be involved in the regulation of vesicular transport.
11051	The protein encoded by this gene is a nucleolar, zinc finger protein that preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids, rather than DNA alone. Mutational studies indicate that the zinc finger domains are not only essential for dsRNA binding, but are also required for its nucleolar localization. The encoded protein may be involved in cell growth and survival.
11052	The protein encoded by this gene is a nucleolar, zinc finger protein that preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids, rather than DNA alone. Mutational studies indicate that the zinc finger domains are not only essential for dsRNA binding, but are also required for its nucleolar localization. The encoded protein may be involved in cell growth and survival.
11053	
11054	
11055	This gene encodes a protein that associates with the centromere-kinetochore complex. The protein is a component of the nuclear matrix during the G2 phase of interphase. In late G2 the protein associates with the kinetochore and maintains this association through early anaphase. It localizes to the spindle midzone and the intracellular bridge in late anaphase and telophase, respectively, and is thought to be subsequently degraded. The localization of this protein suggests that it may play a role in chromosome segregation during mitotis. It is thought to form either a homodimer or heterodimer. Autoantibodies against this protein have been found in patients with cancer or graft versus host disease.
11056	
11057	
11058	This gene encodes one of the three alpha chains of type IX collagen, the major collagen component of hyaline cartilage. Type IX collagen, a heterotrimeric molecule, is usually found in tissues containing type II collagen, a fibrillar collagen. Mutations in this gene are associated with multiple epiphyseal dysplasia.
11059	
11060	
11061	Laminins, a family of extracellular matrix glycoproteins, are the major noncollagenous constituent of basement membranes.  They have been implicated in a wide variety of biological processes including cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis.  Laminins are composed of 3 non identical chains: laminin alpha, beta and gamma (formerly A, B1, and B2, respectively) and they form a cruciform structure consisting of 3 short arms, each formed by a different chain, and a long arm composed of all 3 chains.  Each laminin chain is a multidomain protein encoded by a distinct gene.  Several isoforms of each chain have been described.  Different alpha, beta and gamma chain isomers combine to give rise to different heterotrimeric laminin isoforms which are designated by Arabic numerals in the order of their discovery, i.e. alpha1beta1gamma1 heterotrimer is laminin 1.  The biological functions of the different chains and trimer molecules are largely unknown, but some of the chains have been shown to differ with respect to their tissue distribution, presumably reflecting diverse functions in vivo.  This gene encodes the alpha chain isoform laminin, alpha 4.  The domain structure of alpha 4 is similar to that of alpha 3, both of which resemble truncated versions of alpha 1 and alpha 2, in that approximately 1,200 residues at the N-terminus (domains IV, V and VI) have been lost. Laminin, alpha 4 contains the C-terminal G domain which distinguishes all alpha chains from the beta and gamma chains.  The RNA analysis from adult and fetal tissues revealed developmental regulation of expression, however, the exact function of laminin, alpha 4 is not known.  Tissue-specific utilization of alternative polyA-signal has been described in literature.  Also, alternative splicing involving the first intron in the 5@#$%& UTR, and laminin alpha 4 like isoforms have been noted, however, the full-length nature of these products is not known.
11062	
11063	
11064	
11065	
11066	The muscarinic cholinergic receptors belong to a larger family of G protein-coupled receptors. The functional diversity of these receptors is defined by the binding of acetylcholine and includes cellular responses such as adenylate cyclase inhibition, phosphoinositide degeneration, and potassium channel mediation. Muscarinic receptors influence many effects of acetylcholine in the central and peripheral nervous system. The muscarinic cholinergic receptor 3 controls smooth muscle contraction and its stimulation causes secretion of glandular tissue.
11067	Cytochrome c oxidase (COX), the terminal enzyme of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. It is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in electron transfer, and the nuclear-encoded subunits may be involved in the regulation and assembly of the complex. This nuclear gene encodes polypeptide 2 (heart/muscle isoform) of subunit VIa, and polypeptide 2 is present only in striated muscles. Polypeptide 1 (liver isoform) of subunit VIa is encoded by a different gene, and is found in all non-muscle tissues. These two polypeptides share 66% amino acid sequence identity.
11068	Nectins (e.g., PVRL1; MIM 600644) are immunoglobulin-like adhesion molecules that interact with afadin (AF6; MIM 159559). Afadin is an actin filament-binding protein that connects nectins to the actin cytoskeleton. The nectin-afadin system organizes adherens junctions cooperatively with the cadherin (see MIM 192090)-catenin (see MIM 116805) system in epithelial cells.[supplied by OMIM]
11069	
11070	
11071	The protein encoded by this gene is an integrin ligand. It plays an important role in mediating angiogenesis and may be important in vessel wall remodeling and development. It also influences endothelial cell behavior.
11072	
11073	
11074	
11075	
11076	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development, reproduction, and tissue remodeling, as well as in disease processes, such as arthritis and metastasis. Most MMP@#$%&s are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. The enzyme encoded by this gene degrades type IV and V collagens. Studies in rhesus monkeys suggest that the enzyme is involved in IL-8-induced mobilization of hematopoietic progenitor cells from bone marrow, and murine studies suggest a role in tumor-associated tissue remodeling.
11077	
11078	This gene is a member of the cadherin superfamily. The gene product is composed of five cadherin repeat domains and a cytoplasmic tail similar to the highly conserved cytoplasmic region of classical cadherins. Expressed predominantly in the brain, this putative calcium-dependent cell adhesion protein may play an important role in morphogenesis and tissue formation in neural and non-neural cells during development and maintenance of the brain and neuroendocrine organs.
11079	
11080	This gene is a member of the cadherin superfamily. The gene product is composed of five cadherin repeat domains and a cytoplasmic tail similar to the highly conserved cytoplasmic region of classical cadherins. Expressed predominantly in the brain, this putative calcium-dependent cell adhesion protein may play an important role in morphogenesis and tissue formation in neural and non-neural cells during development and maintenance of the brain and neuroendocrine organs.
11081	
11082	This gene was identified by homology to other species. Its encoded protein is approximately 78-88% identical to a predicted sheep protein of unknown function. The protein is also approximately 25% identical to the Drosophila protein suppressor of hairy wing, which is a leucine zipper protein that represses the function of transcriptional enhancers of the gypsy retrotransposon.
11083	
11084	
11085	
11086	
11087	
11088	Many proteins are tethered to the extracellular face of eukaryotic plasma membranes by a glycosylphosphatidylinositol (GPI) anchor. The GPI-anchor is a glycolipid found on many blood cells. The protein encoded by this gene is a GPI degrading enzyme. Glycosylphosphatidylinositol specific phospholipase D1 hydrolyzes the inositol phosphate linkage in proteins anchored by phosphatidylinositol glycans, thereby releasing the attached protein from the plasma membrane.
11089	
11090	
11091	
11092	
11093	This gene encodes a member of the WAP-type four-disulfide core (WFDC) domain family. Most WFDC proteins contain only one WFDC domain, and this encoded protein contains two WFDC domains. The WFDC domain, or WAP signature motif, contains eight cysteines forming four disulfide bonds at the core of the protein, and functions as a protease inhibitor. Most WFDC gene members are localized to chromosome 20q12-q13 in two clusters: centromeric and telomeric. This gene belongs to the centromeric cluster.
11094	The protein encoded by this gene is a lysosomal cysteine proteinase and member of the peptidase C1 family. It exhibits both carboxy-monopeptidase and carboxy-dipeptidase activities. The encoded protein has also been known as cathepsin X and cathepsin P. This gene is expressed ubiquitously in cancer cell lines and primary tumors and, like other members of this family, may be involved in tumorigenesis. At least two transcript variants of this gene have been found, but the full-length nature of only one of them has been determined.
11095	Purines are necessary for many cellular processes, including DNA replication, transcription, and energy metabolism. Ten enzymatic steps are required to synthesize inosine monophosphate (IMP) in the de novo pathway of purine biosynthesis. The enzyme encoded by this gene catalyzes the fourth step of IMP biosynthesis.
11096	The protein encoded by this gene is a preproprotein that is proteolytically processed to form the secreted opioid peptides beta-neoendorphin, dynorphin, leu-enkephalin, rimorphin, and leumorphin. These peptides are ligands for the kappa-type of opioid receptor. Dynorphin is involved in modulating responses to several psychoactive substances, including cocaine.
11097	
11098	The protein encoded by this gene is a preproprotein that is cleaved to form a secreted 10 aa peptide hormone. The secreted decapeptide regulates reproduction in females by stimulating the secretion of both luteinizing- and follicle-stimulating hormones. Three transcript variants that encode unique proproteins but the same peptide hormone have been found for this gene.
11099	The protein encoded by this gene binds to a specific DNA sequence upstream of the upstream regulatory sequence of the core promoter and second enhancer of human hepatitis B virus (HBV). Through this binding, it represses HBV core promoter activity, transcription of HBV genes, and production of HBV virions. The protein shows sequence similarities with other DNA-binding structural proteins such as gallin, oncoproteins of the MYC family, and the oncoprotein MOS. It may also be involved in protecting cells from apoptosis and in pre-mRNA splicing. Several transcript variants encoding different isoforms have been described for this gene, but the full-length nature of only six of them has been determined.
11100	
11101	
11102	
11103	
11104	
11105	
11106	
11107	
11108	
11109	
11110	
11111	The protein encoded by this gene is a member of the LAR protein-tyrosine phosphatase-interacting protein (liprin) family. Liprins interact with members of LAR family of transmembrane protein tyrosine phosphatases, which are known to be important for axon guidance and mammary gland development. It has been proposed that liprins are multivalent proteins that form complex structures and act as scaffolds for the recruitment and anchoring of LAR family of tyrosine phosphatases. This protein is most closely related to PPFIA1, a liprin family member known to interact with the protein phosphatase LAR. The expression of this gene is found to be downregulated by androgens in a prostate cancer cell line.
11112	
11113	
11114	The protein encoded by this gene associates with stem loop IV of U2 small nuclear ribonucleoprotein (U2 snRNP) in the presence of snRNP-A@#$%&. The encoded protein may play a role in pre-mRNA splicing. Autoantibodies from patients with systemic lupus erythematosus frequently recognize epitopes on the encoded protein. Two transcript variants encoding the same protein have been found for this gene.
11115	
11116	
11117	This gene encodes a member of the bicoid sub-family of homeodomain-containing transcription factors. The encoded protein acts as a transcription factor and may play a role in brain and sensory organ development. A similar protein in mice is required for proper brain and sensory organ development and can cause epilepsy.
11118	
11119	The protein encoded by this gene is an integral membrane protein and G protein-coupled receptor. The encoded protein is similar in sequence to another G protein-coupled receptor (GPR7), and it is structurally similar to opioid and somatostatin receptors. This gene is intronless and is expressed primarily in the frontal cortex of the brain.
11120	
11121	
11122	
11123	
11124	
11125	The protein encoded by this gene, which localizes to the endoplasmic reticulum, catalyzes intramembrane proteolysis of some signal peptides after they have been cleaved from a preprotein. This activity is required to generate signal sequence-derived human lymphocyte antigen-E epitopes that are recognized by the immune system, and to process hepatitis C virus core protein. The encoded protein is an integral membrane protein with sequence motifs characteristic of the presenilin-type aspartic proteases. Multiple transcript variants encoding several different isoforms have been found for this gene.
11126	The protein encoded by this gene belongs to the NALP protein family. NALPs are cytoplasmic proteins that may form large, signal-induced multiprotein complexes called inflammasomes. Inflammasomes activate the pro-inflammatory caspases, caspase-1 and caspase-5.
11127	
11128	The protein encoded by this gene was first identified by its binding to the APC protein which is often mutated in familial and sporadic forms of colorectal cancer. This protein localizes to microtubules, especially the growing ends, in interphase cells. During mitosis, the protein is associated with the centrosomes and spindle microtubules. The protein also associates with components of the dynactin complex and the intermediate chain of cytoplasmic dynein. Because of these associations, it is thought that this protein is involved in the regulation of microtubule structures and chromosome stability. This gene is a member of the RP/EB family.
11129	
11130	
11131	
11132	
11133	
11134	The glycine-N-acyltransferase protein conjugates glycine with acyl-CoA substrates in the mitochondria. The protein is thought to be important in the detoxification of endogenous and xenobiotic acyl-CoA@#$%&s. Two transcript variants encoding different isoforms have been found for this gene.
11135	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This protein contains a C-terminal PTP domain and an N-terminal domain homologous to the band 4.1 superfamily of cytoskeletal-associated proteins. P97, a cell cycle regulator involved in a variety of membrane related functions, has been shown to be a substrate of this PTP. This PTP was also found to interact with, and be regulated by adaptor protein 14-3-3 beta.
11136	HIV-1, the causative agent of acquired immunodeficiency syndrome (AIDS), contains an RNA genome that produces a chromosomally integrated DNA during the replicative cycle. Activation of HIV-1 gene expression by the transactivator Tat is dependent on an RNA regulatory element (TAR) located downstream of the transcription initiation site. The protein encoded by this gene is a transcriptional repressor that binds to chromosomally integrated TAR DNA and represses HIV-1 transcription. In addition, this protein regulates alternate splicing of the CFTR gene. A similar pseudogene is present on chromosome 20.
11137	
11138	
11139	
11140	
11141	
11142	Methylenetetrahydrofolate reductase (EC 1.5.1.20) catalyzes the conversion of 5,10-methylenetetrahydrofolate to 5-methyltetrahydrofolate, a cosubstrate for homocysteine remethylation to methionine.[supplied by OMIM]
11143	
11144	Transcriptional regulatory proteins containing tandemly repeated zinc finger domains are thought to be involved in both normal and abnormal cellular proliferation and differentiation. ZNF161 is a C2H2-type zinc finger protein. See MIM 603971 for general information on zinc finger proteins.[supplied by OMIM]
11145	
11146	
11147	
11148	
11149	This gene encodes a disintegrin and metalloproteinase with thrombospondin motifs-5 (ADAMTS5), which is a member of the ADAMTS protein family. Members of the family share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. The enzyme encoded by this gene contains 2 C-terminal TS motifs and functions as aggrecanase to cleave aggrecan, a major proteoglycan of cartilage. This gene is composed of 8 exons.
11150	Histones play a critical role in transcriptional regulation, cell cycle progression, and developmental events. Histone acetylation/deacetylation alters chromosome structure and affects transcription factor access to DNA. The protein encoded by this gene has sequence homology to members of the histone deacetylase family. This gene is orthologous to the Xenopus and mouse MITR genes. The MITR protein lacks the histone deacetylase catalytic domain. It represses MEF2 activity through recruitment of multicomponent corepressor complexes that include CtBP and HDACs. This encoded protein may play a role in hematopoiesis. Multiple alternatively spliced transcripts have been described for this gene but the full-length nature of some of them has not been determined.
11151	
11152	
11153	The protein encoded by this gene is a receptor for interleukine 7 (IL7). The function of this receptor requires the interleukin 2 receptor, gamma chain (IL2RG), which is a common gamma chain shared by the receptors of various cytokines, including interleukine 2, 4, 7, 9, and 15. This protein has been shown to play a critical role in the V(D)J recombination during lymphocyte development. This protein is also found to control the accessibility of the TCR gamma locus by STAT5 and histone acetylation. Knockout studies in mice suggested that blocking apoptosis is an essential function of this protein during differentiation and activation of T lymphocytes. The functional defects in this protein may be associated with the pathogenesis of the severe combined immunodeficiency (SCID).
11154	Carbonyl reductase 3 catalyzes the reduction of a large number of biologically and pharmacologically active carbonyl compounds to their corresponding alcohols.  The enzyme is classified as a monomeric NADPH-dependent oxidoreductase.  CBR3 contains three exons spanning 11.2 kilobases and is closely linked to another carbonyl reductase gene - CBR1.
11155	The protein encoded by this gene belongs to a series of ectoenzymes that are involved in hydrolysis of extracellular nucleotides. These ectoenzymes possess ATPase and ATP pyrophosphatase activities and are type II transmembrane proteins. Expression of the related rat mRNA has been found in a subset of immature glial cells and in the alimentary tract. The corresponding rat protein has been detected in the pancreas, small intestine, colon, and liver. The human mRNA is expressed in glioma cells, prostate, and uterus. Expression of the human protein has been detected in uterus, basophils, and mast cells.
11156	The region of chromosome 21 between genes CBR and ERG (CBR-ERG region), which spans 2.5 Mb on 21q22.2, has been defined by analysis of patients with partial trisomy 21. It contributes significantly to the pathogenesis of many characteristics of Down syndrome, including morphological features, hypotonia, and mental retardation. This gene is found in this region and multiple transcripts may exist. It is mainly expressed in the placenta.
11157	
11158	
11159	
11160	
11161	
11162	
11163	
11164	
11165	
11166	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. The specific function of this protein has not been elucidated. Alternative splice variants have been described but their full length nature has not been determined.
11167	This gene encodes an astacin-like zinc-dependent metalloprotease and is a subfamily member of the metzincin family. Unlike other family members, a similar protein in mice does not cleave procollagen C-propeptides or chordin.
11168	This gene encodes a protein that may be involved in the degradation of G proteins via the ubiquitin-dependent proteasome pathway. The encoded protein binds to members of subfamily A of the regulator of the G-protein signaling (RGS) family through an N-terminal leucine-rich region. This protein also has a central RING finger-like domain and E3 ubiquitin ligase activity. This protein is highly conserved from flies to humans. Defects in this gene may cause the autosomal recessive, infantile malignant form of osteopetrosis.
11169	
11170	
11171	
11172	HLA-E belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. HLA-E binds a restricted subset of peptides derived from the leader peptides of other class I molecules. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail.
11173	
11174	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region containing five fibronectin type III repeats, a single transmembrane region, and a single intracytoplasmic catalytic domain, and thus represents a receptor-type PTP. This PTP is present in all hematopoietic lineages, and was shown to negatively regulate T cell receptor signaling possibly through interfering with the phosphorylation of Phospholipase C Gamma 1 (PLCG1) and Linker for Activation of T Cells (LAT). This PTP was also found to dephosphorylate PDGF beta receptor, and may be involved in UV-induced signal transduction.
11175	The protein encoded by this gene belongs to the claudin family of tight junction associated proteins and is a major component of central nervous system myelin that is necessary for normal CNS function. There is growing evidence that the protein determines the permeability between layers of myelin sheaths via focal adhesion and, with its expression highly regulated during development, may play an important role in cellular proliferation and migration. In addition, the protein is a candidate autoantigen in the development of autoimmune demyelinating disease.
11176	
11177	
11178	
11179	
11180	This gene encodes a protein which interacts with clathrin and adaptor-related protein complex 2, alpha 1 subunit. The protein is found in a brain-derived clathrin-coated vesicle fraction and localizes to the peri-Golgi region and the cell periphery. The protein is thought to be involved in clathrin-mediated endocytosis. Alternate splicing of this gene results in two transcript variants encoding different isoforms.
11181	
11182	
11183	This gene is a member of a phylogenetically conserved family of genes that share a common DNA-binding domain, the T-box. T-box genes encode transcription factors involved in the regulation of developmental processes. This gene product shares 98% amino acid sequence identity with the mouse ortholog. DiGeorge syndrome (DGS)/velocardiofacial syndrome (VCFS), a common congenital disorder characterized by neural-crest-related developmental defects, has been associated with deletions of chromosome 22q11.2, where this gene has been mapped. Studies using mouse models of DiGeorge syndrome suggest a major role for this gene in the molecular etiology of DGS/VCFS. Several alternatively spliced transcript variants encoding different isoforms have been described for this gene.
11184	
11185	
11186	
11187	v-CRK avian sarcoma virus CT10-homolog-like contains one SH2 domain and two SH3 domains.  CRKL has been shown to activate the RAS and JUN kinase signaling pathways and transform fibroblasts in a RAS-dependent fashion.  It is a substrate of the BCR-ABL tyrosine kinase and plays a role in fibroblast transformation by BCR-ABL.  In addition, CRKL has oncogenic potential.
11188	This gene encodes a potential zinc-binding protein and is located in the region associated with DiGeorge syndrome on chromosome 22. The encoded protein may play a role in the regulation of cellular morphology and may function during craniofacial development.
11189	
11190	
11191	The protein encoded by this gene was identified as an interacting corepressor of BCL6, a POZ/zinc finger transcription repressor that is required for germinal center formation and may influence apoptosis. This protein selectively interacts with the POZ domain of BCL6, but not with eight other POZ proteins. Specific class I and II histone deacetylases (HDACs) have been shown to interact with this protein, which suggests a possible link between the two classes of HDACs. At least two alternatively spliced transcript variants, which encode different isoforms, have been reported for this gene.
11192	
11193	
11194	The N-acetylglutamate synthase gene encodes a mitochondrial enzyme that catalyzes the formation of N-acetylglutamate (NAG) from glutamate and acetyl coenzyme-A. NAG is a cofactor of carbamyl phosphate synthetase I (CPSI), the first enzyme of the urea cycle in mammals. This gene may regulate ureagenesis by altering NAG availability and, thereby, CPSI activity. Deficiencies in N-acetylglutamate synthase have been associated with hyperammonemia.
11195	Glycogen phosphorylase (EC 2.4.1.1) catalyzes and regulates the breakdown of glycogen to glucose-1-phosphate.[supplied by OMIM]
11196	
11197	
11198	
11199	
11200	
11201	This gene encodes a protein which interacts with clathrin and adaptor-related protein complex 2, alpha 1 subunit. The protein is found in a brain-derived clathrin-coated vesicle fraction and localizes to the peri-Golgi region and the cell periphery. The protein is thought to be involved in clathrin-mediated endocytosis. Alternate splicing of this gene results in two transcript variants encoding different isoforms.
11202	
11203	
11204	This gene is a member of the TIS11 family of early response genes. Family members are induced by various agonists such as the phorbol ester TPA and the polypeptide mitogen EGF. The gene is well conserved across species and has a promoter that contains motifs seen in other early-response genes. The encoded protein contains a distinguishing putative zinc finger domain with a repeating cys-his motif. This putative nuclear transcription factor most likely functions in regulating the response to growth factors.
11205	
11206	This gene encodes a member of the vitamin B12-binding protein family. This family of proteins, alternatively referred to as R binders, is expressed in various tissues and secretions. This plasma protein binds cobalamin and mediates the transport of cobalamin into cells. This protein and other mammalian cobalamin-binding proteins, such as transcobalamin I and gastric intrisic factor, may have evolved by duplication of a common ancestral gene.
11207	The function of this gene has not been determined.
11208	
11209	
11210	
11211	
11212	This gene is transcribed but does not appear to encode a protein. Two identical copies of this gene are present in a palindromic region (P4) on chromosome Y. This record represents the more telomeric copy.
11213	
11214	
11215	
11216	
11217	Members of the F-box protein family, such as FBXL16, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
11218	
11219	
11220	
11221	
11222	
11223	
11224	
11225	The protein encoded by this gene is a member of the signal-regulatory-protein (SIRP) family, and also belongs to the immunoglobulin superfamily. SIRP family members are receptor-type transmembrane glycoproteins known to be involved in the negative regulation of receptor tyrosine kinase-coupled signaling processes. This protein was found to interact with TYROBP/DAP12, a protein bearing immunoreceptor tyrosine-based activation motifs. This protein was also reported to participate in the recruitment of tyrosine kinase SYK.
11226	The degradation of L-threonine to glycine consists of a two-step biochemical pathway involving the enzymes L-threonine dehydrogenase and 2-amino-3-ketobutyrate coenzyme A ligase. L-Threonine is first converted into 2-amino-3-ketobutyrate by L-threonine dehydrogenase. This gene encodes the second enzyme in this pathway, which then catalyzes the reaction between 2-amino-3-ketobutyrate and coenzyme A to form glycine and acetyl-CoA. The encoded enzyme is considered a class II pyridoxal-phosphate-dependent aminotransferase.
11227	
11228	The protein encoded by this gene is a serine/threonine protein kinase and a member of the casein kinase I protein family, whose members have been implicated in the control of cytoplasmic and nuclear processes, including DNA replication and repair. The encoded protein is found in the cytoplasm as a monomer and can phosphorylate a variety of proteins, including itself. This protein has been shown to phosphorylate period, a circadian rhythm protein. Two transcript variants encoding the same protein have been found for this gene.
11229	
11230	Hepsin is a cell surface serine protease.[supplied by OMIM]
11231	
11232	The protein encoded by this gene may play a role in intracellular vesicle trafficking. It interacts with Syntaxin 13 which mediates intracellular membrane fusion. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
11233	
11234	
11235	EP300 encodes the adenovirus E1A-associated cellular p300 transcriptional co-activator protein. p300 is related by sequence to CBP (CREB-binding protein [CREB: cyclic-AMP responsive element binding protein]), and like CBP can stimulate transcription through activation of CREB. This EP300 activity is specifically inhibited by the adenovirus oncoprotein E1A. EP300 has also been identified as a co-activator of HIF1A (hypoxia-inducible factor 1 alpha), and thus plays a role in the stimulation of hypoxia-induced genes such as VEGF.
11236	
11237	
11238	
11239	Dynamins represent one of the subfamilies of GTP-binding proteins. These proteins share considerable sequence similarity over the N-terminal portion of the molecule, which contains the GTPase domain. Dynamins are associated with microtubules. They have been implicated in cell processes such as endocytosis and cell motility, and in alterations of the membrane that accompany certain activities such as bone resorption by osteoclasts. Dynamins bind many proteins that bind actin and other cytoskeletal proteins. Dynamins can also self-assemble, a process that stimulates GTPase activity. Four alternatively spliced transcripts encoding different proteins have been described. Additional alternatively spliced transcripts may exist, but their full-length nature has not been determined.
11240	
11241	NEPH2 is a member of the nephrin-like protein family, which includes NEPH1 (KIRREL; MIM 607428) and NEPH3 (MIM 607762). The cytoplasmic domains of these proteins interact with the C terminus of podocin (NPHS2; MIM 604766), and the genes are expressed in kidney podocytes, cells involved in ensuring size- and charge-selective ultrafiltration (Sellin et al., 2003).[supplied by OMIM]
11242	Members of the parvin family, including PARVB, are actin-binding proteins associated with focal contacts.[supplied by OMIM]
11243	The SMADIP1 gene (also known as SIP1) is a member of the delta-EF1 (TCF8; MIM 189909)/Zfh1 family of 2-handed zinc finger/homeodomain proteins. SMADIP1 interacts with receptor-mediated, activated full-length SMADs (see MIM 605568).[supplied by OMIM]
11244	
11245	Acrosin is the major proteinase present in the acrosome of mature spermatozoa. It is a typical serine proteinase with trypsin-like specificity. It is stored in the acrosome in its precursor form, proacrosin. The active enzyme functions in the lysis of the zona pellucida, thus facilitating penetration of the sperm through the innermost glycoprotein layers of the ovum. The mRNA for proacrosin is synthesized only in the postmeiotic stages of spermatogenesis. In humans proacrosin first appears in the haploid spermatids.
11246	Retinitis pigmentosa is an inherited progressive disease which is a major cause of blindness in western communities. It can be inherited as an autosomal dominant, autosomal recessive, or X-linked recessive disorder. In the autosomal dominant form,which comprises about 25% of total cases, approximately 30% of families have mutations in the gene encoding the rod photoreceptor-specific protein rhodopsin. This is the transmembrane protein which, when photoexcited, initiates the visual transduction cascade. Defects in this gene are also one of the causes of congenital stationary night blindness.
11247	Neuregulin 1 (NRG1) was originally identified as a 44-kD glycoprotein that interacts with the NEU/ERBB2 receptor tyrosine kinase to increase its phosphorylation on tyrosine residues. It is known that an extraordinary variety of different isoforms are produced from the NRG1 gene by alternative splicing. These isoforms include heregulins (HRGs), glial growth factors (GGFs) and sensory and motor neuron-derived factor (SMDF). They are tissue-specifically expressed and differ significantly in their structure. The HRG isoforms all contain immunoglobulin (Ig) and epidermal growth factor-like (EGF-like) domains. GGF and GGF2 isoforms contain a kringle-like sequence plus Ig and EGF-like domains; and the SMDF isoform shares only the EGF-like domain with other isoforms. The receptors for all NRG1 isoforms are the ERBB family of tyrosine kinase transmembrane receptors. Through interaction with ERBB receptors, NRG1 isoforms induce the growth and differentialtion of epithelial, neuronal, glial, and other types of cells.
11248	This gene is one of two human homologs of Drosophila bicaudal-D and a member of the Bicoid family. It has been implicated in dynein-mediated, minus end-directed motility along microtubules. It has also been reported to be a phosphorylation target of NIMA related kinase 8. Two alternative splice variants have been described.
11249	
11250	
11251	
11252	
11253	
11254	
11255	
11256	
11257	
11258	
11259	
11260	CA6 is one of several isozymes of carbonic anhydrase. CA6 is found only in salivary glands and saliva.  The protein may play a role in reversible hydratation of carbon dioxide though its function in saliva is unknown.
11261	
11262	The enzyme encoded by this gene catalyzes removal of mono-ADP-ribose from arginine residues of proteins in the ADP-ribosylation cycle. Unlike the rat and mouse enzymes, which require DTT for maximal activity, the human enzyme is DTT-independent.
11263	MAX dimerization protein belongs to a subfamily of MAX-interacting proteins. This protein competes with MYC for binding to MAX to form a sequence-specific DNA-binding complex, acts as a transcriptional repressor (while MYC appears to function as an activator) and is a candidate tumor suppressor.
11264	The product of this gene belongs to the actin-binding proteins ADF family. This family of proteins is responsible for enhancing the turnover rate of actin in vivo. This gene encodes the actin depolymerizing protein that severs actin filaments (F-actin) and binds to actin monomers (G-actin). Two transcript variants encoding distinct isoforms have been identified for this gene.
11265	This gene encodes a mitochondrial transcription termination factor. This protein participates in attenuating transcription from the mitochondrial genome; this attenuation allows higher levels of expression of 16S ribosomal RNA relative to the tRNA gene downstream. The product of this gene has three leucine zipper motifs bracketed by two basic domains that are all required for DNA binding. There is evidence that, for this protein, the zippers participate in intramolecular interactions that establish the three-dimensional structure required for DNA binding.
11266	
11267	This gene encodes a protein containing a potential clathrin-binding motif, consensus dileucine signals, and tyrosine-based sorting signals for targeting to vesicles of lysosomal lineage. The encoded protein may play a role in organelle biogenesis associated with melanosomes, platelet dense granules, and lysosomes. Mutations in this gene are associated with Hermansky-Pudlak syndrome type 3. Alternate splice variants exist, but their full length sequence has not been determined.
11268	
11269	
11270	This gene encodes a molecular chaperone that is a member of the chaperonin containing TCP1 complex (CCT), also known as the TCP1 ring complex (TRiC). This complex consists of two identical stacked rings, each containing eight different proteins. Unfolded polypeptides enter the central cavity of the complex and are folded in an ATP-dependent manner. The complex folds various proteins, including actin and tubulin. Alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
11271	
11272	This gene encodes a protein containing a potential clathrin-binding motif, consensus dileucine signals, and tyrosine-based sorting signals for targeting to vesicles of lysosomal lineage. The encoded protein may play a role in organelle biogenesis associated with melanosomes, platelet dense granules, and lysosomes. Mutations in this gene are associated with Hermansky-Pudlak syndrome type 3. Alternate splice variants exist, but their full length sequence has not been determined.
11273	
11274	
11275	
11276	This gene is a main control point for the regulation of gluconeogenesis. The cytosolic enzyme encoded by this gene, along with GTP, catalyzes the formation of phosphoenolpyruvate from oxaloacetate, with the release of carbon dioxide and GDP. The expression of this gene can be regulated by insulin, glucocorticoids, glucagon, cAMP, and diet. A mitochondrial isozyme of the encoded protein also has been characterized.
11277	
11278	
11279	
11280	
11281	
11282	HLA-A belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon 1 encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Hundreds of HLA-A alleles have been described.
11283	
11284	
11285	
11286	
11287	
11288	
11289	
11290	Initiation of transcription by RNA polymerase II requires the assistance of TATA box-binding protein (TBP; MIM 600075) and TBP-associated factors, or TAFs (e.g., TAF2B; MIM 604912), in 2 distinct complexes, TFIID and B-TFIID. The TFIID complex is composed of TBP and more than 8 TAFs. However, the majority of TBP is present in the B-TFIID complex, which is composed of TBP and TAFII170, also called TAF172, and has DNA-dependent ATPase activity.[supplied by OMIM]
11291	This gene is a member of the BCL2/adenovirus E1B 19 kd-interacting protein (BNIP) family.  It interacts with the E1B 19 kDa protein which is responsible for the protection of virally-induced cell death, as well as E1B 19 kDa-like sequences of BCL2, also an apoptotic protector.  The protein encoded by this gene is a functional homolog of BNIP3, a proapoptotic protein.  This protein may function simultaneously with BNIP3 and may play a role in tumor suppression.
11292	
11293	This gene belongs to the chemokine-like factor gene superfamily, a novel superfamily that links the chemokine gene family and the transmembrane 4 superfamily. This gene is one of several chemokine-like factor genes located in a cluster on chromosome 3. This gene is widely expressed in many tissues, but the exact function of the encoded protein is unknown.
11294	This gene encodes a regulator of the CDC7-like 1 protein, a serine-threonine kinase which links cell cycle regulation to genome duplication. This protein localizes to the nucleus and its expression is cell cycle-regulated. Alternative splicing of this gene results in two transcript variants encoding different isoforms. Additional transcript variants have been described, but their full length sequences have not been determined.
11295	The product of this gene belongs to the Serine/Threonine protein kinase family, and to the Ca(2+)/calmodulin-dependent protein kinase subfamily. This protein plays a role in the calcium/calmodulin-dependent (CaM) kinase cascade by phosphorylating the downstream kinases CaMK1 and CaMK4. Seven transcript variants encoding six distinct isoforms have been identified for this gene. Additional splice variants have been described but their full-length nature has not been determined. The identified isoforms exhibit a distinct ability to  undergo autophosphorylation and to phosphorylate the downstream kinases.
11296	This gene encodes a member of the hepatoma-derived growth factor (HDGF) family. Two alternatively spliced transcript variants encoding slightly different isoforms have been found for this gene.
11297	
11298	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
11299	The protein encoded by this gene is a member of the E2F family of transcription factors. The E2F family plays a crucial role in the control of cell cycle and action of tumor suppressor proteins and is also a target of the transforming proteins of small DNA tumor viruses. The E2F proteins contain several evolutionally conserved domains found in most members of the family. These domains include a DNA binding domain, a dimerization domain which determines interaction with the differentiation regulated transcription factor proteins (DP), a transactivation domain enriched in acidic amino acids, and a tumor suppressor protein association domain which is embedded within the transactivation domain. This protein and another 2 members, E2F1 and E2F3, have an additional cyclin binding domain. This protein binds specifically to retinoblastoma protein pRB in a cell-cycle dependent manner, and it exhibits overall 46% amino acid identity to E2F1.
11300	
11301	
11302	
11303	
11304	
11305	
11306	
11307	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
11308	This gene encodes a cytokine distantly related to type I interferons and the IL-10 family. This gene, interleukin 28B (IL28B), and interleukin 29 (IL29) are three closely related cytokine genes that form a cytokine gene cluster on a chromosomal region mapped to 19q13. Expression of the cytokines encoded by the three genes can be induced by viral infection. All three cytokines have been shown to interact with a heterodimeric class II cytokine receptor that consists of interleukin 10 receptor, beta (IL10RB) and interleukin 28 receptor, alpha (IL28RA).
11309	
11310	
11311	Raf-1 is a MAP kinase kinase kinase (MAP3K) which functions downstream of the Ras family of membrane associated GTPases to which it binds directly. Once activated Raf-1 can phosphorylate to activate the dual specificity protein kinases MEK1 and MEK2 which in turn phosphorylate to activate the serine/threonine specific protein kinases ERK1 and ERK2. Activated ERKs are pleiotropic effectors of cell physiology and play an important role in the control of gene expression involved in the cell division cycle, apoptosis, cell differentiation and cell migration. [Contributed text]
11312	
11313	The protein encoded by this gene is a member of the PP2C family of Ser/Thr protein phosphatases. PP2C family members are known to be negative regulators of cell stress response pathways. This phosphatase dephosphorylates, and negatively regulates the activities of, MAP kinases and MAP kinase kinases. It has been shown to inhibit the activation of p38 and JNK kinase cascades induced by environmental stresses. This phosphatase can also dephosphorylate cyclin-dependent kinases, and thus may be involved in cell cycle control. Overexpression of this phosphatase is reported to activate the expression of the tumor suppressor gene TP53/p53, which leads to G2/M cell cycle arrest and apoptosis. Three alternatively spliced transcript variants encoding two distinct isoforms have been described.
11314	Ghrelin is an endogenous ligand for the growth hormone secretagogue receptor (GHSR; MIM 601898) and is involved in regulating GH release.[supplied by OMIM]
11315	Carbonic anhydrases (CAs) are a large family of zinc metalloenzymes that catalyze the reversible hydration of carbon dioxide.  They participate in a variety of biological processes, including respiration, calcification, acid-base balance, bone resorption, and the formation of aqueous humor, cerebrospinal fluid, saliva, and gastric acid.  They show extensive diversity in tissue distribution and in their subcellular localization.  CA IV is a glycosylphosphatidyl-inositol-anchored membrane isozyme expressed on the luminal surfaces of pulmonary (and certain other) capillaries and of proximal renal tubules.  Its exact function is not known, however, it may have a role in inherited renal abnormalities of bicarbonate transport.
11316	
11317	
11318	
11319	The protein encoded by this gene is a member of the TNF-receptor superfamily. This receptor contains an extracellular TRAIL-binding domain and a transmembrane domain, but no cytoplasmic death domain. This receptor is not capable of inducing apoptosis, and is thought to function as an antagonistic receptor that protects cells from TRAIL-induced apoptosis. This gene was found to be a p53-regulated DNA damage-inducible gene. The expression of this gene was detected in many normal tissues but not in most cancer cell lines, which may explain the specific sensitivity of cancer cells to the apoptosis-inducing activity of TRAIL.
11320	
11321	
11322	
11323	Ornithine decarboxylase catalyzes the conversion of ornithine to putrescine in the first and apparently rate-limiting step in polyamine biosynthesis. The ornithine decarboxylase antizymes play a role in the regulation of polyamine synthesis by binding to and inhibiting ornithine decarboxylase. Antizyme expression is auto-regulated by polyamine-enhanced translational frameshifting. The antizyme encoded by this gene inhibits ornithine decarboxylase and accelerates its degradation.
11324	Ornithine decarboxylase catalyzes the conversion of ornithine to putrescine in the first and apparently rate-limiting step in polyamine biosynthesis. The ornithine decarboxylase antizymes play a role in the regulation of polyamine synthesis by binding to and inhibiting ornithine decarboxylase. Antizyme expression is auto-regulated by polyamine-enhanced translational frameshifting. The antizyme encoded by this gene inhibits ornithine decarboxylase and accelerates its degradation.
11325	
11326	
11327	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. This gene encodes a member of the potassium channel, voltage-gated, subfamily H. This member is a pore-forming (alpha) subunit. There are at least two alternatively spliced transcript variants derived from this gene and encoding distinct isoforms.
11328	
11329	
11330	
11331	
11332	
11333	
11334	
11335	
11336	
11337	
11338	
11339	
11340	
11341	Gamma-aminobutyric acid (GABA), the major inhibitory neurotransmitter in the brain, mediates neuronal inhibition by binding to GABA receptors. The type A GABA receptors are pentameric chloride channels assembled from among many genetic variants of GABA(A) subunits. This gene encodes the gamma 2 subunit of GABA(A) receptor. Mutations in this gene have been associated with epilepsy and febrile seizures. Alternative splicing of this gene results in transcript variants encoding different isoforms.
11342	The protein encoded by this gene is a neurotrophic factor, belonging to the GDNF family. Neurotrophic factors are important for the proper development and maintenance of the nervous system. These factors promote neuronal survival and can prevent the neuronal degeneration associated with injury, toxin exposure, or neurodegenerative disease. The encoded protein has amino acid similarity to its other family members, glial cell line-derived neurotrophic factor and neurturin. This gene product promotes the survival of ventral midbrain dopaminergic neurons in culture and prevents their degeneration after 6-hydroxydopamine treatment in vivo.
11343	
11344	
11345	
11346	
11347	
11348	
11349	
11350	
11351	
11352	
11353	
11354	
11355	
11356	
11357	The product of this gene is a component of the nuclear factor of activated T cells DNA-binding transcription complex. This complex consists of at least two components: a preexisting cytosolic component that translocates to the nucleus upon T cell receptor (TCR) stimulation, and an inducible nuclear component. Proteins belonging to this family of transcription factors play a central role in inducible gene transcription during immune response. The product of this gene is an inducible nuclear component. It functions as a major molecular target for the immunosuppressive drugs such as cyclosporin A. Five transcript variants encoding distinct isoforms have been identified for this gene. Different isoforms of this protein may regulate inducible expression of different cytokine genes.
11358	
11359	
11360	This gene encodes a member of the membrane-associated guanylate kinase (MAGUK) family. It heteromultimerizes with DLG2. With DLG2 it is recruited into the same NMDA receptor and potassium channel clusters. These two MAGUK proteins may interact at postsynaptic sites to form a multimeric scaffold for the clustering of receptors, ion channels, and associated signaling proteins.
11361	The gene product was initially identified as a 190-kD protein associated with the contactin-PTPRZ1 complex. The 1,381-amino acid protein, also designated p190 or CASPR for @#$%&contactin-associated protein,@#$%& includes an extracellular domain with several putative protein-protein interaction domains, a putative transmembrane domain, and a 74-amino acid cytoplasmic domain. Northern blot analysis showed that the gene is transcribed predominantly in brain as a transcript of 6.2 kb, with weak expression in several other tissues tested. The architecture of its extracellular domain is similar to that of neurexins, and this protein may be the signaling subunit of contactin, enabling recruitment and activation of intracellular signaling pathways in neurons.
11362	The cystatin superfamily encompasses proteins that contain multiple cystatin-like sequences. Some of the members are active cysteine protease inhibitors, while others have lost or perhaps never acquired this inhibitory activity. There are three inhibitory families in the superfamily, including the type 1 cystatins (stefins), type 2 cystatins, and kininogens. This gene encodes a stefin that functions as a cysteine protease inhibitor, forming tight complexes with papain and the cathepsins B, H, and L. The protein is one of the precursor proteins of cornified cell envelope in keratinocytes and plays a role in epidermal development and maintenance. Stefins have been proposed as prognostic and diagnostic tools for cancer.
11363	This gene is a member of the Period family of genes and is expressed in a circadian pattern in the suprachiasmatic nucleus, the primary circadian pacemaker in the mammalian brain. Genes in this family encode components of the circadian rhythms of locomotor activity, metabolism, and behavior. Circadian expression in the suprachiasmatic nucleus continues in constant darkness, and a shift in the light/dark cycle evokes a proportional shift of gene expression in the suprachiasmatic nucleus. The specific function of this gene is not yet known.
11364	
11365	
11366	
11367	
11368	
11369	
11370	CBP and p300 are large nuclear proteins that bind to many sequence-specific factors involved in cell growth and/or differentiation, including c-jun and the adenoviral oncoprotein E1A. The protein encoded by this gene associates with p300/CBP. It has in vitro and in vivo binding activity with CBP and p300, and competes with E1A for binding sites in p300/CBP. It has histone acetyl transferase activity with core histones and nucleosome core particles, indicating that this protein plays a direct role in transcriptional regulation.
11371	The protein encoded by this intronless gene is a bZIP transcription factor which can bind as a homodimer to certain DNA regulatory regions. It can also form heterodimers with the related proteins CEBP-alpha, CEBP-delta, and CEBP-gamma. The encoded protein is important in the regulation of genes involved in immune and inflammatory responses and has been shown to bind to the IL-1 response element in the IL-6 gene, as well as to regulatory regions of several acute-phase and cytokine genes. In addition, the encoded protein can bind the promoter and upstream element and stimulate the expression of the collagen type I gene.
11372	
11373	
11374	
11375	
11376	This gene encodes the enzyme acylpeptide hydrolase, which catalyzes the hydrolysis of the terminal acetylated amino acid preferentially from small acetylated peptides. The acetyl amino acid formed by this hydrolase is further processed to acetate and a free amino acid by an aminoacylase. This gene is located within the same region of chromosome 3 (3p21) as the aminoacylase gene, and deletions at this locus are also associated with a decrease in aminoacylase activity. The acylpeptide hydrolase is a homotetrameric protein of 300 kDa with each subunit consisting of 732 amino acid residues. It can play an important role in destroying oxidatively damaged proteins in living cells. Deletions of this gene locus are found in various types of carcinomas, including small cell lung carcinoma and renal cell carcinoma.
11377	
11378	
11379	
11380	
11381	
11382	This gene is a member of the beta-1,3-galactosyltransferase (beta3GalT) gene family. This family encodes type II membrane-bound glycoproteins with diverse enzymatic functions using different donor substrates (UDP-galactose and UDP-N-acetylglucosamine) and different acceptor sugars (N-acetylglucosamine, galactose, N-acetylgalactosamine). The beta3GalT genes are distantly related to the Drosophila Brainiac gene and have the protein coding sequence contained in a single exon. The beta3GalT proteins also contain conserved sequences not found in the beta4GalT or alpha3GalT proteins. The carbohydrate chains synthesized by these enzymes are designated as type 1, whereas beta4GalT enzymes synthesize type 2 carbohydrate chains. The ratio of type 1:type 2 chains changes during embryogenesis. By sequence similarity, the beta3GalT genes fall into at least two groups: beta3GalT4 and 4 other beta3GalT genes (beta3GalT1-3, beta3GalT5). The encoded protein of this gene does not use N-acetylglucosamine as an acceptor sugar at all. Four transcript variants have been described which differ in both the 5@#$%& and 3@#$%& UTR sequences. All transcript variants encode an identical protein.
11383	Pleomorphic adenoma gene 1 encodes a zinc finger protein with 2 putative nuclear localization signals.  PLAG1, which is developmentally regulated, has been shown to be consistently rearranged in pleomorphic adenomas of the salivary glands.  PLAG1 is activated by the reciprocal chromosomal translocations involving 8q12 in a subset of salivary gland pleomorphic adenomas.
11384	This gene is a member of the protocadherin gamma gene cluster, one of three related clusters tandemly linked on chromosome five. These gene clusters have an immunoglobulin-like organization, suggesting that a novel mechanism may be involved in their regulation and expression. The gamma gene cluster includes 22 genes divided into 3 subfamilies. Subfamily A contains 12 genes, subfamily B contains 7 genes and 2 pseudogenes, and the more distantly related subfamily C contains 3 genes. The tandem array of 22 large, variable region exons are followed by a constant region, containing 3 exons shared by all genes in the cluster. Each variable region exon encodes the extracellular region, which includes 6 cadherin ectodomains and a transmembrane region. The constant region exons encode the common cytoplasmic region. These neural cadherin-like cell adhesion proteins most likely play a critical role in the establishment and function of specific cell-cell connections in the brain. Alternative splicing has been described for the gamma cluster genes.
11385	
11386	
11387	
11388	
11389	
11390	
11391	
11392	
11393	
11394	
11395	
11396	
11397	
11398	This gene encodes a receptor for the OX-2 membrane glycoprotein. Both the receptor and substrate are cell surface glycoproteins containing two immunoglobulin-like domains. This receptor is restricted to the surfaces of myeloid lineage cells and the receptor-substrate interaction may function as a myeloid downregulatory signal. Mouse studies of a related gene suggest that this interaction may control myeloid function in a tissue-specific manner. Alternative splicing of this gene results in multiple transcript variants.
11399	
11400	
11401	
11402	This gene encodes a member of the beta chemokine receptor family, which is predicted to be a seven transmembrane protein similar to G protein-coupled receptors. This protein is expressed by T cells and macrophages, and is known to be an important co-receptor for macrophage-tropic virus, including HIV, to enter host cells. Defective alleles of this gene have been associated with the HIV infection resistance. The ligands of this receptor include monocyte chemoattractant protein 2 (MCP-2), macrophage inflammatory protein 1 alpha (MIP-1 alpha), macrophage inflammatory protein 1 beta (MIP-1 beta) and regulated on activation normal T expressed and secreted protein (RANTES). Expression of this gene was also detected in a promyeloblastic cell line, suggesting that this protein may play a role in granulocyte lineage proliferation and differentiation. This gene is located at the chemokine receptor gene cluster region.
11403	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth, and invasion. It has been shown in vitro that this protein is able to induce neurite outgrowth in PC12 cells. Studies of the similar proteins in mouse and chick suggested that this protein is a pleiotropic growth factor that stimulates proliferation in a number of tissues, most notably the liver and small intestine. Knockout studies of the similar gene in mice implied the role of this protein in regulating proliferation and differentiation of midline cerebellar structures. Two alternatively spliced variants, which encode the identical isoform, are reported.
11404	The protein encoded by this gene is a member of the SEC22 family of vesicle trafficking proteins. It seems to complex with SNARE and it is thought to play a role in the ER-Golgi protein trafficking. This protein has strong similarity to Mus musculus and Cricetulus griseus proteins. There is evidence for use of multiple polyadenylation sites for the transcript.
11405	
11406	
11407	
11408	
11409	
11410	This gene is an estrogen-responsive gene that is an early response gene in the estrogen receptor-regulated pathway. It is thought to play an important role in hormone-responsive tissues and cancer. Three alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
11411	
11412	
11413	
11414	
11415	
11416	
11417	
11418	Prokineticins are secreted proteins that can promote angiogenesis and induce strong gastrointestinal smooth muscle contraction. The protein encoded by this gene is an integral membrane protein and G protein-coupled receptor for prokineticins. The encoded protein is similar in sequence to GPR73, another G protein-coupled receptor for prokineticins.
11419	
11420	
11421	
11422	
11423	This gene encodes a protein that binds to the C-terminus of adenovirus E1A proteins. This phosphoprotein is a transcriptional repressor and may play a role during cellular proliferation. This protein and the product of a second closely related gene, CTBP2, can dimerize. Both proteins can also interact with a polycomb group protein complex which participates in regulation of gene expression during development. Alternative splicing of transcripts from this gene results in multiple transcript variants.
11424	This gene encodes a protein that binds to the C-terminus of adenovirus E1A proteins. This phosphoprotein is a transcriptional repressor and may play a role during cellular proliferation. This protein and the product of a second closely related gene, CTBP2, can dimerize. Both proteins can also interact with a polycomb group protein complex which participates in regulation of gene expression during development. Alternative splicing of transcripts from this gene results in multiple transcript variants.
11425	
11426	This gene is a member of a phylogenetically conserved family of genes that share a common DNA-binding domain, the T-box. T-box genes encode transcription factors involved in the regulation of developmental processes. This gene is the human ortholog of mouse Tbx19/Tpit gene. Studies in mouse show that Tpit protein is present only in the two pituitary pro-opiomelanocortin (POMC)-expressing lineages, the corticotrophs and melanotrophs. Mutations in the human ortholog were found in patients with isolated deficiency of pituitary POMC-derived ACTH, suggesting an essential role for this gene in differentiation of the pituitary POMC lineage.
11427	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein also contains a regulator of G protein signaling (RGS) domain. Regulator of G protein signaling family members are regulatory molecules that act as GTPase activating proteins for G alpha subunits of heterotrimeric G proteins. The RGS domain of this protein interacts with G alpha(s), accelerates its GTP hydrolysis, and attenuates G alpha(s)-mediated signaling. Overexpression of this protein delayes lysosomal degradation of the epidermal growth factor receptor. Because of its bifunctional role, this protein may link heterotrimeric G protein signaling and vesicular trafficking.
11428	Histamine is a ubiquitous messenger molecule released from mast cells, enterochromaffin-like cells, and neurons. Its various actions are mediated by histamine receptors H1, H2, H3 and H4. Histamine receptor H3 belongs to the family 1 of G protein-coupled receptors. It is an integral membrane protein and can regulate neurotransmitter release. This receptor can also increase voltage-dependent calcium current in smooth muscles and innervates the blood vessels and the heart in cardiovascular system.
11429	
11430	
11431	
11432	This gene encodes coagulation factor XI of the blood coagulation cascade. Alternative splicing of the gene results in two transcripts. The full length transcript encodes the circulating plasma factor XI and an alternate transcript lacking exon 5 encodes the platelet factor XI. The plasma factor XI is present in plasma as a zymogen, which is an unique plasma coagulation enzyme because it exists as a homodimer consisting of two identical polypeptide chains linked by disulfide bonds. During activation of the plasma factor XI, an internal peptide bond is cleaved by factor XIIa (or XII) in each of the two chains, resulting in activated factor XIa, a serine protease composed of two heavy and two light chains held together by disulfide bonds. This activated plasma factor XI triggers the middle phase of the intrisic pathway of blood coagulation by activating factor IX. Defects in this factor lead to Rosenthal syndrome, a blood coagulation abnormality. The platelet factor XI is localized to platelets and megakaryocytes but absent from other blood cells. The nature of this platelet factor is not known, it may play an important role both in the maintenance of normal hemostasis and as a substitute for plasma factor XI.
11433	KLF13 belongs to a family of transcription factors that contain 3 classical zinc finger DNA-binding domains consisting of a zinc atom tetrahedrally coordinated by 2 cysteines and 2 histidines (C2H2 motif). These transcription factors bind to GC-rich sequences and related GT and CACCC boxes (Scohy et al., 2000).[supplied by OMIM]
11434	
11435	
11436	
11437	
11438	
11439	
11440	
11441	Copper amine oxidases catalyze the oxidative conversion of amines to aldehydes and ammonia in the presence of copper and quinone cofactor. This gene shows high sequence similarity to copper amine oxidases from various species ranging from bacteria to mammals. The protein contains several conserved motifs including the active site of amine oxidases and the histidine residues that likely bind copper. It may be a critical modulator of signal transmission in retina, possibly by degrading the biogenic amines dopamine, histamine, and putrescine. This gene may be a candidate gene for hereditary ocular diseases. Alternate splicing results in multiple transcript variants.
11442	Epiregulin is a member of the epidermal growth factor family. Epiregulin can function as a ligand of EGFR (epidermal growth factor receptor), as well as a ligand of most members of the ERBB (v-erb-b2 oncogene homolog) family of tyrosine-kinase receptors.
11443	
11444	This gene belongs to the forkhead family of transcription factors which is characterized by a distinct forkhead domain.  The specific function of this gene has not yet been determined.
11445	
11446	
11447	The protein encoded by this gene acts as a guanine nucleotide exchange factor for the RHO family of small GTP-binding proteins (RACs).  It has been shown to bind to and activate RAC1 by exchanging bound GDP for free GTP. The encoded protein, which is found mainly in the cytoplasm, is activated by phosphatidylinositol-3,4,5-trisphosphate and the beta-gamma subunits of heterotrimeric G proteins.
11448	
11449	
11450	
11451	The product of this gene belongs to the family of purinoceptors for ATP. This receptor functions as a ligand-gated ion channel. Several characteristic motifs of ATP-gated channels are present in its primary structure, but, unlike other members of the purinoceptors family, this receptor has only a single transmembrane domain. Three transcript variants encoding distinct isoforms have been identified for this gene.
11452	
11453	
11454	
11455	
11456	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a cytoplasmic ribosomal protein that is a component of the 60S subunit. The protein belongs to the L29E family of ribosomal proteins. The protein is also a peripheral membrane protein expressed on the cell surface that directly binds heparin. Although this gene was previously reported to map to 3q29-qter, it is believed that it is located at 3p21.3-p21.2. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
11457	
11458	
11459	
11460	This gene is a member of the paired box (PAX) family of transcription factors.  Members of this gene family typically contain a paired box domain, an octapeptide, and a paired-type homeodomain.  These genes play critical roles during fetal development and cancer growth.  The paired box gene 4 is involved in pancreatic islet development and mouse studies have demonstrated a role for this gene in differentiation of insulin-producing beta cells.
11461	
11462	
11463	
11464	
11465	
11466	This gene is located within the Smith-Magenis syndrome region on chromosome 17. Mutations in this gene are associated with Birt-Hogg-Dube syndrome, which is characterized by fibrofolliculomas, renal tumors, lung cysts, and pneumothorax. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
11467	
11468	L-type calcium channels are composed of five subunits. The protein encoded by this gene represents one of these subunits, gamma, and is one of several gamma subunit proteins. It is an integral membrane protein that is thought to stabilize the calcium channel in an inactive (closed) state. This gene is a member of the neuronal calcium channel gamma subunit gene subfamily of the PMP-22/EMP/MP20 family and is located in a cluster with two similar gamma subunit-encoding genes. Two transcript variants encoding different isoforms have been found for this gene.
11469	
11470	
11471	
11472	This gene is expressed in postmitotic-differentiating neurons of the developmental nervous system and neuronal structures associated with plasticity in the adult. The expression of this gene can be induced by neural activity and neurotrophins. The encoded protein contains a consensus cleavage signal found in glycosylphoshatidylinositol (GPI)-anchored proteins. In vitro assays demonstrated that this protein promotes neurite outgrowth and arborization, suggesting its role in promoting neuritogenesis.
11473	
11474	
11475	
11476	
11477	
11478	Ceruloplasmin is a plasma metalloprotein that that binds most of the copper in plasma. Human ceruloplasmin is composed of a single polypeptide chain. Ceruloplasmin deficiency leads to iron accumulation and cuases damages to a varieties of tissues and organs.
11479	This gene product is a sequence-specific, single-stranded DNA-binding protein. It binds preferentially to the single strand of the purine-rich element termed PUR, which is present at origins of replication and in gene flanking regions in a variety of eukaryotes from yeasts through humans. Thus, it is implicated in the control of both DNA replication and transcription. Deletion of this gene has been associated with myelodysplastic syndrome and acute myelogenous leukemia.
11480	
11481	The corticotropin-releasing hormone receptor binds to corticotropin-releasing hormone (MIM 122560), a potent mediator of endocrine, autonomic, behavioral, and immune responses to stress.[supplied by OMIM]
11482	
11483	
11484	
11485	
11486	UDP-glucose (UDPG) acts as the sugar donor in numerous glycosylation reactions, including those involved in the production of glycogen. NUDT14 is a UDPG pyrophosphatase (EC 3.6.1.45) that hydrolyzes UDPG to produce glucose 1-phosphate and UMP (Yagi et al., 2003).[supplied by OMIM]
11487	
11488	
11489	
11490	
11491	
11492	
11493	
11494	KSP37 is constitutively secreted by Th1-type CD4 (MIM 186940)-positive T lymphocytes and by lymphocytes with cytotoxic potential that usually coexpress perforin (PRF1; MIM 170280).[supplied by OMIM]
11495	The protein encoded by this gene was originally identified as an ovarian tumor antigen monitored in ovarian cancer. The encoded protein contains a B-box/coiled coil motif, which is present in many genes with transformation potential, but the function of this protein is unknown. This gene is located on a region of chromosome 17q21.1 that is in close proximity to tumor suppressor gene BRCA1. Three alternatively spliced variants encoding the same protein have been identified for this gene.
11496	This gene encodes a protein that is related to methyl-CpG-binding proteins but lacks the methyl-CpG binding domain. The protein is localized to discrete areas in the nucleus, and expression appears to be restricted to round spermatids, suggesting that the protein plays a role in the postmeiotic stages of male germ cell development.
11497	The protein encoded by this gene is a transcription factor that binds to GC box elements located in the promoter. Binding of the encoded protein to a single GC box inhibits mRNA expression while binding to tandemly repeated GC box elements activates transcription.
11498	
11499	
11500	This gene encodes a member of the barr gene family. A similar protein in Drosophila melanogaster is necessary for sister chromatid separation and modulates topoisomerase II activity.
11501	The protein encoded by this gene may be an anchoring filament that is a component of basement membranes. It may contribute to the stability of the association of the epithelial layers with the underlying mesenchyme.
11502	
11503	
11504	
11505	This gene encodes a member of the Kruppel family of C2H2-type zinc-finger transcription factor proteins. The encoded protein acts as a transcriptional activator. Two transcript variants encoding distinct isoforms have been identified for this gene. Other transcript variants have been described, but their full length sequence has not been determined.
11506	
11507	
11508	
11509	
11510	The protein encoded by this gene belongs to the tigger subfamily of the pogo superfamily of DNA-mediated transposons in humans. These proteins are related to DNA transposons found in fungi and nematodes, and more distantly to the Tc1 and mariner transposases. They are also very similar to the major mammalian centromere protein B. The exact function of this gene is not known.
11511	
11512	
11513	
11514	SEPT8 is a member of the highly conserved septin family. Septins are 40- to 60-kD GTPases that assemble as filamentous scaffolds. They are involved in the organization of submembranous structures, in neuronal polarity, and in vesicle trafficking (Blaser et al., 2003).[supplied by OMIM]
11515	Aminoacyl-tRNA synthetases are a class of enzymes that charge tRNAs with their cognate amino acids. The protein encoded by this gene is an enzyme belonging to the class II family of aminoacyl-tRNA synthetases. Functioning in the synthesis of histidyl-transfer RNA, the enzyme plays an accessory role in the regulation of protein biosynthesis. The gene is located in a head-to-head orientation with HARS on chromosome five, where the homologous genes share a bidirectional promoter.
11516	
11517	Fructose-1,6-bisphosphate aldolase (EC 4.1.2.13 ) is a tetrameric glycolytic enzyme that catalyzes the reversible conversion of fructose-1,6-bisphosphate to glyceraldehyde 3-phosphate and dihydroxyacetone phosphate. Vertebrates have 3 aldolase isozymes which are distinguished by their electrophoretic and catalytic properties. Differences indicate that aldolases A, B, and C are distinct proteins, the products of a family of related @#$%&housekeeping@#$%& genes exhibiting developmentally regulated expression of the different isozymes. The developing embryo produces aldolase A, which is produced in even greater amounts in adult muscle where it can be as much as 5% of total cellular protein. In adult liver, kidney and intestine, aldolase A expression is repressed and aldolase B is produced. In brain and other nervous tissue, aldolase A and C are expressed about equally. There is a high degree of homology between aldolase A and C. Defects in ALDOB cause hereditary fructose intolerance.
11518	The protein encoded by this gene is a member of the basic helix-loop-helix (bHLH)-PAS family of transcription factors. A similar mouse protein may play a regulatory role in the acquisition of specific types of memory. It also may function as a part of a molecular clock operative in the mammalian forebrain.
11519	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections.
11520	This gene encodes several androgen-dependent, epididymis-specific secretory proteins. The specific functions of these proteins have not been determined, but they are thought to be involved in sperm maturation. Some of the isoforms contain regions of similarity to beta-defensins, a family of antimicrobial peptides. The gene is located on chromosome 8p23 near the defensin gene cluster. Alternative splicing of this gene results in seven transcript variants encoding different isoforms. Two different N-terminal and five different C-terminal protein sequences are encoded by the splice variants. Two additional variants have been described, but their full length sequences have not been determined.
11521	This gene encodes a protein with 3@#$%& exonuclease activity. Enzymes with this activity are involved in DNA replication, repair, and recombination. Similarity to an E. coli protien suggests that this enzyme may be a subunit of DNA polymerase III, which does not have intrinsic exonuclease activity.
11522	This gene encodes a protein with 3@#$%& exonuclease activity. Enzymes with this activity are involved in DNA replication, repair, and recombination. Similarity to an E. coli protien suggests that this enzyme may be a subunit of DNA polymerase III, which does not have intrinsic exonuclease activity.
11523	Initiation of transcription by RNA polymerase II requires the activities of more than 70 polypeptides. The protein that coordinates these activities is transcription factor IID (TFIID), which binds to the core promoter to position the polymerase properly, serves as the scaffold for assembly of the remainder of the transcription complex, and acts as a channel for regulatory signals. TFIID is composed of the TATA-binding protein (TBP) and a group of evolutionarily conserved proteins known as TBP-associated factors or TAFs. TAFs may participate in basal transcription, serve as coactivators, function in promoter recognition or modify general transcription factors (GTFs) to facilitate complex assembly and transcription initiation. This gene encodes one of the smaller subunits of TFIID that binds to the basal transcription factor GTF2B as well as to several transcriptional activators such as p53 and VP16. A similar but distinct gene (TAF9L) has been found on the X chromosome and a pseudogene has been identified on chromosome 19. Alternative splicing results in multiple transcript variants encoding different isoforms.
11524	The protein encoded by this gene shares similarity with the product of Drosophila syd gene, required for the functional interaction of kinesin I with axonal cargo. Studies of the similar gene in mouse suggested that this protein may interact with, and regulate the activity of numerous protein kinases of the JNK signaling pathway, and thus function as a scaffold protein in neuronal cells. The C. elegans counterpart of this gene is found to regulate synaptic vesicle transport possibly by integrating JNK signaling and kinesin-1 transport. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
11525	
11526	
11527	
11528	
11529	
11530	
11531	This gene encodes a member of the membrane-spanning 4A gene family, members of which are characterized by common structural features and similar intron/exon splice boundaries and display unique expression patterns in hematopoietic cells and nonlymphoid tissues. This family member is associated with mature cellular function in the monocytic lineage, and it may be a component of a receptor complex involved in signal transduction. This gene is localized to 11q12, in a cluster of other family members. At least four alternatively spliced transcript variants encoding two distinct isoforms have been observed.
11532	
11533	
11534	This gene encodes a member of the ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) protein family. ADAMTS family members share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. The protein encoded by this gene has a high sequence similarity to the protein encoded by ADAMTS18, another family member. The function of this protein has not been determined.
11535	This gene was identified by its ability to suppress the tumorigenicity of Hela cells in nude mice. The protein encoded by this gene contains a C-terminal region that shares similarity with the Rab 3 family of small GTP binding proteins. This protein preferentially binds to the SH3 domain of c-Abl kinase, and acts as a regulator of MAPK1/ERK2 kinase, which may contribute to its ability to reduce the tumorigenic phenotype in cells. Three alternatively spliced transcript variants of this gene encoding distinct isoforms are identified.
11536	
11537	
11538	This gene encodes a secreted, homodimeric glycoprotein that is expressed in a wide variety of tissues and may have autocrine or paracrine functions. The encoded protein has 10 of its 15 cysteine residues conserved among stanniocalcin family members and is phosphorylated by casein kinase 2 exclusively on its serine residues. Its C-terminus contains a cluster of histidine residues which may interact with metal ions. The protein may play a role in the regulation of renal and intestinal calcium and phosphate transport, cell metabolism, or cellular calcium/phosphate homeostasis. Constitutive overexpression of human stanniocalcin 2 in mice resulted in pre- and postnatal growth restriction, reduced bone and skeletal muscle growth, and organomegaly. Expression of this gene is induced by estrogen and altered in some breast cancers.
11539	
11540	
11541	
11542	
11543	The protein encoded by this gene is a member of a G protein subfamily that mediates signal transduction in pertussis toxin-insensitive systms. This encoded protein may play a role in maintaining the ionic balance of perilymphatic and endolymphatic cochlear fluids.
11544	The protein encoded by this gene is a member of a family of membrane glycoproteins.  This glycoprotein provides selectins with carbohydrate ligands.  It may play a role in tumor cell metastasis.  It may also function in the protection, maintenance, and adhesion of the lysosome.  Alternative splicing of the gene produces two known products of unknown function.
11545	
11546	
11547	
11548	
11549	
11550	
11551	Chloride intracellular channels are involved in chloride ion transport within various subcellular compartments. CLIC5 specifically associates with the cytoskeleton of placenta microvilli.[supplied by OMIM]
11552	
11553	
11554	
11555	
11556	
11557	
11558	
11559	This gene is located within the Smith-Magenis syndrome region on chromosome 17. It is highly similar to its mouse counterpart and is expressed at high levels mainly in neuronal tissues. The protein encoded by this gene includes a polymorphic polyglutamine tract in the N-terminal domain. Expression of the mouse counterpart in neurons is induced by retinoic acid. This gene is associated with both the severity of the phenotype and the response to medication in schizophrenic patients.
11560	This gene is located within the Smith-Magenis syndrome region on chromosome 17. It is highly similar to its mouse counterpart and is expressed at high levels mainly in neuronal tissues. The protein encoded by this gene includes a polymorphic polyglutamine tract in the N-terminal domain. Expression of the mouse counterpart in neurons is induced by retinoic acid. This gene is associated with both the severity of the phenotype and the response to medication in schizophrenic patients.
11561	
11562	Potassium channels represent the most complex class of voltage-gated ino channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. Four sequence-related potassium channel genes - shaker, shaw, shab, and shal - have been identified in Drosophila, and each has been shown to have human homolog(s). This gene encodes a member of the potassium channel, voltage-gated, shaker-related subfamily. This member contains six membrane-spanning domains with a shaker-type repeat in the fourth segment. It belongs to the delayed rectifier class, the function of which could restore the resting membrane potential of beta cells after depolarization and thereby contribute to the regulation of insulin secretion. This gene is intronless, and the gene is clustered with genes KCNA1 and KCNA6 on chromosome 12.
11563	This gene encodes a telomere specific protein, TERF2, which is a component of the telomere nucleoprotein complex. This protein is present at telomeres in metaphase of the cell cycle, is a second negative regulator of telomere length and plays a key role in the protective activity of telomeres. While having similar telomere binding activity and domain organization, TERF2 differs from TERF1 in that its N terminus is basic rather than acidic.
11564	
11565	
11566	This gene encodes a member of the RSK (ribosomal S6 kinase) family of serine/threonine kinases. This kinase contains 2 non-identical kinase catalytic domains and phosphorylates various substrates, including members of the mitogen-activated kinase (MAPK) signalling pathway. The activity of this protein has been implicated in controlling cell growth and differentiation. Mutations in this gene have been associated with Coffin-Lowry syndrome (CLS).
11567	The protein encoded by this gene contains a RING finger motif and is similar to g1, a Drosophila zinc-finger protein that is expressed in mesoderm and involved in embryonic development. The expression of the mouse counterpart was found to be upregulated in myeloblastic cells following IL3 deprivation, suggesting that this gene may regulate growth factor withdrawal-induced apoptosis of myeloid precursor cells.
11568	
11569	The protein encoded by this gene contains several copies of zinc finger motif, which is commonly found in transcriptional regulatory proteins. Studies in mice show that this gene is expressed during embryonic development, and specifically in the testis in adult mice, suggesting that it may play a role in regulating genes in germ cells. Alternative splicing of this gene results in several transcript variants encoding different isoforms.
11570	
11571	
11572	This gene encodes a protein with acyl-CoA thioesterase activity towards medium (C12) and long-chain (C18) fatty acyl-CoA substrates. Expression of a similar murine protein in brown adipose tissue is induced by cold exposure and repressed by warmth. Expression of the mouse protein has been associated with obesity, with higher expression found in obesity-resistant mice compared with obesity-prone mice. Alternative splicing results in two transcript variants encoding different isoforms.
11573	This gene is similar to a mouse gene that encodes a putative protein kinase with a tudor domain, and shows testis-specific expression. Two splice variants have been found for this gene.
11574	This gene product is a sequence-specific, single-stranded DNA-binding protein. It binds preferentially to the single strand of the purine-rich element termed PUR, which is present at origins of replication and in gene flanking regions in a variety of eukaryotes from yeasts through humans. Thus, it is implicated in the control of both DNA replication and transcription. Deletion of this gene has been associated with myelodysplastic syndrome and acute myelogenous leukemia.
11575	
11576	
11577	
11578	IRF1 encodes interferon regulatory factor 1, a member of the interferon regulatory transcription factor (IRF) family. IRF1 serves as an activator of interferons alpha and beta transcription, and in mouse it has been shown to be required for double-stranded RNA induction of these genes. IRF1 also functions as a transcription activator of genes induced by interferons alpha, beta, and gamma. Further, IRF1 has been shown to play roles in regulating apoptosis and tumor-suppressoion.
11579	
11580	Oligophrenin 1 has 25 exons and encodes a Rho-GTPase-activating protein. The Rho proteins are important mediators of intracellular signal transduction, which affects cell migration and cell morphogenesis. Mutations in this gene are responsible for non-specific X-linked mental retardation.
11581	
11582	This gene encodes a member of the fatty acid transport protein family (FATP). FATPs are involved in the uptake of long-chain fatty acids and have unique expression patterns. Alternatively spliced transcript variants encoding the same protein have been found for this gene.
11583	
11584	
11585	
11586	
11587	
11588	MAP3K, or MEK kinase, is a serine/threonine kinase that occupies a pivotal role in a network of phosphorylating enzymes integrating cellular responses to a number of mitogenic and metabolic stimuli, including insulin (MIM 176730) and many growth factors.[supplied by OMIM]
11589	
11590	
11591	
11592	
11593	The protein encoded by this gene shares significant homology to the adenomatous polyposis coli (APC) protein-binding EB1 gene family. The function of this protein is unknown; however, its homology suggests involvement in tumorigenesis of colorectal cancers and proliferative control of normal cells. This gene may belong to the intermediate/early gene family, involved in the signal transduction cascade downstream of the TCR.
11594	
11595	In the mouse, Nkd is a Dishevelled (see DVL1; MIM 601365)-binding protein that functions as a negative regulator of the Wnt (see WNT1; MIM 164820)-beta-catenin (see MIM 116806)-Tcf (see MIM 602272) signaling pathway.[supplied by OMIM]
11596	Pre-mRNA splicing occurs in 2 sequential transesterification steps. The protein encoded by this gene is found to be essential for the catalytic step II in pre-mRNA splicing process. It is found in the spliceosome, and contains seven WD repeats, which function in protein-protein interactions. This protein has a sequence similarity to the yeast splicing factor Prp18.
11597	
11598	
11599	
11600	This gene is one of several CC cytokine genes clustered on the p-arm of chromosome 9. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The CC cytokines are proteins characterized by two adjacent cysteines. The protein encoded by this gene is chemotactic for skin-associated memory T lymphocytes. This cytokine may also play a role in mediating homing of lymphocytes to cutaneous sites. It specifically binds to chemokine receptor 10 (CCR10). Studies of a similar murine protein indicate that these protein-receptor interactions have a pivotal role in T cell-mediated skin inflammation.
11601	
11602	
11603	
11604	
11605	This gene encodes a member of the BCL-2 protein family. The proteins of this family form hetero- or homodimers and act as anti- and pro-apoptotic regulators. Expression of this gene in cells has been shown to contribute to reduced cell apoptosis under cytotoxic conditions. Studies of the related gene in mice indicated a role in the survival of NGF- and BDNF-dependent neurons. Mutation and knockout studies of the mouse gene demonstrated an essential role in adult spermatogenesis.
11606	Oxidative-stress responsive 1 gene is located in the vicinity of three others genes - GOLGA4, ITGA9 and HYA22 on chromosome 3. These four genes are considered to be candidate tumor suppressors. Oxidative-stress responsive 1 protein has similarity to human Ste20/oxidant stress response kinase-1 and is thought to be involved in the response to oxidative stress.
11607	
11608	
11609	
11610	GABA is the major inhibitory neurotransmitter in the mammalian brain where it acts at GABA-A receptors, which are ligand-gated chloride channels. Chloride conductance of these channels can be modulated by agents such as benzodiazepines that bind to the GABA-A receptor. At least 16 distinct subunits of GABA-A receptors have been identified.
11611	
11612	This gene encodes an assembly factor for the F(1) component of the mitochondrial ATP synthase. This protein binds specifically to the F1 alpha subunit and is thought to prevent this subunit from forming nonproductive homooligomers during enzyme assembly. This gene is located within the Smith-Magenis syndrome region on chromosome 17. An alternatively spliced transcript variant has been described, but its biological validity has not been determined.
11613	The protein encoded by this gene is a member of the CARD protein family, which is defined by the presence of a characteristic caspase-associated recruitment domain (CARD). CARD is a protein interaction domain known to participate in activation or suppression of CARD containing members of the caspase family, and thus plays an important regulatory role in cell apoptosis. The function of this CARD protein has not yet been determined.
11614	
11615	Gamma-aminobutyric acid (GABA) is a major inhibitory neurotransmitter. GABAergic neurotransmission is terminated by the uptake of GABA into the presynaptic terminal and the surrounding astroglial cells by sodium-dependent transporters, such as SLC6A11.[supplied by OMIM]
11616	
11617	This gene is one of seven beta-1,4-galactosyltransferase (beta4GalT) genes. They encode type II membrane-bound glycoproteins that appear to have exclusive specificity for the donor substrate UDP-galactose; all transfer galactose in a beta1,4 linkage to similar acceptor sugars: GlcNAc, Glc, and Xyl. Each beta4GalT has a distinct function in the biosynthesis of different glycoconjugates and saccharide structures. As type II membrane proteins, they have an N-terminal hydrophobic signal sequence that directs the protein to the Golgi apparatus and which then remains uncleaved to function as a transmembrane anchor. By sequence similarity, the beta4GalTs form four groups: beta4GalT1 and beta4GalT2, beta4GalT3 and beta4GalT4, beta4GalT5 and beta4GalT6, and beta4GalT7. The enzyme encoded by this gene attaches the first galactose in the common carbohydrate-protein (GlcA-beta1,3-Gal-beta1,3-Gal-beta1,4-Xyl-beta1-O-Ser) linkage found in proteoglycans. Manganese is required as a cofactor. This enzyme differs from the other six beta4GalTs because it lacks the conserved beta4GalT1-beta4GalT6 Cys residues and it is located in cis-Golgi instead of trans-Golgi. Two single-nucleotide mutations were identified from a patient with the progeroid type of Ehlers-Danlos syndrome.
11618	
11619	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are produced by RNA polymerase II and they form a component of the heterogeneous nuclear RNA (hnRNA) complexes. They are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It acts as the catalytic component of the multiprotein editosome complex which mediates apolipoprotein B mRNA editing. Two alternatively spliced transcript variants has been described for this gene which encode different isoforms.
11620	
11621	
11622	This gene encodes a protein that belongs to the basic helix-loop-helix (bHLH) family of transcription factors. Polymorphisms within the encoded protein have been associated with circadian rhythm sleep disorders. A similar protein in mice is a circadian regulator that acts as a transcription factor and forms a heterodimer with aryl hydrocarbon receptor nuclear translocator-like to activate transcription of mouse period 1.
11623	
11624	
11625	This gene encodes a member of the Vent family of homeodomain proteins. The encoded protein may function as a transcriptional repressor and be involved in mesodermal patterning and hemopoietic stem cell maintenance. Multiple pseudogenes exist for this gene. A transcribed pseudogene located on chromosome X may lead to antigen production in certain melanomas.
11626	
11627	This gene encodes a member of the homer family of dendritic proteins. Members of this family regulate group 1 metabotrophic glutamate receptor function.
11628	
11629	
11630	
11631	
11632	
11633	
11634	
11635	
11636	Somatostatin acts at many sites to inhibit the release of many hormones and other secretory proteins.  The biologic effects of somatostatin are probably mediated by a family of G protein-coupled receptors that are expressed in a tissue-specific manner.  SSTR4 is a member of the superfamily of receptors having seven transmembrane segments and is expressed in highest levels in fetal and adult brain and lung.
11637	This gene encodes a ubiquitously expressed transcription factor that controls cholesterol homeostasis by stimulating transcription of sterol-regulated genes. The encoded protein contains a basic helix-loop-helix-leucine zipper (bHLH-Zip) domain.
11638	The protein encoded by this gene was identified using a differential display procedure with tumor biopsies obtained from a noninvasive and an invasive bladder transitional cell carcinoma. Although database searches revealed no homology to any human gene at the time of identification, mouse, rat and zebrafish orthologs have since been identified. The function of this differentially expressed protein is not yet known but it appears to be down-regulated during bladder cancer progression.
11639	This gene is a homolog of the Drosophila delta gene. The delta gene family encodes Notch ligands that are characterized by a DSL domain, EGF repeats, and a transmembrane domain.
11640	
11641	The VG5Q gene encodes a potent angiogenic factor that contains a forkhead-associated domain and a G-patch domain.[supplied by OMIM]
11642	
11643	
11644	
11645	
11646	The family of myotubularin-related proteins includes lipid phosphatases, such as MTM1 (MIM 600415), and pseudophosphatases, such as SBF1 (MIM 603560) and SBF2. Pseudophosphatases contain inactivating substitutions at the catalytic cysteine.[supplied by OMIM]
11647	
11648	
11649	The protein encoded by this gene is a glycosylphosphatidylinositol(GPI)-linked cell surface receptor and a member of the GDNF receptor family. It forms a signaling receptor complex with RET tyrosine kinase receptor and binds the ligand, artemin.
11650	The protein encoded by this gene is a glycosylphosphatidylinositol(GPI)-linked cell surface receptor and a member of the GDNF receptor family. It forms a signaling receptor complex with RET tyrosine kinase receptor and binds the ligand, artemin.
11651	
11652	The protein encoded by this gene is similar to the mouse stromal membrane-associated protein-1. This similarity suggests that this human gene product is also a type II membrane glycoprotein involved in the erythropoietic stimulatory activity of stromal cells.
11653	
11654	Organic ion transporters, such as SLC22A16, transport various medically and physiologically important compounds, including pharmaceuticals, toxins, hormones, neurotransmitters, and cellular metabolites. These transporters are also referred to as amphiphilic solute facilitators (ASFs).[supplied by OMIM]
11655	This gene encodes a member of the STAT-induced STAT inhibitor (SSI), also known as suppressor of cytokine signaling (SOCS), family. SSI family members are cytokine-inducible negative regulators of cytokine signaling. The expression of this gene can be induced by a subset of cytokines, including IL2, IL3 erythropoietin (EPO), CSF2/GM-CSF, and interferon (IFN)-gamma. The protein encoded by this gene functions downstream of cytokine receptors, and takes part in a negative feedback loop to attenuate cytokine signaling. Knockout studies in mice suggested the role of this gene as a modulator of IFN-gamma action, which is required for normal postnatal growth and survival.
11656	
11657	This enzyme catalyzes O-glycan branch synthesis of the core 2 and core 4 type in mucins and controls expression of core 2 branched oligosaccharides and I antigens on the cell surface.
11658	L-type calcium channels are composed of five subunits. The protein encoded by this gene represents one of these subunits, gamma, and is one of several gamma subunit proteins. It is an integral membrane protein that is thought to stabilize the calcium channel in an inactive (closed) state. This gene is a member of the neuronal calcium channel gamma subunit gene subfamily of the PMP-22/EMP/MP20 family and is located in a cluster with two similar gamma subunit-encoding genes. The gene has previously been referred to as CACNG6 but has been renamed calcium channel, voltage-dependent, gamma subunit 8 (CACNG8). It is believed that non-AUG translation initiation occurs for this transcript.
11659	This gene belongs to the protocadherin gene family, a subfamily of the cadherin superfamily. The mRNA encodes a cadherin-related neuronal receptor that localizes to synaptic junctions and is putatively involved in specific neuronal connections and signal transduction. Sharing a characteristic with other protocadherin genes, this gene has a notably large exon that encodes 6 cadherin domains and a transmembrane region. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
11660	
11661	
11662	
11663	
11664	The piggyBac family of proteins, found in diverse animals, are transposases related to the transposase of the canonical piggyBac transposon from the moth, Trichoplusia ni. This family also includes genes in several genomes, including human, that appear to have been derived from the piggyBac transposons. This gene belongs to the subfamily of piggyBac transposable element derived (PGBD) genes. The PGBD proteins appear to be novel, with no obvious relationship to other transposases, or other known protein families.
11665	
11666	
11667	
11668	
11669	CpG methylation is an epigenetic modification that is important for embryonic development, imprinting, and X-chromosome inactivation. Studies in mice have demonstrated that DNA methylation is required for mammalian development. This gene encodes a DNA methyltransferase that is thought to function in de novo methylation, rather than maintenance methylation. The protein localizes to the cytoplasm and nucleus and its expression is developmentally regulated. Alternative splicing results in multiple transcript variants encoding different isoforms.
11670	
11671	
11672	
11673	
11674	
11675	The origin recognition complex (ORC) is a highly conserved six subunits protein complex essential for the initiation of the DNA replication in eukaryotic cells. Studies in yeast demonstrated that ORC binds specifically to origins of replication and serves as a platform for the assembly of additional initiation factors such as Cdc6 and Mcm proteins. The protein encoded by this gene is a subunit of the ORC complex. Studies of a similar gene in Drosophila suggested a possible role of this protein in neuronal proliferation and olfactory memory. Alternatively spliced transcript variants encoding distinct isoforms have been reported for this gene.
11676	
11677	
11678	The protein encoded by this gene is a member of the TNF-receptor superfamily. This receptor has been shown to activate NF-kappaB and MAPK8/JNK, and induce cell apoptosis. Through its death domain, this receptor interacts with TRADD protein, which is known to serve as an adaptor that mediates signal transduction of TNF-receptors. Knockout studies in mice suggested that this gene plays a role in T-helper cell activation, and may be involved in inflammation and immune regulation.
11679	
11680	
11681	
11682	This gene is a member of the Nod1/Apaf-1 family and encodes a protein with two caspase recruitment (CARD) domains and six leucine-rich repeats (LRRs). The protein is primarily expressed in the peripheral blood leukocytes. It plays a role in the immune response to intracellular bacterial lipopolysaccharides (LPS) by recognizing the muramyl dipeptide (MDP) derived from them and activating the NFKB protein. Mutations in this gene have been associated with Crohn disease and Blau syndrome.
11683	
11684	
11685	
11686	The central core of each mitogen-activated protein kinase (MAPK) pathway is a conserved cascade of 3 protein kinases: an activated MAPK kinase kinase (MAPKKK) phosphorylates and activates a specific MAPK kinase (MAPKK), which then activates a specific MAPK. While the ERK MAPKs are activated by mitogenic stimulation, the CSBP2 and JNK MAPKs are activated by environmental stresses such as osmotic shock, UV irradiation, wound stress, and inflammatory factors. This gene encodes a MAPKKK, the MEKK4 protein, also called MTK1. This protein contains a protein kinase catalytic domain at the C terminus. The N-terminal nonkinase domain may contain a regulatory domain. Expression of MEKK4 in mammalian cells activated the CSBP2 and JNK MAPK pathways, but not the ERK pathway. In vitro kinase studies indicated that recombinant MEKK4 can specifically phosphorylate and activate PRKMK6 and SERK1, MAPKKs that activate CSBP2 and JNK, respectively but cannot phosphorylate PRKMK1, an MAPKK that activates ERKs. MEKK4 is a major mediator of environmental stresses that activate the CSBP2 MAPK pathway, and a minor mediator of the JNK pathway. Two alternatively spliced transcripts encoding distinct isoforms have been described.
11687	
11688	
11689	The protein encoded by this gene is highly similar to the protein encoded by the Saccharomyces cerevisiae SEC14 gene. The SEC14 protein is a phophatidylinositol transfer protein that is essential for biogenesis of Golgi-derived transport vesicles, and thus is required for the export of yeast secretory proteins from the Golgi complex. The specific function of this protein has not yet been determined.
11690	
11691	
11692	
11693	
11694	This gene encodes a transcription factor that belongs to the homeobox gene family. Genes of this family are highly conserved among vertebrates and essential for vertebrate embryonic development. This gene has been implicated to play a role in fetal skin development and cutaneous regeneration. In mice the homeo box B13 gene was shown to exhibit temporal and spatial colinearity in the main body axis of the embryo, but was not expressed in the secondary axes, which suggested its functions in body patterning along the axis. This gene and other HOXB genes form a gene cluster at chromosome the 17q21-22 region.
11695	
11696	The SRG1 gene encodes a protein that is a member of a family of proteins involved in the regulation of transmitter release in the nervous system (Fernandez-Chacon et al., 2001). The membrane-associated synaptic SRG1 protein is under thyroid hormone control during brain development (Thompson, 1996).[supplied by OMIM]
11697	
11698	
11699	This gene encodes a cell surface tyrosine kinase receptor for members of the platelet-derived growth factor family. These growth factors are mitogens for cells of mesenchymal origin. The identity of the growth factor bound to a receptor monomer determines whether the functional receptor is a homodimer or a heterodimer, composed of both platelet-derived growth factor receptor alpha and beta polypeptides. This gene is flanked on chromosome 5 by the genes for granulocyte-macrophage colony-stimulating factor and macrophage-colony stimulating factor receptor; all three genes may be implicated in the 5-q syndrome. A translocation between chromosomes 5 and 12, that fuses this gene to that of the translocation, ETS, leukemia gene, results in chronic myelomonocytic leukemia.
11700	This gene was identified as a gene whose expression can be induced by the tumor necrosis factor alpha (TNF) in umbilical vein endothelial cells. The expression of this gene was shown to be induced by retinoic acid in a cell line expressing a oncogenic version of the retinoic acid receptor alpha fusion protein, which suggested that this gene may be a retinoic acid target gene in acute promyelocytic leukemia.
11701	
11702	
11703	
11704	
11705	
11706	
11707	Sm-like proteins were identified in a variety of organisms based on sequence homology with the Sm protein family (see SNRPD2; MIM 601061). Sm-like proteins contain the Sm sequence motif, which consists of 2 regions separated by a linker of variable length that folds as a loop. The Sm-like proteins are thought to form a stable heteromer present in tri-snRNP particles, which are important for pre-mRNA splicing.[supplied by OMIM]
11708	
11709	
11710	
11711	This gene was identified by its similarity to the gene for human adenylyl cyclase-associated protein. The function of the protein encoded by this gene is unknown. However, the protein appears to be able to interact with adenylyl cyclase-associated protein and actin.
11712	This protein is a transcription factor that negatively regulates other myogenic family proteins. Studies of the mouse homolog, I-mf, show that it interfers with myogenic factor function by masking nuclear localization signals and preventing DNA binding. Knockout mouse studies show defects in the formation of vertebrae and ribs that also involve cartilage formation in these structures.
11713	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3,  and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is intronless and encodes a member of the histone H2B family. Transcripts from this gene lack polyA tails; instead, they contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6p22-p21.3.
11714	
11715	
11716	
11717	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order   structures. This gene is intronless and encodes a member of the histone H3 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6.
11718	
11719	This gene encodes a cytosolic, NADP-dependent enzyme that generates NADPH for fatty acid biosynthesis. The activity of this enzyme, the reversible oxidative decarboxylation of malate, links the glycolytic and citric acid cycles. The regulation of expression for this gene is complex. Increased expression can result from elevated levels of thyroid hormones or by higher proportions of carbohydrates in the diet.
11720	
11721	This gene is a member of the beta-1,3-galactosyltransferase (beta3GalT) gene family. This family encodes type II membrane-bound glycoproteins with diverse enzymatic functions using different donor substrates (UDP-galactose and UDP-N-acetylglucosamine) and different acceptor sugars (N-acetylglucosamine, galactose, N-acetylgalactosamine). The beta3GalT genes are distantly related to the Drosophila Brainiac gene and have the protein coding sequence contained in a single exon. The beta3GalT proteins also contain conserved sequences not found in the beta4GalT or alpha3GalT proteins. The carbohydrate chains synthesized by these enzymes are designated as type 1, whereas beta4GalT enzymes synthesize type 2 carbohydrate chains. The ratio of type 1:type 2 chains changes during embryogenesis. By sequence similarity, the beta3GalT genes fall into at least two groups: beta3GalT4 and 4 other beta3GalT genes (beta3GalT1-3, beta3GalT5). This gene is oriented telomere to centromere in close proximity to the ribosomal protein S18 gene. The functionality of the encoded protein is limited to ganglioseries glycolipid biosynthesis.
11722	
11723	
11724	
11725	
11726	
11727	
11728	
11729	The protein encoded by this gene is a subunit of nuclear domain 10 (ND10) bodies. ND10 bodies are nuclear domains appearing immunohistochemically as ten dots per nucleus. They are believed to be associated with the nuclear matrix on the basis of their resistance to nuclease digestion and salt extraction. ND10 proteins are removed from the nucleus by herpes simplex virus-1 infection and may have a role in viral life cycles.
11730	
11731	
11732	HLA-E belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. HLA-E binds a restricted subset of peptides derived from the leader peptides of other class I molecules. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail.
11733	
11734	
11735	This gene functions in the protection of cells from Fas- or tumor necrosis factor type alpha-induced apoptosis. Alternative splicing of this gene results in two transcript variants.
11736	The protein encoded by this gene is an apolipoprotein and member of the lipocalin protein family. It is found associated with high density lipoproteins and to a lesser extent with low density lipoproteins and triglyceride-rich lipoproteins. The encoded protein is secreted through the plasma membrane but remains membrane-bound, where it is involved in lipid transport. Two transcript variants encoding two different isoforms have been found for this gene, but only one of them has been fully characterized.
11737	The protein encoded by this gene is a member of the peptidyl-prolyl cis-trans isomerase (PPIase)) family. PPIases catalyze the cis-trans isomerization of proline imidic peptide bonds in oligopeptides and accelerate the folding of proteins. Similar to other PPIases, this protein can bind immunosuppressant cyclosporin A.
11738	
11739	
11740	
11741	
11742	
11743	This gene encodes the acidic form of complement factor 4, part of the classical activation pathway. The protein is expressed as a single chain precursor which is proteolytically cleaved into a trimer of alpha, beta, and gamma chains prior to secretion. The trimer provides a surface for interaction between the antigen-antibody complex and other complement components. The alpha chain may be cleaved to release C4 anaphylatoxin, a mediator of local inflammation. Deficiency of this protein is associated with systemic lupus erythematosus and type I diabetes mellitus. This gene localizes to the major histocompatibility complex (MHC) class III region on chromosome 6. Varying haplotypes of this gene cluster exist, such that individuals may have 1, 2, or 3 copies of this gene.
11744	
11745	
11746	CABLES1 is a cyclin-dependent kinase (CDK)-binding protein that plays a role in proliferation and/or differentiation (Zukerberg et al., 2004).[supplied by OMIM]
11747	This gene encodes a member of the membrane-associated guanylate kinase (MAGUK) family. It heteromultimerizes with DLG4. With DLG4 it is recruited into the same NMDA receptor and potassium channel clusters. These two MAGUK proteins may interact at postsynaptic sites to form a multimeric scaffold for the clustering of receptors, ion channels, and associated signaling proteins.
11748	
11749	
11750	
11751	Members of the F-box protein family, such as FBXO36, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
11752	
11753	
11754	HLA-DMB belongs to the HLA class II beta chain paralogues. This class II molecule is a heterodimer consisting of an alpha (DMA) and a beta (DMB) chain, both anchored in the membrane. It is located in intracellular vesicles. DM plays a central role in the peptide loading of MHC class II molecules by helping to release the CLIP (class II-associated invariant chain peptide) molecule from the peptide binding site. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa and its gene contains 6 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail.
11755	
11756	Retinoid X receptors (RXRs) and retinoic acid receptors (RARs), are nuclear receptors that mediate the biological effects of retinoids by their involvement in retinoic acid-mediated gene activation.  These receptors exert their action by binding, as homodimers or heterodimers, to specific sequences in the promoters of target genes and regulating their transcription. The protein encoded by this gene is a member of the steroid and thyroid hormone receptor superfamily of transcriptional regulators.
11757	This gene encodes one of the two alpha chains of type XI collagen, a minor fibrillar collagen. It is located on chromosome 6 very close to but separate from the gene for retinoid X receptor beta. Type XI collagen is a heterotrimer but the third alpha chain is a post-translationally modified alpha 1 type II chain. Proteolytic processing of this type XI chain produces PARP, a proline/arginine-rich protein that is an amino terminal domain. Mutations in this gene are associated with type III Stickler syndrome, otospondylomegaepiphyseal dysplasia (OSMED syndrome), Weissenbacher-Zweymuller syndrome, and autosomal dominant nonsyndromic sensorineural 13 deafness. Three transcript variants encoding different isoforms have been identified for this gene.
11758	Gamma-sarcoglycan is one of several sarcolemmal transmembrane glycoproteins that interact with dystrophin, probably to provide a link between the membrane associated cytoskeleton and the extracellular matrix.  Defects in the protein can lead to early onset autosomal recessive muscular dystrophy, in particular limb-girdle muscular dystrophy, type 2C (LGMD2C).
11759	
11760	
11761	
11762	
11763	The Fanconi anemia complementation group (FANC) currently includes FANCA, FANCB, FANCC, FANCD1 (also called BRCA2), FANCD2, FANCE, FANCF, FANCG, and FANCL. Fanconi anemia is a genetically heterogeneous recessive disorder characterized by cytogenetic instability, hypersensitivity to DNA crosslinking agents, increased chromosomal breakage, and defective DNA repair. The members of the Fanconi anemia complementation group do not share sequence similarity; they are related by their assembly into a common nuclear protein complex. This gene encodes the protein for complementation group E.
11764	This gene encodes a transcription factor that is a member of the leucine zipper family of DNA binding proteins. This protein binds to the cAMP-responsive element, an octameric palindrome. The protein interacts with host cell factor C1, which also associates with the herpes simplex virus (HSV) protein VP16 that induces transcription of HSV immediate-early genes. This protein and VP16 both bind to the same site on host cell factor C1. It is thought that the interaction between this protein and host cell factor C1 plays a role in the establishment of latency during HSV infection. An additional transcript variant has been identified, but its biological validity has not been determined.
11765	
11766	LMO2 encodes a cysteine-rich, two LIM-domain protein that is required for yolk sac erythropoiesis.  The LMO2 protein has a central and crucial role in hematopoietic development and is highly conserved.  The LMO2 transcription start site is located approximately 25 kb downstream from the 11p13 T-cell translocation cluster (11p13 ttc), where a number T-cell acute lymphoblastic leukemia-specific translocations occur.
11767	The protein encoded by this gene has significant homology to NADH oxidoreductases and the apoptosis-inducing factor PDCD8/AIF. Overexpression of this gene has been shown to induce apoptosis. The expression of this gene is found to be induced by tumor suppressor protein p53 in colon caner cells.
11768	
11769	
11770	
11771	The protein encoded by this gene is coordinately expressed with activator of cAMP-responsive element modulator (CREM). It is associated with CREM and confers a powerful transcriptional activation function. CREM acts as a transcription factor essential for the differentiation of spermatids into mature spermatozoa. There are multiple polyadenylation sites found in this gene.
11772	
11773	This gene is a member of the MAD gene family . The MAD genes encode basic helix-loop-helix-leucine zipper proteins that heterodimerize with MAX protein, forming a transcriptional repression complex. The MAD proteins compete for MAX binding with MYC, which heterodimerizes with MAX forming a transcriptional activation complex. Studies in rodents suggest that the MAD genes are tumor suppressors and contribute to the regulation of cell growth in differentiating tissues.
11774	
11775	The four human glycoprotein hormones chorionic gonadotropin (CG), luteinizing hormone (LH), follicle stimulating hormone (FSH), and thyroid stimulating hormone (TSH) are dimers consisting of alpha and beta subunits that are associated noncovalently. The alpha subunits of these hormones are identical, however, their beta chains are unique and confer biological specificity. The protein encoded by this gene is the alpha subunit and belongs to the glycoprotein hormones alpha chain family.
11776	In Drosophila melanogaster, the @#$%&Polycomb@#$%& group (PcG) of genes are part of a cellular memory system that is responsible for the stable inheritance of gene activity. PcG proteins form a large multimeric, chromatin-associated protein complex. The protein encoded by this gene has homology to the Drosophila PcG protein @#$%&polyhomeotic@#$%& (Ph) and is known to heterodimerize with EDR1 and colocalize with BMI1 in interphase nuclei of human cells. The specific function in human cells has not yet been determined. Two transcript variants encoding different isoforms have been found for this gene.
11777	
11778	Synovial sarcomas occur most frequently in the extremities around large joints. More than 90% of cases have a recurrent and specific chromosomal translocation, t(X;18)(p11.2;q11.2), in which the 5-prime end of the SS18 gene (MIM 600192) is fused in-frame to the 3-prime end of the SSX1 (MIM 312820), SSX2 (MIM 300192), or SSX4 (MIM 300326) gene. The SS18L1 gene is homologous to SS18.[supplied by OMIM]
11779	Members of the F-box protein family, such as FBXL18, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
11780	
11781	The protein encoded by this gene is a component of the exocyst complex, a multiple protein complex essential for targeting exocytic vesicles to specific docking sites on the plasma membrane. Though best characterized in yeast, the component proteins and the functions of the exocyst complex have been demonstrated to be highly conserved in higher eukaryotes. At least eight components of the exocyst complex, including this protein, are found to interact with the v-ral simian leukemia viral oncogene homolog A (RALA), a Ras-related small GTPase involved in controlling actin cytoskeletal remodeling and vesicle transport. The binding of this protein to RALA has been shown to mediate filopodia formation in fibroblasts. Mutation studies of the similar gene in Drosophila suggested that this protein is required for neuronal membrane traffic.
11782	
11783	
11784	
11785	
11786	
11787	
11788	
11789	The protein encoded by this gene binds a platelet-derived growth factor-responsive element in the matrix metalloproteinase 3 (stromelysin 1) promoter. The protein localizes to the nucleus and displays DNA-binding and transactivation activities. It is thought to be a transcriptional coactivator, enhancing the activity of transcription factors such as JUN and SP1. Alternative splicing results in two transcript variants encoding different isoforms.
11790	This gene encodes a protein that may play a role in organelle biogenesis associated with melanosomes, platelet dense granules, and lysosomes. A similar protein in mouse is a component of a protein complex termed biogenesis of lysosome-related organelles complex 1 (BLOC-1), and binds to alpha- and beta-dystrobrevins, which are components of the dystrophin-associated protein complex (DPC). Mutations in this gene are associated with Hermansky-Pudlak syndrome type 7. This gene may also be associated with schizophrenia. Multiple transcript variants encoding distinct isoforms have been identified for this gene.
11791	
11792	The protein encoded by this gene is a member of the dual specificity protein tyrosine phosphatase family. This protein is highly similar to Saccharomyces cerevisiae Cdc14, a protein tyrosine phosphatase involved in the exit of cell mitosis and initiation of DNA replication, which suggests the role in cell cycle control. This protein has been shown to interact with and dephosphorylates tumor suppressor protein p53, and is thought to regulate the function of p53. Alternative splice of this gene results in 3 transcript variants encoding distinct isoforms.
11793	
11794	
11795	
11796	
11797	
11798	The SRG1 gene encodes a protein that is a member of a family of proteins involved in the regulation of transmitter release in the nervous system (Fernandez-Chacon et al., 2001). The membrane-associated synaptic SRG1 protein is under thyroid hormone control during brain development (Thompson, 1996).[supplied by OMIM]
11799	This gene encodes a protein containing an RNA-binding motif in the N-terminus and four SRGY (serine, arginine, glycine, tyrosine) boxes in the C-terminus. Multiple copies of this gene are found in the AZFb azoospermia factor region of chromosome Y and the encoded protein is thought to be involved in spermatogenesis. Most copies of this locus are pseudogenes, although six highly similar copies have full-length ORFs and are considered functional. Four functional copies of this gene are found within inverted repeat IR2; two functional copies of this gene are found in palindrome P3, along with two copies of PTPN13-like, Y-linked.
11800	
11801	
11802	This gene encodes a protein that is a member of the Shank family of synaptic proteins that may function as molecular scaffolds in the postsynaptic density (PSD). Shank proteins contain multiple domains for protein-protein interaction, including ankyrin repeats, an SH3 domain, a PSD-95/Dlg/ZO-1 domain, a sterile alpha motif domain, and a proline-rich region. This particular family member contains a PDZ domain, a consensus sequence for cortactin SH3 domain-binding peptides and a sterile alpha motif. The alternative splicing demonstrated in Shank genes has been suggested as a mechanism for regulating the molecular structure of Shank and the spectrum of Shank-interacting proteins in the PSDs of adult and developing brain. Two alternative splice variants, encoding distinct isoforms, are reported. Additional splice variants exist but their full-length nature has not been determined.
11803	
11804	
11805	
11806	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein does not contain a coiled coil region, like most family members. This protein interacts with phosphatidylinositol-3-phosphate, and is involved in protein trafficking. This gene results in three transcript variants encoding three distinct isoforms.
11807	Crystallins are separated into two classes: taxon-specific, or enzyme, and ubiquitous. The latter class constitutes the major proteins of vertebrate eye lens and maintains the transparency and refractive index of the lens. Since lens central fiber cells lose their nuclei during development, these crystallins are made and then retained throughout life, making them extremely stable proteins. Mammalian lens crystallins are divided into alpha, beta, and gamma families; beta and gamma crystallins are also considered as a superfamily. Alpha and beta families are further divided into acidic and basic groups. Seven protein regions exist in crystallins: four homologous motifs, a connecting peptide, and N- and C-terminal extensions. Beta-crystallins, the most heterogeneous, differ by the presence of the C-terminal extension (present in the basic group, none in the acidic group). Beta-crystallins form aggregates of different sizes and are able to self-associate to form dimers or to form heterodimers with other beta-crystallins. This gene, a beta basic group member, is part of a gene cluster with beta-A4, beta-B1, and beta-B3. A chain-terminating mutation was found to cause type 2 cerulean cataracts.
11808	This gene is a member of the cyclophilin family of peptidylprolyl isomerases. The cyclophilins are a highly conserved family, members of which play an important role in protein folding, immunosuppression by cyclosporin A, and infection of HIV-1 virions.
11809	
11810	The protein encoded by this gene is homologous to the murine protein MCAP, which associates with chromosomes during mitosis, and to the human RING3 protein, a serine/threonine kinase. Each of these proteins contains two bromodomains, a conserved sequence motif which may be involved in chromatin targeting. This gene has been implicated as the chromosome 19 target of translocation t(15;19)(q13;p13.1), which defines an upper respiratory tract carcinoma in young people. Two alternatively spliced transcript variants have been described.
11811	This gene encodes several androgen-dependent, epididymis-specific secretory proteins. The specific functions of these proteins have not been determined, but they are thought to be involved in sperm maturation. Some of the isoforms contain regions of similarity to beta-defensins, a family of antimicrobial peptides. The gene is located on chromosome 8p23 near the defensin gene cluster. Alternative splicing of this gene results in seven transcript variants encoding different isoforms. Two different N-terminal and five different C-terminal protein sequences are encoded by the splice variants. Two additional variants have been described, but their full length sequences have not been determined.
11812	
11813	The protein encoded by this gene is a subunit of the receptor for IL23A/IL23. This protein pairs with the receptor molecule IL12RB1/IL12Rbeta1, and both are required for IL23A signaling. This protein associates constitutively with Janus kinase 2 (JAK2), and also binds to transcription activator STAT3 in a ligand-dependent manner.
11814	This gene encodes a CCCH-type zinc finger protein that is thought to prevent infection by retroviruses. Studies of the rat homolog indicate that the protein may primarily function to inhibit viral gene expression and induce an innate immunity to viral infection. Alternative splicing occurs at this locus and two variants, each encoding distinct isoforms, are described.
11815	
11816	SUFU encodes a component of the sonic hedgehog (SHH; MIM 600725)/patched (PTCH; MIM 601309) signaling pathway. Mutations in genes encoding components of this pathway are deleterious for normal development and are associated with cancer-predisposing syndromes (e.g., HPE3, MIM 142945; BCNS, MIM 109400; and GCPS, MIM 175700).[supplied by OMIM]
11817	
11818	
11819	
11820	
11821	
11822	
11823	
11824	
11825	The activation of gene transcription is a multistep process that is triggered by factors that recognize transcriptional enhancer sites in DNA. These factors work with co-activators to direct transcriptional initiation by the RNA polymerase II apparatus. The protein encoded by this gene is a subunit of the CRSP (cofactor required for SP1 activation) complex, which, along with TFIID, is required for efficient activation by SP1. This protein is also a component of other multisubunit complexes e.g. thyroid hormone receptor-(TR-) associated proteins which interact with TR and facilitate TR function on DNA templates in conjunction with initiation factors and cofactors. It also regulates p53-dependent apoptosis and it is essential for adipogenesis. This protein is known to have the ability to self-oligomerize.
11826	
11827	The function of this gene@#$%&s protein product has not been determined. A related protein in mouse suggests that this protein has a conserved function.
11828	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to cytoplasmic bodies and its function has not been determined. Four alternatively spliced transcript variants for this gene have been described.
11829	
11830	
11831	
11832	
11833	
11834	
11835	
11836	
11837	This gene is located at chromosome 7p13 which is close to the Williams Beuren syndrome chromosome region 7q11.23.
11838	
11839	
11840	
11841	
11842	
11843	
11844	The protein encoded by this gene is a member of the MAP kinase family. MAP kinases act as an integration point for multiple biochemical signals, and are involved in a wide variety of cellular processes such as proliferation, differentiation, transcription regulation and development. This kinase is activated by various environmental stresses and proinflammatory cytokines. The activation requires its phosphorylation by MAP kinase kinases (MKKs), or its autophosphorylation triggered by the interaction of MAP3K7IP1/TAB1 protein with this kinase. The substrates of this kinase include transcription regulator ATF2, MEF2C, and MAX, cell cycle regulator CDC25B, and tumor suppressor p53, which suggest the roles of this kinase in stress related transcription and cell cycle regulation, as well as in genotoxic stress response. Four alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported.
11845	Copper amine oxidases catalyze the oxidative conversion of amines to aldehydes in the presence of copper and quinone cofactor. The product is a major protein on the adipocyte plasma membrane. It has adhesive properties and also has functional monoamine oxidase activity. A pseudogene for this gene has been discribed and is located approximately 9-kb downstream.
11846	
11847	
11848	Similarity to a related locus suggests that this gene encodes a subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. This subunit exists as a heterodimer with another polymerase subunit; together they form a core subassembly unit of the polymerase. Two similar genes are located downstream of this locus. Alternative splicing of this gene results in three transcript variants encoding different isoforms.
11849	
11850	
11851	
11852	
11853	
11854	The product of this gene has been localized within the split hand/split foot malformation locus SHFM1 at chromosome 7. It has been proposed to be a candidate gene for the autosomal dominant form of the heterogeneous limb developmental disorder split hand/split foot malformation type 1. In addition, it has been shown to directly interact with BRCA2. It also may play a role in the completion of the cell cycle.
11855	
11856	Rho GTPases play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. The encoded protein may form a complex with G proteins and stimulate Rho-dependent signals. A similar protein in rat interacts with glutamate transporter EAAT4 and modulates its glutamate transport activity. Expression of the rat protein induces the reorganization of the actin cytoskeleton and its overexpression induces the formation of membrane ruffling and filopodia. Two alternative transcripts encoding different isoforms have been described.
11857	
11858	This gene is a member of the paired box (PAX) family of transcription factors.  Members of this gene family typically contain a paired box domain, an octapeptide, and a paired-type homeodomain.  These genes play critical roles during fetal development and cancer growth.  The specific function of the paired box gene 9 is unknown but it may involve development of stratified squamous epithelia as well as various organs and skeletal elements.
11859	
11860	
11861	
11862	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
11863	
11864	In vertebrates, the genes encoding the class of transcription factors called homeobox genes are found in clusters named A, B, C, and D on four separate chromosomes. Expression of these proteins is spatially and temporally regulated during embryonic development. This gene is part of the A cluster on chromosome 7 and encodes a DNA-binding transcription factor which may regulate gene expression, morphogenesis, and differentiation. This gene is involved in the regulation of uterine development and is required for female fertility. Mutations in this gene can cause radio-ulnar synostosis with amegakaryocytic thrombocytopenia.
11865	
11866	
11867	The protein encoded by this gene is predicted to be a five-pass transmembrane protein. This gene may be predominately expressed in brain.
11868	The Golgi complex plays a key role in the sorting and modification of proteins exported from the endoplasmic reticulum. The protein encoded by this gene is a membrane protein involved in establishing the stacked structure of the Golgi apparatus. It is a caspase-3 substrate, and cleavage of this encoded protein contributes to Golgi fragmentation in apoptosis. This encoded protein can form a complex with the Golgi matrix protein GOLGA2, and this complex binds to the vesicle docking protein p115. Several alternatively spliced transcript variants of this gene have been identified, but their full-length natures have not been determined.
11869	The protein encoded by this gene belongs to the ARG-specific ADP-ribosyltransferase family. Proteins in this family regulate the function of target proteins by attaching ADP-ribose to specific amino acid residues in their target proteins. The mouse homolog lacks a glycosylphosphatidylinositol-anchor signal sequence and is predicted to be a secretory enzyme.
11870	
11871	
11872	
11873	This gene encodes a cytoplasmic protein which specifically colocalizes and interacts with the serine/threonine protein kinase, receptor-interacting protein (RIP). Zinc finger domains of the encoded protein are required for its interaction with RIP and for inhibition of TNF- and IL1-induced NF-kappa B activation pathways. The encoded protein may also function as an E3 ubiquitin-protein ligase which accepts ubiquitin from E2 ubiquitin-conjugating enzymes and transfers it to substrates. Several alternatively spliced transcript variants have been described for this locus but the full-length natures of only some are known.
11874	The protein encoded by this gene is a putative zinc finger transcription factor and oncoprotein. In acute myeloid leukemia, especially in the M2 subtype, the t(8;21)(q22;q22) translocation is one of the most frequent karyotypic abnormalities. The translocation produces a chimeric gene made up of the 5@#$%&-region of the RUNX1 gene fused to the 3@#$%&-region of this gene. The chimeric protein is thought to associate with the nuclear corepressor/histone deacetylase complex to block hematopoietic differentiation. Several transcript variants encoding multiple isoforms have been found for this gene.
11875	
11876	
11877	
11878	
11879	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
11880	
11881	
11882	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes including embryonic development cell growth, morphogenesis, tissue repair, tumor growth and invasion. This growth factor is a high affinity, heparin dependent ligand for FGFR4. Expression of this gene was detected only in fetal but not adult brain tissue. Synergistic interaction of the chick homolog and Wnt-8c has been shown to be required for initiation of inner ear development.
11883	
11884	
11885	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
11886	This gene encodes an astacin-like zinc-dependent metalloprotease and is a subfamily member of the metzincin family. A similar protein in mice is required during heart development and specifically processes procollagen C-propeptides and chordin at similar cleavage sites.
11887	
11888	
11889	CX31.9 is a member of the large family of connexins that are required for the formation of gap junctions. Six connexin monomers form a hemichannel, or connexon, on the cell surface. This connexon can interact with a connexon from a neighboring cell, thus forming a channel linking the cytoplasm of the 2 cells.[supplied by OMIM]
11890	FUT4 catalyzes the synthesis of the non-sialylated antigen, Lewis x (CD15). It has been detected in human embryos (5-10 weeks) suggesting a role in development.
11891	
11892	This gene is similar to a mouse gene that encodes a putative protein kinase with a tudor domain, and shows testis-specific expression. Two splice variants have been found for this gene.
11893	
11894	
11895	
11896	Cyclin-dependent kinase inhibitor 1C is a tight-binding inhibitor of several G1 cyclin/Cdk complexes and a negative regulator of cell proliferation.  Mutations of CDKN1C are implicated in sporadic cancers and Beckwith-Wiedemann syndorome suggesting that it is a tumor suppressor candidate.
11897	This gene encodes a member of the ephrin (EPH) family. The ephrins and EPH-related receptors comprise the largest subfamily of receptor protein-tyrosine kinases and have been implicated in mediating developmental events, especially in the nervous system and in erythropoiesis. Based on their structures and sequence relationships, ephrins are divided into the ephrin-A (EFNA) class, which are anchored to the membrane by a glycosylphosphatidylinositol linkage, and the ephrin-B (EFNB) class, which are transmembrane proteins. This gene encodes an EFNB class ephrin which binds to the EPHB4 and EPHA3 receptors.
11898	
11899	
11900	
11901	
11902	
11903	
11904	
11905	X-linked inhibitor of apoptosis (XIAP; MIM 300079) is a potent member of the IAP family. All members of this family possess baculoviral IAP (BIR) repeats, cysteine-rich domains of approximately 80 amino acids that bind and inhibit caspases (e.g., CASP3; MIM 600636). XIAP has 3 BIR domains and a C-terminal RING zinc finger that possesses E3 ubiquitin ligase (see MIM 601623) activity. XAF1 antagonizes the anticaspase activity of XIAP and may be important in mediating apoptosis resistance in cancer cells (Liston et al., 2001 [PubMed 11175744]).[supplied by OMIM]
11906	
11907	
11908	
11909	
11910	
11911	
11912	
11913	
11914	
11915	
11916	X-linked inhibitor of apoptosis (XIAP; MIM 300079) is a potent member of the IAP family. All members of this family possess baculoviral IAP (BIR) repeats, cysteine-rich domains of approximately 80 amino acids that bind and inhibit caspases (e.g., CASP3; MIM 600636). XIAP has 3 BIR domains and a C-terminal RING zinc finger that possesses E3 ubiquitin ligase (see MIM 601623) activity. XAF1 antagonizes the anticaspase activity of XIAP and may be important in mediating apoptosis resistance in cancer cells (Liston et al., 2001 [PubMed 11175744]).[supplied by OMIM]
11917	
11918	This gene is a member of the insulin-like growth factor binding protein (IGFBP) family and encodes a protein with an IGFBP domain and a thyroglobulin type-I domain. The protein binds both insulin-like growth factors (IGFs) I and II and circulates in the plasma. Binding of this protein prolongs the half-life of the IGFs and alters their interaction with cell surface receptors. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
11919	The protein encoded by this gene belongs to the pancreatic ribonuclease family. It plays an important role in mRNA cleavage and has marked specificity towards the 3@#$%& side of uridine nucleotides. Alternative splicing occurs at this locus and three transcript variants encoding the same protein have been identified. This gene and ANG share two exons in their 5@#$%& UTRs. The downstream exons are unique and contain the coding regions.
11920	
11921	
11922	
11923	
11924	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein. A pseudogene corresponding to this gene is found on chromosome 11.
11925	
11926	
11927	
11928	
11929	
11930	The product of this gene belongs to the family of transient receptor potential (TRP) channels. TRP channels are cation-selective channels important for cellular calcium signaling and homeostasis. The protein encoded by this gene mediates calcium entry, and this entry is potentiated by calcium store depletion. Alternatively spliced transcript variants encoding different isoforms have been identified.
11931	
11932	
11933	
11934	
11935	
11936	
11937	
11938	
11939	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes including embryonic development cell growth, morphogenesis, tissue repair, tumor growth and invasion. This gene was shown to be prominently expressed in the cerebellum and cortex. The mouse homolog of this gene was localized to specific sites in the midline structures of the forebrain, the midbrain-hindbrain junction, developing skeleton and developing arteries, which suggests a role in central nervous system, bone and vascular development.  This gene was referred to as FGF-13 in reference 2, however, its amino acid sequence and chromosomal localization are identical to FGF17.
11940	
11941	Sialyltransferases such as beta-galactoside alpha-2,6-sialyltransferase II (ST6GalII; EC 2.4.99.1) are type 2 transmembrane proteins that catalyze the transfer of sialic acid from CMP-sialic acid to an acceptor carbohydrate, usually to the terminal ends of carbohydrate chains.[supplied by OMIM]
11942	In the mouse, Nkd is a Dishevelled (see DVL1; MIM 601365)-binding protein that functions as a negative regulator of the Wnt (see WNT1; MIM 164820)-beta-catenin (see MIM 116806)-Tcf (see MIM 602272) signaling pathway.[supplied by OMIM]
11943	
11944	
11945	
11946	The WFIKKN1 protein contains a WAP domain, follistatin domain, immunoglobulin domain, two tandem Kunitz domains, and an NTR domain. This gene encodes a WFIKKN1-related protein which has the same domain organization as the WFIKKN1 protein. The WAP-type, follistatin type, Kunitz-type, and NTR-type protease inhibitory domains may control the action of multiple types of proteases.
11947	
11948	
11949	The ADF (actin-depolymerizing factor)/cofilin family (see MIM 601442) is composed of stimulus-responsive mediators of actin dynamics. ADF/cofilin proteins are inactivated by kinases such as LIM domain kinase-1 (LIMK1; MIM 601329). The SSH family appears to play a role in actin dynamics by reactivating ADF/cofilin proteins in vivo (Niwa et al., 2002).[supplied by OMIM]
11950	
11951	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
11952	
11953	LCOR is a transcriptional corepressor widely expressed in fetal and adult tissues that is recruited to agonist-bound nuclear receptors through a single LxxLL motif, also referred to as a nuclear receptor (NR) box.[supplied by OMIM]
11954	
11955	Fibrinogen-like 2 is a fibrinogen-like protein similar to the beta- and gamma-chains of fibrinogen.  FGL2 carboxyl-terminus consists of the fibrinogen-related domains (FRED).  FGL2 forms a tetrameric complex which is stabilized by interchain disulfide bonds.  FLG2 may play a role in physiologic functions at mucosal sites.
11956	cAMP is a signaling molecule important for a variety of cellular functions. cAMP exerts its effects by activating the cAMP-dependent protein kinase (AMPK), which transduces the signal through phosphorylation of different target proteins. The inactive holoenzyme of AMPK is a tetramer composed of two regulatory and two catalytic subunits. cAMP causes the dissociation of the inactive holoenzyme into a dimer of regulatory subunits bound to four cAMP and two free monomeric catalytic subunits. Four different regulatory subunits and three catalytic subunits of AMPK have been identified in humans. The protein encoded by this gene is one of the regulatory subunits. This subunit can be phosphorylated by the activated catalytic subunit. This subunit has been shown to interact with and suppress the transcriptional activity of the cAMP responsive element binding protein 1 (CREB1) in activated T cells. Knockout studies in mice suggest that this subunit may play an important role in regulating energy balance and adiposity. The studies also suggest that this subunit may mediate the gene induction and cataleptic behavior induced by haloperidol.
11957	Mutations in the KAL1 gene cause the X-linked Kallmann syndrome.  The predictated KAL1 protein sequence is similar to proteins known to function in neural cell adhesion and axonal migration.
11958	The PRX gene encodes L- and S-periaxin, proteins of myelinating Schwann cells, and is mutated in Dejerine-Sottas syndrome (MIM 145900) and Charcot-Marie-Tooth disease type 4F (MIM 145900).[supplied by OMIM]
11959	Inhibition of the nuclear export of poly(A)-containing mRNAs caused by the influenza A virus NS1 protein requires its effector domain. The NS1 effector domain functionally interacts with the cellular 30 kDa subunit of cleavage and polyadenylation specific factor 4, an essential component of the 3@#$%& end processing machinery of cellular pre-mRNAs. In influenza virus-infected cells, the NS1 protein is physically associated with cleavage and polyadenylation specific factor 4, 30kD subunit. Binding of the NS1 protein to the 30 kDa protein in vitro prevents CPSF binding to the RNA substrate and inhibits 3@#$%& end cleavage and polyadenylation of host pre-mRNAs. Thus the NS1 protein selectively inhibits the nuclear export of cellular, and not viral, mRNAs.
11960	This gene encodes a protein for a voltage-gated potassium channel required for the repolarization phase of the cardiac action potential. The gene product can form heteromultimers with two other potassium channel proteins, KCNE1 and KCNE3. Mutations in this gene are associated with hereditary long QT syndrome, Romano-Ward syndrome, Jervell and Lange-Nielsen syndrome and familial atrial fibrillation. The gene is located in a region of chromosome 11 that contains a large number of contiguous genes that are abnormally imprinted in cancer and the Beckwith-Wiedemann syndrome. Three alternative transcripts encoding distinct isoforms have been described.
11961	The Fos gene family consists of 4 members: FOS, FOSB, FOSL1, and FOSL2. These genes encode leucine zipper proteins that can dimerize with proteins of the JUN family, thereby forming the transcription factor complex AP-1. As such, the FOS proteins have been implicated as regulators of cell proliferation, differentiation, and transformation.
11962	
11963	This gene encodes a type-I integral membrane glycoprotein with diverse distribution in human tissues. The physiological function of this protein may be related to its mucin-type character. The homologous protein in other species has been described as a differentiation antigen and influenza-virus receptor. The specific function of this protein has not been determined but it has been proposed as a marker of lung injury. Alternatively spliced transcript variants encoding different isoforms have been identified.
11964	
11965	
11966	
11967	
11968	
11969	
11970	
11971	
11972	The ITGB2 protein product is the integrin beta chain beta 2. Integrins are integral cell-surface proteins composed of an alpha chain and a beta chain.  A given chain may combine with multiple partners resulting in different integrins.  For example, beta 2 combines with the alpha L chain to form the integrin LFA-1, and combines with the alpha M chain to form the integrin Mac-1. Integrins are known to participate in cell adhesion as well as cell-surface mediated signalling.
11973	
11974	The protein encoded by this gene contains an RGD (Arg-Gly-Asp) motif in the N-terminal region, which confers adhesive properties to macromolecular proteins like fibronectin. It shares a high degree of sequence similarity with the mouse homolog, which has been suggested to play a role in mitochondrial biogenesis. The exact biological function of this gene is not known.
11975	The acetylcholine receptor of muscle has 5 subunits of 4 different types: 2 alpha and 1 each of beta, gamma and delta subunits.  After acetylcholine binding, the receptor undergoes an extensive conformation change that affects all subunits and leads to opening of an ion-conducting channel across the plasma membrane.
11976	
11977	
11978	
11979	
11980	The protein encoded by this gene belongs to a family of P-type cation-transporting ATPases. The gastric H+, K+-ATPase is a heterodimer consisting of a high molecular weight catalytic alpha subunit and a smaller but heavily glycosylated beta subunit. This enzyme is a proton pump that catalyzes the hydrolysis of ATP coupled with the exchange of H(+) and K(+) ions across the plasma membrane. It is also responsible for gastric acid secretion. This gene encodes a catalytic alpha subunit of the gastric H+, K+-ATPase.
11981	
11982	
11983	
11984	
11985	Phosphatidylcholine (PC)-specific phospholipases D (PLDs) catalyze the hydrolysis of PC to produce phosphatidic acid and choline. Activation of PC-specific PLDs occurs as a consequence of agonist stimulation of both tyrosine kinase and G protein-coupled receptors. PC-specific PLDs have been proposed to function in regulated secretion, cytoskeletal reorganization, transcriptional regulation, and cell cycle control.[supplied by OMIM]
11986	
11987	
11988	
11989	
11990	Bin/amphiphysin/Rvs (BAR) domain adaptor proteins, such as BIN3, are named for their unique N-terminal region and appear to integrate signal transduction pathways that regulate membrane dynamics, F-actin cytoskeleton, and nuclear processes.[supplied by OMIM]
11991	
11992	
11993	
11994	Dendritic cells are unique in their ability to present antigen to naive T cells, and therefore play a central role in the initiation of immune responses. The protein encoded by this gene is a transmembrane molecule that is preferentially expressed by dendritic cells. Its expression is down-regulated by ligation of the CD40 molecule. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
11995	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order   structures. This gene is intronless and encodes a member of the histone H4 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6.
11996	
11997	
11998	
11999	
12000	
12001	
12002	
12003	
12004	
12005	
12006	
12007	
12008	
12009	
12010	
12011	U2 auxiliary factor (U2AF), comprised of a large and a small subunit, is a non-snRNP protein required for the binding of U2 snRNP to the pre-mRNA branch site. This gene encodes the U2AF large subunit which contains a sequence-specific RNA-binding region with 3 RNA recognition motifs and an Arg/Ser-rich domain necessary for splicing. The large subunit binds to the polypyrimidine tract of introns early during spliceosome assembly. Multiple transcript variants have been detected for this gene, but the full-length natures of only two have been determined to date.
12012	The members of the zinc fingers and homeoboxes gene family are nuclear homodimeric transcriptional repressors that interact with the A subunit of nuclear factor-Y (NF-YA) and contain two C2H2-type zinc fingers and five homeobox DNA-binding domains. This gene encodes member 1 of this gene family. In addition to forming homodimers, this protein heterodimerizes with members 2 and 3 of the zinc fingers and homeoboxes family. Alternative splicing results in multiple transcript variants encoding the same protein.
12013	
12014	
12015	
12016	
12017	
12018	
12019	
12020	
12021	
12022	
12023	This gene encodes a member of the septin family of cytoskeletal proteins with GTPase activity. This protein localizes to the cytoplasm and nucleus and displays GTP-binding and GTPase activity. Alternate splicing results in two transcript variants encoding different isoforms.
12024	
12025	ADP-ribosylation factor 4 (ARF4) is a member of the human ARF gene family. These genes encode small guanine nucleotide-binding proteins that stimulate the ADP-ribosyltransferase activity of cholera toxin and play a role in vesicular trafficking and as activators of phospholipase D.  The gene products include 6 ARF proteins and 11 ARF-like proteins and constitute 1 family of the RAS superfamily. The ARF proteins are categorized as class I (ARF1, ARF2,and ARF3), class II (ARF4 and ARF5) and class III (ARF6). The members of each class share a common gene organization. The ARF4 gene spans approximately 12kb and contains six exons and five introns. The ARF4 is the most divergent member of the human ARFs. Conflicting Map positions at 3p14 or 3p21 have been reported for this gene.
12026	This gene encodes a transmembrane protein which is a member of the immunoglobulin superfamily. This protein is found on thymocytes and mature T cells. It plays an essential role in T-cell interactions and also in T-cell/B-cell interaction during early lymphoid development.
12027	
12028	
12029	Angiopoietins are proteins with important roles in vascular development and angiogenesis. All angiopoietins bind with similar affinity to an endothelial cell-specific tyrosine-protein kinase receptor. The protein encoded by this gene is a secreted glycoprotein that activates the receptor by inducing its tyrosine phosphorylation. It plays a critical role in mediating reciprocal interactions between the endothelium and surrounding matrix and mesenchyme. The protein also contributes to blood vessel maturation and stability, and may be involved in early development of the heart. Two transcript variants encoding different isoforms have been found for this gene.
12030	
12031	
12032	The product of this gene belongs to the Serine/Threonine protein kinase family, and to the Ca(2+)/calmodulin-dependent protein kinase subfamily. This protein plays a role in the calcium/calmodulin-dependent (CaM) kinase cascade. Three transcript variants encoding two distinct isoforms have been identified for this gene.
12033	
12034	Calbindin is a calcium-binding protein belonging to the troponin C superfamily. It was originally described as a 27-kD protein induced by vitamin D in the duodenum of the chick. In the brain, its synthesis is independent of vitamin-D-derived hormones. Calbindin contains 4 active calcium-binding domains, and 2 modified domains that presumably have lost their calcium-binding capacity. The neurons in brains of patients with Huntington disease are calbindin-depleted.
12035	
12036	
12037	
12038	
12039	
12040	
12041	
12042	
12043	
12044	
12045	Phosphomannomutase 2 catalyzes the isomerization of mannose 6-phosphate to mannose 1-phosphate.  Mannose 1-phosphate is a precursor to GDP-mannose necessary for the synthesis of dolichol-P-oligosaccharides.  Mutations in the gene have been shown to cause defects in the protein glycosylation pathway manifest as carbohydrate-deficient glycoprotein syndrome type I.
12046	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is located on chromosome 12 and encodes a variant H2A histone. The protein is divergent at the C-terminus compared to the consensus H2A histone family member. Two transcript variants encoding different isoforms have been found for this gene.
12047	Initiation of transcription by RNA polymerase II requires the activities of more than 70 polypeptides. The protein that coordinates these activities is transcription factor IID (TFIID), which binds to the core promoter to position the polymerase properly, serves as the scaffold for assembly of the remainder of the transcription complex, and acts as a channel for regulatory signals. TFIID is composed of the TATA-binding protein (TBP) and a group of evolutionarily conserved proteins known as TBP-associated factors or TAFs. TAFs may participate in basal transcription, serve as coactivators, function in promoter recognition or modify general transcription factors (GTFs) to facilitate complex assembly and transcription initiation. This gene encodes one of the larger subunits of TFIID that is stably associated with the TFIID complex. It contributes to interactions at and downstream of the transcription initiation site, interactions that help determine transcription complex response to activators.
12048	This gene is a member of the cyclophilin family of peptidylprolyl isomerases. The cyclophilins are a highly conserved ubiquitous family, members of which play an important role in protein folding, immunosuppression by cyclosporin A, and infection of HIV-1 virions. This protein interacts with the proteinase inhibitor eglin c and is localized in the nucleus. This gene encodes three transcript variants encoding two distinct isoforms.
12049	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein and is similar in sequence to its family member CD53 antigen. It is known to complex with integrins and other transmembrane 4 superfamily proteins.
12050	
12051	
12052	Ornithine decarboxylase catalyzes the conversion of ornithine to putrescine in the first and apparently rate-limiting step in polyamine biosynthesis. The ornithine decarboxylase antizymes play a role in the regulation of polyamine synthesis by binding to and inhibiting ornithine decarboxylase. Antizyme expression is auto-regulated by polyamine-enhanced translational frameshifting. In contrast to antizymes 1 and 2, which are widely expressed throughout the body, antizyme 3 transcription is restricted to testis germ cells. Thus antizyme 3, encoded by this gene, is a possible candidate for heritable forms of human male infertility.
12053	
12054	
12055	
12056	
12057	This gene encodes an endoplasmic reticulum-resident type II transmembrane glycosyltransferase involved in the chain elongation step of heparan sulfate biosynthesis. Mutations in this gene cause the type I form of multiple exostoses.
12058	
12059	
12060	
12061	
12062	
12063	
12064	This gene encodes a neuron-restricted protein that contains a CRAL-TRIO motif common to proteins that bind small lipophilic molecules. Mutations in this gene are associated with cerebellar ataxia, Cayman type.
12065	
12066	
12067	
12068	
12069	
12070	
12071	
12072	Serine palmitoyltransferase, which consists of two different subunits, is the key enzyme in sphingolipid biosynthesis. It converts L-serine and palmitoyl-CoA to 3-oxosphinganine with pyridoxal 5@#$%&-phosphate as a cofactor. The product of this gene is the long chain base subunit 1 of serine palmitoyltransferase. Mutations in this gene were identified in patients with hereditary sensory neuropathy type 1. Alternatively spliced variants encoding different isoforms have been identified.
12073	
12074	
12075	KRT17 encodes the type I intermediate filament chain keratin 17, expressed in nail bed, hair follicle, sebaceous glands, and other epidermal appendages. Mutations in this gene lead to Jackson-Lawler type pachyonychia congenita and steatocystoma multiplex.
12076	
12077	
12078	
12079	
12080	
12081	Zinc finger proteins have been shown to interact with nucleic acids and to have diverse functions. The zinc finger domain is a conserved amino acid sequence motif containing 2 specifically positioned cysteines and 2 histidines that are involved in coordinating zinc. Kruppel-related proteins form one family of zinc finger proteins. See ZFP93 (MIM 604749) for additional information on zinc finger proteins.[supplied by OMIM]
12082	cAMP is a signaling molecule important for a variety of cellular functions. cAMP exerts its effects by activating the cAMP-dependent protein kinase (AMPK), which transduces the signal through phosphorylation of different target proteins. The inactive holoenzyme of AMPK is a tetramer composed of two regulatory and two catalytic subunits. cAMP causes the dissociation of the inactive holoenzyme into a dimer of regulatory subunits bound to four cAMP and two free monomeric catalytic subunits. Four different regulatory subunits and three catalytic subunits of AMPK have been identified in humans. The protein encoded by this gene is one of the regulatory subunits. This protein was found to be a tissue-specific extinguisher that down-regulates the expression of seven liver genes in hepatoma x fibroblast hybrids. Functional null mutations in this gene cause Carney complex (CNC), an autosomal dominant multiple neoplasia syndrome. This gene can fuse to the RET protooncogene by gene rearrangement and form the thyroid tumor-specific chimeric oncogene known as PTC2. Three alternatively spliced transcript variants encoding the same protein have been observed.
12083	Sialophorin (leukosialin) is a major sialoglycoprotein on the surface of human T lymphocytes, monocytes, granulocytes, and some B lymphocytes, which appears to be important for immune function and may be part of a physiologic ligand-receptor complex involved in T-cell activation.[supplied by OMIM]
12084	
12085	The product of this gene belongs to the family of purinoceptors for ATP. This receptor functions as a ligand-gated ion channel. Several characteristic motifs of ATP-gated channels are present in its primary structure, but, unlike other members of the purinoceptors family, this receptor has only a single transmembrane domain. Three transcript variants encoding distinct isoforms have been identified for this gene.
12086	
12087	
12088	
12089	
12090	
12091	Component C2 is part of the classical pathway of complement system.  Activated C1 cleaves C2 into C2a and C2b.  C2a leads to activation of C3.  Deficiency of C2 has been reported to associated with certain autoimmune diseases.
12092	
12093	
12094	
12095	
12096	
12097	
12098	
12099	
12100	G protein-coupled receptors (GPCRs, or GPRs) contain 7 transmembrane domains and transduce extracellular signals through heterotrimeric G proteins.[supplied by OMIM]
12101	
12102	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a non-ATPase subunit of the 19S regulator base.
12103	This gene encodes a cytoplasmic protein which contains SH3 domains. This protein is a member of a family of proteins involved in clathrin-mediated endocytosis. Intersectin 2 is thought to regulate the formation of clathrin-coated vesicles and also may function in the induction of T cell antigen receptor (TCR) endocytosis. Alternatively spliced transcript variants have been found for this gene that encode three distinct isoforms. Additional variants have been found but their full length nature has not been determined.
12104	The protein encoded by this gene is an endogenous calpain (calcium-dependent cysteine protease) inhibitor. It consists of an N-terminal domain L and four repetitive calpain-inhibition domains (domains 1-4), and it is involved in the proteolysis of amyloid precursor protein. The calpain/calpastatin system is involved in numerous membrane fusion events, such as neural vesicle exocytosis and platelet and red-cell aggregation. The encoded protein is also thought to affect the expression levels of genes encoding structural or regulatory proteins. Several alternatively spliced transcript variants of this gene have been described, but only some of their full length natures have been determined.
12105	
12106	
12107	
12108	This gene encodes a member of the transcription elongation factor A (SII)-like (TCEAL) gene family. Members of this family contain TFA domains and may function as nuclear phosphoproteins that modulate transcription in a promoter context-dependent manner. Multiple family members are located on the X chromosome. Alternative splicing results in multiple transcript variants encoding a single isoform.
12109	The SIMP protein contains a highly immunogenic minor histocompatibility antigen epitope of 9 amino acids, B6(dom1). Like ITM1 (MIM 601134), SIMP is homologous to yeast STT3, an oligosaccharyltransferase essential for cell proliferation (McBride et al., 2002 [PubMed 12439619]).[supplied by OMIM]
12110	
12111	
12112	
12113	The protein encoded by this gene is one of the three catalytic subunits of protein phosphatase 1 (PP1). PP1 is a serine/threonine specific protein phosphatase known to be involved in the regulation of a variety of cellular processes, such as cell division, glycogen metabolism, muscle contractility, protein synthesis, and HIV-1 viral transcription. Increased PP1 activity has been observed in the end stage of heart failure. Studies in both human and mice suggest that PP1 is an important regulator of cardiac function. Mouse studies also suggest that PP1 functions as a suppressor of learning and memory. Three alternatively spliced transcript variants encoding different isoforms have been found for this gene.
12114	
12115	The nuclear pore complex is a massive structure that extends across the nuclear envelope, forming a gateway that regulates the flow of macromolecules between the nucleus and the cytoplasm. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells. The protein encoded by this gene is a member of the FG-repeat containing nucleoporins that functions as a soluble cofactor in importin-alpha:beta-mediated nuclear protein import. Pseudogenes of this gene are found on chromosomes 5, 6, and 14. Three transcript variants encoding two different isoforms have been found for this gene.
12116	
12117	The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21. This protein may function in the inhibition of casein kinase and as a cytokine. Altered expression of this protein is associated with the disease cystic fibrosis.
12118	This gene generates several transcript variants which differ in their first exons. At least three alternatively spliced variants encoding distinct proteins have been reported, two of which encode structurally related isoforms known to function as inhibitors of CDK4 kinase. The remaining transcript includes an alternate first exon located 20 Kb upstream of the remainder of the gene; this transcript contains an alternate open reading frame (ARF) that specifies a protein which is structurally unrelated to the products of the other variants. This ARF product functions as a stabilizer of the tumor suppressor protein p53 as it can interact with, and sequester, MDM1, a protein responsible for the degradation of p53. In spite of the structural and functional differences, the CDK inhibitor isoforms and the ARF product encoded by this gene, through the regulatory roles of CDK4 and p53 in cell cycle G1 progression, share a common functionality in cell cycle G1 control. This gene is frequently mutated or deleted in a wide variety of tumors, and is known to be an important tumor suppressor gene.
12119	The caspase recruitment domain (CARD) is a protein module that consists of 6 or 7 antiparallel alpha helices. It participates in apoptosis signaling through highly specific protein-protein homophilic interactions. CARDs induce nuclear factor kappa-B (NFKB; MIM 164011) activity through the IKK (e.g., IKBKB; MIM 603258) complex. CARD9 (MIM 607212), CARD10, CARD11 (MIM 607210), and CARD14 (MIM 607211) interact with BCL10 (MIM 603517) and are involved in NFKB signaling complexes. Except for CARD9, these CARD proteins are members of the membrane-associated guanylate kinase (MAGUK) family.[supplied by OMIM]
12120	This gene generates several transcript variants which differ in their first exons. At least three alternatively spliced variants encoding distinct proteins have been reported, two of which encode structurally related isoforms known to function as inhibitors of CDK4 kinase. The remaining transcript includes an alternate first exon located 20 Kb upstream of the remainder of the gene; this transcript contains an alternate open reading frame (ARF) that specifies a protein which is structurally unrelated to the products of the other variants. This ARF product functions as a stabilizer of the tumor suppressor protein p53 as it can interact with, and sequester, MDM1, a protein responsible for the degradation of p53. In spite of the structural and functional differences, the CDK inhibitor isoforms and the ARF product encoded by this gene, through the regulatory roles of CDK4 and p53 in cell cycle G1 progression, share a common functionality in cell cycle G1 control. This gene is frequently mutated or deleted in a wide variety of tumors, and is known to be an important tumor suppressor gene.
12121	
12122	
12123	
12124	Nuclear factor of activated T-cells (NFAT) is a transcription factor required for T-cell expression of interleukin 2. NFAT binds to a sequence in the IL2 enhancer known as the antigen receptor response element 2. In addition, NFAT can bind RNA and is an essential component for encapsidation and protein priming of hepatitis B viral polymerase. NFAT is a heterodimer of 45 kDa and 90 kDa proteins, the larger of which is the product of this gene. The encoded protein, which is primarily localized to ribosomes, probably regulates transcription at the level of mRNA elongation. At least three transcript variants encoding three different isoforms have been found for this gene.
12125	
12126	
12127	This gene encodes a small nuclear ribonucleoprotein that belongs to the SNRNP core protein family. The protein may act as a charged protein scaffold to promote SNRNP assembly or strengthen SNRNP-SNRNP interactions through nonspecific electrostatic contacts with RNA.
12128	
12129	
12130	
12131	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intracellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the ABC1 subfamily. Members of the ABC1 subfamily comprise the only major ABC subfamily found exclusively in multicellular eukaryotes. This protein is highly expressed in brain tissue and may play a role in macrophage lipid metabolism and neural development. Two transcript variants encoding different isoforms have been found for this gene.
12132	
12133	
12134	This gene encodes a member of the WD-protein subfamily. This protein shares a high sequence identity to mouse and chick proteins. It contains several WD-repeats spanning most of the protein and an SOCS box in the C-terminus. Alternative splicing of this gene generates three transcript variants that encode distinct isoforms.
12135	
12136	The protein encoded by this gene is a member of the G-protein coupled receptor family. This protein is one of four receptors identified for prostaglandin E2 (PGE2). This receptor can activate T-cell factor signaling. It has been shown to mediate PGE2 induced expression of early growth response 1 (EGR1), regulate the level and stability of cyclooxygenase-2 mRNA, and lead to the phosphorylation of glycogen synthase kinase-3. Knockout studies in mice suggest that this receptor may be involved in the neonatal adaptation of circulatory system, osteoporosis, as well as initiation of skin immune responses.
12137	Cell surface heparan sulfate proteoglycans are composed of a membrane-associated protein core substituted with a variable number of heparan sulfate chains. Members of the glypican-related integral membrane proteoglycan family (GRIPS) contain a core protein anchored to the cytoplasmic membrane via a glycosyl phosphatidylinositol linkage.  These proteins may play a role in the control of cell division and growth regulation.  The GPC4 gene is adjacent to the 3@#$%& end of GPC3 and may also play a role in Simpson-Golabi-Behmel syndrome.
12138	
12139	
12140	
12141	
12142	
12143	
12144	
12145	The protein encoded by this gene belongs to the complex I 19 kDA subunit family. Mammalian complex I is composed of 45 different subunits. This protein has NADH dehydrogenase activity and oxidoreductase activity. It plays an important role in transfering electrons from NADH to the respiratory chain. The immediate electron acceptor for the enzyme is believed to be ubiquinone.
12146	
12147	
12148	
12149	
12150	This gene encodes a member of the G protein-coupled receptor family, as well as the EDG family of proteins. This protein functions as a cellular receptor for lysophosphatidic acid and mediates lysophosphatidic acid-evoked calcium mobilization. This receptor couples predominantly to G(q/11) alpha proteins.
12151	
12152	The protein encoded by this gene is a subunit of the multisubunit NADH:ubiquinone oxidoreductase (complex I). Mammalian complex I is composed of 45 different subunits. It locates at the mitochondrial inner membrane. This protein has NADH dehydrogenase activity and oxidoreductase activity. It transfers electrons from NADH to the respiratory chain. The immediate electron acceptor for the enzyme is believed to be ubiquinone. Alternative splicing occurs at this locus and two transcript variants encoding distinct isoforms have been identified.
12153	The protein encoded by this gene is a member of the basic helix-loop-helix leucine zipper (bHLHZ) family of transcription factors. It is able to form homodimers and heterodimers with other family members, which include Mad, Mxi1 and Myc. Myc is an oncoprotein implicated in cell proliferation, differentiation and apoptosis. The homodimers and heterodimers compete for a common DNA target site (the E box) and rearrangement among these dimer forms provides a complex system of transcriptional regulation. Multiple alternatively spliced transcript variants have been described for this gene but the full length nature for some of them is unknown.
12154	
12155	This gene encodes an astacin-like zinc-dependent metalloprotease and is a subfamily member of the metzincin family. Unlike other family members, a similar protein in mice does not cleave procollagen C-propeptides or chordin.
12156	
12157	
12158	
12159	
12160	
12161	
12162	
12163	
12164	
12165	
12166	GABA is the major inhibitory neurotransmitter in the mammalian brain where it acts at GABA-A receptors, which are ligand-gated chloride channels. Chloride conductance of these channels can be modulated by agents such as benzodiazepines that bind to the GABA-A receptor. At least 16 distinct subunits of GABA-A receptors have been identified. Transcript variants utilizing three different alternative non-coding first exons have been described.
12167	
12168	
12169	
12170	This gene encodes one of the members of the two-pore-domain background potassium channel protein family. This type of potassium channel is formed by two homodimers that create a channel that leaks potassium out of the cell to control resting membrane potential. The channel can be opened, however, by certain anesthetics, membrane stretching, intracellular acidosis, and heat. Three transcript variants encoding different isoforms have been found for this gene.
12171	NUP160 is 1 of up to 60 proteins that make up the 120-MD nuclear pore complex, which mediates nucleoplasmic transport.[supplied by OMIM]
12172	
12173	
12174	The protein encoded by this gene is a mediator of mitochondrial fusion. This protein and mitofusin 2 are homologs of the Drosophila protein fuzzy onion (Fzo). They are mitochondrial membrane proteins that interact with each other to facilitate mitochondrial targeting. Two alternatively spliced transcript variants encoding different isoforms have been found for this gene.
12175	The protein encoded by this gene is a mediator of mitochondrial fusion. This protein and mitofusin 2 are homologs of the Drosophila protein fuzzy onion (Fzo). They are mitochondrial membrane proteins that interact with each other to facilitate mitochondrial targeting. Two alternatively spliced transcript variants encoding different isoforms have been found for this gene.
12176	This gene encodes one of eight subunits composing COP9 signalosome, a highly conserved protein complex that functions as an important regulator in multiple signaling pathways. The structure and function of COP9 signalosome is similar to that of the 19S regulatory particle of 26S proteasome. COP9 signalosome has been shown to interact with SCF-type E3 ubiquitin ligases and act as a positive regulator of E3 ubiquitin ligases.
12177	
12178	
12179	
12180	The protein encoded by this gene belongs to the family of latent TGF-beta binding proteins (LTBPs). The secretion and activation of TGF-betas is regulated by their association with latency-associated proteins and with latent TGF-beta binding proteins. The product of this gene targets latent complexes of transforming growth factor beta to the extracellular matrix, where the latent cytokine is subsequently activated by several different mechanisms. Alternatively spliced transcript variants encoding different isoforms have been identified.
12181	FABP6 encodes the ileal fatty acid binding protein. Fatty acid binding proteins are a family of small, highly conserved, cytoplasmic proteins that bind long-chain fatty acids and other hydrophobic ligands. FABP6 and FABP1 (the liver fatty acid binding protein) are also able to bind bile acids. It is thought that FABPs roles include fatty acid uptake, transport, and metabolism.
12182	
12183	
12184	
12185	
12186	
12187	
12188	
12189	Immunoglobulins recognize foreign antigens and initiate immune responses such as phagocytosis and the complement system. Each immunoglobulin molecule consists of two identical heavy chains and two identical light chains. There are two classes of light chains, kappa and lambda. This region represents the germline organization of the lambda light chain locus. The locus includes V (variable), J (joining), and C (constant) segments. During B cell development, a recombination event at the DNA level joins a single V segment with a J segment; the C segment is later joined by splicing at the RNA level. Recombination of many different V segments with several J segments provides a wide range of antigen recognition. Additional diversity is attained by junctional diversity, resulting from the random additional of nucleotides by terminal deoxynucleotidyltransferase, and by somatic hypermutation, which occurs during B cell maturation in the spleen and lymph nodes. Several V segments and three C segments are known to be incapable of encoding a protein and are considered pseudogenes. The locus also includes several non-immunoglobulin genes, many of which are pseudogenes or are predicted by automated computational analysis or homology to other species.
12190	Dihydropyrimidine dehydrogenase  (DPYD) is a pyrimidine  catabolic enzyme and the initial and rate-limiting step in the pathway of  uracil and thymidine catabolism and also in the pathway leading to the  formation of beta-alanine. The DPYD gene is a large gene of approximately 150  kb consisting of 23 exons encoding a protein of approximately 111-kDa. Genetic  deficiency of DPYD enzyme results in an error in pyrimidine metabolism  associated with thymine-uraciluria and an increased risk of toxicity in  cancer patients receiving 5-flourouracil chemotherapy.
12191	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the ultrahigh sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
12192	Actins are highly conserved proteins that are involved in various types of cell motility, and maintenance of the cytoskeleton.  In vertebrates, three main groups of actin isoforms, alpha, beta and gamma have been identified.  The alpha actins are found in muscle tissues and are a major constituent of the contractile apparatus.  The beta and gamma actins co-exist in most cell types as components of the cytoskeleton, and as mediators of internal cell motility.  Actin, gamma 1, encoded by this gene, is a cytoplasmic actin found in nonmuscle cells.
12193	
12194	
12195	The protein encoded by this gene is phosphorylated by ZAP-70/Syk protein tyrosine kinases following activation of the T-cell antigen receptor (TCR) signal transduction pathway. This transmembrane protein localizes to lipid rafts and acts as a docking site for SH2 domain-containing proteins. Upon phosphorylation, this protein recruits multiple adaptor proteins and downstream signaling molecules into multimolecular signaling complexes located near the site of TCR engagement. Alternative splicing results in multiple transcript variants encoding different isoforms.
12196	The protein encoded by this gene is phosphorylated by ZAP-70/Syk protein tyrosine kinases following activation of the T-cell antigen receptor (TCR) signal transduction pathway. This transmembrane protein localizes to lipid rafts and acts as a docking site for SH2 domain-containing proteins. Upon phosphorylation, this protein recruits multiple adaptor proteins and downstream signaling molecules into multimolecular signaling complexes located near the site of TCR engagement. Alternative splicing results in multiple transcript variants encoding different isoforms.
12197	
12198	The replication of DNA in eukaryotic cells is carried out by a complex chromosomal replication apparatus, in which DNA polymerase alpha and primase are two key enzymatic components. Primase, which is a heterodimer of a small subunit and a large subunit, synthesizes small RNA primers for the Okazaki fragments made during discontinuous DNA replication. The protein encoded by this gene is the large, 58 kDa primase subunit.
12199	
12200	
12201	
12202	DCXR is an enzyme that has both diacetyl reductase (EC 1.1.1.5) and L-xylulose reductase (EC 1.1.1.10) activities (Nakagawa et al., 2002).[supplied by OMIM]
12203	
12204	This gene encodes a protein containing two conserved tandem RNA recognition motifs. Similar proteins in other species function as RNA-binding proteins and play central roles in posttranscriptional gene regulation. Two transcript variants encoding distinct isoforms have been identified for this gene.
12205	
12206	
12207	The protein encoded by this gene is a member of the SWI/SNF family of proteins. Members of this family have helicase and ATPase activities and are thought to regulate transcription of certain genes by altering the chromatin structure around those genes. Two transcript variants encoding different isoforms have been found for this gene.
12208	This gene encodes a protein with a gelsolin-like actin binding domain and an N-terminal leucine-rich repeat-protein protein interaction domain. The protein is similar to a Drosophila protein involved in early embryogenesis and the structural organization of indirect flight muscle. The gene is located within the Smith-Magenis syndrome region on chromosome 17.
12209	
12210	The homeodomain transcription factor EMX2 is critical for central nervous system and urogenital development. EMX1 (MIM 600034) and EMX2 are related to the @#$%&empty spiracles@#$%& gene expressed in the developing Drosophila head.[supplied by OMIM]
12211	
12212	
12213	
12214	
12215	
12216	
12217	
12218	
12219	
12220	
12221	Apoptosis is a cell death process that removes toxic and/or useless cells during mammalian development. The apoptotic process is accompanied by shrinkage and fragmentation of the cells and nuclei and degradation of the chromosomal DNA into nucleosomal units. DNA fragmentation factor (DFF) is a heterodimeric protein of 40-kD (DFFB) and 45-kD (DFFA) subunits. DFFA is the substrate for caspase-3 and triggers DNA fragmentation during apoptosis. DFF becomes activated when DFFA is cleaved by caspase-3. The cleaved fragments of DFFA dissociate from DFFB, the active component of DFF. DFFB has been found to trigger both DNA fragmentation and chromatin condensation during apoptosis. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene, but the biological validity of some variants has not been determined.
12222	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the MRP subfamily which is involved in multi-drug resistance. This protein is expressed in the canalicular (apical) part of the hepatocyte and functions in biliary transport. Substrates include anticancer drugs such as vinblastine; therefore, this protein appears to contribute to drug resistance in mammalian cells. Several different mutations in this gene have been observed in patients with Dubin-Johnson syndrome (DJS), an autosomal recessive disorder characterized by conjugated hyperbilirubinemia.
12223	This gene encodes a protein originally thought to be related to the collagenase gene family. This gene is one of three highly similar genes in a region of duplication located on the p arm of chromosome 16. These three genes encode closely related proteins that may have the same function. The protein encoded by one of these genes has been identified as part of a protein complex that participates in the Nodal signaling pathway during vertebrate development. Mutations in ABCC6, which is located nearby, rather than mutations in this gene are associated with pseudoxanthoma elasticum (PXE).
12224	Cytochrome c oxidase (COX) is the terminal enzyme of the mitochondrial respiratory chain. It is a multi-subunit enzyme complex that couples the transfer of electrons from cytochrome c to molecular oxygen and contributes to a proton electrochemical gradient across the inner mitochondrial membrane. The complex consists of 13 mitochondrial- and nuclear-encoded subunits. The mitochondrially-encoded subunits perform the electron transfer of proton pumping activities. The functions of the nuclear-encoded subunits are unknown but they may play a role in the regulation and assembly of the complex. This gene encodes the nuclear-encoded subunit Va of the human mitochondrial respiratory chain enzyme. A pseudogene COX5AP1 has been found in chromosome 14q22.
12225	
12226	
12227	
12228	The anaphase-promoting complex (APC) consists of at least 8 protein subunits, including APC5, CDC27 (APC3; MIM 116946), CDC16 (APC6; MIM 603461), and CDC23 (APC8; MIM 603462).[supplied by OMIM]
12229	The anaphase-promoting complex (APC) consists of at least 8 protein subunits, including APC5, CDC27 (APC3; MIM 116946), CDC16 (APC6; MIM 603461), and CDC23 (APC8; MIM 603462).[supplied by OMIM]
12230	This gene encodes a disintegrin and metalloproteinase with thrombospondin motifs-12 (ADAMTS12), which is a member of the ADAMTS protein family. Members of the family share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS-1) motif. Individual members of this family differ in the number of C-terminal TS-1 motifs, and some have unique C-terminal domains. The enzyme encoded by this gene contains 8 TS-1 motifs. It may play roles in pulmonary cells during fetal development or in tumor processes through its proteolytic activity or as a molecule potentially involved in regulation of cell adhesion.
12231	Alpha-1-antitrypsin is a protease inhibitor, deficiency of which is associated with emphysema and liver disease. The protein is encoded by a gene (PI) located on the distal long arm of chromosome 14.[supplied by OMIM]
12232	
12233	
12234	The protein encoded by this gene is a transforming protein that is related to Rho-specific exchange factors and yeast cell cycle regulators. The expression of this gene is elevated with the onset of DNA synthesis and remains elevated during G2 and M phases. In situ hybridization analysis showed that expression is at a high level in cells undergoing mitosis in regenerating liver. Thus, this protein is expressed in a cell cycle-dependent manner during liver regeneration, and is thought to have an important role in the regulation of cytokinesis.
12235	CSRP2 is a member of the CSRP family of genes, encoding a group of LIM domain proteins, which may be involved in regulatory processes important for development and cellular differentiation.  CRP2 contains two copies of the cysteine-rich amino acid sequence motif (LIM) with putative zinc-binding activity, and may be involved in regulating ordered cell growth.  Other genes in the family include CSRP1 and CSRP3.
12236	
12237	
12238	Huntington@#$%&s disease (HD), a neurodegenerative disorder characterized by loss of striatal neurons, is caused by an expansion of a polyglutamine tract in the HD protein huntingtin. This gene encodes a protein belonging to a class of huntingtin interacting proteins characterized by WW motifs. Two transcripts encoding different isoforms have been completely described.
12239	The NF-kappa-B (NFKB) complex of proteins is inhibited by I-kappa-B (IKB) proteins, which inactivate NFKB by trapping it in the cytoplasm. Phosphorylation of serine residues on the IKB proteins by IKB kinases marks them for destruction via the ubiquitination pathway, thereby allowing activation and nuclear translocation of the NFKB complex. The protein encoded by this gene is similar to IKB kinases and can mediate NFKB activation in response to certain growth factors. For example, the protein can form a complex with the IKB protein TANK and TRAF2 and release the NFKB inhibition caused by TANK.
12240	
12241	
12242	
12243	
12244	
12245	
12246	
12247	Ameloblastin is thought to represent an unique ameloblast-specific gene product that may be important in enamel matrix formation and mineralization. The gene is located on chromosome 4 near other genes associated with mineralized tissues: osteopontin, bone sialoprotein, and bone morphogenetic protein 3. Based on its cytogenetic location, this gene is a candidate gene for one form of the disorder, dentinogenesis imperfecta, and/or the disorder, autosomal dominant amylogenesis imperfecta.
12248	This gene encodes a predicted transmembrane protein containing two extracellular CUB domains followed by a low-density lipoprotein class A (LDLa) domain. It also has an intracellular FXNPXY-like motif, which has been shown in other proteins to be essential for the internalization of clathrin coated pits during endocytosis. Alternatively spliced transcript variants encoding distinct proteins have been described.
12249	
12250	
12251	EIF2A is a 65-kD protein that catalyzes the formation of puromycin-sensitive 80S preinitiation complexes (Zoll et al., 2002).[supplied by OMIM]
12252	The protein encoded by this gene is a calcium-dependent glycoprotein that is a member of the desmocollin subfamily of the cadherin superfamily. These desmosomal family members, along with the desmogleins, are found primarily in epithelial cells where they constitute the adhesive proteins of the desmosome cell-cell junction and are required for cell adhesion and desmosome formation. The desmosomal family members are arranged in two clusters on chromosome 18, occupying less than 650 kb combined. Alternative splicing results in two transcript variants encoding distinct isoforms.
12253	
12254	
12255	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein.
12256	Transaldolase 1 is a key enzyme of the nonoxidative pentose phosphate pathway providing ribose-5-phosphate for nucleic acid synthesis and NADPH for lipid biosynthesis. This pathway can also maintain glutathione at a reduced state and thus protect sulfhydryl groups and cellular integrity from oxygen radicals. The functional gene of transaldolase 1 is located on chromosome 11 and a pseudogene is identified on chromosome 1 but there are conflicting map locations. The second and third exon of this gene were developed by insertion of a retrotransposable element. This gene is thought to be involved in multiple sclerosis.
12257	Calbindin D9K, the vitamin D-dependent calcium-binding protein (CALB3) is a cytosolic protein of molecular weight 9,000 and belongs to a family of calcium-binding proteins that includes calmodulin, parvalbumin, troponin C, and S100 protein. CALB3 is found in the mammalian intestine, placenta, uterus, and kidney. Its exact function is unknown. The intestinal protein is vitamin D-dependent and its expression correlates with calcium transport activity. CALB3 may increase Ca2+ absorption by buffering Ca2+ in the cytoplasm and increase ATP-dependent Ca2+ transport in duodenal basolateral membrane vesicles. In keeping with its role in calcium absorption, concentrations of CALB3 are highest in duodenal villus enterocytes. Expression levels in adult human tissue were substantially lower than in child intestine. Located immediately proximal to DXS43 on chromosome X, CALB3 is a small (5.5 kb) gene comprising 3 exons. There is a possible relationship of CALB3 to hereditary hypophosphatemia.
12258	
12259	The protein encoded by this gene is a member of the peptidyl-prolyl cis-trans isomerase (PPIase) family. PPIases catalyze the cis-trans isomerization of proline imidic peptide bonds in oligopeptides and accelerate the folding of proteins. This protein has been shown to possess PPIase activity and, similar to other family members, can bind to the immunosuppressant cyclosporin A.
12260	
12261	This gene product binds to the SH3 domain of CRK protein. It may regulate cell surface extension and may have a role in the cell surface extension of an engulfing cell around a dying cell during apoptosis.
12262	
12263	
12264	HLA-DRB1 belongs to the HLA class II beta chain paralogues. The class II molecule is a heterodimer consisting of an alpha (DRA) and a beta chain (DRB), both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa. It is encoded by 6 exons, exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail. Within the DR molecule the beta chain contains all the polymorphisms specifying the peptide binding specificities. Hundreds of DRB1 alleles have been described and typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. DRB1 is expressed at a level five times higher than its paralogues DRB3, DRB4 and DRB5. DRB1 is present in all individuals. Allelic variants of DRB1 are linked with either none or one of the genes DRB3, DRB4 and DRB5. There are 4 related pseudogenes: DRB2, DRB6, DRB7, DRB8 and DRB9.
12265	
12266	
12267	The protein encoded by this gene is a cytokine that controls the production, differentiation, and function of macrophages. The active form of the protein is found extracellularly as a disulfide-linked homodimer, and is thought to be produced by proteolytic cleavage of membrane-bound precursors. The encoded protein may be involved in development of the placenta. Four transcript variants encoding three different isoforms have been found for this gene.
12268	
12269	
12270	Glycophorins A (GYPA) and B (GYPB) are major sialoglycoproteins of the human erythrocyte membrane which bear the antigenic determinants for the MN and Ss blood groups. GYPA gene consists of 7 exons and has 97% sequence homology with GYPB from the 5@#$%& UTR to the coding sequence encoding the first 45 amino acids. In addition to the M or N and S or s antigens, that commonly occur in all populations, about 40 related variant phenotypes have been identified. These variants include all the variants of the Miltenberger complex and several isoforms of Sta; also, Dantu, Sat, He, Mg, and deletion variants Ena, S-s-U- and Mk. Most of the variants are resulted from gene recombinations between GYPA and GYPB.
12271	The protein encoded by this gene is a member of the platelet-derived growth factor/vascular endothelial growth factor (PDGF/VEGF) family and is active in angiogenesis, lymphangiogenesis, and endothelial cell growth. This secreted protein undergoes a complex proteolytic maturation, generating multiple processed forms which bind and activate VEGFR-2 and VEGFR-3 receptors. This protein is structurally and functionally similar to vascular endothelial growth factor C.
12272	Receptor tyrosine kinases (RTKs) play a key role in the communication of cells with their microenvironment. These molecules are involved in the regulation of cell growth, differentiation, and metabolism. In several cases the biochemical mechanism by which RTKs transduce signals across the membrane has been shown to be ligand induced receptor oligomerization and subsequent intracellular phosphorylation. This autophosphorylation leads to phosphorylation of cytosolic targets as well as association with other molecules, which are involved in pleiotropic effects of signal transduction. RTKs have a tripartite structure with extracellular, transmembrane, and cytoplasmic regions. This gene encodes a member of a novel subclass of RTKs and contains a distinct extracellular region encompassing a factor VIII-like domain. Alternative splicing in the 5@#$%& UTR results in multiple transcript variants encoding the same protein.
12273	
12274	
12275	
12276	This locus controls the synthesis of the Kell blood group @#$%&precursor substance@#$%& (Kx). Mutations in this gene have been associated with McLeod syndrome, an X-linked, recessive disorder characterized by abnormalities in the neuromuscular and hematopoietic systems. The encoded protein has structural characteristics of prokaryotic and eukaryotic membrane transport proteins.
12277	
12278	
12279	Whereas most DNA sequence-specific transcription factors increase the rate of initiation and interact with enhancer or promoter DNA, human immunodeficiency virus-1 (HIV-1) Tat predominantly stimulates elongation and interacts with the trans-acting responsive (TAR) RNA element. Tat is essential for HIV replication.[supplied by OMIM]
12280	
12281	
12282	
12283	
12284	
12285	This gene encodes a protein that is part of a post-splicing multiprotein complex involved in both mRNA nuclear export and mRNA surveillance. The encoded protein is one of two functional homologs to yeast Upf3p. mRNA surveillance detects exported mRNAs with truncated open reading frames and initiates nonsense-mediated mRNA decay (NMD). When translation ends upstream from the last exon-exon junction, this triggers NMD to degrade mRNAs containing premature stop codons. This protein binds to the mRNA and remains bound after nuclear export, acting as a nucleocytoplasmic shuttling protein. It forms with Y14 a complex that binds specifically 20 nt upstream of exon-exon junctions. This gene is located on the long arm of chromosome X. Two splice variants encoding different isoforms have been found for this gene.
12286	
12287	
12288	Chloride channels are a diverse group of proteins that regulate fundamental cellular processes including stabilization of cell membrane potential, transepithelial transport, maintenance of intracellular pH, and regulation of cell volume. Chloride intracellular channel 2 is a member of the p64 family; the protein is detected in fetal liver and adult skeletal muscle tissue.  This gene maps to the candidate region on chromosome X for incontinentia pigmenti.
12289	
12290	
12291	GDP dissociation inhibitors are proteins that regulate the GDP-GTP exchange reaction of members of the rab family, small GTP-binding proteins of the ras superfamily, that are involved in vesicular trafficking of molecules between cellular organelles.  GDIs slow the rate of dissociation of GDP from rab proteins and release GDP from membrane-bound rabs.   GDI1 is expressed primarily in neural and sensory tissues.  Mutations in GDI1 have been linked to X-linked nonspecific mental retardation.
12292	
12293	
12294	MCF2 is a member of a large family of GDP-GTP exchange factors that modulate the activity of small GTPases of the Rho family. Five-prime recombinations result in the loss of N-terminal codons, producing MCF2 variants with oncogenic potential.[supplied by OMIM]
12295	Ocular albinism type 1 protein is a conserved integral membrane protein with seven transmembrane domains.  It is expressed in the eye and epidermal melanocytes.
12296	
12297	This gene is located in the nonrecombining portion of the Y chromosome, and expressed specifically in testis. The encoded protein interacts with ubiquitin protein ligase E3A and may be involved in male germ cell development and male infertility. Three nearly identical copies of this gene exist on chromosome Y; two copies are part of a palindromic region. This record represents the copy outside of the palidromic region.
12298	
12299	This gene encodes a ssDNA binding protein that activates the far upstream element (FUSE) of c-myc and stimulates expression of c-myc in undifferentiated cells. Regulation of FUSE by FUBP occurs through single-strand binding of FUBP to the non-coding strand. This protein has been shown to function as an ATP-dependent DNA helicase.
12300	
12301	
12302	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has three repeats of quasi-RRM domains that bind to RNAs and also contains a nuclear localization motif.
12303	This gene product is a member of the serine-arginine (SR) family of proteins, which is involved in constitutive and regulated RNA splicing. Members of this family are characterized by N-terminal RNP1 and RNP2 motifs, which are required for binding to RNA, and multiple C-terminal SR/RS repeats, which are important in mediating association with other cellular proteins. This protein can influence splice site selection of adenovirus E1A pre-mRNA. It interacts with the oncoprotein TLS, and abrogates the influence of TLS on E1A pre-mRNA splicing. Alternative splicing of this gene results in at least two transcript variants encoding different isoforms. In addition, transcript variants utilizing alternative polyA sites exist.
12304	The protein encoded by this gene belongs to the calcium sensitive chloride conductance protein family. To date, all members of this gene family map to the same site on chromosome 1p31-p22 and share high degrees of homology in size, sequence and predicted structure, but differ significantly in their tissue distributions. The exact function of this protein is not known.
12305	This gene encodes a nuclear protein that interacts with cyclin D-type binding-protein 1, which is thought to be a cell cycle regulator at the G1/S transition. Two alternatively spliced variants have been identified.
12306	This gene encodes one of the three alpha chains of type IX collagen, the major collagen component of hyaline cartilage. Type IX collagen, a heterotrimeric molecule, is usually found in tissues containing type II collagen, a fibrillar collagen. This chain is unusual in that, unlike the other two type IX alpha chains, it contains a covalently attached glycosaminoglycan side chain. Mutations in this gene are associated with multiple epiphyseal dysplasia.
12307	The origin recognition complex (ORC) is a highly conserved six subunits protein complex essential for the initiation of the DNA replication in eukaryotic cells. Studies in yeast demonstrated that ORC binds specifically to origins of replication and serves as a platform for the assembly of additional initiation factors such as Cdc6 and Mcm proteins. The protein encoded by this gene is the largest subunit of the ORC complex. While other ORC subunits are stable throughout the cell cycle, the levels of this protein vary during the cell cycle, which has been shown to be controlled by ubiquitin-mediated proteolysis after initiation of DNA replication. This protein is found to be selectively phosphorylated during mitosis. It is also reported to interact with MYST histone acetyltransferase 2 (MyST2/HBO1), a protein involved in control of transcription silencing.
12308	Type I iodothyronine deiodinase is a thiol-requiring propylthiouracil-sensitive oxidoreductase. It activates thyroid hormone by converting the prohormone thyroxine (T4) by outer ring deiodination (ORD) to bioactive 3,3@#$%&,5-triiodothyronine (T3). It also degrades both hormones by inner ring deiodination (IRD). Additional transcript variants have been described but are not supported by experimental evidence.
12309	
12310	
12311	
12312	Tenascin-R (TNR) is an extracellular matix protein expressed primarily in the central nervous system. It is a member of the tenascin (TN) gene family, which includes at least 3 genes in mammals: TNC (or hexabrachion; MIM 187380), TNX (TNXB; MIM 600985), and TNR (Erickson, 1993). The genes are expressed in distinct tissues at different times during embryonic development and are present in adult tissues.[supplied by OMIM]
12313	
12314	
12315	
12316	CKS1B protein binds to the catalytic subunit of the cyclin dependent kinases and is essential for their biological function. The CKS1B mRNA is found to be expressed in different patterns through the cell cycle in HeLa cells, which reflects a specialized role for the encoded protein.
12317	
12318	
12319	
12320	
12321	This gene product belongs to the tektin family of proteins. Tektins comprise a family of filament-forming proteins that are coassembled with tubulins to form ciliary and flagellar microtubules. This gene is expressed in the testis and its protein is localized to the flagella of the sperms, indicating that it may play a role in spermatogenesis.
12322	
12323	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of the ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class. This protein is highly similar to the rat NFB42 (neural F Box 42 kDa) protein which is enriched in the nervous system and may play a role in maintaining neurons in a postmitotic state.
12324	
12325	
12326	Receptors for the Fc fragment of IgG, or FCGRs (see MIM 146790), are cell surface glycoproteins of the Ig superfamily (IgSF). These receptors mediate phagocytosis of IgG-coated pathogens and promote activation of effector cells, leading to inflammatory responses and antibody-mediated cellular cytotoxicity. All FCGR genes map to human chromosome 1. Additional genes in this region, including FREB, encode FCGR homologs that are selectively expressed in B cells and may be implicated in B-cell development and lymphomagenesis.[supplied by OMIM]
12327	
12328	The regulator of G protein signaling (RGS) proteins are signal transduction molecules that have structural homology to SST2 of Saccharomyces cerevisiae and EGL-10 of Caenorhabditis elegans. Multiple genes homologous to SST2 are present in higher eukaryotes. RGS proteins are involved in the regulation of heterotrimeric G proteins by acting as GTPase activators.[supplied by OMIM]
12329	
12330	
12331	
12332	This gene is one of three related genes that have 2-hydroxyacid oxidase activity yet differ in encoded protein amino acid sequence, tissue expression and substrate preference. Subcellular location of the encoded protein is the peroxisome. Specifically, this gene is expressed predominantly in liver and kidney and has the highest activity toward the substrate 2-hydroxypalmitate. Two alternatively spliced variants encoding the same isoform have been described.
12333	The protein encoded by this gene is an isozyme of very long-chain acyl-CoA synthetase (VLCS). It is capable of activating very long-chain fatty-acids containing 24- and 26-carbons. It is expressed in liver and associated with endoplasmic reticulum but not with peroxisomes. Its primary role is in fatty acid elongation or complex lipid synthesis rather than in degradation. This gene has a mouse ortholog.
12334	The product of the CHML gene supports geranylgeranylation of most Rab proteins and may substitute for REP-1 in tissues other than retina. CHML is localized close to the gene for Usher syndrome type II.
12335	
12336	The protein encoded by this gene, a member of the peptidase C1 family, is a lysosomal cysteine proteinase that may participate in the degradation of antigenic proteins to peptides for presentation on MHC class II molecules. The encoded protein can function as an elastase over a broad pH range in alveolar macrophages. Transcript variants utilizing alternative polyadenylation signals exist for this gene.
12337	
12338	
12339	
12340	
12341	Elastases form a subfamily of serine proteases that hydrolyze many proteins in addition to elastin. Humans have six elastase genes which encode the structurally similar proteins elastase 1, 2, 2A, 2B, 3A, and 3B. Like most of the human elastases, elastase 2A is secreted from the pancreas as a zymogen. In other species, elastase 2A has been shown to preferentially cleave proteins after leucine, methionine, and phenylalanine residues. Clinical literature that describes human elastase 1 activity in the pancreas is actually referring to elastase 2A.
12342	This gene encodes a subunit of mitochondrial ATP synthase. Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. ATP synthase is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, Fo, comprising the proton channel. The catalytic portion of mitochondrial ATP synthase consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled with a stoichiometry of 3 alpha, 3 beta, and a single representative of the other 3. The proton channel seems to have nine subunits (a, b, c, d, e, f, g, F6 and 8). This gene encodes the subunit b of the proton channel. Alternatively spliced transcript variants encoding different isoforms have been identified.
12343	
12344	
12345	
12346	
12347	
12348	
12349	
12350	
12351	
12352	The protein encoded by this gene binds to the intronic cluster of RNA regulatory elements, downstream control sequence (DCS). It is implicated in controlling the assembly of other splicing-regulatory proteins. This protein is very similar to the polypyrimidine tract binding protein but it is expressed primarily in the brain.
12353	
12354	CYR61 is a secreted, cysteine-rich, heparin-binding protein encoded by a growth factor-inducible immediate-early gene. Acting as an extracellular, matrix-associated signaling molecule, CYR61 promotes the adhesion of endothelial cells through interaction with integrin and augments growth factor-induced DNA synthesis in the same cell type.[supplied by OMIM]
12355	CYR61 is a secreted, cysteine-rich, heparin-binding protein encoded by a growth factor-inducible immediate-early gene. Acting as an extracellular, matrix-associated signaling molecule, CYR61 promotes the adhesion of endothelial cells through interaction with integrin and augments growth factor-induced DNA synthesis in the same cell type.[supplied by OMIM]
12356	
12357	
12358	
12359	MIG6 is a cytoplasmic protein whose expression is upregulated with cell growth (Wick et al., 1995). It shares significant homology with the protein product of rat gene-33, which is induced during cell stress and mediates cell signaling (Makkinje et al., 2000; Fiorentino et al., 2000).[supplied by OMIM]
12360	
12361	
12362	
12363	
12364	This gene encodes a proline-rich protein. Studies of the related mouse gene suggest that this gene is a p53 regulated gene and may participate in p53-mediated growth suppression. Two transcript variants encoding different isoforms have been found for this gene.
12365	
12366	
12367	
12368	
12369	
12370	
12371	This gene is a proto-oncogene candidate. It is identified by its translocation into the immunoglobulin alpha-locus in some cases of B-cell leukemia. The protein encoded by this gene contains seven ankyrin repeats, which are most closely related to those found in I kappa B proteins. This protein functions as a transcriptional co-activator that activates through its association with NF-kappa B homodimers. The expression of this gene can be induced by NF-kappa B, which forms a part of the autoregulatory loop that controls the nuclear residence of p50 NF-kappa B.
12372	The adiponectin receptors, ADIPOR1 and ADIPOR2 (MIM 607946), serve as receptors for globular and full-length adiponectin (MIM 605441) and mediate increased AMPK (see MIM 602739) and PPAR-alpha (PPARA; MIM 170998) ligand activities, as well as fatty acid oxidation and glucose uptake by adiponectin (Yamauchi et al., 2003).[supplied by OMIM]
12373	
12374	
12375	
12376	
12377	The protein encoded by this gene is a member of the bone morphogenetic protein (BMP) family and the TGF-beta superfamily. This group of proteins is characterized by a polybasic proteolytic processing site which is cleaved to produce a mature protein containing seven conserved cysteine residues. The members of this family are regulators of cell growth and differentiation in both embryonic and adult tissues. Studies in rodents suggest that this protein plays a role in the adult liver and in differentiation of cholinergic central nervous system neurons.
12378	
12379	
12380	
12381	
12382	
12383	
12384	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbws class; in addition to an F-box, this protein contains multiple WD-40 repeats. This protein is homologous to Xenopus bTrCP1, yeast Met30, Neurospora Scon2 and Drosophila Slimb proteins. It interacts with HIV-1 Vpu and connects CD4 to the proteolytic machinery. It also associates specifically with phosphorylated IkappaBalpha and beta-catenin destruction motifs, probably functioning in multiple transcriptional programs by activating the NF-kappaB pathway and inhibiting the beta-catenin pathway.
12385	
12386	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This protein localizes to nuclear bodies. Its structure is similar to some tumor suppressor proteins and its gene maps to a locus thought to contain tumor suppressor genes.
12387	
12388	
12389	
12390	
12391	
12392	This gene encodes a protein involved in spindle checkpoint function. The encoded protein contains four WD repeat domains and has sequence similarity with the yeast BUB3 protein. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
12393	
12394	The early growth response protein 2 is a transcription factor with three tandem C2H2-type zinc fingers.  Mutations in this gene are associated with the autosomal dominant Charcot-Marie-Tooth disease, type 1.
12395	
12396	
12397	
12398	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this DEAD box protein family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a member of this family. The function of this member has not been determined. Alternative splicing of this gene generates 2 transcript variants, but the full length nature of one of the variants has not been defined.
12399	
12400	This gene encodes a member of the homeobox family of transcription factors, many of which are involved in developmental processes. Expression in specific hematopoietic lineages suggests that this protein may play a role in hematopoietic differentiation. A second transcript variant has been described, but its full length nature has not been determined.
12401	
12402	BAG proteins compete with Hip for binding to the Hsc70/Hsp70 ATPase domain and promote substrate release. All the BAG proteins have an approximately 45-amino acid BAG domain near the C terminus but differ markedly in their N-terminal regions. The protein encoded by this gene contains a WW domain in the N-terminal region and a BAG domain in the C-terminal region. The BAG domains of BAG1, BAG2, and BAG3 interact specifically with the Hsc70 ATPase domain in vitro and in mammalian cells. All 3 proteins bind with high affinity to the ATPase domain of Hsc70 and inhibit its chaperone activity in a Hip-repressible manner.
12403	
12404	
12405	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
12406	
12407	
12408	Oxygen sensing is essential for homeostasis in all aerobic organisms. A phagocyte-type oxidase, similar to that responsible for the production of large amounts of reactive oxygen species (ROS) in neutrophil granulocytes, with resultant antimicrobial activity, has been postulated to function in the kidney as an oxygen sensor that regulates the synthesis of erythropoietin (EPO; MIM 133170) in the renal cortex.[supplied by OMIM]
12409	The protein encoded by this gene belongs to the innexin family. Innexin family members are the structural components of gap junctions. This protein and pannexin 2 are abundantly expressed in central nerve system (CNS) and are coexpressed in various neuronal populations. Studies in Xenopus oocytes suggest that this protein alone and in combination with pannexin 2 may form cell type-specific gap junctions with distinct properties.
12410	This gene encodes a member of the fibronectin leucine rich transmembrane protein (FLRT) family. The family members may function in cell adhesion and/or receptor signalling. Their protein structures resemble small leucine-rich proteoglycans found in the extracellular matrix. The encoded protein shares sequence similarity with two other family members, FLRT2 and FLRT3. This gene is expressed in kidney and brain.
12411	
12412	The protein encoded by this gene is involved in the sodium-independent transport and excretion of organic anions, some of which are potentially toxic. The encoded protein is an integral membrane protein and is found mainly in the kidney and in the placenta, where it may act to prevent potentially harmful organic anions from reaching the fetus.
12413	The protein encoded by this gene was identified by its ability to protect retroviruses from intramolecular integration and therefore promote intermolecular integration into the host cell genome. The endogenous function of the protein is unknown. The protein forms a homodimer which localizes to the nucleus and is specifically associated with chromosomes during mitosis. This protein binds to DNA in a non-specific manner and studies in rodents suggest that it also binds to lamina-associated polypeptide 2, a component of the nuclear lamina.
12414	
12415	
12416	
12417	The protein encoded by this gene is an epithelial-derived, integral membrane serine protease. This protease forms a complex with the Kunitz-type serine protease inhibitor, HAI-1, and is found to be activated by sphingosine 1-phosphate. This protease has been shown to cleave and activate hepatocyte growth factor/scattering factor, and urokinase plasminogen activator, which suggest the function of this protease as an epithelial membrane activator for other proteases and latent growth factors. The expression of this protease has been associated with breast, colon, prostate, and ovarian tumors, which implicates its role in cancer invasion, and metastasis.
12418	This gene encodes a member of the cysteine-aspartic acid protease (caspase) family. Sequential activation of caspases plays a central role in the execution-phase of cell apoptosis. Caspases exist as inactive proenzymes which undergo proteolytic processing at conserved aspartic residues to produce 2 subunits, large and small, that dimerize to form the active enzyme. Overexpression of the active form of this enzyme has been shown to induce apoptosis in fibroblasts. Max, a central component of the Myc/Max/Mad transcription regulation network important for cell growth, differentiation, and apoptosis, was reported to be cleaved by this protein, which required Fas-mediated dephosphorylation of Max. The expression of this gene was found to be regulated by interferon-gamma and lipopolysaccharide.
12419	
12420	This gene encodes a member of a family of small membrane proteins that share a 35-amino acid signature sequence domain, beginning with the sequence PFXYD and containing 7 invariant and 6 highly conserved amino acids. The approved human gene nomenclature for the family is FXYD-domain containing ion transport regulator. Mouse FXYD5 has been termed RIC (Related to Ion Channel). FXYD2, also known as the gamma subunit of the Na,K-ATPase, regulates the properties of that enzyme. FXYD1 (phospholemman), FXYD2 (gamma), FXYD3 (MAT-8), FXYD4 (CHIF), and FXYD5 (RIC) have been shown to induce channel activity in experimental expression systems. Transmembrane topology has been established for two family members (FXYD1 and FXYD2), with the N-terminus extracellular and the C-terminus on the cytoplasmic side of the membrane. The Type III integral membrane protein encoded by this gene is the gamma subunit of the Na,K-ATPase present on the plasma membrane. Although the Na,K-ATPase does not depend on the gamma subunit to be functional, it is thought that the gamma subunit modulates the enzyme@#$%&s activity by inducing ion channel activity. Mutations in this gene have been associated with renal hypomagnesaemia.
12421	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S25E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
12422	
12423	
12424	
12425	
12426	
12427	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein.
12428	
12429	
12430	The protein encoded by this gene belongs to the peptidase family S14 and hydrolyzes proteins into small peptides in the presence of ATP and magnesium. The protein is transported into mitochondrial matrix and is associated with the inner mitochondrial membrane.
12431	Initiation of transcription by RNA polymerase II requires the activities of more than 70 polypeptides. The protein that coordinates these activities is transcription factor IID (TFIID), which binds to the core promoter to position the polymerase properly, serves as the scaffold for assembly of the remainder of the transcription complex, and acts as a channel for regulatory signals. TFIID is composed of the TATA-binding protein (TBP) and a group of evolutionarily conserved proteins known as TBP-associated factors or TAFs. TAFs may participate in basal transcription, serve as coactivators, function in promoter recognition or modify general transcription factors (GTFs) to facilitate complex assembly and transcription initiation. This gene encodes a protein that is a component of the PCAF histone acetylase complex and structurally similar to one of the histone-like TAFs, TAF6. The PCAF histone acetylase complex, which is composed of more than 20 polypeptides some of which are TAFs, is required for myogenic transcription and differentiation.
12432	
12433	
12434	Lactate dehydrogenase A catalyzes the conversion of L-lactate and NAD to pyruvate and NADH in the final step of anaerobic glycolysis.  LDHA is found predominantly in muscle tissue and belongs to the lactate dehydrogenase family.  Mutations in LDHA have been linked to exertional myoglobinuria.
12435	The protein encoded by this gene belongs to the NALP protein family. NALPs are cytoplasmic proteins that may form large, signal-induced multiprotein complexes called inflammasomes. Inflammasomes activate the pro-inflammatory caspases, caspase-1 and caspase-5.
12436	
12437	This gene is one of several genes in the imprinted gene domain of 11p15.5 which is considered to be an important tumor suppressor gene region. Alterations in this region may be associated with the Beckwith-Wiedemann syndrome, Wilms tumor, rhabdomyosarcoma, adrenocortical carcinoma, and lung, ovarian, and breast cancer. Studies of the mouse gene, however, which is also located in an imprinted gene domain, have shown that the product of this gene regulates placental growth.
12438	This gene encodes a type II transmembrane glycoprotein belonging to the M28 peptidase family. Three functionally distinct proteins are encoded, including folylpoly-gamma-glutamate carboxypeptidase in the intestine, N-acetylated alpha-linked acidic dipeptidase 1 in the brain, and prostate-specific membrane antigen in the prostate. A mutation in the intestinal form may be associated with impaired intestinal absorption of dietary folates, resulting in low blood folate levels and consequent hyperhomocysteinemia. The form expressed in the brain may be involved in a number of pathological conditions associated with glutamate excitotoxicity. The prostate form is up-regulated in cancerous cells and is used as an effective diagnostic and prognostic indicator of prostate cancer. This gene likely arose from a duplication event of a nearby chromosomal region. Alternative splicing gives rise to multiple transcript variants.
12439	After removal of the precursor signal peptide, the insulin receptor precursor is post-translationally cleaved into two chains (alpha and beta) that are covalently linked.  Binding of insulin to the insulin receptor (INSR) stimulates glucose uptake.
12440	Members of the parvin family, including PARVA, are actin-binding proteins associated with focal contacts.[supplied by OMIM]
12441	The epsilon globin gene (HBE) is normally expressed in the embryonic yolk sac: two epsilon chains together with two zeta chains (an alpha-like globin) constitute the embryonic hemoglobin Hb Gower I; two epsilon chains together with two alpha chains form the embryonic Hb Gower II. Both of these embryonic hemoglobins are normally supplanted by fetal, and later, adult hemoglobin. The five beta-like globin genes are found within a 45 kb cluster on chromosome 11 in the following order:  5@#$%&-epsilon - G-gamma - A-gamma - delta - beta-3@#$%&
12442	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This protein is part of the RoSSA ribonucleoprotein which includes a single polypeptide and one of four small RNA molecules. The RoSSA particle localizes to both the cytoplasm and the nucleus. RoSSA interacts with autoantigens in patients with Sjogren syndrome and systemic lupus erythematosus. The function of the RoSSA particle has not been determined. Two alternatively spliced transcript variants for this gene have been described; however, the full length nature of one variant has not been determined.
12443	The protein encoded by this gene is a member of the folate receptor (FOLR) family. Members of this gene family have a high affinity for folic acid and for several reduced folic acid derivatives, and mediate delivery of 5-methyltetrahydrofolate to the interior of cells.  This protein has a 68% and 79% sequence homology with the FOLR1 and FOLR3 proteins, respectively. The FOLR2 protein was originally thought to exist only in placenta, but is also detected in spleen, bone marrow, and thymus.
12444	This gene encodes a microtubule-associated protein. The encoded protein is a calmodulin-binding and calmodulin-regulated protein that is involved in microtubule stabilization. Two transcript variants encoding distinct isoforms have been identified for this gene.
12445	Leukocyte Ig-like receptors (LIRs) are a family of immunoreceptors expressed predominantly on monocytes and B cells and at lower levels on dendritic cells and natural killer (NK) cells. Activation of various immune cell types can be prevented by negative signaling receptors through interactions with specific ligands, such as MHC class I molecules by NK cells. All of the LIR inhibitory receptors, members of subfamily B, contain a cytoplasmic immunoreceptor tyrosine-based inhibitory motif (ITIM). Upon MHC class I (or other ligand) engagement and tyrosine phosphorylation of the ITIM, intracellular protein-tyrosine phosphatases such as SHP1 (PTPN6; MIM 176883) are recruited, and an inhibitory signal cascade ensues. Other LIR receptors, members of subfamily A, with short cytoplasmic regions containing no ITIMs and with transmembrane regions containing a charged arginine residue, may initiate stimulatory cascades (see, e.g., MIM 604810). One member of subfamily A (LILRA3; MIM 604818) lacks a transmembrane region and is presumed to be a soluble receptor.[supplied by OMIM]
12446	
12447	
12448	This gene is an ortholog of the S. cerevisiae YDL201w gene, which is predicted to encode a methyltransferase. The gene product contains a conserved S-adenosylmethionine-binding motif, which is typical of a methyltransferase. Alternative splice variants encoding different protein isoforms and transcript variants utilizing alternative polyA sites have been described in the literature.
12449	This gene is an ortholog of the S. cerevisiae YDL201w gene, which is predicted to encode a methyltransferase. The gene product contains a conserved S-adenosylmethionine-binding motif, which is typical of a methyltransferase. Alternative splice variants encoding different protein isoforms and transcript variants utilizing alternative polyA sites have been described in the literature.
12450	This gene encodes an inhibitor of apoptosis protein (IAP)-binding protein. The encoded mitochondrial protein enters the cytosol when cells undergo apoptosis, and it moderates the caspase inhibition of IAPs. Multiple polyadenylation sites have been found for this gene. Several alternatively spliced transcript variants that encode distinct isoforms have been described for this gene but the validity of some transcripts, and their predicted ORFs, has not been determined conclusively.
12451	
12452	The protein encoded by this gene is a Krebs tricarboxylic acid cycle enzyme that catalyzes the synthesis of citrate from oxaloacetate and acetyl coenzyme A. The enzyme is found in nearly all cells capable of oxidative metablism. This protein is nuclear encoded and transported into the mitochondrial matrix, where the mature form is found.
12453	The mouse protein stargazin is one of five subunits comprising neuronal voltage-gated calcium channels. This subunit, gamma, is thought to stabilize the calcium channel in an inactive (closed) state. Mutations in the gene encoding stargazin have been associated with absence seizures, also known as petit-mal or spike-wave seizures. The protein encoded by this gene is structurally similar to the mouse stargazin protein and is a member of the neuronal calcium channel gamma subunit protein family. However, it appears unlikely that the encoded protein is part of a functional calcium channel. Rather, it appears to inhibit the expression of a specific calcium channel subunit.
12454	This gene encodes a member of the DEAD box protein family. DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure, such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. Multiple alternatively spliced transcript variants have been found for this gene, but the biological validity of only one transcript has been confirmed.
12455	
12456	
12457	The protein encoded by this gene is a member of the LAR protein-tyrosine phosphatase-interacting protein (liprin) family. Liprins interact with members of LAR family of transmembrane protein tyrosine phosphatases, which are known to be important for axon guidance and mammary gland development. It has been proposed that liprins are multivalent proteins that form complex structures and act as scaffolds for the recruitment and anchoring of LAR family of tyrosine phosphatases. This protein was found to interact with S100A4, a calcium-binding protein related to tumor invasiveness and metastasis. In vitro experiment demonstrated that the interaction inhibited the phosphorylation of this protein by protein kinase C and protein kinase CK2. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
12458	The protein encoded by this gene is a member of the STAT family of transcription factors. In response to cytokines and growth factors, STAT family members are phosphorylated by the receptor associated kinases, and then form homo- or heterodimers that translocate to the cell nucleus where they act as transcription activators. This protein plays a central role in exerting IL4 mediated biological responses. It is found to induce the expression of BCL2L1/BCL-X(L), which is responsible for the anti-apoptotic activity of IL4. Knockout studies in mice suggested the roles of this gene in differentiation of T helper 2 (Th2) cells, expression of cell surface markers, and class switch of immunoglobulins.
12459	
12460	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and a single intracellular catalytic domains, and thus represents a receptor-type PTP. The similar gene predominately expressed in mouse brain was found to associate with, and thus regulate the activity and cellular localization of MAP kinases. The rat counterpart of this gene was reported to be regulated by the nerve growth factor, which suggested the function of this gene in neuronal growth and differentiation.
12461	This gene belongs to the homeobox family of genes. The homeobox genes encode a highly conserved family of transcription factors that play an important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, located on different chromosomes, consisting of 9 to 11 genes arranged in tandem. This gene is one of several homeobox HOXC genes located in a cluster on chromosome 12. The product of this gene binds to a promoter element of the lactase-phlorizin hydrolase. It also may play a role in early intestinal development. An alternatively spliced variant encoding a shorter isoform has been described but its full-length nature has not been determined.
12462	
12463	This gene encodes an enzyme included in the 2@#$%&, 5@#$%& oligoadenylate synthase family.  This enzyme is induced by interferons and catalyzes the 2@#$%&, 5@#$%& oligomers of adenosine in order to bind and activate RNase L.  This enzyme family plays a significant role in the inhibition of cellular protein synthesis and viral infection resistance.
12464	
12465	
12466	The evolutionarily conserved THRAP genes encode a family of proteins that regulate embryonic development. THRAP2 is involved in early development of the heart and brain (Muncke et al., 2003).[supplied by OMIM]
12467	
12468	
12469	
12470	The protein encoded by this gene is similar to ING1, a tumor suppressor protein that can interact with TP53, inhibit cell growth, and induce apoptosis. This protein contains a PHD-finger, which is a common motif in proteins involved in chromatin remodeling. This protein can bind TP53 and EP300/p300, a component of the histone acetyl transferase complex, suggesting its involvement in the TP53-dependent regulatory pathway. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
12471	The protein encoded by this gene is a member of the G protein-coupled receptor family. This protein is one of four receptors identified for prostaglandin E2 (PGE2). Through a phosphatidylinositol-calcium second messenger system, G-Q proteins mediate this receptor@#$%&s activity. Knockout studies in mice suggested a role of this receptor in mediating algesia and in regulation of blood pressure. Studies in mice also suggested that this gene may mediate adrenocorticotropic hormone response to bacterial endotoxin.
12472	
12473	The protein encoded by this gene is a member of the TNF-receptor superfamily. This receptor is required for generation and long-term maintenance of T cell immunity. It binds to ligand CD70, and plays a key role in regulating B-cell activation and immunoglobulin synthesis. This receptor transduces signals that lead to the activation of NF-kappaB and MAPK8/JNK. Adaptor proteins TRAF2 and TRAF5 have been shown to mediate the signaling process of this receptor. CD27-binding protein (SIVA), a proapoptotic protein, can bind to this receptor and is thought to play an important role in the apoptosis induced by this receptor.
12474	The adiponectin receptors, ADIPOR1 (MIM 607945) and ADIPOR2, serve as receptors for globular and full-length adiponectin (MIM 605441) and mediate increased AMPK (see MIM 602739) and PPAR-alpha (PPARA; MIM 170998) ligand activities, as well as fatty acid oxidation and glucose uptake by adiponectin (Yamauchi et al., 2003).[supplied by OMIM]
12475	
12476	This gene encodes twinfilin, an actin monomer-binding protein conserved from yeast to mammals. Studies of the mouse counterpart suggest that this protein may be an actin monomer-binding protein, and its localization to cortical G-actin-rich structures may be regulated by the small GTPase RAC1. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
12477	
12478	
12479	
12480	
12481	KCNA1/Kv1.1 encodes a voltage-gated delayed potassium channel that is phylogenetically related to the Drosophila Shaker channel (accession number: M17211). The KCNA1/Kv1.1 product has six putative transmembrane segments (S1-S6), and the loop between S5 and S6 forms the pore and contains the conserved selectivity filter motif (GYGD). The functional channel is a homotetramer. The predominant KCNA1 Kv1.1 transcript is 8.0 kb. The coding region and the entire known 5@#$%& and 3@#$%& UTR is contained in a single uninterrupted exon (AC006063, M36456). The N-terminus of the channel is associated with beta subunits that can modify the inactivation properties of the channel as well as affect expression levels. The C-terminal is complexed to a PDZ domain protein that is responsible for channel targeting.
12482	
12483	
12484	
12485	
12486	Messenger RNA stability and translation initiation are extensively under the control of poly(A)-binding proteins (PABP). See PABPC1 (MIM 604679) for background information.[supplied by OMIM]
12487	
12488	
12489	This gene encodes a member of the tetratrico peptide repeat (TPR) family. Mutations of a similar gene in mouse can cause polycystic kidney disease. Two transcript variants encoding distinct isoforms have been identified for this gene.
12490	This gene encodes a member of the claudin family. Claudins are integral membrane proteins and components of tight junction strands. Tight junction strands serve as a physical barrier to prevent solutes and water from passing freely through the paracellular space between epithelial or endothelial cell sheets. Two alternatively spliced transcript variants that encode different isoforms have been identified for this gene.
12491	PCCA encodes the alpha subunit of the heterodimeric mitochondrial enzyme Propionyl-CoA carboxylase. PCCA encodes the biotin-binding region of this enzyme. Mutations in either PCCA or PCCB (encoding the beta subunit) lead to an enzyme deficiency result in propionic acidemia.
12492	
12493	
12494	
12495	The protein encoded by this gene belongs to the highly conserved cyclin family, whose members are characterized by a dramatic periodicity in protein abundance through the cell cycle. Cyclins function as regulators of CDK kinases. Different cyclins exhibit distinct expression and degradation patterns which contribute to the temporal coordination of each mitotic event. The cyclin encoded by this gene was shown to be expressed in testis and brain, as well as in several leukemic cell lines, and is thought to primarily function in the control of the germline meiotic cell cycle. This cyclin binds both CDK2 and CDC2 kinases, which give two distinct kinase activities, one appearing in S phase, the other in G2, and thus regulate separate functions in cell cycle. This cyclin was found to bind to important cell cycle regulators, such as Rb family proteins, transcription factor E2F-1, and the p21 family proteins.
12496	
12497	
12498	The protein encoded by this gene is a type I transmembrane protein. It is thought to modulate bone morphogenetic protein (BMP) receptor function by serving as an accessory or coreceptor, and thus facilitates or hinders BMP binding. It is known that the mouse AMN gene is expressed in the extraembryonic visceral endoderm layer during gastrulation, but it is found to be mutated in amnionless mouse. The encoded protein has sequence similarity to short gastrulation (Sog) and procollagen IIA proteins in Drosophila.
12499	This gene encodes a member of the RecA/Rad51-related protein family that participates in homologous recombination to maintain chromosome stability and repair DNA damage. This gene functionally complements Chinese hamster irs1SF, a repair-deficient mutant that exhibits hypersensitivity to a number of different DNA-damaging agents and is chromosomally unstable. A rare microsatellite polymorphism in this gene is associated with cancer in patients of varying radiosensitivity.
12500	The protein encoded by this gene is a member of the RAD51 protein family. RAD51 family members are evolutionarily conserved proteins essential for DNA repair by homologous recombination. This protein has been shown to form a stable heterodimer with the family member RAD51C, which further interacts with the other family members, such as RAD51, XRCC2, and XRCC3. Overexpression of this gene was found to cause cell cycle G1 delay and cell apoptosis, which suggested a role of this protein in sensing DNA damage. At least three alternatively spliced transcript variants encoding distinct isoforms have been observed.
12501	
12502	
12503	
12504	Rho GTPase activating protein 5 negatively regulates RHO GTPases, a family which may mediate cytoskeleton changes by stimulating the hydrolysis of bound GTP.
12505	
12506	
12507	The protein encoded by this gene contains a SH3 domain, which is known to be important in intracellular signal transduction. The protein encoded by a similiar gene in mice was shown to be able to bind to SH3 domain of protein-tyrosine kinases. The function of this gene is unknown. Two alternatively spliced variants have been described.
12508	
12509	
12510	
12511	
12512	
12513	This gene encodes a member of the bicoid sub-family of homeodomain-containing transcription factors. The encoded protein acts as a transcription factor and may play a role in brain and sensory organ development. A similar protein in mice is required for proper forebrain development. Two transcript variants encoding distinct isoforms have been identified for this gene. Other alternative splice variants may exist, but their full length sequences have not been determined.
12514	The protein encoded by this gene belongs to the dual specificity protein phosphatase family. It was identified as a cyclin-dependent kinase inhibitor, and has been shown to interact with, and dephosphorylate CDK2 kinase, thus prevent the activation of CDK2 kinase. This gene was reported to be deleted, mutated, or overexpressed in several kinds of cancers.
12515	
12516	ADP-ribosylation factor 6 (ARF6) is a member of the human ARF gene family. These genes encode small guanine nucleotide-binding proteins that stimulate the ADP-ribosyltransferase activity of cholera toxin and play a role in vesicular trafficking and as activators of phospholipase D.  The gene products include 6 ARF proteins and 11 ARF-like proteins and constitute 1 family of the RAS superfamily. The ARF proteins are categorized as class I (ARF1, ARF2,and ARF3), class II (ARF4 and ARF5) and class III (ARF6). The ARF6 protein is uniquely localized to the plasma membrane.
12517	This gene encodes disintegrin and metalloprotease (ADAM) domain 20, which is a member of the ADAM protein family. Members of this family are membrane-anchored proteins structurally related to snake venom disintegrins, and have been implicated in a variety of biologic processes involving cell-cell and cell-matrix interactions, including fertilization, muscle development, and neurogenesis. The expression of this gene is testis-specific.
12518	The protein encoded by this gene catalyzes the phosphorylation of uridine monophosphate to uridine diphosphate. This is the first step in the production of the pyrimidine nucleoside triphosphates required for RNA and DNA synthesis. In addition, an allele of this gene may play a role in mediating nonhumoral immunity to Hemophilus influenzae type B.
12519	
12520	This gene encodes a protein containing a lipid recognition domain. The encoded protein may function in regulating the transport of cholesterol through the late endosomal/lysosomal system. Mutations in this gene have been associated with Niemann-Pick disease, type C2 and frontal lobe atrophy.
12521	
12522	Type II iodothyronine deiodinase activates thyroid hormone by converting the prohormone thyroxine (T4) by outer ring deiodination (ORD) to bioactive 3,3@#$%&,5-triiodothyronine (T3). Thyroid hormone appears to have important regulatory effects in some mammalian tissues, such as the developing brain, the anterior pituitary gland, and brown adipose tissue. A relatively high proportion of the receptor-bound triiodothyronine is found within the tissue itself rather than in plasma. The expression in these tissues of type II iodothyronine deiodinase, suggests that deiodinase II is responsible for this @#$%&local@#$%& production of T3 and is thus important in influencing thyroid hormone action in these tissues. In addition, activity of this protein is markedly elevated in the hypothyroid state and appears to be responsible for catalyzing the production of a large proportion of the circulating T3 under such conditions. Dioiodinase 2 may contribute significantly to the relative increase in thyroidal T3 production in patients with Graves disease, toxic adenomas, and, perhaps, iodine deficiency. Alternative splicing results in multiple transcript variants encoding different isoforms.
12523	
12524	
12525	
12526	This gene encodes a highly efficient N-acetylgalactosamine (GalNAc) kinase, which has galactokinase activity when galactose is present at high concentrations. Two alternatively spliced transcript variants encoding different isoforms have been found for this gene.
12527	
12528	
12529	The protein encoded by this gene is a member of kinesin-like protein family. This family includes microtubule-dependent molecular motors that transport organelles within cells and move chromosomes during cell division. This protein has been shown to cross-bridge antiparallel microtubules and drive microtubule movement in vitro. Alternate splicing of this gene results in two transcript variants encoding two different isoforms.
12530	This gene encodes the alpha 1 adaptin subunit of the adaptor protein 2 (AP-2) complex found in clathrin coated vesicles. The AP-2 complex is a heterotetramer consisting of two large adaptins (alpha or beta), a medium adaptin (mu), and a small adaptin (sigma). The complex is part of the protein coat on the cytoplasmic face of coated vesicles which links clathrin to receptors in vesicles. Alternative splicing of this gene results in two transcript variants encoding two different isoforms. A third transcript variant has been described, but its full length nature has not been determined.
12531	
12532	Members of ADAM family are cell surface proteins with a unique structure possessing both potential adhesion and protease domains. ADAM10 possesses many of the properties of a TNF-processing enzyme
12533	
12534	
12535	
12536	
12537	Vesicle mediated protein sorting plays an important role in segregation of intracellular molecules into distinct organelles. Genetic studies in yeast have identified more than 40 vacuolar protein sorting (VPS) genes involved in vesicle transport to vacuoles. This gene is a member of the Sec-1 domain family, and encodes the human ortholog of rat Vps33b which is homologous to the yeast class C Vps33 protein. The mammalian class C Vps proteins are predominantly associated with late endosomes/lysosomes, and like their yeast counterparts, may mediate vesicle trafficking steps in the endosome/lysosome pathway.
12538	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that contains a high level of sequence similarity with ribosomal protein S11P family members. A pseudogene corresponding to this gene is found on chromosome 20. Sequence analysis identified two transcript variants that encode different protein isoforms.
12539	
12540	This gene is a member of a family of neurotrophic factors, neurotrophins, that control survival and differentiation of mammalian neurons. The expression of this gene is ubiquitous and less influenced by environmental signals. While knock-outs of other neurotrophins including nerve growth factor, brain-derived neurotrophic factor, and neurotrophin 3  prove lethal during early postnatal development, NTF5-deficient mice only show minor cellular deficits and develop normally to adulthood.
12541	
12542	
12543	
12544	
12545	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This endosomal protein regulates the cell-surface expression of epidermal growth factor receptor. This protein also has a role in sorting protease-activated receptor-1 from early endosomes to lysosomes. This protein may form oligomeric complexes with family members. This gene results in three transcript variants encoding distinct isoforms.
12546	
12547	
12548	The protein encoded by this gene is a member of the Kunitz family of serine protease inhibitors. Hepatocyte growth factor activator inhibitor is a potent inhibitor specific for HGF activator and is thought to be involved in regulation of proteolytic activation of HGF in injured tissues.
12549	
12550	
12551	The protein encoded by this gene is a T cell-derived cytokine that shares the sequence similarity with IL17. This cytokine was reported to stimulate the release of tumor necrosis factor alpha and interleukin 1 beta from a monocytic cell line. The expression of this cytokine was found to be restricted to activated T cells.
12552	Retention of resident soluble proteins in the lumen of the endoplasmic reticulum (ER) is achieved in both yeast and animal cells by their continual retrieval from the cis-Golgi, or a pre-Golgi compartment. Sorting of these proteins is dependent on a C-terminal tetrapeptide signal, usually lys-asp-glu-leu (KDEL) in animal cells, and his-asp-glu-leu (HDEL) in S. cerevisiae. This process is mediated by a receptor that recognizes, and binds the tetrapeptide-containing protein, and returns it to the ER. In yeast, the sorting receptor encoded by a single gene, ERD2, which is a seven-transmembrane protein. Unlike yeast, several human homologs of the ERD2 gene, constituting the KDEL receptor gene family, have been described. The protein encoded by this gene was the first member of the family to be identified, and it encodes a protein structurally and functionally similar to the yeast ERD2 gene product.
12553	
12554	
12555	This gene is a classical cadherin from the cadherin superfamily. The encoded protein is a calcium-dependent cell-cell adhesion glycoprotein comprised of five extracellular cadherin repeats, a transmembrane region and a highly conserved cytoplasmic tail. This gene is located in a six-cadherin cluster in a region on the long arm of chromosome 16 that is involved in loss of heterozygosity events in breast and prostate cancer. In addition, aberrant expression of this protein is observed in cervical adenocarcinomas. Mutations in this gene have been associated with congential hypotrichosis with juvenile macular dystrophy.
12556	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbls class and, in addition to an F-box, contains 9 tandem leucine-rich repeats. Two transcript variants encoding the same protein have been found for this gene. Other variants may occur, but their full-length natures have not been characterized.
12557	Retention of resident soluble proteins in the lumen of the endoplasmic reticulum (ER) is achieved in both yeast and animal cells by their continual retrieval from the cis-Golgi, or a pre-Golgi compartment. Sorting of these proteins is dependent on a C-terminal tetrapeptide signal, usually lys-asp-glu-leu (KDEL) in animal cells, and his-asp-glu-leu (HDEL) in S. cerevisiae. This process is mediated by a receptor that recognizes, and binds the tetrapeptide-containing protein, and returns it to the ER. In yeast, the sorting receptor encoded by a single gene, ERD2, which is a seven-transmembrane protein. Unlike yeast, several human homologs of the ERD2 gene, constituting the KDEL receptor gene family, have been described. The protein encoded by this gene was the first member of the family to be identified, and it encodes a protein structurally and functionally similar to the yeast ERD2 gene product.
12558	
12559	
12560	The protein encoded by this gene is a transcription factor of the PR-domain protein family. It contains a PR-domain and multiple zinc finger motifs. Ttranscription factors of the PR-domain family are known to be involved in cell differentiation and tumorigenesis.
12561	Theta-globin mRNA is found in human fetal erythroid tissue but not in adult erythroid or other nonerythroid tissue. The theta-1 gene may be expressed very early in embryonic life, perhaps sometime before 5 weeks. Theta-1 is a member of the human alpha-globin gene cluster that involves 4 functional genes and 3 nonfunctional pseudogenes. The order of genes is: 5@#$%&-zeta-pseudozeta-pseudoalpha2-pseudoalpha1-alpha2-alph1-theta1-3@#$%&. Research supports a transcriptionally active role for the gene and a functional role for the peptide in specific cells, possibly those of early erythroid tissue.
12562	
12563	
12564	
12565	
12566	
12567	Cholestery ester transfer protein (CETP) transfers cholesteryl esters between lipoproteins.  CETP may effect susceptibility to atherosclerosis.
12568	
12569	
12570	
12571	
12572	
12573	
12574	
12575	
12576	
12577	Alternative mRNA splicing plays an important role in development and differentiation; many transcripts are spliced differently in distinct cell types and tissues. Both constitutive and alternative splicing occurs on spliceosomes, which are complex particles composed of small nuclear ribonucleoproteins (snRNPs) and non-snRNP proteins. The SR family of non-snRNP splicing factors is characterized by the presence of an RNA recognition motif and a serine- and arginine-rich (SR) domain. SR proteins are required at early stages of spliceosome assembly, have distinct but overlapping specificities for different pre-mRNAs, and can alter splice site choice, suggesting that they may be involved in the regulation of alternative splicing in vivo. Two of the SR proteins, ASF/SF2 (SFRS1) and SC35 (SFRS2; MIM 600813), have been extensively characterized (Bermingham et al., 1995).[supplied by OMIM]
12578	Alternative mRNA splicing plays an important role in development and differentiation; many transcripts are spliced differently in distinct cell types and tissues. Both constitutive and alternative splicing occurs on spliceosomes, which are complex particles composed of small nuclear ribonucleoproteins (snRNPs) and non-snRNP proteins. The SR family of non-snRNP splicing factors is characterized by the presence of an RNA recognition motif and a serine- and arginine-rich (SR) domain. SR proteins are required at early stages of spliceosome assembly, have distinct but overlapping specificities for different pre-mRNAs, and can alter splice site choice, suggesting that they may be involved in the regulation of alternative splicing in vivo. Two of the SR proteins, ASF/SF2 (SFRS1) and SC35 (SFRS2; MIM 600813), have been extensively characterized (Bermingham et al., 1995).[supplied by OMIM]
12579	
12580	The protein encoded by this gene is a member of the PP2C family of Ser/Thr protein phosphatases. PP2C family members are known to be negative regulators of cell stress response pathways. This phosphatase was identified as an interacting protein of Rho guanine nucleotide exchange factors (PIX). PIX proteins are regulators of p21/Cdc42/Rac1-activated kinase 1 (PAK1), a protein kinase mediating biological effects downstream of Rho GTPases. This phosphatase has been shown to block the effects of PAK, and thus inhibit actin stress fiber breakdown and morphological changes driven by cell division cycle 42 (CDC42).
12581	
12582	N-methyl-D-aspartate (NMDA) receptors are a class of ionotropic glutamate receptors.  NMDA channel has been shown to be involved in long-term potentiation, an activity-dependent increase in the efficiency of synaptic transmission thought to underlie certain kinds of memory and learning.  NMDA receptor channels are heteromers composed of the key receptor subunit NMDAR1 (GRIN1) and 1 or more of the 4 NMDAR2 subunits: NMDAR2A (GRIN2A), NMDAR2B (GRIN2B), NMDAR2C (GRIN2C), and NMDAR2D (GRIN2D).
12583	GPR142 is a member of the rhodopsin family of G protein-coupled receptors (GPRs) (Fredriksson et al., 2003).[supplied by OMIM]
12584	
12585	
12586	
12587	
12588	This gene encodes a protein that belongs to the NALP family of cytoplasmic proteins. A mouse gene that encodes a maternal oocyte protein, similar to this encoded protein, is required for normal early embryogenesis.
12589	
12590	This gene encodes a member of the GRB2/Sem5/Drk family. This member functions as a cytoplasmic signaling protein which contains an SH2 domain flanked by two SH3 domains. The SH2 domain interacts with ligand-activated receptors for stem cell factor and erythropoietin, and facilitates the formation of a stable complex with the BCR-ABL oncoprotein. This protein also associates with the Ras guanine nucleotide exchange factor SOS1 (son of sevenless homolog 1) through its N-terminal SH3 domain.
12591	This gene encodes a member of the GRB2/Sem5/Drk family. This member functions as a cytoplasmic signaling protein which contains an SH2 domain flanked by two SH3 domains. The SH2 domain interacts with ligand-activated receptors for stem cell factor and erythropoietin, and facilitates the formation of a stable complex with the BCR-ABL oncoprotein. This protein also associates with the Ras guanine nucleotide exchange factor SOS1 (son of sevenless homolog 1) through its N-terminal SH3 domain.
12592	The enzyme phosphoribosylpyrophosphate synthetase (PRS) catalyzes the formation of phosphoribosylpyrophosphate which is a substrate for synthesis of purine and pyrimidine nucleotides, histidine, tryptophan and NAD. PRS exists as a complex with two catalytic subunits and two associated subunits. This gene encodes a non-catalytic associated subunit of PRS.
12593	
12594	Rho GTPases play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein-coupled receptors. This gene encodes a protein that functions as a specific guanine nucleotide exchange factor for RhoA. It also interacts with ephrin A4 in vascular smooth muscle cells.
12595	
12596	Granulins are a family of secreted, glycosylated peptides that are cleaved from a single precursor protein with 7.5 repeats of a highly conserved 12-cysteine granulin/epithelin motif. The 88 kDa precursor protein, progranulin, is also called proepithelin and PC cell-derived growth factor. Cleavage of the signal peptide produces mature granulin which can be further cleaved into a variety of active, 6 kDa peptides. These smaller cleavage products are named granulin A, granulin B, granulin C, etc. Epithelins 1 and 2 are synonymous with granulins A and B, respectively. Both the peptides and intact granulin protein regulate cell growth. However, different members of the granulin protein family may act as inhibitors, stimulators, or have dual actions on cell growth. Granulin family members are important in normal development, wound healing, and tumorigenesis. Alternative splicing results in multiple transcript variants encoding different isoforms.
12597	
12598	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. Sequence analysis identified three alternatively spliced variants that encode distinct isoforms.
12599	
12600	This gene belongs to the homeobox family of genes. The homeobox genes encode a highly conserved family of transcription factors that play an important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, which are located on different chromosomes and consist of 9 to 11 genes arranged in tandem. This gene is one of several homeobox HOXB genes located in a cluster on chromosome 17.
12601	
12602	This gene is specifically expressed in the photoreceptors in the retina. The encoded product shares strong homology with the C. elegans unc119 protein and it can functionally complement the C. elegans unc119 mutation. It has been localized to the photoreceptor synapses in the outer plexiform layer of the retina, and suggested to play a role in the mechanism of photoreceptor neurotransmitter release through the synaptic vesicle cycle. Two transcript variants encoding different isoforms have been described for this gene.
12603	
12604	This gene product belongs to a family of nonselective cation channels that function in a variety of processes, including temperature sensation and vasoregulation. The thermosensitive members of this family are expressed in subsets of sensory neurons that terminate in the skin, and are activated at distinct physiological temperatures. This channel is activated at temperatures between 22 and 40 degrees C. This gene lies in close proximity to another family member (TRPV1) gene on chromosome 17, and the two encoded proteins are thought to associate with each other to form heteromeric channels.
12605	
12606	
12607	This gene is a member of the beta-1,3-galactosyltransferase (beta3GalT) gene family. This family encodes type II membrane-bound glycoproteins with diverse enzymatic functions using different donor substrates (UDP-galactose and UDP-N-acetylglucosamine) and different acceptor sugars (N-acetylglucosamine, galactose, N-acetylgalactosamine). The beta3GalT genes are distantly related to the Drosophila Brainiac gene and have the protein coding sequence contained in a single exon. The beta3GalT proteins also contain conserved sequences not found in the beta4GalT or alpha3GalT proteins. The carbohydrate chains synthesized by these enzymes are designated as type 1, whereas beta4GalT enzymes synthesize type 2 carbohydrate chains. The ratio of type 1:type 2 chains changes during embryogenesis. By sequence similarity, the beta3GalT genes fall into at least two groups: beta3GalT4 and 4 other beta3GalT genes (beta3GalT1-3, beta3GalT5). This gene encodes a protein that functions in N-linked glycoprotein glycosylation and shows strict donor substrate specificity for UDP-galactose.
12608	
12609	
12610	The protein encoded by this gene is highly similar to Saccharomyces cerevisiae Cdc6, a protein essential for the initiation of DNA replication. This protein functions as a regulator at the early steps of DNA replication. It localizes in cell nucleus during cell cyle G1, but translocates to the cytoplasm at the start of S phase. The subcellular translocation of this protein during cell cyle is regulated through its phosphorylation by Cdks. Transcription of this protein was reported to be regulated in response to mitogenic signals through transcriptional control mechanism involving E2F proteins.
12611	
12612	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the ultrahigh sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
12613	
12614	The protein encoded by this gene is a precursor to the cornified envelope of terminally differentiated keratinocytes. This protein localizes to the periphery of cells and may function in the assembly or regulation of proteins in the cornified envelope. Transcript variants encoding different isoforms exist. A transcript variant utilizing an alternative polyA signal has been described in the literature, but its full-length nature has not been determined.
12615	Kallikreins are a subgroup of serine proteases having diverse physiological functions. Growing evidence suggests that many kallikreins are implicated in carcinogenesis and some have potential as novel cancer and other disease biomarkers. This gene is one of the fifteen kallikrein subfamily members located in a cluster on chromosome 19. Alternate splicing of this gene results in three transcript variants encoding different isoforms.
12616	
12617	This gene encodes a protein of unknown function. Alternatively spliced transcript variants have been identified, but their biological validity has not been determined.
12618	The protein encoded by this gene is a member of the regulator of G protein signaling (RGS) family. RGS family members share similarity with S. cerevisiae SST2 and C. elegans egl-10 proteins, which contain a characteristic conserved RGS domain. RGS proteins accelerate GTPase activity of G protein alpha-subunits, thereby driving G protein into their inactive GDP-bound form, thus negatively regulating G protein signaling. RGS proteins have been implicated in the fine tuning of a variety of cellular events in response to G protein-coupled receptor activation. The biological function of this gene, however, is unknown. Two transcript variants encoding the same isoform exist.
12619	This gene encodes a protein containing four ankyrin repeat domains in its C-terminus. The suggested role for this protein is in tumorigenesis of renal cell carcinoma. Two alternatively spliced transcript variants encoding different isoforms have been identified.
12620	
12621	The protein encoded by this gene is one of eight proteins (Cog1-8) which form a Golgi-localized complex (COG) required for normal Golgi morphology and function. It is thought that this protein is required for steps in the normal medial and trans Golgi-associated processing of glycoconjugates and plays a role in the organization of the Golgi-localized complex.
12622	
12623	
12624	This gene encodes a protein belonging to the 14-3-3 family of proteins, members of which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals. The encoded protein has been shown to interact with RAF1 and CDC25 phosphatases, suggesting that it may play a role in linking mitogenic signaling and the cell cycle machinery. Two transcript variants, which encode the same protein, have been identified for this gene.
12625	This gene encodes a member of the interferon regulatory factor (IRF) family, a group of transcription factors with diverse roles, including virus-mediated activation of interferon, and modulation of cell growth, differentiation, apoptosis, and immune system activity. Members of the IRF family are characterized by a conserved N-terminal DNA-binding domain containing tryptophan (W) repeats. Alternative splice variants encoding different isoforms exist.
12626	This gene encodes a protein with a putative role in the localization of trans-Golgi network (TGN) membrane proteins. Mouse and rat homologs have been identified and studies of the homologous rat protein indicate a role in directing TGN localization of furin by binding to the protease@#$%&s phosphorylated cytosolic domain. In addition, the human protein plays a role in HIV-1 Nef-mediated downregulation of cell surface MHC-I molecules to the TGN, thereby enabling HIV-1 to escape immune surveillance.
12627	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to the cytoplasm. Its function has not been identified. Alternate splicing of this gene results in two transcript variants encoding different isoforms.
12628	
12629	
12630	Action potentials in vertebrate neurons are followed by an afterhyperpolarization (AHP) that may persist for several seconds and may have profound consequences for the firing pattern of the neuron. Each component of the AHP is kinetically distinct and is mediated by different calcium-activated potassium channels. The protein encoded by this gene is activated before membrane hyperpolarization and is thought to regulate neuronal excitability by contributing to the slow component of synaptic AHP. The encoded protein is an integral membrane protein that forms a voltage-independent calcium-activated channel with three other calmodulin-binding subunits. This gene is a member of the KCNN family of potassium channel genes. Two transcript variants encoding different isoforms have been found for this gene.
12631	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
12632	The neurotransmitter, serotonin, is thought to play a role in various cognitive and behavioral functions. The serotonin receptor encoded by this gene belongs to the superfamily of G protein-coupled receptors and the gene is a candidate locus for involvement in autistic disorder and other neuropsychiatric disorders. Three splice variants have been identified which encode proteins that differ in the length of their carboxy terminal ends.
12633	
12634	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intracellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, and White). This encoded protein is a member of the ABC1 subfamily. Members of the ABC1 subfamily comprise the only major ABC subfamily found exclusively in multicellular eukaryotes. This gene is clustered among 4 other ABC1 family members on 17q24 and may play a role in macrophage lipid homeostasis. Alternative splicing of this gene results in two transcript variants.
12635	
12636	
12637	The protein encoded by this gene is a member of the CARD protein family, which is defined by the presence of a characteristic caspase-associated recruitment domain (CARD). CARD is a protein interaction domain known to participate in activation or suppression of CARD containing members of the caspase family, and thus plays an important regulatory role in cell apoptosis. This protein was identified by its selective association with the CARD domain of BCL10, a postive regulator of apoptosis and NF-kappaB activation, and is thought to function as a molecular scaffold for the assembly of a BCL10 signaling complex that activates NF-kappaB. Several alternatively spliced transcript variants have been observed, but their full-length natures are not clearly defined.
12638	
12639	The protein encoded by this gene is a member of the fibroblast growth factor receptor family, where amino acid sequence is highly conserved between members and throughout evolution. FGFR family members differ from one another in their ligand affinities and tissue distribution. A full-length representative protein would consist of an extracellular region, composed of three immunoglobulin-like domains, a single hydrophobic membrane-spanning segment and a cytoplasmic tyrosine kinase domain. The extracellular portion of the protein interacts with fibroblast growth factors, setting in motion a cascade of downstream signals, ultimately influencing mitogenesis and differentiation. This particular family member binds acidic and basic fibroblast growth hormone and plays a role in bone development and maintenance. Mutations in this gene lead to craniosynostosis and multiple types of skeletal dysplasia. Alternative splicing occurs and additional variants have been described, including those utilizing alternate exon 8 rather than 9, but their full-length nature has not been determined.
12640	
12641	This gene encodes a member of the bicoid sub-family of homeodomain-containing transcription factors. The encoded protein acts as a transcription factor and may play a role in brain and sensory organ development. Two transcript variants encoding distinct isoforms have been identified for this gene.
12642	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to cytoplasmic bodies. Its function has not been identified. Alternative splicing of this gene generates two transcript variants.
12643	The protein encoded by this gene is a member of the TNF-receptor superfamily. This receptor is expressed by activated, but not by resting, T and B cells. TRAF2 and TRAF5 can interact with this receptor, and mediate the signal transduction that leads to the activation of NF-kappaB. This receptor is a positive regulator of apoptosis, and also has been shown to limit the proliferative potential of autoreactive CD8 effector T cells and protect the body against autoimmunity. Two alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported.
12644	
12645	The protein encoded by this gene appears to be a zinc finger protein containing a transposase domain at the C-terminus. This protein was found to interact with the transcription factor SP1 in a yeast two-hybrid system. At least three alternatively spliced transcript variants encoding distinct isoforms have been observed.
12646	The t(16;21)(q24;q22) translocation is a rare but recurrent chromosomal abnormality associated with therapy-related myeloid malignancies. The translocation produces a chimeric gene made up of the 5@#$%&-region of the AML1 gene fused to the 3@#$%&-region of this gene. In addition, this gene is a putative breast tumor suppressor. Two transcript variants encoding different isoforms have been found for this gene, and a brefeldin A-sensitive association of RII-alpha protein with one of the isoforms has been demonstrated in the Golgi apparatus.
12647	
12648	
12649	
12650	
12651	
12652	
12653	
12654	
12655	
12656	
12657	This gene encodes an enzyme that is a member of the beta-1,3-N-acetylglucosaminyltransferase family. This enzyme is a type II transmembrane protein. It prefers the substrate of lacto-N-neotetraose, and is involved in the biosynthesis of poly-N-acetyllactosamine chains. Two alternatively splicing transcript variants are identified from this gene and encode the same protein product.
12658	Thymidylate synthase (TYMS, or TS; EC 2.1.1.45) uses the 5,10-methylenetetrahydrofolate (methylene-THF) as a cofactor to maintain the dTMP (thymidine-5-prime monophosphate) pool critical for DNA replication and repair. The enzyme has been of interest as a target for cancer chemotherapeutic agents. It is considered to be the primary site of action for 5-fluorouracil, 5-fluoro-2-prime-deoxyuridine, and some folate analogs.[supplied by OMIM]
12659	This gene is a member of the paired box (PAX) family of transcription factors.  Members of this gene family typically contain a paired box domain, an octapeptide, and a paired-type homeodomain.  These genes play critical roles during fetal development and cancer growth.  The specific function of the paired box gene 7 is unknown but speculated to involve tumor suppression since fusion of this gene with a forkhead domain family member has been associated with alveolar rhabdomyosarcoma.  Alternative splicing in this gene has produced two known products but the biological significance of the variants is unknown.
12660	In vertebrates, the genes encoding the class of transcription factors called homeobox genes are found in clusters named A, B, C, and D on four separate chromosomes. Expression of these proteins is spatially and temporally regulated during embryonic development. This gene is part of the A cluster on chromosome 7 and encodes a DNA-binding transcription factor which may regulate gene expression, morphogenesis, and differentiation. This gene is highly similar to the abdominal-B (Abd-B) gene of Drosophila. A specific translocation event which causes a fusion between this gene and the NUP98 gene has been associated with myeloid leukemogenesis. Two transcript variants encoding different isoforms have been found for this gene.
12661	This gene spans approximately 18 kb of genomic DNA and consists of 12 exons. Two transcripts with distinct 5@#$%& UTR have been described; the resulting proteins have distinct N-terminal amino acid sequences.  Translation initiation from internal methionine residues was observed with in vitro translation. A signal peptide sequence is predicted for translation initiation sites 1, 2, and 4.   The protein isoforms contain 5 or 6 calcium-binding EGF2 domains and 5 or 6 EGF2 domains. Mutations in this gene cause the retinal disease Malattia Leventinese.
12662	This gene encodes the alpha chain of type XVII collagen. Unlike most collagens, collagen XVII is a transmembrane protein. Collagen XVII is a structural component of hemidesmosomes, multiprotein complexes at the dermal-epidermal basement membrane zone that mediate adhesion of keratinocytes to the underlying membrane. Mutations in this gene are associated with both generalized atrophic benign and junctional epidermolysis bullosa. Two homotrimeric forms of type XVII collagen exist. The full length form is the transmembrane protein. A soluble form, referred to as either ectodomain or LAD-1, is generated by proteolytic processing of the full length form. Two transcript variants, one resulting from alternative splicing in the 3@#$%& UTR, have been identified for this gene.
12663	HEC is one of several proteins involved in spindle checkpoint signaling. This surveillance mechanism assures correct segregation of chromosomes during cell division by detecting unaligned chromosomes and causing prometaphase arrest until the proper bipolar attachment of chromosomes is achieved.[supplied by OMIM]
12664	
12665	Eukaryotes, in contrast to prokaryotes, contain more than one DNA ligase, and these enzymes have distinct roles in DNA metabolism.  Five DNA ligase activities, I-V, have been purified from mammalian cell extracts.  Ligase III is more closely related to the DNA ligase encoded by pox viruses, than to replicative DNA ligases, such as mammalian DNA ligase 1, and thus may be involved in DNA repair and recombination.  Two biochemically distinct isoforms of ligase III, alpha and beta, encoding polypeptides with different C-terminal amino acids have been identified, and result from alternative splicing of precursor mRNA.
12666	
12667	Potassium channels are present in most mammalian cells, where they participate in a wide range of physiologic responses. The protein encoded by this gene is an integral membrane protein and inward-rectifier type potassium channel. The encoded protein, which has a greater tendency to allow potassium to flow into a cell rather than out of a cell, can form heterodimers with two other inward-rectifier type potassium channels. It may be involved in the regulation of fluid and pH balance. Three transcript variants encoding the same protein have been found for this gene.
12668	
12669	
12670	
12671	The protein encoded by this gene is a member of the ankyrin repeat and SOCS box-containing (ASB) family of proteins. They contain ankyrin repeat sequence and SOCS box domain. The SOCS box serves to couple suppressor of cytokine signalling (SOCS) proteins and their binding partners with the elongin B and C complex, possibly targeting them for degradation. Multiple alternatively spliced transcript variants have been described for this gene but some of the full length sequences are not known.
12672	Phosphatidylinositol breakdown products are ubiquitous second messengers that function downstream of many G protein-coupled receptors and tyrosine kinases regulating cell growth, calcium metabolism, and protein kinase C activity. Two enzymes, CDP-diacylglycerol synthase and phosphatidylinositol synthase, are involved in the biosynthesis of phosphatidylinositol. Phosphatidylinositol synthase, a member of the CDP-alcohol phosphatidyl transferase class-I family, is an integral membrane protein found on the cytoplasmic side of the endoplasmic reticulum and the Golgi apparatus. Alternative splicing of this gene results in multiple transcript variants and protein isoforms.
12673	
12674	This gene encodes a protein of unknown function. The gene is located within the Smith-Magenis syndrome region on chromosome 17. Alternative splicing of this gene results in two transcript variants encoding the same protein.
12675	CSRP2 is a protein containing two LIM domains, which are double zinc finger motifs found in proteins of diverse function. CSRP2 and some related proteins are thought to act as protein adapters, bridging two or more proteins to form a larger protein complex. The protein encoded by this gene binds to one of the LIM domains of CSRP2 and contains an acetyltransferase domain. Although the encoded protein has been detected in the cytoplasm, it is predominantly a nuclear protein. Two transcript variants encoding different isoforms have been found for this gene.
12676	Sulfotransferase enzymes catalyze the sulfate conjugation of many hormones, neurotransmitters, drugs, and xenobiotic compounds. These cytosolic enzymes are different in their tissue distributions and substrate specificities. The gene structure (number and length of exons) is similar among family members. This gene encodes a phenol sulfotransferase with thermolabile enzyme activity. Four sulfotransferase genes are located on the p arm of chromosome 16, this gene and SULT1A3 arose from a segmental duplication. Exons of this gene overlap with exons of a gene that encodes a protein containing GIY-YIG domains (GIYD2). Three alternatively spliced variants that encode the same protein have been described.
12677	
12678	
12679	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene contains a RNA binding domain and scaffold-associated region (SAR)-specific bipartite DNA-binding domain. This protein is also thought to be involved in the packaging of hnRNA into large ribonucleoprotein complexes. During apoptosis, this protein is cleaved in a caspase-dependent way. Cleavage occurs at the SALD site, resulting in a loss of DNA-binding activity and a concomitant detachment of this protein from nuclear structural sites. But this cleavage does not affect the function of the encoded protein in RNA metabolism. Two alternatively spliced transcript variants have been described for this gene.
12680	In vertebrates, the genes encoding the class of transcription factors called homeobox genes are found in clusters named A, B, C, and D on four separate chromosomes. Expression of these proteins is spatially and temporally regulated during embryonic development. This gene is part of the A cluster on chromosome 7 and encodes a DNA-binding transcription factor which may regulate gene expression, morphogenesis, and differentiation. Methylation of this gene may result in the loss of its expression and, since the encoded protein upregulates the tumor suppressor p53, this protein may play an important role in tumorigenesis.
12681	This gene encodes adenylate cyclase 6, which is a membrane-associated enzyme and catalyzes the formation of the secondary messenger cyclic adenosine monophosphate (cAMP). The expression of this gene is found in normal thyroid and brain tissues, as well as some tumors; and its expression is significantly higher in one hyperfunctioning thyroid tumor than in normal thyroid tissue. Alternative splicing generates 2 transcript variants.
12682	The protein encoded by this gene is a member of the AAA protein family (ATPases associated with diverse cellular activities), and is the homolog of the yeast Vps4 protein. In humans, two paralogs of the yeast protein have been identified. The former share a high degree of aa sequence similarity with each other, and also with yeast Vps4 and mouse Skd1 proteins. The mouse Skd1 (suppressor of K+ transport defect 1) has been shown to be really an yeast Vps4 ortholog. Functional studies indicate that both human paralogs associate with the endosomal compartments, and are involved in intracellular protein trafficking, similar to Vps4 protein in yeast. The gene encoding this paralog has been mapped to chromosome 18; the gene for the other resides on chromosome 16. Usage of multiple polyadenylation sites have been noted for this gene.
12683	The protein encoded by this gene is one subunit of a trimeric complex, forming a highly conserved transcription factor that binds to CCAAT motifs in the promoter regions in a variety of genes. Subunit A associates with a tight dimer composed of the B and C subunits, resulting in a trimer that binds to DNA with high specificity and affinity. The sequence specific interactions of the complex are made by the A subunit, suggesting a role as the regulatory subunit. In addition, there is evidence of post-transcriptional regulation in this gene product, either by protein degradation or control of translation. Further regulation is represented by alternative splicing in the glutamine-rich activation domain, with clear tissue-specific preferences for the two isoforms.
12684	UGT2B17 belongs to the family of UDP-glucuronosyltransferases (UGTs; EC 2.4.1.17), enzymes that catalyze the transfer of glucuronic acid from uridine diphosphoglucuronic acid to a variety of substrates, including steroid hormones.[supplied by OMIM]
12685	
12686	Zinc is an essential cofactor for hundreds of enzymes. It is involved in protein, nucleic acid, carbohydrate, and lipid metabolism, as well as in the control of gene transcription, growth, development, and differentiation. SLC39A6 belongs to a subfamily of proteins that show structural characteristics of zinc transporters (Taylor and Nicholson, 2003).[supplied by OMIM]
12687	The protein encoded by this gene is a member of the interleukin 1 cytokine family. This gene and eight other interleukin 1 family genes form a cytokine gene cluster on chromosome 2. This cytokine is thought to participate in a network of interleukin 1 family members to regulate adapted and innate immune responses. Two alternatively spliced transcript variants encoding the same protein have been reported.
12688	This gene is one of several tumor-suppressing subtransferable fragments located in the imprinted gene domain of 11p15.5, an important tumor-suppressor gene region. Alterations in this region have been associated with the Beckwith-Wiedemann syndrome, Wilms tumor, rhabdomyosarcoma, adrenocortical carcinoma, and lung, ovarian, and breast cancer. This gene is located among several imprinted genes; however, this gene, as well as the tumor-suppressing subchromosomal transferable fragment 4 (TSSC4), escapes imprinting. This gene may play a role in malignancies and disease that involve this region as well as hematopoietic cell function. This gene is described as a member of the tetraspanin superfamily. Alternative splicing of this gene results in several transcript variants; however, not all variants have been fully described.
12689	
12690	This gene encodes a protein involved in import of peroxisomal matrix proteins. This protein localizes to the peroxisomal membrane. Mutations in this gene result in phenotypes within the Zellweger spectrum of peroxisomal biogenesis disorders, ranging from neonatal adrenoleukodystrophy to Zellweger syndrome. Alternative splicing results in two transcript variants encoding different isoforms.
12691	
12692	The protein encoded by this gene is a cytokine that controls the production, differentiation, and function of granulocytes. The active protein is found extracellularly. Three transcript variants encoding three different isoforms have been found for this gene.
12693	This gene is located in the surfeit gene cluster, a group of very tightly linked housekeeping genes that do not share sequence similarity. The gene is oriented in a head-to-head fashion with RPL7A (SURF3) and the two genes share a bidirectional promoter. The encoded proteins are localized to the cytoplasm. Multiple alternative transcript variants encoding multiple isoforms have been identified for this gene.
12694	This gene is located in the surfeit gene cluster, a group of very tightly linked housekeeping genes that do not share sequence similarity. The gene is oriented in a head-to-head fashion with RPL7A (SURF3) and the two genes share a bidirectional promoter. The encoded proteins are localized to the cytoplasm. Multiple alternative transcript variants encoding multiple isoforms have been identified for this gene.
12695	
12696	
12697	This gene encodes a major cytoplasmic protein which is the only known constituent common to submembranous plaques of both desmosomes and intermediate junctions. This protein forms distinct complexes with cadherins and desmosomal cadherins and is a member of the catenin family since it contains a distinct repeating amino acid motif called the armadillo repeat. Mutation in this gene has been associated with Naxos disease. Alternative splicing occurs in this gene; however, not all transcripts have been fully described.
12698	
12699	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L13E family of ribosomal proteins. It is located in the cytoplasm. This gene is expressed at significantly higher levels in benign breast lesions than in breast carcinomas. Transcript variants derived from alternative splicing and/or alternative polyadenylation exist; these variants encode the same protein. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
12700	
12701	The unusual amino acid hypusine is formed posttranslationally and is only found in a single cellular protein, eukaryotic translation initiation factor 5A. In the first step of hypusine biosynthesis, deoxyhypusine synthase catalyzes the NAD-dependent transfer of the butylamine moiety of spermidine to the epsilon-amino group of a specific lysine residue of the EIF5A precursor protein to form the intermediate deoxyhypusine residue. This gene consists of nine exons spanning 6.6 kb. Three transcript variants have been isolated. However, only transcript variant 1 encodes an active protein. The shorter variants may act as modulating factors of DHPS activity.
12702	L-type calcium channels are composed of five subunits. The protein encoded by this gene represents one of these subunits, gamma, and is one of several gamma subunit proteins. It is an integral membrane protein that is thought to stabilize the calcium channel in an inactive (closed) state. This gene is a member of the neuronal calcium channel gamma subunit gene subfamily of the PMP-22/EMP/MP20 family and is located in a cluster with two similar gamma subunit-encoding genes. Three transcript variants encoding different isoforms have been found for this gene.
12703	
12704	This gene encodes the homolog of a protein identified in Drosophila as a suppressor of malignant transformation of neuroblasts and ganglion-mother cells in the optic centers of the brain. This gene product is localized to condensed chromosomes in mitotic cells. Overexpression of this gene in a glioma cell line results in improper nuclear segregation and cytokinesis producing multinucleated cells. Two transcripts have been identified for this gene.
12705	
12706	Galactose-1-phosphate uridyl transferase (GALT) catalyzes the second step of the Leloir pathway of galactose metabolism, namely the conversion of UDP-glucose + galactose-1-phosphate to glucose-1-phosphate + UDP-galactose. The absence of this enzyme results in classic galactosemia in humans and can be fatal in the newborn period if lactose is not removed from the diet. The pathophysiology of galactosemia has not been clearly defined. Alternative splicing has been observed at this locus and three transcript variants, each encoding a distinct isoform, have been identified.
12707	This gene encodes a secreted protein that dorsalizes early vertebrate embryonic tissues by binding to ventralizing TGF-beta-like bone morphogenetic proteins and sequestering them in latent complexes. The encoded protein may also have roles in organogenesis and during adulthood. Multiple transcript variants encoding distinct isoforms have been identified for this gene. Other alternative splice variants have been described but their full length sequence has not been determined.
12708	Multiprotein complexes are key determinants of Golgi apparatus structure and its capacity for intracellular transport and glycoprotein modification. Several complexes have been identified, including the Golgi transport complex (GTC), the LDLC complex, which is involved in glycosylation reactions, and the SEC34 complex, which is involved in vesicular transport. These 3 complexes are identical and have been termed the conserved oligomeric Golgi (COG) complex, which includes COG8 (Ungar et al., 2002).[supplied by OMIM]
12709	
12710	The protein encoded by this gene is closely related to MAPK8IP1/IB1/JIP-1, a scaffold protein that is involved in the c-Jun amino-terminal kinase signaling pathway. This protein is expressed in brain and pancreatic cells. It has been shown to interact with, and regulate the activity of MAPK8/JNK1, and MAP2K7/MKK7 kinases. This protein thus is thought to function as a regulator of signal transduction by protein kinase cascade in brain and pancreatic beta-cells. Three alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported.
12711	This gene is a member of the MAGEC gene family. The members of this family are not expressed in normal tissues, except for testis, and are expressed in tumors of various histological types. The MAGEC genes are clustered on chromosome Xq26-q27. Two transcript variants encoding distinct isoforms have been found for this gene.
12712	This gene is a member of the protein-tyrosine kinase oncogene family. It encodes a membrane-associated tyrosine kinase that has been implicated in the control of cell growth. The protein associates with the p85 subunit of phosphatidylinositol 3-kinase and interacts with the fyn-binding protein. Alternatively spliced transcript variants encoding distinct isoforms exist.
12713	This gene encodes disintegrin and metalloprotease (ADAM) domain 11, which is a member of the ADAM protein family. Members of this family are membrane-anchored proteins structurally related to snake venom disintegrins, and have been implicated in a variety of biologic processes involving  cell-cell and cell-matrix interactions, including fertilization, muscle development, and neurogenesis. This gene represents a candidate tumor supressor gene for human breast cancer based on its location within a minimal region of chromosome 17q21 previously defined by tumor deletion mapping. This gene consists of 28 exons. Alternative splicing generates 2 transcripts; the short version is less abundant than long version in human tissues.
12714	
12715	
12716	
12717	Two forms of NADH-cytochrome b5 reductase are known, a membrane-bound form in somatic cells (anchored in the endoplasmic reticulum, mitochondrial and other membranes) and a soluble form in erythrocytes. The former exists mainly on the cytoplasmic side of the endoplasmic reticulum and functions in desaturation and elongation of fatty acids, in cholesterol biosynthesis, and in drug metabolism. The erythrocyte form is located in a soluble fraction of circulating erythrocytes and is involved in methemoglobin reduction. Membrane-bound enzyme consists of 300 amino acid residues having both membrane-binding and catalytic domains. The soluble type consists of 275 amino acids having only the catalytic domain. The 2 forms of the enzyme are coded by the same gene, DIA1. The two forms are thought to be generated from tissue-specific alternative transcripts differing in the first exon. Deficiency of DIA1 has been associated with methemoglobinemias.
12718	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
12719	The protein encoded by this gene is a member of the MAP kinase family. MAP kinases act as an integration point for multiple biochemical signals, and are involved in a wide variety of cellular processes such as proliferation, differentiation, transcription regulation and development. This kinase is most closely related to p38 MAP kinase, both of which can be activated by proinflammatory cytokines and environmental stress. This kinase is activated through its phosphorylation by MAP kinase kinases (MKKs), preferably by MKK6. Transcription factor ATF2/CREB2 has been shown to be a substrate of this kinase. Alternatively spliced transcript variants encoding the same protein have been observed.
12720	This gene is a founding member of the EGF-TM7 family of class II seven-span transmembrane (7-TM) molecules, likely encoded by a gene cluster on the short arm of chromosome 19. The encoded product is a glycoprotein that is present on the surface of most activated leukocytes and spans the membrane seven times, which is a defining feature of G protein-coupled receptors. The protein has an extended extracellular region with several N-terminal epidermal growth factor (EGF)-like domains, which mediate binding to its cellular ligand, decay accelerating factor (DAF, CD55), a regulatory protein of the complement cascade. The presence of structural features characteristic of extracellular matrix proteins and transmembrane proteins suggests that this protein is a receptor involved in both cell adhesion and signaling processes early after leukocyte activation. Alternative splicing has been observed for this gene and two variants have been described.
12721	This gene is a founding member of the EGF-TM7 family of class II seven-span transmembrane (7-TM) molecules, likely encoded by a gene cluster on the short arm of chromosome 19. The encoded product is a glycoprotein that is present on the surface of most activated leukocytes and spans the membrane seven times, which is a defining feature of G protein-coupled receptors. The protein has an extended extracellular region with several N-terminal epidermal growth factor (EGF)-like domains, which mediate binding to its cellular ligand, decay accelerating factor (DAF, CD55), a regulatory protein of the complement cascade. The presence of structural features characteristic of extracellular matrix proteins and transmembrane proteins suggests that this protein is a receptor involved in both cell adhesion and signaling processes early after leukocyte activation. Alternative splicing has been observed for this gene and two variants have been described.
12722	This gene encodes a member of the Aurora subfamily of serine/threonine protein kinases. The encoded protein is a chromosomal passenger protein that forms complexes with Aurora-B and inner centromere proteins and may play a role in organizing microtubules in relation to centrosome/spindle function during mitosis. This gene is overexpressed in several cancer cell lines, suggesting an involvement in oncogenic signal transduction. Alternative splicing results in multiple transcript variants.
12723	Agouti-related protein is an antagonist of the melanocortin-3 and melanocortin-4 receptor. It appears to regulate hypothalamic control of feeding behavior via melanocortin receptor and/or intracellular calcium regulation. Agouti-related protein is alternatively spliced into 2 variants which differ in 5@#$%& untranslated sequence length.
12724	
12725	This gene encodes a disintegrin and metalloprotease (ADAM) domain 33, which is a member of the ADAM protein family. Members of this family are membrane-anchored proteins structurally related to snake venom disintegrins, and have been implicated in a variety of biologic processes involving cell-cell and cell-matrix interactions, including fertilization, muscle development, and neurogenesis. This protein is a type I transmembrane protein implicated in asthma and bronchial hyperresponsiveness. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
12726	
12727	
12728	This gene encodes a protein which contains a zinc finger-like PHD (plant homeodomain) finger, distinct from other classes of zinc finger motifs, and a hydrophobic and highly conserved domain. The PHD finger shows the typical Cys4-His-Cys3 arrangement. PHD finger genes are thought to belong to a diverse group of transcriptional regulators possibly affecting eukaryotic gene expression by influencing chromatin structure.
12729	
12730	
12731	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. This gene is immediately adjacent to the gene for BAF complex 53 kDa subunit protein a (BAF53a), in a tail-to-tail orientation. Two transcript variants encoding different protein isoforms have been identified.
12732	Nitric oxide is a reactive free radical which acts as a biologic mediator in several processes, including neurotransmission and antimicrobial and antitumoral activities. This gene encodes a nitric oxide synthase which is expressed in liver and is inducible by a combination of lipopolysaccharide and certain cytokines. Three related pseudogenes are located within the Smith-Magenis syndrome region on chromosome 17. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
12733	
12734	Autophagy is the process by which endogenous proteins and damaged organelles are destroyed intracellularly. Autophagy is postulated to be essential for cell homeostasis and cell remodeling during differentiation, metamorphosis, non-apoptotic cell death, and aging. Reduced levels of autophagy have been described in some malignant tumors, and a role for autophagy in controlling the unregulated cell growth linked to cancer has been proposed. This gene encodes a member of the autophagin protein family. The encoded protein is also designated as a member of the C-54 family of cysteine proteases. Two transcript variants that encode different isoforms were identified through sequence analysis.
12735	The protein encoded by this gene is a cytoplasmic enzyme involved in energy homeostasis and is an important serum marker for myocardial infarction. The encoded protein reversibly catalyzes the transfer of phosphate between ATP and various phosphogens such as creatine phosphate. It acts as a homodimer in striated muscle as well as in other tissues, and as a heterodimer with a similar brain isozyme in heart. The encoded protein is a member of the ATP:guanido phosphotransferase protein family.
12736	The nuclear pore complex is a massive structure that extends across the nuclear envelope, forming a gateway that regulates the flow of macromolecules between the nucleus and the cytoplasm. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells. The protein encoded by this gene is a member of the FG-repeat containing nucleoporins and is localized to the nuclear pore central plug. This protein associates with the importin alpha/beta complex which is involved in the import of proteins containing nuclear localization signals. Multiple transcript variants of this gene encode a single protein isoform.
12737	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The function of this protein has not been identified. Alternate splicing results in two transcript variants encoding different isoforms.
12738	
12739	This gene is involved in the first step in glycosylphosphatidylinositol (GPI)-anchor biosynthesis. The GPI-anchor is a glycolipid found on many blood cells and serves to anchor proteins to the cell surface. This gene encodes a N-acetylglucosaminyl transferase component that is part of the complex that catalyzes transfer of N-acetylglucosamine (GlcNAc) from UDP-GlcNAc to phosphatidylinositol (PI).
12740	
12741	
12742	The protein encoded by this gene is a member of the SEC22 family of vesicle trafficking proteins. It is localized at the endoplasmic reticulum and it is thought to play a role in the early stages of the ER-Golgi protein trafficking. There are two alternatively spliced transcript variants encoding different isoforms described for this gene.
12743	This gene encodes the cellular form of serine hydroxymethyltransferase, a pyridoxal phosphate-containing enzyme that catalyzes the reversible conversion of serine and tetrahydrofolate to glycine and 5,10-methylene tetrahydrofolate. This reaction provides one carbon units for synthesis of methionine, thymidylate, and purines in the cytoplasm. This gene is located within the Smith-Magenis syndrome region on chromosome 17. Alternative splicing of this gene results in 2 transcript variants encoding 2 different isoforms. Additional transcript variants have been described, but their biological validity has not been determined.
12744	The protein encoded by this gene contains a structural module related to that of the seven transmembrane domain G protein-coupled receptor superfamily. This protein has sequence and structural similarities to the beta-amyloid binding protein (BBP), but, unlike BBP, it does not regulate a response to beta-amyloid peptide. This protein may have regulatory roles in cell death or proliferation signal cascades. This gene has multiple alternatively spliced transcript variants which encode two different isoforms.
12745	
12746	This gene is a member of the paired box (PAX) family of transcription factors. Members of the PAX family typically contain a paired box domain and a paired-type homeodomain. These genes play critical roles during fetal development. Mutations in paired box gene 3 are associated with Waardenburg syndrome, craniofacial-deafness-hand syndrome, and alveolar rhabdomyosarcoma. The translocation t(2;13)(q35;q14), which represents a fusion between PAX3 and the forkhead gene, is a frequent finding in alveolar rhabdomyosarcoma. Alternative splicing results in transcripts encoding isoforms with different C-termini.
12747	The protein encoded by this gene, a member of the GAS2 family, is similar in sequence to the mouse protein Gas2, an actin-associated protein expressed at high levels in growth-arrested cells. Expression of the mouse Gas2 gene is negatively regulated by serum and growth factors. Three transcript variants encoding two different isoforms have been found for this gene.
12748	
12749	
12750	
12751	This gene encodes a member of the POP family of proteins containing three putative transmembrane domains. This gene is expressed in cardiac and skeletal muscle and may play an important role in development of these tissues. The mouse ortholog may be involved in the regeneration of adult skeletal muscle and may act as a cell adhesion molecule in coronary vasculogenesis. Two transcript variants encoding the same protein have been found for this gene.
12752	The FYVE domain mediates the recruitment of proteins involved in membrane trafficking and cell signaling to phosphatidylinositol 3-phosphate (PtdIns(3)P)-containing membranes. This gene encodes a protein which contains a single FYVE domain and multiple WD40 repeats. This protein is localized to endosomes.
12753	
12754	
12755	The integral membrane protein encoded by this gene is a lysophosphatidic acid (LPA) receptor from a group known as EDG receptors. These receptors are members of the G protein-coupled receptor superfamily. Utilized by LPA for cell signaling, EDG receptors mediate diverse biologic functions, including proliferation, platelet aggregation, smooth muscle contraction, inhibition of neuroblastoma cell differentiation, chemotaxis, and tumor cell invasion. Alternative splicing of this gene has been observed and two transcript variants have been described, each encoding identical proteins. An alternate translation start codon has been identified, which results in isoforms differing in the N-terminal extracellular tail. In addition, an alternate polyadenylation site has been reported.
12756	In acute myeloid leukemia, especially in the M2 subtype, the t(8;21)(q22;q22) translocation is one of the most frequent karyotypic abnormalities. The translocation produces a chimeric gene made up of the 5@#$%&-region of the RUNX1 (AML1) gene fused to the 3@#$%&-region of the CBFA2T1 (MTG8) gene. The chimeric protein is thought to associate with the nuclear corepressor/histone deacetylase complex to block hematopoietic differentiation. The protein encoded by this gene binds to the AML1-MTG8 complex and may be important in promoting leukemogenesis. Several transcript variants are thought to exist for this gene, but the full-length natures of only two have been described.
12757	This gene encodes a protein containing 9 WD repeats. WD repeats are approximately 30- to 40-amino acid domains containing several conserved residues, mostly including a trp-asp at the C-terminal end. WD domains are involved in protein-protein interactions. The encoded protein may help induce the disassembly of actin filaments. Two transcript variants encoding different isoforms have been found for this gene.
12758	This gene encodes a protein containing 9 WD repeats. WD repeats are approximately 30- to 40-amino acid domains containing several conserved residues, mostly including a trp-asp at the C-terminal end. WD domains are involved in protein-protein interactions. The encoded protein may help induce the disassembly of actin filaments. Two transcript variants encoding different isoforms have been found for this gene.
12759	
12760	Cadherins are a family of adhesion molecules that mediate Ca2+-dependent cell-cell adhesion in all solid tissues and modulate a wide variety of processes, including cell polarization and migration. Cadherin domains occur as repeats in the extracellular region and are thought to contribute to the sorting of heterogeneous cell types and the maintenance of orderly structures such as epithelium. This gene encodes a cadherin domain-containing protein whose specific function has not yet been determined. Alternative splicing occurs at this locus and two transcript variants, encoding distinct proteins, have been identified.
12761	This gene encodes a member of the diacylglycerol kinase (DGK) enzyme family of proteins, specifically the type II DGK subfamily. Members of this family are involved in regulating the intracellular concentrations of diacylglycerol and phosphatidic acid. Two transcript variants encoding distinct isoforms have been identified for this gene.
12762	
12763	The protein encoded by this gene is a member of the immunoglobulin superfamily. It is a glycosylphosphatidylinositol (GPI)-anchored neuronal membrane protein that functions as a cell adhesion molecule. It may play a role in the formation of axon connections in the developing nervous system. Two alternatively spliced transcript variants encoding different isoforms have been described for this gene.
12764	This gene encodes a type-I membrane protein that is related to beta-glucosidases. Reduced production of this protein has been observed in patients with chronic renal failure (CRF), and this may be one of the factors underlying the degenerative processes (e.g., arteriosclerosis, osteoporosis, and skin atrophy) seen in CRF. Also, mutations within this protein have been associated with ageing and bone loss. Two transcript variants encoding distinct isoforms have been identified for this gene.
12765	
12766	A reciprocal translocation between chromosomes 22 and 9 produces the Philadelphia chromosome, which is often found in patients with chronic myelogenous leukemia. The chromosome 22 breakpoint for this translocation is located within the BCR gene. The translocation produces a fusion protein which is encoded by sequence from both BCR and ABL, the gene at the chromosome 9 breakpoint. Although the BCR-ABL fusion protein has been extensively studied, the function of the normal BCR gene product is not clear. The protein has serine/threonine kinase activity and is a GTPase-activating protein for p21rac.
12767	This gene belongs to the septin family of GTPases. Members of this family are required for cytokinesis. Expression is upregulated by retinoic acid in a human teratocarcinoma cell line. The specific function of this gene has not been determined. This gene encodes three transcript variants resulting in three distinct isoforms.
12768	This gene encodes a protein that is similar to the protein encoded by the breakpoint cluster region gene located on chromosome 22. The protein encoded by this gene contains a GTPase-activating protein domain, a domain found in members of the Rho family of GTP-binding proteins. Functional studies in mice determined that this protein plays a role in vestibular morphogenesis, suggesting that Rho-related GTPases help coordinate motor skills and balance. Alternatively spliced transcript variants that encode different isoforms have been reported for this gene.
12769	
12770	This gene contains a predicted ORF that encodes a protein with two zinc finger domains. The function of the encoded protein is not known. Sequence analysis suggests that multiple alternatively spliced transcript variants are derived from this gene but the full-length nature of only two of them is known. These two splice variants encode different isoforms. A pseudogene for this gene is located on Xq28.
12771	
12772	
12773	
12774	
12775	Rho GTPases are GTP binding proteins that regulate a wide spectrum of cellular functions. These cellular processes include cytoskeletal rearrangements, gene transcription, cell growth and motility. Activation of Rho GTPases is under the direct control of guanine nucleotide exchange factors (GEFs). The protein encoded by this gene is a guanine nucleotide exchange factor and belongs to the Rho GTPase GFE family. Family members share a common feature, a Dbl (DH) homology domain followed by a pleckstrin (PH) homology domain.
12776	
12777	Osteoclasts are multinucleated cells that resorb bone and are essential for bone homeostasis. This gene encodes an osteoclast-associated receptor (OSCAR), which is a member of the leukocyte receptor complex (LRC) protein family that plays critical roles in the regulation of both innate and adaptive immune responses. Different from the other LRC members, OSCAR expression is detected specifically in preosteoclasts or mature osteoclasts. OSCAR may be an important bone-specific regulator of osteoclast differentiation. Multiple alternatively spliced transcript variants encoding different isoforms have been found for this gene.
12778	
12779	
12780	
12781	
12782	
12783	
12784	The protein encoded by this gene belongs to the BCL-2 protein family. BCL-2 family members form hetero- or homodimers and act as anti- or pro-apoptotic regulators that are involved in a wide variety of cellular activities. This protein contains a Bcl-2 homology domain 2 (BH2). The function of this gene has not yet been determined. Two alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported.
12785	This gene encodes the light subunit of the ferritin protein. Ferritin is the major intracellular iron storage protein in prokaryotes and eukaryotes. It is composed of 24 subunits of the heavy and light ferritin chains. Variation in ferritin subunit composition may affect the rates of iron uptake and release in different tissues. A major function of ferritin is the storage of iron in a soluble and nontoxic state. Defects in this light chain ferritin gene are associated with several neurodegenerative diseases and hyperferritinemia-cataract syndrome. This gene has multiple pseudogenes.
12786	This gene encodes the light subunit of the ferritin protein. Ferritin is the major intracellular iron storage protein in prokaryotes and eukaryotes. It is composed of 24 subunits of the heavy and light ferritin chains. Variation in ferritin subunit composition may affect the rates of iron uptake and release in different tissues. A major function of ferritin is the storage of iron in a soluble and nontoxic state. Defects in this light chain ferritin gene are associated with several neurodegenerative diseases and hyperferritinemia-cataract syndrome. This gene has multiple pseudogenes.
12787	
12788	
12789	
12790	
12791	
12792	
12793	This gene encodes one of the members of the superfamily of potassium channel proteins containing two pore-forming P domains. This channel protein, considered an open rectifier, is widely expressed. It is stimulated by arachidonic acid, and inhibited by internal acidification and volatile anaesthetics.
12794	
12795	
12796	
12797	
12798	
12799	The protein encoded by this gene has amino acid sequence similarity to Csk tyrosine kinase and has the structural features of the CSK subfamily: SRC homology SH2 and SH3 domains, a catalytic domain, a unique N terminus, lack of myristylation signals, lack of a negative regulatory phosphorylation site, and lack of an autophosphorylation site. This protein is thought to play a significant role in the signal transduction of hematopoietic cells. It is able to phosphorylate and inactivate Src family kinases, and may play an inhibitory role in the control of T-cell proliferation. This protein might be involved in signaling in some cases of breast cancer. There are three alternatively spliced transcript variants described for this gene encoding different isoforms.
12800	
12801	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum. The enzyme starts the process of inactivating and degrading leukotriene B4, a potent mediator of inflammation. This gene is part of a cluster of cytochrome P450 genes on chromosome 19. Another member of this family, CYP4F11, is approximately 16 kb away.
12802	
12803	
12804	
12805	
12806	
12807	
12808	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
12809	
12810	This gene encodes a member of the tweety family of proteins. Members of this family function as chloride anion channels. The encoded protein functions as a calcium(2+)-independent, volume-sensitive large conductance chloride(-) channel. Two transcript variants encoding distinct isoforms have been identified for this gene.
12811	This gene encodes a member of the tweety family of proteins. Members of this family function as chloride anion channels. The encoded protein functions as a calcium(2+)-independent, volume-sensitive large conductance chloride(-) channel. Two transcript variants encoding distinct isoforms have been identified for this gene.
12812	
12813	
12814	DNA methylation is the major modification of eukaryotic genomes and plays an essential role in mammalian development. Human proteins MECP2, MBD1, MBD2, MBD3, and MBD4 comprise a family of nuclear proteins related by the presence in each of a methyl-CpG binding domain (MBD). However, unlike the other family members, MBD3 is not capable of binding to methylated DNA. The predicted MBD3 protein shares 71% and 94% identity with MBD2 (isoform 1) and mouse Mbd3.  MBD3 is a subunit of the NuRD, a multisubunit complex containing nucleosome remodeling and histone deacetylase activities. MBD3 mediates the association of metastasis-associated protein 2 (MTA2) with the core histone deacetylase complex.
12815	This gene is upstream of, and in a head-to-head orientation with the gene for the mitochondrial ribosomal protein L34. The predicted protein contains alpha/beta hydrolase fold and secretory lipase domains.
12816	The protein encoded by this gene is a subunit of the multisubunit NADH:ubiquinone oxidoreductase (complex I). Mammalian complex I is composed of 45 different subunits. It is located at the mitochondrial inner membrane. This protein has NADH dehydrogenase activity and oxidoreductase activity. It transfers electrons from NADH to the respiratory chain. The immediate electron acceptor for the enzyme is believed to be ubiquinone.
12817	This gene encodes a subunit of mitochondrial ATP synthase. Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. ATP synthase is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, Fo, comprising the proton channel. The catalytic portion of mitochondrial ATP synthase consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled with a stoichiometry of 3 alpha, 3 beta, and a single representative of the other 3. The proton channel consists of three main subunits (a, b, c). This gene encodes the delta subunit of the catalytic core. Alternatively spliced transcript variants encoding the same isoform have been identified.
12818	
12819	ITAGV encodes integrin alpha chain V. Integrins are heterodimeric integral membrane proteins composed of an alpha chain and a beta chain. The I-domain containing integrin alpha V undergoes post-translational cleavage to yield disulfide-linked heavy and light chains, that combine with multiple integrin beta chains to form different integrins. Among the known associating beta chains (beta chains 1,3,5,6, and 8; @#$%&ITGB1@#$%&, @#$%&ITGB3@#$%&, @#$%&ITGB5@#$%&, @#$%&ITGB6@#$%&, and @#$%&ITGB8@#$%&), each can interact with extracellular matrix ligands; the alpha V beta 3 integrin, perhaps the most studied of these, is referred to as the Vitronectin receptor (VNR). In addition to adhesion, many integrins are known to facilitate signal transduction.
12820	
12821	The alpha-1,3-fucosyltransferases constitute a large family of glycosyltransferases with a high degree of homology. The enzymes of this family comprise 3 main activity patterns called myeloid, plasma, and Lewis, based on their capacity to transfer alpha-L-fucose to distinct oligosaccharide acceptors, their sensitivity to N-ethylmaleimide inhibition, their cation requirements, and their tissue-specific expression patterns. The different categories of alpha-1,3-fucosyltransferases are sequentially expressed during embryo-fetal development.[supplied by OMIM]
12822	
12823	Members of the RAB3 protein family (see RAB3A; MIM 179490) are implicated in regulated exocytosis of neurotransmitters and hormones. RAB3GAP, which is involved in regulation of RAB3 activity, is a heterodimeric complex consisting a 130-kD catalytic subunit and a 150-kD noncatalytic subunit (MIM 609275). RAB3GAP specifically converts active RAB3-GTP to the inactive form RAB3-GDP (Aligianis et al., 2005).[supplied by OMIM]
12824	The protein encoded by this gene is a member of the RAMP family of single-transmembrane-domain proteins, called receptor (calcitonin) activity modifying proteins (RAMPs). RAMPs are type I transmembrane proteins with an extracellular N terminus and a cytoplasmic C terminus. RAMPs are required to transport calcitonin-receptor-like receptor (CRLR) to the plasma membrane. CRLR, a receptor with seven transmembrane domains, can function as either a calcitonin-gene-related peptide (CGRP) receptor or an adrenomedullin receptor, depending on which members of the RAMP family are expressed. In the presence of this (RAMP1) protein, CRLR functions as a CGRP receptor. The RAMP1 protein is involved in the terminal glycosylation, maturation, and presentation of the CGRP receptor to the cell surface.
12825	This gene encodes a member of the NCK family of adaptor proteins. The protein contains three SH3 domains and one SH2 domain. The protein has no known catalytic function but has been shown to bind and recruit various proteins involved in the regulation of receptor protein tyrosine kinases. It is through these regulatory activities that this protein is believed to be involved in cytoskeletal reorganization. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
12826	
12827	This gene encodes adenylyl cyclase 3 which is a membrane-associated enzyme and catalyzes the formation of the secondary messenger cyclic adenosine monophosphate (cAMP). This protein appears to be widely expressed in various human tissues and may be involved in a number of physiological and pathophysiological metabolic processes.
12828	
12829	Sulfotransferase enzymes catalyze the sulfate conjugation of many hormones, neurotransmitters, drugs, and xenobiotic compounds. These cytosolic enzymes are different in their tissue distributions and substrate specificities. The gene structure (number and length of exons) is similar among family members. This gene encodes a protein that belongs to the SULT1 subfamily, responsible for transferring a sulfo moiety from PAPS to phenol-containing compounds. Two alternatively spliced transcript variants encoding different isoforms have been described for this gene.
12830	The multisubunit NADH:ubiquinone oxidoreductase (complex I) is the first enzyme complex in the electron transport chain of mitochondria. See NDUFA2 (MIM 602137).[supplied by OMIM]
12831	
12832	
12833	The inhibin alpha subunit joins either the beta A or beta B subunit to form a pituitary FSH secretion inhibitor.  Inhibin has been shown to regulate gonadal stromal cell proliferation negatively and to have tumour-suppressor activity.  In addition, serum levels of inhibin have been shown to reflect the size of granulosa-cell tumors and can therefore be used as a marker for primary as well as recurrent disease.  However, in prostate cancer, expression of the inhibin alpha-subunit gene was suppressed and was not detectable in poorly differentiated tumor cells.  Furthermore, because expression in gonadal and various extragonadal tissues may vary severalfold in a tissue-specific fashion, it is proposed that inhibin may be both a growth/differentiation factor and a hormone.
12834	
12835	Cytochrome C oxidase (COX) is the terminal enzyme of the mitochondrial respiratory chain. It is a multi-subunit enzyme complex that couples the transfer of electrons from cytochrome c to molecular oxygen and contributes to a proton electrochemical gradient across the inner mitochondrial membrane. The complex consists of 13 mitochondrial- and nuclear-encoded subunits. The mitochondrially-encoded subunits perform the electron transfer and proton pumping activities. The functions of the nuclear-encoded subunits are unknown but they may play a role in the regulation and assembly of the complex. This gene encodes the nuclear-encoded subunit Vb of the human mitochondrial respiratory chain enzyme.
12836	
12837	This gene encodes a protein that may be involved in kinetochore-microtubule interaction and spindle checkpoint activity.
12838	Motor neurons are among the earliest neurons to appear after the commencement of cell patterning and the beginning of cell differentiation. Differentiation occurs in a ventral-to-dorsal gradient and is mediated, at least in part, by the concentration of ventrally expressed sonic hedgehog protein (SHH; MIM 600725). Dorsally expressed factors, such as members of the bone morphogenic protein (e.g., BMP4; MIM 112262) and transforming growth factor-beta (e.g., TGFB1; MIM 190180) families, can repress the induction of these neurons. CRIM1 may interact with growth factors implicated in motor neuron differentiation and survival (Kolle et al., 2000).[supplied by OMIM]
12839	
12840	The protein encoded by this gene is a member of the interleukin 1 receptor family. Studies of the similar gene in mouse suggested that this receptor can be induced by proinflammatory stimuli, and may be involved in the function of helper T cells. This gene, interleukin 1 receptor, type I (IL1R1), interleukin 1 receptor, type II (IL1R2) and interleukin 1 receptor-like 2 (IL1RL2) form a cytokine receptor gene cluster in a region mapped to chromosome 2q12. Three alternatively spliced transcript variants encoding distinct isoforms have been reported.
12841	
12842	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation.
12843	Protein kinase C (PKC) zeta is a member of the PKC family of serine/threonine kinases which are involved in a variety of cellular processes such as proliferation, differentiation and secretion.  Unlike the classical PKC isoenzymes which are calcium-dependent, PKC zeta exhibits a constitutive kinase activity which is independent of calcium, and  PKC activators, phosphatidylserine and diacylglycerol.  Furthermore, it is insensitive to PKC inhibitors and cannot be activated by phorbol ester.  The structural and biochemical properties indicate that the zeta subspecies is related to, but distinct from other isoenzymes of PKC.
12844	
12845	The protein encoded by this gene belongs to the calcium sensitive chloride conductance protein family. To date, all members of this gene family map to the same site on chromosome 1p31-p22 and share high degrees of homology in size, sequence and predicted structure, but differ significantly in their tissue distributions. This gene product is expressed predominantly in the small intestine and colon mucosa, and may be involved in secretory or absorptive processes.
12846	
12847	
12848	
12849	
12850	
12851	
12852	
12853	
12854	
12855	
12856	
12857	SMYD3 is a histone methyltransferase that plays a role in transcriptional regulation as a member of an RNA polymerase complex.[supplied by OMIM]
12858	
12859	The protein encoded by this gene contains a classical signature of the insulin superfamily and is highly similar to relaxin 3 (RLN3/INSL7).
12860	Guanylate binding protein expression is induced by interferon.  Guanylate binding proteins are characterized by their ability to specifically bind guanine nucleotides (GMP, GDP, and GTP) and are distinguished from the GTP-binding proteins by the presence of 2 binding motifs rather than 3
12861	The protein encoded by this gene is a leucine-rich repeat protein present in connective tissue extracellular matrix. This protein functions as a molecule anchoring basement membranes to the underlying connective tissue. This protein has been shown to bind type I collagen to basement membranes and type II collagen to cartilage. It also binds the basement membrane heparan sulfate proteoglycan perlecan. This protein is suggested to be involved in the pathogenesis of Hutchinson-Gilford progeria (HGP), which is reported to lack the binding of collagen in basement membranes and cartilage. Alternatively spliced transcript variants encoding the same protein have been observed.
12862	The protein encoded by this gene is a member of the dual specificity protein phosphatase subfamily. These phosphatases inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues. They negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily (MAPK/ERK, SAPK/JNK, p38), which is associated with cellular proliferation and differentiation. Different members of the family of dual specificity phosphatases show distinct substrate specificities for various MAP kinases, different tissue distribution and subcellular localization, and different modes of inducibility of their expression by extracellular stimuli. This gene product is the human ortholog of the Saccharomyces cerevisiae YVH1 protein tyrosine phosphatase. It is localized predominantly in the nucleus, and is novel in that it contains, and is regulated by a zinc finger domain.
12863	
12864	The glycoprotein encoded by this gene is a cell surface antigen that is expressed in greater than 95% of human colon cancers. The open reading frame encodes a 319-amino acid polypeptide having a putative secretory signal sequence and 3 potential glycosylation sites. The predicted mature protein has a 213-amino acid extracellular region, a single transmembrane domain, and a 62-amino acid intracellular tail. The sequence of the extracellular region contains 2 domains characteristic of the CD2 subgroup of the immunoglobulin (Ig) superfamily.
12865	The MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) family consists of six human proteins, several of which are involved the production of leukotrienes and prostaglandin E, important mediators of inflammation. This gene encodes an enzyme which catalyzes the conjugation of leukotriene A4 and reduced glutathione to produce leukotriene C4. This enzyme also demonstrates glutathione-dependent peroxidase activity towards lipid hydroperoxides.
12866	
12867	This gene encodes a protein which specifically interacts with translin, a DNA-binding protein that binds consensus sequences at breakpoint junctions of chromosomal translocations. The encoded protein contains bipartite nuclear targeting sequences that may provide nuclear transport for translin, which lacks any nuclear targeting motifs.
12868	
12869	
12870	
12871	
12872	Cytokine-inducible kinase is a putative serine/threonine kinase.  CNK contains both a catalytic domain and a putative regulatory domain.  It may play a role in regulation of cell cycle progression and tumorigenesis.
12873	
12874	RHBG and RHCG are non-erythroid members of the Rhesus (Rh) protein family that are mainly expressed in the kidney and belong to the methylammonium-ammonium permease/ammonia transporters superfamily. Rh family proteins are all predicted to be transmembrane proteins with 12 membrane spanning domains and intracytoplasmic N- and C-termini.
12875	Rho GTPases play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. The encoded protein may form complex with G proteins and stimulate rho-dependent signals.
12876	
12877	Elastases form a subfamily of serine proteases that hydrolyze many proteins in addition to elastin. Humans have six elastase genes which encode the structurally similar proteins elastase 1, 2, 2A, 2B, 3A, and 3B. Like most of the human elastases, elastase 2B is secreted from the pancreas as a zymogen. In other species, elastase 2B has been shown to preferentially cleave proteins after leucine, methionine, and phenylalanine residues.
12878	Poly(A)-binding proteins (PABPs) bind to the poly(A) tail present at the 3-prime ends of most eukaryotic mRNAs. PABPC4 or IPABP (inducible PABP) was isolated as an activation-induced T-cell mRNA encoding a protein. Activation of T cells increased PABPC4 mRNA levels in T cells approximately 5-fold. PABPC4 contains 4 RNA-binding domains and proline-rich C terminus. PABPC4 is localized primarily to the cytoplasm. It is suggested that PABPC4 might be necessary for regulation of stability of labile mRNA species in activated T cells. PABPC4 was also identified as an antigen, APP1 (activated-platelet protein-1), expressed on thrombin-activated rabbit platelets. PABPC4 may also be involved in the regulation of protein translation in platelets and megakaryocytes or may participate in the binding or stabilization of polyadenylates in platelet dense granules.
12879	
12880	Endocrine gland-derived vascular endothelial growth factor (EG-VEGF) induces proliferation, migration, and fenestration in capillary endothelial cells derived from endocrine glands. Its expression is induced by hypoxia and is restricted to the steroidogenic glands (ovary, testis, adrenal, and placenta). Its expression is often complementary to the expression of VEGF (MIM 192240), suggesting that these molecules function in a coordinated manner.[supplied by OMIM]
12881	The product of this gene belongs to the family of RAS-related proteins. These proteins share approximately 50% amino acid identity with the classical RAS proteins and have numerous structural features in common. The most striking difference between RAP proteins and RAS proteins resides in their 61st amino acid: glutamine in RAS is replaced by threonine in RAP proteins. The product of this gene counteracts the mitogenic function of RAS because it can interact with RAS GAPs and RAF in a competitive manner. Two transcript variants encoding the same protein have been identified for this gene.
12882	
12883	
12884	Adenosine monophosphate deaminase 1 catalyzes the deamination of AMP to IMP in skeletal muscle and plays an important role in the purine nucleotide cycle. Two other genes have been identified, AMPD2 and AMPD3, for the liver- and erythocyte-specific isoforms, respectively. Deficiency of the muscle-specific enzyme is apparently a common cause of exercise-induced myopathy and probably the most common cause of metabolic myopathy in the human.
12885	
12886	
12887	Dual specificity protein phosphatases inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues. They negatively regulate members of the MAPK superfamily (MAPK/ERK, SAPK/JNK, p38), which is associated with cellular proliferation and differentiation. Different members of this family of dual specificity phosphatases show distinct substrate specificities for MAPKs, different tissue distribution and subcellular localization, and different modes of inducibility of their expression by extracellular stimuli. This gene product binds to and inactivates p38 and SAPK/JNK, but not MAPK/ERK. Its subcellular localization is unique; it is evenly distributed in both the cytoplasm and the nucleus. This gene is widely expressed in various tissues and organs, and its expression is elevated by stress stimuli. Three transcript variants encoding two different isoforms have been found for this gene.
12888	
12889	Prefoldin 2 is a heterohexameric chaperone protein which assists in the correct folding of other proteins. It binds specifically to cytosolic chaperonin and transfers target proteins. Prefoldin may function by selectively targeting nascent actin and tubulin chains pending their transfer to cytosolic chaperonin for final folding and/or assembly. It promotes folding in an environment in which there are many competing pathways for nonnative proteins.
12890	
12891	The protein encoded by this gene is a member of the TNF-receptor superfamily. This receptor contributes to the clonal expansion, survival, and development of T cells. It can also induce proliferation in peripheral monocytes, enhance T cell apoptosis induced by TCR/CD3 triggered activation, and regulate CD28 co-stimulation to promote Th1 cell responses. The expression of this receptor is induced by lymphocyte activation. TRAF adaptor proteins have been shown to bind to this receptor and transduce the signals leading to activation of NF-kappaB.
12892	This gene encodes a member of the IL10 family of cytokines. It was identified as a gene induced during terminal differentiation in melanoma cells. The protein encoded by this gene can induce apoptosis selectively in various cancer cells. Overexpression of this gene leads to elevated expression of several GADD family genes, which correlates with the induction of apoptosis. The phosphorylation of mitogen-activated protein kinase 14 (MAPK7/P38), and heat shock 27kDa protein 1 (HSPB2/HSP27) are found to be induced by this gene in melanoma cells, but not in normal immortal melanocytes. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
12893	The protein encoded by this gene is a transcription factor that may function as an inhibitor or repressor in cell growth and/or maintenance, and plays a role in the negative regulation of the cell cycle. It has a possible tumor suppressive activity when over expressed. Alternative splicing occurs at this locus and two transcript variants encoding the same protein have been identified.
12894	
12895	
12896	
12897	Nephrin is a kidney glomerular filtration barrier protein that is an essential component of the interpodocyte-spanning slit diaphragm. Mutations in the nephrin gene are associated with congenital nephrotic syndrome (NPHS1; MIM 256300).[supplied by OMIM]
12898	The ASPM gene is the human ortholog of the Drosophila melanogaster @#$%&abnormal spindle@#$%& gene (asp), which is essential for normal mitotic spindle function in embryonic neuroblasts (Bond et al., 2002). The mouse gene Aspm is expressed specifically in the primary sites of prenatal cerebral cortical neurogenesis.[supplied by OMIM]
12899	
12900	HIV-1, the causative agent of acquired immunodeficiency syndrome (AIDS), contains an RNA genome that produces a chromosomally integrated DNA during the replicative cycle. Activation of HIV-1 gene expression by the transactivator Tat is dependent on an RNA regulatory element (TAR) located downstream of the transcription initiation site. This element forms a stable stem-loop structure and can be bound by either the protein encoded by this gene or by RNA polymerase II. This protein may act to disengage RNA polymerase II from TAR during transcriptional elongation. Alternatively spliced transcripts of this gene may exist, but their full-length natures have not been determined.
12901	This gene encodes a member of the grainyhead family of transcription factors. The encoded protein interacts with leader-binding protein 32 (LBP-32) and brother of mammalian grainyhead (BOM), and may function as a transcription factor during development. Multiple transcript variants encoding distinct isoforms have been identified for this gene. Additional transcript variants have been described, but their biological nature has not been determined.
12902	
12903	
12904	The protein encoded by this gene is a catalytic subunit of the AMP-activated protein kinase (AMPK). AMPK is a heterotrimer consisting of an alpha catalytic subunit, and non-catalytic beta and gamma subunits. AMPK is an important energy-sensing enzyme that monitors cellular energy status. In response to cellular metabolic stresses, AMPK is activated, and thus phosphorylates and inactivates acetyl-CoA carboxylase (ACC) and beta-hydroxy beta-methylglutaryl-CoA reductase (HMGCR), key enzymes involved in regulating de novo biosynthesis of fatty acid and cholesterol. Studies of the mouse counterpart suggest that this catalytic subunit may control whole-body insulin sensitivity and is necessary for maintaining myocardial energy homeostasis during ischemia.
12905	
12906	
12907	
12908	Geranylgeranyl diphosphate (GGPP) synthase (GGPS) catalyzes the synthesis of GGPP, an important molecule responsible for the C20-prenylation of protein and for the regulation of a nuclear hormone receptor.[supplied by OMIM]
12909	
12910	Geranylgeranyl diphosphate (GGPP) synthase (GGPS) catalyzes the synthesis of GGPP, an important molecule responsible for the C20-prenylation of protein and for the regulation of a nuclear hormone receptor.[supplied by OMIM]
12911	
12912	This gene encodes a transmembrane glycoprotein related to the type I receptors of the transforming growth factor-beta (TGF-beta) family, whose members play important roles in signal transduction in many developmental and pathological processes. The encoded protein however is a pseudoreceptor, lacking an intracellular serine/threonine kinase domain required for signaling. Similar proteins in frog, mouse and zebrafish function as negative regulators of TGF-beta, which has led to the suggestion that the encoded protein may function to limit the signaling range of the TGF-beta family during early embryogenesis.
12913	
12914	
12915	The protein encoded by this gene is a member of the bone morphogenetic protein (BMP) family and the TGF-beta superfamily. This group of proteins is characterized by a polybasic proteolytic processing site which is cleaved to produce a mature protein containing seven conserved cysteine residues. The members of this family are regulators of cell growth and differentiation in both embryonic and adult tissues. Studies in mice suggest that the protein encoded by this gene plays a role in skeletal morphogenesis.
12916	The adrenergic receptors (subtypes alpha 1, alpha 2, beta 1, and beta 2) are a prototypic family of guanine nucleotide binding regulatory protein-coupled receptors that mediate the physiological effects of the hormone epinephrine and the neurotransmitter norepinephrine.
12917	
12918	This gene encodes a motor protein that belongs to the kinesin-like protein family. Members of this protein family are known to be involved in various kinds of spindle dynamics. The function of this gene product includes chromosome positioning, centrosome separation and establishing a bipolar spindle during cell mitosis.
12919	
12920	The protein encoded by this gene is related to the nuclear chaperone phosphoproteins, nucleoplasmin and nucleophosmin.  It is highly homologous to the murine Npm3 gene.  Based on the structural similarity of the human NPM3 gene product to nucleoplasmin and nucleophosmin, NPM3 may represent a new member of this gene family, and may share basic functions with the molecular chaperones.
12921	Initiation of transcription by RNA polymerase II requires the activities of more than 70 polypeptides. The protein that coordinates these activities is transcription factor IID (TFIID), which binds to the core promoter to position the polymerase properly, serves as the scaffold for assembly of the remainder of the transcription complex, and acts as a channel for regulatory signals. TFIID is composed of the TATA-binding protein (TBP) and a group of evolutionarily conserved proteins known as TBP-associated factors or TAFs. TAFs may participate in basal transcription, serve as coactivators, function in promoter recognition or modify general transcription factors (GTFs) to facilitate complex assembly and transcription initiation. This gene encodes an integral subunit of TFIID associated with all transcriptionally competent forms of that complex. This subunit interacts strongly with two TFIID subunits that show similarity to histones H3 and H4, and it may participate in forming a nucleosome-like core in the TFIID complex.
12922	This gene encodes the alpha chain of type XVII collagen. Unlike most collagens, collagen XVII is a transmembrane protein. Collagen XVII is a structural component of hemidesmosomes, multiprotein complexes at the dermal-epidermal basement membrane zone that mediate adhesion of keratinocytes to the underlying membrane. Mutations in this gene are associated with both generalized atrophic benign and junctional epidermolysis bullosa. Two homotrimeric forms of type XVII collagen exist. The full length form is the transmembrane protein. A soluble form, referred to as either ectodomain or LAD-1, is generated by proteolytic processing of the full length form. Two transcript variants, one resulting from alternative splicing in the 3@#$%& UTR, have been identified for this gene.
12923	UNC5B belongs to a family of netrin-1 (MIM 601614) receptors thought to mediate the chemorepulsive effect of netrin-1 on specific axons. For more information on UNC5 proteins, see UNC5C (MIM 603610).[supplied by OMIM]
12924	Zinc finger proteins (ZNFs), which bind nucleic acids, perform many key functions, the most important of which is regulating transcription. See ZNF91 (MIM 603971) for general information on ZNFs.[supplied by OMIM]
12925	Cholinergic systems are implicated in numerous neurologic functions. Alteration in some cholinergic neurons may account for the disturbances of Alzheimer disease. The protein encoded by this gene synthesizes the neurotransmitter acetylcholine. Alternative splice variants have been found that contain alternative 5@#$%& untranslated exons. Three of the four described splice variants encode identical 69 kDa proteins while one variant encodes both the 69 kDa and a larger 82 kDa protein.
12926	
12927	DNAJB12 belongs to the evolutionarily conserved DNAJ/HSP40 family of proteins, which regulate molecular chaperone activity by stimulating ATPase activity. DNAJ proteins may have up to 3 distinct domains: a conserved 70-amino acid J domain, usually at the N terminus, a glycine/phenylalanine (G/F)-rich region, and a cysteine-rich domain containing 4 motifs resembling a zinc-finger domain (Ohtsuka and Hata, 2000).[supplied by OMIM]
12928	
12929	
12930	The WNT gene family consists of structurally related genes which encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family. It encodes a protein which shows 95%, 86% and 71% amino acid identity to the mouse, zebrafish and Xenopus Wnt8B proteins, respectively. The expression patterns of the human and mouse genes appear identical and are restricted to the developing brain. The chromosomal location of this gene to 10q24 suggests it as a candidate gene for partial epilepsy.
12931	
12932	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum but its specific substrate has not yet been determined. The gene is located within a cluster of cytochrome P450 genes on chromosome 10q24. An additional gene, CYP2C17, was once thought to exist; however, CYP4217 is now considered an artefact based on a chimera of CYP4218 and CYP2C19.
12933	
12934	
12935	BAG proteins compete with Hip for binding to the Hsc70/Hsp70 ATPase domain and promote substrate release. All the BAG proteins have an approximately 45-amino acid BAG domain near the C terminus but differ markedly in their N-terminal regions. The protein encoded by this gene contains a WW domain in the N-terminal region and a BAG domain in the C-terminal region. The BAG domains of BAG1, BAG2, and BAG3 interact specifically with the Hsc70 ATPase domain in vitro and in mammalian cells. All 3 proteins bind with high affinity to the ATPase domain of Hsc70 and inhibit its chaperone activity in a Hip-repressible manner.
12936	The protein encoded by this gene is a lysosomal aspartyl protease composed of a dimer of disulfide-linked heavy and light chains, both produced from a single protein precursor. This proteinase, which is a member of the peptidase C1 family, has a specificity similar to but narrower than that of pepsin A. Transcription of this gene is initiated from several sites, including one which is a start site for an estrogen-regulated transcript. Mutations in this gene are involved in the pathogenesis of several diseases, including breast cancer and possibly Alzheimer disease.
12937	
12938	
12939	
12940	DDB2 is the smaller subunit of a heterodimeric protein implicated in the etiology of xeroderma pigmentosum group E. This subunit appears to be required for DNA binding.
12941	The protein encoded by this gene belongs to the innexin family. Innexin family members are the structural components of gap junctions. This protein and pannexin 2 are abundantly expressed in central nerve system (CNS) and are coexpressed in various neuronal populations. Studies in Xenopus oocytes suggest that this protein alone and in combination with pannexin 2 may form cell type-specific gap junctions with distinct properties.
12942	The protein encoded by this gene is the terminal enzyme of the respiratory chain, coupling the transfer of electrons from cytochrome c to molecular oxygen, with the concomitant production of a proton electrochemical gradient across the inner mitochondrial membrane. In addition to 3 mitochondrially encoded subunits, which perform the catalytic function, the eukaryotic enzyme contains nuclear-encoded smaller subunits, ranging in number from 4 in some organisms to 10 in mammals. It has been proposed that nuclear-encoded subunits may be involved in the modulation of the catalytic function. This gene encodes one of the nuclear-encoded subunits.
12943	This gene belongs to a highly conserved gene family encoding EPS15 homology (EH) domain-containing proteins. The protein-binding EH domain was first noted in EPS15, a substrate for the epidermal growth factor receptor. The EH domain has been shown to be an important motif in proteins involved in protein-protein interactions and in intracellular sorting. The protein encoded by this gene is thought to play a role in the endocytosis of IGF1 receptors.
12944	Glycogen phosphorylase (EC 2.4.1.1) catalyzes and regulates the breakdown of glycogen to glucose-1-phosphate.[supplied by OMIM]
12945	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein.
12946	
12947	Calcium ions play a primary role in the regulation of sperm motility. This gene belongs to a family of putative cation channels that are specific to spermatozoa and localize to the flagellum. The protein family features a single repeat with six membrane-spanning segments and a predicted calcium-selective pore region.
12948	
12949	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes including embryonic development cell growth, morphogenesis, tissue repair, tumor growth and invasion. This growth factor is a high affinity, heparin dependent ligand for FGFR4. Expression of this gene was detected only in fetal but not adult brain tissue. Synergistic interaction of the chick homolog and Wnt-8c has been shown to be required for initiation of inner ear development.
12950	The vitamin A metabolite, all-trans retinoic acid (atRA), plays an important role in neuronal development, including neurite outgrowth. RAINB1 is an atRA-responsive gene.[supplied by OMIM]
12951	
12952	This gene encodes a member of the ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) protein family. ADAMTS family members share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. The protein encoded by this gene has a high sequence similarity to the proteins encoded by ADAMTS1 and ADAMTS8. The function of this protein has not been determined.
12953	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development, reproduction, and tissue remodeling, as well as in disease processes, such as arthritis and metastasis. Most MMP@#$%&s are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. The enzyme encoded by this gene degrades proteoglycans, fibronectin, elastin and casein and differs from most MMP family members in that it lacks a conserved C-terminal protein domain. The enzyme is involved in wound healing, and studies in mice suggest that it regulates the activity of defensins in intestinal mucosa. The gene is part of a cluster of MMP genes which localize to chromosome 11q22.3.
12954	This gene encodes a protein that belongs to the serine protease family. Serine proteases are known to be involved in many physiological and pathological processes. The function of this gene has not yet been determined.
12955	
12956	
12957	
12958	
12959	
12960	
12961	
12962	
12963	
12964	
12965	
12966	In S. cerevisiae, Pso4 has pleiotropic functions in DNA recombination and in error-prone nonhomologous end-joining DNA repair.[supplied by OMIM]
12967	
12968	Carbamoyl phosphate synthetase I (EC 6.3.4.16) is the rate-limiting enzyme that catalyzes the first committed step of the hepatic urea cycle. The mitochondrial isozyme is designated CPS I and the cytoplasmic enzyme CPS II. CPS II is part of a multifunctional enzyme, called the CAD trifunctional protein of pyrimidine biosynthesis (CAD; MIM 114010), which has been mapped to 2p21.[supplied by OMIM]
12969	
12970	
12971	
12972	Lactate dehydrogenase A catalyzes the conversion of L-lactate and NAD to pyruvate and NADH in the final step of anaerobic glycolysis.  LDHA is found predominantly in muscle tissue and belongs to the lactate dehydrogenase family.  Mutations in LDHA have been linked to exertional myoglobinuria.
12973	
12974	This gene encodes a member of the oxysterol-binding protein (OSBP) family, a group of intracellular lipid receptors. Most members contain an N-terminal pleckstrin homology domain and a highly conserved C-terminal OSBP-like sterol-binding domain. Transcript variants encoding different isoforms have been identified.
12975	This gene encodes a member of the Tubby family of bipartite transcription factors. The encoded protein may play a role in obesity and sensorineural degradation. The crystal structure has been determined for a similar protein in mouse, and it functions as a membrane-bound transcription regulator that translocates to the nucleus in response to phosphoinositide hydrolysis. Two transcript variants encoding distinct isoforms have been identified for this gene.
12976	LMO2 encodes a cysteine-rich, two LIM-domain protein that is required for yolk sac erythropoiesis.  The LMO2 protein has a central and crucial role in hematopoietic development and is highly conserved.  The LMO2 transcription start site is located approximately 25 kb downstream from the 11p13 T-cell translocation cluster (11p13 ttc), where a number T-cell acute lymphoblastic leukemia-specific translocations occur.
12977	The gamma globin genes (HBG1 and HBG2) are normally expressed in the fetal liver, spleen and bone marrow. Two gamma chains together with two alpha chains constitute fetal hemoglobin (HbF) which is normally replaced by adult hemoglobin (HbA) at birth. In some beta-thalassemias and related conditions, gamma chain production continues into adulthood. The two types of gamma chains differ at residue 136 where glycine is found in the G-gamma product (HBG2) and alanine is found in the A-gamma product (HBG1). The former is predominant at birth. The order of the genes in the beta-globin cluster is: 5@#$%&-epsilon -- gamma-G -- gamma-A -- delta -- beta--3@#$%&
12978	
12979	This gene encodes a type 1 transmembrane protein of unknown function. It is one of several genes located in the imprinted gene domain of 11p15.5, an important tumor-suppressor gene region. Alterations in this region have been associated with the Beckwith-Wiedemann syndrome, Wilms tumor, rhabdomyosarcoma, adrenocrotical carcinoma, and lung, ovarian, and breast cancer. This gene may play a role in malignancies and disease that involve this region, as well as early hematopoiesis, by mediating attachment to stromal cells. This gene is oriented in a head-to-tail configuration with the ribonucleotide reductase 1 gene (RRM1), with the 3@#$%& end of this gene situated 1.6 kb from the 5@#$%& end of the RRM1 gene.
12980	
12981	
12982	Acyl-CoA:diacylglycerol acyltransferase, or DGAT (EC 2.3.1.20), is responsible for the synthesis of triglycerides. It catalyzes a reaction in which diacylglycerol is covalently joined to long chain fatty acyl-CoAs.[supplied by OMIM]
12983	
12984	This gene encodes a member of the Polycomb-group (PcG) family. PcG family members form multimeric protein complexes, which are involved in maintaining the transcriptional repressive state of genes over successive cell generations. This protein interacts with enhancer of zeste 2, the cytoplasmic tail of integrin beta7, immunodeficiency virus type 1 (HIV-1) MA protein, and histone deacetylase proteins. This protein mediates repression of gene activity through histone deacetylation, and may act as a specific regulator of integrin function. Two transcript variants encoding distinct isoforms have been identified for this gene.
12985	The cbl oncogene was first identified as part of a transforming retrovirus which induces mouse pre-B and pro-B cell lymphomas. As an adaptor protein for receptor protein-tyrosine kinases, it positively regulates receptor protein-tyrosine kinase ubiquitination in a manner dependent upon its variant SH2 and RING finger domains. Ubiquitination of receptor protein-tyrosine kinases terminates signaling by marking active receptors for degradation.
12986	
12987	ERCC3 is an ATP-dependent DNA helicase that functions in nucleotide excision repair and complements xeroderma pigmentosum group B mutations. It also is the 89 kDa subunit of basal transcription factor 2 (TFIIH) and thus functions in class II transcription.
12988	The protein encoded by this gene is a member of the gelsolin/villin family of actin regulatory proteins. This protein has structural similarity to villin. It binds actin and may play a role in the development of neuronal cells that form ganglia.
12989	
12990	
12991	
12992	
12993	
12994	
12995	
12996	Members of the Rho GTPase family, such as RND1, regulate the organization of the actin cytoskeleton in response to extracellular growth factors (Nobes et al., 1998).[supplied by OMIM]
12997	
12998	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP contains an extracellular domain, a single transmembrane segment and one intracytoplasmic catalytic domain, thus belongs to receptor type PTP. The extracellular region of this PTP is composed of multiple fibronectin type_III repeats, which was shown to interact with neuronal receptor and cell adhesion molecules, such as contactin and tenascin C. This protein was also found to interact with sodium channels, and thus may regulate sodium channels by altering tyrosine phosphorylation status. The functions of the interaction partners of this protein implicate the roles of this PTP in cell adhesion, neurite growth, and neuronal differentiation.
12999	
13000	
13001	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation.
13002	
13003	
13004	
13005	The product of this gene belongs to the SNF2/RAD54 helicase family. It represents the main component of the nucleosome remodeling and deacetylase complex and plays an important role in epigenetic transcriptional repression. Patients with dermatomyositis develop antibodies against this protein.
13006	
13007	The protein encoded by this gene is a member of the tumor necrosis factor (TNF) family of receptors. It is expressed on the surface of most cell types, including cells of epithelial and myeloid lineages, but not on T and B lymphocytes. The protein specifically binds the lymphotoxin membrane form (a complex of lymphotoxin-alpha and lymphtoxin-beta). The encoded protein and its ligand play a role in the development and organization of lymphoid tissue and tranformed cells. Activation of the encoded protein can trigger apoptosis.
13008	
13009	The WNT gene family consists of structurally related genes which encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family. It encodes a protein which shows 94% and 80% amino acid identity to the mouse Wnt5b protein and the human WNT5A protein, respectively. Alternative splicing of this gene generates 2 transcript variants.
13010	
13011	The protein encoded by this gene was identified by its ability to bind to the nuclear localization signal of BRCA1 and other proteins. It is a cytoplasmic protein which may regulate nuclear targeting by retaining proteins with a nuclear localization signal in the cytoplasm.
13012	The protein encoded by this gene contains a ring finger motif, which is known to be involved in protein-protein interactions. The specific function of this protein has not yet been determined. EST data suggests the existence of multiple alternatively spliced transcript variants, however, their full length nature is not known.
13013	
13014	The tumor suppressor WT1 represses and activates transcription. The protein encoded by this gene is a WT1-interacting protein that itself functions as a transcriptional repressor. It contains a putative leucine zipper domain which interacts with the zinc finger DNA binding domain of WT1. This protein is specifically upregulated during apoptosis of prostate cells.
13015	
13016	Potassium channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. Four sequence-related potassium channel genes - shaker, shaw, shab, and shal - have been identified in Drosophila, and each has been shown to have human homolog(s). This gene encodes a member of the potassium channel, voltage-gated, shaker-related subfamily. This member contains six membrane-spanning domains with a shaker-type repeat in the fourth segment. It belongs to the delayed rectifier class. The coding region of this gene is intronless, and the gene is clustered with genes KCNA1 and KCNA5 on chromosome 12.
13017	
13018	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class. Alternative splicing of this gene generates 2 transcript variants.
13019	
13020	
13021	
13022	This gene encodes a member of the ZIC family of C2H2-type zinc finger proteins. Members of this family are important during development, and have been associated X-linked visceral heterotaxy and holoprosencephaly type 5. This gene is closely linked to a gene encoding zinc finger protein of the cerebellum 2, a related family member on chromosome 13. This gene encodes a protein of unknown function.
13023	This gene encodes the insulin receptor substrate 2, a cytoplasmic signaling molecule that mediates effects of insulin, insulin-like growth factor 1, and other cytokines by acting as a molecular adaptor between diverse receptor tyrosine kinases and downstream effectors. The product of this gene is phosphorylated by the insulin receptor tyrosine kinase upon receptor stimulation, as well as by an interleukin 4 receptor-associated kinase in response to IL4 treatment.
13024	
13025	Multiprotein complexes are key determinants of Golgi apparatus structure and its capacity for intracellular transport and glycoprotein modification. Several complexes have been identified, including the Golgi transport complex (GTC), the LDLC complex, which is involved in glycosylation reactions, and the SEC34 complex, which is involved in vesicular transport. These 3 complexes are identical and have been termed the conserved oligomeric Golgi (COG) complex, which includes COG6 (Ungar et al., 2002).[supplied by OMIM]
13026	
13027	
13028	This gene encodes a member of the Kruppel-like factor subfamily of zinc finger proteins. Since the protein localizes to the nucleus and binds the epidermal growth factor response element, it is thought to be a transcription factor.
13029	The protein encoded by this gene is similar to oxidoreductases, which are enzymes involved in cellular responses to oxidative stresses and irradiation. This gene is induced by the tumor suppressor p53 and is thought to be involved in p53-mediated cell death. It contains a p53 consensus binding site in its promoter region and a downstream pentanucleotide microsatellite sequence. P53 has been shown to transcriptionally activate this gene by interacting with the downstream pentanucleotide microsatellite sequence. The microsatellite is polymorphic, with a varying number of pentanucleotide repeats directly correlated with the extent of transcriptional activation by p53. It has been suggested that the microsatellite polymorphism may be associated with differential susceptibility to cancer. At least two transcript variants encoding the same protein have been found for this gene.
13030	
13031	
13032	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein associates with the long isoform of the leptin receptor, the transforming growth factor-beta family of receptor serine-threonine kinases, and with receptor tyrosine kinases for platelet-derived growth factor, insulin, and epidermal growth factor. This protein may form oligomeric complexes with family member proteins through interactions of both the PX domain and the coiled coil regions of the molecules. Translocation of this protein from the cytoplasm to the nucleus occurs after binding to proviral integration site 1 protein. This gene results in two transcripts encoding two distinct isoforms.
13033	
13034	
13035	
13036	
13037	
13038	
13039	
13040	Functions of the cell cortex, including motility, adhesion, and cytokinesis, are mediated by the reorganization of the actin cytoskeleton and recent evidence suggests a role for the Formin homology (FH) proteins in these processes. The protein encoded by this gene contains FH domains and belongs to a novel FH protein subfamily implicated in cell polarity. Wnt/Fz signaling activates the small GTPase Rho, a key regulator of cytoskeleton architecture, to control cell polarity and movement during development. Activation requires Dvl-Rho complex formation, an assembly mediated by this gene product, which is thought to function as a scaffolding protein. Evidence of alternative splicing has been observed for this gene but the full-length nature of these variants has not been determined.
13041	The protein encoded by this gene is a member of the bone morphogenetic protein family which is part of the transforming growth factor-beta superfamily. The superfamily includes large families of growth and differentiation factors. Bone morphogenetic proteins were originally identified by an ability of demineralized bone extract to induce endochondral osteogenesis in vivo in an extraskeletal site. This particular family member plays an important role in the onset of endochondral bone formation in humans, and a reduction in expression has been associated with a variety of bone diseases, including the heritable disorder Fibrodysplasia Ossificans Progressiva. Alternative splicing in the 5@#$%& untranslated region of this gene has been described and three variants are described, all encoding an identical protein.
13042	This gene encodes one of the proteins important for centrosomal function. This protein is important for positioning and anchoring the microtubules minus-ends in epithelial cells. Localization of this protein to the centrosome requires three leucine zippers in the central coiled-coil domain. Multiple alternatively spliced transcript variants that encode different isoforms have been reported.
13043	
13044	
13045	
13046	
13047	
13048	
13049	The protein encoded by this gene is a member of the chromogranin/secretogranin family of neuroendocrine secretory proteins. Granins may serve as precursors for biologically active peptides. Some granins have been shown to function as helper proteins in sorting and proteolytic processing of prohormones; however, the function of this protein is unknown.
13050	
13051	The protein encoded by this gene is a glycoprotein and a member of the NADPH oxidase family. The synthesis of thyroid hormone is catalyzed by a protein complex located at the apical membrane of thyroid follicular cells. This complex contains an iodide transporter, thyroperoxidase, and a peroxide generating system that includes this encoded protein and DUOX2. This protein is known as dual oxidase because it has both a peroxidase homology domain and a gp91phox domain. Two alternatively spliced transcript variants encoding the same protein have been described for this gene.
13052	
13053	
13054	
13055	
13056	
13057	
13058	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters.  ABC proteins transport various molecules across extra- and intra-cellular membranes.  ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White).  This protein is a member of the MDR/TAP subfamily.  Members of the MDR/TAP subfamily are involved in multidrug resistance as well as antigen presentation.  This half-transporter likely plays a role in mitochondrial function.  Localized to 2q26, this gene is considered a candidate gene for lethal neonatal metabolic syndrome, a disorder of mitochondrial function.
13059	
13060	SPRED1 is a member of the Sprouty (see SPRY1; MIM 602465)/SPRED family of proteins that regulate growth factor-induced activation of the MAP kinase cascade (see MAPK1; MIM 176948) (Nonami et al., 2004).[supplied by OMIM]
13061	
13062	This gene is a member of the H/ACA snoRNPs (small nucleolar ribonucleoproteins) gene family. snoRNPs are involved in various aspects of rRNA processing and modification and have been classified into two families: C/D and H/ACA. The H/ACA snoRNPs also include the DKC1, NOLA1 and NOLA2 proteins. These four H/ACA snoRNP proteins localize to the dense fibrillar components of nucleoli and to coiled (Cajal) bodies in the nucleus. Both 18S rRNA production and rRNA pseudouridylation are impaired if any one of the four proteins is depleted. The four H/ACA snoRNP proteins are also components of the telomerase complex. This gene encodes a protein related to Saccharomyces cerevisiae Nop10p.
13063	The protein encoded by this gene possesses long-chain acyl-CoA synthetase activity. It is thought to play a central role in brain very long-chain fatty acids metabolism and myelinogenesis.
13064	
13065	EIF2AK4 belongs to a family of kinases that phosphorylate the alpha subunit of eukaryotic translation initiation factor-2 (EIF2S1; MIM 603907) to downregulate protein synthesis in response to varied cellular stresses (Berlanga et al., 1999).[supplied by OMIM]
13066	
13067	
13068	
13069	
13070	This gene belongs to the ETS transcription factor family. ETS family members have a highly conserved 85-amino acid ETS domain that binds purine-rich DNA sequences. The alanine-rich C-terminus of this gene indicates that it may act as a transcription repressor. This gene is exclusively expressed in neurons of the central serotonin (5-HT) system, a system implicated in the pathogeny of such psychiatric diseases as depression, anxiety, and eating disorders. In some types of Ewing tumors, this gene is fused to the Ewing sarcoma (EWS) gene following chromosome translocations.
13071	
13072	The protein encoded by this gene is a member of kinesin-like protein family. This family of proteins are microtubule-dependent molecular motors that transport organelles within cells and move chromosomes during cell division. The C-terminal half of this protein has been shown to bind DNA. Studies with the Xenopus homolog suggests its essential role in metaphase chromosome alignment and maintenance.
13073	This gene encodes a component of vacuolar ATPase (V-ATPase), a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, receptor-mediated endocytosis, and synaptic vesicle proton gradient generation. V-ATPase is composed of a cytosolic V1 domain and a transmembrane V0 domain. The V1 domain consists of three A and three B subunits, two G subunits plus the C, D, E, F, and H subunits. The V1 domain contains the ATP catalytic site. The V0 domain consists of five different subunits: a, c, c@#$%&, c@#$%!, and d. Additional isoforms of many of the V1 and V0 subunit proteins are encoded by multiple genes or alternatively spliced transcript variants. This encoded protein is known as the D subunit and is found ubiquitously.
13074	
13075	The protein encoded by this gene is a death domain containing adaptor molecule that interacts with TNFRSF1A/TNFR1 and mediates programmed cell death signaling and NF-kappaB activation. This protein binds adaptor protein TRAF2, reduces the recruitment of inhibitor-of-apoptosis proteins (IAPs) by TRAF2, and thus suppresses TRAF2 mediated apoptosis. This protein can also interact with receptor TNFRSF6/FAS and adaptor protein FADD/MORT1, and is involved in the Fas-induced cell death pathway. Two alternatively spliced transcript variants encoding distinct isoforms have been reported.
13076	
13077	
13078	
13079	
13080	
13081	Apolipoprotein B48 receptor is a macrophage receptor that binds to the apolipoprotein B48 of dietary triglyceride (TG)-rich lipoproteins. This receptor may provide essential lipids, lipid-soluble vitamins and other nutrients to reticuloendothelial cells. If overwhelmed with elevated plasma triglyceride, the apolipoprotein B48 receptor may contribute to foam cell formation, endothelial dysfunction, and atherothrombogenesis.
13082	
13083	
13084	
13085	This gene encodes a component of vacuolar ATPase (V-ATPase), a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, receptor-mediated endocytosis, and synaptic vesicle proton gradient generation. V-ATPase is composed of a cytosolic V1 domain and a transmembrane V0 domain. The V1 domain consists of three A and three B subunits, two G subunits plus the C, D, E, F, and H subunits. The V1 domain contains the ATP catalytic site. The V0 domain consists of five different subunits: a, c, c@#$%&, c@#$%!, and d. Additional isoforms of many of the V1 and V0 subunit proteins are encoded by multiple genes or alternatively spliced transcript variants. This encoded protein is part of the V0 domain. This gene had the previous symbols of ATP6C and ATP6L.
13086	
13087	
13088	The accumulation of unfolded proteins in the endoplasmic reticulum (ER) triggers the ER stress response. This response includes the inhibition of translation to prevent further accumulation of unfolded proteins, the increased expression of proteins involved in polypeptide folding, known as the unfolded protein response (UPR), and the destruction of misfolded proteins by the ER-associated protein degradation (ERAD) system. This gene may play a role in both UPR and ERAD. Its expression is induced by UPR and it has an ER stress response element in its promoter region while the encoded protein has an N-terminal ubiquitin-like domain which may interact with the ERAD system. This protein has been shown to interact with presenilin proteins and to increase the level of amyloid-beta protein following its overexpression. Alternative splicing of this gene produces multiple transcript variants, some encoding different isoforms. The full-length nature of all transcript variants has not been determined.
13089	ITGAL encodes the integrin alpha L chain. Integrins are heterodimeric integral membrane proteins composed of an alpha chain and a beta chain. This I-domain containing alpha integrin combines with the beta 2 chain (ITGB2) to form the integrin lymphocyte function-associated antigen-1 (LFA-1), which is expressed on all leukocytes. LFA-1 plays a central role in leukocyte intercellular adhesion through interactions with its ligands, ICAMs 1-3 (intercellular adhesion molecules 1 through 3), and also functions in lymphocyte costimulatory signaling.
13090	
13091	
13092	The protein encoded by this gene belongs to the D-isomer specific 2-hydroxyacid dehydrogenase family. The similar protein in yeast has both D-lactate and D-glycerate dehydrogenase activities. Alternative splicing occurs at this locus and two transcript variants encoding distinct isoforms have been identified.
13093	
13094	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein.
13095	
13096	
13097	
13098	This gene is similar to a mouse gene that is expressed in the testis.
13099	
13100	
13101	
13102	
13103	This gene encodes a protein with 5-phosphatase activity toward polyphosphate inositol. The protein localizes to the cytosol in regions lacking actin stress fibers. It is thought that this protein may negatively regulate the actin cytoskeleton. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
13104	Although the function of this gene is not known, the existence of this gene is supported by mRNA and EST data. A possible function of the encoded protein can be inferred from amino acid sequence similarity to the E.coli FtsJ protein and to a mouse protein possibly involved in embryogenesis.
13105	Protein kinase C (PKC) is a family of serine- and threonine-specific protein kinases that can be activated by calcium and the second messenger diacylglycerol. PKC family members phosphorylate a wide variety of protein targets and are known to be involved in diverse cellular signaling pathways. PKC family members also serve as major receptors for phorbol esters, a class of tumor promoters. Each member of the PKC family has a specific expression profile and is believed to play a distinct role in cells. The protein encoded by this gene is one of the PKC family members. This kinase has been reported to play roles in many different cellular processes, such as cell adhesion, cell transformation, cell cycle checkpoint, and cell volume control. Knockout studies in mice suggest that this kinase may be a fundamental regulator of cardiac contractility and Ca(2+) handling in myocytes.
13106	This gene encodes a protein that is similar to a serine/threonine kinase in C. elegans which is involved in axonal elongation. The structure of this protein is similar to the C. elegans protein in that both proteins have an N-terminal kinase domain, a central proline/serine rich (PS) domain, and a C-terminal (C) domain. The gene is located within the Smith-Magenis syndrome region on chromosome 17.
13107	
13108	
13109	The beta adaptin subunit is part of the clathrin coat assembly complex which links clathrin to receptors in coated pits and vesicles. These vesicles are involved in endocytosis and Golgi processing. The beta 1 subunit is one of the assembly proteins which binds to clathrin and initiates coat formation.
13110	
13111	
13112	
13113	Spinophilin is a regulatory subunit of protein phosphatase-1 catalytic subunit (PP1; see MIM 176875) and is highly enriched in dendritic spines, specialized protrusions from dendritic shafts that receive most of the excitatory input in the central nervous system.[supplied by OMIM]
13114	ITGA3 encodes the integrin alpha 3 chain. Integrins are heterodimeric integral membrane proteins composed of an alpha chain and a beta chain. Alpha chain 3 undergoes post-translational cleavage in the extracellular domain to yield disulfide-linked light and heavy chains that join with beta 1 to form an integrin that interacts with many extracellular-matrix proteins. The alpha 3 beta 1 integrin is known variously as: very late (activation) antigen 3 (@#$%&VLA-3@#$%&), very common antigen 2 (@#$%&VCA-2@#$%&), extracellular matrix receptor 1 (@#$%&ECMR1@#$%&), and galactoprotein b3 (@#$%&GAPB3@#$%&).
13115	This gene is one of several Cys-Cys (CC) cytokine genes clustered on the q arm of chromosome 17. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The CC cytokines are proteins characterized by two adjacent cysteines. The cytokine encoded by this gene displays chemotactic activity for naive T cells, CD4+ and CD8+ T cells and nonactivated lymphocytes, but not for monocytes or granulocytes. This chemokine attracts naive T lymphocytes toward dendritic cells and activated macrophages in lymph nodes. It may play a role in both humoral and cell-mediated immunity responses.
13116	This gene belongs to the homeobox family of genes. The homeobox genes encode a highly conserved family of transcription factors that play an important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, which are located on different chromosomes and consist of 9 to 11 genes arranged in tandem. This gene is one of several homeobox HOXB genes located in a cluster on chromosome 17.
13117	
13118	
13119	Caveolae are small domains on the inner cell membrane involved in vesicular trafficking and signal transduction.  FLOT2 encodes a caveolae-associated, integral membrane protein.  Flotillin 2 is though to function in neuronal signaling.
13120	This gene encodes a member of the dishevelled (dsh) protein family. The vertebrate dsh proteins have approximately 40% amino acid sequence similarity with Drosophila dsh. This gene encodes a 90-kD protein that undergoes posttranslational phosphorylation to form a 95-kD cytoplasmic protein, which may play a role in the signal transduction pathway mediated by multiple Wnt proteins. The mechanisms of dishevelled function in Wnt signaling are likely to be conserved among metazoans.
13121	
13122	
13123	
13124	
13125	The protein encoded by this gene is a member of the keratin gene family. As a type I hair keratin, it is an acidic protein which heterodimerizes with type II keratins to form hair and nails. The type I hair keratins are clustered in a region of chromosome 17q12-q21 and have the same direction of transcription.
13126	KRT12 encodes the type I intermediate filament chain keratin 12, expressed in corneal epithelia. Mutations in this gene lead to Meesmann corneal dystrophy.
13127	This gene encodes an adenosine receptor that is a member of the G protein-coupled receptor superfamily. This integral membrane protein stimulates adenylate cyclase activity in the presence of adenosine. This protein also interacts with netrin-1, which is involved in axon elongation. The gene is located near the Smith-Magenis syndrome region on chromosome 17.
13128	
13129	Nucleolin (NCL), a eukaryotic nucleolar phosphoprotein, is involved in the synthesis and maturation of ribosomes. It is located mainly in dense fibrillar regions of the nucleolus. Human NCL gene consists of 14 exons with 13 introns and spans approximately 11kb. The intron 11 of the NCL gene encodes a small nucleolar RNA, termed U20.
13130	
13131	
13132	
13133	
13134	This gene was identified by the similarity of its product to the Aspergillus nidulans SUDD protein, an extragenic suppressor of the heat-sensitive bimD6 mutation that fails to attach properly to the spindle microtubules at a restrictive temperature. The specific function of this gene has not yet been determined. Two alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported.
13135	This gene was identified by the similarity of its product to the Aspergillus nidulans SUDD protein, an extragenic suppressor of the heat-sensitive bimD6 mutation that fails to attach properly to the spindle microtubules at a restrictive temperature. The specific function of this gene has not yet been determined. Two alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported.
13136	
13137	
13138	
13139	
13140	
13141	
13142	
13143	
13144	
13145	
13146	
13147	This gene encodes an alpha chain for one of the low abundance fibrillar collagens. Fibrillar collagen molecules are trimers that can be composed of one or more types of alpha chains. Type V collagen is found in tissues containing type I collagen and appears to regulate the assembly of heterotypic fibers composed of both type I and type V collagen. This gene product is closely related to type XI collagen and it is possible that the collagen chains of types V and XI constitute a single collagen type with tissue-specific chain combinations. Mutations in this gene are thought to be responsible for the symptoms of a subset of patients with Ehlers-Danlos syndrome type III. Messages of several sizes can be detected in northern blots but sequence information cannot confirm the identity of the shorter messages.
13148	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum. Although its endogenous substrate has not been determined, it is known to metabolize 4-(methylnitrosamino)-1-(3-pyridyl)-1-butanone, a major nitrosamine specific to tobacco. This gene is part of a large cluster of cytochrome P450 genes from the CYP2A, CYP2B and CYP2F subfamilies on chromosome 19q.
13149	
13150	
13151	N-acetylglucosamine kinase (NAGK; EC 2.7.1.59) converts endogenous N-acetylglucosamine (GlcNAc), a major component of complex carbohydrates, from lysosomal degradation or nutritional sources into GlcNAc 6-phosphate. NAGK belongs to the group of N-acetylhexosamine kinases and is a prominent salvage enzyme of amino sugar metabolism in mammals.[supplied by OMIM]
13152	
13153	
13154	
13155	
13156	
13157	TULP2 is a member of a family of tubby-like genes (TULPs) that encode proteins of unknown function. Members of this family have been identified in plants, vertebrates, and invertebrates. The TULP proteins share a conserved C-terminal region of approximately 200 amino acid residues.
13158	The @#$%&SNARE hypothesis@#$%& is a model explaining the process of docking and fusion of vesicles to their target membranes. According to this model, membrane proteins from the vesicle (v-SNAREs) and proteins from the target membrane (t-SNAREs) govern the specificity of vesicle targeting and docking through mutual recognition. Once the 2 classes of SNAREs bind to each other, they form a complex that recruits the general elements of the fusion apparatus, namely NSF (N-ethylmaleimide-sensitive factor) and SNAPs (soluble NSF-attachment proteins), to the site of membrane fusion, thereby forming the 20S fusion complex. Alpha- and gamma-SNAP are found in a wide range of tissues and act synergistically in intra-Golgi transport. The sequence of the predicted 295-amino acid human protein encoded by NAPA shares 37%, 60%, and 67% identity with the sequences of yeast, Drosophila, and squid alpha-SNAP, respectively. Platelets contain some of the same proteins, including NSF, p115/TAP, alpha-SNAP, gamma-SNAP, and the t-SNAREs syntaxin-2 and syntaxin-4, that are used in many vesicular transport processes in other cell types. Platelet exocytosis uses a molecular mechanism similar to that used by other secretory cells, such as neurons, although the proteins used by the platelet and their modes of regulation may be quite different.
13159	
13160	Docking protein 1 is constitutively tyrosine phosphorylated in hematopoietic progenitors isolated from chronic myelogenous leukemia (CML) patients in the chronic phase.  It may be a critical substrate for p210(bcr/abl), a chimeric protein whose presence is associated with CML.  Docking protein 1 contains a putative pleckstrin homology domain at the amino terminus and ten PXXP SH3 recognition motifs.  Docking protein 2 binds p120 (RasGAP) from CML cells.  It has been postulated to play a role in mitogenic signaling.
13161	
13162	
13163	
13164	
13165	Docking protein 1 is constitutively tyrosine phosphorylated in hematopoietic progenitors isolated from chronic myelogenous leukemia (CML) patients in the chronic phase.  It may be a critical substrate for p210(bcr/abl), a chimeric protein whose presence is associated with CML.  Docking protein 1 contains a putative pleckstrin homology domain at the amino terminus and ten PXXP SH3 recognition motifs.  Docking protein 2 binds p120 (RasGAP) from CML cells.  It has been postulated to play a role in mitogenic signaling.
13166	
13167	
13168	
13169	
13170	
13171	
13172	Ubiquitin is a highly conserved nuclear and cytoplasmic protein that has a major role in targeting cellular proteins for degradation by the 26S proteosome. It is also involved in the maintenance of chromatin structure, the regulation of gene expression, and the stress response. Ubiquitin is synthesized as a precursor protein consisting of either polyubiquitin chains or a single ubiquitin moiety fused to an unrelated protein. This gene encodes a fusion protein consisting of ubiquitin at the N terminus and ribosomal protein L40 at the C terminus. When expressed in yeast, the protein is post-translationally processed, generating free ubiquitin monomer and the 52-amino acid ribosomal protein L40. Ribosomal protein L40 is a component of the 60S subunit of the ribosome and belongs to the L40E family of ribosomal proteins. It contains a zinc finger-like domain and is located in the cytoplasm. Multiple processed pseudogenes derived from this gene are present in the genome. As with ribosomal protein L40, ribosomal protein S27a is also synthesized as a fusion protein with ubiquitin; similarly, ribosomal protein S30 is synthesized as a fusion protein with the ubiquitin-like protein fubi.
13173	Ubiquitin is a highly conserved nuclear and cytoplasmic protein that has a major role in targeting cellular proteins for degradation by the 26S proteosome. It is also involved in the maintenance of chromatin structure, the regulation of gene expression, and the stress response. Ubiquitin is synthesized as a precursor protein consisting of either polyubiquitin chains or a single ubiquitin moiety fused to an unrelated protein. This gene encodes a fusion protein consisting of ubiquitin at the N terminus and ribosomal protein L40 at the C terminus. When expressed in yeast, the protein is post-translationally processed, generating free ubiquitin monomer and the 52-amino acid ribosomal protein L40. Ribosomal protein L40 is a component of the 60S subunit of the ribosome and belongs to the L40E family of ribosomal proteins. It contains a zinc finger-like domain and is located in the cytoplasm. Multiple processed pseudogenes derived from this gene are present in the genome. As with ribosomal protein L40, ribosomal protein S27a is also synthesized as a fusion protein with ubiquitin; similarly, ribosomal protein S30 is synthesized as a fusion protein with the ubiquitin-like protein fubi.
13174	
13175	
13176	
13177	This gene encodes a subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. This subunit, in combination with two other polymerase subunits, forms the DNA binding domain of the polymerase, a groove in which the DNA template is transcribed into RNA. The product of this gene has two zinc finger motifs with conserved cysteines and the subunit does possess zinc binding activity.
13178	
13179	
13180	
13181	
13182	This gene encodes a translocase with similarity to yeast mitochondrial proteins that are involved in the import of metabolite transporters from the cytoplasm and into the mitochondrial inner membrane. The encoded protein and the TIMM8a protein form a 70 kDa complex in the intermembrane space. This gene is in a head-to-tail orientation with the gene for lamin B2.
13183	
13184	
13185	
13186	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intracellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, and White). This encoded protein is a member of the ABC1 subfamily, which is the only major ABC subfamily found exclusively in multicellular eukaryotes. Alternative splicing of this gene results in multiple transcript variants.
13187	
13188	
13189	
13190	
13191	
13192	The product of this gene belongs to the integrin alpha chain family of proteins. Integrins are heterodimeric integral membrane proteins composed of an alpha chain and a beta chain. This gene encodes an alpha 4 chain. Unlike other integrin alpha chains, alpha 4 neither contains an I-domain, nor undergoes disulfide-linked cleavage. Alpha 4 chain associates with either beta 1 chain or beta 7 chain.
13193	The gene product is an immunoglobulin-type protein. It is found to be expressed strongly in endothelial cells, cytotrophoblasts, and poorly differentiated colon adenocarcinoma cells found in lymphatics. The protein contains a heparin-binding domain and mediates heparin-sensitive cell adhesion.
13194	
13195	
13196	GEMIN6 is part of a large macromolecular complex, localized to both the cytoplasm and the nucleus, that plays a role in the cytoplasmic assembly of small nuclear ribonucleoproteins (snRNPs). Other members of this complex include SMN (MIM 600354), GEMIN2 (SIP1; MIM 602595), GEMIN3 (DDX20; MIM 606168), GEMIN4 (MIM 606969), and GEMIN5 (MIM 607005).[supplied by OMIM]
13197	
13198	The protein encoded by this gene is a DNA binding and apoptosis-inducing protein and is localized in the nucleus. It is also a Rac3-interacting protein which acts as a corepressor for the thyroid hormone receptor. This protein is thought to regulate TRAX/Translin complex formation. Several alternatively spliced transcript variants of this gene have been described, but the full length nature of some of these variants has not been determined.
13199	
13200	The protein encoded by this gene is a member of the interleukin 1 receptor family. An experiment with transient gene expression demonstrated that this receptor was incapable of binding to interleukin 1 alpha and interleukin 1 beta with high affinity. This gene and four other interleukin 1 receptor family genes, including interleukin 1 receptor, type I (IL1R1), interleukin 1 receptor, type II (IL1R2), interleukin 1 receptor-like 1 (IL1RL1), and interleukin 18 receptor 1 (IL18R1), form a cytokine receptor gene cluster in a region mapped to chromosome 2q12.
13201	
13202	
13203	
13204	
13205	This gene encodes a subunit of mitochondrial ATP synthase. Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. ATP synthase is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, Fo, comprising the proton channel. The catalytic portion of mitochondrial ATP synthase consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled with a stoichiometry of 3 alpha, 3 beta, and a single representative of the other 3. The proton channel seems to have nine subunits (a, b, c, d, e, f, g, F6 and 8). There are three separate genes which encode the subunit c of the proton channel and they specify precursors with different import sequences but identical mature proteins. The protein encoded by this gene is one of three precursors of subunit c. Alternatively spliced transcript variants encoding the same isoform have been identified.
13206	
13207	The protein encoded by this gene contains a RING-H2 finger, a motif known to be involved in protein-protein and protein-DNA interactions. This gene is highly expressed in normal cerebellum, but not in the cerebral cortex. The expression of the rat counterpart in the frontal cortex and hippocampus was shown to be induced by elctroconvulsive treatment (ECT) as well as chronic antidepressant treatment, suggesting that this gene may be a molecular target for ECT and antidepressants.
13208	In mammals, histamine is metabolized by two major pathways: N(tau)-methylation via histamine N-methyltransferase and oxidative deamination via diamine oxidase. This gene encodes the first enzyme which is found in the cytosol and uses S-adenosyl-L-methionine as the methyl donor. In the mammalian brain, the neurotransmitter activity of histamine is controlled by N(tau)-methylation as diamine oxidase is not found in the central nervous system. A common genetic polymorphism affects the activity levels of this gene product in red blood cells.
13209	
13210	The protein encoded by this gene is a homodimeric integral membrane gelatinase belonging to the serine protease family. It is selectively expressed in reactive stromal fibroblasts of epithelial cancers, granulation tissue of healing wounds, and malignant cells of bone and soft tissue sarcomas. This protein is thought to be involved in the control of fibroblast growth or epithelial-mesenchymal interactions during development, tissue repair, and epithelial carcinogenesis.
13211	
13212	This gene product belongs to the neurexin family, members of which function in the vertebrate nervous system as cell adhesion molecules and receptors. This protein, like other neurexin proteins, contains epidermal growth factor repeats and laminin G domains. In addition, it includes an F5/8 type C domain, discoidin/neuropilin- and fibrinogen-like domains, and thrombospondin N-terminal-like domains. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
13213	The product of this gene belongs to the phosphotyrosine protein phosphatase family of proteins. It functions as an acid phosphatase and a protein tyrosine phosphatase by hydrolyzing protein tyrosine phosphate to protein tyrosine and orthophosphate. This enzyme also hydrolyzes orthophosphoric monoesters to alcohol and orthophosphate. This gene is genetically polymorphic, and three common alleles segregating at the corresponding locus give rise to six phenotypes. Each allele appears to encode at least two electrophoretically different isozymes, Bf and Bs, which are produced in allele-specific ratios. Three transcript variants encoding distinct isoforms have been identified for this gene.
13214	
13215	The protein encoded by this gene is a member of the ankyrin repeat and SOCS box-containing (ASB) family of proteins. They contain ankyrin repeat sequence and SOCS box domain. The SOCS box serves to couple suppressor of cytokine signalling (SOCS) proteins and their binding partners with the elongin B and C complex, possibly targeting them for degradation. Multiple alternatively spliced transcript variants have been described for this gene but some of the full length sequences are not known.
13216	
13217	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, and it is activated by Myc protein.
13218	
13219	GABA is the major inhibitory neurotransmitter in the mammalian brain where it acts at GABA-A receptors, which are ligand-gated chloride channels. Chloride conductance of these channels can be modulated by agents such as benzodiazepines that bind to the GABA-A receptor. At least 16 distinct subunits of GABA-A receptors have been identified.
13220	Kynureninase is a pyridoxal-5@#$%&-phosphate (pyridoxal-P) dependent enzyme that catalyses the cleavage of L-kynurenine and L-3-hydroxykynurenine into anthranilic and 3-hydroxyanthranilic acids, respectively. Kynureninase is involved in the biosynthesis of NAD cofactors from tryptophan through the kynurenine pathway.
13221	
13222	This gene was identified based on its similarity with the mouse counterpart. Studies of the mouse counterpart suggest that the expression of this gene may be regulated during oocyte maturation and preimplantation following zygotic gene activation.  Alternatively spliced transcript variants encoding distinct isoforms have been observed.
13223	This gene belongs to the family of reticulon encoding genes. Reticulons are associated with the endoplasmic reticulum, and are involved in neuroendocrine secretion or in membrane trafficking in neuroendocrine cells. The product of this gene is a potent neurite outgrowth inhibitor which may also help block the regeneration of the central nervous system in higher vertebrates. Alternatively spliced transcript variants derived both from differential splicing and differential promoter usage and encoding different isoforms have been identified.
13224	
13225	
13226	
13227	The protein encoded by this gene is a cytoplasmic nonreceptor protein kinase which may function as an intracellular signal transducer in epithelial tissues. Overexpression of this gene in mammary epithelial cells leads to sensitization of the cells to epidermal growth factor and results in a partially transformed phenotype. Expression of this gene has been detected at low levels in some breast tumors but not in normal breast tissue. The encoded protein has been shown to undergo autophosphorylation.
13228	
13229	
13230	
13231	The protein encoded by this gene is involved in the import of precursor proteins into mitochondria. The encoded protein has a chaperone-like activity, binding the mature portion of unfolded proteins and aiding their import into mitochondria. This protein, which is found in the cytoplasm and sometimes associated with the outer mitochondrial membrane, has a weak ATPase activity and contains 6 TPR repeats.
13232	The protein encoded by this gene is a member of the TNF-receptor superfamily. This receptor has been found to be essential in mediating a broad variety of immune and inflammatory responses including T cell-dependent immunoglobulin class switching, memory B cell development, and germinal center formation. AT-hook transcription factor AKNA is reported to coordinately regulate the expression of this receptor and its ligand, which may be important for homotypic cell interactions. Adaptor protein TNFR2 interacts with this receptor and serves as a mediator of the signal transduction. The interaction of this receptor and its ligand is found to be necessary for amyloid-beta-induced microglial activation, and thus is thought to be an early event in Alzheimer disease pathogenesis. Two alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported.
13233	
13234	
13235	Glucose transporters are integral membrane glycoproteins involved in transporting glucose into most cells. GLUT1 is a major glucose transporter in the mammalian blood-brain barrier. It is present at high levels in primate erythrocytes and brain endothelial cells.[supplied by OMIM]
13236	Transglutaminases are enzymes that catalyze the crosslinking of proteins by epsilon-gamma glutamyl lysine isopeptide bonds. While the primary structure of transglutaminases is not conserved, they all have the same amino acid sequence at their active sites and their activity is calcium-dependent. The protein encoded by this gene consists of two polypeptide chains activated from a single precursor protein by proteolysis. The encoded protein is involved the later stages of cell envelope formation in the epidermis and hair follicle.
13237	
13238	Chromogranin B is a tyrosine-sulfated secretory protein found in a wide variety of peptidergic endocrine cells. Chromogranin functions as a neuroendocrine secretory granule protein which may be the precursor for other biologically active peptides.
13239	
13240	Gamma-glutamyl transpeptidase is a membrane-bound protein that is important in the metabolism of glutathione. The protein encoded by this gene is similar in sequence to several members of the gamma-glutamyl transpeptidase family. Three transcript variants encoding the same protein have been found for this gene.
13241	
13242	Dystrophin is a large, rod-like cytoskeletal protein found at the inner surface of muscle fibers. Dystrophin is missing in Duchenne Muscular Dystrophy patients and is present in reduced amounts in Becker Muscular Dystrophy patients. The protein encoded by this gene is a peripheral membrane protein found associated with dystrophin and dystrophin-related proteins. This gene is a member of the syntrophin gene family, which contains at least two other structurally-related genes.
13243	
13244	
13245	The protein encoded by this gene is a type I membrane receptor for group B coxsackieviruses and subgroup C adenoviruses. Pseudogenes of this gene are found on chromosomes 15, 18, and 21.
13246	
13247	
13248	
13249	Chromatin assembly factor I (CAF-I) is required for the assembly of histone octamers onto newly-replicated DNA. CAF-I is composed of three protein subunits, p50, p60, and p150. The protein encoded by this gene corresponds to the p60 subunit and is required for chromatin assembly after replication. The encoded protein is differentially phosphorylated in a cell cycle-dependent manner. In addition, it is normally found in the nucleus except during mitosis, when it is released into the cytoplasm. This protein is a member of the WD-repeat HIR1 family and may also be involved in DNA repair.
13250	
13251	
13252	The protein encoded by this gene binds to calmodulin and is expressed in the centrosome. It is an integral component of the pericentriolar material (PCM). The function of this protein has not been determined.
13253	Members of the trefoil family are characterized by having at least one copy of the trefoil motif, a 40-amino acid domain that contains three conserved disulfides. They are stable secretory proteins expressed in gastrointestinal mucosa. Their functions are not defined, but they may protect the mucosa from insults, stabilize the mucus layer and affect healing of the epithelium. The encoded protein inhibits gastric acid secretion. This gene and two other related trefoil family member genes are found in a cluster on chromosome 21.
13254	
13255	The protein encoded by this gene was identified by its strong similarity with Saccharomyces cerevisiae Cdc45, an essential protein required to the initiation of DNA replication. Cdc45 is a member of the highly conserved multiprotein complex including Cdc6/Cdc18, the minichromosome maintenance proteins (MCMs) and DNA polymerase, which is important for early steps of DNA replication in eukaryotes. This protein has been shown to interact with MCM7 and DNA polymerase alpha. Studies of the similar gene in Xenopus suggested that this protein play a pivotal role in the loading of DNA polymerase alpha onto chromatin. Multiple polyadenlyation sites of this gene are reported.
13256	
13257	Leucine-zipper-like transcriptional regulator 1 is belived to be a DNA-binding protein and transcriptional regulator based on its predicted structural characteristics.  The transcript is present in several essential fetal organs and is hemizygously deleted in some DiGeorge syndrome patients. It is thought to play a critical role in embryogenesis.
13258	The protein encoded by this gene is a member of the PP2C family of Ser/Thr protein phosphatases. PP2C family members are known to be negative regulators of cell stress response pathways. This phosphatase can interact with Rho guanine nucleotide exchange factors (PIX), and thus block the effects of p21-activated kinase 1 (PAK), a protein kinase mediating biological effects downstream of Rho GTPases. Calcium/calmodulin-dependent protein kinase II gamma (CAMK2G/CAMK-II) is found to be one of the substrates of this phosphatase. The overexpression of this phosphatase or CAMK2G has been shown to mediate caspase-dependent apoptosis. An alternatively spliced transcript variant has been identified, but its full-length nature has not been determined.
13259	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development, reproduction, and tissue remodeling, as well as in disease processes, such as arthritis and metastasis. Most MMP@#$%&s are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. However, the enzyme encoded by this gene is activated intracellularly by furin within the constitutive secretory pathway. Also in contrast to other MMP@#$%&s, this enzyme cleaves alpha 1-proteinase inhibitor but weakly degrades structural proteins of the extracellular matrix.
13260	Oxysterols are byproducts of cholesterol that can have cytotoxic effects on many cell types. The protein encoded by this gene binds oxysterols such as 7-ketocholesterol and may inhibit their cytotoxicity. The encoded protein is membrane-bound. Two transcript variants encoding different isoforms have been found for this gene.
13261	
13262	
13263	
13264	
13265	SLC25A17 is a peroxisomal membrane protein that belongs to the family of mitochondrial solute carriers.[supplied by OMIM]
13266	
13267	The protein encoded by this gene catalyzes the transfer of galactose to lactosylceramide to form globotriaosylceramide, which has been identified as the P(k) antigen of the P blood group system. The encoded protein, which is a type II membrane protein found in the Golgi, is also required for the synthesis of the bacterial verotoxins receptor.
13268	
13269	
13270	The protein encoded by this gene is only expressed in the S and G2 phases of the cell cycle, where it colocalizes with cytoplasmic tubulin and microtubules. In response to DNA damage, the encoded protein accumulates in the nucleus and binds the tumor suppressor protein p53, shuttling it out of the nucleus and repressing its ability to induce apoptosis.
13271	NFE2 (MIM 601490), NFE2L1 (MIM 163260), and NFE2L2 comprise a family of human genes encoding basic leucine zipper (bZIP) transcription factors. They share highly conserved regions that are distinct from other bZIP families, such as JUN (MIM 165160) and FOS (MIM 164810), although remaining regions have diverged considerably from each other (Chan et al., 1995).[supplied by OMIM]
13272	
13273	
13274	
13275	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this DEAD box protein family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. The function of this gene product which is a member of this family, has not been determined.
13276	
13277	
13278	
13279	
13280	
13281	
13282	
13283	Calcium-dependent membrane-binding proteins may regulate molecular events at the interface of the cell membrane and cytoplasm. This gene is one of several genes that encode a calcium-dependent protein containing two N-terminal type II C2 domains and an integrin A domain-like sequence in the C-terminus.
13284	
13285	This gene encodes a protein that has been directly linked to Bardet-Biedl syndrome. The primary features of this syndrome include retinal dystrophy, obesity, polydactyly, renal abnormalities and learning disabilities. Experimentation in non-human eukaryotes suggests that this gene is expressed in ciliated cells and that it is required for the formation of cilia. Alternate transcriptional splice variants have been observed but have not been fully characterized.
13286	The protein encoded by this gene is a zinc finger transcription factor and contains an N-terminal POZ domain. This protein acts as a sequence-specific repressor of transcription, and has been shown to modulate the transcription of START-dependent IL-4 responses of B cells. This protein can interact with a variety of POZ-containing proteins that function as transcription corepressors. This gene is found to be frequently translocated and hypermutated in diffuse large-cell lymphoma (DLCL), and may be involved in the pathogenesis of DLCL. Two alternatively spliced transcript variants that encode the identical protein have been reported for this gene.
13287	Aminoacylase-1 is a cytosolic, homodimeric, zinc-binding enzyme that catalyzes the hydrolysis of acylated L-amino acids to L-amino acids and acyl group, and has been postulated to function in the catabolism and salvage of acylated amino acids.  ACY1 has been assigned to chromosome 3p21.1, a region reduced to homozygosity in small-cell lung cancer (SCLC), and its expression has been reported to be reduced or undetectable in SCLC cell lines and tumors.  The amino acid sequence of human aminoacylase-1 is highly homologous to the porcine counterpart, and ACY1 is the first member of a new family of zinc-binding enzymes.
13288	
13289	
13290	This gene encodes a protein which is similar in structure to hyaluronidases. Hyaluronidases intracellularly degrade hyaluronan, one of the major glycosaminoglycans of the extracellular matrix. Hyaluronan is thought to be involved in cell proliferation, migration and differentiation. Varying functions have been described for this protein. It has been described as a lysosomal hyaluronidase which is active at a pH below 4 and specifically hydrolyzes high molecular weight hyaluronan. It has also been described as a GPI-anchored cell surface protein which does not display hyaluronidase activity but does serve as a receptor for the oncogenic virus Jaagsiekte sheep retrovirus. The gene is one of several related genes in a region of chromosome 3p21.3 associated with tumor suppression. This gene encodes two alternatively spliced transcript variants which differ only in the 5@#$%& UTR.
13291	
13292	Retinitis pigmentosa is an inherited progressive disease which is a major cause of blindness in western communities. It can be inherited as an autosomal dominant, autosomal recessive, or X-linked recessive disorder. In the autosomal dominant form,which comprises about 25% of total cases, approximately 30% of families have mutations in the gene encoding the rod photoreceptor-specific protein rhodopsin. This is the transmembrane protein which, when photoexcited, initiates the visual transduction cascade. Defects in this gene are also one of the causes of congenital stationary night blindness.
13293	
13294	
13295	The protein encoded by this gene has sequence similarity with members of the WD40 repeat-containing protein family. The WD40 group is a large family of proteins, which appear to have a regulatory function. It is believed that the WD40 repeats mediate protein-protein interactions and members of the family are involved in signal transduction, RNA processing, gene regulation, vesicular trafficking, cytoskeletal assembly and may play a role in the control of cytotypic differentiation.
13296	
13297	
13298	The protein encoded by this gene is a member of serine/threonine protein kinase family. This kinase contains a dual leucine-zipper motif, and has been shown to form dimers/oligomers through its leucine-zipper motif. This kinase can phosphorylate and activate MAPK8/JNK, MAP2K7/MKK7, which suggests a role in the JNK signaling pathway.
13299	This gene encodes the large subunit of  3-methylcrotonyl-CoA carboxylase. This enzyme functions as a heterodimer and catalyzes the carboxylation of 3-methylcrotonyl-CoA to form 3-methylglutaconyl-CoA. Mutations in this gene are associated with 3-Methylcrotonylglycinuria, an  autosomal recessive disorder of leucine catabolism.
13300	
13301	
13302	
13303	The protein encoded by this gene belongs to peptidase family S1. It is first synthesized as an inactive single-chain precursor before being activated to a heterodimeric form by endoproteolytic processing. It acts as serine protease that converts hepatocyte growth factor to the active form.
13304	
13305	The protein encoded by this gene is a member of the Ras superfamily of small GTPases. Expression of a chimeric transcript of LAZ3 and this gene has been reported as a result of the translocation t(3;4) in non-Hodgkin@#$%&s lymphomas. This gene encodes a small G-like protein, and unlike most other small G proteins which are expressed ubiquitously, this gene is transcribed only in hemopoietic cells.
13306	
13307	
13308	Plasma prekallikrein is a glycoprotein that participates in the surface-dependent activation of blood coagulation, fibrinolysis, kinin generation and inflammation.  It is synthesized in the liver and secreted into the blood as a single polypeptide chain.  Plasma prekallikrein is converted to plasma kallikrein by factor XIIa by the cleavage of an internal Arg-Ile bond.  Plasma kallikrein therefore is composed of a heavy chain and a light chain held together by a disulphide bond.  The heavy chain originates from the amino-terminal end of the zymogen and contains 4 tandem repeats of 90 or 91 amino acids.  Each repeat harbors a novel structure called the apple domain.  The heavy chain is required for the surface-dependent pro-coagulant activity of plasma kallikrein.  The light chain contains the active site or catalytic domain of the enzyme and is homologous to the trypsin family of serine proteases. Plasma prekallikrein deficiency causes a prolonged activated partial thromboplastin time in patients.
13309	
13310	This gene encodes alpha-fetoprotein, a major plasma protein produced by the yolk sac and the liver during fetal life. Alpha-fetoprotein expression in adults is often associated with hepatoma or teratoma. However, hereditary persistance of alpha-fetoprotein may also be found in individuals with no obvious pathology. The protein is thought to be the fetal counterpart of serum albumin, and the alpha-fetoprotein and albumin genes are present in tandem in the same transcriptional orientation on chromosome 4. Alpha-fetoprotein is found in monomeric as well as dimeric and trimeric forms, and binds copper, nickel, fatty acids and bilirubin. The level of alpha-fetoprotein in amniotic fluid is used to measure renal loss of protein to screen for spina bifida and anencephaly.
13311	
13312	
13313	This gene encodes a member of the metallocarboxypeptidase family. This enzyme displays carboxypeptidase activity towards substrates with basic C-terminal residues. It is most active at neutral pH and is inhibited by active site-directed inhibitors of metallocarboxypeptidases. Alternative splicing in the coding region results in multiple transcript variants encoding different isoforms.
13314	The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21; however, this gene is located at 4p16. This protein, in addition to binding Ca2+, also binds Zn2+ and Mg2+. This protein may play a role in the etiology of prostate cancer.
13315	This gene is a member of the H/ACA snoRNPs (small nucleolar ribonucleoproteins) gene family. snoRNPs are involved in various aspects of rRNA processing and modification and have been classified into two families: C/D and H/ACA. The H/ACA snoRNPs also include the DKC1, NOLA2 and NOLA3 proteins. These four H/ACA snoRNP proteins localize to the dense fibrillar components of nucleoli and to coiled (Cajal) bodies in the nucleus. Both 18S rRNA production and rRNA pseudouridylation are impaired if any one of the four proteins is depleted. These four H/ACA snoRNP proteins are also components of the telomerase complex. The encoded protein of this gene contains two glycine- and arginine-rich domains and is related to Saccharomyces cerevisiae Gar1p. Two splice variants have been found for this gene.
13316	
13317	The protein encoded by this gene is a member of neuron-specific calcium-binding proteins family found in the retina and brain. It is highly similar to human hippocalcin protein and nearly identical to the rat and mouse hippocalcin like-1 proteins. It may be involved in the calcium-dependent regulation of rhodopsin phosphorylation and may be of relevance for neuronal signalling in the central nervous system. There are two alternatively spliced transcript variants of this gene, with multiple polyadenylation sites.
13318	The protein encoded by this gene belongs to the highly conserved cyclin family, whose members are characterized by a dramatic periodicity in protein abundance through the cell cycle. Cyclins function as regulators of CDK kinases. Different cyclins exhibit distinct expression and degradation patterns which contribute to the temporal coordination of each mitotic event. In contrast to cyclin A1, which is present only in germ cells, this cyclin is expressed in all tissues tested. This cyclin binds and activates CDC2 or CDK2 kinases, and thus promotes both cell cycle G1/S and G2/M transitions.
13319	
13320	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are nucleic acid binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It localizes to both the nucleus and the cytoplasm. This protein is implicated in the regulation of mRNA stability. Alternative splicing of this gene results in four transcript variants.
13321	This gene encodes class I alcohol dehydrogenase, gamma subunit, which is a member of the alcohol dehydrogenase family. Members of this enzyme family metabolize a wide variety of substrates, including ethanol, retinol, other aliphatic alcohols, hydroxysteroids, and lipid peroxidation products. Class I alcohol dehydrogenase, consisting of several homo- and heterodimers of alpha, beta, and gamma subunits, exhibits high activity for ethanol oxidation and plays a major role in ethanol catabolism. Three genes encoding alpha, beta and gamma subunits are tandemly organized in a genomic segment as a gene cluster.
13322	
13323	
13324	The protein encoded by this gene is a member of the platelet-derived growth factor family. The four members of this family are mitogenic factors for cells of mesenchymal origin and are characterized by a core motif of eight cysteines. This gene product appears to form only homodimers. It differs from the platelet-derived growth factor alpha and beta polypeptides in having an unusual N-terminal domain, the CUB domain.
13325	The UGTs are of major importance in the conjugation and subsequent elimination of potentially toxic xenobiotics and endogenous compounds. UGT2B8 demonstrates reactivity with estriol. See UGT2B4 (MIM 600067).[supplied by OMIM]
13326	
13327	
13328	This gene encodes a protein with multiple ankyrin repeat domains and a single KH-domain. Co-transcription of this gene and the neighboring downstream gene (EIF4EBP3) generates a transcript (MASK-BP3) which encodes a fusion protein comprised of the MASK protein sequence for the majority of the protein and a different C-terminus due to an alternate reading frame for the EIF4EBP3 segments.
13329	KHK encodes the gene ketohexokinase that catalyzes conversion of fructose to fructose-1-phosphate.  The splice variant presented encodes the highly active form found in liver, renal cortex, and small intestine, while the alternate variant encodes the lower activity form found in most other tissues.
13330	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections.
13331	
13332	There are believed to be over 100 different glycosyltransferases involved in the synthesis of protein-bound and lipid-bound oligosaccharides. UDP-N-acetylglucosamine:alpha-3-D-mannoside beta-1,2-N-acetylglucosaminyltransferase I is a medial-Golgi enzyme essential for the synthesis of hybrid and complex N-glycans.  The protein, encoded by a single exon, shows typical features of a type II transmembrane protein. The protein is believed to be essential for normal embryogenesis.
13333	
13334	
13335	
13336	This gene encodes a WD repeat protein, which interacts with Cockayne syndrome type B (CSB) protein and with p44 protein, a subunit of the RNA polymerase II transcription factor IIH. Mutations in this gene have been identified in patients with hereditary disease Cockayne syndrome (CS). The CS cells are abnormally sensitive to ultraviolet radiation and are defective in the repair of transcriptionally active genes. Multiple alternatively spliced transcript variants encoding different isoforms have been found for this gene.
13337	
13338	
13339	
13340	
13341	
13342	
13343	The eukaryotic cell cycle is governed by cyclin-dependent protein kinases (CDKs) whose activities are regulated by cyclins and CDK inhibitors. The protein encoded by this gene is a member of the cyclin family and contains the cyclin box. The encoded protein lacks the protein destabilizing (PEST) sequence that is present in other family members. Transcriptional activation of this gene can be induced by tumor protein p53. Two transcript variants encoding the same protein have been identified for this gene.
13344	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. This enzyme functions in the ubiquitination of the tumor-suppressor protein p53, which is induced by an E3 ubiquitin-protein ligase. Two alternatively spliced transcript variants have been found for this gene and they encode distinct isoforms.
13345	
13346	This gene encodes a member of the RIEG/PITX homeobox family, which is in the bicoid class of homeodomain proteins. Members of this family are involved in organ development and left-right asymmetry. This protein acts as a transcriptional regulator involved in basal and hormone-regulated activity of prolactin.
13347	
13348	
13349	
13350	
13351	
13352	This gene encodes a protein which contains a conserved nuclear localization signal. The encoded protein may have a role in spermatogenesis, cell cycle control, and in meiotic cell division.
13353	
13354	
13355	The transport of protein and large RNAs through the nuclear pore complexes (NPC) is an energy-dependent and regulated process. The import of proteins with a nuclear localization signal (NLS) is accomplished by recognition of one or more clusters of basic amino acids by the importin-alpha/beta complex; see MIM 600685 and MIM 602738. The small GTPase RAN (MIM 601179) plays a key role in NLS-dependent protein import. RAN-binding protein-17 is a member of the importin-beta superfamily of nuclear transport receptors.[supplied by OMIM]
13356	Coagulation factor II (thrombin) receptor-like 1 (F2RL1) is a member of the large family of 7-transmembrane-region receptors that couple to guanosine-nucleotide-binding proteins. F2RL1 is also a member of the protease-activated receptor family. It is activated by trypsin, but not by thrombin. It is activated by proteolytic cleavage of its extracellular amino terminus. The new amino terminus functions as a tethered ligand and activates the receptor. The F2RL1 gene contains two exons and is widely expressed in human tissues. The predicted protein sequence is 83% identical to the mouse receptor sequence.
13357	
13358	PIST is a PDZ domain-containing Golgi protein. PDZ domains contain approximately 90 amino acids and bind the extreme C terminus of proteins in a sequence-specific manner.[supplied by OMIM]
13359	This protein is highly similar to FLASH, a mouse apoptotic protein identified by its interaction with the death-effector domain (DED) of caspase 8. Studies of FLASH protein suggested that this protein may be a component of the death-inducing signaling complex that includes Fas receptor, Fas-binding adapter FADD, and caspase 8, and plays a regulatory role in Fas-mediated apoptosis.
13360	
13361	
13362	TRAM2 is a component of the translocon, a gated macromolecular channel that controls the posttranslational processing of nascent secretory and membrane proteins at the endoplasmic reticulum (ER) membrane.[supplied by OMIM]
13363	
13364	This gene encodes a member of the 1-acylglycerol-3-phosphate O-acyltransferase family. This integral membrane protein converts lysophosphatidic acid to phosphatidic acid, the second step in de novo phospholipid biosynthesis. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
13365	
13366	
13367	Molybdenum cofactor biosynthesis is a conserved pathway leading to the biological activation of molybdenum. The protein encoded by this gene is involved in molybdenum cofactor biosynthesis. This gene produces a bicistronic mRNA that has the potential to produce two proteins (MOCS1A and MOCS1B) from adjacent open reading frames. However, only the first open reading frame (MOCS1A) has been found to encode a protein from the bicistronic mRNA. Three splice variants have been found for this gene, two of which are monocistronic and express proteins (MOCS1A-MOCS1B) that result from a fusion between the two open reading frames. This gene is defective in patients with molybdenum cofactor deficiency, type A.
13368	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the MDR/TAP subfamily. Members of the MDR/TAP subfamily are involved in multidrug resistance. The protein encoded by this gene is involved in the pumping of degraded cytosolic peptides across the endoplasmic reticulum into the membrane-bound compartment where class I molecules assemble. Mutations in this gene may be associated with ankylosing spondylitis, insulin-dependent diabetes mellitus, and celiac disease.
13369	This gene encodes a zinc-finger transcriptional regulating protein which interacts with CBP/p300 to regulate the human gene CYP11A1. Alternative splicing of this gene results in three transcript variants.
13370	
13371	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is intronless and encodes a member of the histone H2A family. Transcripts from this gene lack polyA tails; instead, they contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6p22-p21.3.
13372	
13373	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
13374	HLA-E belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. HLA-E binds a restricted subset of peptides derived from the leader peptides of other class I molecules. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail.
13375	
13376	
13377	
13378	Sm-like proteins were identified in a variety of organisms based on sequence homology with the Sm protein family (see SNRPD2; MIM 601061). Sm-like proteins contain the Sm sequence motif, which consists of 2 regions separated by a linker of variable length that folds as a loop. The Sm-like proteins are thought to form a stable heteromer present in tri-snRNP particles, which are important for pre-mRNA splicing.[supplied by OMIM]
13379	This gene encodes a member of the tenascin family of extracellular matrix proteins. The tenascins have anti-adhesive effects, as opposed to fibronectin which is adhesive. The specific function of this protein is unknown, but its deficiency has been associated with the connective tissue disorder Ehlers-Danlos syndrome. This gene localizes to the major histocompatibility complex (MHC) class III region on chromosome 6. It is one of four genes in this cluster which have been duplicated. The duplicated copy of this gene is incomplete and is a pseudogene which is transcribed but does not encode a protein. The structure of this gene is unusual in that it overlaps the CREBL1 and CYP21A2 genes at its 5@#$%& and 3@#$%& ends, respectively. The gene expresses two transcripts.
13380	HLA-DPB belongs to the HLA class II beta chain paralogues. This class II molecule is a heterodimer consisting of an alpha (DPA) and a beta chain (DPB), both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa and its gene contains 6 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail. Within the DP molecule both the alpha chain and the beta chain contain the polymorphisms specifying the peptide binding specificities, resulting in up to 4 different molecules.
13381	This gene encodes a member of the retinoid X receptor (RXR) family of nuclear receptors which are involved in mediating the effects of retinoic acid (RA). This receptor forms homodimers with the retinoic acid, thyroid hormone, and vitamin D receptors, increasing both DNA binding and transcriptional function on their respective response elements. The gene lies within the major histocompatibility complex (MHC) class II region on chromosome 6. An alternatively spliced transcript variant has been described, but its full length sequence has not been determined.
13382	
13383	Initiation of transcription by RNA polymerase II requires the activities of more than 70 polypeptides. The protein that coordinates these activities is transcription factor IID (TFIID), which binds to the core promoter to position the polymerase properly, serves as the scaffold for assembly of the remainder of the transcription complex, and acts as a channel for regulatory signals. TFIID is composed of the TATA-binding protein (TBP) and a group of evolutionarily conserved proteins known as TBP-associated factors or TAFs. TAFs may participate in basal transcription, serve as coactivators, function in promoter recognition or modify general transcription factors (GTFs) to facilitate complex assembly and transcription initiation. This gene encodes a small subunit of TFIID that is present in all TFIID complexes and interacts with TBP. This subunit also interacts with another small subunit, TAF13, to form a heterodimer with a structure similar to the histone core structure.
13384	This gene encodes a potent cyclin-dependent kinase inhibitor. The encoded protein binds to and inhibits the activity of cyclin-CDK2 or -CDK4 complexes, and thus functions as a regulator of cell cycle progression at G1. The expression of this gene is tightly controlled by the tumor suppressor protein p53, through which this protein mediates the p53-dependent cell cycle G1 phase arrest in response to a variety of stress stimuli. This protein can interact with proliferating cell nuclear antigen (PCNA), a DNA polymerase accessory factor, and plays a regulatory role in S phase DNA replication and DNA damage repair. This protein was reported to be specifically cleaved by CASP3-like caspases, which thus leads to a dramatic activation of CDK2, and may be instrumental in the execution of apoptosis following caspase activation. Two alternatively spliced variants, which encode an identical protein, have been reported.
13385	
13386	
13387	
13388	This gene encodes a protein that binds RAN, a small GTP binding protein belonging to the RAS superfamily that is essential for the translocation of RNA and proteins through the nuclear pore complex. The protein encoded by this gene has also been shown to interact with several other proteins, including met proto-oncogene, homeodomain interacting protein kinase 2, androgen receptor, and cyclin-dependent kinase 11.
13389	The Drosophila @#$%&glial cells missing@#$%& (gcm) gene is thought to act as a binary switch between neuronal and glial cell determination. The gcm protein and mammalian gcm homologs contain a conserved N-terminal gcm motif that has DNA-binding activity. See GCMA (MIM 603715).[supplied by OMIM]
13390	
13391	
13392	
13393	This gene encodes the homolog of the Drosophila headcase protein, a highly basic, cytoplasmic protein that regulates the re-entry of imaginal cells into the mitotic cycle during adult morphogenesis. In Drosophila, the encoded protein also inhibits terminal branching of neighboring cells during tracheal development.
13394	
13395	Transcription factors containing a basic helix-loop-helix (bHLH) motif regulate expression of tissue-specific genes in a number of mammalian and insect systems. DNA-binding activity of the bHLH proteins is dependent on formation of homo- and/or heterodimers. Dominant-negative HLH proteins encoded by Id-related genes, such as ID4, also contain the HLH-dimerization domain but lack the DNA-binding basic domain. Consequently, Id proteins inhibit binding to DNA and transcriptional transactivation by heterodimerization with bHLH proteins (Pagliuca et al., 1995).[supplied by OMIM]
13396	
13397	This gene encodes a member of the plakin protein family of adhesion junction plaque proteins. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene, but the full-length nature of some variants has not been defined. It has been known that some isoforms are expressed in neural and muscle tissue, anchoring neural intermediate filaments to the actin cytoskeleton, and some isoforms are expressed in epithelial tissue, anchoring keratin-containing intermediate filaments to hemidesmosomes. Consistent with the expression, mice defective for this gene show skin blistering and neurodegeneration.
13398	
13399	The protein encoded by this gene is a member of the kinesin-like protein family. Protein family members are microtubule-dependent molecular motors that transport organelles within cells and move chromosomes during cell division. However, the particular function of this gene product has not yet been determined. Two alternatively spliced transcript variants which encode products have been described. Other splice variants have been found that lack exon 2 and the initiation codon for translation.
13400	
13401	
13402	
13403	
13404	DEAD box proteins characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of the DEAD box protein family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. The function of this gene product which is a member of this family, has not been determined.
13405	
13406	
13407	The APS protein, expressed in B lymphocytes, contains pleckstrin homology and src homology 2 (SH2) domains. In Burkitt lymphoma cell lines, it is tyrosine phosphorylated in response to B cell receptor stimulation. Because it binds Shc independent of stimulation and Grb2 after stimulation, it appears to play a role in signal transduction from the receptor to Shc/Grb2.
13408	
13409	In some non-lymphoid tissues, the unrearranged T cell receptor gamma (TRG@) locus is expressed. The resulting transcript contains a subset of the TRG@ gene segments and is shorter than TRG@ transcripts expressed in lymphoid tissues. This RefSeq record represents the unrearranged TRG@ locus transcript; the complete TRG@ locus is represented by the genomic RefSeq NG_001336. The transcript represented by this RefSeq has two open reading frames (ORFs) that encode different proteins. The downstream ORF is in the same frame as TRG@ and its protein product is similar to TRG@ proteins. The upstream ORF uses a different reading frame and encodes a novel protein.
13410	
13411	
13412	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
13413	
13414	
13415	
13416	
13417	
13418	
13419	The secreted protein encoded by this gene is involved in the formation of a gel matrix that encases ejaculated spermatozoa. Proteolysis by the prostate-specific antigen (PSA) breaks down the gel matrix and allows the spermatozoa to move more freely. The encoded protein is found in lesser abundance than a similar semenogelin protein. The genes encoding these two semenogelin proteins are found in a cluster on chromosome 20.
13420	This gene is a member of the paired box (PAX) family of transcription factors.  Members of this gene family typically contain a paired box domain, an octapeptide, and a paired-type homeodomain.  These genes play critical roles during fetal development and cancer growth.  The paired box gene 4 is involved in pancreatic islet development and mouse studies have demonstrated a role for this gene in differentiation of insulin-producing beta cells.
13421	The protein encoded by this gene is a homolog of yeast Bet1 protein which is a golgi-associated protein that participates in vesicular transport from the endoplasmic reticulum (ER) to the Golgi apparatus. Studies with the rat homolog suggest that it is localized in the vesicular structures, and may be involved in the docking process of ER-derived vesicles with the cis-Golgi membrane.
13422	
13423	
13424	
13425	
13426	The CLCN family of voltage-dependent chloride channel genes comprises nine members (CLCN1-7, Ka and Kb) which demonstrate quite diverse functional characteristics while sharing significant sequence homology. The protein encoded by this gene regulates the electric excitability of the skeletal muscle membrane. Mutations in this gene cause two forms of inherited human muscle disorders: recessive generalized myotonia congenita (Becker) and dominant myotonia (Thomsen).
13427	
13428	
13429	The Makorin ring finger protein-1 gene (MKRN1) is a highly transcribed, intron-containing source for a family of intronless mammalian genes encoding a novel class of zinc finger proteins. Phylogenetic analyses indicate that the MKRN1 gene is the ancestral founder of this gene family.[supplied by OMIM]
13430	The WNT gene family consists of structurally related genes which encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family, and is expressed in the thalamus. It encodes a protein which shows 96% amino acid identity to the mouse Wnt2 protein. Based on the map location of this gene in the 7q31 region, and the phenotype of the diminished social interaction in the knockout mouse, this gene is suggested as a strong candidate gene for autism, a prototypical pervasive development disorder.
13431	
13432	The protein encoded by this gene is one of at least two lipid transfer proteins found in human plasma. The encoded protein transfers phospholipids from triglyceride-rich lipoproteins to high density lipoprotein (HDL). In addition to regulating the size of HDL particles, this protein may be involved in cholesterol metabolism. At least two transcript variants encoding different isoforms have been found for this gene.
13433	
13434	
13435	This gene encodes the fatty acid binding protein found in epidermal cells, and was first identified as being upregulated in psoriasis tissue. Fatty acid binding proteins are a family of small, highly conserved, cytoplasmic proteins that bind long-chain fatty acids and other hydrophobic ligands. It is thought that FABPs roles include fatty acid uptake, transport, and metabolism.
13436	
13437	
13438	ADP-ribosylation factors (ARFs) play an important role in intracellular vesicular trafficking. The protein encoded by this gene is involved in the activation of ARFs by accelerating replacement of bound GDP with GTP and is involved in Golgi transport. It contains a Sec7 domain, which may be responsible for its guanine-nucleotide exchange activity and also brefeldin A inhibition.
13439	Zinc is an essential cofactor for hundreds of enzymes. It is involved in protein, nucleic acid, carbohydrate, and lipid metabolism, as well as in the control of gene transcription, growth, development, and differentiation. SLC39A14 belongs to a subfamily of proteins that show structural characteristics of zinc transporters (Taylor and Nicholson, 2003).[supplied by OMIM]
13440	
13441	
13442	
13443	This gene seems to be intronless. It has high sequence similarity to the gene encoding acyl-malonyl condensing enzyme on chromosome 17.
13444	
13445	
13446	This gene encodes a member of the annexin family. Members of this calcium-dependent phospholipid-binding protein family play a role in the regulation of cellular growth and in signal transduction pathways. The specific function of this gene has not yet been determined; however, it is associated with the plasma membrane of undifferentiated, proliferating endothelial cells and differentiated villus enterocytes. Alternatively spliced transcript variants encoding different isoforms have been identified.
13447	
13448	This gene encodes a serine/threonine protein kinase. Although this gene product is related to serum- and glucocorticoid-induced protein kinase (SGK), this gene is not induced by serum or glucocorticoids. This gene is induced in response to signals that activate phosphatidylinositol 3-kinase, which is also true for SGK. Two alternate transcripts encoding two different isoforms have been described.
13449	This gene encodes the protein elongin C, which is a subunit of the transcription factor B (SIII) complex. The SIII complex is composed of elongins A/A2, B and C. It activates elongation by RNA polymerase II by suppressing transient pausing of the polymerase at many sites within transcription units. Elongin A functions as the transcriptionally active component of the SIII complex, whereas elongins B and C are regulatory subunits. Elongin A2 is specifically expressed in the testis, and capable of forming a stable complex with elongins B and C. The von Hippel-Lindau tumor suppressor protein binds to elongins B and C, and thereby inhibits transcription elongation.
13450	
13451	Naturally occurring antagonist for both ANGPT1 and TIE2; expressed only at the sites of vascular remodeling; similar to angiopoietin-1
13452	
13453	
13454	
13455	
13456	This gene encodes a protein with significant sequence similarity to the ligand binding domain of platelet-derived growth factor receptor beta. Mutations in this gene, or deletion of a chromosomal segment containing this gene, are associated with sporadic hepatocellular carcinomas, colorectal cancers, and non-small cell lung cancers. This suggests this gene product may function as a tumor suppressor.
13457	
13458	The ABL1 protooncogene encodes a cytoplasmic and nuclear protein tyrosine kinase that has been implicated in processes of cell differentiation, cell division, cell adhesion, and stress response.  Activity of c-Abl protein is negatively regulated by its SH3 domain, and deletion of the SH3 domain turns ABL1 into an oncogene.  The t(9;22) translocation results in the head-to-tail fusion of the BCR (MIM:151410) and ABL1 genes present in many cases of chronic myelogeneous leukemia.  The DNA-binding activity of the ubiquitously expressed ABL1 tyrosine kinase is regulated by CDC2-mediated phosphorylation, suggesting a cell cycle function for ABL1.  The ABL1 gene is expressed as either a 6- or 7-kb mRNA transcript, with alternatively spliced first exons spliced to the common exons 2-11.
13459	
13460	The protein encoded by this gene is a basic protein of the sperm head cytoskeleton. This protein contains kelch repeats and a BTB/POZ domain and is necessary for normal morphology during sperm differentiation.
13461	This gene encodes a protein that is involved in glycosylphosphatidylinositol (GPI)-anchor biosynthesis. The GPI-anchor is a glycolipid which contains three mannose molecules in its core backbone. The GPI-anchor is found on many blood cells and serves to anchor proteins to the cell surface. This protein is involved in the transfer of ethanolaminephosphate (EtNP) to the third mannose in GPI. At least two alternatively spliced transcripts encoding distinct isoforms have been found for this gene.
13462	
13463	
13464	
13465	
13466	
13467	
13468	
13469	
13470	
13471	
13472	
13473	The protein encoded by this gene is a member of the SWI/SNF family of proteins and is highly similar to the brahma protein of Drosophila. Members of this family have helicase and ATPase activities and are thought to regulate transcription of certain genes by altering the chromatin structure around those genes. The encoded protein is part of the large ATP-dependent chromatin remodeling complex SNF/SWI, which is required for transcriptional activation of genes normally repressed by chromatin. Two transcript variants encoding different isoforms have been found for this gene, which contains a trinucleotide repeat (CAG) length polymorphism.
13474	This gene encodes a protein that may regulate endothelial cell differentiation. It has been postulated that the protein functions as a bridging molecule that interconnects regulatory proteins and the basal transcriptional machinery, thereby modulating the transcription of genes involved in endothelial differentiation. This protein has also been found to act as a transcriptional coactivator by interconnecting the general transcription factor TATA element-binding protein (TBP) and gene-specific activators. Two alternatively spliced transcripts which encode distinct proteins have been found for this gene.
13475	This gene encodes a member of the Notch family. Members of this Type 1 transmembrane protein family share structural characteristics including an extracellular domain consisting of multiple epidermal growth factor-like (EGF) repeats, and an intracellular domain consisting of multiple, different domain types. Notch family members play a role in a variety of developmental processes by controlling cell fate decisions. The Notch signaling network is an evolutionarily conserved intercellular signaling pathway which regulates interactions between physically adjacent cells. In Drosophilia, notch interaction with its cell-bound ligands (delta, serrate) establishes an intercellular signaling pathway that plays a key role in development. Homologues of the notch-ligands have also been identified in human, but precise interactions between these ligands and the human notch homologues remain to be determined. This protein is cleaved in the trans-Golgi network, and presented on the cell surface as a heterodimer. This protein functions as a receptor for membrane bound ligands, and may play multiple roles during development.
13476	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this DEAD box protein family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a member of this family. The function of this member has not been determined. Alternative splicing of this gene generates 2 transcript variants.
13477	Defensins are cysteine-rich cationic polypeptides that are important in the immunologic response to invading microorganisms. The protein encoded by this gene is secreted and is a member of the beta defensin protein family. Beta defensin genes are found in several clusters throughout the genome, with this gene mapping to a cluster at 20p13. The encoded protein is highly similar to an epididymal-specific secretory protein (ESP13.2) from cynomolgus monkey.
13478	
13479	This gene is a novel gene isolated based on its expression in the human adrenal gland. The full-length protein encoded by this gene contains a UBA-domain (ubiquitin associated domain), which is a motif found in several proteins having connections to ubiquitin and the ubiquitination pathway. In addition, the protein contains a region similar to a domain found in members of the atrophin-1 family. The function of this protein has not been determined. Alternative splicing has been observed at this locus and three variants, each encoding a distinct isoform, have been characterized.
13480	
13481	The outer dense fibers are cytoskeletal structures that surround the axoneme in the middle piece and principal piece of the sperm tail. The fibers function in maintaining the elastic structure and recoil of the sperm tail as well as in protecting the tail from shear forces during epididymal transport and ejaculation. Defects in the outer dense fibers lead to abnormal sperm morphology and infertility. This gene encodes one of the major outer dense fiber proteins. Multiple protein isoforms are encoded by transcript variants of this gene; however, not all isoforms and variants have been fully described.
13482	
13483	
13484	This nuclear gene encodes a mitochondrial protein which belongs to FRATAXIN family. The protein functions in regulating mitochondrial iron transport and respiration. The expansion of intronic trinucleotide repeat GAA results in Friedreich ataxia. Alternative splicing occurs at this locus and two transcript variants encoding distinct isoforms have been identified.
13485	
13486	The removal of introns from nuclear pre-mRNAs occurs on complexes called spliceosomes, which are made up of 4 small nuclear ribonucleoprotein (snRNP) particles and an undefined number of transiently associated splicing factors. PRPF4 is 1 of several proteins that associate with U4 and U6 snRNPs.[supplied by OMIM]
13487	
13488	Phosphomannose isomerase catalyzes the interconversion of fructose-6-phosphate and mannose-6-phosphate and plays a critical role in maintaining the supply of D-mannose derivatives, which are required for most glycosylation reactions.  Mutations in the MPI gene were found in patients with carbohydrate-deficient glycoprotein syndrome, type Ib.
13489	
13490	
13491	Clathrin is the main structural component of the lattice covering the cytoplasmic face of the coated pits and coated vesicles in which specific macromolecules are entrapped in the process of receptor-mediated endocytosis. Clathrin is a large, soluble protein composed of heavy chains and light chains. Two genes encode the clathrin light chain proteins. The light chains are believed to be regulatory elements in clathrin function. The brain-specific light chain variant contains an insert encoding 18 aa immediately downstream of the region encoding the heavy chain binding domain. Two transcript variants encoding different isoforms have been found for this gene.
13492	This enzyme is involved in arachidonic acid metabolism in some epithelial tissues. The protein is similar to human 5S-, 12S-, and 15S-lipoxygenases. This is a second human 15S lipoxygenase.
13493	The protein encoded by this gene is a member of the M14 family of zinc carboxypeptidases; however, the protein has no detectable carboxypeptidase activity. The encoded protein is thought to be an extracellular and/or membrane protein, and may be involved in cell-cell interactions.
13494	
13495	
13496	
13497	
13498	
13499	
13500	
13501	The protein encoded by this gene is a catalytic subunit of the AMP-activated protein kinase (AMPK). AMPK is a heterotrimer consisting of an alpha catalytic subunit, and non-catalytic beta and gamma subunits. AMPK is an important energy-sensing enzyme that monitors cellular energy status. In response to cellular metabolic stresses, AMPK is activated, and thus phosphorylates and inactivates acetyl-CoA carboxylase (ACC) and beta-hydroxy beta-methylglutaryl-CoA reductase (HMGCR), key enzymes involved in regulating de novo biosynthesis of fatty acid and cholesterol. Studies of the mouse counterpart suggest that this catalytic subunit may control whole-body insulin sensitivity and is necessary for maintaining myocardial energy homeostasis during ischemia.
13502	This gene encodes a protein thought to participate in programmed cell death. Studies using mouse cells have shown that overexpression of this protein can block apoptosis. In addition, the product of this gene binds to the product of the PDCD6 gene, a protein required for apoptosis, in a calcium-dependent manner.  This gene product also binds to endophilins, proteins that regulate membrane shape during endocytosis. Overexpression of this gene product and endophilins results in cytoplasmic vacuolization which may be partly responsible for the protection against cell death.
13503	
13504	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein and is similar in sequence to its family member CD53 antigen. It is known to complex with integrins and other transmembrane 4 superfamily proteins.
13505	
13506	
13507	
13508	
13509	The protein encoded by this gene is phosphorylated by ZAP-70/Syk protein tyrosine kinases following activation of the T-cell antigen receptor (TCR) signal transduction pathway. This transmembrane protein localizes to lipid rafts and acts as a docking site for SH2 domain-containing proteins. Upon phosphorylation, this protein recruits multiple adaptor proteins and downstream signaling molecules into multimolecular signaling complexes located near the site of TCR engagement. Alternative splicing results in multiple transcript variants encoding different isoforms.
13510	The protein encoded by this gene catalyzes the formation of inositol 1,4,5-trisphosphate and diacylglycerol from phosphatidylinositol 4,5-bisphosphate. This reaction uses calcium as a cofactor and plays an important role in the intracellular transduction of many extracellular signals. This gene is activated by two G-protein alpha subunits, alpha-q and alpha-11. Two transcript variants encoding different isoforms have been found for this gene.
13511	
13512	
13513	
13514	
13515	The protein encoded by this gene is a pleiotropic cytokine that functions as a chemoattractant, a modulator of T cell activation, and an inhibitor of HIV replication. The signaling process of this cytokine is mediated by CD4. The product of this gene undergoes proteolytic processing, which is found to yield two functional proteins. The cytokine function is exclusively attributed to the secreted C-terminal peptide, while the N-terminal product may play a role in cell cycle control. Caspase 3 is reported to be involved in the proteolytic processing of this protein. Two alternatively spliced transcript variants encoding distinct isoforms have been reported.
13516	
13517	
13518	
13519	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L6P family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
13520	This gene encodes a member of the regulator of G-protein signaling family. This protein contains one RGS domain, two Raf-like Ras-binding domains (RBDs), and one GoLoco domain. The protein attenuates the signaling activity of G-proteins by binding, through its GoLoco domain, to specific types of activated, GTP-bound G alpha subunits. Acting as a GTPase activating protein (GAP), the protein increases the rate of conversion of the GTP to GDP. This hydrolysis allows the G alpha subunits to bind G beta/gamma subunit heterodimers, forming inactive G-protein heterotrimers, thereby terminating the signal. Alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
13521	This gene is a member of the inositol polyphosphate-5-phosphatase (INPP5) family and encodes a protein with an N-terminal SH2 domain, an inositol phosphatase domain, and two C-terminal protein interaction domains. Expression of this protein is restricted to hematopoietic cells where its movement from the cytosol to the plasma membrane is mediated by tyrosine phosphorylation. At the plasma membrane, the protein hydrolyzes the 5@#$%& phosphate from phosphatidylinositol (3,4,5)-trisphosphate and inositol-1,3,4,5-tetrakisphosphate, thereby affecting multiple signaling pathways. Overall, the protein functions as a negative regulator of myeliod cell proliferation and survival. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
13522	
13523	
13524	
13525	
13526	
13527	
13528	
13529	
13530	
13531	
13532	
13533	This protein is ubiquitous and highly conserved. It carries out the enzymatic reduction of methionine sulfoxide to methionine. Human and animal studies have shown the highest levels of expression in kidney and nervous tissue. Its proposed function is the repair of oxidative damage to proteins to restore biological activity.
13534	Electron-transferring-flavoprotein dehydrogenase in the inner mitochondrial membrane accepts electrons from electron-transfer flavoprotein which is located in the mitochondrial matrix and reduces ubiquinone in the mitochondrial membrane.  The protein is synthesized as a 67-kDa precursor which is targeted to mitochondria and processed in a single step to a 64-kDa mature form located in the mitochondrial membrane.  Deficiency in electron-transferring-flavoprotein dehydrogenase have been demonstrated in some patients with type II  glutaricacidemia.
13535	The protein encoded by this gene is a member of the scavenger receptor cysteine-rich (SRCR) superfamily. Members of this family are secreted or membrane-anchored proteins mainly found in cells associated with the immune systems. The SRCR family is defined by a 100-110 amino acid SRCR domain, which may mediate protein-protein interaction and ligand binding. The encoded protein contains twelve SRCR domains, a transmembrane region and a cytoplasmic domain. The expression of this gene is found to be restricted to the monocyte/macrophage. An alternatively spliced transcript variant for this gene has been reported. Its full-length nature, however, has not yet been determined.
13536	
13537	
13538	
13539	
13540	
13541	Pleiomorphic adenoma gene-like 2 is a zinc-finger protein that recognizes DNA and/or RNA.
13542	
13543	
13544	
13545	
13546	Synaptobrevins/VAMPs, syntaxins, and the 25-kD synaptosomal-associated protein SNAP25 are the main components of a protein complex involved in the docking and/or fusion of synaptic vesicles with the presynaptic membrane. The protein encoded by this gene is a member of the vesicle-associated membrane protein (VAMP)/synaptobrevin family. This protein may play a role in trans-Golgi network-to-endosome transport. Two alternatively spliced transcript variants encoding the same protein have been found for this gene.
13547	
13548	
13549	This gene encodes a tyrosine kinase that binds Cdc42Hs in its GTP-bound form and inhibits both the intrinsic and GTPase-activating protein (GAP)-stimulated GTPase activity of Cdc42Hs. This binding is mediated by a unique sequence of 47 amino acids C-terminal to an SH3 domain. The protein may be involved in a regulatory mechanism that sustains the GTP-bound active form of Cdc42Hs and which is directly linked to a tyrosine phosphorylation signal transduction pathway. Several alternatively spliced transcript variants have been identified from this gene, but the full-length nature of only two transcript variants has been determined.
13550	Histidine ammonia-lyase is a cytosolic enzyme catalyzing the first reaction in histidine catabolism, the nonoxidative deamination of L-histidine to trans-urocanic acid. Histidine ammonia-lyase defects cause histidinemia which is characterized by increased histidine and histamine and decreased urocanic acid in body fluids
13551	
13552	
13553	
13554	
13555	This gene encodes a kinase that specifically phosphorylates the N-terminal region of the non-collagenous domain of the alpha 3 chain of type IV collagen, known as the Goodpasture antigen. Goodpasture disease is the result of an autoimmune response directed at this antigen. One isoform of this protein is also involved in ceramide intracellular transport. Two transcripts exist for this gene.
13556	
13557	
13558	The protein encoded by this gene has been identified as a brain-specific angiogenesis inhibitor (BAI1)-binding protein.  This interaction at the cytoplasmic membrane is crucial to the function of this protein, which may be involved in neuronal growth-cone guidance.  This protein functions as an insulin receptor tyrosine kinase substrate and suggests a role for insulin in the central nervous system.  This protein has also been identified as interacting with the dentatorubral-pallidoluysian atrophy gene, which is associated with an autosomal dominant neurodegenerative disease.  It also associates with a downstream effector of Rho small G proteins, which is associated with the formation of stress fibers and cytokinesis.  Alternative splicing of the 3@#$%&-end of this gene results in three products of undetermined function.
13559	
13560	This gene is directly involved, with Myc and IgH, in a three-way gene translocation in a Burkitt lymphoma cell line. As a result of the gene translocation, the N-terminal region of the gene product is disrupted, which is thought to be related to the pathogenesis of a subset of high-grade B cell non-Hodgkin lymphoma. The N-terminal segment involved in the translocation includes the region that shares a strong sequence similarity with those of BCL7B and BCL7C.
13561	
13562	
13563	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility.
13564	
13565	
13566	The SCAN domain is a highly conserved, leucine-rich motif of approximately 60 aa originally found within a subfamily of zinc finger proteins. This gene belongs to a family of genes that encode an isolated SCAN domain, but no zinc finger motif. Functional studies have established that the SCAN box is a protein interaction domain that mediates both hetero- and homoprotein associations, and maybe involved in regulation of transcriptional activity. Two transcript variants with different 5@#$%& UTRs, but encoding the same protein, have been described for this gene.
13567	
13568	
13569	The protein encoded by this gene belongs to the thrombospondin family. It is a disulfide-linked homotrimeric glycoprotein that mediates cell-to-cell and cell-to-matrix interactions. This protein has been shown to function as a potent inhibitor of tumor growth and angiogenesis. Studies of the mouse counterpart suggest that this protein may modulate the cell surface properties of mesenchymal cells and be involved in cell adhesion and migration.
13570	
13571	SLC6A19 is a system B(0) transporter that mediates epithelial resorption of neutral amino acids across the apical membrane in the kidney and intestine.[supplied by OMIM]
13572	
13573	This gene is related to members of the MAGEC gene family. It is not expressed in normal tissues, except for testis, and is expressed in tumors of various histological types. This gene and the MAGEC genes are clustered on chromosome Xq26-q27.
13574	This gene belongs to the TIMP gene family. The proteins encoded by this gene family are natural inhibitors of the matrix metalloproteinases, a group of peptidases involved in degradation of the extracellular matrix. This gene located within intron 6 of the synapsin I gene and is transcribed in the opposite direction.  TIMP1 is highly inducible at the transcriptional level in response to many cytokines and hormones. In addition, the expression from some but not all inactive X chromosomes suggests that this gene inactivation is polymorphic in human females.
13575	This gene is a member of a gene family that encodes tyrosine phosphatases. Myotubularin is required for muscle cell differentiation and mutations in this gene have been identified as being responsible for X-linked myotubular myopathy.
13576	This gene is a member of a gene family that encodes tyrosine phosphatases. Myotubularin is required for muscle cell differentiation and mutations in this gene have been identified as being responsible for X-linked myotubular myopathy.
13577	PQBP1 is a nuclear polyglutamine-binding protein that contains a WW domain.[supplied by OMIM]
13578	
13579	
13580	
13581	Members of the dystrophin family of proteins perform a critical role in the maintenance of membrane-associated complexes at points of intercellular contact in vertebrate cells. Dystrophin related protein 2 is predicted to resemble certain short C-terminal isoforms of dystrophin and dystrophin-related protein 1 (DRP1 or utrophin).  DRP2 is expressed principally in the brain and spinal cord.
13582	The protein encoded by this gene is expressed on the surface of T cells. It regulates B cell function by engaging CD40 on the B cell surface. A defect in this gene results in an inability to undergo immunoglobulin class switch and is associated with hyper-IgM syndrome.
13583	
13584	
13585	Gastrin-releasing peptide (GRP) regulates numerous functions of the gastrointestinal and central nervous systems, including release of gastrointestinal hormones, smooth muscle cell contraction, and epithelial cell proliferation and is a potent mitogen for neoplastic tissues. The effects of GRP are mediated through the gastrin-releasing peptide receptor. This receptor is a glycosylated, 7-transmembrane G-protein coupled receptor that activates the phospholipase C signaling pathway. The receptor is aberrantly expressed in numerous cancers such as those of the lung, colon, and prostate. An individual with autism and multiple exostoses was found to have a balanced translocation between chromosome 8 and a chromosome X breakpoint located within the gastrin-releasing peptide receptor gene.
13586	
13587	
13588	This gene encodes a member of a family of neuronal cell surface proteins. Members of this family may act as splice site-specific ligands for beta-neurexins and may be involved in the formation and remodeling of central nervous system synapses. Mutations in this gene may be associated with autism and Asperger syndrome. Multiple transcript variants encoding distinct isoforms have been identified for this gene, but their full length sequences have not been determined.
13589	
13590	
13591	
13592	
13593	NUDT11 belongs to a subgroup of phosphohydrolases that preferentially attack diphosphoinositol polyphosphates (Hidaka et al., 2002).[supplied by OMIM]
13594	
13595	The protein encoded by this gene is a member of the Toll-like receptor (TLR) family which plays a fundamental role in pathogen recognition and activation of innate immunity. TLRs are highly conserved from Drosophila to humans and share structural and functional similarities. They recognize pathogen-associated molecular patterns (PAMPs) that are expressed on infectious agents, and mediate the production of cytokines necessary for the development of effective immunity. The various TLRs exhibit different patterns of expression. This gene is predominantly expressed in lung, placenta, and spleen, and lies in close proximity to another family member, TLR8, on chromosome X.
13596	This gene encodes a zinc finger-containing protein that may function as a transcription factor. This gene was once a candidate gene for the testis-determining factor (TDF) and was erroneously referred to as TDF.
13597	
13598	
13599	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L5P family of ribosomal proteins. It is located in the cytoplasm. The protein probably associates with the 5S rRNA. Alternative splice variants encoding different isoforms may exist, but they have not been fully characterized. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
13600	The protein encoded by this gene belongs to the class II cytokine receptor family, and has been shown to be a receptor for interleukin 22 (IL22). IL22 receptor is a protein complex that consists of this protein and interleukin 10 receptor, beta (IL10BR/CRFB4), a subunit also shared by the receptor complex for interleukin 10 (IL10). This gene and interleukin 28 receptor, alpha (IL28RA) form a cytokine receptor gene cluster in the chromosomal region 1p36.
13601	The protein encoded by this gene has a nuclear and a cytoplasmic form and is a member of both the dynamin family and the family of large GTPases. The nuclear form is localized in a granular pattern in the heterochromatin region beneath the nuclear envelope. A nuclear localization signal (NLS) is present at the amino terminal end of the nuclear form but is lacking in the cytoplasmic form due to use of an alternate translation start codon. This protein is upregulated by interferon-alpha but does not contain the antiviral activity of a similar myxovirus resistance protein 1.
13602	
13603	The protein encoded by this gene is a homolog of the protein transgelin, which is one of the earliest markers of differentiated smooth muscle. The function of this protein has not yet been determined.
13604	
13605	
13606	Palmitoyl-protein thioesterase (PPT) is a small glycoprotein that removes palmitate groups from cysteine residues in lipid-modified proteins. PPT is thought to be involved in the catabolism of lipid-modified proteins (Camp et al., 1994).[supplied by OMIM]
13607	
13608	
13609	
13610	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that belongs to the ribosomal protein S14P family.
13611	
13612	
13613	
13614	
13615	
13616	
13617	
13618	
13619	
13620	
13621	
13622	Guanylate binding protein expression is induced by interferon.  Guanylate binding proteins are characterized by their ability to specifically bind guanine nucleotides (GMP, GDP, and GTP) and are distinguished from the GTP-binding proteins by the presence of 2 binding motifs rather than 3
13623	ATF6 is an endoplasmic reticulum (ER) stress-regulated transmembrane transcription factor that activates the transcription of ER molecules.[supplied by OMIM]
13624	The small G protein GDP dissociation stimulator (smg GDS) is a regulator protein having two activities on a group of small G proteins including the Rho and Rap1 family members and Ki-Ras; one is to stimulate their GDP/GTP exchange reactions, and the other is to inhibit their interactions with membranes. The protein encoded by this gene contains 9 @#$%&Armadillo@#$%& repeats and interacts with the smg GDS protein through these repeats. This protein, which is highly concentrated around the endoplasmic reticulum, is phosphorylated by v-src, and this phosphorylation reduces the affinity of the protein for smg GDS. It is thought that this protein serves as a linker between human chromosome-associated polypeptide (HCAP) and KIF3A/B, a kinesin superfamily protein in the nucleus, and that it plays a role in the interaction of chromosomes with an ATPase motor protein.
13625	
13626	
13627	
13628	This gene encodes a protein with multiple coiled coil motifs which is located in the nucleus, cytoplasm and mitochondria. The protein is involved in neurite outgrowth and cortical development through its interaction with other proteins. This gene is disrupted in a t(1;11)(q42.1;q14.3) translocation which segregates with schizophrenia and related psychiatric disorders in a large Scottish family. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
13629	
13630	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that also participates in apoptotic pathways which are initiated by tumor necrosis factor-alpha, Fas ligand, and gamma interferon. This protein potentially binds ATP/GTP and might be a functional partner of the mitoribosomal protein S27. Splice variants that differ in the 5@#$%& UTR have been found for this gene; both variants encode the same protein. Pseudogenes corresponding to this gene are found on chromosomes 1q and 2q.
13631	The protein encoded by this gene has sequence similarity to yeast longevity assurance gene-1 (Lag1). In yeast Lag1 is preferentially expressed in cells that have undergone a small number of cell divisions. Yeast lifespan can be increased or decreased by mutation or overexpression of this gene. The human protein may play a role in the regulation of cell growth. Alternative splicing occurs at this locus and three transcript variants encoding two distinct isoforms have been identified.
13632	
13633	
13634	
13635	Uroguanylin and guanylin (GUCA2A; MIM 139392), peptide homologs of the bacterial heat-stable enterotoxins (e.g., the E. coli ST toxin; STa), are endogenous activators of the guanylate cyclase-2C receptor (GUCY2C; MIM 601330), which synthesizes cyclic GMP (cGMP), a key component of several intracellular signal transduction pathways.[supplied by OMIM]
13636	
13637	This gene encodes the fifth enzyme of the heme biosynthetic pathway. This enzyme is responsible for catalyzing the conversion of uroporphyrinogen to coproporphyrinogen through the removal of four carboxymethyl side chains. Mutations and deficiency in this enzyme are known to cause familial porphyria cutanea tarda and hepatoerythropoetic porphyria.
13638	The encoded gene product presumably interacts with YY1 protein; however, its exact function is not known. Five transcript variants encoding different isoforms have been noted for this gene. Expression of one of the isoforms has been associated with hepatocellular carcinoma.
13639	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that also participates in apoptotic pathways which are initiated by tumor necrosis factor-alpha, Fas ligand, and gamma interferon. This protein potentially binds ATP/GTP and might be a functional partner of the mitoribosomal protein S27. Splice variants that differ in the 5@#$%& UTR have been found for this gene; both variants encode the same protein. Pseudogenes corresponding to this gene are found on chromosomes 1q and 2q.
13640	U2 auxiliary factor (U2AF), comprising a large and a small subunit, is a non-snRNP protein required for the binding of U2 snRNP to the pre-mRNA branch site. This gene encodes the small subunit which plays a critical role in both constitutive and enhancer-dependent RNA splicing by directly mediating interactions between the large subunit and proteins bound to the enhancers. This gene is localized to chromosome 21q22.3, which is the critical region for three diseases, progressive myoclonus epilepsy, autoimmune polyglandular disease type 1, and one form of bipolar affective disorder.
13641	
13642	
13643	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, which has an ATPase activity and is a component of the survival of motor neurons (SMN) complex. This protein interacts directly with SMN, the spinal muscular atrophy gene product, and may play a catalytic role in the function of the SMN complex on RNPs.
13644	
13645	
13646	Control of transcription by RNA polymerase II involves the basal transcription machinery which is a collection of proteins. These proteins with RNA polymerase II, assemble into complexes which are modulated by transactivator proteins that bind to cis-regulatory elements located adjacent to the transcription start site. Some modulators interact directly with the basal complex, whereas others may act as bridging proteins linking transactivators to the basal transcription factors. Some of these associated factors are weakly attached while others are tightly associated with TBP in the TFIID complex. Among the latter are the TAF proteins. Different TAFs are predicted to mediate the function of distinct transcriptional activators for a variety of gene promoters and RNA polymerases. TAF12 interacts directly with TBP as well as with TAF2I.
13647	The N-ras oncogene is a member of the RAS gene family. It is mapped on chromosome 1, and it is activated in HL60, a promyelocytic leukemia line. The order of nearby genes is as follows: cen--CD2--NGFB--NRAS--tel. The mammalian ras gene family consists of the harvey and kirsten ras genes (c-Hras1 and c-Kras2), an inactive pseudogene of each (c-Hras2 and c-Kras1) and the N-ras gene. They differ significantly only in the C-terminal 40 amino acids. These ras genes have GTP/GDP binding and GTPase activity, and their normal function may be as G-like regulatory proteins involved in the normal control of cell growth. Mutations which change amino acid residues 12, 13 or 61 activate the potential of N-ras to transform cultured cells and are implicated in a variety of human tumors. The N-ras gene specifies two main transcripts of 2Kb and 4.3Kb. The difference between the two transcripts is a simple extension through the termination site of the 2Kb transcript. The N-ras gene consists of seven exons (-I, I, II, III, IV, V, VI). The smaller 2Kb transcript contains the VIa exon, and the larger 4.3Kb transcript contains the VIb exon which is just a longer form of the VIa exon. Both transcipts encode identical proteins as they differ only the 3@#$%& untranslated region. The sequence of the shorter 2Kb transcript is presented here. The 4.3 Kb transcript sequence is not available.
13648	This gene encodes a TBP- (TATA box-binding protein) associated phosphoprotein that represses both basal and activated levels of transcription. The encoded protein is phosphorylated in vivo and this phosphorylation affects its interaction with TBP. This protein contains a histone fold motif at the amino terminus, a TBP-binding domain, and a glutamine- and alanine-rich region. The binding of DR1 repressor complexes to TBP-promoter complexes may establish a mechanism in which an altered DNA conformation, together with the formation of higher order complexes, inhibits the assembly of the preinitiation complex and controls the rate of RNA polymerase II transcription.
13649	The protein encoded by this gene is a microsomal enzyme that catalyzes the conversion of the stress hormone cortisol to the inactive metabolite cortisone. In addition, the encoded protein can catalyze the reverse reaction, the conversion of cortisone to cortisol. Too much cortisol can lead to central obesity, and a particular variation in this gene has been associated with obesity and insulin resistance in children. Two transcript variants encoding the same protein have been found for this gene.
13650	This gene encodes a member of the claudin family. Claudins are integral membrane proteins and components of tight junction strands. Tight junction strands serve as a physical barrier to prevent solutes and water from passing freely through the paracellular space between epithelial or endothelial cell sheets. Mutations in this gene have been found in patients with velocardiofacial syndrome.
13651	The protein encoded by this gene appears to be a zinc finger protein containing a transposase domain at the C-terminus. This protein was found to interact with the transcription factor SP1 in a yeast two-hybrid system. At least three alternatively spliced transcript variants encoding distinct isoforms have been observed.
13652	
13653	
13654	
13655	
13656	
13657	The product encoded by this gene is a serine proteinase inhibitor which rapidly inhibits thrombin in the presence of dermatan sulfate or heparin. The gene contains five exons and four introns. This protein shares homology with antithrombin III and other members of the alpha 1-antitrypsin superfamily. Mutations in this gene are associated with heparin cofactor II deficiency.
13658	
13659	Carbonic anhydrases (CAs) are a large family of zinc metalloenzymes that catalyze the reversible hydration of carbon dioxide.  They participate in a variety of biological processes, including respiration, calcification, acid-base balance, bone resorption, and the formation of aqueous humor, cerebrospinal fluid, saliva, and gastric acid.  They show extensive diversity in tissue distribution and in their subcellular localization.  CA XIV is predicted to be a type I membrane protein and shares highest sequence similarity with the other transmembrane CA isoform, CA XII; however, they have different patterns of tissue-specific expression and thus may play different physiologic roles.
13660	
13661	Heterotrimeric G proteins transduce extracellular signals received by cell surface receptors into integrated cellular responses. GPSM2 belongs to a group of proteins that modulate activation of G proteins (Blumer et al., 2002).[supplied by OMIM]
13662	The protein encoded by this gene is a member of the rhomboid family of integral membrane proteins. This family contains proteins that are related to Drosophila rhomboid protein. Members of this family are found in both prokaryotes and eukaryotes and are thought to function as intramembrane serine proteases. The encoded protein is thought to release soluble growth factors by proteolytic cleavage of certain membrane-bound substrates, including ephrin B2 and ephrin B3.
13663	
13664	
13665	
13666	
13667	The Sec4/Rab-related small GTP-binding proteins are involved in the regulation of intracellular vesicular transport. Mss4 stimulates GTP-GDP exchange in Sec4 and Rab and binds to a subset of genetically related Rab proteins.[supplied by OMIM]
13668	
13669	
13670	
13671	
13672	The protein encoded by this gene is a member of the immunoglobulin superfamily. It is a glycosylphosphatidylinositol (GPI)-anchored neuronal membrane protein that functions as a cell adhesion molecule. It may play a role in the formation of axon connections in the developing nervous system. It may also be involved in glial tumorigenesis and may provide a potential target for therapeutic intervention.
13673	
13674	
13675	
13676	
13677	This gene encodes a subunit of mitochondrial ATP synthase. Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. ATP synthase is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, Fo, comprising the proton channel. The catalytic portion of mitochondrial ATP synthase consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled with a stoichiometry of 3 alpha, 3 beta, and a single representative of the other 3. The proton channel consists of three main subunits (a, b, c). This gene encodes the gamma subunit of the catalytic core. Alternatively spliced transcript variants encoding different isoforms have been identified. This gene also has a pseudogene on chromosome 14.
13678	
13679	
13680	This gene may belong to a protease family responsible for intercellular peptide signalling.  Though its role in the cellular processing of insulin has not yet been defined, insulin-degrading enzyme is thought to be involved in the termination of the insulin response.
13681	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. This gene and the gene for a semaphorin class 4 protein (SEMA4G) overlap at map location 10q24.31 and are transcribed in opposite directions. Sequence analysis identified multiple transcript variants encoding at least four different protein isoforms.
13682	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. This gene and the gene for a semaphorin class 4 protein (SEMA4G) overlap at map location 10q24.31 and are transcribed in opposite directions. Sequence analysis identified multiple transcript variants encoding at least four different protein isoforms.
13683	ADP-ribosylation factor-like 3 is a member of the ADP-ribosylation factor family of GTP-binding proteins. ARL3 binds guanine nucleotides but lacks ADP-ribosylation factor activity.
13684	
13685	This gene encodes a protein with antioxidant function and is localized in the mitochondrion. This gene shows significant nucleotide sequence similarity to the gene coding for the C22 subunit of Salmonella typhimurium alkylhydroperoxide reductase. Expression of this gene product in E. coli deficient in the C22-subunit gene rescued resistance of the bacteria to alkylhydroperoxide. The human and mouse genes are highly conserved, and they map to the regions syntenic between mouse and human chromosomes. Sequence comparisons with recently cloned mammalian homologues suggest that these genes consist of a family that is responsible for regulation of cellular proliferation, differentiation, and antioxidant functions. Two transcript variants encoding two different isoforms have been found for this gene.
13686	This gene encodes a protein that is clearly involved in kinetochore function although an exact role is not known. It interacts with ZW10, another kinetochore protein, possibly regulating the association between ZW10 and kinetochores. The encoded protein localizes to prophase kinetochores before ZW10 does and it remains detectable on the kinetochore until late anaphase. It has a uniform distribution in the cytoplasm of interphase cells. Four transcript variants encoding three different isoforms have been found for this gene.
13687	This gene encodes a nuclear protein that has three zinc fingers at the end of its C-terminal domain, a serine/threonine-rich central region, and an acidic domain lying within the N-terminal region. The zinc fingers of this protein are responsible for the specific DNA binding with the guanine-rich core promoter elements. The central region might be involved in activation or posttranslational regulatory pathways, and the acidic N-terminal domain might play an important role in the process of transcriptional activation. It is capable of activating transcription approximately 4-fold either on homologous or heterologous promoters. The DNA binding and transcriptional activity of this protein, in conjunction with its expression pattern, suggests that this protein may participate in the regulation and/or maintenance of the basal expression of pregnancy-specific glycoprotein genes and possibly other TATA box-less genes. Two transcript variants encoding the same protein have been found for this gene.
13688	Meningioma 1 (MN1) contains two sets of CAG repeats. It is disrupted by a balanced translocation (4;22) in a meningioma, and its inactivation may contribute to meningioma 32 pathogenesis.
13689	
13690	Glutamic-oxaloacetic transaminase is a pyridoxal phosphate-dependent enzyme which exists in cytoplasmic and mitochondrial forms, GOT1 and GOT2, respectively.  GOT plays a role in amino acid metabolism and the urea and tricarboxylic acid cycles. The two enzymes are homodimeric and show close homology.
13691	
13692	
13693	
13694	This gene produces two transcripts encoding two distinct proteins. One protein is a transcriptional repressor, while the other isoform is a major component of specialized synapses known as synaptic ribbons. Both proteins contain a NAD+ binding domain similar to NAD+-dependent 2-hydroxyacid dehydrogenases. A portion of the 3@#$%& untranslated region was used to map this gene to chromosome 21q21.3; however, it was noted that similar loci elsewhere in the genome are likely. Blast analysis shows that this gene is present on chromosome 10.
13695	
13696	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This gene is located in the chromosome 10q25-26 region, which is frequently deleted in gliomas and tumors of other tissues, and is disrupted by the t(10;19) translocation rearrangement in glioblastoma cells. The gene location suggests that it is a candidate gene for the tumor suppressor locus.
13697	
13698	
13699	Adaptor protein complex 1 is found at the cytoplasmic face of coated vesicles located at the Golgi complex, where it mediates both the recruitment of clathrin to the membrane and the recognition of sorting signals within the cytosolic tails of transmembrane receptors. This complex is a heterotetramer composed of two large, one medium, and one small adaptin subunit. The protein encoded by this gene serves as one of the large subunits of this complex and is a member of the adaptin protein family. This gene is a candidate meningioma gene. Two transcript variants encoding different isoforms have been found for this gene, and variants utilizing alternative polyadenylation signals exist.
13700	
13701	
13702	
13703	This gene is a member of the myotubularin family and encodes a putative tyrosine phosphatase. Mutations in this gene are a cause of Charcot-Marie-Tooth disease type 4B, an autosomal recessive demyelinating neuropathy. Multiple alternatively spliced transcript variants have been found, but the biological validity of some variants has not been determined.
13704	
13705	
13706	The protein encoded by this gene is a brain-enriched nucleotide exchanged factor that contains an N-terminal GEF domain, 2 tandem repeats of EF-hand calcium-binding motifs, and a C-terminal diacylglycerol/phorbol ester-binding domain. This protein can activate small GTPases, including RAS and RAP1/RAS3. The nucleotide exchange activity of this protein can be stimulated by calcium and diacylglycerol. Two alternatively spliced transcript variants of this gene, encoding distinct isoforms, have been reported.
13707	
13708	This gene encodes subunit 2 of the splicing factor 3b protein complex. Splicing factor 3b, together with splicing factor 3a and a 12S RNA unit, forms the U2 small nuclear ribonucleoproteins complex (U2 snRNP). The splicing factor 3b/3a complex binds pre-mRNA upstream of the intron@#$%&s branch site in a sequence-independent manner and may anchor the U2 snRNP to the pre-mRNA. Splicing factor 3b is also a component of the minor U12-type spliceosome. Subunit 2 associates with pre-mRNA upstream of the branch site at the anchoring site. Subunit 2 also interacts directly with subunit 4 of the splicing factor 3b complex. Subunit 2 is a highly hydrophilic protein with a proline-rich N-terminus and a glutamate-rich stretch in the C-terminus.
13709	
13710	
13711	
13712	
13713	This gene encodes a selenoprotein containing selenocysteine at the active site. The selenocysteine is encoded by the ususal nonsense codon UGA. This gene is expressed in a variety of tissues, and the protein is localized to the perinuclear structures.
13714	
13715	
13716	
13717	
13718	This gene encodes a protein that is one of many involved in mechanisms to ensure proper chromosome segregation during cell division. The encoded protein binds to centromeres during the prophase, metaphase, and early anaphase cell division stages and to kinetochore microtubules during metaphase.
13719	The protein encoded by this gene belongs to the subtilisin-like proprotein convertase family. The members of this family are proprotein convertases that process latent precursor proteins into their biologically active products. This encoded protein is a calcium-dependent serine endoprotease. It is structurally related to its family members, PACE and PACE4. This protein is concentrated in the trans-Golgi network, associated with the membranes, and is not secreted. It can process proalbumin and is thought to be responsible for the activation of HIV envelope glycoproteins gp160 and gp140. This gene has been implicated in the transcriptional regulation of housekeeping genes. Multiple alternatively spliced transcripts are described for this gene but their full length nature is not yet known. Downstream of this gene@#$%&s map location at 11q23-q24, nucleotides that match part of this gene@#$%&s 3@#$%& end are duplicated and inverted. A translocation breakpoint associated with lymphoma occurs between this gene and its inverted counterpart.
13720	
13721	
13722	
13723	
13724	
13725	
13726	
13727	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
13728	This gene encodes a member of the globin superfamily and is expressed in skeletal and cardiac muscles. The encoded protein is a haemoprotein contributing to intracellular oxygen storage and transcellular facilitated diffusion of oxygen. At least three alternatively spliced transcript variants encoding the same protein have been reported.
13729	TIMM10 belongs to a family of evolutionarily conserved proteins that are organized in heterooligomeric complexes in the mitochondrial intermembrane space. These proteins mediate the import and insertion of hydrophobic membrane proteins into the mitochondrial inner membrane.[supplied by OMIM]
13730	This gene encodes a member of the vitamin B12-binding protein family. This family of proteins, alternatively referred to as R binders, is expressed in various tissues and secretions. This protein is a major constituent of secondary granules in neutrophils and facilitates the transport of cobalamin into cells.
13731	The protein encoded by this gene is a member of the fatty acid desaturase (FADS) gene family. Desaturase enzymes regulate unsaturation of fatty acids through the introduction of double bonds between defined carbons of the fatty acyl chain. FADS family members are considered fusion products composed of an N-terminal cytochrome b5-like domain and a C-terminal multiple membrane-spanning desaturase portion, both of which are characterized by conserved histidine motifs. This gene is clustered with family members FADS1 and FADS2 at 11q12-q13.1; this cluster is thought to have arisen evolutionarily from gene duplication based on its similar exon/intron organization.
13732	This gene encodes a putative GTP-binding protein similar to RAY/RAB1C. The protein is ras-related, but the function is unknown.
13733	
13734	
13735	
13736	
13737	The gamma globin genes (HBG1 and HBG2) are normally expressed in the fetal liver, spleen and bone marrow. Two gamma chains together with two alpha chains constitute fetal hemoglobin (HbF) which is normally replaced by adult hemoglobin (HbA) at birth. In some beta-thalassemias and related conditions, gamma chain production continues into adulthood. The two types of gamma chains differ at residue 136 where glycine is found in the G-gamma product (HBG2) and alanine is found in the A-gamma product (HBG1). The former is predominant at birth. The order of the genes in the beta-globin cluster is: 5@#$%&-epsilon -- gamma-G -- gamma-A -- delta -- beta--3@#$%&
13738	
13739	
13740	This gene is a member of the frizzled gene family. Members of this family encode seven-transmembrane domain proteins that are receptors for the Wingless type MMTV integration site family of signaling proteins. Most frizzled receptors are coupled to the beta-catenin canonical signaling pathway. This protein may play a role as a positive regulator of the Wingless type MMTV integration site signaling pathway. A transcript variant retaining intronic sequence and encoding a shorter isoform has been described, however, its expression is not supported by other experimental evidence.
13741	The protein encoded by this intronless gene contains a C3HC5 type of RING finger, a motif known to be involved in protein-DNA and protein-protein interactions. The expression of this gene was found to be upregulated in cancer cell lines derived from different types of cancer.
13742	The protein encoded by this gene is an A2 phospholipase, a class of enzyme that catalyzes the release of fatty acids from phospholipids. The encoded protein may play a role in phospholipid remodelling, arachidonic acid release, leukotriene and prostaglandin synthesis, fas-mediated apoptosis, and transmembrane ion flux in glucose-stimulated B-cells. Several transcript variants encoding multiple isoforms have been described, but the full-length nature of only two of them have been determined to date.
13743	
13744	
13745	
13746	
13747	
13748	
13749	Adrenomedullin is a potent vasodilator peptide that exerts major effects on cardiovascular function. This gene encodes a seven-transmembrane protein that belongs to the family 1 of G-protein coupled receptors. Studies of the rat counterpart suggest that the encoded protein may function as a receptor for adrenomedullin.
13750	
13751	This gene encodes a member of the DEAD box protein family. DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure, such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. The protein encoded by this gene is a component of the U5 snRNP complex; it may facilitate conformational changes in the spliceosome during nuclear pre-mRNA splicing. An alternatively spliced transcript variant has been found for this gene, but its biological validity has not been determined.
13752	MaxiK channels are large conductance, voltage and calcium-sensitive potassium channels which are fundamental to the control of smooth muscle tone and neuronal excitability. MaxiK channels can be formed by 2 subunits: the pore-forming alpha subunit and the modulatory beta subunit. The protein encoded by this gene is an auxiliary beta subunit which slows activation kinetics, leads to steeper calcium sensitivity, and shifts the voltage range of current activation to more negative potentials than does the beta 1 subunit.
13753	
13754	
13755	
13756	The protein encoded by this gene is highly expressed in the lacrimal glands and localized primarily to secretory granules and secretory fluid. It augments lacrimal acinar cell secretion, promotes ductal cell proliferation, and stimulates signaling through tyrosine phosphorylation and release of calcium.
13757	This gene encodes an enzyme included in the 2@#$%&, 5@#$%& oligoadenylate synthase family.  This enzyme is induced by interferons and catalyzes the 2@#$%&, 5@#$%& oligomers of adenosine in order to bind and activate RNase L.  This enzyme family plays a significant role in the inhibition of cellular protein synthesis and viral infection resistance.  Alternative splicing of this gene produces two isoforms which share identical N-terminal sequence but diverge at exon 7.
13758	
13759	
13760	SELPLG is the high affinity counter-receptor for P-selectin on myeloid cells and stimulated T lymphocytes. As such, it plays a critical role in the tethering of these cells to activated platelets or endothelia expressing P-selectin. The organization of the SELPG gene closely resembles that of CD43 and the human platelet glycoprotein GpIb-alpha both of which have an intron in the 5-prime-noncoding region, a long second exon containing the complete coding region, and TATA-less promoters.
13761	
13762	Potassium channels are present in most mammalian cells, where they participate in a wide range of physiologic responses. The protein encoded by this gene is an integral membrane protein and inward-rectifier type potassium channel. The encoded protein, which has a greater tendency to allow potassium to flow into a cell rather than out of a cell, is controlled by G-proteins.
13763	This gene product belongs to the family of candidate taste receptors that are members of the G-protein-coupled receptor superfamily. These proteins are specifically expressed in the taste receptor cells of the tongue and palate epithelia. They are organized in the genome in clusters and are genetically linked to loci that influence bitter perception in mice and humans. In functional expression studies, they respond to bitter tastants. This gene maps to the taste receptor gene cluster on chromosome 12p13.
13764	Natural killer (NK) cells are lymphocytes that can mediate lysis of certain tumor cells and virus-infected cells without previous activation. They can also regulate specific humoral and cell-mediated immunity. The protein encoded by this gene belongs to the killer cell lectin-like receptor (KLR) family, which is a group of transmembrane proteins preferentially expressed in NK cells. Studies in mice suggested that the expression of this gene may be regulated by MHC class I molecules. Alternatively spliced transcript variants have been reported, but their full-length natures have not yet been determined.
13765	
13766	
13767	
13768	
13769	
13770	
13771	
13772	
13773	The nuclear ACADS gene encodes a mitochondrially-targeted protein product.  Acyl-Coenzyme A dehydrogenase, short-chain specific, catalyzes the initial step of the mitochondrial fatty acid beta-oxidation pathway. Clinical phenotypes, described elsewhere, are associated with ACADS hereditary deficiency.
13774	Cytoplasmic dyneins are large enzyme complexes with a molecular mass of about 1,200 kD. They contain two force-producing heads formed primarily from dynein heavy chains, and stalks linking the heads to a basal domain, which contains a varying number of accessory intermediate chains. The predicted 89-amino acid human protein, cytoplasmic dynein light polypeptide, is 94% similar to Drosophila dlc1. This gene product physically interacts with and inhibits the activity of neuronal nitric oxide synthase (nNOS). It also appears to be one of the more conserved proteins in nature, showing 92% amino acid similarity with the nematode and rat homologs. Binding of this protein destabilizes the nNOS dimer, a conformation necessary for activity, and it may regulate numerous biologic processes through its effects on nitric oxide synthase activity.
13775	
13776	This gene encodes a member of a family of pyridine nucleotide oxidoreductases. This protein reduces thioredoxins as well as other substrates, and plays a role in selenium metabolism and protection against oxidative stress. The functional enzyme is thought to be a homodimer which uses FAD as a cofactor. Each subunit contains a selenocysteine residue which is required for the catalytic activity. Alternative splicing results in several transcript variants encoding the same protein. Additional transcript variants have been described but their biological validity has not been determined.
13777	
13778	This gene encodes a member of the UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase (GalNAc-T) family of enzymes. GalNAc-Ts initiate mucin-type O-linked glycosylation in the Golgi apparatus by catalyzing the transfer of GalNAc to serine and threonine residues on target proteins. They are characterized by an N-terminal transmembrane domain, a stem region, a lumenal catalytic domain containing a GT1 motif and Gal/GalNAc transferase motif, and a C-terminal ricin/lectin-like domain. GalNAc-Ts have different, but overlapping, substrate specificities and patterns of expression.
13779	Rho GTPases control a variety of cellular processes. There are 3 subtypes of Rho GTPases in the Ras superfamily of small G proteins: RHO (see MIM 165370), RAC (see RAC1; MIM 602048), and CDC42 (MIM 116952). GTPase-activating proteins (GAPs) bind activated forms of Rho GTPases and stimulate GTP hydrolysis. Through this catalytic function, Rho GAPs negatively regulate Rho-mediated signals. GAPs may also serve as effector molecules and play a role in signaling downstream of Rho and other Ras-like GTPases.[supplied by OMIM]
13780	
13781	Cytochrome c oxidase (COX), the terminal component of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. This component is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in electron transfer, and the nuclear-encoded subunits may function in the regulation and assembly of the complex. This nuclear gene encodes a protein which is not a structural subunit, but may be involved in the recruitment of copper to mitochondria for incorporation into the COX apoenzyme. This protein shares 92% amino acid sequence identity with mouse and rat Cox17 proteins. This gene is no longer considered to be a candidate gene for COX deficiency. A pseudogene COX17P has been found on chromosome 13.
13782	
13783	Insulin promoter factor 1 is an islet-specific protein that activates transcription of the insulin andsomatostatin genes.  It is a key regulator of islet peptide hormone expressionn and also plays an essential role in pancreatic development.  Mutations in this gene may be involved in several disorders of the pancreas or in diabetes mellitus.
13784	
13785	
13786	
13787	This gene encodes the major type IV alpha collagen chain of basement membranes. Like the other members of the type IV collagen gene family, this gene is organized in a head-to-head conformation with another type IV collagen gene so that each gene pair shares a common promoter.
13788	
13789	
13790	This gene encodes a member of a large, diverse group of A-kinase anchor proteins that target the activity of protein kinase A to specific subcellular sites by binding to its type II regulatory subunits. Brain-specific expression and coat protein-like membrane recruitment of a highly similar protein in mouse suggest an involvement in neuronal post-Golgi membrane traffic. Mutations in this gene may be associated with a form of autism. Alternatively spliced transcript variants may exist, but their full-length nature has not been determined.
13791	
13792	
13793	In Aspergillus nidulans, lack of the serine/threonine kinase NimA (never in mitosis A) results in cell cycle arrest in G2, while overexpression causes the premature onset of mitotic events. The protein encoded by this gene is similar in sequence to the Aspergillus nidulans protein and may therefore play a role in mitotic regulation. However, the encoded protein differs from other NimA family members in that it is not cell cycle regulated and is found primarily in the cytoplasm. Three transcript variants have been found for this gene, but the full-length nature of only two of them has been characterized.
13794	
13795	
13796	
13797	
13798	
13799	
13800	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. The immunoproteasome contains an alternate regulator, referred to as the 11S regulator or PA28, that replaces the 19S regulator. Three subunits (alpha, beta and gamma) of the 11S regulator have been identified. This gene encodes the beta subunit of the 11S regulator, one of the two 11S subunits that is induced by gamma-interferon. Three beta and three alpha subunits combine to form a heterohexameric ring. Six pseudogenes have been identified on chromosomes 4, 5, 8, 10 and 13.
13801	
13802	
13803	The protein encoded by this gene is part of a large multisubunit complex required for microtubule nucleation at the centrosome. Two transcript variants encoding different isoforms have been found for this gene.
13804	Members of the Rab protein family are nontransforming monomeric GTP-binding proteins of the Ras superfamily that contain 4 highly conserved regions involved in GTP binding and hydrolysis. Rabs are prenylated, membrane-bound proteins involved in vesicular fusion and trafficking.[supplied by OMIM]
13805	
13806	This gene product is a component of the ribonucleoprotein complex responsible for telomerase activity which catalyzes the addition of new telomeres on the chromosome ends. The telomerase-associated proteins are conserved from ciliates to humans.
13807	
13808	
13809	
13810	WW domain-containing proteins are found in all eukaryotes and play an important role in the regulation of a wide variety of cellular functions such as protein degradation, transcription, and RNA splicing. This gene encodes a protein which contains 2 WW domains and a coiled-coil region. It is ubiquitously expressed in adult tissues. The encoded protein is 94% identical to the mouse protein at the amino acid level.
13811	The product of this gene is a Golgi enzyme catalyzing an essential step in the conversion of oligomannose to complex N-glycans. The enzyme has the typical glycosyltransferase domains: a short N-terminal cytoplasmic domain, a hydrophobic non-cleavable signal-anchor domain, and a C-terminal catalytic domain. Mutations in this gene may lead to carbohydrate-deficient glycoprotein syndrome, type II. Two transcript variants encoding the same protein have been identified for this gene.
13812	
13813	
13814	
13815	Aminoacyl-tRNA synthetases catalyze the aminoacylation of tRNA by their cognate amino acid. Because of their central role in linking amino acids with nucleotide triplets contained in tRNAs, aminoacyl-tRNA synthetases are thought to be among the first proteins that appeared in evolution. Two forms of tryptophanyl-tRNA synthetase exist, a cytoplasmic form, named WARS, and a mitochondrial form, named WARS2. Tryptophanyl-tRNA synthetase (WARS) catalyzes the aminoacylation of tRNA(trp) with tryptophan and is induced by interferon. Tryptophanyl-tRNA synthetase belongs to the class I tRNA synthetase family. Four transcript variants encoding two different isoforms have been found for this gene.
13816	The protein encoded by this gene belongs to the family of P-type cation transport ATPases, and to the subfamily of aminophospholipid-transporting ATPases. The aminophospholipid translocases transport phosphatidylserine and phosphatidylethanolamine from one side of a bilayer to another. This gene is maternally expressed. It maps within the most common interval of deletion responsible for Angelman syndrome, also known as @#$%&happy puppet syndrome@#$%&.
13817	
13818	This gene encodes the fibrillin protein, an important component of extracellular microfibrils. The encoded protein is found in elastic and nonelastic connective tissues of the body. Mutations in this gene cause Marfan syndrome.
13819	
13820	
13821	
13822	
13823	
13824	Cyclin B2 is a member of the cyclin family, specifically the B-type cyclins. The B-type cyclins, B1 and B2, associate with p34cdc2 and are essential components of the cell cycle regulatory machinery. B1 and B2 differ in their subcellular localization. Cyclin B1 co-localizes with microtubules, whereas cyclin B2 is primarily associated with the Golgi region. Cyclin B2 also binds to transforming growth factor beta RII and thus cyclin B2/cdc2 may play a key role in transforming growth factor beta-mediated cell cycle control.
13825	This gene encodes a protein sharing a low level of sequence similarity with human ribosomal protein L24. Although this gene has been referred to as RPL24, L30, and 60S ribosomal protein L30 isolog in the sequence databases, it is distinct from the human genes officially named RPL24 (which itself has been referred to as ribosomal protein L30) and RPL30. The function of this gene is currently unknown. This gene utilizes alternative polyadenylation signals.
13826	
13827	
13828	
13829	
13830	
13831	
13832	
13833	
13834	
13835	
13836	The amino acid sequence the protein encoded by this gene is similar to that of KIP/CIB, calcineurin B, and calmodulin. This suggests that the encoded protein may be a Ca2+-binding regulatory protein that interacts with DNA-dependent protein kinase catalytic subunit (DNA-PKcs).
13837	Regulates inositol phosphate metabolism by phosphorylation of second messenger inositol 1,4,5-trisphosphate to Ins(1,3,4,5)P4. The activity of the inositol 1,4,5-trisphosphate 3-kinase is responsible for regulating the levels of a large number of inositol polyphosphates that are important in cellular signaling. Both calcium/calmodulin and protein phosphorylation mechanisms control its activity. It is also a substrate for the cyclic AMP-dependent protein kinase, calcium/calmodulin- dependent protein kinase II, and protein kinase C in vitro. ITPKA and ITPKB are 68% identical in the C-terminus region.
13838	This protein belongs to the basic helix-loop-helix family of transcription factors. It is a transcriptional repressor of genes that require a bHLH protein for their transcription. The protein has a particular type of basic domain that contains a helix interrupting protein that binds to the N-box rather than the canonical E-box.
13839	
13840	
13841	
13842	
13843	
13844	This gene encodes the major vault protein which is a lung resistance-related protein. Vaults are multi-subunit structures that may be involved in nucleo-cytoplasmic transport. This protein mediates drug resistance, perhaps via a transport process. It is widely distributed in normal tissues, and overexpressed in multidrug-resistant cancer cells. The protein overexpression is a potentially useful marker of clinical drug resistance. This gene produces two transcripts by using two alternative exon 2 sequences; however, the open reading frames are the same in both transcripts.
13845	
13846	
13847	
13848	Mutations in this gene lead to tuberous sclerosis complex. Its gene product is believed to be a tumor suppressor and is able to stimulate specific GTPases. The protein associates with hamartin in a cytosolic complex, possibly acting as a chaperone for hamartin. Three splice variants for this gene exist, producing three different isoforms of the protein.
13849	
13850	
13851	Three different procarboxypeptidases A and two different procarboxypeptidases B have been isolated. The B1 and B2 forms differ from each other mainly in isoelectric point. Carboxypeptidase B1 is a highly tissue-specific protein and is a useful serum marker for acute pancreatitis and dysfunction of pancreatic transplants. It is not elevated in pancreatic carcinoma.
13852	
13853	
13854	L-type calcium channels are composed of five subunits. The protein encoded by this gene represents one of these subunits, gamma, and is one of several gamma subunit proteins. It is an integral membrane protein that is thought to stabilize the calcium channel in an inactive (closed) state. This protein is similar to the mouse stargazin protein, mutations in which have been associated with absence seizures, also known as petit-mal or spike-wave seizures. This gene is a member of the neuronal calcium channel gamma subunit gene subfamily of the PMP-22/EMP/MP20 family. This gene is a candidate gene for a familial infantile convulsive disorder with paroxysomal choreoathetosis.
13855	
13856	
13857	
13858	
13859	
13860	
13861	The biosynthesis of lipid-linked oligosaccharides is highly conserved among eukaryotes and is catalyzed by 14 glycosyltransferases in an ordered stepwise manner. Mannosyltransferase I (MT I) catalyzes the first mannosylation step in this process.[supplied by OMIM]
13862	
13863	
13864	Integrins are heterodimers comprised of alpha and beta subunits, that are noncovalently associated transmembrane glycoprotein receptors. Different combinations of alpha and beta polypeptides form complexes that vary in their ligand-binding specificities. Integrins mediate cell-matrix or cell-cell adhesion, and transduced signals that regulate gene expression and cell growth. This gene encodes the integrin beta 4 subunit, a receptor for the laminins. This subunit tends to associate with alpha 6 subunit and is likely to play a pivotal role in the biology of invasive carcinoma. Mutations in this gene are associated with epidermolysis bullosa with pyloric atresia. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
13865	The multiprotein Mediator complex is a coactivator required for activation of RNA polymerase II transcription by DNA bound transcription factors. The protein encoded by this gene is thought to be a subunit of the Mediator complex. This gene is located within the Smith-Magenis syndrome region on chromosome 17.
13866	
13867	
13868	This gene encodes a member of the tweety family of proteins. Members of this family function as chloride anion channels. The encoded protein functions as a calcium(2+)-activated large conductance chloride(-) channel, and may play a role in kidney tumorigenesis. Two transcript variants encoding distinct isoforms have been identified for this gene.
13869	
13870	
13871	This gene encodes a member of the phosphatidylinositol transfer protein family. The encoded cytoplasmic protein transfers phosphatidylinositol from one membrane compartment to another. Two transcript variants encoding distinct isoforms have been identified for this gene.
13872	
13873	The protein encoded by this gene is a member of the SWI/SNF family of proteins, whose members display helicase and ATPase activities and which are thought to regulate transcription of certain genes by altering the chromatin structure around those genes. The encoded protein is part of the large ATP-dependent chromatin remodeling complex SNF/SWI and has sequence similarity to the yeast Swp73 protein.
13874	
13875	
13876	Heparan sulfate biosynthetic enzymes are key components in generating a myriad of distinct heparan sulfate fine structures that carry out multiple biologic activities. The enzyme encoded by this gene is a member of the heparan sulfate biosynthetic enzyme family. It is a type II integral membrane protein and possesses heparan sulfate glucosaminyl 3-O-sulfotransferase activity. The sulfotransferase domain of this enzyme is highly similar to the same domain of heparan sulfate D-glucosaminyl 3-O-sulfotransferase 3A1, and these two enzymes sulfate an identical disaccharide. This gene is widely expressed, with the most abundant expression in liver and placenta.
13877	
13878	
13879	
13880	
13881	
13882	This gene is one of several Cys-Cys (CC) cytokine genes clustered on the q-arm of chromosome 17. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The CC cytokines are proteins characterized by two adjacent cysteines. The cytokine encoded by this gene displays chemotactic activity for eosinophils, but not mononuclear cells or neutrophils. This eosinophil specific chemokine assumed to be involved in eosinophilic inflammatory diseases such as atopic dermatitis, allergic rhinitis, asthma and parasitic infections.
13883	The dystrophin-glycoprotein complex (DGC) comprises a group of proteins that are critical to the stability of muscle fiber membranes and to the linking of the actin cytoskeleton to the extracellular matrix. Components of the DGC include dystrophin (MIM 300377), which is deficient in Duchenne muscular dystrophy (DMD; MIM 310200); syntrophins (e.g., MIM 600026); dystroglycans (MIM 128239); and sarcoglycans, such as adhalin, a 50-kD transmembrane protein (Roberds et al., 1993).[supplied by OMIM]
13884	Many DNA-binding transcriptional activator proteins enhance the initiation rate of RNA polymerase II-mediated gene transcription by interacting functionally with the general transcription machinery bound at the basal promoter. Adaptor proteins are usually required for this activation, possibly to acetylate and destabilize nucleosomes, thereby relieving chromatin constraints at the promoter. The protein encoded by this gene is a transcriptional activator adaptor and has been found to be part of the PCAF histone acetylase complex. Two transcript variants encoding different isoforms have been identified for this gene.
13885	
13886	
13887	
13888	This gene encodes a protein that is involved in GPI-anchor biosynthesis. The glycosylphosphatidylinositol (GPI) anchor is a glycolipid found on many blood cells and serves to anchor proteins to the cell surface. This gene encodes an essential component of the multisubunit enzyme, GPI transamidase. GPI transamidase mediates GPI anchoring in the endoplasmic reticulum, by catalyzing the transfer of fully assembled GPI units to proteins.
13889	
13890	
13891	SLC13A5 is a tricarboxylate plasma transporter with a preference for citrate.[supplied by OMIM]
13892	
13893	
13894	
13895	
13896	
13897	The protein encoded by this gene shares strong similarity with Saccharomyces cerevisiae protein Cdc27, and the gene product of Schizosaccharomyces pombe nuc 2. This protein is a component of anaphase-promoting complex (APC), which is composed of eight protein subunits and highly conserved in eucaryotic cells. APC catalyzes the formation of cyclin B-ubiquitin conjugate that is responsible for the ubiquitin-mediated proteolysis of B-type cyclins. This protein and 3 other members of the APC complex contain the TPR (tetratricopeptide repeat), a protein domain important for protein-protein interaction. This protein was shown to interact with mitotic checkpoint proteins including Mad2, p55CDC and BUBR1, and thus may be involved in controlling the timing of mitosis.
13898	ATP citrate lyase is the primary enzyme responsible for the synthesis of cytosolic acetyl-CoA in many tissues. The enzyme is a tetramer (relative molecular weight approximately 440,000) of apparently identical subunits. It catalyzes the formation of acetyl-CoA and oxaloacetate from citrate and CoA with a concomitant hydrolysis of ATP to ADP and phosphate. The product, acetyl-CoA, serves several important biosynthetic pathways, including lipogenesis and cholesterogenesis. In nervous tissue, ATP citrate-lyase may be involved in the biosynthesis of acetylcholine. Two transcript variants encoding distinct isoforms have been identified for this gene.
13899	KRT17 encodes the type I intermediate filament chain keratin 17, expressed in nail bed, hair follicle, sebaceous glands, and other epidermal appendages. Mutations in this gene lead to Jackson-Lawler type pachyonychia congenita and steatocystoma multiplex.
13900	
13901	
13902	
13903	The protein encoded by this gene is a member of the keratin family. The keratins are intermediate filament proteins responsible for the structural integrity of epithelial cells and are subdivided into cytokeratins and hair keratins. The type I cytokeratins consist of acidic proteins which are arranged in pairs of heterotypic keratin chains. This cytokeratin is a major cellular protein of mature enterocytes and goblet cells and is specifically expressed in the gastric and intestinal mucosa. The type I cytokeratin genes are clustered in a region of chromosome 17q12-q21.
13904	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the ultrahigh sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
13905	GEMIN4 is part of a high molecular mass multiprotein complex that also contains SMN (MIM 600354), GEMIN2 (SIP1; MIM 602595) and GEMIN3 (DDX20; MIM 606168). In the cytoplasm, this complex is associated with small nuclear ribonucleoproteins (snRNPs).[supplied by OMIM]
13906	GEMIN4 is part of a high molecular mass multiprotein complex that also contains SMN (MIM 600354), GEMIN2 (SIP1; MIM 602595) and GEMIN3 (DDX20; MIM 606168). In the cytoplasm, this complex is associated with small nuclear ribonucleoproteins (snRNPs).[supplied by OMIM]
13907	
13908	Members of the RGS family, such as RGS9, are signaling proteins that suppress the activity of G proteins by promoting their deactivation.[supplied by OMIM]
13909	The product of this gene is a member of the CDC42-binding protein family. Members of this family interact with Rho family GTPases and regulate the organization of the actin cytoskeleton. This protein has been shown to bind both CDC42 and TC10 GTPases in a GTP-dependent manner. When overexpressed in fibroblasts, this protein was able to induce pseudopodia formation, which suggested a role in inducing actin filament assembly and cell shape control.
13910	Neuronal CDC2-like kinase, which is involved in the regulation of neuronal differentiation, is composed of a catalytic subunit, CDK5, and an activating subunit, p25NCK5A. The protein encoded by this gene binds to p25NCK5A and therefore may be involved in neuronal differentiation. The encoded protein, which may be a substrate of neuronal CDC2-like kinase, has also been found in vascular endothelial cells, where it mediates cell proliferation. Three transcript variants encoding two different isoforms have been found for this gene.
13911	This gene encodes a protein which is necessary for normal skeletal development and brain function. Mutations in this gene are associated with two types of recessive osteochondrodysplasia, Dyggve-Melchior-Clausen (DMC) dysplasia and Smith-McCort (SMC) dysplasia, which involve both skeletal defects and mental retardation.
13912	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It is one of the most abundant core proteins of hnRNP complexes and it is localized to the nucleoplasm. This protein, along with other hnRNP proteins, is exported from the nucleus, probably bound to mRNA, and is immediately re-imported. Its M9 domain acts as both a nuclear localization and nuclear export signal. The encoded protein is involved in the packaging of pre-mRNA into hnRNP particles, transport of poly A+ mRNA from the nucleus to the cytoplasm, and may modulate splice site selection. It is also thought have a primary role in the formation of specific myometrial protein species in parturition. Multiple alternatively spliced transcript variants have been found for this gene but only two transcripts are fully described. These variant have multiple alternative transcription initiation sites and multiple polyA sites.
13913	
13914	
13915	
13916	
13917	
13918	
13919	
13920	
13921	
13922	The protein encoded by this gene is a member of the gp130 family of cytokines. These cytokines drive the assembly of multisubunit receptor complexes, all of which contain at least one molecule of the transmembrane signaling receptor IL6ST (gp130). This cytokine is shown to stimulate the T-cell-dependent development of immunoglobulin-producing B cells. It is also found to support the proliferation of hematopoietic stem cells and megakaryocyte progenitor cells.
13923	The second major step in the catabolism of the branched-chain amino acids, isoleucine, leucine, and valine, is catalyzed by the branched-chain alpha-keto acid dehydrogenase complex (BCKD; EC 1.2.4.4), an inner-mitochondrial enzyme complex that consists of 3 catalytic components: a heterotetrameric (alpha2, beta2) branched-chain alpha-keto acid decarboxylase (E1), a homo-24-meric dihydrolipoyl transacylase (E2; MIM 248610), and a homodimeric dihydrolipoamide dehydrogenase (E3; MIM 246900). The reaction is irreversible and constitutes the first committed step in BCAA oxidation. The complex also contains 2 regulatory enzymes, a kinase and a phosphorylase. The BCKDHA gene encodes the alpha subunit of E1, and the BCKDHB gene (MIM 248611) encodes the beta subunit of E1.[supplied by OMIM]
13924	CPAMD8 belongs to the complement component-3 (C3; MIM 120700)/alpha-2-macroglobulin (A2M; MIM 103950) family of proteins, which are involved in innate immunity and damage control. Complement components recognize and eliminate pathogens by direct binding or by mediating opsonization/phagocytosis and intracellular killing, and A2M is a broad-spectrum protease inhibitor (Li et al., 2004).[supplied by OMIM]
13925	
13926	The protein encoded by this gene is a cytokine that belongs to the tumor necrosis factor (TNF) ligand family. This cytokine is a ligand for TNFRSF9/4-1BB, which is a costimulatory molecule in T lymphocytes. This cytokine and its receptor are involved in the antigen presentation process, and in the generation of cytotoxic T cells. This cytokine is also shown to be required for the optimal CD8 responses in CD8 T cells. This cytokine is found to be expressed in various carcinoma cell lines, and is thought to be involved in T cell-tumor cell interaction.
13927	
13928	
13929	Zinc finger proteins (ZNFs) bind DNA and, through this binding, regulate gene transcription. Most ZNFs contain conserved C2H2 motifs and are classified as Kruppel-type zinc fingers. For a general description of these proteins, see ZNF91 (MIM 603971).[supplied by OMIM]
13930	Calreticulin is a multifunctional protein that acts as a major Ca(2+)-binding (storage) protein in the lumen of the endoplasmic reticulum. It is also found in the nucleus, suggesting that it may have a role in transcription regulation. Calreticulin binds to the synthetic peptide KLGFFKR, which is almost identical to an amino acid sequence in the DNA-binding domain of the superfamily of nuclear receptors. Calreticulin binds to antibodies in certain sera of systemic lupus and Sjogren patients which contain anti-Ro/SSA antibodies, it is highly conserved among species, and it is located in the endoplasmic and sarcoplasmic reticulum where it may bind calcium. The amino terminus of calreticulin interacts with the DNA-binding domain of the glucocorticoid receptor and prevents the receptor from binding to its specific glucocorticoid response element. Calreticulin can inhibit the binding of androgen receptor to its hormone-responsive DNA element and can inhibit androgen receptor and retinoic acid receptor transcriptional activities in vivo, as well as retinoic acid-induced neuronal differentiation. Thus, calreticulin can act as an important modulator of the regulation of gene transcription by nuclear hormone receptors. Systemic lupus erythematosus is associated with increased autoantibody titers against calreticulin but calreticulin is not a Ro/SS-A antigen. Earlier papers referred to calreticulin as an Ro/SS-A antigen but this was later disproven. Increased autoantibody titer against human calreticulin is found in infants with complete congenital heart block of both the IgG and IgM classes.
13931	
13932	
13933	The LX receptors (LXRs) were originally identified as orphan members of the nuclear receptor superfamily because their ligands were unknown. Like other receptors in the family, LXRs heterodimerize with retinoid X receptor (see MIM 180245) and bind to specific response elements (LXREs) characterized by direct repeats separated by 4 nucleotides. Two genes, alpha (LXRA, MIM 602423) and beta, are known to encode LXR proteins (Song et al., 1995).[supplied by OMIM]
13934	
13935	
13936	
13937	Trans-dihydrodiol dehydrogenase (DD; EC 1.3.1.20) catalyzes the NADP(+)-linked oxidation of trans-dihydrodiols of aromatic hydrocarbons to the corresponding catechols. DD exists in mammalian tissues in monomeric and dimeric forms with similar but distinct enzymatic properties. See MIM 600449 and MIM 600450 for additional background information on dihydrodiol dehydrogenase.[supplied by OMIM]
13938	Sulfotransferase enzymes catalyze the sulfate conjugation of many hormones, neurotransmitters, drugs, and xenobiotic compounds. These cytosolic enzymes are different in their tissue distributions and substrate specificities. The gene structure (number and length of exons) is similar among family members. This gene is primarily expressed in liver and adrenal tissues where the encoded protein sulfates steroids and bile acids.
13939	
13940	
13941	The protein encoded by this gene is similar to a sea urchin radial spoke head protein. Radial spoke protein complexes form part of the axoneme of eukaryotic flagella and are located between the axoneme@#$%&s outer ring of doublet microtubules and central pair of microtubules. In Chlamydomonas, radial spoke proteins are thought to regulate the activity of dynein and the symmetry of flagellar bending patterns. This gene maps to a region of chromosome 19 that is linked to primary ciliary dyskinesia-2 (CILD2).
13942	The protein encoded by this gene is similar to a sea urchin radial spoke head protein. Radial spoke protein complexes form part of the axoneme of eukaryotic flagella and are located between the axoneme@#$%&s outer ring of doublet microtubules and central pair of microtubules. In Chlamydomonas, radial spoke proteins are thought to regulate the activity of dynein and the symmetry of flagellar bending patterns. This gene maps to a region of chromosome 19 that is linked to primary ciliary dyskinesia-2 (CILD2).
13943	
13944	Troponin I (TnI) is one of 3 subunits that form the troponin complex of the thin filaments of striated muscle; see also TNNI1 (MIM 191042). The others are troponin T (TnT; see MIM 191041) and troponin C (TnC; see MIM 191040).[supplied by OMIM]
13945	
13946	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a non-ATPase subunit of the 19S regulator. A pseudogene has been identified on chromosome 1.
13947	
13948	The protein encoded by this gene is part of a potentially heterotetrameric voltage-independent potassium channel that is activated by intracellular calcium. Activation is followed by membrane hyperpolarization, which promotes calcium influx. The encoded protein may be part of the predominant calcium-activated potassium channel in T-lymphocytes. This gene is similar to other KCNN family potassium channel genes, but it differs enough to possibly be considered as part of a new subfamily.
13949	
13950	
13951	
13952	Ubiquitin is a highly conserved nuclear and cytoplasmic protein that has a major role in targeting cellular proteins for degradation by the 26S proteosome. It is also involved in the maintenance of chromatin structure, the regulation of gene expression, and the stress response. Ubiquitin is synthesized as a precursor protein consisting of either polyubiquitin chains or a single ubiquitin moiety fused to an unrelated protein. This gene encodes a fusion protein consisting of ubiquitin at the N terminus and ribosomal protein L40 at the C terminus. When expressed in yeast, the protein is post-translationally processed, generating free ubiquitin monomer and the 52-amino acid ribosomal protein L40. Ribosomal protein L40 is a component of the 60S subunit of the ribosome and belongs to the L40E family of ribosomal proteins. It contains a zinc finger-like domain and is located in the cytoplasm. Multiple processed pseudogenes derived from this gene are present in the genome. As with ribosomal protein L40, ribosomal protein S27a is also synthesized as a fusion protein with ubiquitin; similarly, ribosomal protein S30 is synthesized as a fusion protein with the ubiquitin-like protein fubi.
13953	
13954	
13955	Glyceraldehyde 3-phosphate dehydrogenases (EC 1.2.1.12), such as GAPDS, are NAD-dependent enzymes that remove hydrogen and add phosphate to glyceraldehyde 3-phosphate to form 1,3-diphosphoglycerate.[supplied by OMIM]
13956	Calpains are a ubiquitous, well-conserved family of calcium-dependent, cysteine proteases. Calpain families have been implicated in neurodegenerative processes, as their activation can be triggered by calcium influx and oxidative stress. Calpain I and II are heterodimeric with distinct large subunits associated with common small subunits, all of which are encoded by different genes. This gene encodes a small subunit common to both calpain I and II and is associated with myotonic dystrophy. Two transcript variants encoding the same protein have been identified for this gene.
13957	Cytochrome c oxidase (COX), the terminal component of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. This component is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in electron transfer, and the nuclear-encoded subunits may function in the regulation and assembly of the complex. This nuclear gene encodes polypeptide 1 (muscle isoform) of subunit VIIa and the polypeptide 1 is present only in muscle tissues. Other polypeptides of subunit VIIa are present in both muscle and nonmuscle tissues, and are encoded by different genes.
13958	Peptidoglycan recognition proteins, such as PGRPL, are part of the innate immune system and recognize peptidoglycan, a ubiquitous component of bacterial cell walls.[supplied by OMIM]
13959	
13960	
13961	The protein encoded by this gene contains a cysteine-rich domain in the N-terminal region and 2 LIM domains in the C-terminal region. It also has several potential phosphorylation and N-myristoylation sites and a single potential N-glycosylation site. The presence of LIM domains implies involvement in protein-protein interactions. Expression of this gene has been detected in most tissues, with highest expression in skeletal muscle. Transcript variants utilizing alternative polyA signals have been observed.
13962	
13963	
13964	BARD1 interacts with the N-terminal region of BRCA1. In addition to its ability to bind BRCA1 in vivo and in vitro, BARD1 shares homology with the 2 most conserved regions of BRCA1: the N-terminal RING motif and the C-terminal BRCT domain. The RING motif is a cysteine-rich sequence found in a variety of proteins that regulate cell growth, including the products of tumor suppressor genes and dominant protooncogenes. The BARD1 protein also contains 3 tandem ankyrin repeats. The BARD1/BRCA1 interaction is disrupted by tumorigenic amino acid substitutions in BRCA1, implying that the formation of a stable complex between these proteins may be an essential aspect of BRCA1 tumor suppression.  BARD1 may be the target of oncogenic mutations in breast or ovarian cancer.
13965	
13966	
13967	
13968	GALNT14 (EC 2.4.1.41) belongs to a large subfamily of glycosyltransferases residing in the Golgi apparatus. GALNT enzymes catalyze the first step in the O-glycosylation of mammalian proteins by transferring N-acetyl-D-galactosamine (GalNAc) to peptide substrates.[supplied by OMIM]
13969	This gene encodes adenylyl cyclase 3 which is a membrane-associated enzyme and catalyzes the formation of the secondary messenger cyclic adenosine monophosphate (cAMP). This protein appears to be widely expressed in various human tissues and may be involved in a number of physiological and pathophysiological metabolic processes.
13970	
13971	This gene is thought to encode a GDP-mannose pyrophosphorylase. This enzyme catalyzes the reaction which converts mannose-1-phosphate and GTP to GDP-mannose which is involved in the production of N-linked oligosaccharides.
13972	
13973	The N-myc oncogene has been implicated in the pathogenesis of a number of human tumors, including childhood neuroblastoma and adult small cell lung cancer. Complementary DNA clones derived from a transcription unit, N-cym, located on the opposite DNA strand to N-myc have been characterized. There is extensive overlap between the 5@#$%& ends of the two transcription units. The N-cym gene, which can encode a 109-amino acid protein, is expressed during fetal development, as well as in tumor cell lines containing amplified N-myc loci, where it is expressed at very high levels. Although other examples of overlapping, opposite-strand eukaryotic genes exist, N-myc and N-cym are unique in that they appear to be coregulated in tumor cell lines under basal growth conditions and in response to the differentiating agent retinoic acid. This coregulation suggests that their protein products may be functionally interrelated during normal development and oncogenesis.
13974	
13975	
13976	
13977	The protein encoded by this gene is a DNA binding and apoptosis-inducing protein and is localized in the nucleus. It is also a Rac3-interacting protein which acts as a corepressor for the thyroid hormone receptor. This protein is thought to regulate TRAX/Translin complex formation. Several alternatively spliced transcript variants of this gene have been described, but the full length nature of some of these variants has not been determined.
13978	The protein encoded by this gene is a cytokine receptor that belongs to the interleukin 1 receptor family. This protein is a receptor for interleukin alpha (IL1A), interleukin beta (IL1B), and interleukin 1 receptor, type I(IL1R1/IL1RA). It is an important mediator involved in many cytokine induced immune and inflammatory responses. This gene along with interleukin 1 receptor, type II (IL1R2), interleukin 1 receptor-like 2 (IL1RL2), and interleukin 1 receptor-like 1 (IL1RL1) form a cytokine receptor gene cluster in a region mapped to chromosome 2q12.
13979	The protein encoded by this gene belongs to the family of Na+/K+ and H+/K+ ATPases beta chain proteins, and to the subfamily of Na+/K+ -ATPases. Na+/K+ -ATPase is an integral membrane protein responsible for establishing and maintaining the electrochemical gradients of Na and K ions across the plasma membrane. These gradients are essential for osmoregulation, for sodium-coupled transport of a variety of organic and inorganic molecules, and for electrical excitability of nerve and muscle. This enzyme is composed of two subunits, a large catalytic subunit (alpha) and a smaller glycoprotein subunit (beta). The beta subunit regulates, through assembly of alpha/beta heterodimers, the number of sodium pumps transported to the plasma membrane. The glycoprotein subunit of Na+/K+ -ATPase is encoded by multiple genes. This gene encodes a beta 3 subunit. This gene encodes a beta 3 subunit. A pseudogene exists for this gene, and it is located on chromosome 2.
13980	
13981	The protein encoded by this gene is a member of the STAT family of transcription factors. In response to cytokines and growth factors, STAT family members are phosphorylated by the receptor associated kinases, and then form homo- or heterodimers that translocate to the cell nucleus where they act as transcription activators. This protein is essential for mediating responses to IL12 in lymphocytes, and regulating the differentiation of T helper cells.
13982	This intronless gene is thought to be generated by retrotransposition of a fully processed PCBP-2 mRNA. This gene and PCBP-2 has paralogues PCBP3 and PCBP4 which is thought to arose as a result of duplication events of entire genes. The protein encoded by this gene appears to be multifunctional. It along with PCBP-2 and hnRNPK corresponds to the major cellular poly(rC)-binding proteins. It contains three K-homologous (KH) domains which may be involved in RNA binding. This encoded protein together with PCBP-2 also functions as translational coactivators of poliovirus RNA via a sequence-specific interaction with stem-loop IV of the IRES and promote poliovirus RNA replication by binding to its 5@#$%&-terminal cloverleaf structure. It has also been implicated in translational control of the 15-lipoxygenase mRNA, human Papillomavirus type 16 L2 mRNA, and hepatitis A virus RNA. The encoded protein is also suggested to play a part in formation of a sequence-specific alpha-globin mRNP complex which is associated with alpha-globin mRNA stability.
13983	
13984	
13985	
13986	
13987	
13988	
13989	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein that is upregulated in response to treatment with beta-interferon (IFNB) and a protein kinase C-activating compound, mezerein (MEZ). Irreversible reprogramming of melanomas can be achieved by treatment with both these agents; treatment with either agent alone only achieves reversible differentiation.
13990	
13991	There are at least four distinct but related alkaline phosphatases: intestinal, placental, placental-like, and liver/bone/kidney (tissue non-specific). The first three are located together on chromosome 2 while the tissue non-specific form is located on chromosome 1. The exact physiological function of the alkaline phosphatases is not known. The product of this gene is a membrane bound glycosylated enzyme, localized to testis, thymus and certain germ cell tumors, that is closely related to both the placental and intestinal forms of alkaline phosphatase.
13992	
13993	Sepiapterin reductase (7,8-dihydrobiopterin:NADP+ oxidoreductase; EC 1.1.1.153) catalyzes the NADPH-dependent reduction of various carbonyl substances, including derivatives of pteridines, and belongs to a group of enzymes called aldo-keto reductases. SPR plays an important role in the biosynthesis of tetrahydrobiopterin (see MIM 261640).[supplied by OMIM]
13994	This gene encodes a large abundant protein of striated muscle. The product of this gene is divided into two regions, a N-terminal I-band and a C-terminal A-band. The I-band, which is the elastic part of the molecule, contains two regions of tandem immunoglobulin domains on either side of a PEVK region that is rich in proline, glutamate, valine and lysine. The A-band, which is thought to act as a protein-ruler, contains a mixture of immunoglobulin and fibronectin repeats, and possesses kinase activity. A N-terminal Z-disc region and a C-terminal M-line region bind to the Z-line and M-line of the sarcomere respectively so that a single titin molecule spans half the length of a sarcomere. Titin also contains binding sites for muscle associated proteins so it serves as an adhesion template for the assembly of contractile machinery in muscle cells. It has also been identified as a structural protein for chromosomes. Considerable variability exists in the I-band, the M-line and the Z-disc regions of titin. Variability in the I-band region contributes to the differences in elasticity of different titin isoforms and, therefore, to the differences in elasticity of different muscle types. Of the many titin variants identified, five for which complete transcript information is available are described. Mutations in this gene are associated with familial hypertrophic cardiomyopathy 9 and autoantibodies to titin are produced in patients with the autoimmune disease scleroderma.
13995	
13996	GPR75 is a member of the G protein-coupled receptor family. GPRs are cell surface receptors that activate guanine-nucleotide binding proteins upon the binding of a ligand.[supplied by OMIM]
13997	
13998	
13999	
14000	
14001	This gene encodes a protein that belongs to the inositol phosphokinase (IPK) family. This protein is likely responsible for the conversion of inositol hexakisphosphate (InsP6) to diphosphoinositol pentakisphosphate (InsP7/PP-InsP5). It may also convert 1,3,4,5,6-pentakisphosphate (InsP5) to PP-InsP4 and affect the growth suppressive and apoptotic activities of interferon-beta in some ovarian cancers. Alternative splicing results in multiple transcript variants encoding different isoforms.
14002	
14003	The protein encoded by this gene contains an acidic domain, a putative bipartite nuclear localization signal, a nuclear export signal, a leucine-isoleucine zipper, and phosphorylation motifs. In addition, the encoded protein contains Armadillo/beta-catenin-like repeats, which have been implicated in protein-protein interactions. Although the function of this protein has not been determined, the C-terminal portion of the protein has been shown to possess apoptosis-inducing activity.
14004	This gene encodes a member of the WAP-type four-disulfide core (WFDC) domain family. The WFDC domain, or WAP signature motif, contains eight cysteines forming four disulfide bonds at the core of the protein, and functions as a protease inhibitor. Most WFDC gene members are localized to chromosome 20q12-q13 in two clusters: centromeric and telomeric. This gene belongs to the centromeric cluster.
14005	Oxygenation levels play an important role in the regulation of cellular invasiveness which occurs during early implantation when the trophoblast cells invade the uterus as well as during tumour progression and metastasis. This gene, which is regulated by oxygen tension, is induced in hypoxic trophoblast cells and is overexpressed in ovarian cancer. Two protein isoforms are encoded by transcript variants of this gene.
14006	Dolichol-phosphate mannose (Dol-P-Man) serves as a donor of mannosyl residues on the lumenal side of the endoplasmic reticulum (ER).  Lack of Dol-P-Man results in defective surface expression of GPI-anchored proteins.  Dol-P-Man is synthesized from GDP-mannose and dolichol-phosphate on the cytosolic side of the ER by the enzyme dolichyl-phosphate mannosyltransferase.  Human DPM1 lacks a carboxy-terminal transmembrane domain and signal sequence and is regulated by DPM2.
14007	
14008	The bone morphogenetic proteins (BMPs) are a family of secreted signaling molecules that can induce ectopic bone growth. Many BMPs are part of the transforming growth factor-beta (TGFB) superfamily. BMPs were originally identified by an ability of demineralized bone extract to induce endochondral osteogenesis in vivo in an extraskeletal site. Based on its expression early in embryogenesis, the BMP encoded by this gene has a proposed role in early development. In addition, the fact that this BMP is closely related to BMP5 and BMP7 has lead to speculation of possible bone inductive activity.
14009	
14010	Angiopoietins are proteins with important roles in vascular development and angiogenesis. All angiopoietins bind with similar affinity to an endothelial cell-specific tyrosine-protein kinase receptor. The mechanism by which they contribute to angiogenesis is thought to involve regulation of endothelial cell interactions with supporting perivascular cells. The protein encoded by this gene functions as an agonist and is an angiopoietin.
14011	This gene is considered a member of a gene family predicted to encode transmembrane proteins. The specific function of this gene is unknown; however, expression in the inner ear suggests that it may be crucial for normal auditory function. Mutations in this gene may underlie hereditary disorders of balance and hearing.
14012	The protein encoded by this gene hydrolyzes inosine triphosphate and deoxyinosine triphosphate to the monophosphate nucleotide and diphosphate. The encoded protein, which is a member of the HAM1 NTPase protein family, is found in the cytoplasm and acts as a homodimer. Defects in the encoded protein can result in inosine triphosphate pyrophosphorylase deficiency. Two transcript variants encoding two different isoforms have been found for this gene. Also, at least two other transcript variants have been identified which are probably regulatory rather than protein-coding.
14013	
14014	The protein encoded by this gene is one of the highly conserved mini-chromosome maintenance proteins (MCM) that are essential for the initiation of eukaryotic genome replication. The hexameric protein complex formed by the MCM proteins is a key component of the pre-replication complex (pre_RC) and may be involved in the formation of replication forks and in the recruitment of other DNA replication related proteins. This protein contains the central domain that is conserved among the MCM proteins. This protein has been shown to co-immunoprecipitate with MCM4, 6 and 7, which suggests that it may interact with other MCM proteins and play a role in DNA replication. Alternatively spliced transcript variants encoding distinct isoforms have been described.
14015	
14016	The cystatin superfamily encompasses proteins that contain multiple cystatin-like sequences. Some of the members are active cysteine protease inhibitors, while others have lost or perhaps never acquired this inhibitory activity. There are three inhibitory families in the superfamily, including the type 1 cystatins (stefins), type 2 cystatins and the kininogens. The type 2 cystatin proteins are a class of cysteine proteinase inhibitors found in a variety of human fluids and secretions. This gene encodes a glycosylated cysteine protease inhibitor with a putative role in immune regulation through inhibition of a unique target in the hematopoietic system. Expression of the protein has been observed in various human cancer cell lines established from malignant tumors.
14017	
14018	This gene encodes a member of the glycosyltransferase O-Fuc family. This enzyme adds O-fucose through an O-glycosidic linkage to conserved serine or threonine residues in the epidermal growth factor-like repeats of a number of cell surface and secreted proteins. O-fucose glycans are involved in ligand-induced receptor signaling. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
14019	The protein encoded by this gene is an integral plasma membrane protein which promotes cell adhesion. The encoded protein is thought to undergo O-linked glycosylation. Expression of this gene has been shown to be induced by gamma interferon in some cancer cells. Two transcript variants encoding the same protein have been found for this gene.
14020	This gene encodes a type I membrane protein with a carbohydrate recognition domain characteristic of C-type lectins in its extracellular portion. In other proteins, this domain is involved in endocytosis of glycoproteins and exogenous sugar-bearing pathogens. This protein localizes predominantly to the perinuclear region.
14021	
14022	The function of the protein encoded by this gene is not well defined, however it contains zinc finger motifs suggestive of a transcription factor. The protein (isoform 1) is localized to both the nucleus and cytoplasm. Three splice variant mRNAs products have been described [1]. The longer AIRE-1 mRNA appears to be more abundant and includes exons 1 through 14. Splice variant AIRE-2 includes a portion of the non-coding region of exon 1, an alternatively spliced longer exon 8, plus exons 9 through 14. Variant AIRE-3 includes the same exon 1-8-9 sequences as found in AIRE-2 but utilizes additional alternative splicing in exon 10 that shifts the reading frame such that a stop codon in exon 12 is utilized. The resulting protein products of these splice variants are differ significantly. Defects in this gene cause the autosomal-recessive systemic autoimmune disease termed autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED).
14023	Members of the trefoil family are characterized by having at least one copy of the trefoil motif, a 40-amino acid domain that contains three conserved disulfides. They are stable secretory proteins expressed in gastrointestinal mucosa. Their functions are not defined, but they may protect the mucosa from insults, stabilize the mucus layer, and affect healing of the epithelium. This gene, which is expressed in the gastric mucosa, has also been studied because of its expression in human tumors. This gene and two other related trefoil family member genes are found in a cluster on chromosome 21.
14024	The semaphorins are a family of proteins that are involved in signaling. All the family members have a secretion signal, a 500-amino acid sema domain, and 16 conserved cysteine residues (Kolodkin et al., 1993). Sequence comparisons have grouped the secreted semaphorins into 3 general classes, all of which also have an immunoglobulin domain. The semaphorin III family, consisting of human semaphorin III (SEMA3A; MIM 603961), chicken collapsin, and mouse semaphorins A, D, and E, all have a basic domain at the C terminus. Chicken collapsin contributes to path finding by axons during development by inhibiting extension of growth cones (Luo et al., 1993) through an interaction with a collapsin response mediator protein of relative molecular mass 62K (CRMP62) (Goshima et al., 1995), a putative homolog of an axonal guidance associated UNC33 gene product (MIM 601168). SEMA3F is a secreted member of the semaphorin III family.[supplied by OMIM]
14025	
14026	The protein encoded by this gene is a member of the PP2C family of Ser/Thr protein phosphatases. PP2C family members are known to be negative regulators of cell stress response pathways. This phosphatase can interact with Rho guanine nucleotide exchange factors (PIX), and thus block the effects of p21-activated kinase 1 (PAK), a protein kinase mediating biological effects downstream of Rho GTPases. Calcium/calmodulin-dependent protein kinase II gamma (CAMK2G/CAMK-II) is found to be one of the substrates of this phosphatase. The overexpression of this phosphatase or CAMK2G has been shown to mediate caspase-dependent apoptosis. An alternatively spliced transcript variant has been identified, but its full-length nature has not been determined.
14027	The protein encoded by this gene is part of a complex that relieves repressive chromatin structures, allowing the transcriptional machinery to access its targets more effectively. The encoded nuclear protein may also bind to and enhance the DNA joining activity of HIV-1 integrase. This gene has been found to be a tumor suppressor, and mutations in it have been associated with malignant rhabdoid tumors. Two transcript variants encoding different isoforms have been found for this gene.
14028	The protein encoded by this gene may regulate muscle-specific genes when in the nucleus and may influence intracellular trafficking when in the cytoplasm. The encoded protein functions as a homodimer and may interact with F actin. Mutations in this gene are associated with lung cancer.
14029	
14030	
14031	
14032	
14033	
14034	
14035	This nuclear gene encodes a mitochondrial member of the thioredoxin family, a group of small multifunctional redox-active proteins. The encoded protein may play important roles in the regulation of the mitochondrial membrane potential and in protection against oxidant-induced apoptosis.
14036	
14037	Somatostatin acts at many sites to inhibit the release of many hormones and other secretory proteins.  The biological effects of somatostatin are probably mediated by a family of G protein-coupled receptors that are expressed in a tissue-specific manner.  SSTR3 is a member of the superfamily of receptors having seven transmembrane segments and is expressed in highest levels in brain and pancreatic islets.  SSTR3 is functionally coupled to adenylyl cyclase.
14038	
14039	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L3P family of ribosomal proteins. It is located in the cytoplasm. The protein can bind to the HIV-1 TAR mRNA, and it has been suggested that the protein contributes to tat-mediated transactivation. This gene is co-transcribed with the small nucleolar RNA genes U43, U86, U83a, and U83b, which are located in its first, third, fifth, and seventh introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
14040	The protein encoded by this gene is an adaptor protein that mediates the association of the heat shock proteins HSP70 and HSP90. This protein has been shown to be involved in the assembly process of glucocorticoid receptor, which requires the assistance of multiple molecular chaperones. The expression of this gene is reported to be downregulated in colorectal carcinoma tissue suggesting that is a candidate tumor suppressor gene.
14041	
14042	The protein encoded by this gene is a GTPase-activating protein (GAP) which associates with the Golgi apparatus and which is thought to interact with ADP-ribosylation factor 1 (ARF1). The encoded protein likely promotes hydrolysis of ARF1-bound GTP, which is required for the dissociation of coat proteins from Golgi-derived membranes and vesicles. Dissociation of the coat proteins is a prerequisite for the fusion of these vesicles with target compartments. The activity of this protein is sensitive to phospholipids. This gene was originally known as ARFGAP1, but that is now the name of a related but different gene.
14043	
14044	
14045	
14046	This gene encodes one of the two alpha chains of type VIII collagen. The gene product is a short chain collagen and a major component of the basement membrane of the corneal endothelium. The type VIII collagen fibril can be either a homo- or a heterotrimer. Alternatively spliced transcript variants encoding the same isoform have been observed.
14047	This gene product belongs to the zinc finger protein superfamily, members of which are regulatory proteins characterized by nucleic acid-binding zinc finger domains. The encoded protein contains 20 tandemly arrayed C2H2-type zinc fingers, a Kruppel-associated box (KRAB) domain, and a SCAN box. This transcript turns over rapidly and contains 3@#$%& UTR AUUUA motifs, which are often a hallmark of rapid turnover. It is overexpressed in some thyroid papillary carcinomas. This gene is located in a cluster of zinc finger genes at 3p21.
14048	This gene was identified as a locus frequently mutated in hereditary nonpolyposis colon cancer (HNPCC). It is a human homolog of the E. coli DNA mismatch repair gene mutL, consistent with the characteristic alterations in microsatellite sequences (RER+ phenotype) found in HNPCC. Alternatively spliced transcript variants encoding different isoforms have been described, but their full-length natures have not been determined.
14049	
14050	The protein encoded by this gene is a member of the G protein-coupled receptor family, and is a receptor for C-C type chemokines. This receptor has been shown to bind dendritic cell- and T cell-activated chemokines including CCL19/ELC, CCL21/SLC, and CCL25/TECK. Alternatively spliced transcript variants encoding the same protein have been described.
14051	
14052	This gene encodes a protein that is a member of the seven in absentia homolog (SIAH) family. The protein is an E3 ligase and is involved in ubiquitination and proteasome-mediated degradation of specific proteins. The activity of this ubiquitin ligase has been implicated in regulating cellular response to hypoxia.
14053	This gene encodes a member of the oxysterol-binding protein (OSBP) family, a group of intracellular lipid receptors. Like most members, the encoded protein contains an N-terminal pleckstrin homology domain and a highly conserved C-terminal OSBP-like sterol-binding domain.
14054	
14055	
14056	
14057	
14058	
14059	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. Four sequence-related potassium channel genes - shaker, shaw, shab, and shal - have been identified in Drosophila, and each has been shown to have human homolog(s). This gene encodes a member of the potassium channel, voltage-gated, shal-related subfamily, members of which form voltage-activated A-type potassium ion channels and are prominent in the repolarization phase of the action potential. This member includes two isoforms with different sizes, which are encoded by alternatively spliced transcript variants of this gene.
14060	
14061	
14062	C-alpha-formylglycine (FGly), the catalytic residue in the active site of eukaryotic sulfatases, is posttranslationally generated from a cysteine in the endoplasmic reticulum (ER). The genetic defect of FGly formation causes multiple sulfatase deficiency (MSD; MIM 272200), a lysosomal storage disorder. The SUMF1 gene encodes FGly-generating enzyme (FGE) and is mutated in MSD.[supplied by OMIM]
14063	This gene encodes a family member of actin-related proteins (ARPs), which share significant amino acid sequence identity to conventional actins. Both actins and ARPs have an actin fold, which is an ATP-binding cleft, as a common feature. The ARPs are involved in diverse cellular processes, including vesicular transport, spindle orientation, nuclear migration and chromatin remodeling. This gene encodes a 53 kDa subunit protein of the BAF (BRG1/brm-associated factor) complex in mammals, which is functionally related to SWI/SNF complex in S. cerevisiae and Drosophila; the latter is thought to facilitate transcriptional activation of specific genes by antagonizing chromatin-mediated transcriptional repression. Together with beta-actin, it is required for maximal ATPase activity of BRG1, and for the association of the BAF complex with chromatin/matrix. Three transcript variants that encode two different protein isoforms have been described.
14064	
14065	
14066	This gene encodes a chemokine receptor like protein, which is predicted to be a seven transmembrane protein and most closely related to CCR1. Chemokines and their receptors mediated signal transduction are critical for the recruitment of effector immune cells to the site of inflammation. This gene is expressed at high levels in primary neutrophils and primary monocytes, and is further upregulated on neutrophil activation and during monocyte to macrophage differentiation. The function of this gene is unknown. This gene is mapped to the region where the chemokine receptor gene cluster is located.
14067	CADPS encodes a novel neural/endocrine-specific cytosolic and peripheral membrane protein required for the Ca2+-regulated exocytosis of secretory vesicles. CADPS acts at a stage in exocytosis that follows ATP-dependent priming, which involves the essential synthesis of phosphatidylinositol 4,5-bisphosphate (PtdIns(4,5)P2). Alternative splicing has been observed at this locus and three variants, encoding distinct isoforms, are described.
14068	
14069	Acetyl-Coenzyme A acyltransferase (ACAA1) is an enzyme operative in the beta-oxidation system of the peroxisomes.  Deficiency of this enzyme leads to pseudo-Zellweger syndrome
14070	This gene encodes a membrane protein, which is involved in the increase in intracellular calcium concentration that occurs upon cell adhesion to extracellular matrix. The encoded protein is also a receptor for the C-terminal cell binding domain of thrombospondin, and it may play a role in membrane transport and signal transduction. This gene has broad tissue distribution, and is reduced in expression on Rh erythrocytes. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
14071	
14072	The protein encoded by this gene is a glycoprotein with an approximate molecular weight of 76.5 kDa. It is thought to have been created as a result of an ancient gene duplication event that led to generation of homologous C and N-terminal domains each of which binds 1 ion of ferric iron. The function of this encoded protein is to transport iron from the intestine, reticuloendothelial system, and liver parenchymal cells to all proliferating cells in the body. In addition to its function in iron transport, this protein may also have a physiologic role as granulocyte/pollen-binding protein (GPBP) involved in the removal of certain organic matter/allergins from serum.
14073	
14074	
14075	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L4E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
14076	
14077	The RNA lariat debranching enzyme, or DBR1, specifically hydrolyzes 2-prime-to-5-prime branched phosphodiester bonds at the branch point of excised lariat intron RNA and converts them into linear molecules.[supplied by OMIM]
14078	The protein encoded by this gene belongs to the complex I NDUFB4 subunit family. Mammalian complex I is composed of 45 different subunits. This protein has NADH dehydrogenase activity and oxidoreductase activity. It transfers electrons from NADH to the respiratory chain. The immediate electron acceptor for the enzyme is believed to be ubiquinone.
14079	
14080	
14081	
14082	
14083	The protein encoded by this gene belongs to the highly conserved cyclin family, whose members are characterized by a dramatic periodicity in protein abundance through the cell cycle. Cyclins function as regulators of CDK kinases. Different cyclins exhibit distinct expression and degradation patterns which contribute to the temporal coordination of each mitotic event. This cyclin shows the highest similarity with cyclin G. The transcript of this gene was found to be expressed constantly during cell cycle progression. The function of this cyclin has not yet been determined.
14084	
14085	
14086	Soluble guanylate cyclase (sGC), a heterodimeric protein consisting of an alpha and a beta subunit, catalyzes the conversion of GTP to the second messenger cGMP and functions as the main receptor for nitric oxide and nitrovasodilator drugs (Zabel et al., 1998).[supplied by OMIM]
14087	This gene is an ortholog of the Drosophila fat gene, which encodes a tumor suppressor essential for controlling cell proliferation during Drosophila development. The gene product is a member of the cadherin superfamily, a group of integral membrane proteins characterized by the presence of cadherin-type repeats. In addition to containing 34 tandem cadherin-type repeats, the gene product has five epidermal growth factor (EGF)-like repeats and one laminin A-G domain. This gene is expressed at high levels in a number of fetal epithelia. Its product probably functions as an adhesion molecule and/or signaling receptor, and is likely to be important in developmental processes and cell communication. Transcript variants derived from alternative splicing and/or alternative promoter usage exist, but they have not been fully described.
14088	
14089	This gene product belongs to the UNC-5 family of netrin receptors. Netrins are secreted proteins that direct axon extension and cell migration during neural development. They are bifunctional proteins that act as attractants for some cell types and as repellents for others, and these opposite actions are thought to be mediated by two classes of receptors. The UNC-5 family of receptors mediate the repellent response to netrin; they are transmembrane proteins containing 2 immunoglobulin (Ig)-like domains and 2 type I thrombospondin motifs in the extracellular region.
14090	G protein-coupled receptors (GPCRs, or GPRs) contain 7 transmembrane domains and transduce extracellular signals through heterotrimeric G proteins.[supplied by OMIM]
14091	
14092	This intronless gene encodes a protein that may play a role in organelle biogenesis associated with melanosomes, platelet dense granules, and lysosomes. A similar protein in mouse is a component of a protein complex termed biogenesis of lysosome-related organelles complex 1 (BLOC-1), and is a model for Hermansky-Pudlak syndrome. The encoded protein may play a role in intracellular vesicular trafficking.
14093	The protein encoded by this gene is a member of the SEC24 subfamily of the SEC23/SEC24 family, which is involved in vesicle trafficking. The encoded protein has similarity to yeast Sec24p component of COPII. COPII is the coat protein complex responsible for vesicle budding from the ER. The role of this gene product is implicated in the shaping of the vesicle, and also in cargo selection and concentration.
14094	
14095	
14096	The protein encoded by this gene belongs to the annexin family of calcium-dependent phospholipid binding proteins some of which have been implicated in membrane-related events along exocytotic and endocytotic pathways. Annexin 5 is a phospholipase A2 and protein kinase C inhibitory protein with calcium channel activity and a potential role in cellular signal transduction, inflammation, growth and differentiation. Annexin 5 has also been described as placental anticoagulant protein I, vascular anticoagulant-alpha, endonexin II, lipocortin V, placental protein 4 and anchorin CII. The gene spans 29 kb containing 13 exons, and encodes a single transcript of approximately 1.6 kb and a protein product with a molecular weight of about 35 kDa.
14097	
14098	
14099	
14100	The protein encoded by this gene is a putative ATP-dependent RNA helicase implicated in pre-mRNA splicing.
14101	
14102	
14103	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections.
14104	
14105	
14106	
14107	This gene encodes a WD repeat protein, which interacts with Cockayne syndrome type B (CSB) protein and with p44 protein, a subunit of the RNA polymerase II transcription factor IIH. Mutations in this gene have been identified in patients with hereditary disease Cockayne syndrome (CS). The CS cells are abnormally sensitive to ultraviolet radiation and are defective in the repair of transcriptionally active genes. Multiple alternatively spliced transcript variants encoding different isoforms have been found for this gene.
14108	
14109	
14110	This gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins. The protein may be involved in the differentiation of developing male germ cells. Two transcript variants encoding distinct isoforms have been identified for this gene.
14111	Aminoacyl-tRNA synthetases catalyze the aminoacylation of tRNA by their cognate amino acid. Because of their central role in linking amino acids with nucleotide triplets contained in tRNAs, aminoacyl-tRNA synthetases are thought to be among the first proteins that appeared in evolution. Arginyl-tRNA synthetase belongs to the class-I aminoacyl-tRNA synthetase family.
14112	The receptor for hyaluronan mediated motility has been reported to mediate migration, transformation, and metastatic spread of murine fibroblasts. In human, it is expressed as an intracellular protein in breast cancer cells.  No correlation between the level of HMMR mRNA and protein expression with known metastatic/malignant potential of the tumour cell lines was observed.
14113	Prefoldin 1 is a heterohexameric chaperone protein which assists in the correct folding of other proteins. It binds specifically to cytosolic chaperonin and transfers target proteins. Prefoldin may function by selectively targeting nascent actin and tubulin chains pending their transfer to cytosolic chaperonin for final folding and/or assembly. It promotes folding in an environment in which there are many competing pathways for nonnative proteins.
14114	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene encodes a member of the histone H2A family. Three transcripts that encode different isoforms have been identified for this gene.
14115	The protein encoded by this gene is localized in the nuclear matrix. It may play a role in transcription or may interact with other nuclear matrix proteins to form the internal fibrogranular network. Two transcript variants encoding the same protein have been identified for this gene.
14116	The product of this gene belongs to the ALG3 family. This enzyme adds the first Dol-P-Man derived mannose in an alpha 1,3 linkage to Man5GlcNAc2-PP-Dol. Defects in this gene are the cause of congenital disorder of glycosylation type Id (CDG-Id). This disease is biochemically characterized by abnormal N-glycosylation. Alternatively spliced transcript variants encoding different isoforms have been identified.
14117	
14118	
14119	
14120	
14121	
14122	NNT is a pyridine nucleotide transhydrogenase (EC 1.6.1.1). These integral inner mitochondrial membrane proteins are part of the energy-transfer system of the respiratory chain and catalyze the transfer of a hydride ion between nicotinamide adenine dinucleotide, NAD(H), and oxidized nicotinamide dinucleotide phosphate, NADP(H).[supplied by OMIM]
14123	ITGA2 encodes integrin alpha chain 2. Integrins are heterodimeric integral membrane proteins composed of an alpha chain and a beta chain. The I-domain containing alpha integrin 2 combines with beta integrin 1 to form a collagen-binding integrin referred to as glycoprotein Ia/IIa when expressed on on platelets, and very late (activation) antigen 2 (@#$%&VLA-2@#$%&) when found on T-cells. In addition to adhesion, integrins are known to participate in cell-surface mediated signalling.
14124	Aminoacyl-tRNA synthetases catalyze the aminoacylation of tRNA by their cognate amino acid. Because of their central role in linking amino acids with nucleotide triplets contained in tRNAs, aminoacyl-tRNA synthetases are thought to be among the first proteins that appeared in evolution. Threonyl-tRNA synthetase belongs to the class-II aminoacyl-tRNA synthetase family
14125	The CHD family of proteins is characterized by the presence of chromo (chromatin organization modifier) domains and SNF2-related helicase/ATPase domains.  CHD genes alter gene expression possibly by modification of chromatin structure thus altering access of the transcriptional apparatus to its chromosomal DNA template.
14126	RNX (HOX11L2, TLX3) belongs to a family of orphan homeobox genes that encode DNA-binding nuclear transcription factors. Members of the HOX11 gene family are characterized by a threonine-47 replacing cytosine in the highly conserved homeodomain (Dear et al., 1993).[supplied by OMIM]
14127	Antiquitin is a member of subfamily 7 in the aldehyde dehydrogenase gene family. These enzymes are thought to play a major role in the detoxification of aldehydes generated by alcohol metabolism and lipid peroxidation. This particular member has homology to a previously described protein from the green garden pea, the 26g pea turgor protein. Four human antiquitin-like sequences, possibly pseudogenes, have also been identified.
14128	This gene is highly conserved during evolution and it plays a key role in the regulation of cell division. The encoded protein is a tyrosine phosphatase and belongs to the Cdc25 phosphatase family. It directs dephosphorylation of cyclin B-bound CDC2 and triggers entry into mitosis. It is also thought to suppress p53-induced growth arrest. Multiple alternatively spliced transcript variants of this gene have been described, however, the full-length nature of many of them is not known.
14129	This proto-oncogene, highly-expressed in a variety of tumor cell lines, belongs to the sevenless subfamily of tyrosine kinase insulin receptor genes. The protein encoded by this gene is a type I integral membrane protein with tyrosine kinase activity. The protein may function as a growth or differentiation factor receptor.
14130	
14131	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein that belongs to the L3P ribosomal protein family. A pseudogene corresponding to this gene is found on chromosome 13q.
14132	
14133	
14134	
14135	This intronless gene encodes a member of the MAGE superfamily. It is ubiquitously expressed in normal tissues and in tumor cells. This gene includes a microsatellite repeat in the coding region.
14136	
14137	This gene encodes a member of the 1-acylglycerol-3-phosphate O-acyltransferase family. This integral membrane protein converts lysophosphatidic acid to phosphatidic acid, the second step in de novo phospholipid biosynthesis. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
14138	
14139	The protein encoded by this gene contains a RING finger motif and a FHA domain. This protein has been shown to interact with several class II ubiquitin-conjugating enzymes (E2), including UBE2E1/UBCH6, UBE2E2, and UBE2E3, and may act as an ubiquitin ligase (E3) in the ubiquitination of certain nuclear proteins. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
14140	
14141	Many polypeptide mitogens, such as basic fibroblast growth factor (FGFB; MIM 134920) and platelet-derived growth factors (MIM 173430, MIM 190040), are active on a wide range of different cell types. In contrast, vascular endothelial growth factor is a mitogen primarily for vascular endothelial cells. It is, however, structurally related to platelet-derived growth factor.[supplied by OMIM]
14142	
14143	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that belongs to the ribosomal protein S10P family. Pseudogenes corresponding to this gene are found on chromosomes 1q, 3p, and 9p.
14144	
14145	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order   structures. This gene is intronless and encodes a member of the histone H1 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6.
14146	
14147	This gene is expressed in neuroblastoma; however, the function of this gene is not yet determined.
14148	
14149	
14150	This gene encodes a protein found in corneodesmosomes, which localize to human epidermis and other cornified squamous epithelia. During maturation of the cornified layers, the protein undergoes a series of cleavages, which are thought to be required for desquamation. The gene is located in the major histocompatibility complex (MHC) class I region on chromosome 6.
14151	HLA-B belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon 1 encodes the leader peptide, exon 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Hundreds of HLA-B alleles have been described
14152	This gene encodes a 70kDa heat shock protein. In conjunction with other heat shock proteins, this protein stabilizes existing proteins against aggregation and mediates the folding of newly translated proteins in the cytosol and in organelles. The gene is located in the major histocompatibility complex class III region, in a cluster with two closely related genes which also encode isoforms of the 70kDa heat shock protein.
14153	
14154	Zinc is an essential cofactor for more than 50 classes of enzymes. It is involved in protein, nucleic acid, carbohydrate, and lipid metabolism, as well as in the control of gene transcription, growth, development, and differentiation. Zinc cannot passively diffuse across cell membranes and requires specific transporters, such as SLC39A7, to enter the cytosol from both the extracellular environment and from intracellular storage compartments.[supplied by OMIM]
14155	This gene encodes a protein that is similar to the S. cerevisiae suppressor of actin mutations 2 gene. The rat, mouse, and human proteins share a high percentage of identity to each other (95-98%). This gene lies in a head-to-head orientation with the gene encoding ribosomal protein S18. Two transcript variants have been identified for this gene.
14156	L-glutamate is the major excitatory neurotransmitter in the central nervous system and activates both ionotropic and metabotropic glutamate receptors. Glutamatergic neurotransmission is involved in most aspects of normal brain function and can be perturbed in many neuropathologic conditions. The metabotropic glutamate receptors are a family of G protein-coupled receptors, that have been divided into 3 groups on the basis of sequence homology, putative signal transduction mechanisms, and pharmacologic properties. Group I includes GRM1 and GRM5 and these receptors have been shown to activate phospholipase C. Group II includes GRM2 and GRM3 while Group III includes GRM4, GRM6, GRM7 and GRM8. Group II and III receptors are linked to the inhibition of the cyclic AMP cascade but differ in their agonist selectivities.
14157	Neuromedin B receptor binds neuromedin B, a potent mitogen and growth factor for normal and neoplastic lung and for gastrointestinal epithelial tissue.
14158	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L6P family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
14159	
14160	
14161	
14162	
14163	
14164	Laminin, an extracellular protein, is a major component of the basement membrane. It is thought to mediate the attachment, migration, and organization of cells into tissues during embryonic development by interacting with other extracellular matrix components. It is composed of three subunits, alpha, beta, and gamma, which are bound to each other by disulfide bonds into a cross-shaped molecule. This gene encodes the alpha 2 chain, which constitutes one of the subunits of laminin 2 (merosin) and laminin 4 (s-merosin). Mutations in this gene have been identified as the cause of congenital merosin-deficient muscular dystrophy. A transcript variant of this gene has been reported but its full-length nature is not known.
14165	
14166	
14167	
14168	
14169	The protein encoded by this gene belongs to the small GTPase superfamily, Rab family. It may be involved in small GTPase mediated signal transduction and intracellular protein transportation. Alternative splicing occurs at this locus and two transcript variants encoding the same protein have been identified.
14170	
14171	
14172	
14173	
14174	
14175	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
14176	This gene encodes a member of a highly conserved protein family that exists from archaea to vertebrates and plants. The encoded protein may function in RNA metabolism. Mutations within this gene are associated with Shwachman-Bodian-Diamond syndrome. An alternative transcript has been described, but its biological nature has not been determined. This gene has a closely linked pseudogene that is distally located.
14177	
14178	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of the ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class. Alternative splicing of this gene generates two transcript variants.
14179	
14180	
14181	
14182	
14183	
14184	In vertebrates, the genes encoding the class of transcription factors called homeobox genes are found in clusters named A, B, C, and D on four separate chromosomes. Expression of these proteins is spatially and temporally regulated during embryonic development. This gene is part of the A cluster on chromosome 7 and encodes a DNA-binding transcription factor which may regulate gene expression, morphogenesis, and differentiation. Methylation of this gene may result in the loss of its expression and, since the encoded protein upregulates the tumor suppressor p53, this protein may play an important role in tumorigenesis.
14185	Synaptic vesicles dock and fuse in the active zone of the plasma membrane at chemical synapses. The presynaptic cytoskeletal matrix (PCM), which is associated with the active zone and is situated between synaptic vesicles, is thought to be involved in maintaining the neurotransmitter release site in register with the postsynaptic reception apparatus. The cycling of synaptic vesicles is a multistep process involving a number of proteins (see MIM 603215). Among the components of the PCM that orchestrate these events are Bassoon (BSN; MIM 604020), RIM (RBBP8; MIM 604124), Oboe, and Piccolo (PCLO).[supplied by OMIM]
14186	
14187	The JTV1 gene is located on chromosome 7p22 flanked by two genes, HRI and PMS2. JTV1 and HRI overlap slightly and are arranged in a tail-to-tail fashion. JTV1 and PMS2 are separated by approximately 200 base pairs and are arranged head-to-head. JTV1 is transcribed in the opposite direction compared to HRI and PMS2. The function of the JTV1 gene product is unknown.
14188	
14189	This gene encodes a protein that binds to the stem-loop structure in replication-dependent histone mRNAs. Histone mRNAs do not contain introns or polyadenylation signals, and are processed by endonucleolytic cleavage. The stem-loop structure is essential for efficient processing but this structure also controls the transport, translation and stability of histone mRNAs. Expression of the protein is regulated during the cell cycle, increasing more than 10-fold during the latter part of G1.
14190	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
14191	The product of this gene is a membrane-associated protein that colocalizes with huntingtin. This protein has similarities to cytoskeleton proteins and its interaction with huntingtin is thought to play a functional role in the cell filament network. Loss of normal huntingtin-HIP1 interaction in Huntington disease may contribute to a defect in membrane-cytoskeletal integrity in the brain. This gene could help in the understanding of the normal function of huntingtin and also the pathogenesis of Huntington disease. It also has been implicated in the pathogenesis of hematopoietic malignancies. An alternative splice variant of this gene has been described but its full length sequence has not been determined.
14192	
14193	Members of the peripheral membrane-associated guanylate kinase (MAGUK) family function in tumor suppression and receptor clustering by forming multiprotein complexes containing distinct sets of transmembrane, cytoskeletal, and cytoplasmic signaling proteins. All MAGUKs contain a PDZ-SH3-GUK core and are divided into 4 subfamilies, DLG-like (see DLG1; MIM 601014), ZO1-like (see TJP1; MIM 601009), p55-like (see MPP1; MIM 305360), and LIN2-like (see CASK; MIM 300172), based on their size and the presence of additional domains. MPP6 is a member of the p55-like MAGUK subfamily (Tseng et al., 2001).[supplied by OMIM]
14194	
14195	Basic helix-loop-helix (bHLH) transcription factors have been implicated in cell lineage determination and differentiation. The protein encoded by this gene is a bHLH transcription factor and shares similarity with another bHLH transcription factor, Dermo1. The strongest expression of this mRNA is in placental tissue; in adults, mesodermally derived tissues express this mRNA preferentially. Mutations in this gene have been found in patients with Saethre-Chotzen syndrome.
14196	Basic helix-loop-helix (bHLH) transcription factors have been implicated in cell lineage determination and differentiation. The protein encoded by this gene is a bHLH transcription factor and shares similarity with another bHLH transcription factor, Dermo1. The strongest expression of this mRNA is in placental tissue; in adults, mesodermally derived tissues express this mRNA preferentially. Mutations in this gene have been found in patients with Saethre-Chotzen syndrome.
14197	The protein encoded by this gene is a peripheral membrane protein. It is sensitive to brefeldin A. This encoded protein contains a GRIP domain which is thought to be used in targeting. It may play a role in the organization of trans-Golgi network subcompartment involved with membrane transport.
14198	The protein encoded by this gene is a peripheral membrane protein. It is sensitive to brefeldin A. This encoded protein contains a GRIP domain which is thought to be used in targeting. It may play a role in the organization of trans-Golgi network subcompartment involved with membrane transport.
14199	
14200	
14201	
14202	Calcium-permeable channels, such as TRPV6, participate in neurotransmission, muscle contraction, and exocytosis by providing calcium as an intracellular second messenger. Depending on the tissue, transcellular calcium transport may be regulated by vitamin D, parathyroid hormone (PTH; MIM 168450), or calcitonin (CALCA; MIM 114130).[supplied by OMIM]
14203	
14204	
14205	The Sec61 complex is the central component of the protein translocation apparatus of the endoplasmic reticulum (ER) membrane. Oligomers of the Sec61 complex form a transmembrane channel where proteins are translocated across and integrated into the ER membrane. This complex consists of three membrane proteins- alpha, beta, and gamma. This gene encodes the gamma-subunit protein. Alternatively spliced transcript variants encoding the same protein have been identified.
14206	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This enzyme has a low level of testosterone hydroxylase activity. Although it bears homology to some drug-metabolizing cytochrome P450s, it is unknown whether the enzyme is also involved in xenobiotic metabolism. This gene is part of a cluster of cytochrome P450 genes on chromosome 7q21.1. Alternate splicing of this gene results in three transcript variants encoding different isoforms.
14207	
14208	
14209	
14210	This gene encodes a member of the RRM family of RNA-binding proteins. The RRM domain is between 80-100 amino acids in length and family members contain one to four copies of the domain. The RRM domain consists of two short stretches of conserved sequence called RNP1 and RNP2, as well as a few highly conserved hydrophobic residues. The protein encoded by this gene has a single, putative RRM domain in its N-terminus. Alternative splicing results in multiple transcript variants encoding different isoforms.
14211	
14212	
14213	This gene encodes a member of the von Willebrand factor A domain containing protein family. This family of proteins is thought to be involved in the formation of filamentous networks in the extracellular matrices of various tissues. This protein contains five von Willebrand factor A domains. The specific function of this gene has not yet been determined. Two transcript variants encoding different isoforms have been found for this gene.
14214	This gene encodes a trypsin inhibitor. The specific function of this protein has not yet been determined; however, this protein shares similarity to insect venom allergens, mammalian testis-specific proteins and plant pathogenesis-related proteins. It is frequently expressed in human neuroblastoma and glioblastoma cell lines, and thus may play a role in the central nervous system.
14215	Initially named for its response to in vivo retinal oxygen levels (designated ORP1 for @#$%&oxygen-regulated protein-1@#$%&), this gene was subsequently linked to autosomal dominant retinitis pigmentosa and was renamed RP1 for @#$%&retinitis pigmentosa 1@#$%&. The data suggest that mutations in this gene cause dominant RP, and that the encoded protein has an important but unknown role in photoreceptor biology.
14216	
14217	This gene encodes a myotubularin-related protein that is atypical to most other members of the myotubularin-related protein family because it has no dual-specificity phosphatase domain. The encoded protein contains a double-helical motif similar to the SET interaction domain, which is thought to have a role in the control of cell proliferation. In mouse, a protein similar to the encoded protein binds with MTMR7, and together they dephosphorylate phosphatidylinositol 3-phosphate and inositol 1,3-bisphosphate.
14218	
14219	The protein encoded by this gene is one of the eight subunits of COP9 signalosome, a highly conserved protein complex that functions as an important regulator in multiple signaling pathways. The structure and function of COP9 signalosome is similar to that of the 19S regulatory particle of 26S proteasome. COP9 signalosome has been shown to interact with SCF-type E3 ubiquitin ligases and act as a positive regulator of E3 ubiquitin ligases. This protein is reported to be involved in the degradation of  cyclin-dependent kinase inhibitor CDKN1B/p27Kip1. It is also known to be an coactivator that increases the specificity of JUN/AP1 transcription factors.
14220	
14221	
14222	Chemokines are a group of small (approximately 8 to 14 kD), mostly basic, structurally related molecules that regulate cell trafficking of various types of leukocytes through interactions with a subset of 7-transmembrane, G protein-coupled receptors. Chemokines also play fundamental roles in the development, homeostasis, and function of the immune system, and they have effects on cells of the central nervous system as well as on endothelial cells involved in angiogenesis or angiostasis. Chemokines are divided into 2 major subfamilies, CXC and CC, based on the arrangement of the first 2 of the 4 conserved cysteine residues; the 2 cysteines are separated by a single amino acid in CXC chemokines and are adjacent in CC chemokines. CXC chemokines are further subdivided into ELR and non-ELR types based on the presence or absence of a glu-leu-arg sequence adjacent and N terminal to the CXC motif.[supplied by OMIM]
14223	
14224	This gene encodes an accessory enzyme which participates in the beta-oxidation and metabolism of unsaturated fatty enoyl-CoA esters.
14225	Defensins form a family of microbicidal and cytotoxic peptides made by neutrophils. Members of the defensin family are highly similar in protein sequence. This gene encodes defensin, beta 1, an antimicrobial peptide implicated in the resistance of epithelial surfaces to microbial colonization. This gene maps in close proximity to defensin family member, defensin, alpha 1 and has been implicated in the pathogenesis of cystic fibrosis.
14226	
14227	
14228	
14229	The protein encoded by this gene is a member of the TNF-receptor superfamily. This protein is an osteoblast-secreted decoy receptor that functions as a negative regulator of bone resorption. This protein specifically binds to its ligand, osteoprotegerin ligand (TNFSF11/OPGL), both of which are key extracellular regulators of osteoclast development. Studies of the mouse counterpart also suggest this protein and its ligand play a role in lymph-node organogenesis and vascular calcification. Alternatively spliced transcript variants of this gene have been reported, but their full length nature has not been determined.
14230	
14231	
14232	
14233	
14234	
14235	The PAX5 gene is a member of the paired box (PAX) family of transcription factors.  The central feature of this gene family is a novel, highly conserved DNA-binding motif, known as the paired box.  The PAX proteins are important regulators in early development, and alterations in the expression of their genes are thought to contribute to neoplastic transformation.  The PAX5 gene encodes the B-cell lineage specific activator protein (BSAP) that is expressed at early, but not late stages of B-cell differentiation.  Its expression has also been detected in developing CNS and testis, therefore, PAX5 gene product may not only play an important role in B-cell differentiation, but also in neural development and spermatogenesis.  The PAX5 gene is located in chromosome 9p13 region, which is involved in t(9;14)(p13;q32) translocations recurring in small lymphocytic lymphomas of the plasmacytoid subtype, and in derived large-cell lymphomas.  This translocation brings the potent E-mu enhancer of the IgH gene into close proximity of the PAX5 promoters, suggesting that the deregulation of PAX5 gene transcription contributes to the pathogenesis of these lymphomas.  A transcript variant arising as a consequence of alternative promoter usage, and containing a different coding exon 1(B), has been described, however, its full-length nature is not known.
14236	
14237	FANCG, involved in  Fanconi anemia, confers resistance to both hygromycin and mitomycin C. FANCG contains a 5-prime GC-rich untranslated region characteristic of housekeeping genes. The putative 622-amino acid protein has a leucine-zipper motif at its N-terminus.  Fanconi anemia is an autosomal recessive disorder with diverse clinical symptoms, including developmental anomalies, bone marrow failure, and early occurrence of malignancies. A minimum of 8 FA genes have been identified. The FANCG gene is responsible for complementation group G.
14238	Intercellular communication is often mediated by receptors on the surface of one cell that recognize and are activated by specific protein ligands released by other cells. Members of one class of cell surface receptors, receptor tyrosine kinases (RTKs), are characterized by having a cytoplasmic domain containing intrinsic tyrosine kinase activity. This kinase activity is regulated by the binding of a cognate ligand to the extracellular portion of the receptor. DeChiara et al. (1996) noted that the RTKs, known to be expressed in cell type-specific fashions, play a role critical for the growth and differentiation of those cell types. For example, members of the neural-specific TRK family that recognize nerve growth factor are absolutely required for the survival and development of discrete neuronal subpopulations, and the receptor tyrosine kinases TIE1 (MIM 600222) and TIE2 (MIM 600221) play a critical role in the development of normal blood vessels.[supplied by OMIM]
14239	The protein encoded by this gene delays the onset of apoptosis and promotes homologous recombination repair of damaged DNA. Mutations in this gene result in Fanconi anemia.
14240	The protein encoded by this gene is a member of the NR1 subfamily of nuclear hormone receptors. It is a DNA-binding protein that can bind as a monomer or as a homodimer to hormone response elements upstream of several genes to enhance the expression of those genes. The specific functions of this protein are not known, but it has been shown to interact with NM23-2, a nucleoside diphosphate kinase involved in organogenesis and differentiation.
14241	
14242	
14243	CKS2 protein binds to the catalytic subunit of the cyclin dependent kinases and is essential for their biological function. The CKS2 mRNA is found to be expressed in different patterns through the cell cycle in HeLa cells, which reflects specialized role for the encoded protein.
14244	
14245	
14246	
14247	
14248	
14249	
14250	The protein encoded by this gene is a liver enzyme that catalyzes the transfer of the bile acid moiety from the acyl-CoA thioester to either glycine or taurine, the second step in the formation of bile acid-amino acid conjugates which serve as detergents in the gastrointestinal tract.
14251	
14252	The protein encoded by this gene functions as a docking protein acting downstream of Tec tyrosine kinase in B cell antigen receptor signaling. The protein is directly phosphorylated by Tec in vitro where it participates in a postive feedback loop, increasing Tec activity. A mouse ortholog, stem cell adaptor protein 1, shares 83% identity with its human counterpart.
14253	
14254	
14255	Interleukin 11 is a stromal cell-derived cytokine that belongs to a family of pleiotropic and redundant cytokines that use the gp130 transducing subunit in their high affinity receptors. This gene encodes the IL-11 receptor, which is a member of the hematopoietic cytokine receptor family. This particular receptor is very similar to ciliary neurotrophic factor, since both contain an extracellular region with a 2-domain structure composed of an immunoglobulin-like domain and a cytokine receptor-like domain. Alternative splicing has been observed at this locus and two variants, each encoding a distinct isoform, have been identified.
14256	
14257	The outer dense fibers are cytoskeletal structures that surround the axoneme in the middle piece and principal piece of the sperm tail. The fibers function in maintaining the elastic structure and recoil of the sperm tail as well as in protecting the tail from shear forces during epididymal transport and ejaculation. Defects in the outer dense fibers lead to abnormal sperm morphology and infertility. This gene encodes one of the major outer dense fiber proteins. Multiple protein isoforms are encoded by transcript variants of this gene; however, not all isoforms and variants have been fully described.
14258	
14259	Cyclic AMP-dependent protein kinase (PKA) consists of two catalytic subunits and a regulatory subunit dimer. This gene encodes the gamma form of its catalytic subunit. The gene is intronless and is thought to be a retrotransposon derived from the gene for the alpha form of the PKA catalytic subunit.
14260	
14261	
14262	
14263	
14264	
14265	
14266	
14267	
14268	
14269	
14270	The protein encoded by this gene is located at the mitochondrial inner membrane where it transfers fatty acylcarnitines into mitochondria. Thus it is critical in the fatty acid oxidation process. A defect in this translocase impairs oxidation of fatty acids and can cause a varieties of pathological conditions such as hypoglycemia, cardiac arrest, hepatomegaly, hepatic dysfunction and muscle weakness.
14271	
14272	This gene encodes a cytoplasmic protein tyrosine kinase which is found concentrated in the focal adhesions that form between cells growing in the presence of extracellular matrix constituents. The encoded protein is a member of the FAK subfamily of protein tyrosine kinases but lacks significant sequence similarity to kinases from other subfamilies. Activation of this gene may be an important early step in cell growth and intracellular signal transduction pathways triggered in response to certain neural peptides or to cell interactions with the extracellular matrix. At least four transcript variants encoding four different isoforms have been found for this gene, but the full-length natures of only two of them have been determined.
14273	
14274	
14275	The protein encoded by this gene is a type I transmembrane protein identified as a subunit of the interleukin 12 receptor complex. The coexpression of this and IL12RB1 proteins was shown to lead to the formation of high-affinity IL12 binding sites and reconstitution of IL12 dependent signaling. The expression of this gene is up-regulated by interferon gamma in Th1 cells, and plays a role in Th1 cell differentiation. The up-regulation of this gene is found to be associated with a number of infectious diseases, such as Crohn@#$%&s disease and leprosy, which is thought to contribute to the inflammatory response and host defense.
14276	
14277	
14278	
14279	
14280	
14281	
14282	The activation of gene transcription is a multistep process that is triggered by factors that recognize transcriptional enhancer sites in DNA. These factors work with co-activators to direct transcriptional initiation by the RNA polymerase II apparatus. The protein encoded by this gene is a subunit of the CRSP (cofactor required for SP1 activation) complex, which, along with TFIID, is required for efficient activation by SP1. This protein is also a component of other multisubunit complexes e.g. thyroid hormone receptor-(TR-) associated proteins which interact with TR and facilitate TR function on DNA templates in conjunction with initiation factors and cofactors. This protein contains a bipartite nuclear localization signal. This gene is known to escape chromosome X-inactivation.
14283	
14284	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L34E family of ribosomal proteins. It is located in the cytoplasm. This gene originally was thought to be located at 17q21, but it has been mapped to 4q. Transcript variants derived from alternative splicing, alternative transcription initiation sites, and/or alternative polyadenylation exist; these variants encode the same protein. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
14285	
14286	This gene product belongs to the 14-3-3 family of proteins which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals, and this protein is 100% identical to the mouse ortholog. It interacts with CDC25 phosphatases, RAF1 and IRS1 proteins, suggesting its role in diverse biochemical activities related to signal transduction, such as cell division and regulation of insulin sensitivity. It has also been implicated in the pathogenesis of small cell lung cancer.
14287	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L34E family of ribosomal proteins. It is located in the cytoplasm. This gene originally was thought to be located at 17q21, but it has been mapped to 4q. Transcript variants derived from alternative splicing, alternative transcription initiation sites, and/or alternative polyadenylation exist; these variants encode the same protein. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
14288	
14289	The protein encoded by this gene acts as receptor for arginine vasopressin. This receptor belongs to the subfamily of G-protein coupled receptors which includes AVPR1A, V2R and OXT receptors. Its activity is mediated by G proteins which stimulate a phosphatidylinositol-calcium second messenger system. The receptor is primarily located in the anterior pituitary, where it stimulates ACTH release. It is expressed at high levels in ACTH-secreting pituitary adenomas as well as in bronchial carcinoids responsible for the ectopic ACTH syndrome. A spliced antisense transcript of this gene has been reported but its function is not known.
14290	The protein encoded by this gene is a Src binding partner. It may represent a potential modulator of actin filament integrity in response to cellular signals, and may function as an adaptor protein by linking Src family members and/or other signaling proteins to actin filaments. Two alternative transcripts encoding the same protein have been identified.
14291	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. The expression pattern of a similar rat protein suggests that this protein may be involved in neurovascular function in the brain.
14292	
14293	
14294	Breakdown products of phosphoinositides are ubiquitous second messengers that function downstream of many G protein-coupled receptors and tyrosine kinases regulating cell growth, calcium metabolism, and protein kinase C activity. This gene encodes an enzyme which regulates the amount of phosphatidylinositol available for signaling by catalyzing the conversion of phosphatidic acid to CDP-diacylglycerol. This enzyme is an integral membrane protein localized to two subcellular domains, the matrix side of the inner mitochondrial membrane where it is thought to be involved in the synthesis of phosphatidylglycerol and cardiolipin and the cytoplasmic side of the endoplasmic reticulum where it functions in phosphatidylinositol biosynthesis. Two genes encoding this enzyme have been identified in humans, one mapping to human chromosome 4q21 and a second to 20p13.
14295	
14296	This gene encodes a protein containing domains found in the La related protein of Drosophila melanogaster. La motif-containing proteins are thought to be RNA-binding proteins, where the La motif and adjacent amino acids fold into an RNA recognition motif. The La motif is also found in proteins unrelated to the La protein. Alternative splicing has been observed at this locus and three variants, encoding distinct isoforms, are described. Additional splice variation has been identified but the full-length nature of these transcripts has not been determined.
14297	This gene encodes a flavoprotein essential for nuclear disassembly in apoptotic cells that is found in the mitochondrial intermembrane space in healthy cells. Induction of apoptosis results in the translocation of this protein to the nucleus where it effects chromosome condensation and fragmentation. In addition, this gene product induces mitochondria to release the apoptogenic proteins cytochrome c and caspase-9. Three alternative transcripts encoding different isoforms have been identified for this gene.
14298	
14299	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein associated with the long isoform of the leptin receptor and with receptor tyrosine kinases for platelet-derived growth factor, insulin, and epidermal growth factor in cell cultures, but its function is unknown. This protein may form oligomeric complexes with family members.
14300	
14301	The protein encoded by this gene is a member of the AAA family of adenosine triphosphatases (ATPases), is related to the Clp protease/heat shock family and is expressed prominently in the substantia nigra pars compacta. Mutations in this gene result in the autosomal dominant disorder, torsion dystonia 1.
14302	
14303	
14304	This gene encodes disintegrin and metalloprotease (ADAM) domain 9, which is a member of the ADAM protein family. Members of this family are membrane-anchored proteins structurally related to snake venom disintegrins, and have been implicated in a variety of biologic processes involving cell-cell and cell-matrix interactions, including fertilization, muscle development, and neurogenesis. The member encoded by this gene interacts with SH3 domain-containing proteins, binds mitotic arrest deficient 2 beta protein, and is also involved in TPA-induced ectodomain shedding of membrane-anchored heparin-binding EGF-like growth factor. Two alternative splice variants have been identified, encoding distinct isoforms.
14305	
14306	This gene encodes a member of the IgLON subfamily in the immunoglobulin protein superfamily. The encoded protein is localized in the plasma membrane and may have an accessory role in opioid receptor function. This gene has an ortholog in rat and bovine. The opioid binding-cell adhesion molecule encoded by the rat gene binds opioid alkaloids in the presence of acidic lipids, exhibits selectivity for mu ligands and acts as a GPI-anchored protein. Since the encoded protein is highly conserved in species during evolution, it may have a fundamental role in mammalian systems. Two transcript variants encoding different isoforms have been found for this gene.
14307	This gene encodes a calcium-independent phospholipid-binding protein whose expression increases in serum-starved cells. This protein has also been shown to be a substrate for protein kinase C (PKC) phosphorylation.
14308	
14309	
14310	
14311	Galactocerebrosides are abundant sphingolipids of the myelin membrane of the central nervous system and peripheral nervous system and are also present in small amounts in kidney. The key enzymatic step in the biosynthesis of galactocerebrosides consists of the transfer of galactose to ceramide catalyzed by UDP-galactose ceramide galactosyltransferase (CGT, EC 2.4.1.45). The enzyme encoded by the CGT gene is the first involved in complex lipid biosynthesis in the myelinating oligodendrocyte.[supplied by OMIM]
14312	
14313	The protein encoded by this gene is a member of the bone morphogenetic protein (BMP) family and the TGF-beta superfamily. This group of proteins is characterized by a polybasic proteolytic processing site which is cleaved to produce a mature protein containing seven conserved cysteine residues. The members of this family are regulators of cell growth and differentiation in both embryonic and adult tissues. The function of this protein is unknown, but expression studies suggest it may be involved in regulation of the adult lymphatic and erythroid systems and embryonic development.
14314	
14315	Dolichol-phosphate mannose (Dol-P-Man) serves as a donor of mannosyl residues on the lumenal side of the endoplasmic reticulum (ER). Lack of Dol-P-Man results in defective surface expression of GPI-anchored proteins. Dol-P-Man is synthesized from GDP-mannose and dolichol-phosphate on the cytosolic side of the ER by the enzyme dolichyl-phosphate mannosyltransferase. The protein encoded by this gene is a subunit of dolichyl-phosphate mannosyltransferase and acts as a stabilizer subunit of the dolichyl-phosphate mannosyltransferase complex.
14316	
14317	
14318	
14319	
14320	
14321	
14322	
14323	
14324	
14325	
14326	
14327	
14328	Regulator of G protein signaling (RGS) proteins are regulatory and structural components of G protein-coupled receptor complexes. RGS proteins are GTPase-activating proteins for Gi (see GNAI1; MIM 139310) and Gq (see GNAQ; MIM 600998) class G-alpha proteins. They accelerate transit through the cycle of GTP binding and hydrolysis and thereby accelerate signaling kinetics and termination.[supplied by OMIM]
14329	
14330	
14331	
14332	
14333	
14334	
14335	
14336	
14337	
14338	
14339	
14340	
14341	
14342	Bone marrow stromal cell antigen-1 is a stromal cell line-derived glycosylphosphatidylinositol-anchored molecule that facilitates pre-B-cell growth. The deduced amino acid sequence exhibits 33% similarity with CD38. BST1 expression is enhanced in bone marrow stromal cell lines derived from patients with rheumatoid arthritis. The polyclonal B-cell abnormalities in rheumatoid arthritis may be, at least in part, attributed to BST1 overexpression in the stromal cell population.
14343	
14344	
14345	The protein encoded by this gene is a member of the membrane-associated guanylate kinase homologue (MAGUK) family. Characterized by two WW domains, a guanylate kinase domain, and five PDZ domains, this protein interacts with the cytoplasmic region of BAI1. Together, these proteins may play a role in cell adhesion and signal transduction.
14346	
14347	
14348	
14349	
14350	
14351	This gene is one member of a family of sulfate/anion transporter genes. Family members are well conserved in their genomic (number and size of exons) and protein (aa length among species) structures yet have markedly different tissue expression patterns. This gene has abundant and specific expression in the kidney. Splice variants that use both alternate transcription initiation and polyadenylation sites have been described for this gene.
14352	
14353	
14354	
14355	
14356	Caltractin belongs to a family of calcium-binding proteins and is a structural component of the centrosome.  The high level of conservation from algae to humans and its association with the centrosome suggested that caltractin plays a fundamental role in the structure and function of the microtubule-organizing center, possibly required for the proper duplication and segregation of the centrosome.
14357	
14358	
14359	
14360	This gene encodes a protein containing domains found in the La related protein of Drosophila melanogaster. La motif-containing proteins are thought to be RNA-binding proteins, where the La motif and adjacent amino acids fold into an RNA recognition motif. The La motif is also found in proteins unrelated to the La protein. Alternative splicing has been observed at this locus and three variants, encoding distinct isoforms, are described. Additional splice variation has been identified but the full-length nature of these transcripts has not been determined.
14361	This gene encodes a member of the myotubularin related family of proteins. Members of this family contain the consensus sequence for the active site of protein tyrosine phosphatases. Alternative splicing results in two transcript variants encoding different isoforms. Additional variants have been described but their biological validity has not been determined.
14362	
14363	GRASP1 is a neuron-specific guanine nucleotide exchange factor for the Ras family of small G proteins (RasGEF) and is associated with the GRIP/AMPA receptor complex in brain.[supplied by OMIM]
14364	FGD1 contains Dbl (DH) and pleckstrin (PH) homology domains. It can bind specifically to the Rho family GTPase Cdc42Hs and stimulate the GDP-GTP exchange of the isoprenylated form of Cdc42Hs. It also stimulates the mitogen activated protein kinase cascade leading to c-Jun kinase SAPK/JNK1 activation. FGD1 has an essential role in embryonic development, and FGD1 gene mutations result in the human developmental disorder, Aarskog-Scott syndrome.
14365	
14366	
14367	Moesin (for membrane-organizing extension spike protein) is a member of the ERM family which includes ezrin and radixin. ERM proteins appear to function as cross-linkers between plasma membranes and actin-based cytoskeletons. Moesin is localized to filopodia and other membranous protrusions that are important for cell-cell recognition and signaling and for cell movement.
14368	Moesin (for membrane-organizing extension spike protein) is a member of the ERM family which includes ezrin and radixin. ERM proteins appear to function as cross-linkers between plasma membranes and actin-based cytoskeletons. Moesin is localized to filopodia and other membranous protrusions that are important for cell-cell recognition and signaling and for cell movement.
14369	
14370	NDP is the genetic locus identified as harboring mutations that result in Norrie disease. Norrie disease is a rare genetic disorder characterized by bilateral congenital blindness that is caused by a vascularized mass behind each lens due to a maldeveloped retina (pseudoglioma).
14371	
14372	This gene encodes a protein that contains a DNA-binding domain. The encoded protein acts as a dominant-negative regulator of transcription which is mediated by the retinoic acid receptor. This protein also functions as an anti-testis gene by acting antagonistically to Sry. Mutations in this gene result in both X-linked congenital adrenal hypoplasia and hypogonadotropic hypogonadism.
14373	
14374	The protein encoded by this gene is thought to be part of a large multisubunit complex involved in the targeting and fusion of endoplasmic reticulum-to-Golgi transport vesicles with their acceptor compartment. In addition, the encoded protein can bind MBP1 and block its transcriptional repression capability. Mutations in this gene are a cause of spondyloepiphyseal dysplasia tarda (SEDT). A processed pseudogene of this gene is located on chromosome 19, and other pseuodogenes of it are found on chromosome 8 and the Y chromosome. Two transcript variants encoding the same protein have been found for this gene.
14375	
14376	
14377	
14378	
14379	Ras-homologous GTPases constitute a large family of signal transducers that alternate between an activated, GTP-binding state and an inactivated, GDP-binding state. These proteins represent cellular switches that are operated by GTP-exchange factors and factors that stimulate their intrinsic GTPase activity. All GTPases of the Ras superfamily have in common the presence of six conserved motifs involved in GTP/GDP binding, three of which are phosphate-/magnesium-binding sites (PM1-PM3) and three of which are guanine nucleotide-binding sites (G1-G3). Transcript variants encoding distinct isoforms have been identified.
14380	This gene encodes a member of the ALEX family of proteins and may play a role in tumor suppression. The encoded protein contains a potential N-terminal transmembrane domain and a single Armadillo (arm) repeat. Other proteins containing the arm repeat are involved in development, maintenance of tissue integrity, and tumorigenesis. This gene is closely localized with other family members, including ALEX1 and ALEX3, on the X chromosome. Two alternative transcripts that encode the same protein but differ in the 5@#$%& UTR have been described. Additional alternative transcripts may exist but their full length natures have not been determined. A pseudogene for this locus is located on chromosome 7.
14381	
14382	This gene belongs to the ESO/LAGE gene family, members of which are clustered together on chromosome Xq28, and have similar exon-intron structures. Unlike the other family members which are normally expressed only in testis and activated in a wide range of human tumors, this gene is ubiquitously expressed in somatic tissues. The latter, combined with the finding that it is highly conserved in mouse and rat, suggests that the encoded protein is functionally important. An intronless pseudogene highly homologous to this gene is located on chromosome 9.
14383	IRAK1 encodes the interleukin-1 receptor-associated kinase 1, one of two putative serine/threonine kinases that become associated with the interleukin-1 receptor (IL1R) upon stimulation. IRAK1 is partially responsible for IL1-induced upregulation of the transcription factor NF-kappa B.
14384	NUDT10 belongs to a subgroup of phosphohydrolases that preferentially attack diphosphoinositol polyphosphates (Hidaka et al., 2002).[supplied by OMIM]
14385	
14386	
14387	The protein encoded by this gene is a member of the Toll-like receptor (TLR) family which plays a fundamental role in pathogen recognition and activation of innate immunity. TLRs are highly conserved from Drosophila to humans and share structural and functional similarities. They recognize pathogen-associated molecular patterns (PAMPs) that are expressed on infectious agents, and mediate the production of cytokines necessary for the development of effective immunity. The various TLRs exhibit different patterns of expression. This gene is predominantly expressed in lung and peripheral blood leukocytes, and lies in close proximity to another family member, TLR7, on chromosome X.
14388	
14389	There are two copies of this pseudogene within the P1 palindrome on chromosome Y. This record represents the more telomeric copy.
14390	Histones play a critical role in transcriptional regulation, cell cycle progression, and developmental events. Histone acetylation/deacetylation alters chromosome structure and affects transcription factor access to DNA. The protein encoded by this gene belongs to the histone deacetylase/acuc/apha family. It has histone deacetylase activity and represses transcription when tethered to a promoter. It may participate in the regulation of transcription through its binding with the zinc-finger transcription factor YY1. This protein can also down-regulate p53 function and thus modulate cell growth and apoptosis. This gene is regarded as a potential tumor suppressor gene.
14391	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections.
14392	
14393	
14394	
14395	The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21.  This protein is proposed to be involved in specific calcium-dependent signal transduction pathways and its regulatory effect on cytoskeletal components may modulate various neutrophil activities.
14396	
14397	The second major step in the catabolism of the branched-chain amino acids, isoleucine, leucine, and valine, is catalyzed by the branched-chain alpha-keto acid dehydrogenase complex (BCKD; EC 1.2.4.4), an inner-mitochondrial enzyme complex that consists of 3 catalytic components: a heterotetrameric (alpha2, beta2) branched-chain alpha-ketoacid decarboxylase (E1-alpha; MIM 608348 and E1-beta; MIM 248611), a homo-24-meric dihydrolipoyl transacylase (E2), and a homodimeric dihydrolipoamide dehydrogenase (E3; MIM 246900). The reaction is irreversible and constitutes the first committed step in BCAA oxidation. The complex also contains 2 regulatory enzymes, a kinase and a phosphorylase. The DBT gene encodes the E2 component.[supplied by OMIM]
14398	The protein encoded by this gene is a calcyclin binding protein. It may be involved in calcium-dependent ubiquitination and subsequent proteosomal degradation of target proteins. It probably serves as a molecular bridge in ubiquitin E3 complexes and participates in the ubiquitin-mediated degradation of beta-catenin. Two alternatively spliced transcript variants encoding different isoforms have been found for this gene.
14399	This gene encodes a member of the latrophilin subfamily of G-protein coupled receptors (GPCR). Latrophilins may function in both cell adhesion and signal transduction. In experiments with non-human species, endogenous proteolytic cleavage within a cysteine-rich GPS (G-protein-coupled-receptor proteolysis site) domain resulted in two subunits (a large extracellular N-terminal cell adhesion subunit and a subunit with substantial similarity to the secretin/calcitonin family of GPCRs) being non-covalently bound at the cell membrane. While several transcript variants have been described, the biological validity of only one has been determined.
14400	
14401	
14402	
14403	This gene encodes the protein elongin A, which is a subunit of the transcription factor B (SIII) complex. The SIII complex is composed of elongins A/A2, B and C. It activates elongation by RNA polymerase II by suppressing transient pausing of the polymerase at many sites within transcription units. Elongin A functions as the transcriptionally active component of the SIII complex, whereas elongins B and C are regulatory subunits. Elongin A2 is specifically expressed in the testis, and capable of forming a stable complex with elongins B and C. The von Hippel-Lindau tumor suppressor protein binds to elongins B and C, and thereby inhibits transcription elongation.
14404	The protein encoded by this gene belongs to the class II cytokine receptor family. This protein forms a receptor complex with interleukine 10 receptor, beta (IL10RB). The receptor complex has been shown to interact with three closely related cytokines, including interleukin 28A (IL28A), interleukin 28B (IL28B), and interleukin 29 (IL29). The expression of all three cytokines can be induced by viral infection. The cells overexpressing this protein have been found to have enhanced responses to IL28A and IL29, but decreased response to IL28B. Three alternatively spliced transcript variants encoding distinct isoforms have been reported.
14405	
14406	The protein encoded by this gene belongs to the DEAD-like helicase superfamily, and shares similarity with Saccharomyces cerevisiae Rad54, a protein known to be involved in the homologous recombination and repair of DNA. This protein has been shown to play a role in homologous recombination related repair of DNA double-strand breaks. The binding of this protein to double-strand DNA induces a DNA topological change, which is thought to facilitate homologous DNA paring, and stimulate DNA recombination.
14407	
14408	This gene is located in the RH gene locus, between the RHD and RHCE genes. The function of its protein product is unknown; however, its sequence has potential transmembrane domains suggesting that it may be an integral membrane protein. Its position between the RH genes suggests that polymorphisms in this gene may be tightly linked to RH haplotypes and may contribute to selective pressure for or against certain RH haplotypes.
14409	The protein encoded by this gene is a homolog of the protein transgelin, which is one of the earliest markers of differentiated smooth muscle. The function of this protein has not yet been determined.
14410	
14411	
14412	HCRTR1, or orexin 1 receptor (OX1R), is a G-protein coupled receptor expressed in the hypothalamus and involved in the regulation of feeding behaviour.  HCRTR1 selectively binds the orexin A neuropeptide.  It shares 64% identity with HCRTR2.
14413	
14414	
14415	
14416	
14417	
14418	The polyamines putrescine, spermine, and spermidine are ubiquitous polycationic mediators of cell growth and differentiation. Spermidine synthase is one of four enzymes in the polyamine-biosynthetic pathway and carries out the final step of spermidine biosynthesis. This enzyme catalyzes the conversion of putrescine to spermidine using decarboxylated S-adenosylmethionine as the cofactor.
14419	
14420	
14421	This gene encodes a cytosolic protein that binds to the signaling molecule, neuronal nitric oxide synthase (nNOS). This protein has a C-terminal PDZ-binding domain that mediates interactions with nNOS and an N-terminal phosphotyrosine binding (PTB) domain that binds to the small monomeric G protein, Dexras1. Studies of the related mouse and rat proteins have shown that this protein functions as an adapter protein linking nNOS to specific targets, such as Dexras1 and the synapsins.
14422	
14423	
14424	The protein encoded by this gene is highly similar to Saccharomyces cerevisiae Nuf2, a component of a conserved protein complex associated with the centromere. Yeast Nuf2 disappears from the centromere during meiotic prophase when centromeres lose their connection to the spindle pole body, and plays a regulatory role in chromosome segregation. This protein is found to be associated with centromeres of mitotic HeLa cells, which suggests that this protein is a functional homolog of yeast Nuf2. Two alternatively spliced transcript variants encoding the same protein have been reported.
14425	CSRP1 is a member of the CSRP family of genes encoding a group of LIM domain proteins, which may be involved in regulatory processes important for development and cellular differentiation.  The LIM/double zinc-finger motif found in CRP1 is found in a group of proteins with critical functions in gene regulation, cell growth, and somatic differentiation  Other genes in the family include CSRP2 and CSRP3.
14426	Aminoacyl-tRNA synthetases catalyze the aminoacylation of tRNA by their cognate amino acid. Because of their central role in linking amino acids with nucleotide triplets contained in tRNAs, aminoacyl-tRNA synthetases are thought to be among the first proteins that appeared in evolution. Two forms of tryptophanyl-tRNA synthetase exist, a cytoplasmic form, named WARS, and a mitochondrial form, named WARS2. This gene encodes the mitochondrial tryptophanyl-tRNA synthetase. Two alternative transcripts encoding different isoforms have been described.
14427	The product of the CHML gene supports geranylgeranylation of most Rab proteins and may substitute for REP-1 in tissues other than retina. CHML is localized close to the gene for Usher syndrome type II.
14428	
14429	
14430	
14431	
14432	The protein encoded by this gene is a member of the peptidyl-prolyl cis-trans isomerase (PPIase) family. PPIases catalyze the cis-trans isomerization of proline imidic peptide bonds in oligopeptides and accelerate the folding of proteins. This protein contains a highly conserved cyclophilin (CYP) domain as well as an RNA-binding domain. It was shown to possess PPIase and protein folding activities and also exhibit RNA-binding activity. Three alternatively spliced transcript variants encoding distinct isoforms have been observed.
14433	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein that is known to complex with integrins. It contributes to the transduction of CD2-generated signals in T cells and natural killer cells and has been suggested to play a role in growth regulation. Familial deficiency of this gene has been linked to an immunodeficiency associated with recurrent infectious diseases caused by bacteria, fungi and viruses.
14434	
14435	CD1D is the sole group-2 member of the CD1 family of major histocompatibility (MHC)-like glycoproteins. See CD1A (MIM 188370) for background on CD1 molecules.[supplied by OMIM]
14436	
14437	
14438	
14439	This gene product belongs to the selenium-binding protein family. Selenium is an essential nutrient that exhibits potent anticarcinogenic properties, and deficiency of selenium may cause certain neurologic diseases. It has been proposed that the effects of selenium in preventing cancer and neurologic diseases may be mediated by selenium-binding proteins. The exact function of this gene is not known.
14440	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S27E family of ribosomal proteins. It contains a C4-type zinc finger domain that can bind to zinc. The encoded protein has been shown to be able to bind to nucleic acid. It is located in the cytoplasm as a ribosomal component, but it has also been detected in the nucleus. Studies in rat indicate that ribosomal protein S27 is located near ribosomal protein S18 in the 40S subunit and is covalently linked to translation initiation factor eIF3. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
14441	
14442	Initiation of transcription by RNA polymerase I requires the formation of a complex composed of the TATA-binding protein (TBP) and three TBP-associated factors (TAFs) specific for RNA polymerase I. This complex, known as SL1, binds to the core promoter of ribosomal RNA genes to position the polymerase properly and acts as a channel for regulatory signals. This gene encodes the smallest SL1-specific TAF. Two transcripts encoding different isoforms have been identified.
14443	
14444	
14445	
14446	This gene encodes a protein that contains a death effector domain (DED). DED is a protein-protein interaction domain shared by adaptors, regulators and executors of the programmed cell death pathway. Overexpression of this gene was shown to induce weak apoptosis. Upon stimulation, this protein was found to translocate from cytoplasm to nucleus and colocalize with UBTF, a basal factor required for RNA polymerase I transcription, in the nucleolus. Alternative splicing of this gene results in two transcript variants which encode the same protein.
14447	
14448	
14449	This gene encodes a member of the eyes absent (EYA) family of proteins. The encoded protein may act as a transcriptional activator and have a role during development. A similar protein in mice can act as a transcriptional activator. Two transcript variants encoding distinct isoforms have been identified for this gene.
14450	
14451	
14452	
14453	
14454	This gene encodes a member of the interferon regulatory transcription factor (IRF) family. Family members share a highly-conserved N-terminal helix-turn-helix DNA-binding domain and a less conserved C-terminal protein-binding domain. Mutations in this gene can cause van der Woude syndrome and popliteal pterygium syndrome. This protein is involved in palate formation.
14455	
14456	
14457	
14458	
14459	
14460	
14461	
14462	
14463	
14464	This gene is one of the at least 20 genes expressed during epidermal differentiation and located on chromosomal band 1q21. This gene is involved in epidermal differentiation, and it is expressed at high levels in normal and psoriatic skin, but not in cultured keratinocytes or in any other tested cell types or tissues.
14465	
14466	This gene encodes a protein belonging to the vacuolar-protein-sorting-13 gene family. In yeast, vacuolar-protein-sorting-13 proteins are involved in trafficking of membrane proteins between the trans-Golgi network and the prevacuolar compartment. While several transcript variants may exist for this gene, the full-length natures of only two have been described to date. These two represent the major variants of this gene and encode distinct isoforms.
14467	This gene encodes a protein belonging to the vacuolar-protein-sorting-13 gene family. In yeast, vacuolar-protein-sorting-13 proteins are involved in trafficking of membrane proteins between the trans-Golgi network and the prevacuolar compartment. While several transcript variants may exist for this gene, the full-length natures of only two have been described to date. These two represent the major variants of this gene and encode distinct isoforms.
14468	
14469	
14470	
14471	This gene encodes one of several forms of glutamic acid decarboxylase, identified as a major autoantigen in insulin-dependent diabetes.  The enzyme encoded is responsible for catalyzing the production of gamma-aminobutyric acid from L-glutamic acid.  A pathogenic role for this enzyme has been identified in the human pancreas since it has been identified as an autoantibody and an autoreactive T cell target in insulin-dependent diabetes.  This gene may also play a role in the stiff man syndrome.
14472	The protein encoded by this gene is a member of the immunoglobulin binding factor family. It is synthesized by the epithelial cells of the prostate gland and secreted into the seminal plasma. This protein has inhibin-like activity. It may have a role as an autocrine paracrine factor in uterine, breast and other female reproductive tissues. The expression of the encoded protein is found to be decreased in prostate cancer. Two alternatively spliced transcript variants encoding different isoforms are described for this gene. The use of alternate polyadenylation sites has been found for this gene.
14473	
14474	
14475	
14476	
14477	
14478	
14479	
14480	The protein encoded by this gene was first identified by its ability to bind the adenovirus E1A protein. The protein localizes to the nucleus. It functions as a transcriptional repressor, and expression of E1A inhibits this repression. Two transcript variants encoding different isoforms have been found for this gene.
14481	
14482	
14483	
14484	The correlation of anti-sperm antibodies with cases of unexplained infertility implicates a role for these antibodies in blocking fertilization. Improved diagnosis and treatment of immunologic infertility, as well as identification of proteins for targeted contraception, are dependent on the identification and characterization of relevant sperm antigens. The protein expressed by this gene is recognized by anti-sperm antibodies from an infertile man. This protein localizes to the tail of permeabilized human sperm and contains eight contiguous armadillo repeats, a motif known to mediate protein-protein interactions. Studies in mice suggest that this protein is involved in sperm flagellar motility and maintenance of the structural integrity of mature sperm. Alternatively spliced variants that encode different protein isoforms have been described but the full-length sequences of only two have been determined.
14485	
14486	
14487	
14488	
14489	This gene belongs to a group of vacuolar protein sorting (VPS) genes. The encoded protein is a component of a large multimeric complex, termed the retromer complex, involved in retrograde transport of proteins from endosomes to the trans-Golgi network. The close structural similarity between the yeast and human proteins that make up this complex suggests a similarity in function. Expression studies in yeast and mammalian cells indicate that this protein interacts directly with VPS35, which serves as the core of the retromer complex.
14490	
14491	
14492	
14493	This protein belongs to the lipocalin family and is the specific carrier for retinol (vitamin A alcohol) in the blood. It delivers retinol from the liver stores to the peripheral tissues. In plasma, the RBP-retinol complex interacts with transthyretin which prevents its loss by filtration through the kidney glomeruli. A deficiency of vitamin A blocks secretion of the binding protein posttranslationally and results in defective delivery and supply to the epidermal cells.
14494	
14495	
14496	
14497	
14498	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has three repeats of quasi-RRM domains that bind to RNAs which have guanosine-rich sequences. This protein is very similar to the family member hnRPH.
14499	Histamine is a ubiquitous messenger molecule released from mast cells, enterochromaffin-like cells, and neurons. Its various actions are mediated by histamine receptors H1, H2, H3 and H4. Histamine receptor H2 belongs to the family 1 of G protein-coupled receptors. It is an integral membrane protein and stimulates gastric acid secretion. It also regulates gastrointestinal motility and intestinal secretion and is thought to be involved in regulating cell growth and differentiation.
14500	
14501	Through alternate splicing, this gene encodes two proteins with similarity to subunits of the vacuolar ATPase (V-ATPase) but the encoded proteins seem to have different functions. V-ATPase is a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, and receptor-mediated endocytosis. V-ATPase is comprised of a cytosolic V1 domain and a transmembrane V0 domain. Mutations in this gene are associated with infantile malignant osteopetrosis.
14502	AIP may play a positive role in AHR-mediated signalling possibly by influencing its receptivity for ligand and/or its nuclear targeting. AIP is the cellular negative regulator of the HBV X protein.
14503	
14504	
14505	The ZFYVE16 gene encodes endofin, an endosomal protein implicated in regulating membrane trafficking. It is characterized by the presence of a phosphatidylinositol 3-phosphate-binding FYVE domain positioned in the middle of the molecule (Seet et al., 2004).[supplied by OMIM]
14506	The protein encoded by this gene contains a region that is highly similar to the N-terminal segment of BCL7A protein. The BCL7A protein is encoded by the gene known to be directly involved in a three-way gene translocation in a Burkitt lymphoma cell line. This gene is located at a chromosomal region commonly deleted in Williams syndrome. The function of this gene has not yet been determined. Two alternatively spliced transcript variants encoding distinct isoforms have been reported.
14507	
14508	
14509	BRMS1 is a human breast carcinoma metastasis suppressor gene that maps to 11q13, a hotspot in breast cancer progression.[supplied by OMIM]
14510	The protein encoded by this gene is a member of the LAR protein-tyrosine phosphatase-interacting protein (liprin) family. Liprins interact with members of LAR family of transmembrane protein tyrosine phosphatases, which are known to be important for axon guidance and mammary gland development. This protein binds to the intracellular membrane-distal phosphatase domain of tyrosine phosphatase LAR, and appears to localize LAR to cell focal adhesions. This interaction may regulate the disassembly of focal adhesion and thus help orchestrate cell-matrix interactions. Alternatively spliced transcript variants encoding distinct isoforms have been described.
14511	The protein encoded by this gene is a member of the LAR protein-tyrosine phosphatase-interacting protein (liprin) family. Liprins interact with members of LAR family of transmembrane protein tyrosine phosphatases, which are known to be important for axon guidance and mammary gland development. This protein binds to the intracellular membrane-distal phosphatase domain of tyrosine phosphatase LAR, and appears to localize LAR to cell focal adhesions. This interaction may regulate the disassembly of focal adhesion and thus help orchestrate cell-matrix interactions. Alternatively spliced transcript variants encoding distinct isoforms have been described.
14512	
14513	
14514	
14515	This gene encodes a subunit of interleukin 12, a cytokine that acts on T and natural killer cells, and has a broad array of biological activities. Interleukin 12 is a disulfide-linked heterodimer composed of the 40 kD cytokine receptor like subunit encoded by this gene, and a 35 kD subunit encoded by IL12A. This cytokine is expressed by activated macrophages that serve as an essential inducer of Th1 cells development. This cytokine has been found to be important for sustaining a sufficient number of memory/effector Th1 cells to mediate long-term protection to an intracellular pathogen. Overexpression of this gene was observed in the central nervous system of patients with multiple sclerosis (MS), suggesting a role of this cytokine in the pathogenesis of the disease. The promoter polymorphism of this gene has been reported to be associated with the severity of atopic and non-atopic asthma in children.
14516	
14517	Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. It is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, F0, which comprises the proton channel. The F1 complex consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled in a ratio of 3 alpha, 3 beta, and a single representative of the other 3. The F0 seems to have nine subunits (a, b, c, d, e, f, g, F6 and 8). This gene encodes the g subunit of the F0 complex.
14518	The protein encoded by this gene is an apolipoprotein and an important determinant of plasma triglyceride levels, a major risk factor for coronary artery disease. It is a component of high density lipoprotein and is highly similar to a rat protein that is upregulated in response to liver injury. This gene uses alternate polyadenylation sites and is located proximal to the apolipoprotein gene cluster on chromosome 11q23.
14519	The protein encoded by this gene belongs to the heat shock protein 70 family. This gene has three mRNAs from the use of alternative transcription sites. A cis-acting segment is found at the 5@#$%& end of exon 1A which is involved in the stress-dependent induction. The transcript that begins with exon 1B is preferentially induced by hypoxia, resulting in the accumulation of this protein in the endoplasmic reticulum (ER). The protein encoded by this gene is thought to play an important role in protein folding and secretion in the ER. Since suppression of the protein is associated with accelerated apoptosis, it is also suggested to have an important cytoprotective role in hypoxia-induced cellular perturbation. This protein has been shown to be up-regulated in tumors, especially in breast tumors, and thus it is associated with tumor invasiveness. There is also an alternative translation site of this gene which lacks the signal peptide. This signal peptide-lacking protein, is only 3 amino acids shorter than the mature protein in the ER, and it is thought to have a housekeeping function in the cytosol.
14520	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes (E1s), ubiquitin-conjugating enzymes (E2s) and ubiquitin-protein ligases (E3s). This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. This enzyme is highly similar in primary structure to the enzyme encoded by UBE2L3 gene. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
14521	
14522	This gene encodes a member of the membrane-spanning 4A gene family. Members of this nascent protein family are characterized by common structural features and similar intron/exon splice boundaries and display unique expression patterns among hematopoietic cells and nonlymphoid tissues. Alternative splicing of this gene results in several transcript variants; however, not all transcripts have been fully described.
14523	
14524	
14525	
14526	
14527	This gene is a member of the solute carrier family and encodes a cell surface, transmembrane protein with an alpha amylase domain. The protein exists as the heavy chain of a heterodimer, covalently bound through di-sulfide bonds to one of several possible light chains. It associates with integrins and mediates integrin-dependent signaling related to normal cell growth and tumorigenesis. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
14528	
14529	
14530	The protein encoded by this gene belongs to a group of apparently inactive homologs of ubiquitin-conjugating enzymes. The gene product contains a coiled-coil domain that interacts with stathmin, a cytosolic phosphoprotein implicated in tumorigenesis. The protein may play a role in cell growth and differentiation and act as a negative growth regulator. In vitro steady-state expression of this tumor susceptibility gene appears to be important for maintenance of genomic stability and cell cycle regulation. Mutations and alternative splicing in this gene occur in high frequency in breast cancer and suggest that defects occur during breast cancer tumorigenesis and/or progression.
14531	
14532	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a non-ATPase subunit of the 19S regulator. Two transcripts encoding different isoforms have been described.
14533	
14534	
14535	The protein encoded by this gene is a member of the TNF receptor associated factor (TRAF) protein family. TRAF proteins are associated with, and mediate signal transduction from members of the TNF receptor superfamily. This protein mediates the signaling not only from the members of the TNF receptor superfamily, but also from the members of the Toll/IL-1 family. Signals from receptors such as CD40, TNFSF11/RANCE and IL-1 have been shown to be mediated by this protein. This protein also interacts with various protein kinases including IRAK1/IRAK, SRC and PKCzeta, which provides a link between distinct signaling pathways. This protein functions as a signal transducer in the NF-kappaB pathway that activates IkappaB kinase (IKK) in response to proinflammatory cytokines. The interaction of this protein with UBE2N/UBC13, and UBE2V1/UEV1A, which are ubiquitin conjugating enzymes catalyzing the formation of polyubiquitin chains, has been found to be required for IKK activation by this protein. Two alternatively spliced transcript variants encoding identical proteins have been reported.
14536	
14537	The protein encoded by this gene contains a RING finger, a motif present in a variety of functionally distinct proteins and known to be involved in protein-protein and protein-DNA interactions. Multiple alternatively spliced transcript variants encoding distinct isoforms have been reported. Three of them are supported by at least two independent transcripts or ESTs, the full length natures of others are not clear.
14538	
14539	PAK proteins are critical effectors that link RhoGTPases to cytoskeleton reorganization and nuclear signaling. PAK proteins, a family of serine/threonine p21-activating kinases, include PAK1, PAK2, PAK3 and PAK4. These proteins serve as targets for the small GTP binding proteins Cdc42 and Rac and have been implicated in a wide range of biological activities. PAK1 regulates cell motility and morphology. Alternative transcripts of this gene have been found, but their full-length natures have not yet been determined.
14540	This gene encodes a member of the serpin superfamily of serine proteinase inhibitors. Its expression is induced by heat shock. The protein localizes to the endoplasmic reticulum lumen and binds collagen; thus it is thought to be a molecular chaperone involved in the maturation of collagen molecules. Autoantibodies to this protein have been found in patients with rheumatoid arthritis.
14541	
14542	
14543	
14544	This gene encodes a 50-kD subunit of dynactin, a macromolecular complex consisting of 10-11 subunits ranging in size from 22 to 150 kD. Dynactin binds to both microtubules and cytoplasmic dynein. It is involved in a diverse array of cellular functions, including ER-to-Golgi transport, the centripetal movement of lysosomes and endosomes, spindle formation, chromosome movement, nuclear positioning, and axonogenesis. This subunit is present in 4-5 copies per dynactin molecule. It contains three short alpha-helical coiled-coil domains that may mediate association with self or other dynactin subunits. It may interact directly with the largest subunit (p150) of dynactin and may affix p150 in place.
14545	This gene encodes a member of the beta-1,3-N-acetylglucosaminyltransferase protein family. The encoded enzyme is involved in the biosynthesis of poly-N-acetyllactosamine chains and prefers lacto-N-neotetraose as a substrate. It is a type II transmembrane protein.
14546	This gene encodes a subunit of the heterodimeric cytokine interleukin 23 (IL23). IL23 is composed of this protein and the p40 subunit of interleukin 12 (IL12B). The receptor of IL23 is formed by the beta 1 subunit of IL12 (IL12RB1) and an IL23 specific subunit, IL23R. Both IL23 and IL12 can activate the transcription activator STAT4, and stimulate the production of interferon-gamma (IFNG). In contrast to IL12, which acts mainly on naive CD4(+) T cells, IL23 preferentially acts on memory CD4(+) T cells.
14547	The protein encoded by this gene is a member of the STAT protein family. In response to cytokines and growth factors, STAT family members are phosphorylated by the receptor associated kinases, and then form homo- or heterodimers that translocate to the cell nucleus where they act as transcription activators. In response to interferon (IFN), this protein forms a complex with STAT1 and IFN regulatory factor family protein p48 (ISGF3G), in which this protein acts as a transactivator, but lacks the ability to bind DNA directly. Transcription adaptor P300/CBP (EP300/CREBBP) has been shown to interact specifically with this protein, which is thought to be involved in the process of blocking IFN-alpha response by adenovirus.
14548	
14549	The protein encoded by this gene is a member of the bone morphogenetic protein (BMP) family and the TGF-beta superfamily. This group of proteins is characterized by a polybasic proteolytic processing site which is cleaved to produce a mature protein containing seven conserved cysteine residues. The members of this family are regulators of cell growth and differentiation in both embryonic and adult tissues. Studies in mice and Xenopus suggest that this protein is involved in mesodermal formation and neurogenesis during embryonic development.
14550	
14551	
14552	
14553	
14554	
14555	
14556	
14557	
14558	
14559	The protein encoded by this gene is a member of the keratin gene family. The type II cytokeratins consist of basic or neutral proteins which are arranged in pairs of heterotypic keratin chains coexpressed during differentiation of simple and stratified epithelial tissues. This type II cytokeratin is specifically expressed in the basal layer of the epidermis with family member KRT14. Mutations in these genes have been associated with a complex of diseases termed epidermolysis bullosa simplex. The type II cytokeratins are clustered in a region of chromosome 12q12-q13.
14560	
14561	
14562	The protein encoded by this gene belongs to the phosphoinositide 3-kinase (PI3K) family. PI3-kinases play roles in signaling pathways involved in cell proliferation, oncogenic transformation, cell survival, cell migration, and intracellular protein trafficking. This protein contains a lipid kinase catalytic domain as well as a C-terminal C2 domain, a characteristic of class II PI3-kinases. C2 domains act as calcium-dependent phospholipid binding motifs that mediate translocation of proteins to membranes, and may also mediate protein-protein interactions. The biological function of this gene has not yet been determined.
14563	This gene is a member of the regulatory factor X gene family, which encodes transcription factors that contain a highly-conserved winged helix DNA binding domain. The protein encoded by this gene is structurally related to regulatory factors X1, X2, X3, and X5. It has been shown to interact with itself as well as with regulatory factors X2 and X3, but it does not interact with regulatory factor X1. This protein may be a transcriptional repressor rather than a transcriptional activator. Three transcript variants encoding different isoforms have been described for this gene.
14564	Heterotrimeric guanine nucleotide-binding proteins (G proteins), which integrate signals between receptors and effector proteins, are composed of an alpha, a beta, and a gamma subunit. These subunits are encoded by families of related genes. This gene encodes a beta subunit. Beta subunits are important regulators of alpha subunits, as well as of certain signal transduction receptors and effectors. A single-nucleotide polymorphism (C825T) in this gene is associated with essential hypertension and obesity. This polymorphism is also associated with the occurrence of the splice variant GNB3-s, which appears to have increased activity. GNB3-s is an example of alternative splicing caused by a nucleotide change outside of the splice donor and acceptor sites. Additional splice variants may exist for this gene, but they have not been fully described.
14565	The encoded protein is a homolog of yeast securin proteins, which prevent separins from promoting sister chromatid separation. It is an anaphase-promoting complex (APC) substrate that associates with a separin until activation of the APC. The gene product has transforming activity in vitro and tumorigenic activity in vivo, and the gene is highly expressed in various tumors. The gene product contains 2 PXXP motifs, which are required for its transforming and tumorigenic activities, as well as for its stimulation of basic fibroblast growth factor expression. It also contains a destruction box (D box) that is required for its degradation by the APC. The acidic C-terminal region of the encoded protein can act as a transactivation domain. The gene product is mainly a cytosolic protein, although it partially localizes in the nucleus.
14566	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein that is known to complex with integrins and other transmembrane 4 superfamily proteins. It can modulate cell adhesion and migration and also trigger platelet activation and aggregation. In addition, the protein appears to promote muscle cell fusion and support myotube maintenance.
14567	
14568	
14569	
14570	
14571	Pleckstrin homology domains bind polyphosphoinositides and are involved in platelet cytoskeletal changes. This gene contains a (CAG)n trinucleotide repeat which has been associated with several neurological disorders in which the repeats code for polyglutamine in the protein. This gene may play an important role in the anti-apoptotic effects of insulin-like growth factor I.
14572	
14573	
14574	
14575	
14576	
14577	
14578	
14579	
14580	
14581	
14582	
14583	
14584	
14585	
14586	
14587	
14588	
14589	
14590	
14591	In Aspergillus nidulans, lack of the serine/threonine kinase NimA (never in mitosis A) results in cell cycle arrest in G2, while overexpression causes the premature onset of mitotic events. The protein encoded by this gene is similar in sequence to the Aspergillus nidulans protein and may therefore play a role in mitotic regulation. However, the encoded protein differs from other NimA family members in that it is not cell cycle regulated and is found primarily in the cytoplasm. Three transcript variants have been found for this gene, but the full-length nature of only two of them has been characterized.
14592	
14593	
14594	
14595	
14596	This gene encodes a member of the bicoid subfamily of the paired (PRD) homeobox family of proteins. The encoded protein acts as a transcription factor and may be autoregulatory. A similar protein in mice plays a role in craniofacial and rib cage development during embryogenesis.
14597	Nucleoporins are the main components of the nuclear pore complex (NPC) of eukaryotic cells. They are involved in the bidirectional trafficking of molecules, especially mRNAs and proteins, between the nucleus and the cytoplasm. The protein encoded by this gene does not contain the typical FG repeat sequences found in most vertebrate nucleoporins. Two protein isoforms are encoded by transcript variants of this gene.
14598	
14599	
14600	
14601	
14602	
14603	
14604	
14605	The vertebrate SIX genes are homologs of the Drosophila @#$%&sine oculis@#$%& (so) gene, which is expressed primarily in the developing visual system of the fly. Members of the SIX gene family encode proteins that are characterized by a divergent DNA-binding homeodomain and an upstream SIX domain, which may be involved both in determining DNA-binding specificity and in mediating protein-protein interactions. Genes in the SIX family have been shown to play roles in vertebrate and insect development or have been implicated in maintenance of the differentiated state of tissues.[supplied by OMIM]
14606	TIMM9 belongs to a family of evolutionarily conserved proteins that are organized in heterooligomeric complexes in the mitochondrial intermembrane space. These proteins mediate the import and insertion of hydrophobic membrane proteins into the mitochondrial inner membrane.[supplied by OMIM]
14607	
14608	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit and a member of the S14P family of ribosomal proteins. The protein, which contains a C2-C2 zinc finger-like domain that can bind to zinc, can enhance the tumor suppressor activity of Ras-related protein 1A (KREV1). It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
14609	
14610	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein associates with formin-binding protein 17, but its function is unknown. This protein may form oligomeric complexes with family members.
14611	
14612	
14613	
14614	
14615	
14616	
14617	
14618	
14619	
14620	
14621	
14622	The protein encoded by this gene belongs to the small GTPase superfamily, Rab family. It is associated with both constitutive and regulated secretory pathways, and may be involved in protein transport.
14623	This gene encodes a transmembrane protein that is located in the endoplasmic reticulum and is involved in GPI-anchor biosynthesis. The glycosylphosphatidylinositol (GPI) anchor is a glycolipid found on many blood cells and serves to anchor proteins to the cell surface. This gene is thought to encode a member of a family of dolichol-phosphate-mannose (Dol-P-Man) dependent mannosyltransferases.
14624	
14625	
14626	
14627	
14628	
14629	
14630	
14631	
14632	The protein encoded by this gene contains four EF-hand calcium binding domains, and is a member of the centrin protein family. Centrins are evolutionarily conserved proteins similar to the CDC31 protein of S. cerevisiae. Yeast CDC31 is located at the centrosome of interphase and mitotic cells, where it plays a fundamental role in centrosome duplication and separation. Multiple forms of the proteins similar to the yeast centrin have been identified in human and other mammalian cells, some of which have been shown to be associated with centrosome fractions. This protein appears to be one of the most abundant centrins associated with centrosome, which suggests a similar function to its yeast counterpart.
14633	
14634	GTP cyclohydrolase I feedback regulatory protein binds to and mediates tetrahydrobiopterin inhibition of GTP cyclohydrolase I. The regulatory protein, GCHFR, consists of a homodimer. It is postulated that GCHFR may play a role in regulating phenylalanine metabolism in the liver and in the production of biogenic amine neurotransmitters and nitric oxide.
14635	
14636	
14637	
14638	
14639	
14640	
14641	
14642	This gene encodes the major vault protein which is a lung resistance-related protein. Vaults are multi-subunit structures that may be involved in nucleo-cytoplasmic transport. This protein mediates drug resistance, perhaps via a transport process. It is widely distributed in normal tissues, and overexpressed in multidrug-resistant cancer cells. The protein overexpression is a potentially useful marker of clinical drug resistance. This gene produces two transcripts by using two alternative exon 2 sequences; however, the open reading frames are the same in both transcripts.
14643	
14644	
14645	
14646	
14647	
14648	The protein encoded by this gene binds to the beta-amyloid precursor protein. Beta-amyloid precursor protein is a cell surface protein with signal-transducing properties, and it is thought to play a role in the pathogenesis of Alzheimer@#$%&s disease. In addition, the encoded protein can form a heterodimer with UBE1C and bind and activate NEDD8, a ubiquitin-like protein. This protein is required for cell cycle progression through the S/M checkpoint. Three transcript variants encoding different isoforms have been found for this gene.
14649	
14650	
14651	This gene encodes a member of the polycystin protein family. The encoded glycoprotein contains a large N-terminal extracellular region, multiple transmembrane domains and a cytoplasmic C-tail. It may function as an integral membrane protein involved in cell-cell/matrix interactions, and may modulate intracellular calcium homoeostasis and other signal-transduction pathways. It plays a role in renal tubular development, and mutations in this gene have been associated with autosomal dominant polycystic kidney disease. Splice variants encoding different isoforms have been noted for this gene. Also, six pseudogenes, closely linked in a known duplicated region on chromosome 16p, have been described.
14652	
14653	An antibody that reacts with ovarian cancers and mesotheliomas was used to isolate a cell surface antigen named mesothelin.  Although the function of mesothelin is unknown, it may play a role in cellular adhesion and is present on mesothelium, mesotheliomas, and ovarian cancers.  Three transcript variants have been described for this gene and their protein products were initially named megakaryocyte potentiating factor (MPF), mesothelin, and the soluble member(s) of the mesothelin/MPF family. The protein isoform named MPF is a cytokine with megakaryocyte potentiating activity.  The soluble member encoded by the third splice variant is detected in ovarian carcinoma; the full length sequence of the third variant has not been described.
14654	
14655	
14656	
14657	
14658	
14659	
14660	
14661	
14662	The protein encoded by this gene is a type II membrane protein that catalyzes the transfer of sialic acid from CMP-sialic acid to galactose-containing substrates. The encoded protein is normally found in the Golgi but can be proteolytically processed to a soluble form. This protein, which is a member of glycosyltransferase family 29, can use the same acceptor substrates as does sialyltransferase 4A.
14663	The product of this gene is a subunit of neuronal microtubule-associated MAP1A and MAP1B proteins, which are involved in microtubule assembly and important for neurogenesis. Studies on the rat homolog implicate a role for this gene in autophagy, a process that involves the bulk degradation of cytoplasmic component.
14664	
14665	
14666	
14667	
14668	
14669	Crystallins are separated into two classes: taxon-specific and ubiquitous. The former class is also called phylogenetically-restricted crystallins. The latter class constitutes the major proteins of vertebrate eye lens and maintains the transparency and refractive index of the lens. This gene encodes a taxon-specific crystallin protein that binds NADPH and has sequence similarity to bacterial ornithine cyclodeaminases. The encoded protein does not perform a structural role in lens tissue, and instead it binds thyroid hormone for possible regulatory or developmental roles. Multiple alternatively spliced transcript variants have been found for this gene.
14670	Galanin is an important neuromodulator present in the brain, gastrointestinal system, and hypothalamopituitary axis. It is a 30-amino acid non-C-terminally amidated peptide that potently stimulates growth hormone secretion, inhibits cardiac vagal slowing of heart rate, abolishes sinus arrhythmia, and inhibits postprandial gastrointestinal motility. The actions of galanin are mediated through interaction with specific membrane receptors that are members of the 7-transmembrane family of G protein-coupled receptors. GALR2 interacts with the N-terminal residues of the galanin peptide. The primary signaling mechanism for GALR2 is through the phospholipase C/protein kinase C pathway (via Gq), in contrast to GALR1, which communicates its intracellular signal by inhibition of adenylyl cyclase through Gi. However, it has been demonstrated that GALR2 couples efficiently to both the Gq and Gi proteins to simultaneously activate 2 independent signal transduction pathways.
14671	
14672	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L19E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
14673	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
14674	This gene is located within the Smith-Magenis syndrome region on chromosome 17. It is highly similar to its mouse counterpart and is expressed at high levels mainly in neuronal tissues. The protein encoded by this gene includes a polymorphic polyglutamine tract in the N-terminal domain. Expression of the mouse counterpart in neurons is induced by retinoic acid. This gene is associated with both the severity of the phenotype and the response to medication in schizophrenic patients.
14675	
14676	Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. It is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, F0, which comprises the proton channel. The F1 complex consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled in a ratio of 3 alpha, 3 beta, and a single representative of the other 3. The F0 seems to have nine subunits (a, b, c, d, e, f, g, F6 and 8). This gene encodes the d subunit of the F0 complex. Alternatively spliced transcript variants encoding different isoforms have been identified for this gene. In addition, three pseudogenes are located on chromosomes 9, 12 and 15.
14677	
14678	
14679	The protein encoded by this gene binds the epidermal growth factor receptor and contains one SH2 domain and two SH3 domains. Its two SH3 domains direct complex formation with proline-rich regions of other proteins, and its SH2 domain binds tyrosine phosphorylated sequences. This gene is similar to the Sem5 gene of C.elegans, which is involved in the signal transduction pathway. Two alternatively spliced transcript variants encoding different isoforms have been found for this gene.
14680	
14681	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a non-ATPase subunit of the 19S regulator. Two transcripts encoding different isoforms have been described. A pseudogene has been identified on chromosome 3.
14682	Phosphatidylinositol transfer protein is a member of a diverse set of cytosolic phospholipid transfer proteins that are distinguished by their ability to transfer phospholipids between membranes in vitro (Wirtz, 1991).[supplied by OMIM]
14683	
14684	
14685	Heparan sulfate biosynthetic enzymes are key components in generating a myriad of distinct heparan sulfate fine structures that carry out multiple biologic activities. The enzyme encoded by this gene is a member of the heparan sulfate biosynthetic enzyme family. It is a type II integral membrane protein and possesses heparan sulfate glucosaminyl 3-O-sulfotransferase activity. The sulfotransferase domain of this enzyme is highly similar to the same domain of heparan sulfate D-glucosaminyl 3-O-sulfotransferase 3A1, and these two enzymes sulfate an identical disaccharide. This gene is widely expressed, with the most abundant expression in liver and placenta.
14686	
14687	ITGA2B encodes integrin alpha chain 2b. Integrins are heterodimeric integral membrane proteins composed of an alpha chain and a beta chain. Alpha chain 2b undergoes post-translational cleavage to yield disulfide-linked light and heavy chains that join with beta 3 to form a fibronectin receptor expressed in platelets that plays a crucial role in coagulation. Mutations that interfere with this role result in thrombasthenia. In addition to adhesion, integrins are known to participate in cell-surface mediated signalling.
14688	The CD233 gene is located on chromosome 17q21-q22 and is part of the anion exchanger (AE) family. CD233 is expressed in the erythrocyte plasma membrane where it functions as a chloride/bicarbonate exchanger involved in carbon dioxide transport from tissues to lungs. The protein comprises two domains that are structurally and functionally distinct. The N-terminal 40kDa domain is located in the cytoplasm and acts as an attachment site for the red cell skeleton by binding ankyrin. The glycosylated C-terminal membrane-associated domain contains 12-14 membrane spanning segments and carries out the stilbene disulphonate-sensitive exchange transport of anions. The cytoplasmic tail at the extreme C-terminus of the membrane domain binds carbonic anhydrase II. CD233 associates with the red cell membrane protein glycophorin A and this association promotes the correct folding and translocation of CD233. CD233 is predominantly dimeric but forms tetramers in the presence of ankyrin. Many CD233 mutations are known in man and these mutations can lead to two types of disease; destabilization of red cell membrane leading to hereditary spherocytosis, and defective kidney acid secretion leading to distal renal tubular acidosis. Other CD233 mutations that do not give rise to disease result in novel blood group antigens, which form the Diego blood group system. Southeast Asian ovalocytosis (SAO, Melanesian ovalocytosis) results from the heterozygous presence of a deletion in the CD233 protein and is common in areas where Plasmodium falciparum malaria is endemic. One CD233 null human is known also with very severe anemia and nephrocalcinosis [PROW]
14689	
14690	
14691	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intracellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, and White). This encoded protein is a member of the ABC1 subfamily. Members of the ABC1 subfamily comprise the only major ABC subfamily found exclusively in multicellular eukaryotes. This gene is clustered among 4 other ABC1 family members on 17q24, but neither the substrate nor the function of this gene is known. Alternative splicing of this gene results in several transcript variants; however, not all variants have been fully described.
14692	
14693	
14694	This gene encodes a member of a large protein family which contains the LIM domain, a unique cysteine-rich zinc-binding domain. The encoded protein may function as a transcriptional regulator and be involved in control of differentiation and development of neural and lymphoid cells. A similar protein in mice is an essential regulator of the vertebrate head organizer.
14695	
14696	The protein encoded by this gene is a member of the pexin family. It is found in serum and tissues and promotes cell adhesion and spreading, inhibits the membrane-damaging effect of the terminal cytolytic complement pathway, and binds to several serpin serine protease inhibitors. It is a secreted protein and exists in either a single chain form or a clipped, two chain form held together by a disulfide bond.
14697	
14698	
14699	
14700	
14701	
14702	
14703	The protein encoded by this gene is a member of the keratin gene family. As a type I hair keratin, it is an acidic protein which heterodimerizes with type II keratins to form hair and nails. The type I hair keratins are clustered in a region of chromosome 17q12-q21 and have the same direction of transcription.
14704	
14705	CX31.9 is a member of the large family of connexins that are required for the formation of gap junctions. Six connexin monomers form a hemichannel, or connexon, on the cell surface. This connexon can interact with a connexon from a neighboring cell, thus forming a channel linking the cytoplasm of the 2 cells.[supplied by OMIM]
14706	
14707	The protein encoded by this gene is a member of the keratin gene family. As a type I hair keratin, it is an acidic protein which heterodimerizes with type II keratins to form hair and nails. The type I hair keratins are clustered in a region of chromosome 17q12-q21 and have the same direction of transcription.
14708	
14709	
14710	
14711	
14712	
14713	The protein encoded by this gene is part of a nuclear pore complex, Nup107-160. This protein contains WD repeats and shares 34% amino acid identity with yeast Seh1 and 30% identity with yeast Sec13. All constituents of the Nup107-160 complex, including this protein, specifically localize to kinetochores in mitosis. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
14714	CD226 is a ~65 kDa glycoprotein expressed on the surface of NK cells, platelets, monocytes and a subset of T cells. It is a member of the Ig-superfamily containing 2 Ig-like domains of the V-set and is encoded by a gene on human chromosome 18q22.3. CD226 mediates cellular adhesion to other cells bearing an unidentified ligand and cross-linking CD226 with antibodies causes cellular activation [PROW]
14715	The protein encoded by this gene is a type I integral membrane protein localized in the intermediate region between the endoplasmic reticulum and the Golgi, presumably recycling between the two compartments. The protein is a mannose-specific lectin and is a member of a novel family of plant lectin homologs in the secretory pathway of animal cells. Mutations in the gene are associated with a coagulation defect. Using positional cloning, the gene was identified as the disease gene leading to combined factor V-factor VIII deficiency, a rare, autosomal recessive disorder in which both coagulation factors V and VIII are diminished.
14716	This gene belongs to the ephrin receptor subfamily of the protein-tyrosine kinase family. EPH and EPH-related receptors have been implicated in mediating developmental events, particularly in the nervous system. Receptors in the EPH subfamily typically have a single kinase domain and an extracellular region containing a Cys-rich domain and 2 fibronectin type III repeats. The ephrin receptors are divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands.
14717	Meprins are multidomain zinc metalloproteases that are highly expressed in mammalian kidney and intestinal brush border membranes and in leukocytes and certain cancer cells. Mature meprins are oligomers of evolutionarily related, separately encoded alpha and/or beta subunits. Homooligomers of meprin-alpha (MEP1A; MIM 600388) are secreted; oligomers containing meprin-beta are associated with the plasma membrane. Substrates include bioactive peptides and extracellular matrix proteins. See MIM 600388 for further information on meprins.[supplied by OMIM]
14718	
14719	
14720	
14721	The protein encoded by this gene has substantial phospholipase activity and may be involved in lipoprotein metabolism and vascular biology. This protein is designated a member of the TG lipase family by its sequence and characteristic lid region which provides substrate specificity for enzymes of the TG lipase family.
14722	
14723	
14724	
14725	
14726	
14727	
14728	
14729	
14730	
14731	
14732	
14733	
14734	This gene encodes a subunit of the heterotetrameric adaptor-related protein comlex 1 (AP-1), which belongs to the adaptor complexes medium subunits family. This protein is capable of interacting with tyrosine-based sorting signals.
14735	Members of the F-box protein family, such as FBXL12, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
14736	Ubiquitin-like proteins (UBLs) are thought to be reversible modulators of protein function rather than protein degraders like ubiquitin (MIM 191339).[supplied by OMIM]
14737	
14738	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and is known to dehydrogenate 3-methylindole, an endogenous toxin derived from the fermentation of tryptophan, as well as xenobiotic substrates such as naphthalene and ethoxycoumarin. This gene is part of a large cluster of cytochrome P450 genes from the CYP2A, CYP2B and CYP2F subfamilies on chromosome 19q.
14739	
14740	
14741	The protein encoded by this gene is a dual specificity protein kinase that belongs to the MAP kinase kinase family. This kinase specifically activates MAPK8/JNK1 and MAPK9/JNK2, and this kinase itself is phosphorylated and activated by MAP kinase kinase kinases including MAP3K1/MEKK1, MAP3K2/MEKK2,MAP3K3/MEKK5, and MAP4K2/GCK. This kinase is involved in the signal transduction mediating the cell responses to proinflammatory cytokines, and environmental stresses. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found, but only one transcript variant has been supported and defined.
14742	
14743	
14744	Aminoacyl-tRNA synthetases are a class of enzymes that charge tRNAs with their cognate amino acids. FARSL encodes a product which is similar to the catalytic subunit of prokaryotic and Saccharomyces cerevisiae phenylalanyl-tRNA synthetases (PheRS). FARSL has been shown to be expressed in a tumor-selective and cell cycle stage- and differentiation-dependent manner, the first member of the tRNA synthetase gene family shown to exhibit this type of regulated expression
14745	
14746	The protein encoded by this gene is one of the highly conserved mini-chromosome maintenance proteins (MCM) that are involved in the initiation of eukaryotic genome replication. The hexameric protein complex formed by MCM proteins is a key component of the pre-replication complex (pre_RC) and may be involved in the formation of replication forks and in the recruitment of other DNA replication related proteins. This protein is a subunit of the protein complex that consists of MCM2-7. It has been shown to interact directly with MCM5/CDC46. This protein also interacts with, and thus is acetlyated by MCM3AP, a chromatin-associated acetyltransferase. The acetylation of this protein inhibits the initiation of DNA replication and cell cycle progression.
14747	MYH14 is a member of the nonmuscle myosin II family of ATP-dependent molecular motors, which interact with cytoskeletal actin and regulate cytokinesis, cell motility, and cell polarity (Golomb et al., 2004).[supplied by OMIM]
14748	
14749	The protein encoded by this gene is a vesicle-bound, sodium-dependent phosphate transporter that is specifically expressed in the neuron-rich regions of the brain. It is preferentially associated with the membranes of synaptic vesicles and functions in glutamate transport. The protein shares 82% identity with the differentiation-associated Na-dependent inorganic phosphate cotransporter and they appear to form a distinct class within the Na+/Pi cotransporter family.
14750	
14751	
14752	Carbonic anhydrases (CAs) are a large family of zinc metalloenzymes that catalyze the reversible hydration of carbon dioxide.  They participate in a variety of biological processes, including respiration, calcification, acid-base balance, bone resorption, and the formation of aqueous humor, cerebrospinal fluid, saliva, and gastric acid.  They show extensive diversity in tissue distribution and in their subcellular localization.  CA XI is likely a secreted protein, however, radical changes at active site residues completely conserved in CA isozymes with catalytic activity, make it unlikely that it has carbonic anhydrase activity. It shares properties in common with two other acatalytic CA isoforms, CA VIII and CA X.  CA XI is most abundantly expressed in brain, and may play a general role in the central nervous system.
14753	
14754	The protein encoded by this gene belongs to the protein kinase D (PKD) family of serine/threonine protein kinases. This kinase can be activated by phorbol esters as well as by gastrin via the cholecystokinin B receptor (CCKBR) in gastric cancer cells. It can bind to diacylglycerol (DAG) in the trans-Golgi network (TGN) and may regulate basolateral membrane protein exit from TGN.
14755	
14756	
14757	
14758	
14759	The RYR1 gene encodes the skeletal muscle ryanodine receptor, which serves as a calcium release channel of the sarcoplasmic reticulum as well as a bridging structure connecting the sarcoplasmic reticulum and transverse tubule (MacLennan et al., 1989). See also RYR2 (MIM 180902) and RYR3 (MIM 180903), which encode the cardiac and brain ryanodine receptors, respectively.[supplied by OMIM]
14760	
14761	This gene product is a component of a nucleolar small nuclear ribonucleoprotein (snRNP) particle thought to participate in the first step in processing preribosomal RNA. It is associated with the U3, U8, and U13 small nuclear RNAs and is located in the dense fibrillar component (DFC) of the nucleolus. The encoded protein contains an N-terminal repetitive domain that is rich in glycine and arginine residues, like fibrillarins in other species. Its central region resembles an RNA-binding domain and contains an RNP consensus sequence. Antisera from approximately 8% of humans with the autoimmune disease scleroderma recognize fibrillarin.
14762	This gene product is a component of a nucleolar small nuclear ribonucleoprotein (snRNP) particle thought to participate in the first step in processing preribosomal RNA. It is associated with the U3, U8, and U13 small nuclear RNAs and is located in the dense fibrillar component (DFC) of the nucleolus. The encoded protein contains an N-terminal repetitive domain that is rich in glycine and arginine residues, like fibrillarins in other species. Its central region resembles an RNA-binding domain and contains an RNP consensus sequence. Antisera from approximately 8% of humans with the autoimmune disease scleroderma recognize fibrillarin.
14763	
14764	
14765	Anti-Mullerian hormone is a member of the transforming growth factor-beta gene family which mediates male sexual differentiation. Anti-Mullerian hormone causes the regression of Mullerian ducts which would otherwise differentiate into the uterus and fallopian tubes. Some mutations in the anti-Mullerian hormone result in persistent Mullerian duct syndrome.
14766	This gene encodes a member of the type I phosphatidylinositol-4-phosphate 5-kinase family of enzymes. A similar protein in mice is found in synapses and focal adhesion plaques, and binds the FERM domain of talin through its C-terminus.
14767	This gene encodes a calcium-binding protein, which may play a role in the regulation of ion transport. A similar protein was first described as a potentially important regulatory protein in the dog thyroid and was termed as R2D5 antigen in rabbit. Alternative splicing of this gene generates two transcript variants.
14768	
14769	
14770	This gene encodes an enzyme that is a member of the beta-1,3-N-acetylglucosaminyltransferase family. This enzyme is a type II transmembrane protein and contains a signal anchor that is not cleaved. It prefers the substrates of lacto-N-tetraose and lacto-N-neotetraose, and is involved in the biosynthesis of poly-N-acetyllactosamine chains and the biosynthesis of the backbone structure of dimeric sialyl Lewis a. It plays dominant roles in L-selectin ligand biosynthesis, lymphocyte homing and lymphocyte trafficking.
14771	The protein encoded by this gene is a member of the bone morphogenetic protein (BMP) family and the TGF-beta superfamily. This group of proteins is characterized by a polybasic proteolytic processing site which is cleaved to produce a mature protein containing seven conserved cysteine residues. The members of this family are regulators of cell growth and differentiation in both embryonic and adult tissues. Studies in yeast suggest that the protein is involved in aging. This protein is transcribed from both a monocistronic mRNA and a bicistronic mRNA that also encodes growth differentiation factor 1.
14772	
14773	
14774	
14775	NFKB1 (MIM 164011) or NFKB2 (MIM 164012) is bound to REL (MIM 164910), RELA (MIM 164014), or RELB (MIM 604758) to form the NFKB complex. The NFKB complex is inhibited by I-kappa-B proteins (NFKBIA, MIM 164008, or NFKBIB), which inactivate NF-kappa-B by trapping it in the cytoplasm. Phosphorylation of serine residues on the I-kappa-B proteins by kinases (IKBKA, MIM 600664 or IKBKB, MIM 603258) marks them for destruction via the ubiquitination pathway, thereby allowing activation of the NF-kappa-B complex. Activated NFKB complex translocates into the nucleus and binds DNA at kappa-B-binding motifs such as 5-prime GGGRNNYYCC 3-prime or 5-prime HGGARNYYCC 3-prime (where H is A, C, or T; R is an A or G purine; and Y is a C or T pyrimidine).[supplied by OMIM]
14776	
14777	The protein encoded by this gene belongs to the family of P-type cation transport ATPases, and to the subfamily of aminophospholipid-transporting ATPases. The aminophospholipid translocases transport phosphatidylserine and phosphatidylethanolamine from one side of a bilayer to another. This gene encodes the member 3 of the phospholipid-transporting ATPase 8B.
14778	
14779	
14780	
14781	
14782	
14783	
14784	
14785	The protein encoded by this gene belongs to the family 1 of glycosyl hydrolases. The protein is integral to plasma membrane and has both phlorizin hydrolase activity and lactase activity.
14786	Activins are dimeric growth and differentiation factors which belong to the transforming growth factor-beta (TGF-beta) superfamily of structurally related signaling proteins. Activins signal through a heteromeric complex of receptor serine kinases which include at least two type I ( I and IB) and two type II (II and IIB) receptors. These receptors are all transmembrane proteins, composed of a ligand-binding extracellular domain with cysteine-rich region, a transmembrane domain, and a cytoplasmic domain with predicted serine/threonine specificity. Type I receptors are essential for signaling; and type II receptors are required for binding ligands and for expression of type I receptors. Type I and II receptors form a stable complex after ligand binding, resulting in phosphorylation of type I receptors by type II receptors. This gene encodes activin A type I receptor which signals a particular transcriptional response in concert with activin type II receptors.
14787	This gene encodes a transcription factor that is a member of the leucine zipper family of DNA binding proteins. This protein binds as a homodimer to the cAMP-responsive element, an octameric palindrome. The protein is phosphorylated by several protein kinases, and induces transcription of genes in response to hormonal stimulation of the cAMP pathway. Alternate splicing of this gene results in two transcript variants encoding different isoforms.
14788	
14789	
14790	
14791	
14792	The protein encoded by this gene is a neuron-specific activator of CDK5 kinase. It associates with CDK5 to form an active kinase. This protein and neuron-specific CDK5 activator CDK5R1/p39NCK5A both share limited similarity to cyclins, and thus may define a distinct family of cyclin-dependent kinase activating proteins.
14793	
14794	Nitric oxide (NO) is a potent mediator in biologic processes such as neurotransmission, inflammatory response, and vascular homeostasis. NOSTRIN binds the enzyme responsible for NO production, endothelial NO synthase (ENOS; MIM 163729), and triggers the translocation of ENOS from the plasma membrane to vesicle-like subcellular structures, thereby attenuating ENOS-dependent NO production.[supplied by OMIM]
14795	The protein encoded by this gene is a cytokine receptor that belongs to the interleukin 1 receptor family. This protein binds interleukin alpha (IL1A), interleukin beta (IL1B), and interleukin 1 receptor, type I(IL1R1/IL1RA), and acts as a decoy receptor that inhibits the activity of its ligands. Interleukin 4 (IL4) is reported to antagonize the activity of interleukin 1 by inducing the expression and release of this cytokine. This gene and three other genes form a cytokine receptor gene cluster on chromosome 2q12. Two alternatively spliced transcript variants encoding the same protein have been reported.
14796	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein that belongs to the EcoL2 ribosomal protein family. A pseudogene corresponding to this gene is found on chromosome 12q.
14797	
14798	
14799	
14800	
14801	
14802	
14803	This gene encodes a protein that acts in the sorting of transmembrane proteins into lysosomes/vacuoles via the multivesicular body (MVB) pathway. This protein, along with other soluble coiled-coil containing proteins, forms part of the ESCRT-III protein complex that binds to the endosomal membrane and recruits additional cofactors for protein sorting into the MVB. This protein may also co-immunoprecipitate with a member of the IFG-binding protein superfamily. Alternative splicing results in multiple transcript variants encoding different isoforms.
14804	The protein encoded by this gene is a member of the interleukin 1 cytokine family. This cytokine is produced by activated macrophages as a proprotein, which is proteolytically processed to its active form by caspase 1 (CASP1/ICE). This cytokine is an important mediator of the inflammatory response, and is involved in a variety of cellular activities, including cell proliferation, differentiation, and apoptosis. The induction of cyclooxygenase-2 (PTGS2/COX2) by this cytokine in the central nervous system (CNS) is found to contribute to inflammatory pain hypersensitivity. This gene and eight other interleukin 1 family genes form a cytokine gene cluster on chromosome 2.
14805	
14806	FABP1 encodes the fatty acid binding protein found in liver.  Fatty acid binding proteins are a family of small, highly conserved, cytoplasmic proteins that bind long-chain fatty acids and other hydrophobic ligands. FABP1 and FABP6 (the ileal fatty acid binding protein) are also able to bind bile acids. It is thought that FABPs roles include fatty acid uptake, transport, and metabolism.
14807	Glycine N-methyltransferase (GNMT; EC 2.1.1.20) catalyzes the synthesis of N-methylglycine (sarcosine) from glycine using S-adenosylmethionine (AdoMet) as the methyl donor. The enzyme was first described by Blumenstein and Williams (1960) in guinea pig liver. GNMT acts as an enzyme to regulate the ratio of S-adenosylmethionine to S-adenosylhomocysteine (AdoHcy) and participates in the detoxification pathway in liver cells.[supplied by OMIM]
14808	Histones play a critical role in transcriptional regulation, cell cycle progression, and developmental events. Histone acetylation/deacetylation alters chromosome structure and affects transcription factor access to DNA. The protein encoded by this gene belongs to class II of the histone deacetylase/acuc/apha family. It possesses histone deacetylase activity and represses transcription when tethered to a promoter. This protein does not bind DNA directly, but through transcription factors MEF2C and MEF2D. It seems to interact in a multiprotein complex with RbAp48 and HDAC3.
14809	There are at least four distinct but related alkaline phosphatases: intestinal, placental, placental-like, and liver/bone/kidney (tissue non-specific). The first three are located together on chromosome 2 while the tissue non-specific form is located on chromosome 1. The exact physiological function of the alkaline phosphatases is not known. The product of this gene is a membrane bound glycosylated enzyme, also referred to as the heat stable form, that is expressed primarily in the placenta although it is closely related to the intestinal form of the enzyme as well as to the placental-like form. The coding sequence for this form of alkaline phosphatase is unique in that the 3@#$%& untranslated region contains multiple copies of an Alu family repeat. In addition, this gene is polymorphic and three common alleles (type 1, type 2 and type 3) for this form of alkaline phosphatase have been well characterized.
14810	Apolipoprotein B (ApoB) is the main apolipoprotein of chylomicrons and low density lipoproteins (LDL). The protein occurs in the plasma in 2 main isoforms, apoB-48 and apoB-100. The first is synthesized exclusively by the gut, the second by the liver.  The intestinal (B-48) and hepatic (B-100) forms of apoB are coded by a single gene and by a single mRNA transcript larger than 16 kb. The 2 proteins share a common amino terminal sequence. In the ApoB-100 isoform the precursor has 4,563 amino acids, and the mature apoB-100 has 4,536 amino acid residues.  Mature, circulating B-48 is homologous over its entire length (estimated to be between 2,130 and 2,144 amino acid residues) with the amino-terminal portion of B-100 and contains no sequence from the carboxyl end of B-100. From structural studies, it is thought that apoB-48 represents the amino-terminal 47% of apoB-100 and that the carboxyl terminus of apoB-48 is in the vicinity of residue 2151 of apoB-100. Apolipoprotein B-48 may be the product of an intestinal mRNA with an in-frame UAA stop codon resulting from a C-to-U change in the codon CAA encoding Gln(2153) in apoB-100 mRNA.  Since only the sequence that codes B-100 is present in genomic DNA, this presents the possibility of an  organ-specific introduction of a stop codon to an mRNA and the change from CAA to UAA of codon 2153 of the message as a unique RNA editing process.
14811	
14812	The protein encoded by this gene belongs to the FKBP-type peptidyl-prolyl cis/trans isomerase (PPIase) family. Members of this family exhibit PPIase activity and function as molecular chaperones. A similar protein in mouse is located in the endoplasmic reticulum and binds calcium. Two alternatively spliced variants, which encode different isoforms, are reported.
14813	
14814	
14815	
14816	The HADHA (MIM 600890) and HADHB genes encode the alpha and beta subunits of the mitochondrial trifunctional protein, respectively. The heterocomplex contains 4 alpha and 4 beta subunits and catalyzes 3 steps in the beta-oxidation of fatty acids, including the long-chain 3-hydroxyl-CoA dehydrogenase step. The beta subunit harbors the 3-ketoacyl-CoA thiolase activity (Kamijo et al., 1994).[supplied by OMIM]
14817	
14818	Ubiquitin, a highly conserved protein that has a major role in targeting cellular proteins for degradation by the 26S proteosome, is synthesized as a precursor protein consisting of either polyubiquitin chains or a single ubiquitin fused to an unrelated protein. This gene encodes a fusion protein consisting of ubiquitin at the N terminus and ribosomal protein S27a at the C terminus. When expressed in yeast, the protein is post-translationally processed, generating free ubiquitin monomer and ribosomal protein S27a. Ribosomal protein S27a is a component of the 40S subunit of the ribosome and belongs to the S27AE family of ribosomal proteins. It contains C4-type zinc finger domains and is located in the cytoplasm. Pseudogenes derived from this gene are present in the genome. As with ribosomal protein S27a, ribosomal protein L40 is also synthesized as a fusion protein with ubiquitin; similarly, ribosomal protein S30 is synthesized as a fusion protein with the ubiquitin-like protein fubi.
14819	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order   structures. This gene is intronless and encodes a member of the histone H1 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6.
14820	The protein encoded by this gene is a member of the family of inhibitor of apoptosis proteins (IAP) and contains a single copy of a baculovirus IAP repeat (BIR) as well as a RING-type zinc finger domain. The BIR domain is essential for inhibitory activity and interacts with caspases, while the RING finger domain sometimes enhances antiapoptotic activity but does not inhibit apoptosis alone. Two transcript variants encoding different isoforms have been found for this gene. The two isoforms have different antiapoptotic properties, with isoform alpha protecting cells from apoptosis induced by staurosporine and isoform b protecting cells from apoptosis induced by etoposide.
14821	The protein encoded by this gene is a DNA-binding homeobox protein and a transcriptional repressor. The encoded protein appears to repress transcription by recruiting histone deacetylases to TGF beta-responsive genes. This gene is amplified and overexpressed in some ovarian cancers, and mutations in this gene can cause holoprosencephaly.
14822	
14823	
14824	
14825	
14826	
14827	Butyrophilin is the major protein associated with fat droplets in the milk. It is a member of the immunoglobulin superfamily. It may have a cell surface receptor function. The human butyrophilin gene is localized in the major histocompatibility complex (MHC) class I region of 6p and may have arisen relatively recently in evolution by the shuffling of exons between 2 ancestral gene families
14828	The protein encoded by this gene is a member of the RAB family of small GTPases. The GTP-bound form of the encoded protein has been shown to interact with early-endosomal antigen 1, and may be involved in the trafficking of and interaction between endosomal compartments.
14829	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a protein that inhibits the activation of the proteasome by the 11S and 19S regulators. Three alternative transcripts encoding two isoforms have been identified for this gene.
14830	Nop56p is a yeast nucleolar protein that is part of a complex with the nucleolar proteins Nop58p and fibrillarin. Nop56p is required for assembly of the 60S ribosomal subunit and is involved in pre-rRNA processing. The protein encoded by this gene is similar in sequence to Nop56p and is also found in the nucleolus. Multiple transcript variants encoding several different isoforms have been found for this gene, but the full-length nature of most of them have not been determined.
14831	
14832	
14833	The protein encoded by this gene belongs to the subtilisin-like proprotein convertase family. The members of this family are proprotein convertases that process latent precursor proteins into their biologically active products. This encoded protein is a proinsulin-processing enzyme that plays a key role in regulating insulin biosynthesis. It is also known to cleave proopiomelanocortin, proenkephalin, prodynorphin and proluteinizing-hormone-releasing hormone. The use of alternate polyadenylation sites has been found for this gene.
14834	The cystatin superfamily encompasses proteins that contain multiple cystatin-like sequences. Some of the members are active cysteine protease inhibitors, while others have lost or perhaps never acquired this inhibitory activity. There are three inhibitory families in the superfamily, including the type 1 cystatins (stefins), type 2 cystatins and the kininogens. The type 2 cystatin proteins are a class of cysteine proteinase inhibitors found in a variety of human fluids and secretions. The cystatin locus on chromosome 20 contains the majority of the type 2 cystatin genes and pseudogenes. This gene is located in the cystatin locus and encodes a protein similar to type 2 cystatins. The protein exhibits highly tissue-specific expression in the reproductive tract, suggesting implicit roles in reproduction. Alternative splicing identified in mouse is suggested in human based on EST evidence but the full-length nature of putative variants has not been determined.
14835	ENTPD6 is similar to E-type nucleotidases (NTPases). NTPases, such as CD39, mediate catabolism of extracellular nucleotides. ENTPD6 contains 4 apyrase-conserved regions which is characteristic of NTPases.
14836	This gene encodes a myosin light chain kinase, a calcium/calmodulin dependent enzyme, that is exclusively expressed in adult skeletal muscle.
14837	The protein encoded by this gene interacts with the amino-terminal domain of the neuron-specific X11-like protein (X11L), inhibits the association of X11L with amyloid precursor protein through a non-competitive mechanism, and abolishes the suppression of beta-amyloid production by X11L. This protein, together with X11L, may play an important role in the regulatory system of amyloid precursor protein metabolism and beta-amyloid generation. This gene consists of at least 13 exons and its alternative splicing generates at least 2 transcript variants.
14838	The protein encoded by this gene is a receptor for activated protein C, a serine protease activated by and involved in the blood coagulation pathway. The encoded protein is an N-glycosylated type I membrane protein that enhances the activation of protein C. Mutations in this gene have been associated with venous thromboembolism and myocardial infarction, as well as with late fetal loss during pregnancy.
14839	
14840	Initiation of transcription by RNA polymerase II requires the activities of more than 70 polypeptides. The protein that coordinates these activities is transcription factor IID (TFIID), which binds to the core promoter to position the polymerase properly, serves as the scaffold for assembly of the remainder of the transcription complex, and acts as a channel for regulatory signals. TFIID is composed of the TATA-binding protein (TBP) and a group of evolutionarily conserved proteins known as TBP-associated factors or TAFs. TAFs may participate in basal transcription, serve as coactivators, function in promoter recognition or modify general transcription factors (GTFs) to facilitate complex assembly and transcription initiation. This gene encodes one of the larger subunits of TFIID that has been shown to potentiate transcriptional activation by retinoic acid, thyroid hormone and vitamin D3 receptors. In addition, this subunit interacts with the transcription factor CREB, which has a glutamine-rich activation domain, and binds to other proteins containing glutamine-rich regions. Aberrant binding to this subunit by proteins with expanded polyglutamine regions has been suggested as one of the pathogenetic mechanisms underlying a group of neurodegenerative disorders referred to as polyglutamine diseases.
14841	The protein encoded by this gene is a member of the BTG/Tob family. This family has structurally related proteins that appear to have antiproliferative properties. This encoded protein might play a role in neurogenesis in the central nervous system.
14842	The protein encoded by this gene belongs to the methyltransferase superfamily. Alternative splicing occurs at this locus and two transcript variants encoding distinct isoforms have been identified.
14843	The protein encoded by this gene is a trifunctional polypeptide. It has phosphoribosylglycinamide formyltransferase, phosphoribosylglycinamide synthetase, phosphoribosylaminoimidazole synthetase activity which is required for de novo purine biosynthesis. This enzyme is highly conserved in vertebrates. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
14844	This gene encodes a basic nuclear protein of unknown function. The gene is widely expressed in adult and fetal tissues. Since the region proposed to contain the gene(s) for congenital heart disease (CHD) in Down syndrome (DS) patients has been restricted to 21q22.2-22.3, this gene, which maps to 21q22.3, has a potential role in the pathogenesis of Down syndrom congenital heart disease.
14845	
14846	
14847	
14848	
14849	This gene encodes a protein that belongs to the serine protease family. The encoded protein contains a serine protease domain, a transmembrane domain, a LDL receptor-like domain, and a scavenger receptor cysteine-rich domain. Serine proteases are known to be involved in a variety of biological processes, whose malfunction often leads to human diseases and disorders. This gene was identified by its association with both congenital and childhood onset autosomal recessive deafness. This gene is expressed in fetal cochlea and many other tissues, and is thought to be involved in the development and maintenance of the inner ear or the contents of the perilymph and endolymph. This gene was also identified as a tumor associated gene that is overexpressed in ovarian tumors. Four alternatively spliced variants have been described, two of which encode identical products.
14850	
14851	
14852	This gene encodes the sixth largest subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes, that is also shared by the other two DNA-directed RNA polymerases. In yeast, this polymerase subunit, in combination with at least two other subunits, forms a structure that stabilizes the transcribing polymerase on the DNA template.
14853	The protein encoded by this gene is an integral membrane protein of the mitochondrial outer membrane. The encoded protein interacts with TOMM20 and TOMM40, and forms a complex with several other proteins to import cytosolic preproteins into the mitochondrion.
14854	The protein encoded by this gene was identified as a regulator of the MAP kinase kinase kinase MAP3K7/TAK1, which is known to mediate various intracellular signaling pathways, such as those induced by TGF beta, interleukin 1, and WNT-1. This protein interacts and thus activates TAK1 kinase. It has been shown that the C-terminal portion of this protein is sufficient for binding and activation of TAK1, while a portion of the N-terminus acts as a dominant-negative inhibitor of TGF beta, suggesting that this protein may function as a mediator between TGF beta receptors and TAK1. This protein can also interact with and activate the mitogen-activated protein kinase 14 (MAPK14/p38alpha), and thus represents an alternative activation pathway, in addition to the MAPKK pathways, which contributes to the biological responses of MAPK14 to various stimuli. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
14855	
14856	Originally named because of its sequence similarity to the Saccharomyces cerevisiae NHP2 (non-histone protein 2), this protein appears to be a highly conserved nuclear protein that is a component of the [U4/U6.U5] tri-snRNP. It binds to the 5@#$%& stem-loop of U4 snRNA. Two transcript variants encoding the same protein have been found for this gene.
14857	
14858	
14859	
14860	
14861	
14862	The protein encoded by this gene is a nuclear hormone receptor for triiodothyronine. It is one of the several receptors for thyroid hormone, and has been shown to mediate the biological activities of thyroid hormone. Knockout studies in mice suggest that the different receptors, while having certain extent of redundancy, may mediate different functions of thyroid hormone. Defects in this gene are known to be a cause of generalized thyroid hormone resistance (GTHR), a syndrome characterized by goiter and high levels of circulating thyroid hormone (T3-T4), with normal or slightly elevated thyroid stimulating hormone (TSH).
14863	
14864	
14865	RBP2 is an abundant protein present in the small intestinal epithelium. It is thought to participate in the uptake and/or intracellular metabolism of vitamin A. Vitamin A is a fat-soluble vitamin necessary for growth, reproduction, differentiation of epithelial tissues, and vision. RBP2 may also modulate the supply of retinoic acid to the nuclei of endometrial cells during the menstrual cycle.
14866	Bilateral symmetric nervous systems have special midline structures that establish a partition between the two mirror image halves. Some axons project toward and across the midline in response to long-range chemoattractants emanating from the midline. In Drosophila, the roundabout gene, a member of the immunoglobulin gene superfamily, encodes an integral membrane protein that is both an axon guidance receptor and a cell adhesion receptor. This receptor is involved in the decision by axons to cross the central nervous system midline. The protein encoded by this gene is structurally similar to the Drosophila roundabout protein. Two transcript variants encoding different isoforms have been found for this gene.
14867	
14868	
14869	
14870	
14871	The protein encoded by this gene is member of the GAP1 family of GTPase-activating proteins. The gene product stimulates the GTPase activity of normal RAS p21 but not its oncogenic counterpart. Acting as a suppressor of RAS function, the protein enhances the weak intrinsic GTPase activity of RAS proteins resulting in the inactive GDP-bound form of RAS, thereby allowing control of cellular proliferation and differentiation. This particular family member has a perinuclear localization and is an inositol 1,3,4,5-tetrakisphosphate-binding protein; a compound suggested to function as a second messenger.
14872	Microsomal arylacetamide deacetylase competes against the activity of cytosolic arylamine N-acetyltransferase, which catalyzes one of the initial biotransformation pathways for arylamine and heterocyclic amine carcinogens
14873	
14874	
14875	
14876	IRAK2 encodes the interleukin-1 receptor-associated kinase 2, one of two putative serine/threonine kinases that become associated with the interleukin-1 receptor (IL1R) upon stimulation. IRAK2 is reported to participate in the IL1-induced upregulation of NF-kappaB.
14877	
14878	
14879	
14880	
14881	
14882	Nectins (e.g., PVRL1; MIM 600644) are immunoglobulin-like adhesion molecules that interact with afadin (AF6; MIM 159559). Afadin is an actin filament-binding protein that connects nectins to the actin cytoskeleton. The nectin-afadin system organizes adherens junctions cooperatively with the cadherin (see MIM 192090)-catenin (see MIM 116805) system in epithelial cells.[supplied by OMIM]
14883	Nectins (e.g., PVRL1; MIM 600644) are immunoglobulin-like adhesion molecules that interact with afadin (AF6; MIM 159559). Afadin is an actin filament-binding protein that connects nectins to the actin cytoskeleton. The nectin-afadin system organizes adherens junctions cooperatively with the cadherin (see MIM 192090)-catenin (see MIM 116805) system in epithelial cells.[supplied by OMIM]
14884	The protein encoded by this gene has similarity to mouse protein which is a subunit of the signal recognition particle receptor (SR). This subunit is a transmembrane GTPase belonging to the GTPase superfamily. It anchors alpha subunit, a peripheral membrane GTPase, to the ER membrane. SR is required for the cotranslational targeting of both secretory and membrane proteins to the ER membrane.
14885	
14886	
14887	
14888	
14889	
14890	
14891	
14892	BTLA expression is induced during activation of T cells, and BTLA remains expressed on Th1 cells but not Th2 cells. Like PD1 (MIM 600244) and CTLA4 (MIM 123890), BTLA interacts with a B7 homolog, B7H4.[supplied by OMIM]
14893	
14894	
14895	
14896	The protein encoded by this gene is a member of the purine/pyrimidine phosphoribosyltransferase family. This protein is a regulatory allosteric enzyme that catalyzes the first step of de novo purine nucleotide biosynthesis. This gene and PAICS/AIRC, a bifunctional enzyme catalyzing steps six and seven in the purine nucleotide biosynthesis pathway, are located in close proximity on chromosome 4.
14897	The protein encoded by this gene contains a RING finger, which is a motif known to be involved in protein-protein interactions. This protein is a membrane-bound ubiquitin ligase. It can regulate cell motility by targeting paxillin ubiquitination and altering the distribution and localization of paxillin in cytoplasm and cell focal adhesions.
14898	This gene is a member of a family of sulfate/anion transporter genes. Family members are well conserved in their genomic (number and size of exons) and protein (aa length among species) structures, but have markedly different tissue expression patterns. This gene is primarily expressed in the liver, pancreas, and brain. Three splice variants that encode different isoforms have been identified.
14899	
14900	
14901	
14902	This gene is a member of the stromal interaction molecule (STIM) family and likely arose, with related family member STIM1, from a common ancestral gene. It encodes a type 1 transmembrane protein whose function is currently unknown, but is a likely adhesion molecule with a role in early hematopoiesis. Alternative translation initiation from an AUG and a non-AUG (UUG) start site results in the production of two different isoforms.
14903	BMP3 belongs to the transforming growth factor-beta (TGFB) superfamily. Bone morphogenic protein induces bone formation.
14904	HLA-DQB1 belongs to the HLA class II beta chain paralogues. This class II molecule is a heterodimer consisting of an alpha (DQA) and a beta chain (DQB), both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa and it contains 6 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail. Within the DQ molecule both the alpha chain and the beta chain contain the polymorphisms specifying the peptide binding specificities, resulting in up to 4 different molecules. Typing for these polymorphisms is routinely done for bone marrow transplantation.
14905	
14906	
14907	This gene encodes a member of the SFRP family that contains a cysteine-rich domain homologous to the putative Wnt-binding site of Frizzled proteins. SFRPs act as soluble modulators of Wnt signaling. Methylation of this gene is a potential marker for the presence of colorectal cancer.
14908	
14909	
14910	
14911	This gene encodes class I alcohol dehydrogenase, gamma subunit, which is a member of the alcohol dehydrogenase family. Members of this enzyme family metabolize a wide variety of substrates, including ethanol, retinol, other aliphatic alcohols, hydroxysteroids, and lipid peroxidation products. Class I alcohol dehydrogenase, consisting of several homo- and heterodimers of alpha, beta, and gamma subunits, exhibits high activity for ethanol oxidation and plays a major role in ethanol catabolism. Three genes encoding alpha, beta and gamma subunits are tandemly organized in a genomic segment as a gene cluster.
14912	
14913	
14914	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that belongs to the ribosomal protein S18P family. The encoded protein is one of three that has significant sequence similarity to bacterial S18 proteins. The primary sequences of the three human mitochondrial S18 proteins are no more closely related to each other than they are to the prokaryotic S18 proteins. Pseudogenes corresponding to this gene are found on chromosomes 8p, 12p, 15q, and 22q.
14915	
14916	
14917	
14918	The protein encoded by this gene belongs to a small group of evolutionarily conserved proteins with three transmembrane domains. It is a potential target for ubiquitination by the Nedd4 family of proteins. This protein is thought to be part of a family of integral Golgi membrane proteins.
14919	The protein encoded by this gene belongs to a small group of evolutionarily conserved proteins with three transmembrane domains. It is a potential target for ubiquitination by the Nedd4 family of proteins. This protein is thought to be part of a family of integral Golgi membrane proteins.
14920	The intronless gene for this transcription coactivator is located between the protocadherin beta and gamma gene clusters on chromosome 5. The protein encoded by this gene is a component of the TFIID protein complex, a complex which binds to the TATA box in class II promoters and recruits RNA polymerase II and other factors. This particular subunit interacts with the largest TFIID subunit, as well as multiple transcription activators. The protein is required for transcription by promoters targeted by RNA polymerase II.
14921	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections.
14922	
14923	
14924	This gene encodes a disintegrin and metalloprotease (ADAM) domain 19, which is a member of the ADAM protein family. Members of this family are membrane-anchored proteins structurally related to snake venom disintegrins, and have been implicated in a variety of biologic processes involving cell-cell and cell-matrix interactions, including fertilization, muscle development, and neurogenesis. This member is a type I transmembrane protein and serves as a marker for dendritic cell differentiation. It has also been demonstrated to be an active metalloproteinase, which may be involved in normal physiological and pathological processes such as cells migration, cell adhesion, cell-cell and cell-matrix interactions, and signal transduction. Alternative splicing generates 2 transcript variants for this gene.
14925	
14926	
14927	
14928	The protein encoded by this gene belongs to the methionine adenosyltransferase (MAT) family. MAT catalyzes the biosynthesis of S-adenosylmethionine from methionine and ATP. This protein is the regulatory beta subunit of MAT. Alternative splicing occurs at this locus and two transcript variants encoding distinct isoforms have been identified.
14929	The protein encoded by this gene is a cofactor needed by pancreatic lipase for efficient dietary lipid hydrolysis. It binds to the C-terminal, non-catalytic domain of lipase, thereby stabilizing an active conformation and considerably increasing the overall hydrophobic binding site. The gene product allows lipase to anchor noncovalently to the surface of lipid micelles, counteracting the destabilizing influence of intestinal bile salts. This cofactor is only expressed in pancreatic acinar cells, suggesting regulation of expression by tissue-specific elements.
14930	
14931	
14932	
14933	
14934	This gene is the second identified human homolog of the Drosophila fat gene, which encodes a tumor suppressor essential for controlling cell proliferation during Drosophila development. The gene product is a member of the cadherin superfamily, a group of integral membrane proteins characterized by the presence of cadherin-type repeats. In addition to containing 34 tandem cadherin-type repeats, the gene product has two epidermal growth factor (EGF)-like repeats and one laminin G domain. This protein most likely functions as a cell adhesion molecule, controlling cell proliferation and playing an important role in cerebellum development.
14935	ISL1 encodes islet 1, a transcription factor containing two amino-terminal LIM domains and one carboxy-terminal homeodomain. ISL1 plays an important role in the embryogenesis of pancreatic islets of Langerhans. In addition, mouse embryos made deficient in ISL1 fail to undergo neural tube motor neuron differentiation.
14936	Betaine-homocysteine methyltransferase is a cytosolic enzyme that catalyzes the conversion of betaine and homocysteine to dimethylglycine and methionine, respectively. Defects in BHMT could lead to hyperhomocyst(e)inemia,but such a defect has not yet been observed.
14937	PDE6A encodes the cyclic-GMP (cGMP) specific phosphodiesterase 6A alphha subunit, expressed in cells of the retinal rod outer segment. The phosphodiesterase 6 holoenzyme is a heterotrimer composed of an alpha, beta, and two gamma subunits.cGMP is an important regulator of rod cell membrane current, and its dynamic concentration is established by phosphodiesterase 6A cGMP hydrolysis and gunylate cyclase cGMP synthesis. Mutations in PDE6A have been identified as one cause of autosomal recessive retinitis pigmentosa.
14938	
14939	
14940	
14941	
14942	This gene encodes a member of the WNT1 inducible signaling pathway (WISP) protein subfamily, which belongs to the connective tissue growth factor (CTGF) family. WNT1 is a member of a family of cysteine-rich, glycosylated signaling proteins that mediate diverse developmental processes. The CTGF family members are characterized by four conserved cysteine-rich domains: insulin-like growth factor-binding domain, von Willebrand factor type C module, thrombospondin domain and C-terminal cystine knot-like domain. This gene is overexpressed in colon tumors. It may be downstream in the WNT1 signaling pathway that is relevant to malignant transformation. Mutations of this gene are associated with progressive pseudorheumatoid dysplasia, an autosomal recessive skeletal disorder, indicating that the gene is essential for normal postnatal skeletal growth and cartilage homeostasis. Alternative splicing generates at least three transcript variants.
14943	
14944	Cytochrome c oxidase (COX), the terminal component of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. This component is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in electron transfer, and the nuclear-encoded subunits may function in the regulation and assembly of the complex. This nuclear gene encodes polypeptide 2 (liver isoform) of subunit VIIa and the polypeptide 2 is present in both muscle and nonmuscle tissues. In addition to polypeptide 2, subunit VIIa includes polypeptide 1 (muscle isoform), which is present only in muscle tissues, and a related protein, present in all tissues. This gene may have several pseudogenes.
14945	
14946	
14947	This gene encodes a C2H2 zinc finger protein with transactivation and DNA-binding activity. This gene has been shown to exhibit antiproliferative activities and is a tumor suppressor gene candidate. Two transcript variants encoding different isoforms have been found for this gene.
14948	
14949	The protein encoded by this gene belongs to a small class of prenylated protein tyrosine phosphatases (PTPs), which contains a PTP domain and a characteristic C-terminal prenylation motif. PTPs are cell signaling molecules that play regulatory roles in a variety of cellular processes. This tyrosine phosphatase is a nuclear protein, but may primarily associate with plasma membrane. The surface membrane association of this protein depends on its C-terminal prenylation. Overexpression of this gene in mammalian cells conferred a transformed phenotype, which implicated its role in the tumorigenesis. Studies in rat suggested that this gene may be an immediate-early gene in mitogen-stimulated cells.
14950	Many polypeptide mitogens, such as basic fibroblast growth factor (FGFB; MIM 134920) and platelet-derived growth factors (MIM 173430, MIM 190040), are active on a wide range of different cell types. In contrast, vascular endothelial growth factor is a mitogen primarily for vascular endothelial cells. It is, however, structurally related to platelet-derived growth factor.[supplied by OMIM]
14951	
14952	
14953	
14954	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a member of the histone H2A family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the small histone gene cluster on chromosome 6p22-p21.3.
14955	
14956	
14957	
14958	
14959	
14960	
14961	This gene encodes a cytosolic, NADP-dependent enzyme that generates NADPH for fatty acid biosynthesis. The activity of this enzyme, the reversible oxidative decarboxylation of malate, links the glycolytic and citric acid cycles. The regulation of expression for this gene is complex. Increased expression can result from elevated levels of thyroid hormones or by higher proportions of carbohydrates in the diet.
14962	
14963	
14964	
14965	N-Oct-3 is a protein belonging to a large family of transcription factors that bind to the octameric DNA sequence ATGCAAAT. Most of these proteins share a highly homologous region, referred to as the POU domain, which occurs in several mammalian transcription factors, including the octamer-binding proteins Oct1 (POU2F1; MIM 164175) and Oct2 (POU2F2; MIM 164176), and the pituitary protein Pit1 (PIT1; MIM 173110). Class III POU genes are expressed predominantly in the CNS. It is likely that CNS-specific transcription factors such as these play an important role in mammalian neurogenesis by regulating their diverse patterns of gene expression.[supplied by OMIM]
14966	
14967	
14968	
14969	
14970	This gene is expressed in postmitotic-differentiating neurons of the developmental nervous system and neuronal structures associated with plasticity in the adult. The expression of this gene can be induced by neural activity and neurotrophins. The encoded protein contains a consensus cleavage signal found in glycosylphoshatidylinositol (GPI)-anchored proteins. In vitro assays demonstrated that this protein promotes neurite outgrowth and arborization, suggesting its role in promoting neuritogenesis.
14971	NADPH oxidases, such as NOX3, are plasma membrane-associated enzymes found in many cell types. They catalyze the production of superoxide by a 1-electron reduction of oxygen, using NADPH as the electron donor.[supplied by OMIM]
14972	
14973	
14974	
14975	This intronless gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins, such as syndecan binding protein (syntenin). The protein may function in the apoptosis pathway leading to cell death as well as to tumorigenesis and may mediate downstream effects of parathyroid hormone (PTH) and PTH-related protein (PTHrP) in bone development. The solution structure has been resolved for the HMG-box of a similar mouse protein.
14976	
14977	
14978	
14979	Heterotrimeric guanine nucleotide-binding proteins (G proteins), which integrate signals between receptors and effector proteins, are composed of an alpha, a beta, and a gamma subunit. These subunits are encoded by families of related genes. This gene encodes a beta subunit. Beta subunits are important regulators of alpha subunits, as well as of certain signal transduction receptors and effectors. This gene contains a trinucleotide (CCG) repeat length polymorphism in its 5@#$%& UTR.
14980	
14981	
14982	The protein encoded by this gene belongs to the membrane-associated guanylate kinase (MAGUK) family, a class of proteins that functions as molecular scaffolds for the assembly of multiprotein complexes at specialized regions of the plasma membrane. This protein is also a member of the CARD protein family, which is defined by carrying a characteristic caspase-associated recruitment domain (CARD). This protein has a domain structure similar to that of CARD14 protein. The CARD domains of both proteins have been shown to specifically interact with BCL10, a protein known to function as a positive regulator of cell apoptosis and NF-kappaB activation. When expressed in cells, this protein activated NF-kappaB and induced the phosphorylation of BCL10.
14983	
14984	This gene is downregulated by parathyroid hormone in osteoblastic cells, and therefore, is thought to be involved in parathyroid hormone action in bones. The exact function of this gene has not yet been determined. Alternatively spliced transcript variants encoding different isoforms have been identified.
14985	In vertebrates, the genes encoding the class of transcription factors called homeobox genes are found in clusters named A, B, C, and D on four separate chromosomes. Expression of these proteins is spatially and temporally regulated during embryonic development. This gene is part of the A cluster on chromosome 7 and encodes a DNA-binding transcription factor which may regulate gene expression, morphogenesis, and differentiation. Methylation of this gene may result in the loss of its expression and, since the encoded protein upregulates the tumor suppressor p53, this protein may play an important role in tumorigenesis.
14986	This gene encodes a member of the alpha-2/delta subunit family, a protein in the voltage-dependent calcium channel complex. Calcium channels mediate the influx of calcium ions into the cell upon membrane polarization and consist of a complex of alpha-1, alpha-2/delta, beta, and gamma subunits in a 1:1:1:1 ratio. Research on a highly similar protein in rabbit suggests the protein described in this record is cleaved into alpha-2 and delta subunits. Alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
14987	
14988	
14989	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of the ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class. This protein is similar to xenopus early mitotic inhibitor-1 (Emi1), which is a mitotic regulator that interacts with Cdc20 and inhibits the anaphase promoting complex.
14990	
14991	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
14992	Hearing impairment is a heterogeneous condition with over 40 loci described. The protein encoded by this gene is expressed in fetal cochlea, however, its function is not known. Nonsyndromic hearing impairment is associated with a mutation in this gene.
14993	
14994	
14995	
14996	
14997	
14998	Members of the RAET1 family, such as RAET1E, are major histocompatibility complex (MHC) class I-related genes located within a 180-kb cluster on chromosome 6q24.2-q25.3. RAET1 proteins contain MHC class I-like alpha-1 and alpha-2 domains. RAET1E and RAET1G (MIM 609244) differ from the other RAET1 proteins (e.g., RAET1I, or ULBP1; MIM 605697) in that they have type I membrane-spanning sequences at their C termini rather than glycosylphosphatidylinositol anchor sequences. (Radosavljevic et al., 2002).[supplied by OMIM]
14999	
15000	
15001	
15002	
15003	Chromosomal region 7p12, which contains GBAS, is amplified in approximately 40% of glioblastomas, the most common and malignant form of central nervous system tumor.The predicted 286-amino acid protein contains a signal peptide, a transmembrane domain, and 2 tyrosine phosphorylation sites. The GBAS transcript is expressed most abundantly in heart and skeletal muscle. GBAS protein might be involved in vesicular transport.
15004	The poly(A)-binding protein (PABP), which is found complexed to the 3-prime poly(A) tail of eukaryotic mRNA, is required for poly(A) shortening and translation initiation. In humans, the PABPs comprise a small nuclear isoform and a conserved gene family that displays at least 3 functional proteins: PABP1 (PABPC1), inducible PABP (iPABP, or PABPC4; MIM 603407), and PABP3 (PABPC3; MIM 604680). In addition, there are at least 4 pseudogenes, PABPCP1 to PABPCP4.[supplied by OMIM]
15005	The protein encoded by this gene belongs to the DEAD-like helicase superfamily. It shares similarity with Saccharomyces cerevisiae RAD54 and RDH54, both of which are involved in homologous recombination and repair of DNA. This protein binds to double-stranded DNA, and displays ATPase activity in the presence of DNA. This gene is highly expressed in testis and spleen, which suggests active roles in meiotic and mitotic recombination. Homozygous mutations of this gene were observed in primary lymphoma and colon cancer. Two alternatively spliced transcript variants encoding different isoforms have been reported for this gene. Also, transcript variants resulting from use of alternative polyA site exist.
15006	
15007	This gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins.
15008	IFNGR1 encodes the ligand-binding chain (alpha) of the heterodimeric gamma interferon receptor. IFNGR2 encodes the non-ligand-binding partner of the heterodimeric receptor.
15009	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of the ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class and contains an F-box domain. This protein is highly expressed during muscle atrophy, whereas mice deficient in this gene were found to be resistant to atrophy. This protein is thus a potential drug target for the treatment of muscle atrophy. Alternative splicing of this gene results in two transcript variants encoding two isoforms of different sizes.
15010	
15011	Lysophosphatidic acid acyltransferases (EC 2.3.1.51) catalyze the conversion of lysophosphatidic acid (LPA) to phosphatidic acid (PA). LPA and PA are involved in signal transduction and lipid biosynthesis.[supplied by OMIM]
15012	
15013	This gene encodes tissue-type plasminogen activator, a secreted serine protease which converts the proenzyme plasminogen to plasmin, a fibrinolytic enzyme. Tissue-type plasminogen activator is synthesized as a single chain which is cleaved by plasmin to a two chain disulfide linked protein. This enzyme plays a role in cell migration and tissue remodeling. Increased enzymatic activity causes hyperfibrinolysis, which manifests as excessive bleeding; decreased activity leads to hypofibrinolysis which can result in thrombosis or embolism. Alternative splicing of this gene produces three transcripts.
15014	Defensins are a family of microbicidal and cytotoxic peptides thought to be involved in host defense. They are abundant in the granules of neutrophils and also found in the epithelia of mucosal surfaces such as those of the intestine, respiratory tract, urinary tract, and vagina. Members of the defensin family are highly similar in protein sequence and distinguished by a conserved cysteine motif. Several alpha defensin genes appear to be clustered on chromosome 8. The protein encoded by this gene, defensin, alpha 6, is highly expressed in the secretory granules of Paneth cells of the small intestine, and likely plays a role in host defense of human bowel.
15015	
15016	
15017	ADP-ribosylation factors (ARFs) play an important role in intracellular vesicular trafficking. The protein encoded by this gene is involved in the activation of ARFs by accelerating replacement of bound GDP with GTP. It contains a Sec7 domain, which may be responsible for the guanine-nucleotide exchange activity and also the brefeldin A inhibition.
15018	
15019	FOXH1 encodes a human homolog of Xenopus forkhead activin signal transducer-1. FOXH1 protein binds SMAD2 and activates an activin response element via binding the DNA motif TGT(G/T)(T/G)ATT.
15020	
15021	
15022	This gene encodes a protein which induces ectopic bone formation in conjunction with transforming growth factor beta. This protein is a small proteoglycan which contains tandem leucine-rich repeats (LRR). The gene expresses three transcript variants which differ only in the 5@#$%& UTR.
15023	
15024	The protein encoded by this gene is a cysteine-rich, extracellular protein with protease inhibitor-like domains whose expression is suppressed strongly in many tumors and cells transformed by various kinds of oncogenes. In normal cells, this membrane-anchored glycoprotein may serve as a negative regulator for matrix metalloproteinase-9, a key enzyme involved in tumor invasion and metastasis.
15025	Valosin-containing protein (VCP) is a member of a family that includes putative ATP-binding proteins involved in vesicle transport and fusion, 26S proteasome function, and assembly of peroxisomes. VCP, as a structural protein, is associated with clathrin, and heat-shock protein Hsc70, to form a complex. VCP has been implicated in a number of cellular events that are regulated during mitosis, including homotypic membrane fusion, spindle pole body function, and ubiquitin-dependent protein degradation.
15026	
15027	
15028	This protein belongs to the aldehyde dehydrogenases family of proteins. Aldehyde dehydrogenase is the second enzyme of the major oxidative pathway of alcohol metabolism. Two major liver isoforms of this enzyme, cytosolic and mitochondrial, can be distinguished by their electrophoretic mobilities, kinetic properties, and subcellular localizations. Most Caucasians have two major isozymes, while approximately 50% of Orientals have only the cytosolic isozyme, missing the mitochondrial isozyme. A remarkably higher frequency of acute alcohol intoxication among Orientals than among Caucasians could be related to the absence of the mitochondrial isozyme. This gene encodes a cytosolic isoform, which has a high affinity for aldehydes.
15029	This gene encodes a member of a superfamily of divalent cation-dependent phosphodiesterases. The encoded protein associates with CD40, tumor necrosis factor (TNF) receptor-75 and TNF receptor associated factors (TRAFs), and inhibits nuclear factor-kappa-B activation. This protein has sequence and structural similarities with APE1 endonuclease, which is involved in both DNA repair and the activation of transcription factors.
15030	Neuronal CDC2-like kinase, which is involved in the regulation of neuronal differentiation, is composed of a catalytic subunit, CDK5, and an activating subunit, p25NCK5A. The protein encoded by this gene binds to p25NCK5A and therefore may be involved in neuronal differentiation. The encoded protein may also be a substrate of neuronal CDC2-like kinase. Multiple transcript variants exist for this gene, but the full-length nature of only two has been determined.
15031	Growth arrest-specific 1 plays a role in growth suppression.  GAS1 blocks entry to S phase and prevents cycling of normal and transformed cells.  Gas1 is a putative tumor suppressor gene.
15032	
15033	
15034	
15035	
15036	
15037	
15038	The protein encoded by this gene is a nuclear encoded endonuclease that is localized in the mitochondrion. The encoded protein is widely distributed among animals and cleaves DNA at GC tracts. This protein is capable of generating the RNA primers required by DNA polymerase gamma to initiate replication of mitochondrial DNA.
15039	Interleukin 11 is a stromal cell-derived cytokine that belongs to a family of pleiotropic and redundant cytokines that use the gp130 transducing subunit in their high affinity receptors. This gene encodes the IL-11 receptor, which is a member of the hematopoietic cytokine receptor family. This particular receptor is very similar to ciliary neurotrophic factor, since both contain an extracellular region with a 2-domain structure composed of an immunoglobulin-like domain and a cytokine receptor-like domain. Alternative splicing has been observed at this locus and two variants, each encoding a distinct isoform, have been identified.
15040	
15041	
15042	The DNA-associated protein encoded by this gene is a member of the paired family of homeobox proteins. Expression is localized to proliferating fetal fibroblasts and the developing dermal layer, with downregulated expression in adult skin. Increases in expression of this gene during fetal but not adult wound healing suggest a possible role in mechanisms that control mammalian dermal regeneration and prevent formation of scar response to wounding. The expression patterns provide evidence consistent with a role in fetal skin development and a possible role in cellular proliferation.
15043	
15044	
15045	
15046	In rat brain, Cip98 interacts with a calmodulin-dependent serine kinase, CASK (MIM 300172), and may be involved in the formation of scaffolding protein complexes that facilitate synaptic transmission in the central nervous system (CNS) (Yap et al., 2003 [PubMed 12641734]). Mutations in this gene, also known as WHRN, cause DFNB31 (MIM 607084).[supplied by OMIM]
15047	
15048	
15049	
15050	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein that belongs to the L18P ribosomal protein family. Three polymorphic sites exist in this gene, one of which is three nt in length which causes an extra aa near the N-terminus.
15051	
15052	The protein encoded by this gene is one of the three catalytic subunits of protein phosphatase 1 (PP1). PP1 is a serine/threonine specific protein phosphatase known to be involved in the regulation of a variety of cellular processes, such as cell division, glycogen metabolism, muscle contractility, protein synthesis, and HIV-1 viral transcription. Mouse studies suggest that PP1 functions as a suppressor of learning and memory. At least three alternatively spliced transcript variants encoding two distinct isoforms have been observed.
15053	Xaa-Pro dipeptidase is a cytosolic dipeptidase that hydrolyzes dipeptides with proline or hydroxyproline at the carboxy terminus (but not Pro-Pro). It is important in collagen metabolism because of the high levels of iminoacids.
15054	
15055	The protein encoded by this gene is a member of the SWI/SNF family of proteins, whose members display helicase and ATPase activities and which are thought to regulate transcription of certain genes by altering the chromatin structure around those genes. The encoded protein is part of the large ATP-dependent chromatin remodeling complex SNF/SWI and contains a predicted leucine zipper motif typical of many transcription factors.
15056	This gene encodes one of the selenoproteins containing selenocysteine at the active site. The selenocysteine is encoded by the nonsense (stop) codon TGA. Studies in mouse suggest that this selenoprotein may have redox function. This gene is localized on chromosome 1p31, a genetic locus commonly mutated or deleted in human cancers. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
15057	
15058	This gene encodes a member of the IGF-II mRNA-binding protein (IMP) family. The protein encoded by this gene contains four K homology domains and two RNA recognition motifs. It functions by binding to the 5@#$%& UTR of the insulin-like growth factor 2 (IGF2) mRNA and regulating IGF2 translation.
15059	This gene encodes a member of an adapter protein family that binds to several tyrosine-phosphorylated proteins. The product of this gene has several SH2 and SH3 domains (src-homology domains) and is involved in several signaling pathways, recruiting cytoplasmic proteins in the vicinity of tyrosine kinase through SH2-phosphotyrosine interaction. The N-terminal SH2 domain of this protein functions as a positive regulator of transformation whereas the C-terminal SH3 domain functions as a negative regulator of transformation. Two alternative transcripts encoding different isoforms with distinct biological activity have been described.
15060	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. This gene encodes a member of the potassium channel, voltage-gated, subfamily H. This member is a pore-forming (alpha) subunit. Several alternatively spliced transcript variants have been identified from this gene, but the full-length nature of only two transcript variants has been determined.
15061	The aryl hydrocarbon receptor nuclear translocator (ARNT; MIM 126110) is a member of a novel transcription factor family consisting of a conserved basic helix-loop-helix (bHLH) structural motif contiguous with a PAS domain. Members of this family include PER (see MIM 602260), the aryl hydrocarbon receptor (AHR; MIM 600253), SIM1 (MIM 603128), and HIF1A (MIM 603348).[supplied by OMIM]
15062	The human AGXT protein product is normally localized in the peroxisomes of liver where it is involved in glyoxylate detoxification. Defects in the AGXT gene, some of which alter subcellular targetting, are the cause of Oxalosis I.
15063	
15064	This gene encodes one of the numerous actin-binding proteins which regulate the actin cytoskeleton. This protein binds F-actin, and also interacts with 5-lipoxygenase, which is the first committed enzyme in leukotriene biosynthesis. Although this gene has been reported to map to chromosome 17 in the Smith-Magenis syndrome region, the best alignments for this gene are to chromosome 16. The Smith-Magenis syndrome region is the site of two related pseudogenes.
15065	
15066	
15067	Aminopeptidase P is a hydrolase specific for N-terminal imido bonds, which are common to several collagen degradation products, neuropeptides, vasoactive peptides, and cytokines. Structurally, the enzyme is a member of the @#$%&pita bread fold@#$%& family and occurs in mammalian tissues in both soluble and GPI-anchored membrane-bound forms.  A membrane-bound and soluble form of this enzyme have been identified as products of two separate genes.
15068	This protein is the human homolog of mouse morc and like the mouse protein it is testis-specific. Mouse studies support a testis-specific function since only male knockout mice are infertile; infertility is the only apparent defect. These studies further support a role for this protein early in spermatogenesis, possibly by affecting entry into apoptosis since testis from knockout mice show greatly increased numbers of apoptotic cells.
15069	
15070	This gene is a member of the inhibitor of growth (ING) family. Members of the ING family associate with and modulate the activity of histone acetyltransferase (HAT) and histone deacetylase (HDAC) complexes and function in DNA repair and apoptosis.
15071	
15072	Pleckstrin homology, Sec7 and coiled/coil domains 1 (PSCD1) is a member of the PSCD family. Members of this family have identical structural organization that consists of an N-terminal coiled-coil motif, a central Sec7 domain, and a C-terminal pleckstrin homology (PH) domain. The coiled-coil motif is involved in homodimerization, the Sec7 domain contains guanine-nucleotide exchange protein (GEP) activity, and the PH domain interacts with phospholipids and is responsible for association of PSCDs with membranes. Members of this family appear to mediate the regulation of protein sorting and membrane trafficking. The PSCD1 is highly expressed in natural killer and peripheral T cells, and regulates the adhesiveness of integrins at the plasma membrane of lymphocytes. PSCD1 protein is 83% homologous to PSCD2.
15073	The specific function of this gene has yet to be determined in humans; however,  in rodents, it is necessary for survival of the forebrain mesenchyme and may  also be involved in development of the cervix.  Mutations in the mouse gene lead  to neural tube defects such as acrania and meroanencephaly.
15074	
15075	This gene encodes an alpha chain for one of the low abundance fibrillar collagens. Fibrillar collagen molecules are trimers that can be composed of one or more types of alpha chains. Type V collagen is found in tissues containing type I collagen and appears to regulate the assembly of heterotypic fibers composed of both type I and type V collagen. This gene product is closely related to type XI collagen and it is possible that the collagen chains of types V and XI constitute a single collagen type with tissue-specific chain combinations. Mutations in this gene are associated with Ehlers-Danlos syndrome, types I and II.
15076	
15077	The protein encoded by this gene was initially named CA-related protein because of sequence similarity to other known carbonic anhydrase genes. However, the gene product lacks carbonic anhydrase activity (i.e., the reversible hydration of carbon dioxide). The gene product continues to carry a carbonic anhydrase designation based on clear sequence identity to other members of the carbonic anhydrase gene family. The absence of CA8 gene transcription in the cerebellum of the lurcher mutant in mice with a neurologic defect suggests an important role for this acatalytic form.
15078	This gene encodes monoamine oxidase A, an enzyme that degrades amine neurotransmitters, such as dopamine, norepinephrine, and serotonin. The protein localizes to the mitochondrial outer membrane. The gene is adjacent to a related gene on the opposite strand of chromosome X. Mutation in this gene results in monoamine oxidase deficiency, or Brunner syndrome.
15079	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbls class and, in addition to an F-box, contains at least 6 highly degenerated leucine-rich repeats.
15080	
15081	
15082	
15083	
15084	
15085	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
15086	L-type calcium channels are composed of five subunits. The protein encoded by this gene represents one of these subunits, gamma, and is one of several gamma subunit proteins. It is an integral membrane protein that is thought to stabilize the calcium channel in an inactive (closed) state. This protein is similar to the mouse stargazin protein, mutations in which have been associated with absence seizures, also known as petit-mal or spike-wave seizures. This gene is a member of the neuronal calcium channel gamma subunit gene subfamily of the PMP-22/EMP/MP20 family.
15087	
15088	
15089	This gene encodes a member of the IgLON (LAMP, OBCAM, Ntm) family of immunoglobulin (Ig) domain-containing glycosylphosphatidylinositol (GPI)-anchored cell adhesion molecules. The encoded protein may promote neurite outgrowth and adhesion via a homophilic mechanism. An alternative splice variant has been described but its full length sequence has not been determined. This gene is closely linked to a related family member, opioid binding protein/cell adhesion molecule-like (OPCML), on chromosome 11.
15090	
15091	
15092	Glial cell line-derived neurotrophic factor (GDNF) and neurturin (NTN) are two structurally related, potent neurotrophic factors that play key roles in the control of neuron survival and differentiation. The protein encoded by this gene is a member of the GDNF receptor family. It is a glycosylphosphatidylinositol(GPI)-linked cell surface receptor for both GDNF and NTN, and mediates activation of the RET tyrosine kinase receptor. This encoded protein acts preferentially as a receptor for NTN compared to its other family member, GDNF family receptor alpha 1. This gene is a candidate gene for RET-associated diseases.
15093	The protein encoded by this gene is a member of the interleukin 1 receptor family and is similar to the interleukin 1 accessory proteins. It is most closely related to interleukin 1 receptor accessory protein-like 2 (IL1RAPL2). This gene and IL1RAPL2 are located at a region on chromosome X that is associated with X-linked non-syndromic mental retardation. Deletions and mutations in this gene were found in patients with mental retardation. This gene is expressed at a high level in post-natal brain structures involved in the hippocampal memory system, which suggests a specialized role in the physiological processes underlying memory and learning abilities.
15094	This gene encodes a type II classical cadherin from the cadherin superfamily of integral membrane proteins that mediate calcium-dependent cell-cell adhesion. Mature cadherin proteins are composed of a large N-terminal extracellular domain, a single membrane-spanning domain, and a small, highly conserved C-terminal cytoplasmic domain. Type II (atypical) cadherins are defined based on their lack of a HAV cell adhesion recognition sequence specific to type I cadherins. This particular cadherin is expressed specifically in the central nervous system and is putatively involved in synaptic adhesion, axon outgrowth and guidance.
15095	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
15096	
15097	
15098	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
15099	
15100	
15101	This gene encodes a membrane-bound member of the mammalian triglyceride lipase family. It catalyzes the production of 2-acyl lysophosphatidic acid (LPA), which is a lipid mediator with diverse biological properties that include platelet aggregation, smooth muscle contraction, and stimulation of cell proliferation and motility.
15102	This gene encodes a member of the homeodomain (HD) family. HD family proteins are helix-turn-helix transcription factors that play key roles in the specification of cell fates. This protein may function during brain development.
15103	
15104	
15105	
15106	
15107	The protein encoded by this gene is one of the eight subunits of COP9 signalosome, a highly conserved protein complex that functions as an important regulator in multiple signaling pathways. The structure and function of COP9 signalosome is similar to that of the 19S regulatory particle of 26S proteasome. COP9 signalosome has been shown to interact with SCF-type E3 ubiquitin ligases and act as a positive regulator of E3 ubiquitin ligases. This protein belongs to translation initiation factor 3 (eIF3) superfamily. It is involved in the regulation of cell cycle and likely to be a cellular cofactor for HIV-1 accessory gene product Vpr.
15108	
15109	
15110	
15111	The product of this gene belongs to the family of highly homologous synovial sarcoma X (SSX) breakpoint proteins. These proteins may function as transcriptional repressors. They are also capable of eliciting spontaneously humoral and cellular immune responses in cancer patients, and are potentially useful targets in cancer vaccine-based immunotherapy. SSX1, SSX2 and SSX4 genes have been involved in the t(X;18) translocation characteristically found in all synovial sarcomas. This gene appears not to be involved in this type of chromosome translocation. Two transcript variants encoding distinct isoforms have been identified for this gene.
15112	This gene encodes a member of the neurexin family which functions in the vertebrate nervous system as cell adhesion molecules and receptors. This protein, like other neurexin proteins, contains epidermal growth factor repeats and laminin G domains. In addition, it includes an F5/8 type C domain, discoidin/neuropilin- and fibrinogen-like domains, thrombospondin N-terminal-like domains and a putative PDZ binding site. This protein is localized at the juxtaparanodes of myelinated axons and associated with potassium channels. It may play a role in the local differentiation of the axon into distinct functional subdomains. This gene encompasses almost 1.5% of chromosome 7 and is one of the largest genes in the human genome. It may represent a positional candidate gene for the DFNB13 form of nonsyndromic deafness.
15113	Microtubules of the eukaryotic cytoskeleton perform essential and diverse functions and are composed of a heterodimer of alpha and beta tubulin. The genes encoding these microtubule constituents are part of the tubulin superfamily, which is composed of six distinct families. Genes from the alpha, beta and gamma tubulin families are found in all eukaryotes. The alpha and beta tubulins represent the major components of microtubules, while gamma tubulin plays a critical role in the nucleation of microtubule assembly. There are multiple alpha and beta tubulin genes and they are highly conserved among and between species. This gene encodes an alpha tubulin that is a highly conserved homolog of a rat testis-specific alpha tubulin.
15114	
15115	FSCN2, or retinal fascin, is a photoreceptor-specific paralog of the actin-bundling protein fascin (FSCN1; MIM 602689). Fascins crosslink F-actin into highly ordered bundles within dynamic cell extensions, such as neuronal growth cone filopodia.[supplied by OMIM]
15116	
15117	
15118	This gene complements a temperature-sensitive mutant isolated from the BHK-21 Syrian hamster cell line. It leads to a block in progression through the G1 phase of the cell cycle at nonpermissive temperatures.
15119	
15120	The WNK1 gene encodes a cytoplasmic serine-threonine kinase expressed in distal nephron.[supplied by OMIM]
15121	
15122	The POT1 gene encodes a protein conserved across widely diverged eukaryotes that binds the G-rich strand of its own telomeric repeat sequence, consistent with a direct role in protecting chromosome ends.[supplied by OMIM]
15123	
15124	
15125	Iduronate-2-sulfatase is required for the lysosomal degradation of heparan sulfate and dermatan sulfate. Mutations in this X-chromosome gene that result in enzymatic deficiency lead to the sex-linked Mucopolysaccharidosis Type II, also known as Hunter Syndrome. Iduronate-2-sulfatase has a strong sequence homology with human arylsulfatases A, B, and C, and human glucosamine-6-sulfatase. A splice variant of this gene has been described.
15126	The ZNF9 protein contains 7 zinc finger domains and is believed to function as an RNA-binding protein. A CCTG expansion in intron 1 of the ZNF9 gene results in myotonic dystrophy type 2 (MIM 602668).[supplied by OMIM]
15127	
15128	The protein encoded by this gene is a critical subunit of N-methyl-D-aspartate receptors, members of the glutamate receptor channel superfamily which are heteromeric protein complexes with multiple subunits arranged to form a ligand-gated ion channel. These subunits play a key role in the plasticity of synapses, which is believed to underlie memory and learning. The gene consists of 21 exons and is alternatively spliced, producing transcript variants differing in the C-terminus. Although the sequence of exon 5 is identical in human and rat, the alternative exon 5 splicing in rat has yet to be demonstrated in human. Cell-specific factors are thought to control expression of different isoforms, possibly contributing to the functional diversity of the subunits.
15129	The protein encoded by this gene is a lymphocyte-specific member of the tumor necrosis factor (TNF) receptor superfamily. It interacts with calcium-modulator and cyclophilin ligand (CAML). The protein induces activation of the transcription factors NFAT, AP1, and NF-kappa-B and plays a crucial role in humoral immunity by interacting with a TNF ligand. This gene is located within the Smith-Magenis syndrome region on chromosome 17.
15130	
15131	
15132	
15133	
15134	
15135	
15136	
15137	This gene encodes a protein that is highly similar to the CDC10 protein of Saccharomyces cerevisiae. The protein also shares similarity with Diff 6 of Drosophila and with H5 of mouse. Each of these similar proteins, including the yeast CDC10, contains a GTP-binding motif. The yeast CDC10 protein is a structural component of the 10 nm filament which lies inside the cytoplasmic membrane and is essential for cytokinesis. Although the exact function of this gene has not yet been determined, its high similarity to yeast CDC10 and the high conservative nature of eukaryotic cell cycle machinery suggest a similar role to that of its yeast counterpart. Alternative splicing results in two transcript variants encoding different isoforms.
15138	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein that belongs to the L1 ribosomal protein family.
15139	This gene encodes the puromycin-sensitive aminopeptidase, a zinc metallopeptidase which hydrolyzes amino acids from the N-terminus of its substrate. The protein has been localized to both the cytoplasm and to cellular membranes. This enzyme degrades enkaphalins in the brain, and studies in mouse suggest that it is involved in proteolytic events regulating the cell cycle.
15140	Voltage-dependent Ca(2+) channels are multisubunit complexes found in the membrane of many excitable cells that regulate calcium entry (see MIM 601011). N-type calcium channels, which control neurotransmitter release from neurons, are dihydropyridine-insensitive and omega-conotoxin-sensitive. The alpha-1 subunit forms the pore through which calcium enters the cell, and is encoded by a family of at least 5 genes.[supplied by OMIM]
15141	
15142	
15143	This gene product is a likely zinc finger family transcription factor. It contains KRAB-A and KRAB-B domains that act as transcriptional repressors in related proteins, and multiple zinc finger DNA binding motifs and finger linking regions characteristic of the Kruppel family. This gene is part of a gene cluster on chromosome Xp11.23.
15144	Histones play a critical role in transcriptional regulation, cell cycle progression, and developmental events. Histone acetylation/deacetylation alters chromosome structure and affects transcription factor access to DNA. The protein encoded by this gene belongs to class I of the histone deacetylase/acuc/apha family. It has histone deacetylase activity and represses transcription when tethered to a promoter.
15145	
15146	
15147	
15148	This gene, expressed in the pituitary, encodes a receptor for growth hormone-releasing hormone. Binding of this hormone to the receptor leads to synthesis and release of growth hormone. Mutations in this gene have been associated with isolated growth hormone deficiency (IGHD), also known as Dwarfism of Sindh, a disorder characterized by short stature. Many alternate transcriptional splice variants encoding different isoforms have been described, but only two have been characterized to date.
15149	
15150	
15151	This gene may play a role in the development and normal function of the ovaries. Mutations of this gene have been linked to premature ovarian failure. Alternative splicing results in two protein isoforms. Furthermore, each splice variant undergoes additional splicing in the 3@#$%& UTR. Overall 4 splice variants have been described.
15152	The protein encoded by this gene belongs to the flavin monoamine oxidase family. It is a enzyme located in the mitochondrial outer membrane. It catalyzes the oxidative deamination of biogenic and xenobiotic amines and plays an important role in the metabolism of neuroactive and vasoactive amines in the central nervous sysytem and peripheral tissues. This protein preferentially degrades benzylamine and phenylethylamine.
15153	The protein encoded by this gene belongs to the flavin monoamine oxidase family. It is a enzyme located in the mitochondrial outer membrane. It catalyzes the oxidative deamination of biogenic and xenobiotic amines and plays an important role in the metabolism of neuroactive and vasoactive amines in the central nervous sysytem and peripheral tissues. This protein preferentially degrades benzylamine and phenylethylamine.
15154	Rho GTPases play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. The encoded protein belongs to a family of cytoplasmic proteins that activate the Ras-like family of Rho proteins by exchanging bound GDP for GTP. It may form a complex with G proteins and stimulate Rho-dependent signals. This protein is activated by PI3-kinase. Mutations in this gene can cause X-chromosomal non-specific mental retardation.
15155	IRS4 encodes the insulin receptor substrate 4, a cytoplasmic protein that contains many potential tyrosine and serine/threonine phosphorylation sites. Tyrosine-phosphorylated IRS4 protein has been shown to associate with cytoplasmic signalling molecules that contain SH2 domains. The IRS4 protein is phosphorylated by the insulin receptor tyrosine kinase upon receptor stimulation..
15156	This gene encodes U2AF small subunit related protein 2 which is an essential splicing factor. This protein is associated with the U2AF heterodimer composed of a large and a small subunit. It specifically interacts with the large subunit through an U2AF small subunit homologous region, and with SR proteins through its RS domain. This protein and U2AF small subunit are not functionally redundant.
15157	
15158	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein does not contain a coiled coil region, like some family members. This gene encodes a protein whose function has not been determined.
15159	
15160	This gene encodes a member of the deoxyribonuclease family and the protein shows high sequence similarity to lysosomal DNase I. Alternate transcriptional splice variants, encoding the same protein, have been characterized.
15161	Emerin is a serine-rich nuclear membrane protein and a member of the nuclear lamina-associated protein family. It mediates membrane anchorage to the cytoskeleton. Dreifuss-Emery muscular dystrophy is an X-linked inherited degenerative myopathy resulting from mutation in the emerin gene.
15162	This gene encodes a ubiquitously expressed protein of unknown function.
15163	
15164	This gene encodes a transmembrane proteolipid protein that is the predominant myelin protein present in the central nervous system. The encoded protein functions in myelination. This protein may play a role in the compaction, stabilization, and maintenance of myelin sheaths, as well as in oligodendrocyte development and axonal survival. Mutations associated with this gene cause X-linked Pelizaeus-Merzbacher disease and spastic paraplegia type 2. Two transcript variants encoding distinct isoforms have been identified for this gene.
15165	The protein encoded by this gene is closely related to Il13RA1, a subuint of the interleukin 13 receptor complex. This protein binds IL13 with high affinity, but lacks cytoplasmic domain, and does not appear to function as a signal mediator. It is reported to play a role in the internalization of IL13.
15166	
15167	
15168	
15169	The protein encoded by this gene is a member of the Toll-like receptor (TLR) family which plays a fundamental role in pathogen recognition and activation of innate immunity. TLRs are highly conserved from Drosophila to humans and share structural and functional similarities. They recognize pathogen-associated molecular patterns (PAMPs) that are expressed on infectious agents, and mediate the production of cytokines necessary for the development of effective immunity. The various TLRs exhibit different patterns of expression. This gene is predominantly expressed in lung, placenta, and spleen, and lies in close proximity to another family member, TLR8, on chromosome X.
15170	
15171	
15172	
15173	Retention of resident soluble proteins in the lumen of the endoplasmic reticulum (ER) is achieved in both yeast and animal cells by their continual retrieval from the cis-Golgi, or a pre-Golgi compartment.  Sorting of these proteins is dependent on a C-terminal tetrapeptide signal, usually lys-asp-glu-leu (KDEL) in animal cells, and his-asp-glu-leu (HDEL) in S. cerevisiae.  This process is mediated by a receptor that recognizes, and binds the tetrapeptide-containing protein, and returns it to the ER.  In yeast, the sorting receptor encoded by a single gene, ERD2, is a seven-transmembrane protein.  Unlike yeast, several human homologs of the ERD2 gene, constituting the KDEL receptor gene family, have been described. KDELR2 was the second member of the family to be identified, and it encodes a protein which is 83% identical to the KDELR1 gene product.
15174	
15175	
15176	
15177	
15178	
15179	The protein encoded by this gene is a member of the INK4 family of cyclin-dependent kinase inhibitors. This protein has been shown to interact with CDK4 or CDK6, and prevent the activation of the CDK kinases, thus function as a cell growth regulator that controls cell cycle G1 progression. Ectopic expression of this gene was shown to suppress the growth of human cells in a manner that appears to correlate with the presence of a wild-type RB1 function. Studies in the knockout mice suggested the roles of this gene in regulating spermatogenesis, as well as in suppressing tumorigenesis. Two alternatively spliced transcript variants of this gene, which encode an identical protein, have been reported.
15180	Peptidoglycan recognition proteins, such as PGRPI-beta, are part of the innate immune system and recognize peptidoglycan, a ubiquitous component of bacterial cell walls.[supplied by OMIM]
15181	
15182	Members of the G protein-coupled receptor (GPR) family play important roles in signal transduction from the external environment to the inside of the cell.[supplied by OMIM]
15183	Polycomb group (PcG) of proteins form the multiprotein complexes that are important for the transcription repression of various genes involved in development and cell proliferation. The protein encoded by this gene is one of the PcG proteins. It has been shown to interact with, and suppress the activity of, transcription factor CP2 (TFCP2/CP2). Studies of the mouse counterpart suggested the involvement of this gene in the specification of anterior-posterior axis, as well as in cell proliferation in early development. This protein was also found to interact with huntingtin interacting protein 2 (HIP2), an ubiquitin-conjugating enzyme, and possess ubiquitin ligase activity.
15184	
15185	
15186	
15187	
15188	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation.
15189	
15190	
15191	The Ra-reactive factor (RARF) is a complement-dependent bactericidal factor that binds to the Ra and R2 polysaccharides expressed by certain enterobacteria. Alternate splicing of this gene results in two transcript variants encoding two RARF components that are involved in the mannan-binding lectin pathway of complement activation. The longer isoform is cleaved into two chains which form a heterodimer linked by a disulfide bond. The encoded proteins are members of the trypsin family of peptidases.
15192	Nuclear factor I (NFI) proteins constitute a family of dimeric DNA-binding proteins with similar, and possibly identical, DNA-binding specificity. They function as cellular transcription factors and as replication factors for adenovirus DNA replication. Diversity in this protein family is generated by multiple genes, differential splicing, and heterodimerization.[supplied by OMIM]
15193	Interferons are cytokines that have antiviral effects and inhibit tumor cell proliferation. They induce a large number of genes in their target cells, including those coding for the guanylate-binding proteins (GBPs). GBPs are characterized by their ability to specifically bind guanine nucleotides (GMP, GDP, and GTP). The protein encoded by this gene is a GTPase that converts GTP to GDP and GMP.
15194	This gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins. It has also been determined to be a type-1 diabetes autoantigen, also known as islet cell antibody 12.
15195	The scavenger receptor cysteine-rich (SRCR) superfamily is an ancient and highly conserved group of cell surface and/or secreted proteins, some of which are involved in the development of the immune system and the regulation of both innate and adaptive immune responses. Group B SRCR domains usually contain 8 regularly spaced cysteines that give rise to a well-defined intradomain disulfide-bond pattern.[supplied by OMIM]
15196	
15197	The adenovirus E1A protein both activates and represses gene expression to promote cellular proliferation and inhibit differentiation. The protein encoded by this gene antagonizes transcriptional activation and cellular transformation by E1A. This protein shares limited sequence similarity with E1A and binds both the general transcription factor TBP and the tumor suppressor pRb in vitro. This gene may contribute to the transcriptional control of cell growth and differentiation.
15198	L type voltage-dependent calcium channel controls release of calcium in skeletal muscle cells. Alpha 1 (CACNA1S) is one of the five subunits that constitute this calcium channel.  Mutation in CACNA1S has been reported to associate with malignant hyperthermia susceptibility and hypokalemic periodic paralysis.
15199	
15200	3-Phosphoglycerate dehydrogenase (PHGDH; EC 1.1.1.95) catalyzes the transition of 3-phosphoglycerate into 3-phosphohydroxypyruvate, which is the first and rate-limiting step in the phosphorylated pathway of serine biosynthesis, using NAD+/NADH as a cofactor.[supplied by OMIM]
15201	
15202	
15203	
15204	The encoded gene product presumably interacts with YY1 protein; however, its exact function is not known. Five transcript variants encoding different isoforms have been noted for this gene. Expression of one of the isoforms has been associated with hepatocellular carcinoma.
15205	The protein encoded by this gene is a member of the CLK family of dual specificity protein kinases. CLK family members have shown to interact with, and phosphorylate, serine- and arginine-rich (SR) proteins of the spliceosomal complex, which is a part of the regulatory mechanism that enables the SR proteins to control RNA splicing. This protein kinase is found to be involved in the regulation of several cellular processes and may serve as a link between cell cycle progression, apoptosis, and telomere length regulation. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
15206	
15207	Adenylosuccinate synthetase catalyzes the first committed step in the conversion of IMP to AMP
15208	The protein encoded by this gene is a member of the Toll-like receptor (TLR) family which plays a fundamental role in pathogen recognition and activation of innate immunity. TLRs are highly conserved from Drosophila to humans and share structural and functional similarities. They recognize pathogen-associated molecular patterns (PAMPs) that are expressed on infectious agents, and mediate the production of cytokines necessary for the development of effective immunity. The various TLRs exhibit different patterns of expression. This gene product is expressed in myelomonocytic cells, and recognizes bacterial flagellin, a principal component of bacterial flagella and a virulence factor. The activation of this receptor mobilizes the nuclear factor NF-kappaB and stimulates tumour necrosis factor-alpha production.
15209	
15210	
15211	
15212	
15213	
15214	
15215	
15216	
15217	LAMB3 encodes the beta 3 subunit of laminin. Laminin is composed of three subunits (alpha, beta, and gamma), and refers to a family of basement membrane proteins. For example, LAMB3 serves as the beta chain in laminin-5. Mutations in LAMB3 have been identified as the cause of various types of epidermolysis bullosa. Two alternatively spliced transcript variants encoding the same protein have been found for this gene.
15218	
15219	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes (E1s), ubiquitin-conjugating enzymes (E2s), and ubiquitin-protein ligases (E3s). This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. The encoded protein is 98% identical to the mouse counterpart.
15220	
15221	
15222	
15223	Transcriptional repression is a general mechanism for regulating transcriptional initiation in organisms ranging from yeast to humans.  Accurate initiation of transcription from eukaryotic protein-encoding genes requires the assembly of a large multiprotein complex consisting of RNA polymerase II and general transcription factors such as TFIIA, TFIIB, and TFIID. DR1 is a repressor that interacts with the TATA-binding protein (TBP) of TFIID and prevents the formation of an active transcription complex by precluding the entry of TFIIA and/or TFIIB into the preinitiation complex. The protein encoded by this gene is a corepressor of transcription that interacts with DR1 to enhance DR1-mediated repression. The interaction between this corepressor and DR1 is required for corepressor function and appears to stabilize the TBP-DR1-DNA complex.
15224	
15225	
15226	
15227	
15228	
15229	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that belongs to the ribosomal protein S21P family. Pseudogenes corresponding to this gene are found on chromosomes 1p, 1q, 9p, 10p, 10q, 16q, and 17q. Available sequence data analyses identified splice variants that differ in the 5@#$%& UTR; both transcripts encode the same protein.
15230	
15231	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent  process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the peptidase T1A family, that is a 20S core alpha subunit.
15232	
15233	The MYCBP gene encodes a protein that binds to the N-terminal region of MYC (MIM 190080) and stimulates the activation of E box-dependent transcription by MYC.[supplied by OMIM]
15234	
15235	This gene is significantly downregulated in primary brain tumors. The exact function of the protein encoded by this gene is unknown.
15236	This gene encodes an endoplasmic reticulum associated protein that is involved in glycosylphosphatidylinositol (GPI) lipid anchor biosynthesis. The GPI lipid anchor is a glycolipid found on many blood cells and serves to anchor proteins to the cell surface. The encoded protein is one subunit of the GPI N-acetylglucosaminyl (GlcNAc) transferase that transfers GlcNAc to phosphatidylinositol (PI) on the cytoplasmic side of the endoplasmic reticulum. Two alternatively spliced transcripts that encode the same protein have been found for this gene. A pseudogene on chromosome 11 has also been characterized.
15237	
15238	This gene encodes a protein that contains laminin EGF motifs, a pentaxin domain, and many fibronectin type III motifs. The encoded basement membrane-associated protein may be important in development and homeostasis of the inner ear and retina. Mutations within this gene have been associated with Usher syndrome type IIa. Alternatively spliced transcript variants that encode different isoforms have been described.
15239	This gene encodes a protein that is similar to other immediate early response proteins. In the mouse, a similar gene may play an important role in mediating the cellular response to mitogenic signals. Studies in rats found the expression of a similar gene to be increased after waking and sleep deprivation.
15240	
15241	NRP1 is a membrane-bound coreceptor to a tyrosine kinase receptor for both vascular endothelial growth factor (VEGF; MIM 192240) and semaphorin (see SEMA3A; MIM 603961) family members. NRP1 plays versatile roles in angiogenesis, axon guidance, cell survival, migration, and invasion.[supplied by OMIM]
15242	
15243	
15244	The protein encoded by this gene belongs to the myosin superfamily. Myosins are actin-dependent motor proteins and are categorized into conventional myosins (class II) and unconventional myosins (classes I and III through XV) based on their variable C-terminal cargo-binding domains. Class III myosins, such as this one, have a kinase domain N-terminal to the conserved N-terminal motor domains and are expressed in photoreceptors. The protein encoded by this gene plays an important role in hearing in humans. 3 different recessive, loss of function mutations in the encoded protein have been shown to cause nonsyndromic progressive hearing loss. Expression of this gene is highly restricted, with the strongest expression in retina and cochlea.
15245	
15246	
15247	
15248	
15249	This is an intronless gene that is involved in cholesterol and lipid metabolism. The encoded protein is a membrane protein and contains clusters of histidine residues essential for catalytic activity. Unlike most other sterol hydroxylases, this enzyme is a member of a small family of enzymes that utilize diiron cofactors to catalyze the hydroxylation of hydrophobic substrates.
15250	
15251	
15252	
15253	
15254	
15255	
15256	
15257	
15258	the protein encoded by this gene is a type II single transmembrane protein. It is required for maximal stimulated calcium release after stimulation of purinergic or muscarinic but not beta-adrenergic receptors. The encoded protein interacts with D1 dopamine receptor and may interact with other DA receptor subtypes and/or GPCRs.
15259	
15260	
15261	Loss of sequences from human chromosome 10q has been associated with the progression of human cancers.  The gene DMBT1 was originally isolated based on its deletion in a medulloblastoma cell line.  DMBT1 is expressed with transcripts of 6.0, 7.5, and 8.0 kb in fetal lung and with one transcript of 8.0 kb in adult lung, although the 7.5 kb transcript has not been characterized.  The DMBT1 protein is a glycoprotein containing multiple scavenger receptor cysteine-rich (SRCR) domains separated by SRCR-interspersed domains (SID).  Transcript variant 2 (8.0 kb) has been shown to bind surfactant protein D independently of carbohydrate recognition.  This indicates that DMBT1 may not be a classical tumor supressor gene, but rather play a role in the interaction of tumor cells and the immune system.
15262	
15263	
15264	
15265	
15266	This gene encodes a protein best known as a hematopoietic cell granule proteoglycan. Proteoglycans stored in the secretory granules of many hematopoietic cells also contain a protease-resistant peptide core, which may be important for neutralizing hydrolytic enzymes. This encoded protein was found to be associated with the macromolecular complex of granzymes and perforin, which may serve as a mediator of granule-mediated apoptosis.
15267	This locus shares sequence similarity with heterogeneous nuclear ribonuclear proteins and represents an apparent transcribed pseudogene of the HNRPA3 gene, which maps to chromosome 2.
15268	Mutations in dysferlin, a protein associated with the plasma membrane, can cause muscle weakness that affects both proximal and distal muscles. The protein encoded by this gene is a type II membrane protein that is structurally similar to dysferlin. It is a member of the ferlin family and associates with both plasma and nuclear membranes. The protein contains C2 domains that play a role in calcium-mediated membrane fusion events, suggesting that it may be involved in membrane regeneration and repair. Two transcript variants encoding different isoforms have been found for this gene. Other possible variants have been detected, but their full-length natures have not been determined.
15269	
15270	MOK2 proteins are DNA- and RNA-binding proteins that are mainly associated with nuclear RNP components, including the nucleoli and extranucleolar structures (Arranz et al., 1997).[supplied by OMIM]
15271	
15272	
15273	RNA-editing deaminase-2 (RED2, or ADARB2) is a member of the double-stranded RNA (dsRNA) adenosine deaminase family of RNA-editing enzymes. Adenosine deamination of pre-mRNA results in a change in the amino acid sequence of the gene product, which differs from that predicted by the genomic DNA sequence. Other members of this family include DRADA (ADAR; MIM 601059) and RED1 (ADARB1; MIM 601218) (Mittaz et al., 1997 [PubMed 9272162]).[supplied by OMIM]
15274	
15275	This protein belongs to a family of proteins that are receptor associated proteins of the synapse. It contains a conserved cAMP-dependent protein kinase phosphorylation site. It is believed to play some role in anchoring or stabilizing the nicotinic acetylcholine receptor at synaptic sites. It may link the receptor to the underlying postsynaptic cytoskeleton, possibly by direct association with actin or spectrin. Two splice variants have been identified for this gene.
15276	The protein encoded by this gene is an integral peroxisomal membrane protein. An inactivating nonsense mutation localized to this gene was observed in a patient with Zellweger syndrome of the complementation group CGD/CG9. Expression of this gene product morphologically and biochemically restores the formation of new peroxisomes, suggesting a role in peroxisome organization and biogenesis. Alternative splicing has been observed for this gene and two variants have been described.
15277	
15278	
15279	
15280	This gene encodes two proteins, the SART1(800) protein expressed in the nucleus of the majority of proliferating cells, and the SART1(259) protein expressed in the cytosol of epithelial cancers. The SART1(259) protein is translated by the mechanism of -1 frameshifting during posttranscriptional regulation; its full-length sequence is not published yet. The two encoded proteins are thought to be involved in the regulation of proliferation. Both proteins have tumor-rejection antigens. The SART1(259) protein possesses tumor epitopes capable of inducing HLA-A2402-restricted cytotoxic T lymphocytes in cancer patients. This SART1(259) antigen may be useful in specific immunotherapy for cancer patients and may serve as a paradigmatic tool for the diagnosis and treatment of patients with atopy. The SART1(259) protein is found to be essential for the recruitment of the tri-snRNP to the pre-spliceosome in the spliceosome assembly pathway.
15281	This gene encodes an enzyme that is a member of the beta-1,3-N-acetylglucosaminyltransferase family. This enzyme is a type II transmembrane protein. It is essential for the synthesis of poly-N-acetyllactosamine, a determinant for the blood group i antigen.
15282	The protein encoded by this gene is an adaptor molecule that interacts with various cell surface receptors and mediates cell apoptotic signals. Through its C-terminal death domain, this protein can be recruited by TNFRSF6/Fas-receptor, tumor necrosis factor receptor, TNFRSF25, and TNFSF10/TRAIL-receptor, and thus it participates in the death signaling initiated by these receptors. Interaction of this protein with the receptors unmasks the N-terminal effector domain of this protein, which allows it to recruit caspase-8, and thereby activate the cysteine protease cascade. Knockout studies in mice also suggest the importance of this protein in early T cell development.
15283	
15284	
15285	This gene encodes a disintegrin and metalloproteinase with thrombospondin motifs-8 (ADAMTS8), which is a member of the ADAMTS protein family. Members of the family share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. The enzyme encoded by this gene contains 2 C-terminal TS motifs, and disrupts angiogenesis in vivo and in vitro more efficiently than thrombospondin-1 or endostatin but somewhat less efficiently than ADAMTS1 protein. A number of disorders have been mapped in the vicinity of this gene, most notably, lung neoplasms.
15286	
15287	The product of this gene belongs to the ligand-gated ion channel receptor superfamily. This gene encodes subunit B of the type 3 receptor for 5-hydroxytryptamine (serotonin), a biogenic hormone that functions as a neurotransmitter, a hormone, and a mitogen. This receptor causes fast, depolarizing responses in neurons after activation. It appears that the heteromeric combination of A and B subunits is necessary to provide the full functional features of this receptor, since either subunit alone results in receptors with very low conductance and response amplitude.
15288	The protein encoded by this gene is a transformation and shape-change sensitive actin cross-linking/gelling protein found in fibroblasts and smooth muscle. Its expression is down-regulated in many cell lines, and this down-regulation may be an early and sensitive marker for the onset of transformation. A functional role of this protein is unclear. Two transcript variants encoding the same protein have been found for this gene.
15289	The protein encoded by this gene is a transformation and shape-change sensitive actin cross-linking/gelling protein found in fibroblasts and smooth muscle. Its expression is down-regulated in many cell lines, and this down-regulation may be an early and sensitive marker for the onset of transformation. A functional role of this protein is unclear. Two transcript variants encoding the same protein have been found for this gene.
15290	
15291	
15292	The protein encoded by this gene belongs to the innexin family. Innexin family members are known to be the structural components of gap junctions.
15293	
15294	
15295	The product encoded by this gene is preferentially expressed in hematopoietic cells and is most homologous to the focal adhesion protein, paxillin. It may function in cell type-specific signaling by associating with PYK2, a member of focal adhesion kinase family. As a substrate for a tyrosine kinase in lymphoid cells, this protein may also function in, and be regulated by tyrosine kinase activity.
15296	
15297	This gene is a member of the solute carrier family and encodes a cell surface, transmembrane protein with an alpha amylase domain. The protein exists as the heavy chain of a heterodimer, covalently bound through di-sulfide bonds to one of several possible light chains. It associates with integrins and mediates integrin-dependent signaling related to normal cell growth and tumorigenesis. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
15298	
15299	
15300	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein.
15301	
15302	
15303	LMO1 enocdes a cysteine-rich, two LIM domain transcriptional regulator. It is mapped to an area of consistent chromosomal translocation in chromosome 11, disrupting it in T-cell leukemia, although more rarely than the related gene, LMO2 is disrupted.
15304	
15305	
15306	The delta (HBD) and beta (HBB) genes are normally expressed in the adult: two alpha chains plus two beta chains constitute HbA, which in normal adult life comprises about 97% of the total hemoglobin. Two alpha chains plus two delta chains constitute HbA-2, which with HbF comprises the remaining 3% of adult hemoglobin. Five beta-like globin genes are found within a 45 kb cluster on chromosome 11 in the following order:      5@#$%&-epsilon--Ggamma--Agamma--delta--beta-3@#$%&. Mutations in the delta-globin gene are associated with beta-thalassemia.
15307	This gene encodes one of two non-identical subunits which constitute ribonucleoside-diphosphate reductase, an enzyme essential for the production of deoxyribonucleotides prior to DNA synthesis in S phase of dividing cells. It is one of several genes located in the imprinted gene domain of 11p15.5, an important tumor-suppressor gene region. Alterations in this region have been associated with the Beckwith-Wiedemann syndrome, Wilms tumor, rhabdomyosarcoma, adrenocrotical carcinoma, and lung, ovarian, and breast cancer. This gene may play a role in malignancies and disease that involve this region. This gene is oriented in a head-to-tail configuration with the stromal interaction molecule 1 gene (STIM1), with the 3@#$%& end of STIM1 situated 1.6 kb from the 5@#$%& end of this gene.
15308	
15309	This gene encodes a member of the trypsin family of serine proteases. Its specific function is unknown.
15310	
15311	
15312	
15313	GM2 and GD2 gangliosides are sialic acid-containing glycosphingolipids. GalNAc-T is the enzyme involved in the biosynthesis of G(M2) and G(D2) glycosphingolipids. GalNAc-T catalyzes the transfer of GalNAc into G(M3) and G(D3) by a beta-1,4 linkage, resulting in the synthesis of G(M2) and G(D2), respectively.
15314	
15315	
15316	
15317	This gene encodes the cytosolic form of the enzyme branched-chain amino acid transaminase. This enzyme catalyzes the reversible transamination of branched-chain alpha-keto acids to branched-chain L-amino acids essential for cell growth. Two different clinical disorders have been attributed to a defect of branched-chain amino acid transamination: hypervalinemia and hyperleucine-isoleucinemia. As there is also a gene encoding a mitochondrial form of this enzyme, mutations in either gene may contribute to these disorders.
15318	
15319	This gene encodes a bifunctional enzyme that catalyzes the first two steps in the mammalian lysine degradation pathway. The N-terminal and the C-terminal portions of this enzyme contain lysine-ketoglutarate reductase and saccharopine dehydrogenase activity, respectively, resulting in the conversion of lysine to alpha-aminoadipic semialdehyde. Mutations in this gene are associated with familial hyperlysinemia.
15320	
15321	The BTG1 gene locus has been shown to be involved in a t(8;12)(q24;q22) chromosomal translocation in a case of B-cell chronic lymphocytic leukemia. It is a member of a family of antiproliferative genes. BTG1 expression is maximal in the G0/G1 phases of the cell cycle and downregulated when cells progressed through G1. It negatively regulates cell proliferation.
15322	The protein encoded by this gene belongs to the highly conserved cyclin family, whose members are characterized by a dramatic periodicity in protein abundance through the cell cycle. Cyclins function as regulators of CDK kinases. Different cyclins exhibit distinct expression and degradation patterns which contribute to the temporal coordination of each mitotic event. This cyclin tightly associates with CDK9 kinase, and was found to be a major subunit of the transcription elongation factor p-TEFb. The kinase complex containing this cyclin and the elongation factor can interact with, and act as a cofactor of human immunodeficiency virus type 1 (HIV-1) Tat protein, and was shown to be both necessary and sufficient for full activation of viral transcription. This cyclin and its kinase partner were also found to be involved in the phosphorylation and regulation of the carboxy-terminal domain (CTD) of the largest RNA polymerase II subunit.
15323	Phosphoprotein 32 (PP32) is a tumor suppressor that can inhibit several types of cancers, including prostate and breast cancers. The protein encoded by this gene is one of at least two proteins that are similar in amino acid sequence to PP32 and are part of the same acidic nuclear phosphoprotein gene family. However, unlike PP32, the encoded protein is tumorigenic. The tumor suppressor function of PP32 has been localized to a 25 amino acid region that is absent in the protein encoded by this gene. This gene does not contain introns.
15324	The protein encoded by this gene is found in the nucleoli. It has high sequence homology to human MLLT1, and yeast and human MLLT3 proteins. Both MLLT1 and MLLT3 proteins belong to a class of transcription factors, indicating that the encoded protein might also represent a transcription factor. This protein is thought to be required for RNA transcription. This gene has been shown to be amplified in tumors.
15325	HIV-1, the causative agent of acquired immunodeficiency syndrome (AIDS), contains an RNA genome that produces a chromosomally integrated DNA during the replicative cycle. Activation of HIV-1 gene expression by the transactivator Tat is dependent on an RNA regulatory element (TAR) located downstream of the transcription initiation site. The protein encoded by this gene binds between the bulge and the loop of the HIV-1 TAR RNA regulatory element and activates HIV-1 gene expression in synergy with the viral Tat protein. Three transcript variants encoding different isoforms as well as a pseudogene have been found for this gene.
15326	This gene encodes a subunit of mitochondrial ATP synthase. Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. ATP synthase is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, Fo, comprising the proton channel. The catalytic portion of mitochondrial ATP synthase consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled with a stoichiometry of 3 alpha, 3 beta, and a single representative of the other 3. The proton channel seems to have nine subunits (a, b, c, d, e, f, g, F6 and 8). There are three separate genes which encode the subunit c of the proton channel and they specify precursors with different import sequences but identical mature proteins. The protein encoded by this gene is one of three precursors of subunit c. Alternatively spliced transcript variants encoding different isoforms have been identified.
15327	
15328	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP contains a C-terminal PEST motif, which serves as a protein-protein interaction domain, and may be related to protein intracellular half-life. This PTP was found to bind and dephosphorylate the product of oncogene c-ABL, thus may play a role in oncogenesis. This PTP was shown to interact with, and dephosphorylate, various of cytoskeleton and cell adhesion molecules, such as p130 (Cas), CAKbeta/PTK2B, PSTPIP1, and paxillin, which suggested its regulatory roles in controlling cell shape and mobility.
15329	
15330	The protein encoded by this gene is a member of the keratin gene family. The type II cytokeratins consist of basic or neutral proteins which are arranged in pairs of heterotypic keratin chains coexpressed during differentiation of simple and stratified epithelial tissues. This type II cytokeratin is specifically expressed in the basal layer of the epidermis with family member KRT14. Mutations in these genes have been associated with a complex of diseases termed epidermolysis bullosa simplex. The type II cytokeratins are clustered in a region of chromosome 12q12-q13.
15331	The protein encoded by this gene is an RNA-binding nuclear protein that is a tumor-rejection antigen. This antigen possesses tumor epitopes capable of inducing HLA-A24-restricted and tumor-specific cytotoxic T lymphocytes in cancer patients and may be useful for specific immunotherapy. This gene product is found to be an important cellular factor for HIV-1 gene expression and viral replication. It also associates transiently with U6 and U4/U6 snRNPs during the recycling phase of the spliceosome cycle. This encoded protein is thought to be involved in the regulation of mRNA splicing.
15332	Dermatan sulfate proteoglycan 3 is a member of the small leucine-rich repeat proteoglycan family. This gene is composed of seven exons. It regulates fibrillogenesis by interacting with collagen fibrils and other extracellular matrix proteins.
15333	
15334	This gene encodes a 25-kD microfibril-associated glycoprotein which is rich in serine and threonine residues. It lacks a hydrophobic carboxyl terminus and proline-, glutamine-, and tyrosine-rich regions, which are characteristics of a related 31-kDa microfibril-associated glycoprotein (MFAP2). The close similarity between these two proteins is confined to a central region of 60 aa where precise alignment of 7 cysteine residues occurs. The structural differences suggest that this encoded protein has some functions that are distinct from those of MFAP2.
15335	
15336	This gene encodes an actin capping protein, one of the F-actin capping protein alpha subunit family. The encoded protein is predominantly localized to the neck region of ejaculated sperm, other immunohistochemical signals were found in the tail and postacrosomal regions. The encoded protein may also form heterodimers of alpha and beta subunits. This protein may be important in determining sperm architecture and male fertility.
15337	
15338	This gene is a member of the intermediate filament family. Intermediate filaments are proteins which are primordial components of the cytoskeleton and nuclear envelope. The proteins encoded by the members of this gene family are evolutionarily and structurally related but have limited sequence homology, with the exception of the central rod domain. Alternative splicing has been observed for this gene and three transcript variants have been identified.
15339	
15340	
15341	
15342	
15343	
15344	
15345	
15346	
15347	
15348	Dual-specificity kinases, such as DYRK4, play key roles in cell proliferation, survival, and development (Zhang et al., 2005).[supplied by OMIM]
15349	
15350	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbls class. Multiple alternatively spliced transcript variants have been found for this gene, but the full-length nature of some variants has not been determined.
15351	
15352	
15353	This gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins. The encoded protein may play a role in chondrogenesis. A pseudogene of this gene is located on chromosome 8. Multiple transcript variants encoding distinct isoforms have been identified for this gene.
15354	
15355	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
15356	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to cytoplasmic bodies near the nucleus, but its function is unknown. The gene is located on chromosome 13 within the minimal deletion region for B-cell chronic lymphocytic leukemia. Multiple alternatively spliced transcript variants have been found for this gene.
15357	The protein encoded by this gene belongs to the family of P-type cation transport ATPases. This gene encodes a catalytic subunit of the ouabain-sensitive H+/K+ -ATPase that catalyzes the hydrolysis of ATP coupled with the exchange of H(+) and K(+) ions across the plasma membrane. It is also responsible for potassium absorption in various tissues.
15358	This gene,CYP3A5, encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and its expression is induced by glucocorticoids and some pharmacological agents. The enzyme metabolizes drugs such as nifedipine and cyclosporine as well as the steroid hormones testosterone, progesterone and androstenedione. This gene is part of a cluster of cytochrome P450 genes on chromosome 7q21.1. This cluster includes a pseudogene, CYP3A5P1, which is very similar to CYP3A5. This similarity has caused some difficulty in determining whether cloned sequences represent the gene or the pseudogene.
15359	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. A mutation in this gene is associated with autosomal dominant cerebral ataxia. Alternatively spliced transcript variants have been found for this gene.
15360	
15361	
15362	
15363	This gene encodes a protein that is a homolog of Saccharomyces cerevisiae CDC16. This protein is a part of the APC complex that is composed of eight proteins and functions as a protein ubiquitin ligase. The APC complex is a cyclin degradation system that governs exit from mitosis. Each component protein of the APC complex is highly conserved among eucaryotic organisms. CDC16 and two other APC complex proteins, CDC23 and CDC27, contain a tetratricopeptide repeat (TPR), a protein domain that may be involved in protein-protein interaction.
15364	
15365	
15366	
15367	
15368	
15369	
15370	
15371	
15372	
15373	
15374	FABP4 encodes the fatty acid binding protein found in adipocytes.  Fatty acid binding proteins are a family of small, highly conserved, cytoplasmic proteins that bind long-chain fatty acids and other hydrophobic ligands. It is thought that FABPs roles include fatty acid uptake, transport, and metabolism.
15375	
15376	
15377	
15378	
15379	
15380	
15381	
15382	
15383	
15384	
15385	This gene encodes a member of the epoxide hydrolase family. The protein, found in both the cytosol and peroxisomes, binds to specific epoxides and converts them to the corresponding dihydrodiols. Mutations in this gene have been associated with familial hypercholesterolemia. Alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
15386	
15387	
15388	The protein encoded by this gene plays a role in the determination of cell fates during development. The encoded protein, whose degradation is induced in a proteasome-dependent manner by MDM2, is a membrane-bound protein that has been shown to associate with EPS15, LNX1, and NOTCH1. Four transcript variants encoding different isoforms have been found for this gene.
15389	
15390	
15391	
15392	Checkpoint suppressor 1 is a member of the forkhead/winged helix transcription factor family. Checkpoints are eukaryotic DNA damage-inducible cell cycle arrests at G1 and G2. Checkpoint suppressor 1 suppresses multiple yeast checkpoint mutations including mec1, rad9, rad53 and dun1 by activating a MEC1-independent checkpoint pathway.
15393	
15394	
15395	TRPCs, mammalian homologs of the Drosophila transient receptor potential (trp) protein, are ion channels that are thought to mediate capacitative calcium entry into the cell. TRP-PLIK is a protein that is both an ion channel and a kinase. As a channel, it conducts calcium and monovalent cations to depolarize cells and increase intracellular calcium. As a kinase, it is capable of phosphorylating itself and other substrates. The kinase activity is necessary for channel function, as shown by its dependence on intracellular ATP and by the kinase mutants.[supplied by OMIM]
15396	
15397	
15398	
15399	
15400	
15401	This gene encodes a member of the neurotrophic tyrosine receptor kinase (NTRK) family. This kinase is a membrane-bound receptor that, upon neurotrophin binding, phosphorylates itself and members of the MAPK pathway. Signalling through this kinase leads to cell differentiation and may play a role in the development of proprioceptive neurons that sense body position. Mutations in this gene have been associated with medulloblastomas, secretory breast carcinomas and other cancers.
15402	
15403	LIPC encodes hepatic triglyceride lipase, which is expressed in liver. LIPC has the dual functions of triglyceride hydrolase and ligand/bridging factor for receptor-mediated lipoprotein uptake.
15404	
15405	This gene is a member of the ARID (AT-rich interaction domain) family of proteins which bind DNA. It is homologous with two proteins that bind to the retinoblastoma gene product and also with the mouse Bright and Drosophila dead ringer proteins. A pseudogene on chromosome 1p31 also exists for this gene. Other ARID family members have roles in embryonic patterning, cell lineage gene regulation, cell cycle control, transcriptional regulation and possibly in chromatin structure modification.
15406	
15407	
15408	Aminopeptidase N is located in the small-intestinal and renal microvillar membrane, and also in other plasma membranes. In the small intestine aminopeptidase N plays a role in the final digestion of peptides generated from hydrolysis of proteins by gastric and pancreatic proteases. Its function in proximal tubular epithelial cells and other cell types is less clear. The large extracellular carboxyterminal domain contains a pentapeptide consensus sequence characteristic of members of the zinc-binding metalloproteinase superfamily. Sequence comparisons with known enzymes of this class showed that CD13 and aminopeptidase N are identical. The latter enzyme was thought to be involved in the metabolism of regulatory peptides by diverse cell types, including small intestinal and renal tubular epithelial cells, macrophages, granulocytes, and synaptic membranes from the CNS. Human aminopeptidase N is a receptor for one strain of human coronavirus that is an important cause of upper respiratory tract infections. Defects in this gene appear to be a cause of various types of leukemia or lymphoma.
15409	The Bloom syndrome gene product is related to the RecQ subset of DExH box-containing DNA helicases and has both DNA-stimulated ATPase and ATP-dependent DNA helicase activities. Mutations causing Bloom syndrome delete or alter helicase motifs and may disable the 3@#$%&-5@#$%& helicase activity.  The normal protein may act to suppress inappropriate recombination.
15410	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
15411	
15412	
15413	
15414	
15415	
15416	The protein encoded by this gene is a member of the RAD51 protein family. RAD51 family members are highly similar to bacterial RecA and Saccharomyces cerevisiae Rad51, and are known to be involved in the homologous recombination and repair of DNA. This protein can interact with the ssDNA-binding protein RPA and RAD52, and it is thought to play roles in homologous pairing and strand transfer of DNA. This protein is also found to interact with BRCA1 and BRCA2, which may be important for the cellular response to DNA damage. BRCA2 is shown to regulate both the intracellular localization and DNA-binding ability of this protein. Loss of these controls following BRCA2 inactivation may be a key event leading to genomic instability and tumorigenesis. Two alternatively spliced transcript variants of this gene, which encode distinct proteins, have been reported. Transcript variants utilizing alternative polyA signals exist.
15417	
15418	
15419	
15420	
15421	
15422	
15423	This gene encodes the major vault protein which is a lung resistance-related protein. Vaults are multi-subunit structures that may be involved in nucleo-cytoplasmic transport. This protein mediates drug resistance, perhaps via a transport process. It is widely distributed in normal tissues, and overexpressed in multidrug-resistant cancer cells. The protein overexpression is a potentially useful marker of clinical drug resistance. This gene produces two transcripts by using two alternative exon 2 sequences; however, the open reading frames are the same in both transcripts.
15424	
15425	
15426	
15427	This gene encodes a member of the cytoskeletal BTB/kelch (Broad-Complex, Tramtrack and Bric a brac) repeat family. (Kelch repeats are predicted to form a beta-propeller shape.) The encoded protein plays a role in neurofilament architecture and is mutated in giant axonal neuropathy.
15428	The protein encoded by this gene is a nuclear protein that belongs to the family of Kruppel-like C2H2 zinc finger proteins. It functions as a DNA-binding transcription factor by using distinct zinc fingers in different signaling pathways. Thus, it is thought that this gene may have multiple roles in signal transduction during development.
15429	
15430	
15431	This gene encodes an integral membrane protein. The gene belongs to the synaptogyrin gene family. Like other members of the family the protein contains four transmembrane regions. The exact function of this protein is unclear.
15432	
15433	This gene product, Aldolase A (fructose-bisphosphate aldolase) is a glycolytic enzyme that catalyzes the reversible conversion of fructose-1,6-bisphosphate to glyceraldehyde 3-phosphate and dihydroxyacetone phosphate. Three aldolase isozymes (A, B, and C), encoded by three different genes, are differentially expressed during development. Aldolase A is found in the developing embryo and is produced in even greater amounts in adult muscle. Aldolase A expression is repressed in adult liver, kidney and intestine and similar to aldolase C levels in brain and other nervous tissue. Aldolase A deficiency has been associated with myopathy and hemolytic anemia. Alternative splicing of this gene results in multiple transcript variants which encode the same protein.
15434	
15435	
15436	Cytochrome c oxidase (COX), the terminal enzyme of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. It is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in electron transfer, and the nuclear-encoded subunits may be involved in the regulation and assembly of the complex. This nuclear gene encodes subunit VIc, which has 77% amino acid sequence identity with mouse COX subunit VIc. This gene is up-regulated in prostate cancer cells. A pseudogene COX6CP1 has been found on chromosomes 16p12.
15437	
15438	
15439	Members of the F-box protein family, such as FBXO31, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
15440	
15441	This gene was identified by its differential expression in cancer cells. The product of this gene is predicted to be a transmembrane protein. The level of this protein was found to be lower in tumor derived cell lines compared to normal cells. This gene was thus proposed to be a candidate tumor suppressor gene.
15442	
15443	
15444	
15445	
15446	
15447	CA2 is one of several (at least 7) isozymes of carbonic anhydrase.  Carbonic anhydrase catalyzes reversible hydration of carbon dioxide.  Defects in this enzyme are associated with osteopetrosis and renal tubular acidosis.
15448	The zona pellucida is an extracellular matrix that surrounds the oocyte and early embryo. It is composed primarily of three or four glycoproteins with various functions during fertilization and preimplantation development. The protein encoded by this gene is a structural component of the zona pellucida and functions in secondary binding and penetration of acrosome-reacted spermatozoa. The nascent protein contains a N-terminal signal peptide sequence, a conserved ZP domain, a consensus furin cleavage site, and a C-terminal transmembrane domain. It is hypothesized that furin cleavage results in release of the mature protein from the plasma membrane for subsequent incorporation into the zona pellucida matrix. However, the requirement for furin cleavage in this process remains controversial based on mouse studies.
15449	
15450	CDK3 complements cdc28 mutants of Saccharomyces cerevisiae suggesting that it may be involved in cell cycle control.  CDK3 can phosphorylate histone H1 and interacts with an unknown type of cyclin.
15451	
15452	
15453	The protein encoded by this gene possesses kinase activity that phosphorylates regulators involved in signal transduction. It phosphorylates I kappa-Balpha, p105, and c-Jun. It acts as a docking site for complex-mediated phosphorylation. The gene is located within the Smith-Magenis syndrome region on chromosome 17.
15454	FN3K catalyzes phosphorylation of fructosamines formed by glycation, the nonenzymatic reaction of glucose with primary amines followed by Amadori rearrangement. Phosphorylation of fructosamines may initiate metabolism of the modified amine and result in deglycation of glycated proteins (Delpierre et al., 2000).[supplied by OMIM]
15455	This gene encodes a member of the epidermal growth factor (EGF) receptor family of receptor tyrosine kinases. This protein has no ligand binding domain of its own and therefore cannot bind growth factors. However, it does bind tightly to other ligand-bound EGF receptor family members to form a heterodimer, stabilizing ligand binding and enhancing kinase-mediated activation of downstream signalling pathways, such as those involving mitogen-activated protein kinase and phosphatidylinositol-3 kinase. Allelic variations at amino acid positions 654 and 655 of isoform a (positions 624 and 625 of isoform b) have been reported, with the most common allele, Ile654/Ile655, shown here. Amplification and/or overexpression of this gene has been reported in numerous cancers, including breast and ovarian tumors. Alternative splicing results in several additional transcript variants, some encoding different isoforms and others that have not been fully characterized.
15456	
15457	
15458	Phosphatidylinositol transfer protein is a member of a diverse set of cytosolic phospholipid transfer proteins that are distinguished by their ability to transfer phospholipids between membranes in vitro (Wirtz, 1991).[supplied by OMIM]
15459	L-type calcium channels are composed of five subunits. The protein encoded by this gene represents one of these subunits, gamma, and is one of several gamma subunit proteins. This particular gamma subunit is part of skeletal muscle 1,4-dihydropyridine-sensitive calcium channels and is an integral membrane protein that plays a role in excitation-contraction coupling. This gene is a member of the neuronal calcium channel gamma subunit gene subfamily of the PMP-22/EMP/MP20 family and is located in a cluster with two similar gamma subunit-encoding genes.
15460	This gene encodes a protein which interacts with clathrin and adaptor-related protein complex 2, alpha 1 subunit. The protein is found in a brain-derived clathrin-coated vesicle fraction and localizes to the peri-Golgi region and the cell periphery. The protein is thought to be involved in clathrin-mediated endocytosis. Alternate splicing of this gene results in two transcript variants encoding different isoforms.
15461	
15462	
15463	
15464	The CD233 gene is located on chromosome 17q21-q22 and is part of the anion exchanger (AE) family. CD233 is expressed in the erythrocyte plasma membrane where it functions as a chloride/bicarbonate exchanger involved in carbon dioxide transport from tissues to lungs. The protein comprises two domains that are structurally and functionally distinct. The N-terminal 40kDa domain is located in the cytoplasm and acts as an attachment site for the red cell skeleton by binding ankyrin. The glycosylated C-terminal membrane-associated domain contains 12-14 membrane spanning segments and carries out the stilbene disulphonate-sensitive exchange transport of anions. The cytoplasmic tail at the extreme C-terminus of the membrane domain binds carbonic anhydrase II. CD233 associates with the red cell membrane protein glycophorin A and this association promotes the correct folding and translocation of CD233. CD233 is predominantly dimeric but forms tetramers in the presence of ankyrin. Many CD233 mutations are known in man and these mutations can lead to two types of disease; destabilization of red cell membrane leading to hereditary spherocytosis, and defective kidney acid secretion leading to distal renal tubular acidosis. Other CD233 mutations that do not give rise to disease result in novel blood group antigens, which form the Diego blood group system. Southeast Asian ovalocytosis (SAO, Melanesian ovalocytosis) results from the heterozygous presence of a deletion in the CD233 protein and is common in areas where Plasmodium falciparum malaria is endemic. One CD233 null human is known also with very severe anemia and nephrocalcinosis [PROW]
15465	
15466	This gene is one of several cytokine genes clustered on the q-arm of chromosome 17. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The protein encoded by this gene is structurally related to the CXC subfamily of cytokines. Members of this subfamily are characterized by two cysteines separated by a single amino acid. This cytokine displays chemotactic activity for monocytes and basophils but not for neutrophils or eosinophils. It has been implicated in the pathogenesis of diseases characterized by monocytic infiltrates, like psoriasis, rheumatoid arthritis and atherosclerosis. It binds to chemokine receptors CCR2 and CCR4.
15467	
15468	The protein encoded by this gene was identified on the basis of its interaction with MAP3K12/DLK, a protein kinase known to be involved in the induction of cell apoptosis. This gene product contains a leucine zipper, which is a characteristic motif of transcription factors, and was shown to exhibit strong transactivation activity when fused to Gal4 DNA binding domain. Overexpression of this gene interfered with MAP3K12 induced apoptosis.
15469	
15470	This gene encodes a protein associated with the mitotic spindle apparatus. The encoded protein may be involved in the functional and dynamic regulation of mitotic spindles.
15471	
15472	
15473	
15474	
15475	The WNT gene family consists of structurally related genes that encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family. Study of its expression in the teratocarcinoma cell line NT2 suggests that it may be implicated in the early process of neuronal differentiation of NT2 cells induced by retinoic acid. This gene is clustered with WNT3, another family member, in the chromosome 17q21 region.
15476	This gene is a member of the Period family of genes and is expressed in a circadian pattern in the suprachiasmatic nucleus, the primary circadian pacemaker in the mammalian brain. Genes in this family encode components of the circadian rhythms of locomotor activity, metabolism, and behavior. Circadian expression in the suprachiasmatic nucleus continues in constant darkness, and a shift in the light/dark cycle evokes a proportional shift of gene expression in the suprachiasmatic nucleus. The specific function of this gene is not yet known. Alternative splicing has been observed in this gene; however, these variants have not been fully described.
15477	The protein encoded by this gene is a member of the keratin gene family. As a type I hair keratin, it is an acidic protein which heterodimerizes with type II keratins to form hair and nails. The type I hair keratins are clustered in a region of chromosome 17q12-q21 and have the same direction of transcription.
15478	The protein encoded by this gene is a cytokine important for B and T cell development. This cytokine and the hepatocyte growth factor (HGF) form a heterodimer that functions as a pre-pro-B cell growth-stimulating factor. This cytokine is found to be a cofactor for V(D)J rearrangement of the T cell receptor beta (TCRB) during early T cell development. This cytokine can be produced locally by intestinal epithelial and epithelial goblet cells, and may serve as a regulatory factor for intestinal mucosal lymphocytes. Knockout studies in mice suggested that this cytokine plays an essential role in lymphoid cell survival.
15479	
15480	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the ultrahigh sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
15481	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the high sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
15482	The protein encoded by this gene is a member of the keratin gene family. It is one of the type I hair keratin genes which are clustered in a region of chromosome 17q12-q21 and have the same direction of transcription. As a type I hair keratin, it is an acidic protein which heterodimerizes with type II keratins to form hair and nails. There are two isoforms of this protein, encoded by two separate genes, KRTHA3A and KRTHA3B.
15483	
15484	
15485	
15486	Vitamin B6, or pyridoxal 5-prime-phosphate (PLP), is critical for normal cellular function, and some cancer cells have notable differences in vitamin B6 metabolism compared to their normal counterparts. The rate-limiting enzyme in vitamin B6 synthesis is pyridoxine-5-prime-phosphate (PNP) oxidase (PNPO; EC 1.4.3.5).[supplied by OMIM]
15487	
15488	This gene encodes a protein which interacts with the carboxy-terminus of transcription initiation factor TFIIF, a transcription factor which regulates elongation as well as initiation by RNA polymerase II. The protein may also represent a component of an RNA polymerase II holoenzyme complex. Alternative splicing of this gene results in two transcript variants.
15489	
15490	The protein encoded by this gene contains a RING finger, a motif present in a variety of functionally distinct proteins and known to be involved in protein-DNA and protein-protein interactions. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
15491	
15492	
15493	Laminins are basement membrane components thought to mediate the attachment, migration and organization of cells into tissues during embryonic development by interacting with other extracellular matrix components. The protein encoded by this gene is the alpha-3 chain of laminin 5, which is a complex glycoprotein composed of three subunits (alpha, beta, and gamma). Laminin 5 is thought to be involved in cell adhesion, signal transduction and differentiation of keratinocytes. Mutations in this gene have been identified as the cause of Herlitz type junctional epidermolysis bullosa. Alternatively spliced transcript variants encoding different isoforms have been identified.
15494	Ferrochelatase is localized to the mitochondrion where it catalyzes the insertion of the ferrous form of iron into protoporphyrin IX in the heme synthesis pathway. Defects in ferrochelatase are associated with protoporphyria. Two transcript variants encoding different isoforms have been found for this gene.
15495	The encoded protein catalyzes the last step of the mitochondrial fatty acid beta-oxidation spiral. Unlike most mitochondrial matrix proteins, it contains a non-cleavable amino-terminal targeting signal.
15496	The protein encoded by this gene plays a key role in the acute regulation of steroid hormone synthesis by enhancing the conversion of cholesterol into pregnenolone. This protein permits the cleavage of cholesterol into pregnenolone by mediating the transport of cholesterol from the outer mitochondrial membrane to the inner mitochondrial membrane. Mutations in this gene are a cause of congenital lipoid adrenal hyperplasia (CLAH), also called lipoid CAH. A pseudogene of this gene is located on chromosome 13. While several transcript variants may exist for this gene, the full-length natures of only two have been described to date. These two represent the major variants of this gene and encode distinct isoforms.
15497	CNDP2, also known as tissue carnosinase and peptidase A (EC 3.4.13.18), is a nonspecific dipeptidase rather than a selective carnosinase (Teufel et al., 2003).[supplied by OMIM]
15498	
15499	
15500	
15501	The protein encoded by this gene mediates transcriptional control by interaction with the Kruppel-associated box repression domain found in many transcription factors. The protein localizes to the nucleus and is thought to associate with specific chromatin regions. The protein is a member of the tripartite motif family. This tripartite motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region.
15502	
15503	The protein encoded by this gene is a component of the core SMN complex, which is required for pre-mRNA splicing in the nucleus. The encoded protein is found in the nucleoplasm, in nuclear @#$%!gems@#$%! (Gemini of Cajal bodies), and in the cytoplasm. Three transcript variants encoding the same protein have been found for this gene.
15504	
15505	The protein encoded by this gene is a member of the INK4 family of cyclin-dependent kinase inhibitors. This protein has been shown to form a stable complex with CDK4 or CDK6, and prevent the activation of the CDK kinases, thus function as a cell growth regulator that controls cell cycle G1 progression. The abundance of the transcript of this gene was found to oscillate in a cell-cycle dependent manner with the lowest expression at mid G1 and a maximal expression during S phase. The negative regulation of the cell cycle involved in this protein was shown to participate in repressing neuronal proliferation, as well as spermatogenesis. Two alternatively spliced variants of this gene, which encode an identical protein, have been reported.
15506	
15507	
15508	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and is known to dehydrogenate 3-methylindole, an endogenous toxin derived from the fermentation of tryptophan, as well as xenobiotic substrates such as naphthalene and ethoxycoumarin. This gene is part of a large cluster of cytochrome P450 genes from the CYP2A, CYP2B and CYP2F subfamilies on chromosome 19q.
15509	
15510	
15511	
15512	
15513	
15514	The protein encoded by this gene belongs to the acyl-CoA dehydrogenase family. It catalyzes the oxidative decarboxylation of glutaryl-CoA to crotonyl-CoA and CO(2) in the degradative pathway of L-lysine, L-hydroxylysine, and L-tryptophan metabolism. It uses electron transfer flavoprotein as its electron acceptor. The enzyme exists in the mitochondrial matrix as a homotetramer of 45-kD subunits. Alternatively spliced transcript variants encoding different isoforms have been identified.
15515	
15516	The LX receptors (LXRs) were originally identified as orphan members of the nuclear receptor superfamily because their ligands were unknown. Like other receptors in the family, LXRs heterodimerize with retinoid X receptor (see MIM 180245) and bind to specific response elements (LXREs) characterized by direct repeats separated by 4 nucleotides. Two genes, alpha (LXRA, MIM 602423) and beta, are known to encode LXR proteins (Song et al., 1995).[supplied by OMIM]
15517	The activation peptides of aspartic proteinases plays role as inhibitors of the active site. These peptide segments, or pro-parts, are deemed important for correct folding, targeting, and control of the activation of aspartic proteinase zymogens. The pronapsin A gene is expressed predominantly in lung and kidney. Its translation product is predicted to be a fully functional, glycosylated aspartic proteinase precursor containing an RGD motif and an additional 18 residues at its C-terminus.
15518	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L13P family of ribosomal proteins. It is located in the cytoplasm. Transcript variants utilizing alternative polyA signals have been observed. This gene is co-transcribed with the small nucleolar RNA genes U32, U33, U34, and U35, which are located in its second, fourth, fifth, and sixth introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
15519	
15520	
15521	This gene is a member of a group of genes whose transcript levels are increased following stressful growth arrest conditions and treatment with DNA-damaging agents. The induction of this gene by ionizing radiation occurs in certain cell lines regardless of p53 status, and its protein response is correlated with apoptosis following ionizing radiation.
15522	
15523	
15524	
15525	
15526	
15527	
15528	
15529	
15530	DYRK1B is a member of the DYRK family of protein kinases.  DYRK1B contains a bipartite nuclear localization signal and is found mainly in muscle and testis.  The protein is proposed to be involved in the regulation of nuclear functions.  Three isoforms of DYRK1B have been identified differing in the presence of two alternatively spliced exons within the catalytic domain.
15531	TRIF, or TICAM1, is a Toll/IL1R (MIM 147810) (TIR) domain-containing adaptor molecule, like MYD88 (MIM 602170) and TIRAP (MIM 606252), that induces interferon-beta (IFNB; MIM 147640), specifically interacts with TLR3 (MIM 603029), and activates nuclear factor kappa-B (NFKB; see MIM 164011).[supplied by OMIM]
15532	
15533	
15534	This gene encodes subunit 2 of the splicing factor 3a protein complex. The splicing factor 3a heterotrimer includes subunits 1, 2 and 3 and is necessary for the in vitro conversion of 15S U2 snRNP into an active 17S particle that performs pre-mRNA splicing. Subunit 2 interacts with subunit 1 through its amino-terminus while the single zinc finger domain of subunit 2 plays a role in its binding to the 15S U2 snRNP. Subunit 2 may also function independently of its RNA splicing function as a microtubule-binding protein.
15535	
15536	Neurturin is a member of the TGF-beta subfamily, TRN.  This gene signals through RET and a GPI-linked coreceptor, and promotes survival of neuronal populations.  A neurturin mutation has been described in a family with Hirschsprung Disease.
15537	
15538	
15539	
15540	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L18AE family of ribosomal proteins. It is located in the cytoplasm. This gene is co-transcribed with the U68 snoRNA, which is located in its third intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
15541	
15542	This gene encodes a member of the cysteine-aspartic acid protease (caspase) family. Sequential activation of caspases plays a central role in the execution-phase of cell apoptosis. Caspases exist as inactive proenzymes which undergo proteolytic processing at conserved aspartic residues to produce 2 subunits, large and small, that dimerize to form the active enzyme. This caspase has been shown to be processed and activated by caspase 8 and caspase 10 in vitro, and by anti-Fas agonist antibody or TNF-related apoptosis inducing ligand in vivo. The expression and processing of this caspase may be involved in the keratinocyte terminal differentiation, which is important for the formation of the skin barrier.
15543	This gene encodes a homeodomain protein with similarity to a transcription factor involved in translocations in pre-B-cell leukemias. The presence of a homeobox domain suggests that this protein acts as a transcription factor, but this function has not been demonstrated.
15544	The protein encoded by this gene is a noncollagenous extracellular matrix (ECM) protein. It consists of five identical glycoprotein subunits, each with EGF-like and calcium-binding (thrombospondin-like) domains. Oligomerization results from formation of a five-stranded coiled coil and disulfides. Binding to other ECM proteins such as collagen appears to depend on divalent cations. Mutations can cause the osteochondrodysplasias pseudochondroplasia (PSACH) and multiple epiphyseal dysplasia (MED).
15545	
15546	
15547	Sm-like proteins were identified in a variety of organisms based on sequence homology with the Sm protein family (see SNRPD2; MIM 601061). Sm-like proteins contain the Sm sequence motif, which consists of 2 regions separated by a linker of variable length that folds as a loop. The Sm-like proteins are thought to form a stable heteromer present in tri-snRNP particles, which are important for pre-mRNA splicing.[supplied by OMIM]
15548	
15549	
15550	
15551	
15552	
15553	
15554	
15555	Heparan sulfate biosynthetic enzymes are key components in generating a myriad of distinct heparan sulfate fine structures that carry out multiple biologic activities. The enzyme encoded by this gene is a member of the heparan sulfate biosynthetic enzyme family. It is a type II integral membrane protein and responsible for 6-O-sulfation of heparan sulfate. This enzyme does not share significant sequence similarity to other known sulfotransferases.
15556	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein.
15557	
15558	
15559	The protein encoded by this gene is one of the highly conserved mini-chromosome maintenance proteins (MCM) that are essential for the initiation of eukaryotic genome replication. The hexameric protein complex formed by the MCM proteins is a key component of the pre-replication complex (pre_RC) and may be involved in the formation of replication forks and in the recruitment of other DNA replication related proteins. The MCM complex consisting of this protein and MCM2, 4 and 7 proteins possesses DNA helicase activity, and may act as a DNA unwinding enzyme. The phosphorylation of the complex by CDC2 kinase reduces the helicase activity, suggesting a role in the regulation of DNA replication.
15560	The protein encoded by this gene contains 2 leucine zipper domains and a putative C-terminal nuclear targeting signal, and it does not have any hydrophobic regions. This protein is expressed weakly in resting NK and T cells.
15561	
15562	This gene encodes a protein that has four predicted transmembrane domains. The function of this gene has not yet been determined; however, studies in the mouse homolog suggest a role in the transport of small molecules across endosomal and lysosomal membranes.
15563	
15564	
15565	
15566	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and a single catalytic domain, and thus represents a receptor-type PTP. This PTP was found to be an autoantigen that is reactive with insulin-dependent diabetes mellitus (IDDM) patient sera, and thus may be a potential target of autoimmunity in diabetes mellitus.
15567	
15568	This gene is a type I subclass member of the Reg gene family. The Reg gene family is a multigene family grouped into four subclasses, types I, II, III and IV based on the primary structures of the encoded proteins. This gene encodes a protein that is secreted by the exocrine pancreas. It is associated with islet cell regeneration and diabetogenesis and may be involved in pancreatic lithogenesis. Reg family members REG1B, REGL, PAP and this gene are tandomly clustered on chromosome 2p12 and may have arisen from the same ancestral gene by gene duplication.
15569	
15570	
15571	
15572	The protein encoded by this gene is a member of the serine/threonine protein kinase family. This kinase has been shown to specifically activate MAPK8/JNK. The activation of MAPK8 by this kinase is found to be inhibited by the dominant-negative mutants of MAP3K7/TAK1, MAP2K4/MKK4, and MAP2K7/MKK7, which suggests that this kinase may function through the MAP3K7-MAP2K4-MAP2K7 kinase cascade, and mediate the TNF-alpha signaling pathway. Alternatively spliced transcript variants encoding different isoforms have been identified.
15573	The product encoded by this gene is a member of a RNA-binding protein family and possesses nucleolytic activity against cytotoxic lymphocyte (CTL) target cells. It has been suggested that this protein may be involved in the induction of apoptosis as it preferentially recognizes poly(A) homopolymers and induces DNA fragmentation in CTL targets. The major granule-associated species is a 15-kDa protein that is thought to be derived from the carboxyl terminus of the 40-kDa product by proteolytic processing. Alternative splicing resulting in different isoforms of this gene product has been described in the literature.
15574	
15575	
15576	The muscle acetylcholine receptor consiststs of 5 subunits of 4 different types: 2 alpha isoforms and 1 each of beta, gamma, and delta subunits.2 This gene encodes an alpha subunit that plays a role in acetlycholine binding/channel gating.
15577	
15578	
15579	The protein encoded by this gene is a member of the interleukin 1 cytokine family. Protein structure modeling indicated that this cytokine may contain a 12-stranded beta-trefoil structure that is conserved between IL1A (IL-A alpha) and IL1B (IL-1 beta). This gene and eight other interleukin 1 family genes form a cytokine gene cluster on chromosome 2. Two alternatively spliced transcript variants encoding distinct isoforms have been reported.
15580	
15581	
15582	
15583	
15584	
15585	
15586	
15587	This 9-exon gene encodes a carcinoma-associated antigen and is a member of a family that includes at least two type I membrane proteins. This antigen is expressed on most normal epithelial cells and gastrointestinal carcinomas and functions as a homotypic calcium-independent cell adhesion molecule. The antigen is being used as a target for immunotherapy treatment of human carcinomas.
15588	
15589	This gene encodes one of the members of the superfamily of potassium channel proteins containing two pore-forming P domains. The gene product is an outwardly rectifying channel that is sensitive to changes in extracellular pH and is inhibited by extracellular acidification. Also referred to as an acid-sensitive potassium channel, it is activated by the anesthetics halothane and isoflurane. Although three transcripts are detected in northern blots, there is currently no sequence available to confirm transcript variants for this gene.
15590	This gene encodes a protein that specifically binds to a Pit1-binding element of the prolactin (PRL) promoter. This protein may act as a transcriptional regulator and is thought to be involved in some of the developmental abnormalities observed in patients with partial trisomy 2p. This gene overlaps the abhydrolase domain containing 1 (ABHD1) gene on the opposite strand.
15591	
15592	
15593	
15594	This gene encodes a member of the septin family of cytoskeletal proteins with GTPase activity. This protein localizes to the cytoplasm and nucleus and displays GTP-binding and GTPase activity. Alternate splicing results in two transcript variants encoding different isoforms.
15595	
15596	Chromosomal DNA of eukaryotic cells is compacted by nuclear proteins to form chromatin, an organized nucleoprotein structure that can inhibit gene expression. Several multisubunit protein complexes exist to remodel the chromatin to allow patterns of cell type-specific gene expression. The protein encoded by this gene is thought to be a core member of one or more of these complexes. The encoded protein, which is a member of the SNF2/RAD54 helicase family, contains two chromodomains, a helicase domain, and an ATPase domain.
15597	The protein encoded by this gene is a member of the cAMP-dependent protein kinase (PKA) inhibitor family. Studies of a similar protein in mice suggest that this protein acts as a potent competitive PKA inhibitor, and is a predominant form of PKA inhibitors in various tissues. Three alternatively spliced transcript variants encoding the same protein have been reported.
15598	This gene encodes a secreted inhibitor which protects epithelial tissues from serine proteases.  It is found in various secretions including seminal plasma, cervical mucus, and bronchial secretions, and has affinity for trypsin, leukocyte elastase, and cathepsin G.  Its inhibitory effect contributes to the immune response by protecting epithelial surfaces from attack by endogenous proteolytic enzymes; the protein is also thought to have broad-spectrum anti-biotic activity.
15599	
15600	The product of this gene is a member of the nuclear factors of activated T cells DNA-binding transcription complex. This complex consists of at least two components: a preexisting cytosolic component that translocates to the nucleus upon T cell receptor (TCR) stimulation and an inducible nuclear component. Other members of this family participate to form this complex also. Proteins belonging to this family of transcription factors play a central role in inducible gene transcription during the immune response. The product of this gene is a preexisting cytosolic component. It is present in the cytosolic fraction of unstimulated T cells. After T-cell activation it is found in the nucleus as a part of the DNA-binding transcription complex. Two transcript variants encoding distinct isoforms have been identified for this gene.
15601	
15602	
15603	Nop56p is a yeast nucleolar protein that is part of a complex with the nucleolar proteins Nop58p and fibrillarin. Nop56p is required for assembly of the 60S ribosomal subunit and is involved in pre-rRNA processing. The protein encoded by this gene is similar in sequence to Nop56p and is also found in the nucleolus. Multiple transcript variants encoding several different isoforms have been found for this gene, but the full-length nature of most of them have not been determined.
15604	The protein encoded by this gene contains an atypical heat shock protein 70 (Hsp70) ATPase domain and is therefore a distant member of the mammalian Hsp70 family. This gene may be involved in susceptibility to atherosclerosis.
15605	This gene encodes a protein belonging to the pantothenate kinase family. Pantothenate kinase is a key regulatory enzyme in the biosynthesis of coenzyme A (CoA) in bacteria and mammalian cells. It catalyzes the first committed step in the universal biosynthetic pathway leading to CoA and is itself subject to regulation through feedback inhibition by CoA. Mutations in this gene are associated with HARP syndrome and pantothenate kinase-associated neurodegeneration (PKAN), formerly Hallervorden-Spatz syndrome. Alternative splicing has been observed at this locus, including the use of 4 alternate first exons. Six transcript variants have been identified, encoding four distinct isoforms.
15606	The protein encoded by this gene belongs to the NCP family of cell-recognition molecules. This family represents a distinct subgroup of the neurexins. NCP proteins mediate neuron-glial interactions in vertebrates and glial-glial contact in invertebrates. The protein encoded by this gene may play a role in cell recognition within the nervous system. Alternatively spliced transcript variants encoding different isoforms have been described but their biological nature has not been determined.
15607	
15608	The RAB5 protein is a small GTPase involved in membrane trafficking in the early endocytic pathway. The protein encoded by this gene binds the GTP-bound form of the RAB5 protein preferentially over the GDP-bound form, and functions as a guanine nucleotide exchange factor for RAB5. The encoded protein is found primarily as a tetramer in the cytoplasm and does not bind other members of the RAB family.
15609	The protein encoded by this intronless gene is an endothelial-specific type I membrane receptor that binds thrombin. This binding results in the activation of protein C, which degrades clotting factors Va and VIIIa and reduces the amount of thrombin generated. Mutations in this gene are a cause of thromboembolic disease, also known as inherited thrombophilia.
15610	
15611	
15612	The protein encoded by this gene interacts with the amino-terminal domain of the neuron-specific X11-like protein (X11L), inhibits the association of X11L with amyloid precursor protein through a non-competitive mechanism, and abolishes the suppression of beta-amyloid production by X11L. This protein, together with X11L, may play an important role in the regulatory system of amyloid precursor protein metabolism and beta-amyloid generation. This gene consists of at least 13 exons and its alternative splicing generates at least 2 transcript variants.
15613	This gene encodes a nuclear protein which is involved in cell-cycle regulation. The encoded protein is a ubiquitin-ligase capable of ubiquinating PCNP (PEST-containing nuclear protein), and together they may play a role in tumorigenesis. Alternative splicing of this gene results in several transcripts; however, not all variants have been fully described.
15614	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development, reproduction, and tissue remodeling, as well as in disease processes, such as arthritis and metastasis. Most MMP@#$%&s are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. However, the protein encoded by this gene is a member of the membrane-type MMP (MT-MMP) subfamily; each member of this subfamily contains a potential transmembrane domain suggesting that these proteins are expressed at the cell surface rather than secreted. This protein activates MMP2 by cleavage. The gene has previously been referred to as MMP25 but has been renamed matrix metalloproteinase 24 (MMP24).
15615	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the peptidase T1A family, that is a 20S core alpha subunit. This particular subunit has been shown to interact specifically with the hepatitis B virus X protein, a protein critical to viral replication. In addition, this subunit is involved in regulating hepatitis virus C internal ribosome entry site (IRES) activity, an activity essential for viral replication. This core alpha subunit is also involved in regulating the hypoxia-inducible factor-1alpha, a transcription factor important for cellular responses to oxygen tension. Multiple isoforms of this subunit arising from alternative splicing may exist but alternative transcripts for only two isoforms have been defined. A pseudogene has been identified on chromosome 9.
15616	
15617	
15618	Tight junctions represent one mode of cell-to-cell adhesion in epithelial or endothelial cell sheets, forming continuous seals around cells and serving as a physical barrier to prevent solutes and water from passing freely through the paracellular space. These junctions are comprised of sets of continuous networking strands in the outwardly facing cytoplasmic leaflet, with complementary grooves in the inwardly facing extracytoplasmic leaflet. The protein encoded by this gene, a member of the claudin family, is an integral membrane protein and a component of tight junction strands. The encoded protein also binds specifically to the WW domain of Yes-associated protein. Defects in this gene are the cause of an autosomal recessive form of nonsyndromic sensorineural deafness. Two transcript variants encoding the same protein have been found for this gene.
15619	
15620	
15621	FTCD is a bifunctional enzyme that channels 1-carbon units from formiminoglutamate, a metabolite of the histidine degradation pathway, to the folate pool.[supplied by OMIM]
15622	Ran/TC4-binding protein, RanBP1, interacts specifically with GTP-charged RAN. RANBP1 encodes a 23-kD protein that binds to RAN complexed with GTP but not GDP. RANBP1 does not activate GTPase activity of RAN but does markedly increase GTP hydrolysis by the RanGTPase-activating protein (RanGAP1). The RANBP1 cDNA encodes a 201-amino acid protein that is 92% similar to its mouse homolog. In both mammalian cells and in yeast, RANBP1 acts as a negative regulator of RCC1 by inhibiting RCC1-stimulated guanine nucleotide release from RAN.
15623	
15624	This gene encodes a lymphokine involved in cell-mediated immunity, immunoregulation, and inflammation. It plays a role in the regulation of macrophage function in host defense through the suppression of anti-inflammatory effects of glucocorticoids. This lymphokine and the JAB1 protein form a complex in the cytosol near the peripheral plasma membrane, which may indicate an additional role in integrin signaling pathways.
15625	
15626	
15627	This gene encodes a putative RNA binding protein of unknown function.  Mutations in this gene, specifically a t(11;22)(q24;q12) translocation, are known to cause Ewing sarcoma as well as neuroectodermal and various other tumors.  Alternative splicing of this gene results in two products.
15628	This gene encodes a cytoskeletal protein which is concentrated in areas of cell-substratum and cell-cell contacts. This protein plays a significant role in the assembly of actin filaments and in spreading and migration of various cell types, including fibroblasts and osteoclasts. It codistributes with integrins in the cell surface membrane in order to assist in the attachment of adherent cells to extracellular matrices and of lymphocytes to other cells. The N-terminus of this protein contains elements for localization to cell-extracellular matrix junctions. The C-terminus contains binding sites for proteins such as beta-1-integrin, actin, and vinculin.
15629	
15630	This gene, which is one of the largest in the human genome, encodes a member of the N-acetylglucosaminyltransferase gene family. The function of this gene has not yet been established; however, it may involve a role in tumor-specific genomic rearrangements. Mutations in this gene may be involved in the development and progression of meningioma through modification of ganglioside composition and other glycosylated molecules in tumor cells. Alternative splicing of this gene results in two transcript variants that encode the same protein.
15631	Eukaryotic translation initiation factor-3 (eIF3), the largest of the eIFs, is a multiprotein complex composed of at least ten nonidentical subunits. The complex binds to the 40S ribosome and helps maintain the 40S and 60S ribosomal subunits in a dissociated state. It is also thought to play a role in the formation of the 40S initiation complex by interacting with the ternary complex of eIF2/GTP/methionyl-tRNA, and by promoting mRNA binding. The protein encoded by this gene is the major RNA binding subunit of the eIF3 complex.
15632	Eukaryotic translation initiation factor-3 (eIF3), the largest of the eIFs, is a multiprotein complex composed of at least ten nonidentical subunits. The complex binds to the 40S ribosome and helps maintain the 40S and 60S ribosomal subunits in a dissociated state. It is also thought to play a role in the formation of the 40S initiation complex by interacting with the ternary complex of eIF2/GTP/methionyl-tRNA, and by promoting mRNA binding. The protein encoded by this gene is the major RNA binding subunit of the eIF3 complex.
15633	The protein encoded by this gene contains a PDZ domain, through which it interacts with protein kinase C, alpha (PRKCA). This protein may function as an adaptor that binds to and organizes the subcellular localization of a variety of membrane proteins. It has been shown to interact with multiple glutamate receptor subtypes, monoamine plasma membrane transporters, as well as non-voltage gated sodium channels, and may target PRKCA to these membrane proteins and thus regulate their distribution and function. This protein has also been found to act as an anchoring protein that specifically targets PRKCA to mitochondria in a ligand-specific manner.
15634	There are believed to be over 100 different glycosyltransferases involved in the synthesis of protein-bound and lipid-bound oligosaccharides. N-acetylglucosaminyltransferase III transfers a GlcNAc residue to the beta-linked mannose of the trimannosyl core of N-linked oligosaccharides and produces a bisecting GlcNAc. Expression of this gene may be controlled by a multiple-promoter system.
15635	Thyrotroph embryonic factor (TEF), a transcription factor, is a member of the PAR (proline and acidic amino acid-rich) subfamily of basic region/leucine zipper (bZIP) transcription factors.  It is expressed in a broad range of cells and tissues in adult animals, however, during embryonic development, TEF expression appears to be restricted to the developing anterior pituitary gland, coincident with the appearance of thyroid-stimulating hormone, beta (TSHB).  Indeed, TEF can bind to, and transactivate the TSHB promoter.  It shows homology (in the functional domains) with other members of the PAR-bZIP subfamily of transcription factors, which include albumin D box-binding protein (DBP), human hepatic leukemia factor (HLF) and chicken vitellogenin gene-binding protein (VBP); VBP is considered the chicken homologue of TEF.  Different members of the subfamily can readily form heterodimers, and share DNA-binding, and transcriptional regulatory properties.
15636	B cell-activating factor (BAFF) enhances B-cell survival in vitro and is a regulator of the peripheral B-cell population. Overexpression of Baff in mice results in mature B-cell hyperplasia and symptoms of systemic lupus erythematosus (SLE). Also, some SLE patients have increased levels of BAFF in serum. Therefore, it has been proposed that abnormally high levels of BAFF may contribute to the pathogenesis of autoimmune diseases by enhancing the survival of autoreactive B cells. The protein encoded by this gene is a receptor for BAFF and is a type III transmembrane protein containing a single extracellular cysteine-rich domain. It is thought that this receptor is the principal receptor required for BAFF-mediated mature B-cell survival.
15637	
15638	This gene encodes a protein with a proline-rich domain. This gene is located in a region of chromosome 22 reported to contain a tumor suppressor gene that may be involved in breast and colorectal tumorigenesis. Rare read-through transcripts, containing exons from the ARHGAP8 gene which is located immediately downstream, led to the original description of PRR5 and ARHGAP8 as a single gene. Alternative splicing and the use of alternative promoters results in transcripts encoding different isoforms.
15639	Platelet-derived endothelial cell growth factor (ECGF1) is an angiogenic factor which promotes angiogenesis in vivo and stimulates the in vitro growth of a variety of endothelial cells. ECGF1 has a highly restricted target cell specificity acting only on endothelial cells. Because it limits glial cell proliferation, ECGF1 is also known as thymidine phosphorylase and as gliostatin. The ECGF1 gene contains 10 exons spanning more than 4.3 kb. Thymidine phosphorylase activity of ECGF1 in leukocytes from mitochondrial neurogastrointestinal encephalomyopathy (MNGIE) patients was less than 5 percent of controls, indicating that loss-of-function mutations in thymidine phosphorylase cause MNGIE.
15640	
15641	
15642	
15643	
15644	
15645	The protein encoded by this gene is a member of a family of actin-related proteins (ARPs) which share significant amino acid sequence identity to conventional actins. Both actins and ARPs have an actin fold, which is an ATP-binding cleft, as a common feature. The ARPs are involved in diverse cellular processes, including vesicular transport, spindle orientation, nuclear migration and chromatin remodeling. This gene (ACTL7B), and related gene, ACTL7A, are intronless, and are located approximately 4 kb apart in a head-to-head orientation within the familial dysautonomia candidate region on 9q31. Based on mutational analysis of the ACTL7B gene in patients with this disorder, it was concluded that it is unlikely to be involved in the pathogenesis of dysautonomia. Unlike ACTL7A, the ACTL7B gene is expressed predominantly in the testis, however, its exact function is not known.
15646	
15647	The Golgi coatomer complex (see MIM 601924) constitutes the coat of nonclathrin-coated vesicles and is essential for Golgi budding and vesicular trafficking. It consists of 7 protein subunits, including COPB2.[supplied by OMIM]
15648	
15649	
15650	
15651	ROD1 is a functional homolog of nrd1, an S. pombe RNA-binding protein that suppresses the onset of differentiation.[supplied by OMIM]
15652	
15653	This gene encodes a subunit of a cytokine that acts on T and natural killer cells, and has a broad array of biological activities. The cytokine is a disulfide-linked heterodimer composed of the 35-kD subunit encoded by this gene, and a 40-kD subunit that is a member of the cytokine receptor family. This cytokine is required for the T-cell-independent induction of interferon (IFN)-gamma, and is important for the differentiation of both Th1 and Th2 cells. The responses of lymphocytes to this cytokine are mediated by the activator of transcription protein STAT4. Nitric oxide synthase 2A (NOS2A/NOS2) is found to be required for the signaling process of this cytokine in innate immunity.
15654	This gene encodes a beta chemokine receptor, which is predicted to be a seven transmembrane protein similar to G protein-coupled receptors. Chemokines and their receptor-mediated signal transduction are critical for the recruitment of effector immune cells to the inflammation site. This gene is expressed in a range of tissues and hemopoietic cells. The expression of this receptor in lymphatic endothelial cells and overexpression in vascular tumors suggested its function in chemokine-driven recirculation of leukocytes and possible chemokine effects on the development and growth of vascular tumors. This receptor appears to bind the majority of beta-chemokine family members; however, its specific function remains unknown. This gene is mapped to chromosome 3p21.3, a region that includes a cluster of chemokine receptor genes.
15655	The protein encoded by this gene is member of the GAP1 family of GTPase-activating proteins. The gene product stimulates the GTPase activity of normal RAS p21 but not its oncogenic counterpart. Acting as a suppressor of RAS function, the protein enhances the weak intrinsic GTPase activity of RAS proteins resulting in the inactive GDP-bound form of RAS, thereby allowing control of cellular proliferation and differentiation. This particular family member has a perinuclear localization and is an inositol 1,3,4,5-tetrakisphosphate-binding protein; a compound suggested to function as a second messenger.
15656	
15657	
15658	tRNA splicing is a fundamental process required for cell growth and division. SEN2 is a subunit of the tRNA splicing endonuclease, which catalyzes the removal of introns, the first step in tRNA splicing (Paushkin et al., 2004).[supplied by OMIM]
15659	
15660	
15661	
15662	
15663	
15664	The protein encoded by this gene belongs to the PARP family. These enzymes modify nuclear proteins by poly-ADP-ribosylation, which is required for DNA repair, regulation of apoptosis, and maintenance of genomic stability. This gene encodes the poly(ADP-ribosyl)transferase 3, which is preferentially localized to the daughter centriole throughout the cell cycle. Alternatively spliced transcript variants encoding different isoforms have been identified.
15665	
15666	The protein encoded by this gene is a member of the flamingo subfamily, part of the cadherin superfamily. The flamingo subfamily consists of nonclassic-type cadherins; a subpopulation that does not interact with catenins. The flamingo cadherins are located at the plasma membrane and have nine cadherin domains, seven epidermal growth factor-like repeats and two laminin A G-type repeats in their ectodomain. They also have seven transmembrane domains, a characteristic unique to this subfamily. It is postulated that these proteins are receptors involved in contact-mediated communication, with cadherin domains acting as homophilic binding regions and the EGF-like domains involved in cell adhesion and receptor-ligand interactions. The specific function of this particular member has not been determined.
15667	
15668	
15669	
15670	This gene encodes an enzyme that functions in the biosynthetic pathways of sialic acids. In vitro, the encoded protein uses N-acetylmannosamine 6-phosphate and mannose 6-phosphate as substrates to generate phosphorylated forms of N-acetylneuraminic acid (Neu5Ac) and 2-keto-3-deoxy-D-glycero-D-galacto-nononic acid (KDN), respectively; however, it exhibits much higher activity toward the Neu5Ac phosphate product. In insect cells, expression of this gene results in Neu5Ac and KDN production. This gene is related to the E. coli sialic acid synthase gene neuB, and it can partially restore sialic acid synthase activity in an E. coli neuB-negative mutant.
15671	
15672	
15673	This intronless gene encodes a protein containing several WD40 repeats. WD repeats are approximately 30- to 40-amino acid domains containing several conserved residues, including a trp-asp at the C-terminal end. The encoded protein may mediate protein-protein interactions.
15674	
15675	This gene is thought to encode a GDP-mannose pyrophosphorylase. This enzyme catalyzes the reaction which converts mannose-1-phosphate and GTP to GDP-mannose which is involved in the production of N-linked oligosaccharides. The gene encodes two transcript variants.
15676	
15677	
15678	
15679	
15680	
15681	
15682	
15683	This gene is a member of the beta-1,6-N-acetylglucosaminyltransferase gene family.  It is essential to the formation of Gal beta 1-3(GlcNAc beta 1-6)GalNAc structures and the core 2 O-glycan branch.  The gene coding this enzyme was originally mapped to 9q21, but was later localized to 9q13 by another group.
15684	The nuclear envelope creates distinct nuclear and cytoplasmic compartments in eukaryotic cells. It consists of two concentric membranes perforated by nuclear pores, large protein complexes that form aqueous channels to regulate the flow of macromolecules between the nucleus and the cytoplasm. These complexes are composed of at least 100 different polypeptide subunits, many of which belong to the nucleoporin family. This gene encodes a member of the phe-gly (FG) repeat-containing nucleoporin subset.
15685	The protein encoded by this gene is a member of the fibroblast growth factor receptor (FGFR) family, where amino acid sequence is highly conserved between members and throughout evolution. FGFR family members differ from one another in their ligand affinities and tissue distribution. A full-length representative protein would consist of an extracellular region, composed of three immunoglobulin-like domains, a single hydrophobic membrane-spanning segment and a cytoplasmic tyrosine kinase domain. The extracellular portion of the protein interacts with fibroblast growth factors, setting in motion a cascade of downstream signals, ultimately influencing mitogenesis and differentiation. A marked difference between this gene product and the other family members is its lack of a cytoplasmic tyrosine kinase domain. The result is a transmembrane receptor that could interact with other family members and potentially inhibit signaling. Multiple alternatively spliced transcript variants encoding the same isoform have been found for this gene.
15686	
15687	
15688	This gene encodes a protein which is a member of the cysteine-aspartic acid protease (caspase) family. Sequential activation of caspases plays a central role in the execution-phase of cell apoptosis. Caspases exist as inactive proenzymes which undergo proteolytic processing at conserved aspartic residues to produce 2 subunits, large and small, that dimerize to form the active enzyme. This protein was shown to cleave and activate caspases 6, 7 and 9, and itself could be processed by caspases 8, 9 and 10. It is the predominant caspase involved in the cleavage of amyloid-beta 4A precursor protein, which is associated with neuronal death in Alzheimer@#$%&s disease. Alternative splicing of this gene results in two transcript variants which encode the same protein.
15689	
15690	MAD2L1 is a component of the mitotic spindle assembly checkpoint that prevents the onset of anaphase until all chromosomes are properly aligned at the metaphase plate.  MAD2L1 is related to the MAD2L2 gene located on chromosome 1.  A MAD2 pseudogene has been mapped to chromosome 14.
15691	
15692	
15693	
15694	The protein encoded by this gene is a member of the Toll-like receptor (TLR) family which plays a fundamental role in pathogen recognition and activation of innate immunity. TLRs are highly conserved from Drosophila to humans and share structural and functional similarities. They recognize pathogen-associated molecular patterns (PAMPs) that are expressed on infectious agents, and mediate the production of cytokines necessary for the development of effective immunity. The various TLRs exhibit different patterns of expression. This gene is expressed most abundantly in peripheral blood leukocytes, and mediates host response to Gram-positive bacteria and yeast via stimulation of NF-kappaB.
15695	
15696	
15697	
15698	
15699	This gene encodes class IV alcohol dehydrogenase 7 mu or sigma subunit, which is a member of the alcohol dehydrogenase family. Members of this family metabolize a wide variety of substrates, including ethanol, retinol, other aliphatic alcohols, hydroxysteroids, and lipid peroxidation products. The enzyme encoded by this gene is inefficient in ethanol oxidation, but is the most active as a retinol dehydrogenase; thus it may participate in the synthesis of retinoic acid, a hormone important for cellular differentiation. The expression of this gene is much more abundant in stomach than liver, thus differing from the other known gene family members.
15700	
15701	
15702	
15703	
15704	HEL308 is a single-stranded DNA-dependent ATPase and DNA helicase.[supplied by OMIM]
15705	
15706	
15707	This gene encodes a protein with multiple ankyrin repeat domains and a single KH-domain. Co-transcription of this gene and the neighboring downstream gene (EIF4EBP3) generates a transcript (MASK-BP3) which encodes a fusion protein comprised of the MASK protein sequence for the majority of the protein and a different C-terminus due to an alternate reading frame for the EIF4EBP3 segments.
15708	Located between the protocadherin beta and gamma gene clusters on chromosome 5, this intronless gene encodes a protein that is highly similar to an ornithine transporter localized in the mitochondrial inner membrane. Although its particular function is unknown, it most likely plays a role in metabolism as a mitochondrial transport protein.
15709	DAP gene encodes a basic, proline-rich, 15-kD protein.  Death-associated protein acts as a positive mediator of programmed cell death that is induced by interferon-gamma.
15710	RHOBTB3 is a member of the evolutionarily conserved RHOBTB subfamily of Rho GTPases. For background information on RHOBTBs, see RHOBTB1 (MIM 607351).[supplied by OMIM]
15711	
15712	
15713	The protein encoded by this gene is a member of the fibroblast growth factor receptor family, where amino acid sequence is highly conserved between members and throughout evolution. FGFR family members differ from one another in their ligand affinities and tissue distribution. A full-length representative protein would consist of an extracellular region, composed of three immunoglobulin-like domains, a single hydrophobic membrane-spanning segment and a cytoplasmic tyrosine kinase domain. The extracellular portion of the protein interacts with fibroblast growth factors, setting in motion a cascade of downstream signals, ultimately influencing mitogenesis and differentiation. The genomic organization of this gene, compared to members 1-3, encompasses 18 exons rather than 19 or 20. Although alternative splicing has been observed, there is no evidence that the C-terminal half of the IgIII domain of this protein varies between three alternate forms, as indicated for members 1-3. This particular family member preferentially binds acidic fibroblast growth factor and, although its specific function is unknown, it is overexpressed in gynecological tumor samples, suggesting a role in breast and ovarian tumorigenesis.
15714	
15715	
15716	
15717	
15718	
15719	
15720	UNG2 is a glycosylase that removes uracil from both double- and single-stranded DNA in nuclear chromatin, thus contributing to base excision repair.[supplied by OMIM]
15721	This gene is a member of the regulatory factor X gene family, which encodes transcription factors that contain a highly-conserved winged helix DNA binding domain. The protein encoded by this gene is structurally related to regulatory factors X1, X2, X4, and X5. It is a transcriptional activator that can bind DNA as a monomer or as a heterodimer with other RFX family members. Two transcript variants encoding different isoforms have been described for this gene, and at least one of the variants utilizes alternative polyadenylation signals.
15722	
15723	
15724	
15725	
15726	
15727	Tight junction proteins (TJPs) belong to a family of membrane-associated guanylate kinase (MAGUK) homologs that are involved in the organization of epithelial and endothelial intercellular junctions. TJPs bind to the cytoplasmic C termini of junctional transmembrane proteins and link them to the actin cytoskeleton.[supplied by OMIM]
15728	
15729	
15730	Eukaryotic molybdoenzymes use a unique molybdenum cofactor (MoCo) consisting of a pterin, termed molybdopterin, and the catalytically active metal molybdenum. MoCo is synthesized from precursor Z by the heterodimeric enzyme molybdopterin synthase. A bicistronic transcript encodes both the large and small subunits of molybdopterin synthase in two overlapping ORFs. The downstream ORF is thought to be translated by leaky scanning by ribosomes. A monocistronic transcript, in which only the downstream ORF is complete, also exists. Mutations in both ORFs have been associated with molybdenum cofactor deficiency B.
15731	
15732	Homocysteine is a sulfur-containing amino acid that plays a crucial role in methylation reactions. Transfer of the methyl group from betaine to homocysteine creates methionine, which donates the methyl group to methylate DNA, proteins, lipids, and other intracellular metabolites. The protein encoded by this gene is one of two methyl transferases that can catalyze the transfer of the methyl group from betaine to homocysteine. Anomalies in homocysteine metabolism have been implicated in disorders ranging from vascular disease to neural tube birth defects such as spina bifida.
15733	
15734	This gene encodes a type II classical cadherin from the cadherin superfamily, integral membrane proteins that mediate calcium-dependent cell-cell adhesion. Mature cadherin proteins are composed of a large N-terminal extracellular domain, a single membrane-spanning domain, and a small, highly conserved C-terminal cytoplasmic domain. The extracellular domain consists of 5 subdomains, each containing a cadherin motif, and appears to determine the specificity of the protein@#$%&s homophilic cell adhesion activity. Type II (atypical) cadherins are defined based on their lack of a HAV cell adhesion recognition sequence specific to type I cadherins.
15735	APH1 is a multipass transmembrane protein that interacts with presenilin (see PSEN1; MIM 104311) and nicastrin (APH2; MIM 605254) as a functional component of the gamma-secretase complex. The gamma-secretase complex is required for the intramembrane proteolysis of a number of membrane proteins, including the amyloid-beta precursor protein (APP; MIM 104760) and Notch (MIM 190198).[supplied by OMIM]
15736	
15737	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth, and invasion. It has been shown in vitro that this protein is able to induce neurite outgrowth in PC12 cells. Studies of the similar proteins in mouse and chick suggested that this protein is a pleiotropic growth factor that stimulates proliferation in a number of tissues, most notably the liver and small intestine. Knockout studies of the similar gene in mice implied the role of this protein in regulating proliferation and differentiation of midline cerebellar structures. Two alternatively spliced variants, which encode the identical isoform, are reported.
15738	
15739	The protein encoded by this gene is a component of the exocyst complex, a multiple protein complex essential for targeting exocytic vesicles to specific docking sites on the plasma membrane. Though best characterized in yeast, the component proteins and functions of exocyst complex have been demonstrated to be highly conserved in higher eukaryotes. At least eight components of the exocyst complex, including this protein, are found to interact with the v-ral simian leukemia viral oncogene homolog A (RALA), a Ras-related small GTPase involved in controlling actin cytoskeletal remodeling and vesicle transport. The protein complex containing the similar protein in rat was reported to be involved in synaptic vesicle docking and fusion at the presynaptic terminal.
15740	
15741	
15742	GABA is the major inhibitory neurotransmitter in the mammalian brain where it acts at GABA receptors, which are ligand-gated chloride channels. GABRR2 is a member of the rho subunit family
15743	
15744	
15745	
15746	
15747	The protein encoded by this gene is involved in the sodium-independent transport and excretion of organic anions, some of which are potentially toxic. The encoded protein is an integral membrane protein and appears to be localized to the basolateral membrane of the kidney. Two transcript variants encoding different isoforms have been found for this gene.
15748	The protein encoded by this gene is involved in the sodium-independent transport and excretion of organic anions, some of which are potentially toxic. The encoded protein is an integral membrane protein and appears to be localized to the basolateral membrane of the kidney. Two transcript variants encoding different isoforms have been found for this gene.
15749	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The function of this protein has not been identified.
15750	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. This structure consists of approximately 146 bp of DNA wrapped around a nucleosome, an octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is intronless and encodes a member of the histone H3 family. Transcripts from this gene lack polyA tails; instead, they contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6p22-p21.3.
15751	This gene encodes a member of the Kruppel family of zinc finger proteins. Members of this DNA-binding protein family act as transcriptional regulators. This gene is located within a cluster of zinc finger family members. The encoded protein may play a role in spermatogenesis.
15752	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a member of the histone H3 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the small histone gene cluster on chromosome 6p22-p21.3.
15753	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
15754	
15755	Receptor tyrosine kinases (RTKs) play a key role in the communication of cells with their microenvironment. These molecules are involved in the regulation of cell growth, differentiation and metabolism. The protein encoded by this gene is a RTK that is widely expressed in normal and transformed epithelial cells and is activated by various types of collagen. This protein belongs to a subfamily of tyrosine kinase receptors with a homology region to the Dictyostelium discoideum protein discoidin I in their extracellular domain. Its autophosphorylation is achieved by all collagens so far tested (type I to type VI). In situ studies and Northern-blot analysis showed that expression of this encoded protein is restricted to epithelial cells, particularly in the kidney, lung, gastrointestinal tract, and brain. In addition, this protein is significantly over-expressed in several human tumors from breast, ovarian, esophageal, and pediatric brain. This gene is located on chromosome 6p21.3 in proximity to several HLA class I genes. Three isoforms of this gene are generated by alternative splicing. [PROW]
15756	
15757	Lymphotoxin beta is a type II membrane protein of the TNF family. It anchors lymphotoxin-alpha to the cell surface through heterotrimer formation.  The predominant form on the lymphocyte surface is the lymphotoxin-alpha 1/beta 2 complex (e.g. 1 molecule alpha/2 molecules beta) and this complex is the primary ligand for the lymphotoxin-beta receptor.  The minor complex is lymphotoxin-alpha 2/beta 1.  LTB is an inducer of the inflammatory response system and involved in normal development of lymphoid tissue.  Lymphotoxin-beta isoform b is unable to complex with lymphotoxin-alpha suggesting a function for lymphotoxin-beta which is independent of lympyhotoxin-alpha.
15758	This gene encodes a member of the immunoglobulin superfamily of cell surface molecules. It is a receptor for various molecules, including the amyloidogenic form of serum amyloid A, amyloid-beta protein, members of the S100/calgranulin superfamily and advanced glycation end products. The gene lies within the major histocompatibility complex (MHC) class III region on chromosome 6. Alternative splicing results in two transcript variants encoding different isoforms.
15759	
15760	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S10E family of ribosomal proteins. It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
15761	
15762	
15763	
15764	Branched-chain keto acid dehydrogenase is a multienzyme complex associated with the inner membrane of mitochondria, and functions in the catabolism of branched-chain amino acids. The complex consists of multiple copies of 3 components: branched-chain alpha-keto acid decarboxylase (E1), lipoamide acyltransferase (E2) and lipoamide dehydrogenase (E3). This gene encodes the E1 beta subunit, and mutations therein have been associated with maple syrup urine disease (MSUD), type 1B. Alternative splicing at this locus results in transcript variants with different 3@#$%& noncoding regions, but encoding the same isoform.
15765	This gene encodes a subunit of the N-methyl-D-aspartate (NMDA) receptors, which belong to the superfamily of glutamate-regulated ion channels, and function in physiological and pathological processes in the central nervous system. This subunit shows greater than 90% identity to the corresponding subunit in rat. Studies in the knockout mouse deficient in this subunit suggest that this gene may be involved in the development of synaptic elements by modulating NMDA receptor activity.
15766	The transcription of genes from mitochondrial DNA requires a mitochondrial RNA polymerase (see POLRMT, MIM 601778) and a DNA-binding transcription factor (see TFAM, MIM 600438). Transcription factor B1 (TFB1M) is a part of this transcription complex.[supplied by OMIM]
15767	
15768	
15769	
15770	
15771	
15772	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the proteasome B-type family, also known as the T1B family, that is a 20S core beta subunit in the proteasome. Expression of this catalytic subunit is downregulated by gamma interferon and proteolytic processing is required to generate a mature subunit. This subunit is not present in the immunoproteasome and is replaced by catalytic subunit 2i (proteasome beta 10 subunit).
15773	This gene encodes a component of vacuolar ATPase (V-ATPase), a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, receptor-mediated endocytosis, and synaptic vesicle proton gradient generation. V-ATPase is composed of a cytosolic V1 domain and a transmembrane V0 domain. The V1 domain consists of three A and three B subunits, two G subunits plus the C, D, E, F, and H subunits. The V1 domain contains the ATP catalytic site. The V0 domain consists of five different subunits: a, c, c@#$%&, c@#$%!, and d. Additional isoforms of many of the V1 and V0 subunit proteins are encoded by multiple genes or alternatively spliced transcript variants. This encoded protein is the V1 domain F subunit protein.
15774	The protein encoded by this gene mediates transcriptional control by interaction with the activation function 2 (AF2) region of several nuclear receptors, including the estrogen, retinoic acid, and vitamin D3 receptors. The protein localizes to nuclear bodies and is thought to associate with chromatin and heterochromatin-associated factors. The protein is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains - a RING, a B-box type 1 and a B-box type 2 - and a coiled-coil region. Two alternatively spliced transcript variants encoding different isoforms have been described for this gene.
15775	This gene encodes a member of the aldo/keto reductase superfamily, which consists of more than 40 known enzymes and proteins. This member can efficiently reduce aliphatic and aromatic aldehydes, and it is less active on hexoses. It is highly expressed in adrenal gland, small intestine, and colon, and may play an important role in liver carcinogenesis.
15776	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
15777	
15778	The protein encoded by this gene is one of the highly conserved mini-chromosome maintenance proteins (MCM) that are essential for the initiation of eukaryotic genome replication. The hexameric protein complex formed by the MCM proteins is a key component of the pre-replication complex (pre_RC) and may be involved in the formation of replication forks and in the recruitment of other DNA replication related proteins. The MCM complex consisting of this protein and MCM2, 4 and 6 proteins possesses DNA helicase activity, and may act as a DNA unwinding enzyme. Cyclin D1-dependent kinase, CDK4, is found to associate with this protein, and may regulate the binding of this protein with the tumorsuppressor protein RB1/RB. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
15779	
15780	
15781	
15782	
15783	
15784	The protein encoded by this gene is a type II transmembrane protein and is similar to two families of cell adhesion molecules, the protocadherins and ependymins. This protein may play a role in calcium dependent cell adhesion.
15785	Sm-like proteins were identified in a variety of organisms based on sequence homology with the Sm protein family (see SNRPD2; MIM 601061). Sm-like proteins contain the Sm sequence motif, which consists of 2 regions separated by a linker of variable length that folds as a loop. The Sm-like proteins are thought to form a stable heteromer present in tri-snRNP particles, which are important for pre-mRNA splicing.[supplied by OMIM]
15786	In vertebrates, the genes encoding the class of transcription factors called homeobox genes are found in clusters named A, B, C, and D on four separate chromosomes. Expression of these proteins is spatially and temporally regulated during embryonic development. This gene is part of the A cluster on chromosome 7 and encodes a DNA-binding transcription factor which may regulate gene expression, morphogenesis, and differentiation. Methylation of this gene may result in the loss of its expression and, since the encoded protein upregulates the tumor suppressor p53, this protein may play an important role in tumorigenesis.
15787	Hepatocyte growth factor regulates cell growth, cell motility, and morphogenesis by activating a tyrosine kinase signaling cascade after binding to the proto-oncogenic c-Met receptor. Hepatocyte growth factor is secreted by mesenchymal cells and acts as a multi-functional cytokine on cells of mainly epithelial origin. Its ability to stimulate mitogenesis, cell motility, and matrix invasion gives it a central role in angiogenesis, tumorogenesis, and tissue regeneration. It is secreted as a single inactive polypeptide and is cleaved by serine proteases into a 69-kDa alpha-chain and 34-kDa beta-chain. A disulfide bond between the alpha and beta chains produces the active, heterodimeric molecule. The protein belongs to the plasminogen subfamily of S1 peptidases but has no detectable protease activity. Alternative splicing of this gene produces multiple transcript variants encoding different isoforms.
15788	Hepatocyte growth factor regulates cell growth, cell motility, and morphogenesis by activating a tyrosine kinase signaling cascade after binding to the proto-oncogenic c-Met receptor. Hepatocyte growth factor is secreted by mesenchymal cells and acts as a multi-functional cytokine on cells of mainly epithelial origin. Its ability to stimulate mitogenesis, cell motility, and matrix invasion gives it a central role in angiogenesis, tumorogenesis, and tissue regeneration. It is secreted as a single inactive polypeptide and is cleaved by serine proteases into a 69-kDa alpha-chain and 34-kDa beta-chain. A disulfide bond between the alpha and beta chains produces the active, heterodimeric molecule. The protein belongs to the plasminogen subfamily of S1 peptidases but has no detectable protease activity. Alternative splicing of this gene produces multiple transcript variants encoding different isoforms.
15789	This gene is one of the PMS2 gene family members which are found in clusters on chromosome 7. The product of this gene is involved in DNA mismatch repair. The protein forms a heterodimer with MLH1 and this complex interacts with MSH2 bound to mismatched bases. Defects in this gene are associated with hereditary nonpolyposis colorectal cancer, with a Turcot syndrome, and are a cause of supratentorial primitive neuroectodermal tumors. Alternatively spliced transcript variants encoding different isoforms have been identified.
15790	
15791	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
15792	TOM7 is a small regulatory component of the translocase of the outer mitochondrial membrane (TOM), a general import pore complex that translocates preproteins into mitochondria.[supplied by OMIM]
15793	This gene is a human homolog of rat DLP11 and a member of the dynein heavy chain family. It is a microtubule-dependent motor ATPase and has been reported to be involved in the movement of respiratory cilia. Mutations in this gene have been implicated in causing Kartagener Syndrome (a combination of situs inversus totalis and Primary Ciliary Dyskinesia (PCD), also called Immotile Cilia Syndrome 1 (ICS1)) and male sterility.
15794	
15795	
15796	
15797	
15798	
15799	
15800	Calcium-permeable channels, such as TRPV5, participate in neurotransmission, muscle contraction, and exocytosis by providing calcium as an intracellular second messenger. Depending on the tissue, transcellular calcium transport may be regulated by vitamin D, parathyroid hormone (PTH; MIM 168450), or calcitonin (CALCA; MIM 114130).[supplied by OMIM]
15801	Rho GTPases play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. The encoded protein may form complex with G proteins and stimulate Rho-dependent signals. This protein may be involved in the control of cytoskeletal organization. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
15802	
15803	
15804	This gene encodes a hematopoeitin/interferon-class receptor belonging to the cytokine superfamily of receptors. The encoded gene product represents the CNTF-specific alpha subunit of a heterotrimer forming the CNTF receptor complex, which also includes LIFR and gp130. The receptor is attached to the membrane by a glycosyl-phosphatidylinositol linkage and contains an immunoglobulin-like C2-type domain and a fibronectin type-III domain. Signal transduction requires that CNTF bind first to this alpha component, which permits the recruitment of gp130 and LIFR beta to form the tripartite receptor complex. Signal transduction stimulates gene expression, cell survival or differentiation in a variety of neuronal cell types. Alternative splicing has been observed at this locus and two variants, both encoding the same protein, have been identified.
15805	Laminins, a family of extracellular matrix glycoproteins, are the major noncollagenous constituent of basement membranes.  They have been implicated in a wide variety of biological processes including cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis.  Laminins are composed of 3 non identical chains: laminin alpha, beta and gamma (formerly A, B1, and B2, respectively) and they form a cruciform structure consisting of 3 short arms, each formed by a different chain, and a long arm composed of all 3 chains.  Each laminin chain is a multidomain protein encoded by a distinct gene.  Several isoforms of each chain have been described.  Different alpha, beta and gamma chain isomers combine to give rise to different heterotrimeric laminin isoforms which are designated by Arabic numerals in the order of their discovery, i.e. alpha1beta1gamma1 heterotrimer is laminin 1.  The biological functions of the different chains and trimer molecules are largely unknown, but some of the chains have been shown to differ with respect to their tissue distribution, presumably reflecting diverse functions in vivo.  This gene encodes the beta chain isoform laminin, beta 1.  The beta 1 chain has 7 structurally distinct domains which it shares with other beta chain isomers.  The C-terminal helical region containing domains I and II are separated by domain alpha, domains III and V contain several EGF-like repeats, and domains IV and VI have a globular conformation.  Laminin, beta 1 is expressed in most tissues that produce basement membranes, and is one of the 3 chains constituting laminin 1, the first laminin isolated from Engelbreth-Holm-Swarm (EHS) tumor.  A sequence in the beta 1 chain that is involved in cell attachment, chemotaxis, and binding to the laminin receptor was identified and shown to have the capacity to inhibit metastasis.
15806	
15807	
15808	
15809	
15810	
15811	NEIL2 belongs to a class of DNA glycosylases homologous to the bacterial Fpg/Nei family. These glycosylases initiate the first step in base excision repair by cleaving bases damaged by reactive oxygen species and introducing a DNA strand break via the associated lyase reaction (Bandaru et al., 2002)[supplied by OMIM]
15812	
15813	The protein encoded by this gene belongs to the DEAD-like helicase superfamily. It shares similarity with Saccharomyces cerevisiae RAD54 and RDH54, both of which are involved in homologous recombination and repair of DNA. This protein binds to double-stranded DNA, and displays ATPase activity in the presence of DNA. This gene is highly expressed in testis and spleen, which suggests active roles in meiotic and mitotic recombination. Homozygous mutations of this gene were observed in primary lymphoma and colon cancer. Two alternatively spliced transcript variants encoding different isoforms have been reported for this gene. Also, transcript variants resulting from use of alternative polyA site exist.
15814	Aconitase 1, also known as iron regulatory element binding protein 1 (IREB1), is a cytosolic protein which binds to iron-responsive elements (IREs).  IREs are stem-loop structures found in the 5@#$%& UTR of ferritin mRNA, and in the 3@#$%& UTR of transferrin receptor mRNA.  The iron-induced binding to the IRE results in repression of translation of ferritin mRNA, and inhibition of degradation of the otherwise rapidly degrading transferrin receptor mRNA.  Thus, IREB1 plays a central role in cellular iron homeostasis.  It was also shown to have aconitase activity, and hence grouped with the aconitase family of enzymes.
15815	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class. Three alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
15816	Aconitase 1, also known as iron regulatory element binding protein 1 (IREB1), is a cytosolic protein which binds to iron-responsive elements (IREs).  IREs are stem-loop structures found in the 5@#$%& UTR of ferritin mRNA, and in the 3@#$%& UTR of transferrin receptor mRNA.  The iron-induced binding to the IRE results in repression of translation of ferritin mRNA, and inhibition of degradation of the otherwise rapidly degrading transferrin receptor mRNA.  Thus, IREB1 plays a central role in cellular iron homeostasis.  It was also shown to have aconitase activity, and hence grouped with the aconitase family of enzymes.
15817	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein that belongs to the EcoL15 ribosomal protein family. A pseudogene corresponding to this gene is found on chromosome 15q.
15818	
15819	
15820	
15821	The MD-2 protein appears to associate with toll-like receptor 4 on the cell surface and confers responsiveness to lipopolysaccyaride (LPS), thus providing a link between the receptor and LPS signaling.
15822	The structure of the protein encoded by this gene is highly related to both the protein ankyrin and transmembrane proteins.  The specific function of this protein has not yet been determined; however, studies indicate the function may involve a role in signal transduction and growth control.
15823	The Golgi apparatus, which participates in glycosylation and transport of proteins and lipids in the secretory pathway, consists of a series of stacked cisternae (flattened membrane sacs). Interactions between the Golgi and microtubules are thought to be important for the reorganization of the Golgi after it fragments during mitosis. The golgins are a family of proteins, of which the protein encoded by this gene is a member, that are localized to the Golgi. This encoded protein has been postulated to play roles in the stacking of Golgi cisternae and in vesicular transport. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of these variants has not been determined.
15824	This gene encodes a protein that is a member of the dickkopf family. The secreted protein contains two cysteine rich regions and is involved in embryonic development through its interactions with the Wnt signaling pathway. Activity of this protein is modulated by binding to the Wnt co-receptor and the co-factor kremen 2.
15825	Tissue specific transplantation antigen P35B is a NADP(H)-binding protein. It catalyze the two-step epimerase and the reductase reactions in GDP-D-mannose metabolism, converting GDP-4-keto-6-D-deoxymannose to GDP-L-fucose. GDP-L-fucose is the substrate of several fucosyltransferases involved in the expression of many glycoconjugates, including blood group ABH antigens and developmental adhesion antigens. Mutations in this gene may cause leukocyte adhesion deficiency, type II.
15826	
15827	
15828	
15829	Calcium-dependent membrane-binding proteins may regulate molecular events at the interface of the cell membrane and cytoplasm.  Copine III contains two type II C2 domains in the amino-terminus and an A domain-like sequence in the carboxy-terminus.  The A domain mediates interactions between integrins and extracellular ligands.
15830	
15831	
15832	
15833	
15834	
15835	
15836	
15837	
15838	This gene encodes a gluconeogenesis regulatory enzyme which catalyzes the hydrolysis of fructose 1,6-bisphosphate to fructose 6-phosphate and inorganic phosphate.
15839	
15840	Annexin I belongs to a family of Ca(2+)-dependent phospholipid binding proteins which have a molecular weight of approximately 35,000 to 40,000 and are preferentially located on the cytosolic face of the plasma membrane. Annexin I protein has an apparent relative molecular mass of 40 kDa, with phospholipase A2 inhibitory activity.  Since phospholipase A2 is required for the biosynthesis of the potent mediators of inflammation, prostaglandins and leukotrienes, annexin I may have potential anti-inflammatory activity.
15841	The protein encoded by this gene is a lysosomal cysteine proteinase that plays a major role in intracellular protein catabolism. Its substrates include collagen and elastin, as well as alpha-1 protease inhibitor, a major controlling element of neutrophil elastase activity. The encoded protein has been implicated in several pathologic processes, including myofibril necrosis in myopathies and in myocardial ischemia, and in the renal tubular response to proteinuria. This protein, which is a member of the peptidase C1 family, is a dimer composed of disulfide-linked heavy and light chains, both produced from a single protein precursor. At least two transcript variants encoding the same protein have been found for this gene.
15842	
15843	
15844	
15845	
15846	
15847	
15848	
15849	
15850	This gene encodes the folylpolyglutamate synthetase enzyme. This enzyme has a central role in establishing and maintaining both cytosolic and mitochondrial folylpolyglutamate concentrations and, therefore, is essential for folate homeostasis and the survival of proliferating cells. This enzyme catalyzes the ATP-dependent addition of glutamate moieties to folate and folate derivatives. While several transcript variants may exist for this gene, the full-length natures of only two have been biologically validated to date. These two variants encode distinct isoforms.
15851	
15852	
15853	
15854	
15855	
15856	The protein encoded by this gene may control steps in the cycling of proteins through the trans-Golgi network to endosomes, lysosomes and the plasma membrane. Mutations in this gene cause the autosomal recessive disorder, chorea-acanthocytosis. Alternative splicing of this gene results in multiple transcript variants.
15857	The protein encoded by this gene is a type II membrane protein that catalyzes the transfer of sialic acid from CMP-sialic acid to galactose-containing substrates. The encoded protein prefers glycoproteins rather than glycolipids as substrates and shows restricted substrate specificity, utilizing only the trisaccharide sequence Neu5Ac-alpha-2,3-Gal-beta-1,3-GalNAc. In addition, it is involved in the synthesis of ganglioside GD1A from GM1B. The encoded protein is normally found in the Golgi apparatus but can be proteolytically processed to a soluble form. This protein is a member of glycosyltransferase family 29. Three transcript variants encoding two different isoforms have been found for this gene.
15858	
15859	
15860	This gene encodes a member of the transcriptional enhancer factor (TEF) family. The family members contain the TEA/ATTS DNA-binding domain. This member is preferentially expressed in skeletal muscle, and binds to the M-CAT regulatory element which directs muscle-specific gene expression. The protein is encoded through the use of a non-AUG (TTG) translation initiation codon. Three alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
15861	The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21. This protein may function in motility, invasion, and tubulin polymerization. Chromosomal rearrangements and altered expression of this gene have been implicated in tumor metastasis.  Alternative splicing of the 5@#$%& UTR results in two gene products.
15862	
15863	Ryanodine receptors, such as RYR3, are intracellular calcium ion release channels responsible for the release of Ca(2+) from intracellular stores following transduction of many different extracellular stimuli.[supplied by OMIM]
15864	
15865	This gene encodes a key acute phase plasma protein.  Because of its increase due to acute inflammation, this protein is classified as an acute-phase reactant.  The specific function of this protein has not yet been determined; however, it may be involved in aspects of immunosuppression.
15866	
15867	
15868	The protein encoded by this gene is an inositol 1,4,5-trisphosphate (InsP3) 5-phosphatase. InsP3 5-phosphatases hydrolyze Ins(1,4,5)P3, which mobilizes intracellular calcium and acts as a second messenger mediating cell responses to various stimulation. Studies of the mouse counterpart suggest that this protein may hydrolyze phosphatidylinositol 3,4, 5-trisphosphate and phosphatidylinositol 3,5-bisphosphate on the cytoplasmic Golgi membrane and thereby regulate Golgi-vesicular trafficking.
15869	Coagulation factor II is proteolytically cleaved to form thrombin in the first step of the coagulation cascade which ultimately results in the stemming of blood loss.  F2 also plays a role in maintaining vascular integrity during development and postnatal life.  Mutations in F2 leads to various forms of thrombosis and dysprothrombinemia.
15870	
15871	
15872	
15873	
15874	The protein encoded by this gene is similar to the Maf1, a Saccharomyces cerevisiae protein highly conserved in eukaryotic cells. Yeast Maf1 is a negative effector of RNA polymerase III (Pol III). It responses to the changes in the cellular environment and represses the pol III transcription. Biochemical studies identified the initiation factor TFIIIB as a target for Maf1-dependent repression.
15875	
15876	
15877	Dystrophin is a large, rod-like cytoskeletal protein found at the inner surface of muscle fibers. Dystrophin is missing in Duchenne Muscular Dystrophy patients and is present in reduced amounts in Becker Muscular Dystrophy patients. The protein encoded by this gene is a peripheral membrane protein found associated with dystrophin and dystrophin-related proteins. This gene is a member of the syntrophin gene family, which contains at least two other structurally-related genes.
15878	
15879	Ephrin receptors and their ligands, the ephrins, mediate numerous developmental processes, particularly in the nervous system. Based on their structures and sequence relationships, ephrins are divided into the ephrin-A (EFNA) class, which are anchored to the membrane by a glycosylphosphatidylinositol linkage, and the ephrin-B (EFNB) class, which are transmembrane proteins. The Eph family of receptors are divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands. Ephrin receptors make up the largest subgroup of the receptor tyrosine kinase (RTK) family. The protein encoded by this gene is a receptor for ephrin-B family members.
15880	
15881	
15882	
15883	
15884	This gene encodes a protein that was identified in a screen for genes expressed in metastatic cells, specifically, mammary adenocarcinoma cell lines. Expression of this gene has been correlated with the metastatic potential of at least two types of carcinomas although it is also expressed in many normal tissues. The role it plays in metastasis is unclear. It was initially thought to be the 70kD component of a nucleosome remodeling deacetylase complex, NuRD, but it is more likely that this component is a different but very similar protein. These two proteins are so closely related, though, that they share the same types of domains. These domains include two DNA binding domains, a dimerization domain, and a domain commonly found in proteins that methylate DNA. The profile and activity of this gene product suggest that it is involved in regulating transcription and that this may be accomplished by chromatin remodeling.
15885	
15886	
15887	
15888	
15889	
15890	
15891	
15892	Coagulation factor II is proteolytically cleaved to form thrombin in the first step of the coagulation cascade which ultimately results in the stemming of blood loss.  F2 also plays a role in maintaining vascular integrity during development and postnatal life.  Mutations in F2 leads to various forms of thrombosis and dysprothrombinemia.
15893	The elongation of primed DNA templates by DNA polymerase delta and DNA polymerase epsilon requires the accessory proteins proliferating cell nuclear antigen (PCNA) and replication factor C (RFC). RFC, also named activator 1, is a protein complex consisting of five distinct subunits of 140, 40, 38, 37, and 36 kD. This gene encodes the 36 kD subunit. This subunit can interact with the C-terminal region of PCNA. It forms a core complex with the 38 and 40 kDa subunits. The core complex possesses DNA-dependent ATPase activity, which was found to be stimulated by PCNA in an in vitro system. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
15894	
15895	DNAJC6 belongs to the evolutionarily conserved DNAJ/HSP40 family of proteins, which regulate molecular chaperone activity by stimulating ATPase activity. DNAJ proteins may have up to 3 distinct domains: a conserved 70-amino acid J domain, usually at the N terminus, a glycine/phenylalanine (G/F)-rich region, and a cysteine-rich domain containing 4 motifs resembling a zinc-finger domain (Ohtsuka and Hata, 2000).[supplied by OMIM]
15896	
15897	Neurexins are a family of proteins that function in the vertebrate nervous system as cell adhesion molecules and receptors. They are encoded by several unlinked genes of which two, NRXN1 and NRXN3, are among the largest known human genes. Three of the genes (NRXN1-3) utilize two alternate promoters and include numerous alternatively spliced exons to generate thousands of distinct mRNA transcripts and protein isoforms. The majority of transcripts are  produced from the upstream promoter and encode alpha-neurexin isoforms; a much smaller number of transcripts are produced from the downstream promoter and encode beta-neurexin isoforms. The alpha-neurexins contain epidermal growth factor-like (EGF-like) sequences and laminin G domains, and have been shown to interact with neurexophilins. The beta-neurexins lack EGF-like sequences and contain fewer laminin G domains than alpha-neurexins.
15898	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein and is similar in sequence to its family member CD53 antigen. It is known to complex with integrins and other transmembrane 4 superfamily proteins.
15899	
15900	
15901	
15902	
15903	
15904	
15905	
15906	
15907	
15908	Carbonic anhydrases (CAs) are a large family of zinc metalloenzymes that catalyze the reversible hydration of carbon dioxide.  They participate in a variety of biological processes, including respiration, calcification, acid-base balance, bone resorption, and the formation of aqueous humor, cerebrospinal fluid, saliva, and gastric acid.  They show extensive diversity in tissue distribution and in their subcellular localization.  CA VA is localized in the mitochondria and expressed primarily in the liver.  It may play an important role in ureagenesis and gluconeogenesis.  CA5A gene maps to chromosome 16q24.3 and an unprocessed pseudogene has been assigned to 16p12-p11.2.
15909	TATA-binding protein associated factors (TAFs) participate, with TATA binding protein (TBP; MIM 600075), in the formation of the TFIID protein complex (see MIM 313650), which is involved in the initiation of gene transcription by RNA polymerase II (see MIM 180660).[supplied by OMIM]
15910	
15911	The protein encoded by this gene is a transforming protein that is related to Rho-specific exchange factors and yeast cell cycle regulators. The expression of this gene is elevated with the onset of DNA synthesis and remains elevated during G2 and M phases. In situ hybridization analysis showed that expression is at a high level in cells undergoing mitosis in regenerating liver. Thus, this protein is expressed in a cell cycle-dependent manner during liver regeneration, and is thought to have an important role in the regulation of cytokinesis.
15912	
15913	
15914	
15915	
15916	This gene encodes a member of the arm-repeat (armadillo) and plakophilin gene families. Plakophilin proteins contain numerous armadillo repeats, localize to cell desmosomes and nuclei, and participate in linking cadherins to intermediate filaments in the cytoskeleton. This protein may act in cellular desmosome-dependent adhesion and signaling pathways.
15917	
15918	
15919	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L38E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome, including one located in the promoter region of the type 1 angiotensin II receptor gene.
15920	
15921	HLA-C belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domain, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Over one hundred HLA-C alleles have been described
15922	
15923	
15924	This gene complements a temperature-sensitive mutant isolated from the BHK-21 Syrian hamster cell line. It leads to a block in progression through the G1 phase of the cell cycle at nonpermissive temperatures.
15925	
15926	
15927	This gene has been found to be recurrently rearranged in chromosomal translocation with two other genes - baculoviral IAP repeat-containing protein 3 (also known as apoptosis inhibitor 2) and immunoglobulin heavy chain locus - in mucosa-associated lymphoid tissue lymphomas. The protein encoded by this gene may play a role in NF-kappaB activation. Two alternatively spliced transcript variants encoding different isoforms have been described for this gene.
15928	
15929	The PRKDC gene encodes the catalytic subunit of a nuclear DNA-dependent serine/threonine protein kinase (DNA-PK). The second component is the autoimmune antigen Ku (MIM 152690), which is encoded by the G22P1 gene on chromosome 22q. On its own, the catalytic subunit of DNA-PK is inactive and relies on the G22P1 component to direct it to the DNA and trigger its kinase activity; PRKDC must be bound to DNA to express its catalytic properties.[supplied by OMIM]
15930	
15931	
15932	The protein encoded by this gene is a critical subunit of N-methyl-D-aspartate receptors, members of the glutamate receptor channel superfamily which are heteromeric protein complexes with multiple subunits arranged to form a ligand-gated ion channel. These subunits play a key role in the plasticity of synapses, which is believed to underlie memory and learning. The gene consists of 21 exons and is alternatively spliced, producing transcript variants differing in the C-terminus. Although the sequence of exon 5 is identical in human and rat, the alternative exon 5 splicing in rat has yet to be demonstrated in human. Cell-specific factors are thought to control expression of different isoforms, possibly contributing to the functional diversity of the subunits.
15933	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene encodes a member of the histone H1 family.
15934	This gene encodes a member of the bone morphogenetic protein (BMP) receptor family of transmembrane serine/threonine kinases. The ligands of this receptor are BMPs, which are members of the TGF-beta superfamily. BMPs are involved in endochondral bone formation and embryogenesis. These proteins transduce their signals through the formation of heteromeric complexes of 2 different types of serine (threonine) kinase receptors: type I receptors of about 50-55 kD and type II receptors of about 70-80 kD. Type II receptors bind ligands in the absence of type I receptors, but they require their respective type I receptors for signaling, whereas type I receptors require their respective type II receptors for ligand binding. Mutations in this gene have been associated with primary pulmonary hypertension. This gene expresses two transcript variants.
15935	KRT17 encodes the type I intermediate filament chain keratin 17, expressed in nail bed, hair follicle, sebaceous glands, and other epidermal appendages. Mutations in this gene lead to Jackson-Lawler type pachyonychia congenita and steatocystoma multiplex.
15936	
15937	
15938	
15939	
15940	
15941	
15942	
15943	
15944	This gene is a member of the MAGEA gene family. The members of this family encode proteins with 50 to 80% sequence identity to each other. The promoters and first exons of the MAGEA genes show considerable variability, suggesting that the existence of this gene family enables the same function to be expressed under different transcriptional controls. The MAGEA genes are clustered at chromosomal location Xq28. They have been implicated in some hereditary disorders, such as dyskeratosis congenita.
15945	Temporally regulated transcription and translation of several testis-specific genes is required to initiate the series of molecular and morphological changes in the male germ cell lineage necessary for the formation of mature spermatozoa. This gene is a member of the SPANX family of cancer/testis-associated genes, which are located in a cluster on chromosome X. The SPANX genes encode differentially expressed testis-specific proteins that localize to various subcellular compartments. This particular gene maps to chromosome X in a head-to-head orientation with SPANX family member A1 and appears to be a duplication of that locus. The protein encoded by this gene targets to the nucleus where it associates with nuclear vacuoles and the redundant nuclear envelope. Based on its association with these poorly characterized regions of the sperm nucleus, this protein provides a biochemical marker to study unique structures in spermatazoa while attempting to further define its role in spermatogenesis.
15946	
15947	
15948	
15949	
15950	The protein encoded by this gene is a member of the ankyrin repeat and SOCS box-containing (ASB) family of proteins. They contain ankyrin repeat sequence and a SOCS box domain. The SOCS box serves to couple suppressor of cytokine signalling (SOCS) proteins and their binding partners with the elongin B and C complex, possibly targeting them for degradation. The SOCS box domain has not yet been annotated for this encoded protein.
15951	This gene belongs to the RBMY gene family which includes candidate Y chromosome spermatogenesis genes. This gene, an active X chromosome homolog of the Y chromosome RBMY gene, is widely expressed whereas the RBMY gene evolved a male-specific function in spermatogenesis. Pseudogenes of this gene, found on chromosomes 1, 4, 9, 11, and 6, were likely derived by retrotransposition from the original gene. Alternatively spliced transcript variants encoding different isoforms have been identified but their biological nature has not been determined.
15952	
15953	
15954	This gene is X-linked and is expressed in only male germ cells. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
15955	
15956	Glycerol kinase (EC 2.7.1.30) catalyzes the phosphorylation of glycerol by ATP, yielding ADP and glycerol-3-phosphate.[supplied by OMIM]
15957	Actin-binding protein, or filamin, is a 280-kD protein that crosslinks actin filaments into orthogonal networks in cortical cytoplasm and participates in the anchoring of membrane proteins for the actin cytoskeleton. Remodeling of the cytoskeleton is central to the modulation of cell shape and migration. Filamin A, encoded by the FLNA gene, is a widely expressed protein that regulates reorganization of the actin cytoskeleton by interacting with integrins, transmembrane receptor complexes, and second messengers.[supplied by OMIM]
15958	
15959	
15960	
15961	The protein encoded by this gene is a member of the kelch family of proteins, which is characterized by a 50 amino acid repeat which interacts with actin. Transcript variants have been described but their full-length nature has not been determined.
15962	
15963	
15964	
15965	
15966	This gene encodes a protein that contains a SET domain. SET domains appear to be protein-protein interaction domains that mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). The function of this gene has not been determined. Two alternatively spliced transcript variants have been found for this gene.
15967	This gene encodes the medium-chain specific (C4 to C12 straight chain) acyl-Coenzyme A dehydrogenase. The homotetramer enzyme catalyzes the initial step of the mitochondrial fatty acid beta-oxidation pathway. Clinical phenotypes are associated with ACADM hereditary deficiency.
15968	
15969	
15970	
15971	The product of the POMGNT1 gene, protein O-mannose beta-1,2-N-acetylglucosaminyltransferase, participates in O-mannosyl glycan synthesis. It is mutant in muscle-eye-brain disease (MIM 253280).[supplied by OMIM]
15972	The protein encoded by this gene is a glycoprotein, belonging to the pentraxin family of proteins, which has a characteristic pentameric organization. These family members have considerable sequence homology which is thought to be the result of gene duplication. The binding of the encoded protein to proteins in the pathological amyloid cross-beta fold suggests its possible role as a chaperone. This protein is also thought to control the degradation of chromatin. It has been demonstrated that this protein binds to apoptotic cells at an early stage, which raises the possibility that it is involved in dealing with apoptotic cells in vivo. This gene has multiple polyadenylation sites.
15973	CD2 (MIM 186990) family members, such as SLAMF9, are encoded by genes near the centromere of chromosome 1. They function as coreceptors for lymphocyte activation and/or adhesion via their extracellular and cytoplasmic domains (Fennelly et al., 2001).[supplied by OMIM]
15974	Interaction of Fas ligand (TNFSF6) with the FAS antigen (TNFRSF6) mediates programmed cell death, also called apoptosis, in a number of organ systems. The protein encoded by this gene binds to FAS antigen and can initiate apoptosis or enhance apoptosis initiated through FAS antigen. Initiation of apoptosis by the protein encoded by this gene requires a ubiquitin-like domain but not the FAS-binding domain. Two transcript variants encoding different protein isoforms have been described for this gene.
15975	
15976	
15977	Peptidoglycan recognition proteins, such as PGRPI-alpha, are part of the innate immune system and recognize peptidoglycan, a ubiquitous component of bacterial cell walls.[supplied by OMIM]
15978	
15979	
15980	
15981	
15982	Alzheimer@#$%&s disease (AD) patients with an inherited form of the disease carry mutations in the presenilin proteins (PSEN1; PSEN2) or the amyloid precursor protein (APP).  These disease-linked mutations result in increased production of the    longer form of amyloid-beta (main component of amyloid deposits found in AD brains).  Presenilins are postulated to regulate APP processing through their effects on gamma-secretase, an enzyme that cleaves APP.  Also, it is thought that the presenilins are involved in the cleavage of the Notch receptor, such that that they either directly regulate gamma-secretase activity or themselves are protease enzymes.  Two alternative transcripts of PSEN2 have been identified.
15983	
15984	This gene encodes a member of the Argonaute family of proteins which play a role in RNA interference. The encoded protein is highly basic, contains a PAZ domain and a PIWI domain, and may play a role in short-interfering-RNA-mediated gene silencing. This gene is located on chromosome 1 in a tandem cluster of closely related family members including argonaute 4 and eukaryotic translation initiation factor 2C, 1. Two transcript variants encoding distinct isoforms have been identified for this gene.
15985	There are at least four distinct but related alkaline phosphatases: intestinal, placental, placental-like, and liver/bone/kidney (tissue non-specific). The first three are located together on chromosome 2 while the tissue non-specific form is located on chromosome 1. The product of this gene is a membrane bound glycosylated enzyme that is not expressed in any particular tissue and is, therefore, referred to as the tissue-nonspecific form of the enzyme. The exact physiological function of the alkaline phosphatases is not known. A proposed function of this form of the enzyme is matrix mineralization, however, mice that lack a functional form of this enzyme show normal skeletal development. This enzyme has been linked directly to a disorder known as hypophosphatasia, a disorder that is characterized by hypercalcemia and includes skeletal defects. The character of this disorder can vary, however, depending on the specific mutation since this determines age of onset and severity of symptoms.
15986	
15987	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of the ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class, and its C-terminal region is highly similar to that of rat NFB42 (neural F Box 42 kDa) which may be involved in the control of the cell cycle.
15988	
15989	Opticin belongs to class III of the small leucine-rich repeat protein (SLRP) family. Members of this family are typically associated with the extracellular matrix. Opticin is present in significant quantities in the vitreous of the eye and also localizes to the cornea, iris, ciliary body, optic nerve, choroid, retina, and fetal liver. Opticin may noncovalently bind collagen fibrils and regulate fibril morphology, spacing, and organization. The opticin gene is mapped to a region of chromosome 1 that is associated with the inherited eye diseases age-related macular degeneration (AMD) and posterior column ataxia with retinosa pigmentosa (AXPC1).
15990	
15991	The endothelial leukocyte adhesion molecule-1 is expressed by cytokine-stimulated endothelial cells. It is thought to be responsible for the accumulation of blood leukocytes at sites of inflammation by mediating the adhesion of cells to the vascular lining. It exhibits structural features such as the presence of lectin- and EGF-like domains followed by short consensus repeat (SCR) domains that contain 6 conserved cysteine residues. These proteins are part of the selectin family of cell adhesion molecules. This gene is present in single copy in the human genome and contains 14 exons spanning about 13 kb of DNA. Adhesion molecules participate in the interaction between leukocytes and the endothelium and appear to be involved in the pathogenesis of atherosclerosis.
15992	
15993	This gene encodes a member of the regulator of G-protein signalling family. This protein is located on the cytosolic side of the plasma membrane and contains a conserved, 120 amino acid motif called the RGS domain. The protein attenuates the signalling activity of G-proteins by binding to activated, GTP-bound G alpha subunits and acting as a GTPase activating protein (GAP), increasing the rate of conversion of the GTP to GDP. This hydrolysis allows the G alpha subunits to bind G beta/gamma subunit heterodimers, forming inactive G-protein heterotrimers, thereby terminating the signal.
15994	
15995	This gene encodes a member of the SWI2/SNF2 family of proteins, which play a critical role in altering protein-DNA interactions. The encoded protein has been shown to have dsDNA-dependent ATPase activity and RNA polymerase II termination activity. This protein interacts with cell division cycle 5-like, associates with human splicing complexes, and plays a role in pre-mRNA splicing.
15996	
15997	
15998	
15999	
16000	The VMD2 gene (MIM 607854) underlying vitelliform macular dystrophy and the 3 VMD2-like genes, VMD2L1 (MIM 607335), VMD2L2, and VMD2L3 (MIM 607337), encode transmembrane (TM) spanning proteins that share a homology region with a high content of aromatic residues including an invariant arginine (R), phenylalanine (F), and proline (P) motif. VMD2 and the 3 VMD2-related genes share a conserved gene structure with almost identical sizes of the 8 RFP-TM domain encoding exons and highly conserved positions of their corresponding exon-intron boundaries. Each of the 4 paralogous genes contains a unique 3-prime end of variable length without significant homology to known proteins or motifs (Stohr et al., 2002).[supplied by OMIM]
16001	The encoded gene product presumably interacts with YY1 protein; however, its exact function is not known. Five transcript variants encoding different isoforms have been noted for this gene. Expression of one of the isoforms has been associated with hepatocellular carcinoma.
16002	
16003	This gene product belongs to the SCAMP family of proteins which are secretory carrier membrane proteins. They function as carriers to the cell surface in post-golgi recycling pathways. Different family members are highly related products of distinct genes, and are usually expressed together. These findings suggest that the SCAMPs may function at the same site during vesicular transport rather than in separate pathways. Two transcript variants encoding different isoforms have been found for this gene.
16004	
16005	
16006	This gene encodes a protein which is a member of the cysteine-aspartic acid protease (caspase) family. Sequential activation of caspases plays a central role in the execution-phase of cell apoptosis. Caspases exist as inactive proenzymes which undergo proteolytic processing at conserved aspartic residues to produce 2 subunits, large and small, that dimerize to form the active enzyme. This protein is processed by caspase APAF1; this step is thought to be one of the earliest in the caspase activation cascade. Alternative splicing of this gene results in two transcript variants which encode different isoforms.
16007	
16008	
16009	Breast tumors are initially dependent on estrogens for growth and progression and can be inhibited by anti-estrogens such as tamoxifen. However, breast cancers progress to become anti-estrogen resistant. Breast cancer anti-estrogen resistance gene 3 was identified in the search for genes involved in the development of estrogen resistance. The gene encodes a component of intracellular signal transduction that causes estrogen-independent proliferation in human breast cancer cells. The protein contains a putative src homology 2 (SH2) domain, a hall mark of cellular tyrosine kinase signaling molecules, and is partly homologous to the cell division cycle protein CDC48.
16010	This gene encodes a protein similar to calcium/calmodulin dependent protein kinase, however, its exact function is not known.
16011	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes (E1s), ubiquitin-conjugating enzymes (E2s), and ubiquitin-protein ligases (E3s). This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. The encoded protein is 98% identical to the mouse counterpart.
16012	
16013	
16014	
16015	
16016	
16017	Synaptobrevins/VAMPs, syntaxins, and the 25-kD synaptosomal-associated protein are the main components of a protein complex involved in the docking and/or fusion of synaptic vesicles with the presynaptic membrane. This gene is a member of the vesicle-associated membrane protein (VAMP)/synaptobrevin family. Because of its high homology to other known VAMPs, its broad tissue distribution, and its subcellular localization, the protein encoded by this gene was shown to be the human equivalent of the rodent cellubrevin. In platelets the protein resides on a compartment that is not mobilized to the plasma membrane on calcium or thrombin stimulation.
16018	This gene encodes a member of a superfamily of proteins composed predominantly of tandemly arrayed short consensus repeats of approximately 60 amino acids. Along with a single, unique beta-chain, seven identical alpha-chains encoded by this gene assemble into the predominant isoform of C4b-binding protein, a multimeric protein that controls activation of the complement cascade through the classical pathway. The genes encoding both alpha and beta chains are located adjacent to each other on human chromosome 1 in the regulator of complement activation gene cluster. Two pseudogenes of this gene are also found in the cluster.
16019	
16020	The protein encoded by this gene is a Ro RNP-binding protein. It interacts with Ro RNPs and their interaction is thought to represent a gain of function for Ro RNPs. This protein also forms a ternary complex with far upstream element (FUSE) and FUSE-binding protein. It can repress a c-myc reporter via the FUSE. It is also known to target transcription factor IIH and inhibit activated transcription. This gene is implicated in the xeroderma pigmentosum disorder. There are two alternatively spliced transcript variants of this gene encoding different isoforms. There seems to be evidence of multiple polyadenylation sites for this gene.
16021	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and hydroxylates medium-chain fatty acids such as laurate and myristate.
16022	The removal of introns from nuclear pre-mRNAs occurs on complexes called spliceosomes, which are made up of 4 small nuclear ribonucleoprotein (snRNP) particles and an undefined number of transiently associated splicing factors. PRPF3 is 1 of several proteins that associate with U4 and U6 snRNPs.[supplied by OMIM]
16023	
16024	Cytosolic and membrane-bound forms of glutathione S-transferase are encoded by two distinct supergene families. At present, eight distinct classes of the soluble cytoplasmic mammalian glutathione S-transferases have been identified: alpha, kappa, mu, omega, pi, sigma, theta and zeta. This gene encodes a glutathione S-transferase that belongs to the mu class. The mu class of enzymes functions in the detoxification of electrophilic compounds, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress, by conjugation with glutathione. The genes encoding the mu class of enzymes are organized in a gene cluster on chromosome 1p13.3 and are known to be highly polymorphic. These genetic variations can change an individual@#$%&s susceptibility to carcinogens and toxins as well as affect the toxicity and efficacy of certain drugs. Diversification of these genes has occurred in regions encoding substrate-binding domains, as well as in tissue expression patterns, to accommodate an increasing number of foreign compounds.
16025	
16026	RRAGC is a monomeric guanine nucleotide-binding protein, or G protein. By binding GTP or GDP, small G proteins act as molecular switches in numerous cell processes and signaling pathways.[supplied by OMIM]
16027	RRAGC is a monomeric guanine nucleotide-binding protein, or G protein. By binding GTP or GDP, small G proteins act as molecular switches in numerous cell processes and signaling pathways.[supplied by OMIM]
16028	Aminoacyl-tRNA synthetases are a class of enzymes that charge tRNAs with their cognate amino acids. The protein encoded by this gene is a multifunctional aminoacyl-tRNA synthetase that catalyzes the aminoacylation of glutamic acid and proline tRNA species. Alternative splicing has been observed for this gene, but the full-length nature and biological validity of the variant have not been determined.
16029	
16030	This gene encodes a member of von Willebrand factor A domain containing protein family. This family of proteins are thought to be involved in the formation of filamentous networks in the extracellular matrices of various tissues. Mutations of this gene have been associated with variety of inherited chondrodysplasias.
16031	
16032	
16033	
16034	This gene encodes a leucine-rich cytoplasmic protein, which is highly similar to a mouse protein that negatively regulates Ca/calmodulin-dependent protein kinase II phosphorylation and may be essential for spatial learning processes. Several alternatively spliced transcript variants of this gene have been described.
16035	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene includes 2 alternatively spliced transcripts encoding 2 different isoforms. The larger isoform is a DEAD box protein with RNA helicase activity. It may participate in melting of DNA:RNA hybrids, such as those that occur during transcription, and may play a role in X-linked gene expression. It contains 2 copies of a double-stranded RNA-binding domain, a DEXH core domain and an RGG box. The RNA-binding domains and RGG box influence and regulate RNA helicase activity. The smaller isoform is a lymphocyte granule protein. It lacks RNA-binding domains and DEXH core domain, but contains an RGG box, which may render this isoform RNA binding function.
16036	
16037	Integrins are heterodimeric proteins made up of alpha and beta subunits. At least 18 alpha and 8 beta subunits have been described in mammals. Integrin family members are membrane receptors involved in cell adhesion and recognition in a variety of processes including embryogenesis, hemostasis, tissue repair, immune response and metatastatic diffusion of tumor cells. The protein encoded by this gene is a beta subunit. Six alternatively spliced variants have been found for this gene which encode five proteins with alternate carboxy termini.
16038	IRX5 is a member of the Iroquois homeobox gene family. Members of this family appear to play multiple roles during pattern formation of vertebrate embryos.[supplied by OMIM]
16039	
16040	The genes for alpha-tectorin (MIM 602574) and beta-tectorin encode the major noncollagenous proteins of the tectorial membrane of the cochlea.[supplied by OMIM]
16041	
16042	
16043	This gene encodes lipase A, the lysosomal acid lipase (also known as cholesterol ester hydrolase). This enzyme functions in the lysosome to catalyze the hydrolysis of cholesteryl esters and triglycerides. Mutations in this gene can result in Wolman disease and cholesteryl ester storage disease.
16044	
16045	
16046	
16047	Purine 5-prime-nucleotidase (EC 3.1.3.5) preferentially hydrolyzes inosine 5-prime-monophosphate (IMP) and other purine nucleotides, and is allosterically activated by various compounds, including ATP. The enzyme is exclusively located in the cytoplasmic matrix of cells and may have a critical role in the maintenance of a constant composition of intracellular purine/pyrimidine nucleotides in cooperation with other nucleotidases.[supplied by OMIM]
16048	
16049	
16050	
16051	
16052	This gene encodes a member of the trypsin family of serine proteases. This protein is a secreted enzyme that is proposed to regulate the availability of insulin-like growth factors (IGFs) by cleaving IGF-binding proteins. It has also been suggested to be a regulator of cell growth.
16053	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and is induced by ethanol, the diabetic state, and starvation. The enzyme metabolizes both endogenous substrates, such as ethanol, acetone, and acetal, as well as exogenous substrates including benzene, carbon tetrachloride, ethylene glycol, and nitrosamines which are premutagens found in cigarette smoke. Due to its many substrates, this enzyme may be involved in such varied processes as gluconeogenesis, hepatic cirrhosis, diabetes, and cancer.
16054	This gene encodes a member of the sirtuin family of proteins, homologs to the yeast Sir2 protein. Members of the sirtuin family are characterized by a sirtuin core domain and grouped into four classes. The functions of human sirtuins have not yet been determined; however, yeast sirtuin proteins are known to regulate epigenetic gene silencing and suppress recombination of rDNA. Studies suggest that the human sirtuins may function as intracellular regulatory proteins with mono-ADP-ribosyltransferase activity. The protein encoded by this gene is included in class I of the sirtuin family.
16055	
16056	
16057	
16058	This gene was identified as a tumor suppressor that is mutated in a large number of cancers at high frequency. The protein encoded this gene is a phosphatidylinositol-3,4,5-trisphosphate 3-phosphatase. It contains a tension like domain as well as a catalytic domain similar to that of the dual specificity protein tyrosine phosphatases. Unlike most of the protein tyrosine phosphatases, this protein preferentially dephosphorylates phosphoinositide substrates. It negatively regulates intracellular levels of phosphatidylinositol-3,4,5-trisphosphate in cells and functions as a tumor suppressor by negatively regulating AKT/PKB signaling pathway.
16059	OAT encodes the mitochondrial enzyme ornithine aminotransferase, which is a key enzyme is the pathway that converts arginine and ornithine into the major excitatory and inhibitory neurotransmitters glutamate and GABA. Mutations that result in a deficiency of this enzyme cause the autosomal recessive eye disease Gyrate Atrophy.
16060	This gene encodes a member of a family of small membrane proteins that share a 35-amino acid signature sequence domain, beginning with the sequence PFXYD and containing 7 invariant and 6 highly conserved amino acids. The approved human gene nomenclature for the family is FXYD-domain containing ion transport regulator. FXYD4, originally named CHIF for channel-inducing factor, has been shown to modulate the properties of the Na,K-ATPase, as has FXYD2, also known as the gamma subunit of the Na,K-ATPase, and FXYD7. Transmembrane topology has been established for FXYD4 and two family members (FXYD1 and FXYD2), with the N-terminus extracellular and the C-terminus on the cytoplasmic side of the membrane.
16061	Carboxypeptidase N (CPN)is a plasma metallo-protease that cleaves basic amino acid residues from the C terminal of peptides and proteins. The enzyme plays a central role in regulating the biologic activity of peptides such as kinins and anaphylatoxins, and is also known as kininase-1 and anaphylatoxin inactivator.  CPN is a tetrameric complex with a molecular weight of 280-kDa consisting of two identical small enzymatically active (catalytic) subunits of 50-kDa (this record) and two large 83-kDa regulatory subunits.  The gene that encodes the 50-kDa catalytic subunit has been mapped to chromosome 10 but its exact location is unknown.
16062	
16063	This gene encodes a protein with similarity to the C. elegans unc93 protein. The Unc93 protein is involved in the regulation or coordination of muscle contraction in the worm.
16064	The protein encoded by this gene was identified as a transmembrane phosphoprotein specifically associated with tyrosine phosphatase PTPRC/CD45, a key regulator of T- and B-lymphocyte activation. The interaction with PTPRC may be required for the stable expression of this protein.
16065	
16066	
16067	MONDOA forms heterodimers with MLX (MIM 602976) that can bind to and activate transcription from CACGTG E boxes (Billin et al., 2000 [PubMed 11073985]).[supplied by OMIM]
16068	
16069	
16070	The ADP-ribosylation factor (ARF) genes are small GTP-binding proteins of the RAS superfamily.  ARL2 is a member of a functionally distinct group of ARF-like genes.
16071	
16072	The protein encoded by this gene is a member of the serine/threonine kinase family. This kinase contains a SH3 domain and a leucine zipper-basic motif. This kinase preferentially activates MAPK8/JNK kinase, and functions as a positive regulator of JNK signaling pathway. This kinase can directly phosphorylate, and activates IkappaB kinase alpha and beta, and is found to be involved in the transcription activity of NF-kappaB mediated by Rho family GTPases and CDC42.
16073	
16074	
16075	The protein encoded by this gene is a member of the thiol-specific antioxidant protein family. This protein is a bifunctional enzyme with two distinct active sites. It is involved in redox regulation of the cell; it can reduce H(2)O(2) and short chain organic, fatty acid, and phospholipid hydroperoxides. It may play a role in the regulation of phospholipid turnover as well as in protection against oxidative injury.
16076	
16077	CSTF3 is one of three (including CSTF1 and CSTF2) cleavage stimulation factors that combine to form CSTF.  CSTF is involved in the polyadenylation and 3@#$%&end cleavage of pre-mRNAs.
16078	CDON and BOC (MIM 608708) are cell surface receptors of the immunoglobulin (Ig)/fibronectin type III (FNIII; see MIM 135600) repeat family involved in myogenic differentiation. CDON and BOC are coexpressed during development, form complexes with each other in a cis fashion, and are related to each other in their ectodomains, but each has a unique long cytoplasmic tail.[supplied by OMIM]
16079	
16080	The protein encoded by this gene is a member of a family of proteins that inhibits apoptosis by binding to tumor necrosis factor receptor-associated factors TRAF1 and TRAF2, probably by interfering with activation of ICE-like proteases. The encoded protein inhibits apoptosis induced by serum deprivation but does not affect apoptosis resulting from exposure to menadione, a potent inducer of free radicals. The amino acid sequence predicts three baculovirus IAP repeat domains and a ring finger domain. Transcript variants encoding the same isoform have been identified.
16081	
16082	Human glutamate receptor 4 (GRIA4) is a new member of the family of ionotropic glutamate receptors which are the predominant excitatory neurotransmitter receptors in the mammalian brain.  Binding studies showed that human GRIA4 transfected into simian kidney cells (COS-1) exhibits high specific binding for[3H](RS)-alpha-amino- 3-hydroxy-5-methylisoxazole-4-propionic acid ([3H]AMPA) but not [3H]kainate.  Ion substitution experiments indicate that hGluR4 receptor-linked ion channels in their homomeric state are permeable to both CA2+ and Na+ ions.  Immunoprecipitation studies suggest that GRIA1 exists in situ in the form of a pentamer.
16083	TIMM8B belongs to a family of evolutionarily conserved proteins that are organized in heterooligomeric complexes in the mitochondrial intermembrane space. These proteins mediate the import and insertion of hydrophobic membrane proteins into the mitochondrial inner membrane.[supplied by OMIM]
16084	The product of this gene belongs to the ligand-gated ion channel receptor superfamily. This gene encodes subunit A of the type 3 receptor for 5-hydroxytryptamine (serotonin), a biogenic hormone that functions as a neurotransmitter, a hormone, and a mitogen. This receptor causes fast, depolarizing responses in neurons after activation. It appears that the heteromeric combination of A and B subunits is necessary to provide the full functional features of this receptor, since either subunit alone results in receptors with very low conductance and response amplitude. Alternatively spliced transcript variants encoding different isoforms have been identified.
16085	
16086	T cell antigen receptor (TCR) is associated on the T cell surface with a complex of protein called CD3.  CD3G (gamma chain) is one of the four peptides (gamma, delta, epsilon and zeta) that form CD3.  Defects in CD3G are associated with T cell immunodeficiency.
16087	This gene encodes a member of the hydroxymethylbilane synthase superfamily. The encoded protein is the third enzyme of the heme biosynthetic pathway and catalyzes the head to tail condensation of four porphobilinogen molecules into the linear hydroxymethylbilane. Mutations in this gene are associated with the autosomal dominant disease acute intermittent porphyria. Alternatively spliced transcript variants encoding different isoforms have been described.
16088	This gene encodes an enzyme that is involved in cholesterol biosynthesis. It is thought to be an integral membrane protein.
16089	
16090	
16091	
16092	Translation initiation is mediated by specific recognition of the cap structure by eukaryotic translation initiation factor 4F (eIF4F), which is a cap binding protein complex that consists of three subunits: eIF4A, eIF4E and eIF4G. The protein encoded by this gene shares similarity with the C-terminal region of eIF4G, that contains the binding sites for eIF4A and eIF3; eIF4G in addition, contains a binding site for eIF4E at the N-terminus. Unlike eIF4G which supports cap-dependent and independent translation, this gene product functions as a general repressor of translation by forming translationally inactive complexes. In vitro and in vivo studies indicate that translation of this mRNA initiates exclusively at a non-AUG (GUG) codon.
16093	
16094	This gene encodes a member of the membrane-spanning 4A gene family, members of which are characterized by common structural features and similar intron/exon splice boundaries and display unique expression patterns in hematopoietic cells and nonlymphoid tissues. This family member is associated with mature cellular function in the monocytic lineage, and it may be a component of a receptor complex involved in signal transduction. This gene is localized to 11q12, in a cluster of other family members. At least four alternatively spliced transcript variants encoding two distinct isoforms have been observed.
16095	
16096	The protein encoded by this gene is a member of the fatty acid desaturase (FADS) gene family. Desaturase enzymes regulate unsaturation of fatty acids through the introduction of double bonds between defined carbons of the fatty acyl chain. FADS family members are considered fusion products composed of an N-terminal cytochrome b5-like domain and a C-terminal multiple membrane-spanning desaturase portion, both of which are characterized by conserved histidine motifs. This gene is clustered with family members FADS1 and FADS2 at 11q12-q13.1; this cluster is thought to have arisen evolutionarily from gene duplication based on its similar exon/intron organization.
16097	
16098	
16099	This gene encodes a protein that may play a role in organelle biogenesis associated with melanosomes, platelet dense granules, and lysosomes. This protein interacts with Hermansky-Pudlak syndrome 6 protein and may interact with the cytoplasmic domain of integrin, alpha-3. Mutations in this gene are associated with Hermansky-Pudlak syndrome type 5. Multiple transcript variants encoding two distinct isoforms have been identified for this gene.
16100	This gene is a member of the cadherin superfamily whose members encode calcium-dependent cell-cell adhesion molecules. The encoded protein has a signal peptide, 27 cadherin repeat domains and a unique cytoplasmic region. This particular cadherin family member is expressed in fibroblasts but not in melanocytes or keratinocytes. The cell-cell adhesion of fibroblasts is thought to be necessary for wound healing.
16101	
16102	
16103	This gene encodes a member of the M20 metalloprotease family. The encoded protein is specifically expressed in the brain, is a homodimeric dipeptidase which was identified as human carnosinase. This gene contains trinucleotide (CTG) repeat length polymorphism in the coding region.
16104	
16105	
16106	This gene product is a serine/threonine protein kinase that contains an N-terminal protein kinase domain that is structurally similar to the kinase domains of testis-specific protein kinase-1 and the LIM motif-containing protein kinases (LIMKs). Its overall structure is most related to the former, indicating that it belongs to the TESK subgroup of the LIMK/TESK family of protein kinases. This gene is predominantly expressed in testis and prostate. The developmental expression pattern of the rat gene in testis suggests an important role for this gene in meitoic stages and/or early stages of spermiogenesis.
16107	
16108	
16109	This gene encodes a pyrin-like protein which contains a pyrin domain, a nucleotide-binding site (NBS) domain, and a leucine-rich repeat (LRR) motif. This protein interacts with apoptosis-associated speck-like protein containing a CARD. Proteins which contain the caspase recruitment domain, CARD, have been shown to be involved in inflammation and immune response. This protein may function as an activator of NF-kappaB signaling. The encoded protein may play a role in the regulation of inflammation and apoptosis. Mutations in this gene have been associated with familial cold autoinflammatory syndrome (FCAS), Muckle-Wells syndrome (MWS), chronic infantile neurological cutaneous and articular (CINCA) syndrome, and neonatal-onset multisystem inflammatory disease (NOMID). Two transcript variants encoding distinct isoforms have been identified for this gene.
16110	Summary: This gene encodes an ubiquitin-like protein (ubiquilin) that shares high degree of similarity with related products in yeast, rat and frog. Ubiquilins contain a N-terminal ubiquitin-like domain and a C-terminal ubiquitin-associated domain. They physically associate with both proteasomes and ubiquitin ligases, and thus thought to functionally link the ubiquitination machinery to the proteasome to affect in vivo protein degradation. This gene is specifically expressed in the testis, and proposed to regulate cell-cycle progression during spermatogenesis.
16111	
16112	The Golgi apparatus, which participates in glycosylation and transport of proteins and lipids in the secretory pathway, consists of a series of stacked cisternae (flattened membrane sacs). Interactions between the Golgi and microtubules are thought to be important for the reorganization of the Golgi after it fragments during mitosis. The golgins are a family of proteins, of which the protein encoded by this gene is a member, that are localized to the Golgi. This encoded protein has been postulated to play a role in nuclear transport and Golgi apparatus localization. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of these variants has not been determined.
16113	Zinc is an essential cofactor for hundreds of enzymes. It is involved in protein, nucleic acid, carbohydrate, and lipid metabolism, as well as in the control of gene transcription, growth, development, and differentiation. SLC39A5 belongs to a subfamily of proteins that show structural characteristics of zinc transporters (Taylor and Nicholson, 2003).[supplied by OMIM]
16114	The protein encoded by this gene contains a RING finger, a motif present in a variety of functionally distinct proteins and known to be involved in protein-protein and protein-DNA interactions. The specific function of this protein has not yet been determined. Three alternatively spliced transcript variants encoding two distinct isoforms have been reported.
16115	The protein encoded by this gene is a member of the Ser/Thr protein kinase family. This protein kinase is highly similar to the gene products of S. cerevisiae cdc28, and S. pombe cdc2. It is a catalytic subunit of the cyclin-dependent protein kinase complex, whose activity is restricted to the G1-S phase, and essential for cell cycle G1/S phase transition. This protein associates with and regulated by the regulatory subunits of the complex including cyclin A or E, CDK inhibitor p21Cip1 (CDKN1A) and p27Kip1 (CDKN1B). Its activity is also regulated by its protein phosphorylation. Two alternatively spliced variants and multiple transcription initiation sites of this gene have been reported.
16116	
16117	The protein encode dby this gene is expressed in a developmentally regulated manner in lymphoid cell lines and tissues. The protein is localized to the cytoplasmic face of the endoplasmic reticulum.
16118	The importin-alpha/beta complex and the GTPase Ran mediate nuclear import of proteins with a classical nuclear localization signal. The protein encoded by this gene is a member of a class of approximately 20 potential Ran targets that share a sequence motif related to the Ran-binding site of importin-beta. This protein binds to the nuclear pore complex and, along with RanGTP and RANBP1, inhibits the GAP stimulation of the Ran GTPase.
16119	
16120	
16121	
16122	
16123	
16124	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP contains two tandem Src homology-2 domains, which function as phospho-tyrosine binding domains and mediate the interaction of this PTP with its substrates. This PTP is widely expressed in most tissues and plays a regulatory role in various cell signaling events that are important for a diversity of cell functions, such as mitogenic activation, metabolic control, transcription regulation, and cell migration. Mutations in this gene are a cause of Noonan syndrome as well as acute myeloid leukemia.
16125	Autophagy is the process by which endogenous proteins and damaged organelles are destroyed intracellularly. Autophagy is postulated to be essential for cell homeostasis and cell remodeling during differentiation, metamorphosis, non-apoptotic cell death, and aging. Reduced levels of autophagy have been described in some malignant tumors, and a role for autophagy in controlling the unregulated cell growth linked to cancer has been proposed. This gene encodes a member of the autophagin protein family. The encoded protein is also designated as a member of the C-54 family of cysteine proteases. Two transcript variants that encode different isoforms were identified through sequence analysis.
16126	The protein encoded by this gene is a member of the keratin gene family. The type II cytokeratins consist of basic or neutral proteins which are arranged in pairs of heterotypic keratin chains coexpressed during differentiation of simple and stratified epithelial tissues. This type II cytokeratin is specifically expressed in the basal layer of the epidermis with family member KRT14. Mutations in these genes have been associated with a complex of diseases termed epidermolysis bullosa simplex. The type II cytokeratins are clustered in a region of chromosome 12q12-q13.
16127	This gene encodes a member of the small leucine-rich proteoglycan (SLRP) family that includes decorin, biglycan, fibromodulin, keratocan, epiphycan, and osteoglycin. In these bifunctional molecules, the protein moiety binds collagen fibrils and the highly charged hydrophilic glycosaminoglycans regulate interfibrillar spacings. Lumican is the major keratan sulfate proteoglycan of the cornea but is also distributed in interstitial collagenous matrices throughout the body. Lumican may regulate collagen fibril organization and circumferential growth, corneal transparency, and epithelial cell migration and tissue repair.
16128	
16129	
16130	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. Pseudogenes corresponding to this gene are found on chromosomes 4p and 21q.
16131	This gene was identified upon genomic analysis of a gene-dense region at human chromosome 12p13. It appears to be mainly expressed in the brain, however, its function is not known. Alternatively spliced transcript variants encoding different isoforms have been identified.
16132	
16133	
16134	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E3 ubiquitin-conjugating enzyme family. The encoded protein may interact with other proteins and play a role in stress response. Multiple transcript variants encoding two distinct isoforms have been identified for this gene.
16135	
16136	
16137	
16138	
16139	
16140	Natural killer (NK) cells are lymphocytes that mediate cytotoxicity and secrete cytokines after immune stimulation. Several genes of the C-type lectin superfamily, including the rodent NKRP1 family of glycoproteins, are expressed by NK cells and may be involved in the regulation of NK cell function. The KLRB1 protein contains an extracellular domain with several motifs characteristic of C-type lectins, a transmembrane domain, and a cytoplasmic domain. The KLRB1 protein is classified as a type II membrane protein because it has an external C terminus.
16141	
16142	
16143	
16144	This gene encodes a protein localized to the nucleus in proliferating cells. Expression of this gene is modulated by cytokines in natural killer and T cells. The gene product is thought to play a role in apoptosis but the specific role has not yet been determined. Two transcripts encoding different isoforms have been identified.
16145	KRT18 encodes the type I intermediate filament chain keratin 18. Keratin 18, together with its filament partner keratin 8, are perhaps the most commonly found members of the intermediate filament gene family. They are expressed in single layer epithelial tissues of the body. Mutations in this gene have been linked to cryptogenic cirrhosis. Two transcript variants encoding the same protein have been found for this gene.
16146	
16147	This gene is similar to a mouse gene that encodes a testis-specific protein containing a RING finger domain. Two splice variants have been found for this gene.
16148	This gene encodes a member of the tumor necrosis factor (TNF) cytokine family which is a ligand for osteoprotegerin and functions as a key factor for osteoclast differentiation and activation. This protein was shown to be a dentritic cell survival factor and is involved in the regulation of T cell-dependent immune response. T cell activation was reported to induce expression of this gene and lead to an increase of osteoclastogenesis and bone loss. This protein was shown to activate antiapoptotic kinase AKT/PKB through a signaling complex involving SRC kinase and tumor necrosis factor receptor-associated factor (TRAF) 6, which indicated this protein may have a role in the regulation of cell apoptosis. Targeted disruption of the related gene in mice led to severe osteopetrosis and a lack of osteoclasts. The deficient mice exhibited defects in early differentiation of T and B lymphocytes, and failed to form lobulo-alveolar mammary structures during pregnancy. Two alternatively spliced transcript variants have been found.
16149	
16150	
16151	
16152	
16153	
16154	
16155	This gene encodes a protein that is part of a post-splicing multiprotein complex involved in both mRNA nuclear export and mRNA surveillance. The encoded protein is one of two functional homologs to yeast Upf3p. mRNA surveillance detects exported mRNAs with truncated open reading frames and initiates nonsense-mediated mRNA decay (NMD). When translation ends upstream from the last exon-exon junction, this triggers NMD to degrade mRNAs containing premature stop codons. This protein binds to the mRNA and remains bound after nuclear export, acting as a nucleocytoplasmic shuttling protein. It forms with Y14 a complex that binds specifically 20 nt upstream of exon-exon junctions. This gene is located on the long arm of chromosome 13. Two splice variants encoding different isoforms have been found for this gene.
16156	
16157	
16158	Members of the peripheral membrane-associated guanylate kinase (MAGUK) family function in tumor suppression and receptor clustering by forming multiprotein complexes containing distinct sets of transmembrane, cytoskeletal, and cytoplasmic signaling proteins. All MAGUKs contain a PDZ-SH3-GUK core and are divided into 4 subfamilies, DLG-like (see DLG1; MIM 601014), ZO1-like (see TJP1; MIM 601009), p55-like (see MPP1; MIM 305360), and LIN2-like (see CASK; MIM 300172), based on their size and the presence of additional domains (Tseng et al., 2001). MPP5 is a member of the p55-like MAGUK subfamily.[supplied by OMIM]
16159	
16160	
16161	
16162	
16163	
16164	
16165	
16166	
16167	
16168	
16169	
16170	
16171	This gene product is a member of a large family of CDC2-related serine/threonine protein kinases. It accumulates primarily in the nucleus. Alternative splice variants encoding different protein isoforms have been described, but their full-length nature has not been determined.
16172	This gene product is a member of a large family of CDC2-related serine/threonine protein kinases. It accumulates primarily in the nucleus. Alternative splice variants encoding different protein isoforms have been described, but their full-length nature has not been determined.
16173	
16174	
16175	
16176	
16177	
16178	
16179	
16180	The sodium-potassium-chloride cotransporter isoform 2 is kidney-specific and is found on the apical membrane of the thick ascending limb of Henle@#$%&s loop and the macula densa. It accounts for most of the NaCl resorption with the stoichiometry of 1Na:1K:2Cl and is sensitive to such diuretics as furosemide and bumetanide. Some Bartter-like syndromes result from defects in this gene.
16181	
16182	
16183	
16184	
16185	
16186	
16187	
16188	This protein belongs to the aldehyde dehydrogenase family of proteins. The product of this gene is an enzyme that catalyzes the synthesis of retinoic acid (RA) from retinaldehyde. Retinoic acid, the active derivative of vitamin A (retinol), is a hormonal signaling molecule that functions in developing and adult tissues. The studies of a similar mouse gene suggest that this enzyme and  the cytochrome CYP26A1, concurrently establish local embryonic retinoic acid levels which facilitate posterior organ development and prevent spina bifida. Three transcript variants encoding distinct isoforms have been identified for this gene.
16189	This gene encodes a protein which contains tetratricopeptide repeats (TPR), similar to O-linked N-acetyglucosamine transferase. Mutations in this gene have been observed in patients with Bardet-Biedl syndrome type 4. The encoded protein may play a role in pigmentary retinopathy, obesity, polydactyly, renal malformation and mental retardation.
16190	
16191	
16192	
16193	
16194	
16195	The protein encoded by this gene belongs to the BCL-2 protein family. BCL-2 family members form hetero- or homodimers and act as anti- or pro-apoptotic regulators that are involved in a wide variety of cellular activities. The proteins encoded by this gene are located at the outer mitochondrial membrane, and have been shown to regulate outer mitochondrial membrane channel (VDAC) opening. VDAC regulates mitochondrial membrane potential, and thus controls the production of reactive oxygen species and release of cytochrome C by mitochondria, both of which are the potent inducers of cell apoptosis. Two alternatively spliced transcript variants, which encode distinct isoforms, have been reported. The longer isoform acts as an apoptotic inhibitor and the shorter form acts as an apoptotic activator.
16196	
16197	This gene encodes a transcription factor that regulates MHC class II genes by binding to a promoter element referred to as an X box. The protein contains two domains commonly found in transcription factors that confer DNA binding and dimerization capability. This gene product was also separately identified as a cellular transcription factor that binds to an enhancer in the promoter of the T cell leukemia virus type 1 promoter. It may increase expression of viral proteins by acting as the DNA binding partner of a viral transactivator. There is also a recent finding that the same agents that cause an ER stress response, the unfolded protein response, also increase transcription of this gene. This increase in transcription requires an ER stress response consensus binding element in the promoter. Adding to the list of functional possibilities is a recent report from studies of knockout mice showing that this gene is essential for hepatocyte growth. A pseudogene has been identified and localized to chromosome 5.
16198	
16199	The protein encoded by this gene is an adaptor protein that mediates the association of the heat shock proteins HSP70 and HSP90. This protein has been shown to be involved in the assembly process of glucocorticoid receptor, which requires the assistance of multiple molecular chaperones. The expression of this gene is reported to be downregulated in colorectal carcinoma tissue suggesting that is a candidate tumor suppressor gene.
16200	Two forms of NADH-cytochrome b5 reductase are known, a membrane-bound form in somatic cells (anchored in the endoplasmic reticulum, mitochondrial and other membranes) and a soluble form in erythrocytes. The former exists mainly on the cytoplasmic side of the endoplasmic reticulum and functions in desaturation and elongation of fatty acids, in cholesterol biosynthesis, and in drug metabolism. The erythrocyte form is located in a soluble fraction of circulating erythrocytes and is involved in methemoglobin reduction. Membrane-bound enzyme consists of 300 amino acid residues having both membrane-binding and catalytic domains. The soluble type consists of 275 amino acids having only the catalytic domain. The 2 forms of the enzyme are coded by the same gene, DIA1. The two forms are thought to be generated from tissue-specific alternative transcripts differing in the first exon. Deficiency of DIA1 has been associated with methemoglobinemias.
16201	
16202	
16203	Glycerol kinase (EC 2.7.1.30) catalyzes the phosphorylation of glycerol by ATP, yielding ADP and glycerol-3-phosphate.[supplied by OMIM]
16204	Glycerol kinase (EC 2.7.1.30) catalyzes the phosphorylation of glycerol by ATP, yielding ADP and glycerol-3-phosphate.[supplied by OMIM]
16205	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. This gene encodes a membrane protein which has sequence similarity to the KCNE1 gene product, a member of the potassium channel, voltage-gated, isk-related subfamily. This intronless gene is deleted in AMME contiguous gene syndrome and may be involved in the cardiac and neurologic abnormalities found in the AMME contiguous gene syndrome.
16206	
16207	This gene encodes a protein containing multiple SH3 domains. It has been shown to bind many components of the JNK apoptosis signaling pathway including MLKs, MKKs and JNKs.
16208	
16209	
16210	
16211	
16212	This gene encodes a protein with sequence similarity to the chaperonin family. The encoded protein may have a role in protein processing in limb, cardiac and reproductive system development. Mutations in this gene have been observed in patients with Bardet-Biedl syndrome type 6 and McKusick-Kaufman syndrome. Two transcript variants encoding the same protein have been identified for this gene.
16213	
16214	
16215	
16216	
16217	
16218	
16219	
16220	
16221	
16222	
16223	
16224	
16225	
16226	
16227	
16228	
16229	
16230	
16231	
16232	
16233	
16234	
16235	
16236	
16237	
16238	
16239	
16240	Nop56p is a yeast nucleolar protein that is part of a complex with the nucleolar proteins Nop58p and fibrillarin. Nop56p is required for assembly of the 60S ribosomal subunit and is involved in pre-rRNA processing. The protein encoded by this gene is similar in sequence to Nop56p and is also found in the nucleolus. Multiple transcript variants encoding several different isoforms have been found for this gene, but the full-length nature of most of them have not been determined.
16241	
16242	
16243	
16244	
16245	
16246	
16247	
16248	
16249	
16250	
16251	
16252	
16253	
16254	
16255	
16256	
16257	The protein encoded by this gene is a receptor protein tyrosine kinase and type I transmembrane protein that belongs to the ROR subfamily of cell surface receptors.  The protein may be involved in the early formation of the chondrocytes and may be required for cartilage and growth plate development.  Mutations in this gene can cause brachydactyly type B, a skeletal disorder characterized by hypoplasia/aplasia of distal phalanges and nails. In addition, mutations in this gene can cause the autosomal recessive form of Robinow syndrome, which is characterized by skeletal dysplasia with generalized limb bone shortening, segmental defects of the spine, brachydactyly, and a dysmorphic facial appearance.
16258	
16259	
16260	
16261	
16262	
16263	
16264	
16265	
16266	
16267	Members of the Rab protein family are nontransforming monomeric GTP-binding proteins of the Ras superfamily that contain 4 highly conserved regions involved in GTP binding and hydrolysis. Rabs are prenylated, membrane-bound proteins involved in vesicular fusion and trafficking. The mammalian RAB proteins show striking similarities to the S. cerevisiae YPT1 and SEC4 proteins, Ras-related GTP-binding proteins involved in the regulation of secretion.[supplied by OMIM]
16268	
16269	
16270	STXBP6 binds components of the SNARE complex (see MIM 603215) and may be involved in regulating SNARE complex formation (Scales et al., 2002).[supplied by OMIM]
16271	
16272	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and two tandem catalytic domains, and thus represents a receptor-type PTP. The extracellular region contains a meprin-A5 antigen-PTP mu (MAM) domain, an Ig-like domain and four fibronectin type III-like repeats. This PTP has been shown to mediate cell-cell aggregation through the interaction with another molecule of this PTP on an adjacent cell. This PTP can interact with scaffolding protein RACK1/GNB2L1, which may be necessary for the downstream signaling in response to cell-cell adhesion.
16273	
16274	Members of the parvin family, including PARVG, are actin-binding proteins associated with focal contacts.[supplied by OMIM]
16275	
16276	
16277	Semaphorins are a large family, including both secreted and membrane associated proteins, many of which have been implicated as inhibitors or chemorepellents in axon pathfinding, fasciculation and branching, and target selection. All semaphorins possess a semaphorin (Sema) domain and a PSI domain (found in plexins, semaphorins and integrins) in the N-terminal extracellular portion. Additional sequence motifs C-terminal to the semaphorin domain allow classification into distinct subfamilies. Results demonstrate that transmembrane semaphorins, like the secreted ones, can act as repulsive axon guidance cues. This gene encodes a class 6 vertebrate transmembrane semaphorin that demonstrates alternative splicing. Six transcript variants have been identified and expression of the distinct encoded isoforms is thought to be regulated in a tissue- and development-dependent manner.
16278	
16279	The SMADIP1 gene (also known as SIP1) is a member of the delta-EF1 (TCF8; MIM 189909)/Zfh1 family of 2-handed zinc finger/homeodomain proteins. SMADIP1 interacts with receptor-mediated, activated full-length SMADs (see MIM 605568).[supplied by OMIM]
16280	
16281	DNAJC6 belongs to the evolutionarily conserved DNAJ/HSP40 family of proteins, which regulate molecular chaperone activity by stimulating ATPase activity. DNAJ proteins may have up to 3 distinct domains: a conserved 70-amino acid J domain, usually at the N terminus, a glycine/phenylalanine (G/F)-rich region, and a cysteine-rich domain containing 4 motifs resembling a zinc-finger domain (Ohtsuka and Hata, 2000).[supplied by OMIM]
16282	This gene encodes a protein that belongs to the serine/threonine protein kinase family. This protein contains a N-terminal protein kinase domain followed by a conserved calmodulin-binding domain with significant similarity to that of death-associated protein kinase 1 (DAPK1), a positive regulator of programmed cell death. Overexpression of this gene was shown to induce cell apoptosis.  It uses multiple polyadenylation sites.
16283	
16284	
16285	
16286	
16287	This gene encodes one of the three subunits of the RNA polymerase III transcription factor complex. This complex plays a central role in transcription initiation by RNA polymerase III on genes encoding tRNA, 5S rRNA, and other small structural RNAs. The gene product belongs to the TF2B family. Three alternatively spliced variants encoding three different isoforms, that function at different promoters transcribed by RNA polymerase III, have been identified. A transcript encoding a fourth isoform has not yet been completely characterized.
16288	
16289	
16290	
16291	Calcineurin plays an important role in the T-cell receptor-mediated signal transduction pathway. The protein encoded by this gene binds specifically to the activated form of calcineurin and inhibits calcineurin-mediated signal transduction. The encoded protein is found in the nucleus and contains a leucine zipper domain as well as several PEST motifs, sequences which confer targeted degradation to those proteins which contain them. At least four alternatively spliced transcripts have been found for this gene, but the full-length nature of most of them has not been determined.
16292	Retinoblastoma (RB) is an embryonic malignant neoplasm of retinal origin. It almost always presents in early childhood and is often bilateral. Spontaneous regression (@#$%&cure@#$%&) occurs in some cases.[supplied by OMIM]
16293	
16294	
16295	This gene encodes the penultimate enzyme of heme biosynthesis, which catalyzes the 6-electron oxidation of protoporphyrinogen IX to form protoporphyrin IX. This protein is a flavoprotein associated with the outer surface of the inner mitochondrial membrane. Mutations in this gene cause variegate porphyria.
16296	
16297	
16298	
16299	This gene belongs to the neuron navigator family and is expressed predominantly in the nervous system. The encoded protein contains coiled-coil domains and a conserved AAA domain characteristic for ATPases associated with a variety of cellular activities. This gene is similar to unc-53, a Caenorhabditis elegans gene involved in axon guidance. The exact function of this gene is not known.
16300	
16301	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This gene is highly expressed in testis and the protein is localized to the nucleus. This gene may play important roles in the mechanisms of cytodifferentiation and/or DNA recombination. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
16302	
16303	Retrovirus receptors allow infection of human and murine cells by various retroviruses. The receptors that have been identified at the molecular level include CD4 (MIM 186940) for human immunodeficiency virus, Rec1 for murine ecotropic virus, and GLVR1 for gibbon ape leukemia virus (see MIM 182090). These 3 proteins show no homology to one another at the DNA or protein level. GLVR1 is a sodium-dependent phosphate symporter.[supplied by OMIM]
16304	This intronless gene is thought to be generated by retrotransposition of a fully processed PCBP-2 mRNA. This gene and PCBP-2 has paralogues PCBP3 and PCBP4 which is thought to arose as a result of duplication events of entire genes. The protein encoded by this gene appears to be multifunctional. It along with PCBP-2 and hnRNPK corresponds to the major cellular poly(rC)-binding proteins. It contains three K-homologous (KH) domains which may be involved in RNA binding. This encoded protein together with PCBP-2 also functions as translational coactivators of poliovirus RNA via a sequence-specific interaction with stem-loop IV of the IRES and promote poliovirus RNA replication by binding to its 5@#$%&-terminal cloverleaf structure. It has also been implicated in translational control of the 15-lipoxygenase mRNA, human Papillomavirus type 16 L2 mRNA, and hepatitis A virus RNA. The encoded protein is also suggested to play a part in formation of a sequence-specific alpha-globin mRNP complex which is associated with alpha-globin mRNA stability.
16305	
16306	Galactocerebrosides are abundant sphingolipids of the myelin membrane of the central nervous system and peripheral nervous system and are also present in small amounts in kidney. The key enzymatic step in the biosynthesis of galactocerebrosides consists of the transfer of galactose to ceramide catalyzed by UDP-galactose ceramide galactosyltransferase (CGT, EC 2.4.1.45). The enzyme encoded by the CGT gene is the first involved in complex lipid biosynthesis in the myelinating oligodendrocyte.[supplied by OMIM]
16307	This intronless gene belongs to the forkhead family of transcription factors which is characterized by a distinct forkhead domain.  The specific function of this gene has not yet been determined; however, it may play a role in tumor formation.
16308	
16309	
16310	Lysophosphatidic acid acyltransferases (EC 2.3.1.51) catalyze the conversion of lysophosphatidic acid (LPA) to phosphatidic acid (PA). LPA and PA are involved in signal transduction and lipid biosynthesis.[supplied by OMIM]
16311	
16312	This gene is one of seven beta-1,4-galactosyltransferase (beta4GalT) genes. They encode type II membrane-bound glycoproteins that appear to have exclusive specificity for the donor substrate UDP-galactose; all transfer galactose in a beta1,4 linkage to similar acceptor sugars: GlcNAc, Glc, and Xyl. Each beta4GalT has a distinct function in the biosynthesis of different glycoconjugates and saccharide structures. As type II membrane proteins, they have an N-terminal hydrophobic signal sequence that directs the protein to the Golgi apparatus and which then remains uncleaved to function as a transmembrane anchor. By sequence similarity, the beta4GalTs form four groups: beta4GalT1 and beta4GalT2, beta4GalT3 and beta4GalT4, beta4GalT5 and beta4GalT6, and beta4GalT7. This gene is unique among the beta4GalT genes because it encodes an enzyme that participates both in glycoconjugate and lactose biosynthesis. For the first activity, the enzyme adds galactose to N-acetylglucosamine residues that are either monosaccharides or the nonreducing ends of glycoprotein carbohydrate chains. The second activity is restricted to lactating mammary tissues where the enzyme forms a heterodimer with alpha-lactalbumin to catalyze UDP-galactose + D-glucose <=> UDP + lactose. The two enzymatic forms result from alternate transcription initiation sites and post-translational processing. Two transcripts, which differ only at the 5@#$%& end, with approximate lengths of 4.1 kb and 3.9 kb encode the same protein. The longer transcript encodes the type II membrane-bound, trans-Golgi resident protein involved in glycoconjugate biosynthesis. The shorter transcript encodes a protein which is cleaved to form the soluble lactose synthase.
16313	IDI1 encodes a peroxisomally-localized enzyme that catalyzes the interconversion of isopentenyl diphosphate (IPP) to its highly electrophilic isomer, dimethylallyl diphosphate (DMAPP), which are the substrates for the successive reaction that results in the synthesis of farnesyl diphosphate and, ultimately, cholesterol.  It has been shown in peroxisomal deficiency diseases such as Zellweger syndrome and neonatal adrenoleukodystrophy that there is reduction in IPP isomerase activity.
16314	
16315	This gene encodes a putative retinal G-protein coupled receptor. The gene is a member of the opsin subfamily of the 7 transmembrane, G-protein coupled receptor 1 family. Like other opsins which bind retinaldehyde, it contains a conserved lysine residue in the seventh transmembrane domain. The protein acts as a photoisomerase to catalyze the conversion of all-trans-retinal to 11-cis-retinal. The reverse isomerization occurs with rhodopsin in retinal photoreceptor cells. The protein is exclusively expressed in tissue adjacent to retinal photoreceptor cells, the retinal pigment epithelium and Mueller cells. This gene may be associated with autosomal recessive and autosomal dominant retinitis pigmentosa (arRP and adRP, respectively). Alternative splicing results in multiple transcript variants encoding different isoforms.
16316	This gene produces two transcripts encoding two distinct proteins. One protein is a transcriptional repressor, while the other isoform is a major component of specialized synapses known as synaptic ribbons. Both proteins contain a NAD+ binding domain similar to NAD+-dependent 2-hydroxyacid dehydrogenases. A portion of the 3@#$%& untranslated region was used to map this gene to chromosome 21q21.3; however, it was noted that similar loci elsewhere in the genome are likely. Blast analysis shows that this gene is present on chromosome 10.
16317	The protein encoded by this gene belongs to the eukaryotic diacylglycerol kinase family. It may attenuate protein kinase C activity by regulating diacylglycerol levels in intracellular signaling cascade and signal transduction. Alternative splicing occurs at this locus and three transcript variants encoding three distinct isoforms have been identified.
16318	
16319	
16320	
16321	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S5P family of ribosomal proteins. It is located in the cytoplasm. This gene shares sequence similarity with mouse LLRep3. It is co-transcribed with the small nucleolar RNA gene U64, which is located in its third intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
16322	
16323	The protein encoded by this gene is an intermediate filament (IF) family member. IF proteins are cytoskeletal proteins that confer resistance to mechanical stress and are encoded by a dispersed multigene family. This protein has been found to form a linkage between desmin, which is a subunit of the IF network, and the extracellular matrix, and provides an important structural support in muscle. Two alternatively spliced variants encoding different isoforms have been described for this gene.
16324	
16325	This gene was identified by the reactivity of its encoded protein to a monoclonal antibody prepared against brain homogenates from patients with Alzheimer@#$%&s disease. Analysis of the original protein (fetal Alz-50 reactive clone 1, or FAC1), identified as an 810 aa protein containing a DNA-binding domain and a zinc finger motif, suggested it might play a role in the regulation of transcription. High levels of FAC1 were detected in fetal brain and in patients with neurodegenerative diseases. The protein encoded by this gene is actually much larger than originally thought, and it also contains a C-terminal bromodomain characteristic of proteins that regulate transcription during proliferation. The encoded protein is highly similar to the largest subunit of the Drosophila NURF (nucleosome remodeling factor) complex. In Drosophila, the NURF complex, which catalyzes nucleosome sliding on DNA and interacts with sequence-specific transcription factors, is necessary for the chromatin remodeling required for transcription. Two alternative transcripts encoding different isoforms have been described completely.
16326	
16327	
16328	
16329	Immunoglobulins recognize foreign antigens and initiate immune responses such as phagocytosis and the complement system. Each immunoglobulin molecule consists of two identical heavy chains and two identical light chains. There are two classes of light chains, kappa and lambda. This region represents the germline organization of the lambda light chain locus. The locus includes V (variable), J (joining), and C (constant) segments. During B cell development, a recombination event at the DNA level joins a single V segment with a J segment; the C segment is later joined by splicing at the RNA level. Recombination of many different V segments with several J segments provides a wide range of antigen recognition. Additional diversity is attained by junctional diversity, resulting from the random additional of nucleotides by terminal deoxynucleotidyltransferase, and by somatic hypermutation, which occurs during B cell maturation in the spleen and lymph nodes. Several V segments and three C segments are known to be incapable of encoding a protein and are considered pseudogenes. The locus also includes several non-immunoglobulin genes, many of which are pseudogenes or are predicted by automated computational analysis or homology to other species.
16330	The protein encoded by this gene is a member of the serine/threonine protein kinase family. This kinase preferentially phosphorylates and inactivates cell division cycle 2 protein (CDC2), and thus negatively regulates cell cycle G2/M transition. This kinase is associated with the membrane throughout the cell cycle. Its activity is highly regulated during the cell cycle. Protein kinases AKT1/PKB and PLK (Polo-like kinase) have been shown to phosphorylate and regulate the activity of this kinase. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
16331	
16332	
16333	
16334	
16335	
16336	
16337	
16338	
16339	This gene product, grancalcin, is a calcium-binding protein abundant in neutrophils and macrophages. It belongs to the penta-EF-hand subfamily of proteins which includes sorcin, calpain, and ALG-2. Grancalcin localization is dependent upon calcium and magnesium. In the absence of divalent cation, grancalcin localizes to the cytosolic fraction; with magnesium alone, it partitions with the granule fraction; and in the presence of magnesium and calcium, it associates with both the granule and membrane fractions, suggesting a role for grancalcin in granule-membrane fusion and degranulation.
16340	The protein encoded by this gene localizes to focal adhesions, regions of the plasma membrane where the cell attaches to the extracellular matrix. This protein crosslinks actin filaments and contains a Src homology 2 (SH2) domain, which is often found in molecules involved in signal transduction. This protein is a substrate of calpain II. A second transcript from this gene has been described, but its full length nature has not been determined.
16341	
16342	This gene encodes a zinc finger protein that contains a jumonji domain.
16343	This gene encodes a member of the RSK (ribosomal S6 kinase) family of serine/threonine kinases. This kinase contains 2 non-identical kinase catalytic domains and phosphorylates various substrates, including members of the mitogen-activated kinase (MAPK) signalling pathway. The activity of this protein has been implicated in controlling cell growth and differentiation. Mutations in this gene have been associated with Coffin-Lowry syndrome (CLS).
16344	
16345	
16346	
16347	
16348	This gene encodes a member of the polycystin protein family. The encoded protein contains multiple transmembrane domains, and cytoplasmic N- and C-termini. The protein may be an integral membrane protein involved in cell-cell/matrix interactions. The encoded protein may function in renal tubular development, morphology, and function, and may modulate intracellular calcium homoeostasis and other signal transduction pathways. This protein interacts with polycystin 1 to produce cation-permeable currents. Mutations in this gene have been associated with autosomal dominant polycystic kidney disease.
16349	This gene encodes a protein containing 9 WD repeats. WD repeats are approximately 30- to 40-amino acid domains containing several conserved residues, mostly including a trp-asp at the C-terminal end. WD domains are involved in protein-protein interactions. The encoded protein may help induce the disassembly of actin filaments. Two transcript variants encoding different isoforms have been found for this gene.
16350	
16351	
16352	This gene encodes a protein that shares high sequence similarity with ribosomal protein L26. It is not currently known whether the encoded protein is a functional ribosomal protein or whether it has evolved a function that is independent of the ribosome. Transcript variants utilizing alternative polyA signals exist.
16353	
16354	
16355	
16356	HLA-E belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. HLA-E binds a restricted subset of peptides derived from the leader peptides of other class I molecules. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail.
16357	
16358	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of the ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class. Alternative splicing of this gene generates at least 3 transcript variants diverging at the 5@#$%& terminus.
16359	
16360	
16361	This gene encodes a protein-disulfide isomerase. Its expression is induced by hypoxia and its role may be to protect hypoxic cells from apoptosis. This gene can be co-transcribed with the upstream gene MU.
16362	This gene belongs to a family of nonclustered, diverged homeobox genes. It is an important regulator of vertebrate limb myogenesis. Mutations in the gene may be involved in craniofacial and/or skeletal abnormalities.
16363	This gene product belongs to the 14-3-3 family of proteins which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals, and this protein is 100% identical to the rat ortholog. It is induced by growth factors in human vascular smooth muscle cells, and is also highly expressed in skeletal and heart muscles, suggesting an important role for this protein in muscle tissue. It has been shown to interact with RAF1 and protein kinase C, proteins involved in various signal transduction pathways.
16364	
16365	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. The specific function of this protein has not been elucidated. Alternative splice variants have been described but their full length nature has not been determined.
16366	
16367	This gene encodes a member of the patched gene family. The encoded protein is the receptor for sonic hedgehog, a secreted molecule implicated in the formation of embryonic structures and in tumorigenesis. This gene functions as a tumor suppressor. Mutations of this gene have been associated with nevoid basal cell carcinoma syndrome, esophageal squamous cell carcinoma, trichoepitheliomas, transitional cell carcinomas of the bladder, as well as holoprosencephaly. Alternative spliced variants have been described, but their full length sequences have not be determined.
16368	
16369	This gene encodes a member of the annexin family, a group of calcium-dependent phospholipid-binding proteins. Annexins have unique N-terminal domains and conserved C-terminal domains, which contain the calcium-dependent phospholipid-binding sites. The encoded protein is a 56-kD antigen recognized by sera from patients with various autoimmune diseases. Transcript variants encoding the same isoform have been identified.
16370	
16371	
16372	Transcriptional repression is a general mechanism for regulating transcriptional initiation in organisms ranging from yeast to humans.  Accurate initiation of transcription from eukaryotic protein-encoding genes requires the assembly of a large multiprotein complex consisting of RNA polymerase II and general transcription factors such as TFIIA, TFIIB, and TFIID. DR1 is a repressor that interacts with the TATA-binding protein (TBP) of TFIID and prevents the formation of an active transcription complex by precluding the entry of TFIIA and/or TFIIB into the preinitiation complex. The protein encoded by this gene is a corepressor of transcription that interacts with DR1 to enhance DR1-mediated repression. The interaction between this corepressor and DR1 is required for corepressor function and appears to stabilize the TBP-DR1-DNA complex.
16373	This gene encodes a constant regulatory subunit of protein phosphatase 2. Protein phosphatase 2 is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. The constant regulatory subunit A serves as a scaffolding molecule to coordinate the assembly of the catalytic subunit and a variable regulatory B subunit. This gene encodes a beta isoform of the constant regulatory subunit A. Defects in this gene could be the cause of some lung and colon cancers. At least two transcript variants encoding different isoforms have been found for this gene.
16374	The protein encoded by this gene is a receptor for interleukin 10. This protein is structurally related to interferon receptors. It has been shown to mediate the immunosuppressive signal of interleukin 10, and thus inhibits the synthesis of proinflammatory cytokines. This receptor is reported to promote survival of progenitor myeloid cells through the insulin receptor substrate-2/PI 3-kinase/AKT pathway. Activation of this receptor leads to tyrosine phosphorylation of JAK1 and TYK2 kinases.
16375	The protein encoded by this gene is an endothelial cell adhesion molecule that interacts preferentially with the leukocyte beta7 integrin LPAM-1 (alpha4beta7), L-selectin, and VLA-4 (alpha4beta1) on myeloid cells to direct leukocytes into mucosal and inflamed tissues. It is a member of the immunoglobulin family and is similar to ICAM1 and VCAM1. At least seven alternatively spliced transcripts encoding different protein isoforms have been found for this gene, but the full-length nature of some variants has not been determined.
16376	Ubiquitin (MIM 191339), a highly conserved protein involved in the regulation of intracellular protein breakdown, cell cycle regulation, and stress response, is released from degraded proteins by disassembly of the polyubiquitin chains. The disassembly process is mediated by ubiquitin-specific proteases (USPs). Also see USP1 (MIM 603478).[supplied by OMIM]
16377	
16378	The protein encoded by this gene is a nuclear transcription factor which binds DNA as a homodimer. The encoded protein controls the expression of several genes, including hepatocyte nuclear factor 1 alpha, a transcription factor which regulates the expression of several hepatic genes. This gene may play a role in development of the liver, kidney, and intestines. Mutations in this gene have been associated with monogenic autosomal dominant non-insulin-dependent diabetes mellitus type I. At least three different transcript variants encoding three different isoforms have been found for this gene.
16379	
16380	
16381	Alpha actinins belong to the spectrin gene superfamily which represents a diverse group of cytoskeletal proteins, including the alpha and beta spectrins and dystrophins.  Alpha actinin is an actin-binding protein with multiple roles in different cell types.  In nonmuscle cells, the cytoskeletal isoform is found along microfilament bundles and adherens-type junctions, where it is involved in binding actin to the membrane.  In contrast, skeletal, cardiac, and smooth muscle isoforms are localized to the Z-disc and analogous dense bodies, where they help anchor the myofibrillar actin filaments.  This gene encodes a nonmuscle, cytoskeletal, alpha actinin isoform and maps to the same site as the structurally similar erythroid beta spectrin gene.
16382	SPRED1 is a member of the Sprouty (see SPRY1; MIM 602465)/SPRED family of proteins that regulate growth factor-induced activation of the MAP kinase cascade (see MAPK1; MIM 176948) (Nonami et al., 2004).[supplied by OMIM]
16383	
16384	
16385	The product of this gene is a subunit of neuronal microtubule-associated MAP1A and MAP1B proteins, which are involved in microtubule assembly and important for neurogenesis. Studies on the rat homolog implicate a role for this gene in autophagy, a process that involves the bulk degradation of cytoplasmic component.
16386	
16387	The protein encoded by this gene is a member of the immunoglobulin superfamily. The protein contains two immunoglobulin domains and thirteen fibronectin type III domains. Fibronectin type III domains are present in both extracellular and intracellular proteins and tandem repeats are known to contain binding sites for DNA, heparin and the cell surface. This protein, and a homologous mouse sequence, are very similar to the Drosophila sidekick gene product but the specific function of this superfamily member is not yet known. Evidence for alternative splicing at this gene locus has been observed but the full-length nature of additional variants has not yet been determined.
16388	
16389	
16390	
16391	
16392	
16393	Kallikreins are a subgroup of serine proteases having diverse physiological functions. Growing evidence suggests that many kallikreins are implicated in carcinogenesis and some have potential as novel cancer and other disease biomarkers. This gene is one of the fifteen kallikrein subfamily members located in a cluster on chromosome 19. Its protein product is a protease present in seminal plasma. It is thought to function normally in the liquefaction of seminal coagulum, presumably by hydrolysis of the high molecular mass seminal vesicle protein. Serum level of this protein, called PSA in the clinical setting, is useful in the diagnosis and monitoring of prostatic carcinoma. Alternate splicing of this gene generates two transcript variants encoding different isoforms. Additional transcript variants have been described, but it is unclear if these transcripts are normally expressed or if they are specific to benign or malignant tumors.
16394	
16395	This gene belongs to the GPI family whose members encode multifunctional phosphoglucose isomerase proteins involved in energy pathways. The protein encoded by this gene is a dimeric enzyme that catalyzes the reversible isomerization of glucose-6-phosphate and fructose-6-phosphate. The protein functions in different capacities inside and outside the cell. In the cytoplasm, the gene product is involved in glycolysis and gluconeogenesis, while outside the cell it functions as a neurotrophic factor for spinal and sensory neurons. Defects in this gene are the cause of nonspherocytic hemolytic anemia and a severe enzyme deficiency can be associated with hydrops fetalis, immediate neonatal death and neurological impairment.
16396	This gene encodes a member of the Crumbs family of proteins. The encoded transmembrane protein binds to membrane protein, palmitoylated 5 and to PDZ domain protein (Drosophila inaD-like). This protein may play a role in epithelial cell polarity and is associated with tight junctions at the apical surface of epithelial cells. Three transcript variants encoding two distinct isoforms have been identified for this gene.
16397	The protein encoded by this gene is similar to  SCARF1/SREC-I, a scavenger receptor protein that mediates the binding and degradation of acetylated low density lipoprotein (Ac-LDL). This protein has only little activity of internalizing modified low density lipoproteins (LDL), but it can interact with SCARF1 through its extracellular domain. The association of this protein with SCARF1 is suppressed by the presence of scavenger ligands. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
16398	
16399	This gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional activator after forming a protein complex with other proteins. This protein acts as a nucleocytoplasmic shuttle protein and is important for neural crest and peripheral nervous system development. Mutations in this gene are associated with Waardenburg-Shah and Waardenburg-Hirschsprung disease.
16400	This gene encodes a member of the GRB2/Sem5/Drk family. This member is an adaptor-like protein involved in leukocyte-specific protein-tyrosine kinase signaling. Like its related family member, GRB2-related adaptor protein (GRAP), this protein contains an SH2 domain flanked by two SH3 domains. This protein interacts with other proteins, such as GRB2-associated binding protein 1 (GAB1) and the SLP-76 leukocyte protein (LCP2), through its SH3 domains. Transcript variants utilizing alternative polyA sites exist.
16401	
16402	
16403	
16404	Selenoproteins, such as GPX6, contain the rare twenty-first amino acid, selenocysteine (sec). These proteins lack common amino acid sequence motifs, but the 3-prime untranslated regions of selenoprotein genes have a common stem-loop structure, the sec insertion sequence (SECIS), that is necessary for the recognition of UGA as a sec codon rather than as a stop signal.[supplied by OMIM]
16405	This gene is a member of the cyclophilin family of peptidylprolyl isomerases. The cyclophilins are a highly conserved family, members of which play an important role in protein folding, immunosuppression by cyclosporin A, and infection of HIV-1 virions.
16406	
16407	
16408	
16409	Adducins are a family of cytoskeleton proteins encoded by three genes (alpha, beta, gamma). Adducin is a heterodimeric protein that consists of related subunits, which are produced from distinct genes but share a similar structure. Alpha- and beta-adducin include a protease-resistant N-terminal region and a protease-sensitive, hydrophilic C-terminal region. Alpha- and gamma-adducins are ubiquitously expressed. In contrast, beta-adducin is expressed at high levels in brain and hematopoietic tissues. Adducin binds with high affinity to Ca(2+)/calmodulin and is a substrate for protein kinases A and C. Alternative splicing results in multiple variants encoding distinct isoforms; however, not all variants have been fully described.
16410	
16411	
16412	
16413	
16414	Cytoplasmic ribosomes, organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein, which shares sequence similarity with yeast ribosomal protein L44, belongs to the L44E (L36AE) family of ribosomal proteins. Although this gene has been referred to as ribosomal protein L44 (RPL44), its official name is ribosomal protein L36a (RPL36A). This gene and the human gene officially named ribosomal protein L36a-like (RPL36AL) encode nearly identical proteins; however, they are distinct genes. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
16415	
16416	The ADF (actin-depolymerizing factor)/cofilin family (see MIM 601442) is composed of stimulus-responsive mediators of actin dynamics. ADF/cofilin proteins are inactivated by kinases such as LIM domain kinase-1 (LIMK1; MIM 601329). The SSH family appears to play a role in actin dynamics by reactivating ADF/cofilin proteins in vivo (Niwa et al., 2002).[supplied by OMIM]
16417	This gene encodes a centrosomal protein required for the centrosome to function as a microtubule organizing center. The gene product is also associated with centrosome maturation. One version of stem cell myeloproliferative disorder is the result of a reciprocal translocation between chromosomes 8 and 9, with the breakpoint associated with fibroblast growth factor receptor 1 and centrosomal protein 1.
16418	
16419	
16420	
16421	
16422	
16423	
16424	
16425	V7-3 shows structural characteristics of an Na(+) and Cl(-)-dependent neurotransmitter transporter, including 12 transmembrane (TM) domains, intracellular N and C termini, and large extracellular loops containing multiple N-glycosylation sites.[supplied by OMIM]
16426	
16427	
16428	
16429	
16430	
16431	
16432	HLA-C belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domain, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Over one hundred HLA-C alleles have been described
16433	
16434	
16435	
16436	
16437	
16438	
16439	
16440	
16441	
16442	
16443	
16444	
16445	
16446	
16447	
16448	
16449	
16450	
16451	
16452	
16453	
16454	
16455	This gene encodes a member of a family of tumor antigens. The protein is expressed in chondrosarcomas, but may also be expressed in normal tissues such as testis. Alternative splicing of this gene results in two transcript variants encoding different isoforms. An additional transcript variant has been described, but its biological validity has not been determined.
16456	
16457	
16458	The protein encoded by this gene catalyzes the conversion of (S)-2,3 oxidosqualene to lanosterol. The encoded protein is a member of the terpene cyclase/mutase family and catalyzes the first step in the biosynthesis of cholesterol, steroid hormones, and vitamin D. Two transcript variants encoding the same protein have been found for this gene.
16459	
16460	
16461	
16462	
16463	
16464	
16465	
16466	The protein encoded by this gene is a member of the AKT, also called PKB,  serine/threonine protein kinase family. AKT kinases are known to be regulators of cell signaling in response to insulin and growth factors. They are involved in a wide variety of biological processes including cell proliferation, differentiation, apoptosis, tumorigenesis, as well as glycogen synthesis and glucose uptake. This kinase has been shown to be stimulated by platelet-derived growth factor (PDGF), insulin, and insulin-like growth factor 1 (IGF1). Alternatively splice transcript variants encoding distinct isoforms have been described.
16467	
16468	
16469	
16470	
16471	
16472	The protein encoded by this gene is a cytoplasmic membrane-associated protein that indirectly coordinates endocytic membrane traffic with the actin assembly machinery. In addition, the encoded protein may regulate the formation of clathrin-coated vesicles and could be involved in synaptic vesicle recycling. This protein has been shown to interact with dynamin, CDC42, SNAP23, SNAP25, SPIN90, EPS15, EPN1, EPN2, and STN2. Multiple transcript variants encoding different isoforms have been found for this gene, but the full-length nature of only two of them have been characterized so far.
16473	
16474	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. This protein functions as a modifier of endothelial cell migration and proliferation, as well as an angiogenic factor. It acts as a mitogen for a variety of mesoderm- and neuroectoderm-derived cells in vitro, thus is thought to be involved in organogenesis. Three alternatively spliced variants encoding different isoforms have been described.
16475	
16476	
16477	
16478	
16479	This gene encodes a protein which is a member of the cysteine-aspartic acid protease (caspase) family. Sequential activation of caspases plays a central role in the execution-phase of cell apoptosis. Caspases exist as inactive proenzymes which undergo proteolytic processing at conserved aspartic residues to produce 2 subunits, large and small, that dimerize to form the active enzyme. This protein is processed by caspase APAF1; this step is thought to be one of the earliest in the caspase activation cascade. Alternative splicing of this gene results in two transcript variants which encode different isoforms.
16480	
16481	
16482	
16483	
16484	
16485	CD1D is the sole group-2 member of the CD1 family of major histocompatibility (MHC)-like glycoproteins. See CD1A (MIM 188370) for background on CD1 molecules.[supplied by OMIM]
16486	
16487	
16488	
16489	
16490	
16491	
16492	
16493	
16494	
16495	
16496	
16497	PAFAH1B1 was identified as encoding a gene that when mutated or lost caused the lissencephaly associated with Miller-Dieker lissencephaly syndrome. PAFAH1B1 encodes the non-catalytic alpha subunit of the intracellular Ib isoform of platelet-activating factor acteylhydrolase, a heterotrimeric enzyme that specifically catalyzes the removal of the acetyl group at the SN-2 position of platelet-activating factor (identified as 1-O-alkyl-2-acetyl-sn-glyceryl-3-phosphorylcholine). Two other isoforms of intracellular platelet-activating factor acetylhydrolase exist: one composed of multiple subunits, the other, a single subunit. In addition, a single-subunit isoform of this enzyme is found in serum.
16498	This gene encodes ubiquitin, one of the most conserved proteins known. Ubiquitin is required for ATP-dependent, nonlysosomal intracellular protein degradation of abnormal proteins and normal proteins with a rapid turnover. Ubiquitin is covalently bound to proteins to be degraded, and presumably labels these proteins for degradation. Ubiquitin also binds to histone H2A in actively transcribed regions but does not cause histone H2A degradation, suggesting that ubiquitin is also involved in regulation of gene expression. This gene consists of three direct repeats of the ubiquitin coding sequence with no spacer sequence. Consequently, the protein is expressed as a polyubiquitin precursor with a final amino acid after the last repeat. Aberrant form of this protein has been noticed in patients with Alzheimer@#$%&s and Down syndrome.
16499	
16500	
16501	
16502	Members of the F-box protein family, such as FBXO27, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
16503	One of two major clathrin-associated adaptor complexes, AP-2, is a heterotetramer which is associated with the plasma membrane. This complex is composed of two large chains, a medium chain, and a small chain. This gene encodes the small chain of this complex. Alternative splicing has been observed in this gene and results in two known transcripts.
16504	
16505	
16506	
16507	
16508	HLA-B belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon 1 encodes the leader peptide, exon 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Hundreds of HLA-B alleles have been described
16509	
16510	
16511	This gene is located in the RH gene locus, between the RHD and RHCE genes. The function of its protein product is unknown; however, its sequence has potential transmembrane domains suggesting that it may be an integral membrane protein. Its position between the RH genes suggests that polymorphisms in this gene may be tightly linked to RH haplotypes and may contribute to selective pressure for or against certain RH haplotypes.
16512	MIG6 is a cytoplasmic protein whose expression is upregulated with cell growth (Wick et al., 1995). It shares significant homology with the protein product of rat gene-33, which is induced during cell stress and mediates cell signaling (Makkinje et al., 2000; Fiorentino et al., 2000).[supplied by OMIM]
16513	Dermatopontin is an extracellular matrix protein with possible functions in cell-matrix interactions and matrix assembly. The protein is found in various tissues and many of its tyrosine residues are sulphated. Dermatopontin is postulated to modify the behavior of TGF-beta through interaction with decorin.
16514	
16515	This gene encodes a member of the ephrin (EPH) family. The ephrins and EPH-related receptors comprise the largest subfamily of receptor protein-tyrosine kinases and have been implicated in mediating developmental events, especially in the nervous system and in erythropoiesis. Based on their structures and sequence relationships, ephrins are divided into the ephrin-A (EFNA) class, which are anchored to the membrane by a glycosylphosphatidylinositol linkage, and the ephrin-B (EFNB) class, which are transmembrane proteins. This gene encodes an EFNA class ephrin.
16516	
16517	
16518	The protein encoded by this gene is a member of the immunoglobulin superfamily. It is a glycosylphosphatidylinositol (GPI)-anchored neuronal membrane protein that functions as a cell adhesion molecule. It may play a role in the formation of axon connections in the developing nervous system. It may also be involved in glial tumorigenesis and may provide a potential target for therapeutic intervention.
16519	The isoprene biosynthetic pathway provides the cell with cholesterol, ubiquinone, dolichol, and other nonsterol metabolites. Farnesylpyrophosphate synthetase (EC 2.5.1.10) catalyzes the formation of both geranyl and farnesylpyrophosphate from isopentenylpyrophosphate and dimethylallyl pyrophosphate.[supplied by OMIM]
16520	This gene encodes UDP-GalNAc transferase 3, a member of the GalNAc-transferases family. This family transfers an N-acetyl galactosamine to the hydroxyl group of a serine or threonine residue in the first step of O-linked oligosaccharide biosynthesis. Individual GalNAc-transferases have distinct activities and initiation of O-glycosylation is regulated by a repertoire of GalNAc-transferases.  The protein encoded by this gene is highly homologous to other family members, however the enzymes have different substrate specificities.
16521	
16522	
16523	
16524	
16525	
16526	
16527	
16528	
16529	
16530	The protein encoded by this gene has sequence similarity with members of the WD40 repeat-containing protein family. The WD40 group is a large family of proteins, which appear to have a regulatory function. It is believed that the WD40 repeats mediate protein-protein interactions and members of the family are involved in signal transduction, RNA processing, gene regulation, vesicular trafficking, cytoskeletal assembly and may play a role in the control of cytotypic differentiation.
16531	
16532	
16533	
16534	The product of this gene belongs to the serine/threonine protein kinase family and to the Ca(2+)/calmodulin-dependent protein kinase subfamily. Calcium signaling is crucial for several aspects of plasticity at glutamatergic synapses. In mammalian cells, the enzyme is composed of four different chains: alpha, beta, gamma, and delta. The product of this gene is a delta chain. Four alternatively spliced transcript variants that encode three different isoforms have been characterized to date. Distinct isoforms of this chain have different expression patterns.
16535	
16536	Kynureninase is a pyridoxal-5@#$%&-phosphate (pyridoxal-P) dependent enzyme that catalyses the cleavage of L-kynurenine and L-3-hydroxykynurenine into anthranilic and 3-hydroxyanthranilic acids, respectively. Kynureninase is involved in the biosynthesis of NAD cofactors from tryptophan through the kynurenine pathway.
16537	
16538	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
16539	
16540	
16541	Bin/amphiphysin/Rvs (BAR) domain adaptor proteins, such as BIN3, are named for their unique N-terminal region and appear to integrate signal transduction pathways that regulate membrane dynamics, F-actin cytoskeleton, and nuclear processes.[supplied by OMIM]
16542	Bin/amphiphysin/Rvs (BAR) domain adaptor proteins, such as BIN3, are named for their unique N-terminal region and appear to integrate signal transduction pathways that regulate membrane dynamics, F-actin cytoskeleton, and nuclear processes.[supplied by OMIM]
16543	This gene encodes a member of the patched gene family. The encoded protein is the receptor for sonic hedgehog, a secreted molecule implicated in the formation of embryonic structures and in tumorigenesis. This gene functions as a tumor suppressor. Mutations of this gene have been associated with nevoid basal cell carcinoma syndrome, esophageal squamous cell carcinoma, trichoepitheliomas, transitional cell carcinomas of the bladder, as well as holoprosencephaly. Alternative spliced variants have been described, but their full length sequences have not be determined.
16544	
16545	
16546	
16547	Cytochrome c oxidase (COX), the terminal component of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. This component is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in electron transfer, and the nuclear-encoded subunits may function in the regulation and assembly of the complex. This nuclear gene encodes a protein which is not a structural subunit, but may be essential for the biogenesis of COX formation and may function in the hydroxylation of heme O, according to the yeast mutant studies. This protein is predicted to contain 5 transmembrane domains localized in the mitochondrial inner membrane. Alternative splicing of this gene generates several transcript variants diverging in the 3@#$%& region including alternate poly A sites. In total, 2 different isoforms are encoded by these variants.
16548	Motor neurons are among the earliest neurons to appear after the commencement of cell patterning and the beginning of cell differentiation. Differentiation occurs in a ventral-to-dorsal gradient and is mediated, at least in part, by the concentration of ventrally expressed sonic hedgehog protein (SHH; MIM 600725). Dorsally expressed factors, such as members of the bone morphogenic protein (e.g., BMP4; MIM 112262) and transforming growth factor-beta (e.g., TGFB1; MIM 190180) families, can repress the induction of these neurons. CRIM1 may interact with growth factors implicated in motor neuron differentiation and survival (Kolle et al., 2000).[supplied by OMIM]
16549	The protein encoded by this gene, a member of the peptidase C1 family, is a lysosomal cysteine proteinase that appears to be a central coordinator for activation of many serine proteinases in immune/inflammatory cells. It is composed of a dimer of disulfide-linked heavy and light chains, both produced from a single protein precursor. It requires chloride ions for activity and can degrade glucagon. Defects in the encoded protein have been shown to be a cause of Papillon-Lefevre syndrome, an autosomal recessive disorder characterized by palmoplantar keratosis and periodontitis. Two transcript variants encoding different isoforms have been found for this gene.
16550	The protein encoded by this gene is a member of a family of proteins that inhibits apoptosis by binding to tumor necrosis factor receptor-associated factors TRAF1 and TRAF2, probably by interfering with activation of ICE-like proteases. This encoded protein inhibits apoptosis induced by serum deprivation and menadione, a potent inducer of free radicals.
16551	
16552	
16553	G protein-coupled receptors (GPCRs), such as GPR44, are integral membrane proteins containing 7 putative transmembrane domains (TMs). These proteins mediate signals to the interior of the cell via activation of heterotrimeric G proteins that in turn activate various effector proteins, ultimately resulting in a physiologic response.[supplied by OMIM]
16554	
16555	
16556	
16557	
16558	
16559	
16560	
16561	
16562	
16563	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development, reproduction, and tissue remodeling, as well as in disease processes, such as arthritis and metastasis. Most MMP@#$%&s are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. However, the protein encoded by this gene is a member of the membrane-type MMP (MT-MMP) subfamily; each member of this subfamily contains a potential transmembrane domain suggesting that these proteins are expressed at the cell surface rather than secreted.
16564	Hydrolases are transported to lysosomes after binding to mannose 6-phosphate receptors in the trans-Golgi network. This gene encodes the enzyme that catalyzes the second step in the formation of the mannose 6-phosphate recognition marker on lysosomal hydrolases. Commonly known as @#$%&uncovering enzyme@#$%& or UCE, this enzyme removes N-acetyl-D-glucosamine (GlcNAc) residues from GlcNAc-alpha-P-mannose moieties and thereby produces the recognition marker. This reaction most likely occurs in the trans-Golgi network. This enzyme functions as a homotetramer of two disulfide-linked homodimers. In addition to having an N-terminal signal peptide, the protein@#$%&s C-terminus contains multiple signals for trafficking it between lysosomes, the plasma membrane, and trans-Golgi network.
16565	
16566	
16567	
16568	
16569	
16570	The protein encoded by this gene is a member of the STAT protein family. In response to cytokines and growth factors, STAT family members are phosphorylated by the receptor associated kinases, and then form homo- or heterodimers that translocate to the cell nucleus where they act as transcription activators. This protein is activated through phosphorylation in response to various cytokines and growth factors including IFNs, EGF, IL5, IL6, HGF, LIF and BMP2. This protein mediates the expression of a variety of genes in response to cell stimuli, and thus plays a key role in many cellular processes such as cell growth and apoptosis. The small GTPase Rac1 has been shown to bind and regulate the activity of this protein. PIAS3 protein is a specific inhibitor of this protein. Three alternatively spliced transcript variants encoding distinct isoforms have been described.
16571	
16572	The low density lipoprotein receptor (LDLR) gene family consists of cell surface proteins involved in receptor-mediated endocytosis of specific ligands. Low density lipoprotein (LDL) is normally bound at the cell membrane and taken into the cell ending up in lysosomes where the protein is degraded and the cholesterol is made available for repression of microsomal enzyme 3-hydroxy-3-methylglutaryl coenzyme A (HMG CoA) reductase, the rate-limiting step in cholesterol synthesis. At the same time, a reciprocal stimulation of cholesterol ester synthesis takes place. Mutations in this gene cause the autosomal dominant disorder, familial hypercholesterolemia.
16573	
16574	
16575	
16576	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein does not contain a coiled coil region, like some family members. The specific function of this protein has not been determined. Two transcript variants encoding distinct isoforms have been identified for this gene.
16577	The minichromosome maintenance protein 3 (MCM3) is one of the MCM proteins essential for the initiation of DNA replication. The protein encoded by this gene is a MCM3 binding protein. It was reported to have phosphorylation-dependent DNA-primase activity, which was up-regulated in antigen immunization induced germinal center. This protein was demonstrated to be an acetyltransferase that acetylates MCM3 and plays a role in DNA replication. The mutagenesis of a nuclear localization signal of MCM3 affects the binding of this protein with MCM3, suggesting that this protein may also facilitate MCM3 nuclear localization.
16578	
16579	
16580	
16581	
16582	
16583	
16584	
16585	
16586	This gene encodes a secreted protein that dorsalizes early vertebrate embryonic tissues by binding to ventralizing TGF-beta-like bone morphogenetic proteins and sequestering them in latent complexes. The encoded protein may also have roles in organogenesis and during adulthood. Multiple transcript variants encoding distinct isoforms have been identified for this gene. Other alternative splice variants have been described but their full length sequence has not been determined.
16587	
16588	
16589	The protein encoded by this gene is a cytoplasmic serine-threonine kinase that contains cysteine in place of the lysine found at the conserved ATP-binding location in subdomain II of protein kinases. Since this protein does have kinase activity, it is possible that another lysine in the kinase subdomain I can substitute for the missing conserved lysine.
16590	The protein encoded by this gene belongs to the family of P-type cation transport ATPases, and to the subfamily of aminophospholipid-transporting ATPases. The aminophospholipid translocases transport phosphatidylserine and phosphatidylethanolamine from one side of a bilayer to another. Alternatively spliced transcript variants encoding different isoforms have been identified.
16591	This gene encodes a member of the oxysterol-binding protein (OSBP) family, a group of intracellular lipid receptors. Like most members, the encoded protein contains an N-terminal pleckstrin homology domain and a highly conserved C-terminal OSBP-like sterol-binding domain.
16592	The nuclear import of karyophilic proteins is directed by short amino acid sequences termed nuclear localization signals (NLSs). Karyopherins, or importins, are cytoplasmic proteins that recognize NLSs and dock NLS-containing proteins to the nuclear pore complex. The protein encoded by this gene shares the sequence similarity with Xenopus importin-alpha and Saccharomyces cerevisiae Srp1. This protein is found to interact with the NLSs of DNA helicase Q1 and SV40 T antigen.
16593	
16594	
16595	
16596	
16597	TPIP is a member of a large class of membrane-associated phosphatases with substrate specificity for the 3-position phosphate of inositol phospholipids.[supplied by OMIM]
16598	
16599	
16600	
16601	
16602	
16603	This gene encodes a subunit of mitochondrial ATP synthase. Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. ATP synthase is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, Fo, comprising the proton channel. This gene encodes the subunit s, also known as factor B, of the proton channel. This subunit is necessary for the energy transduction activity of the ATP synthase complexes. Alternatively spliced transcript variants encoding different isoforms have been identified.
16604	
16605	This gene encodes two isoforms of a receptor for monocyte chemoattractant protein-1, a chemokine which specifically mediates monocyte chemotaxis. Monocyte chemoattractant protein-1 is involved in monocyte infiltration in inflammatory diseases such as rheumatoid arthritis as well as in the inflammatory response against tumors. The receptors encoded by this gene mediate agonist-dependent calcium mobilization and inhibition of adenylyl cyclase. This gene is located in the chemokine receptor gene cluster region. Two alternatively spliced transcript variants are expressed by the gene.
16606	
16607	
16608	
16609	
16610	
16611	
16612	
16613	
16614	
16615	
16616	
16617	
16618	
16619	
16620	
16621	
16622	
16623	
16624	
16625	
16626	
16627	
16628	
16629	
16630	
16631	
16632	
16633	
16634	
16635	
16636	
16637	
16638	
16639	
16640	
16641	
16642	
16643	
16644	
16645	
16646	
16647	
16648	
16649	
16650	
16651	
16652	This gene encodes a protein for a voltage-gated potassium channel required for the repolarization phase of the cardiac action potential. The gene product can form heteromultimers with two other potassium channel proteins, KCNE1 and KCNE3. Mutations in this gene are associated with hereditary long QT syndrome, Romano-Ward syndrome, Jervell and Lange-Nielsen syndrome and familial atrial fibrillation. The gene is located in a region of chromosome 11 that contains a large number of contiguous genes that are abnormally imprinted in cancer and the Beckwith-Wiedemann syndrome. Three alternative transcripts encoding distinct isoforms have been described.
16653	This gene encodes a protein for a voltage-gated potassium channel required for the repolarization phase of the cardiac action potential. The gene product can form heteromultimers with two other potassium channel proteins, KCNE1 and KCNE3. Mutations in this gene are associated with hereditary long QT syndrome, Romano-Ward syndrome, Jervell and Lange-Nielsen syndrome and familial atrial fibrillation. The gene is located in a region of chromosome 11 that contains a large number of contiguous genes that are abnormally imprinted in cancer and the Beckwith-Wiedemann syndrome. Three alternative transcripts encoding distinct isoforms have been described.
16654	
16655	
16656	The protein encoded by this gene is a transcriptional coactivator that can interact with nuclear hormone receptors to enhance their transcriptional activator functions. The encoded protein has been shown to be involved in the hormone-dependent coactivation of several receptors, including prostanoid, retinoid, vitamin D3, thyroid hormone, and steroid receptors. The encoded protein may also act as a general coactivator since it has been shown to interact with some basal transcription factors, histone acetyltransferases, and methyltransferases.
16657	
16658	This gene encodes a large, transmembrane receptor protein which may function in angiogenesis, lymphocyte homing, cell adhesion, or receptor scavenging. The protein contains 7 fasciclin, 16 epidermal growth factor (EGF)-like, and 2 laminin-type EGF-like domains as well as a C-type lectin-like hyaluronan-binding Link module. The protein is primarily expressed on sinusoidal endothelial cells of liver, spleen, and lymph node. The receptor has been shown to endocytose ligands such as low density lipoprotein, Gram-positive and Gram-negative bacteria, and advanced glycosylation end products. Supporting its possible role as a scavenger receptor, the protein rapidly cycles between the plasma membrane and early endosomes.
16659	
16660	
16661	Eukaryotes, in contrast to prokaryotes, contain more than one DNA ligase, and these enzymes have distinct roles in DNA metabolism.  Five DNA ligase activities, I-V, have been purified from mammalian cell extracts.  Ligase III is more closely related to the DNA ligase encoded by pox viruses, than to replicative DNA ligases, such as mammalian DNA ligase 1, and thus may be involved in DNA repair and recombination.  Two biochemically distinct isoforms of ligase III, alpha and beta, encoding polypeptides with different C-terminal amino acids have been identified, and result from alternative splicing of precursor mRNA.
16662	
16663	
16664	
16665	
16666	
16667	
16668	
16669	Alpha actinins belong to the spectrin gene superfamily which represents a diverse group of cytoskeletal proteins, including the alpha and beta spectrins and dystrophins.  Alpha actinin is an actin-binding protein with multiple roles in different cell types.  In nonmuscle cells, the cytoskeletal isoform is found along microfilament bundles and adherens-type junctions, where it is involved in binding actin to the membrane.  In contrast, skeletal, cardiac, and smooth muscle isoforms are localized to the Z-disc and analogous dense bodies, where they help anchor the myofibrillar actin filaments.  This gene encodes a muscle-specific, alpha actinin isoform that is expressed in both skeletal and cardiac muscles.  Transcript variants resulting from the use of multiple poly_A sites have been observed.
16670	The obscurin gene spans more than 150 kb, contains over 80 exons and encodes a protein of approximately 720 kDa. The encoded protein contains 68 Ig domains, 2 fibronectin domains, 1 calcium/calmodulin-binding domain, 1 RhoGEF domain with an associated PH domain, and 2 serine-threonine kinase domains. This protein belongs to the family of giant sacromeric signaling proteins that includes titin and nebulin, and may have a role in the organization of myofibrils during assembly and may mediate interactions between the sarcoplasmic reticulum and myofibrils. While several transcript variants likely exist for this gene, the full-length nature of only one has been described to date.
16671	This gene encodes fibronectin, a glycoprotein present in a soluble dimeric form in plasma, and in a dimeric or multimeric form at the cell surface and in extracellular matrix. Fibronectin is involved in cell adhesion and migration processes including embryogenesis, wound healing, blood coagulation, host defense, and metastasis. The gene has three regions subject to alternative splicing, with the potential to produce 20 different transcript variants. However, the full-length nature of some variants has not been determined.
16672	
16673	
16674	
16675	
16676	
16677	
16678	
16679	
16680	
16681	
16682	The eukaryotic cell cycle is governed by cyclin-dependent protein kinases (CDKs) whose activities are regulated by cyclins and CDK inhibitors. The 8 species of cyclins reported in mammals, cyclins A through H, share a conserved amino acid sequence of about 90 residues called the cyclin box. The amino acid sequence of cyclin G is well conserved among mammals. The nucleotide sequence of cyclin G1 and cyclin G2 are 53% identical. Unlike cyclin G1, cyclin G2 contains a C-terminal PEST protein destabilization motif, suggesting that cyclin G2 expression is tightly regulated through the cell cycle.
16683	
16684	
16685	
16686	This gene is a member of a group of genes whose transcript levels are increased following stressful growth arrest conditions and treatment with DNA-damaging agents. The protein encoded by this gene responds to environmental stresses by mediating activation of the p38/JNK pathway via MTK1/MEKK4 kinase. The GADD45G is highly expressed in placenta.
16687	
16688	The protein encoded by this gene is one subunit of a cleavage factor required for 3@#$%& RNA cleavage and polyadenylation processing. The interaction of the protein with the RNA is one of the earliest steps in the assembly of the 3@#$%& end processing complex and facilitates the recruitement of other processing factors. The cleavage factor complex is composed of four polypeptides. This gene encodes the 68kD subunit. It has a domain organization reminiscent of spliceosomal proteins.
16689	
16690	
16691	Actins are highly conserved proteins that are involved in various types of cell motility. Polymerization of globular actin (G-actin) leads to a structural filament (F-actin) in the form of a two-stranded helix. Each actin can bind to four others. The protein encoded by this gene belongs to the actin family which is comprised of three main groups of actin isoforms, alpha, beta, and gamma. The alpha actins are found in muscle tissues and are a major constituent of the contractile apparatus. Defects in this gene have been associated with idiopathic dilated cardiomyopathy (IDC) and familial hypertrophic cardiomyopathy (FHC).
16692	
16693	
16694	
16695	
16696	
16697	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a protein that inhibits the activation of the proteasome by the 11S and 19S regulators. Three alternative transcripts encoding two isoforms have been identified for this gene.
16698	Leukaemia inhibitory factor is a cytokine that induces macrophage differentiation. Neurotransmitters and neuropeptides, well known for their role in the communication between neurons, are also capable of activating monocytes and macrophages and inducing chemotaxis in immune cells. LIF signals through different receptors and transcription factors. LIF in conjunction with BMP2 acts in synergy on primary fetal neural progenitor cells to induce astrocytes.
16699	
16700	
16701	
16702	This gene encodes a nuclear protein containing 10 WD repeats. WD repeats are approximately 30- to 40-amino acid domains containing several conserved residues, which usually include a trp-asp at the C-terminal end. Proteins belonging to the WD repeat family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation.
16703	BPNT1, also called bisphosphate 3-prime-nucleotidase, or BPntase, is a member of a magnesium-dependent phosphomonoesterase family. Lithium, a major drug used to treat manic depression, acts as an uncompetitive inhibitor of BPntase. The predicted human protein is 92% identical to mouse BPntase. BPntase@#$%&s physiologic role in nucleotide metabolism may be regulated by inositol signaling pathways. The inhibition of human BPntase may account for lithium-induced nephrotoxicity.
16704	This gene encodes a member of the eyes absent (EYA) family of proteins. The encoded protein may act as a transcriptional activator and have a role during development. A similar protein in mice can act as a transcriptional activator. Two transcript variants encoding distinct isoforms have been identified for this gene.
16705	
16706	
16707	
16708	The heterotetrameric adaptor protein (AP) complexes sort integral membrane proteins at various stages of the endocytic and secretory pathways. AP4 is composed of 2 large chains, beta-4 (AP4B1; MIM 607245) and epsilon-4 (AP4E1; MIM 607244), a medium chain, mu-4 (AP4M1; MIM 602296), and a small chain, sigma-4 (AP4S1).[supplied by OMIM]
16709	
16710	
16711	
16712	
16713	This gene product belongs to the 14-3-3 family of proteins which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals, and this protein is 100% identical to the mouse ortholog. It interacts with CDC25 phosphatases, RAF1 and IRS1 proteins, suggesting its role in diverse biochemical activities related to signal transduction, such as cell division and regulation of insulin sensitivity. It has also been implicated in the pathogenesis of small cell lung cancer.
16714	This gene product belongs to the 14-3-3 family of proteins which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals, and this protein is 100% identical to the mouse ortholog. It interacts with CDC25 phosphatases, RAF1 and IRS1 proteins, suggesting its role in diverse biochemical activities related to signal transduction, such as cell division and regulation of insulin sensitivity. It has also been implicated in the pathogenesis of small cell lung cancer.
16715	
16716	
16717	
16718	
16719	
16720	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are nucleic acid binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It localizes to both the nucleus and the cytoplasm. This protein is implicated in the regulation of mRNA stability. Alternative splicing of this gene results in four transcript variants.
16721	
16722	The protein encoded by this gene is a member of the purine/pyrimidine phosphoribosyltransferase family. This protein is a regulatory allosteric enzyme that catalyzes the first step of de novo purine nucleotide biosynthesis. This gene and PAICS/AIRC, a bifunctional enzyme catalyzing steps six and seven in the purine nucleotide biosynthesis pathway, are located in close proximity on chromosome 4.
16723	
16724	
16725	Dendritic cells (DCs) are potent antigen-presenting cells which are involved in initiation of antigen-specific immune responses. The protein encoded by this gene is thought to be involved in dynamic changes in structure and function of DCs during the maturation processes. It is localized in the nuclear membrane. This gene is found to be intronless.
16726	The protein encoded by this gene belongs to the CDC2-like protein kinase (CLK) family. This protein kinase can interact with and phosphorylate the serine- and arginine-rich (SR) proteins, which are known to play an important role in the formation of spliceosomes, and thus may be involved in the regulation of alternative splicing. Studies in the Israeli sand rat Psammomys obesus suggested that the ubiquitin-like 5 (UBL5/BEACON), a highly conserved ubiquitin-like protein, may interact with and regulate the activity of this kinase. Multiple alternatively spliced transcript variants have been observed, but the full-length natures of which have not yet been determined.
16727	Receptor tyrosine kinases (RTKs) play a key role in the communication of cells with their microenvironment. These molecules are involved in the regulation of cell growth, differentiation and metabolism. The protein encoded by this gene is a RTK that is widely expressed in normal and transformed epithelial cells and is activated by various types of collagen. This protein belongs to a subfamily of tyrosine kinase receptors with a homology region to the Dictyostelium discoideum protein discoidin I in their extracellular domain. Its autophosphorylation is achieved by all collagens so far tested (type I to type VI). In situ studies and Northern-blot analysis showed that expression of this encoded protein is restricted to epithelial cells, particularly in the kidney, lung, gastrointestinal tract, and brain. In addition, this protein is significantly over-expressed in several human tumors from breast, ovarian, esophageal, and pediatric brain. This gene is located on chromosome 6p21.3 in proximity to several HLA class I genes. Three isoforms of this gene are generated by alternative splicing. [PROW]
16728	This intronless gene encodes a 70kDa heat shock protein which is a member of the heat shock protein 70 family. In conjuction with other heat shock proteins, this protein stabilizes existing proteins against aggregation and mediates the folding of newly translated proteins in the cytosol and in organelles. It is also involved in the ubiquitin-proteasome pathway through interaction with the AU-rich element RNA-binding protein 1. The gene is located in the major histocompatibility complex class III region, in a cluster with two closely related genes which encode similar proteins.
16729	
16730	
16731	The HRI gene is localized to 7p22 where its 3@#$%& end slightly overlaps the 3@#$%& end of the gene JTV1. The two genes are transcribed from opposite strands. Studies in rat and rabbit suggest that the HRI gene product phosphorylates the alpha subunit of eukaryotic initiation factor 2. Its kinase activity is induced by low levels of heme and inhibited by the presence of heme.
16732	
16733	
16734	Many polypeptide mitogens, such as basic fibroblast growth factor (FGFB; MIM 134920) and platelet-derived growth factors (MIM 173430, MIM 190040), are active on a wide range of different cell types. In contrast, vascular endothelial growth factor is a mitogen primarily for vascular endothelial cells. It is, however, structurally related to platelet-derived growth factor.[supplied by OMIM]
16735	
16736	
16737	TRAM2 is a component of the translocon, a gated macromolecular channel that controls the posttranslational processing of nascent secretory and membrane proteins at the endoplasmic reticulum (ER) membrane.[supplied by OMIM]
16738	Members of the Ikaros (ZNFN1A1; MIM 603023) family of transcription factors, which includes Pegasus, are expressed in lymphocytes and are implicated in the control of lymphoid development.[supplied by OMIM]
16739	The protein encoded by this gene is a member of the platelet-derived growth factor family. The four members of this family are mitogenic factors for cells of mesenchymal origin and are characterized by a core motif of eight cysteines, seven of which are found in this factor. This gene product only forms homodimers and, therefore, does not dimerize with the other three family members. It differs from alpha and beta members of this family in having an unusual N-terminal domain, the CUB domain. Two splice variants have been identified for this gene.
16740	
16741	Members of the parvin family, including PARVA, are actin-binding proteins associated with focal contacts.[supplied by OMIM]
16742	
16743	The protein encoded by this gene belongs to the highly conserved cyclin family, whose members are characterized by a dramatic periodicity in protein abundance throughout the cell cycle. Cyclins function as regulators of CDK kinases. Different cyclins exhibit distinct expression and degradation patterns which contribute to the temporal coordination of each mitotic event. This cyclin forms a complex with and functions as a regulatory subunit of CDK4 or CDK6, whose activity is required for cell cycle G1/S transition. This protein has been shown to interact with tumor suppressor protein Rb and the expression of this gene is regulated positively by Rb. Mutations, amplification and overexpression of this gene, which alters cell cycle progression, are observed frequently in a variety of tumors and may contribute to tumorigenesis.
16744	
16745	This gene encodes a receptor protein which belongs to the C-type lectin superfamily. This gene is regulated through the cyclic AMP signaling pathway. The encoded protein binds, internalizes and degrades oxidized low-density lipoprotein. This protein may be involved in the regulation of Fas-induced apoptosis. This protein may play a role as a scavenger receptor. Mutations of this gene have been associated with atherosclerosis, risk of myocardial infarction, and may modify the risk of Alzheimer@#$%&s disease. Alternative transcript variants have been described but their full-length sequence has not been determined.
16746	
16747	The protein encoded by this gene is a member of the serine/threonine kinase family. In response to cellular stress and proinflammatory cytokines, this kinase is activated through its phosphorylation by MAP kinases including MAPK1/ERK, MAPK14/p38-alpha, and MAPK11/p38-beta. In vitro, this kinase phosphorylates heat shock protein HSP27 at its physiologically relevant sites. Two alternately spliced transcript variants of this gene encoding distinct isoforms have been reported.
16748	Cytoplasmic dyneins are large enzyme complexes with a molecular mass of about 1,200 kD. They contain two force-producing heads formed primarily from dynein heavy chains, and stalks linking the heads to a basal domain, which contains a varying number of accessory intermediate chains. The predicted 89-amino acid human protein, cytoplasmic dynein light polypeptide, is 94% similar to Drosophila dlc1. This gene product physically interacts with and inhibits the activity of neuronal nitric oxide synthase (nNOS). It also appears to be one of the more conserved proteins in nature, showing 92% amino acid similarity with the nematode and rat homologs. Binding of this protein destabilizes the nNOS dimer, a conformation necessary for activity, and it may regulate numerous biologic processes through its effects on nitric oxide synthase activity.
16749	
16750	
16751	Otubains are deubiquitylating cysteine proteases (DUBs; see MIM 602519) that belong to the ovarian tumor (OTU) protein superfamily. Like other DUBs, otubains cleave proteins precisely at the ubiquitin (UB; see MIM 191339)-protein bond.[supplied by OMIM]
16752	The protein encoded by this gene belongs to the small GTPase superfamily, Rab family. It is associated with both constitutive and regulated secretory pathways, and may be involved in protein transport.
16753	This intronless gene encodes a member of the MAGE superfamily. The encoded protein is 76% identical to the mouse mage-g1 protein.
16754	
16755	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the peptidase T1A family, that is a 20S core alpha subunit.
16756	
16757	
16758	
16759	
16760	ADP-ribosylation factor 4-like is a member of the ADP-ribosylation factor family of GTP-binding proteins.  ARF4L is closely similar to ARL4 and ARL7 and each has a nuclear localization signal and an unusually high guanine nucleotide exchange rate.
16761	Histones play a critical role in transcriptional regulation, cell cycle progression, and developmental events. Histone acetylation/deacetylation alters chromosome structure and affects transcription factor access to DNA. The protein encoded by this gene belongs to the class II histone deacetylase/acuc/apha family. It possesses histone deacetylase activity and represses transcription when tethered to a promoter. It coimmunoprecipitates only with HDAC3 family member and might form multicomplex proteins. It also interacts with myocyte enhancer factor-2 (MEF2) proteins, resulting in repression of MEF2-dependent genes. This gene is thought to be associated with colon cancer. Two transcript variants encoding different isoforms have been found for this gene.
16762	
16763	This gene is one of several Cys-Cys (CC) cytokine genes clustered on the q-arm of chromosome 17. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The CC cytokines are proteins characterized by two adjacent cysteines. The cytokine encoded by this gene displays chemotactic activity for monocytes, lymphocytes, basophils and eosinophils, but not neutrophils. This chemokine plays a role in accumulation of leukocytes during inflammation.  It may also be involved in the recruitment of monocytes into the arterial wall during artherosclerosis.
16764	
16765	
16766	
16767	The protein encoded by this gene has substantial phospholipase activity and may be involved in lipoprotein metabolism and vascular biology. This protein is designated a member of the TG lipase family by its sequence and characteristic lid region which provides substrate specificity for enzymes of the TG lipase family.
16768	This gene product was isolated on the basis of its interaction with BRCA2 and p21 proteins. It is an evolutionarily conserved nuclear protein with multiple interacting domains. The N-terminal half shares moderate homology with regions of calmodulin and M-calpain, suggesting that it may also bind calcium. Functional studies indicate that this protein may be an important cofactor for BRCA2 in tumor suppression, and a modulator of CDK2 kinase activity via p21. Several transcript variants encoding different isoforms have been described for this gene.
16769	
16770	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein.
16771	This gene is one of two Cys-Cys (CC) cytokine genes clustered on the q arm of chromosome 7. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The CC cytokines are proteins characterized by two adjacent cysteines. The cytokine encoded by this gene displays chemotactic activity for normal peripheral blood eosinophils and basophils. The product of this gene is one of three related chemokines that specifically activate chemokine receptor CCR3. This chemokine may contribute to the eosinophil accumulation in atopic diseases.
16772	
16773	The protein encoded by this gene is a serine/threonine protein kinase that interacts with protein kinase C-delta. The encoded protein can also activate NFkappaB and is required for keratinocyte differentiation. This kinase undergoes autophosphorylation.
16774	Deletions of the 22q11.2 have been associated with a wide range of developmental defects (notably DiGeorge syndrome, velocardiofacial syndrome, conotruncal anomaly face syndrome and isolated conotruncal cardiac defects) classified under the acronym CATCH 22. The DGCR2 gene encodes a novel putative adhesion receptor protein, which could play a role in neural crest cells migration, a process which has been proposed to be altered in DiGeorge syndrome.
16775	The protein encoded by this gene is a member of the PP2C family of Ser/Thr protein phosphatases. PP2C family members are known to be negative regulators of cell stress response pathways. This phosphatase can interact with Rho guanine nucleotide exchange factors (PIX), and thus block the effects of p21-activated kinase 1 (PAK), a protein kinase mediating biological effects downstream of Rho GTPases. Calcium/calmodulin-dependent protein kinase II gamma (CAMK2G/CAMK-II) is found to be one of the substrates of this phosphatase. The overexpression of this phosphatase or CAMK2G has been shown to mediate caspase-dependent apoptosis. An alternatively spliced transcript variant has been identified, but its full-length nature has not been determined.
16776	
16777	
16778	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, which interacts specifically with hepatitis C virus core protein resulting a change in intracellular location. This gene has a homolog located in the nonrecombining region of the Y chromosome. The protein sequence is 91% identical between this gene and the Y-linked homolog. Alternative splicing of this gene generates 2 transcript variants, which are divergent in the 3@#$%& UTR.
16779	
16780	
16781	
16782	
16783	The scaffolding protein encoded by this gene is the main component of the caveolae plasma membranes found in most cell types. The protein links integrin subunits to the tyrosine kinase FYN, an initiating step in coupling integrins to the Ras-ERK pathway and promoting cell cycle progression. The gene is a tumor suppressor gene candidate and a negative regulator of the Ras-p42/44 MAP kinase cascade. CAV1 and CAV2 are located next to each other on chromosome 7 and express colocalizing proteins that form a stable hetero-oligomeric complex. By using alternative initiation codons in the same reading frame, two isoforms (alpha and beta) are encoded by a single transcript from this gene.
16784	
16785	
16786	
16787	
16788	
16789	This gene encodes a protein of unknown function. Alternatively spliced transcript variants have been identified, but their biological validity has not been determined.
16790	
16791	
16792	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The function of this protein has not been identified. Alternate splicing results in two transcript variants encoding different isoforms.
16793	
16794	
16795	
16796	
16797	
16798	
16799	
16800	
16801	
16802	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L6P family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
16803	
16804	
16805	
16806	
16807	
16808	
16809	
16810	
16811	
16812	
16813	
16814	
16815	
16816	A reciprocal translocation between chromosomes 22 and 9 produces the Philadelphia chromosome, which is often found in patients with chronic myelogenous leukemia. The chromosome 22 breakpoint for this translocation is located within the BCR gene. The translocation produces a fusion protein which is encoded by sequence from both BCR and ABL, the gene at the chromosome 9 breakpoint. Although the BCR-ABL fusion protein has been extensively studied, the function of the normal BCR gene product is not clear. The protein has serine/threonine kinase activity and is a GTPase-activating protein for p21rac.
16817	
16818	
16819	
16820	
16821	
16822	
16823	
16824	
16825	
16826	
16827	
16828	The protein encoded by this gene is highly similar to Saccharomyces cerevisiae Rad50, a protein involved in DNA double-strand break repair. This protein forms a complex with MRE11 and NBS1. The protein complex binds to DNA and displays numerous enzymatic activities that are required for nonhomologous joining of DNA ends. This protein, cooperating with its partners, is important for DNA double-strand break repair, cell cycle checkpoint activation, telomere maintenance, and meiotic recombination. Knockout studies of the mouse homolog suggest this gene is essential for cell growth and viability. Two alternatively spliced transcript variants of this gene, which encode distinct proteins, have been reported.
16829	
16830	
16831	
16832	
16833	
16834	
16835	
16836	Members of the RAB3 protein family (see RAB3A; MIM 179490) are implicated in Ca(2+)-dependent exocytosis. RAB3GAP, which is involved in regulation of RAB3 activity, is a heterodimeric complex consisting a 130-kD catalytic subunit (MIM 602536) and a 150-kD noncatalytic subunit (Nagano et al., 1998).[supplied by OMIM]
16837	
16838	
16839	
16840	Complex I, or NADH:ubiquinone oxidoreductase, the first multisubunit enzyme complex of the mitochondrial respiratory chain, plays a vital role in cellular ATP production, the primary source of energy for many crucial processes in living cells. It removes electrons from NADH and passes them by a series of different protein-coupled redox centers to the electron acceptor ubiquinone. In well-coupled mitochondria, the electron flux leads to ATP generation via the building of a proton gradient across the inner membrane. Complex I is composed of at least 41 subunits, of which 7 are encoded by the mitochondrial genome (ND1-6, ND4L) and the remainder by nuclear genes (van den Heuvel et al., 1998).[supplied by OMIM]
16841	The protein encoded by this gene contains an ATPase/helicase domain, and thus it belongs to the SWI/SNF family of chromatin remodeling proteins. The mutations of this gene are associated with an X-linked mental retardation (XLMR) syndrome most often accompanied by alpha-thalassemia (ATRX) syndrome. Theses mutations have been shown to cause diverse changes in the pattern of DNA methylation, which may provide a link between chromatin remodeling, DNA methylation, and gene expression in developmental processes. This protein is found to undergo cell cycle-dependent phosphorylation, which regulates its nuclear matrix and chromatin association, and suggests its involvement in the gene regulation at interphase and chromosomal segregation in mitosis. Three alternatively spliced transcript variants encoding distinct isoforms have been reported.
16842	
16843	
16844	
16845	
16846	
16847	
16848	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes the largest non-ATPase subunit of the 19S regulator lid, which is responsible for substrate recognition and binding.
16849	
16850	The protein encoded by this gene is an integral membrane protein and tetrodotoxin-resistant voltage-gated sodium channel subunit. The encoded protein is found primarily in cardiac muscle and is responsible for the initial upstroke of the action potential in an electrocardiogram. Defects in this gene are a cause of long QT syndrome type 3 (LQT3), an autosomal dominant cardiac disease. Alternative splicing results in two transcript variants encoding separate isoforms which differ by a single amino acid. Mutation nomenclature has been assigned with respect to the longer isoform.
16851	
16852	
16853	This gene encodes a nuclear protein that has three zinc fingers at the end of its C-terminal domain, a serine/threonine-rich central region, and an acidic domain lying within the N-terminal region. The zinc fingers of this protein are responsible for the specific DNA binding with the guanine-rich core promoter elements. The central region might be involved in activation or posttranslational regulatory pathways, and the acidic N-terminal domain might play an important role in the process of transcriptional activation. It is capable of activating transcription approximately 4-fold either on homologous or heterologous promoters. The DNA binding and transcriptional activity of this protein, in conjunction with its expression pattern, suggests that this protein may participate in the regulation and/or maintenance of the basal expression of pregnancy-specific glycoprotein genes and possibly other TATA box-less genes. Two transcript variants encoding the same protein have been found for this gene.
16854	
16855	
16856	
16857	There are believed to be over 100 different glycosyltransferases involved in the synthesis of protein-bound and lipid-bound oligosaccharides. N-acetylglucosaminyltransferase III transfers a GlcNAc residue to the beta-linked mannose of the trimannosyl core of N-linked oligosaccharides and produces a bisecting GlcNAc. Expression of this gene may be controlled by a multiple-promoter system.
16858	This gene encodes a protein containing domains that associate it with the peptidase family C19, also known as family 2 of ubiquitin carboxyl-terminal hydrolases. Family members function as deubiquitinating enzymes, recognizing and hydrolyzing the peptide bond at the C-terminal glycine of ubiquitin. Enzymes in peptidase family C19 are involved in the processing of poly-ubiquitin precursors as well as that of ubiquitinated proteins.
16859	
16860	This gene encodes the third of three enzymes that posttranslationally modify isoprenylated C-terminal cysteine residues in certain proteins and target those proteins to the cell membrane. This enzyme localizes to the endoplasmic reticulum. Alternative splicing results in two transcript variants encoding different isoforms.
16861	Nucleocytoplasmic transport, a signal- and energy-dependent process, takes place through nuclear pore complexes embedded in the nuclear envelope. The import of proteins containing a nuclear localization signal (NLS) requires the NLS import receptor, a heterodimer of importin alpha and beta subunits also known as karyopherins. Importin alpha binds the NLS-containing cargo in the cytoplasm and importin beta docks the complex at the cytoplasmic side of the nuclear pore complex. In the presence of nucleoside triphosphates and the small GTP binding protein Ran, the complex moves into the nuclear pore complex and the importin subunits dissociate. Importin alpha enters the nucleoplasm with its passenger protein and importin beta remains at the pore. The protein encoded by this gene is a member of the importin alpha family.
16862	Galactose-1-phosphate uridyl transferase (GALT) catalyzes the second step of the Leloir pathway of galactose metabolism, namely the conversion of UDP-glucose + galactose-1-phosphate to glucose-1-phosphate + UDP-galactose. The absence of this enzyme results in classic galactosemia in humans and can be fatal in the newborn period if lactose is not removed from the diet. The pathophysiology of galactosemia has not been clearly defined. Alternative splicing has been observed at this locus and three transcript variants, each encoding a distinct isoform, have been identified.
16863	
16864	
16865	
16866	
16867	The protein encoded by this gene has sequence similarity with members of the WD40 repeat-containing protein family. The WD40 group is a large family of proteins, which appear to have a regulatory function. It is believed that the WD40 repeats mediate protein-protein interactions and members of the family are involved in signal transduction, RNA processing, gene regulation, vesicular trafficking, cytoskeletal assembly and may play a role in the control of cytotypic differentiation.
16868	
16869	
16870	This gene belongs to the forkhead family of transcription factors which is characterized by a distinct forkhead domain. The specific function of this gene has not yet been determined; however, it is possible that this gene plays an important role in the development of the cochlea and vestibulum, as well as embryogenesis. Mutations in this gene may be associated with the common cavity phenotype. Two transcript variants encoding different isoforms have been found for this gene.
16871	In the mouse, Nkd is a Dishevelled (see DVL1; MIM 601365)-binding protein that functions as a negative regulator of the Wnt (see WNT1; MIM 164820)-beta-catenin (see MIM 116806)-Tcf (see MIM 602272) signaling pathway.[supplied by OMIM]
16872	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbls class and, in addition to an F-box, contains at least 9 tandem leucine-rich repeats.
16873	
16874	
16875	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that belongs to the ribosomal protein S16P family. The encoded protein is one of the most highly conserved ribosomal proteins between mammalian and yeast mitochondria. Three pseudogenes (located at 8q21.3, 20q13.32, 22q12-q13.1) for this gene have been described.
16876	This gene is specifically expressed in a subpopulation of neuroendocrine cells, and is upregulated by nerve growth factor. The structural organization of this gene is similar to that of the rat gene, and both the translated and the untranslated regions show a high degree of sequence similarity to the rat gene. The encoded secretory protein also shares similarities with the secretogranin/chromogranin family, however, its exact function is not known.
16877	This gene encodes a member of the aldo/keto reductase superfamily, which consists of more than 40 known enzymes and proteins. This member can efficiently reduce aliphatic and aromatic aldehydes, and it is less active on hexoses. It is highly expressed in adrenal gland, small intestine, and colon, and may play an important role in liver carcinogenesis.
16878	
16879	
16880	Intercellular communication is often mediated by receptors on the surface of one cell that recognize and are activated by specific protein ligands released by other cells. Members of one class of cell surface receptors, receptor tyrosine kinases (RTKs), are characterized by having a cytoplasmic domain containing intrinsic tyrosine kinase activity. This kinase activity is regulated by the binding of a cognate ligand to the extracellular portion of the receptor. DeChiara et al. (1996) noted that the RTKs, known to be expressed in cell type-specific fashions, play a role critical for the growth and differentiation of those cell types. For example, members of the neural-specific TRK family that recognize nerve growth factor are absolutely required for the survival and development of discrete neuronal subpopulations, and the receptor tyrosine kinases TIE1 (MIM 600222) and TIE2 (MIM 600221) play a critical role in the development of normal blood vessels.[supplied by OMIM]
16881	GPR120 is a member of the rhodopsin family of G protein-coupled receptors (GPRs) (Fredriksson et al., 2003).[supplied by OMIM]
16882	
16883	The protein encoded by this gene is a member of the SWI/SNF family of proteins, whose members display helicase and ATPase activities and which are thought to regulate transcription of certain genes by altering the chromatin structure around those genes. The encoded protein is part of the large ATP-dependent chromatin remodeling complex SNF/SWI and contains a predicted leucine zipper motif typical of many transcription factors. Two transcript variants encoding different isoforms have been found for this gene.
16884	Tryptophan hydroxylase (TPH; EC 1.14.16.4) is the rate-limiting enzyme in the synthesis of serotonin (5-hydroxytryptamine, or 5HT). 5HT is causally involved in numerous central nervous activities, and it has several functions in peripheral tissues, including the maintenance of vascular tone and gut motility.[supplied by OMIM]
16885	This gene encodes a member of the family of voltage-gated potassium (Kv) channel-interacting proteins (KCNIPs), which belongs to the recoverin branch of the EF-hand superfamily. Members of the KCNIP family are small calcium binding proteins. They all have EF-hand-like domains, and differ from each other in the N-terminus. They are integral subunit components of native Kv4 channel complexes. They may regulate A-type currents, and hence neuronal excitability, in response to changes in intracellular calcium. Multiple alternatively spliced transcript variants encoding distinct isoforms have been identified from this gene.
16886	This gene encodes one family member of vacuolar protein sorting 10 (VPS10) domain-containing receptor proteins. The VPS10 domain name comes from the yeast carboxypeptidase Y sorting receptor Vps10 protein. Members of this gene family are large with many exons but the CDS lengths are usually less than 3700 nt. Very large introns typically separate the exons encoding the VPS10 domain; the remaining exons are separated by much smaller-sized introns. These genes are strongly expressed in the central nervous system. Two of the five family members (sortilin and sortilin-related receptor) are synthesized as preproproteins; it is not yet known if this encoded protein is also a preproprotein.
16887	The protein encoded by this gene is a member of the BAG1-related protein family. BAG1 is an anti-apoptotic protein that functions through interactions with a variety of cell apoptosis and growth related proteins including BCL-2, Raf-protein kinase, steroid hormone receptors, growth factor receptors and members of the heat shock protein 70 kDa family. This protein contains a BAG domain near the C-terminus, which could bind and inhibit the chaperone activity of Hsc70/Hsp70. Three transcript variants encoding two different isoforms have been found for this gene.
16888	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This endoplasmic reticulum protein participates in the synthesis of cholesterol by catalyzing the removal of the 14alpha-methyl group from lanosterol. Homologous genes are found in all three eukaryotic phyla, fungi, plants, and animals, suggesting that this is one of the oldest cytochrome P450 genes.
16889	
16890	
16891	
16892	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. This enzyme is closely related to a stimulator of iron transport (SFT), and is up-regulated in hereditary hemochromatosis. It also functions in the ubiquitination of the tumor-suppressor protein p53 and the hypoxia-inducible transcription factor HIF1alpha by interacting with the E1 ubiquitin-activating enzyme and the E3 ubiquitin-protein ligases.
16893	This gene is located within the Smith-Magenis syndrome region on chromosome 17. It encodes a protein of unknown function.
16894	
16895	
16896	Voltage-dependent calcium channels mediate the entry of calcium ions into excitable cells, and are also involved in a variety of calcium-dependent processes, including muscle contraction, hormone or neurotransmitter release, and gene expression. Calcium channels are multisubunit complexes composed of alpha-1, beta, alpha-2/delta, and gamma subunits. The channel activity is directed by the pore-forming alpha-1 subunit, whereas, the others act as auxiliary subunits regulating this activity. The distinctive properties of the calcium channel types are related primarily to the expression of a variety of alpha-1 isoforms, alpha-1A, B, C, D, E, and S. This gene encodes the alpha-1A subunit, which is predominantly expressed in neuronal tissue. Mutations in this gene are associated with 2 neurologic disorders, familial hemiplegic migraine and episodic ataxia 2. This gene also exhibits polymorphic variation due to (CAG)n-repeats. Multiple transcript variants have been described, however, the full-length nature of not all is known. In one set of transcript variants, the (CAG)n-repeats occur in the 3@#$%& UTR, and are not associated with any disease. But in another set of variants, an insertion extends the coding region to include the (CAG)n-repeats which encode a polyglutamine tract. Expansion of the (CAG)n-repeats from the normal 4-16 to 21-28 in the coding region is associated with spinocerebellar ataxia 6.
16897	
16898	The protein encoded by this gene is a type II transmembrane serine proteinase that is found attached to the cell surface. The encoded protein may be involved in matrix remodeling processes in the liver.
16899	
16900	
16901	
16902	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, and it has a homolog on the X chromosome. The gene mutation causes male infertility, Sertoli cell-only syndrome or severe hypospermatogenesis, suggesting that this gene plays a key role in the spermatogenic process. Alternative splicing of this gene generates 2 transcripts, which differ only in the length of the 3@#$%& UTR.
16903	
16904	
16905	
16906	
16907	
16908	
16909	
16910	The protein encoded by this gene is a type I transmembrane protein and is a tumor-specific endothelial marker that has been implicated in colorectal cancer. This protein has been shown to also be a docking protein or receptor for Bacillus anthracis toxin, the causative agent of the disease, anthrax. The binding of the protective antigen (PA) component, of the tripartite anthrax toxin, to this receptor protein mediates delivery of toxin components to the cytosol of cells. Once inside the cell, the other two components of anthrax toxin, edema factor (EF) and lethal factor (LF) disrupt normal cellular processes. Three alternatively spliced variants have been described.
16911	There are at least four distinct but related alkaline phosphatases: intestinal, placental, placental-like, and liver/bone/kidney (tissue non-specific). The first three are located together on chromosome 2 while the tissue non-specific form is located on chromosome 1. The exact physiological function of the alkaline phosphatases is not known. The product of this gene is a membrane bound glycosylated enzyme, localized to testis, thymus and certain germ cell tumors, that is closely related to both the placental and intestinal forms of alkaline phosphatase.
16912	This gene encodes a type I membrane protein that is a member of the immunoglobulin superfamily. This protein is expressed by antigen-presenting cells, and it is the ligand for two proteins at the cell surface of T cells, CD28 antigen and cytotoxic T-lymphocyte-associated protein 4. Binding of this protein with CD28 antigen is a costimulatory signal for activation of the T-cell. Binding of this protein with cytotoxic T-lymphocyte-associated protein 4 negatively regulates T-cell activation and diminishes the immune response. Alternative splicing results in two transcript variants encoding different isoforms. Additional transcript variants have been described, but their full-length sequences have not been determined.
16913	
16914	MYL3 encodes myosin light chain 3, an alkali light chain also referred to in the literature as both the ventricular isoform and the slow skeletal muscle isoform. Mutations in MYL3 have been identified as a cause of mid-left ventricular chamber type hypertrophic cardiomyopathy.
16915	
16916	
16917	
16918	
16919	
16920	Mutations in this gene are associated with craniosynostosis and enlarged parietal foramina. Wildtype levels of this gene activity may establish a balance between survival and apoptosis of neural crest-derived cells required for proper craniofacial morphogenesis. This protein promotes cell growth under certain conditions and may be an important target for the RAS signaling pathways.
16921	Mutations in this gene are associated with craniosynostosis and enlarged parietal foramina. Wildtype levels of this gene activity may establish a balance between survival and apoptosis of neural crest-derived cells required for proper craniofacial morphogenesis. This protein promotes cell growth under certain conditions and may be an important target for the RAS signaling pathways.
16922	
16923	Thyroid hormone receptors are hormone-dependent transcription factors that regulate expression of a variety of specific target genes. The protein encoded by this gene binds thyroid hormone receptor beta, but only in the presence of thyroid hormone. The encoded protein, a member of the HMGN protein family, is thought to reduce the compactness of the chromatin fiber in nucleosomes, thereby enhancing transcription from chromatin templates. Two transcript variants encoding different isoforms have been found for this gene.
16924	
16925	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
16926	
16927	NIPSNAP3B belongs to a family of proteins with putative roles in vesicular trafficking (Buechler et al., 2004).[supplied by OMIM]
16928	
16929	
16930	
16931	
16932	This gene encodes a member of the nucleosome assembly protein (NAP) family which can interact with both core and linker histones. It can shuttle between the cytoplasm and nucleus, suggesting a role as a histone chaperone. This gene is one of several located near the imprinted gene domain of 11p15.5, an important tumor-suppressor gene region. Alterations in this region have been associated with the Beckwith-Wiedemann syndrome, Wilms tumor, rhabdomyosarcoma, adrenocortical carcinoma, and lung, ovarian, and breast cancer.
16933	
16934	
16935	
16936	The protein encoded by this gene is a member of the ETS-domain transcription factor family and the ternary complex factor (TCF) subfamily. Proteins in this subfamily regulate transcription when recruited by serum response factor to bind to serum response elements. This protein is activated by signal-induced phosphorylation; studies in rodents suggest that it is a transcriptional inhibitor in the absence of Ras, but activates transcription when Ras is present.
16937	This gene encodes the ligand of the tyrosine-kinase receptor encoded by the KIT locus. This ligand is a pleiotropic factor that acts in utero in germ cell and neural cell development, and hematopoiesis, all believed to reflect a role in cell migration. In adults, it functions pleiotropically, while mostly noted for its continued requirement in hematopoiesis. Two transcript variants encoding different isoforms have been found for this gene.
16938	This gene encodes a member of the arm-repeat (armadillo) and plakophilin gene families. Plakophilin proteins contain numerous armadillo repeats, localize to cell desmosomes and nuclei, and participate in linking cadherins to intermediate filaments in the cytoskeleton. This gene product may regulate the signaling activity of beta-catenin. Two alternately spliced transcripts encoding two protein isoforms have been identified. A processed pseudogene with high similarity to this locus has been mapped to chromosome 12p13.
16939	
16940	The function of this gene is not known, but its existence is supported by mRNAs and EST data. A similar gene in mice was found to be expressed preferentially in hematopoietic progenitors and mature blood cells. Two alternatively spliced transcript variants encoding the same protein have been noted for this gene.
16941	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This phosphoprotein localizes to nuclear bodies where it functions as a transcription factor and tumor suppressor. Its expression is cell-cycle related and it regulates the p53 response to oncogenic signals. The gene is often involved in the translocation with the retinoic acid receptor alpha gene associated with acute promyelocytic leukemia (APL). Extensive alternative splicing of this gene results in several variations of the protein@#$%&s central and C-terminal regions; all variants encode the same N-terminus. Fourteen transcript variants for this gene have been described; however, the full length nature of three of the variants has not been determined.
16942	
16943	
16944	This gene encodes a member of the OSM9-like transient receptor potential channel (OTRPC) subfamily in the transient receptor potential (TRP) superfamily of ion channels. The encoded protein is a Ca2+-permeable, nonselective cation channel that is thought to be involved in the regulation of systemic osmotic pressure. Two transcript variants encoding different isoforms have been found for this gene.
16945	
16946	
16947	This gene is one of several cytokine genes clustered on the q-arm of chromosome 17. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The CC cytokines are proteins characterized by two adjacent cysteines. The cytokine encoded by this gene displays chemotactic activity on resting T lymphocytes and monocytes, lower activity on neutrophils and no activity on activated T lymphocytes. The protein is also a strong suppressor of colony formation by a multipotential hematopoietic progenitor cell line. In addition, the product of this gene is a potent agonist at CC chemokine receptor 1. Two alternatively spliced variants encoding different active isoforms have been identified.
16948	
16949	
16950	
16951	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S19E family of ribosomal proteins. It is located in the cytoplasm. Mutations in this gene cause Diamond-Blackfan anemia (DBA), a constitutional erythroblastopenia characterized by absent or decreased erythroid precursors, in a subset of patients. This suggests a possible extra-ribosomal function for this gene in erythropoietic differentiation and proliferation, in addition to its ribosomal function. Higher expression levels of this gene in some primary colon carcinomas compared to matched normal colon tissues has been observed. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
16952	
16953	
16954	
16955	
16956	
16957	N-ethylmaleimide-sensitive factor (NSF) and valosin-containing protein (p97) are two ATPases known to be involved in transport vesicle/target membrane fusion and fusions between membrane compartments. A trimer of the protein encoded by this gene binds a hexamer of cytosolic p97 and is required for p97-mediated regrowth of Golgi cisternae from mitotic Golgi fragments. Multiple transcript variants encoding several different isoforms have been found for this gene.
16958	
16959	The protein encoded by this gene acts as a heterodimer with kinesin family member 3A to aid in chromosome movement during mitosis and meiosis. The encoded protein is a plus end-directed microtubule motor and can interact with the SMC3 subunit of the cohesin complex. In addition, the encoded protein may be involved in the intracellular movement of membranous organelles. This protein and kinesin family member 3A form the kinesin II subfamily of the kinesin superfamily.
16960	The protein encoded by this gene is a member of the heat shock protein 70 family and is found associated with microsomes. Members of this protein family play a role in the processing of cytosolic and secretory proteins, as well as in the removal of denatured or incorrectly-folded proteins. The encoded protein contains an ATPase domain and has been shown to associate with a ubiquitin-like protein.
16961	
16962	Present mainly in the mitochondrial compartment of peripheral tissues, PBR interacts with some benzodiazepines and has different affinities than its endogenous counterpart. PBR appears to be a key factor in the flow of cholesterol into mitochondria to permit the initiation of steroid hormone synthesis.  It is speculated that patients with congenital lipoid adrenal hyperplasia, who cannot make any steroids, might have a genetic lesion in BZRP.  A short form, PBR-S is also expressed in the same tissues, but at a level about ten times that of PBR.
16963	The RP2 locus has been implicated as one cause of X-linked retinitis pigmentosa. The predicted gene product shows homology with human cofactor C, a protein involved in the ultimate step of beta-tubulin folding. Progressive retinal degeneration may therefore be due to the accumulation of incorrectly-folded photoreceptor or neuron-specific tubulin isoforms followed by progressive cell death
16964	
16965	Human chromosomal region Xp22.3-p21.3 comprises the area between the pseudoautosomal boundary and the Duchenne muscular dystrophy gene (MIM 300377). This region harbors several disease loci, including OFD1 (MIM 311200), CFNS (MIM 304110), DFN6 (MIM 300066), and SEDT (MIM 313400). It also contains a region of homology with both the short and the long arms of the Y chromosome and undergoes frequent chromosomal rearrangements.[supplied by OMIM]
16966	
16967	This gene encodes a mitochondrial transcription factor that is a key activator of mitochondrial transcription as well as a participant in mitochondrial genome replication. Studies in mice have demonstrated that this gene product is required to regulate the mitochondrial genome copy number and is essential for embryonic development. A mouse model for Kearns-Sayre syndrome was produced when expression of this gene was eliminated by targeted disruption in heart and muscle cells.
16968	
16969	
16970	
16971	
16972	
16973	
16974	Von Hippel-Lindau syndrome (VHL) is a dominantly inherited familial cancer syndrome predisposing to a variety of malignant and benign tumors. A germline mutation of this gene is the basis of familial inheritance of VHL syndrome. The protein encoded by this gene is a component of the protein complex that includes elongin B, elongin C, and cullin-2, and possesses ubiquitin ligase E3 activity. This protein is involved in the ubiquitination and degradation of hypoxia-inducible-factor (HIF), which is a transcription factor that plays a central role in the regulation of gene expression by oxygen. RNA polymerase II subunit POLR2G/RPB7 is also reported to be a target of this protein. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
16975	The product of this gene belongs to the phosphatase 2A regulatory subunit B family. Protein phosphatase 2A is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. The B regulatory subunit might modulate substrate selectivity and catalytic activity. This gene encodes an alpha isoform of the regulatory subunit B56 subfamily.
16976	
16977	
16978	
16979	
16980	
16981	
16982	NRP1 is a membrane-bound coreceptor to a tyrosine kinase receptor for both vascular endothelial growth factor (VEGF; MIM 192240) and semaphorin (see SEMA3A; MIM 603961) family members. NRP1 plays versatile roles in angiogenesis, axon guidance, cell survival, migration, and invasion.[supplied by OMIM]
16983	The protein encoded by this gene contains a leucine-rich repeat (LRR). It specifically interacts with TNFRSF9/4-1BB, a member of the tumor necrosis factor receptor (TNFR) superfamily. Overexpression of this gene suppresses the activation of NF-kappa B induced by TNFRSF9 or TNF receptor-associated factor 2 (TRAF2), which suggests that this protein is a negative regulator of TNFRSF9-mediated signaling cascades. At least three alternatively spliced transcript variants encoding distinct isoforms have been observed.
16984	RICS is a neuron-associated GTPase-activating protein that may regulate dendritic spine morphology and strength by modulating Rho GTPase (see RHOA; MIM 165390) activity (Okabe et al., 2003 [PubMed 12531901]).[supplied by OMIM]
16985	
16986	
16987	The protein encoded by this gene is a member of the interleukin 1 receptor family. This protein is similar to the interleukin 1 accessory proteins, and is most closely related to interleukin 1 receptor accessory protein-like 1 (IL1RAPL1). This gene and IL1RAPL1 are located at a region on chromosome X that is associated with X-linked non-syndromic mental retardation.
16988	Autocrine motility factor is a tumor motility-stimulating protein secreted by tumor cells. The protein encoded by this gene is a glycosylated transmembrane protein and a receptor for autocrine motility factor. The receptor, which shows some sequence similarity to tumor protein p53, is localized to the leading and trailing edges of carcinoma cells. Two transcript variants encoding different isoforms have been found for this gene.
16989	
16990	This gene encodes a member of the nucleoporin family that shares 87% sequence identity with rat nucleoporin p58. The protein is localized to the nuclear rim and is a component of the nuclear pore complex (NPC). All molecules entering or leaving the nucleus either diffuse through or are actively transported by the NPC. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
16991	
16992	
16993	
16994	
16995	
16996	
16997	
16998	
16999	
17000	
17001	
17002	
17003	
17004	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S5P family of ribosomal proteins. It is located in the cytoplasm. This gene shares sequence similarity with mouse LLRep3. It is co-transcribed with the small nucleolar RNA gene U64, which is located in its third intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
17005	
17006	
17007	
17008	
17009	The sodium-potassium-chloride cotransporter isoform 2 is kidney-specific and is found on the apical membrane of the thick ascending limb of Henle@#$%&s loop and the macula densa. It accounts for most of the NaCl resorption with the stoichiometry of 1Na:1K:2Cl and is sensitive to such diuretics as furosemide and bumetanide. Some Bartter-like syndromes result from defects in this gene.
17010	
17011	
17012	
17013	
17014	
17015	
17016	
17017	
17018	This gene encodes a highly efficient N-acetylgalactosamine (GalNAc) kinase, which has galactokinase activity when galactose is present at high concentrations. Two alternatively spliced transcript variants encoding different isoforms have been found for this gene.
17019	
17020	
17021	
17022	
17023	
17024	
17025	
17026	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are produced by RNA polymerase II and they form a component of the heterogeneous nuclear RNA (hnRNA) complexes. They are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It acts as the catalytic component of the multiprotein editosome complex which mediates apolipoprotein B mRNA editing. Two alternatively spliced transcript variants has been described for this gene which encode different isoforms.
17027	
17028	
17029	
17030	
17031	
17032	The protein encoded by this gene is a member of the basic helix-loop-helix (bHLH) E-protein family that recognizes the consensus binding site (E-box) CANNTG. This encoded protein is expressed in many tissues, among them skeletal muscle, thymus, B- and T-cells, and may participate in regulating lineage-specific gene expression through the formation of heterodimers with other bHLH E-proteins. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
17033	The protein encoded by this gene is a subunit of the multisubunit NADH:ubiquinone oxidoreductase (complex I). Mammalian complex I is composed of 45 different subunits. It locates at the mitochondrial inner membrane. This protein has NADH dehydrogenase activity and oxidoreductase activity. It transfers electrons from NADH to the respiratory chain. The immediate electron acceptor for the enzyme is believed to be ubiquinone. Alternative splicing occurs at this locus and two transcript variants encoding distinct isoforms have been identified.
17034	
17035	
17036	
17037	The protein encoded by this gene binds to the intronic cluster of RNA regulatory elements, downstream control sequence (DCS). It is implicated in controlling the assembly of other splicing-regulatory proteins. This protein is very similar to the polypyrimidine tract binding protein but it is expressed primarily in the brain.
17038	
17039	The protein encoded by this gene is the large subunit of replication factor C, which is a five subunit DNA polymerase accessory protein. Replication factor C is a DNA-dependent ATPase that is required for eukaryotic DNA replication and repair. The protein acts as an activator of DNA polymerases, binds to the 3@#$%& end of primers, and promotes coordinated synthesis of both strands. It also may have a role in telomere stability.
17040	
17041	
17042	In cardiac myocytes, Ca(2+) concentrations alternate between high levels during contraction and low levels during relaxation. The increase in Ca(2+) concentration during contraction is primarily due to release of Ca(2+) from intracellular stores. However, some Ca(2+) also enters the cell through the sarcolemma (plasma membrane). During relaxation, Ca(2+) is sequestered within the intracellular stores. To prevent overloading of intracellular stores, the Ca(2+) that entered across the sarcolemma must be extruded from the cell. The Na(+)-Ca(2+) exchanger is the primary mechanism by which the Ca(2+) is extruded from the cell during relaxation. In the heart, the exchanger may play a key role in digitalis action. The exchanger is the dominant mechanism in returning the cardiac myocyte to its resting state following excitation.[supplied by OMIM]
17043	
17044	
17045	The protein encoded by this gene belongs to the family of P-type cation transport ATPases. This magnesium-dependent enzyme catalyzes the hydrolysis of ATP coupled with the transport of the calcium. Defects in this gene cause Hailey-Hailey disease, an autosomal dominant disorder. Alternatively spliced transcript variants encoding different isoforms have been identified.
17046	The protein encoded by this gene belongs to the family of P-type cation transport ATPases. This magnesium-dependent enzyme catalyzes the hydrolysis of ATP coupled with the transport of the calcium. Defects in this gene cause Hailey-Hailey disease, an autosomal dominant disorder. Alternatively spliced transcript variants encoding different isoforms have been identified.
17047	
17048	
17049	This gene encodes a member of the SFRP family that contains a cysteine-rich domain homologous to the putative Wnt-binding site of Frizzled proteins. SFRPs act as soluble modulators of Wnt signaling. Methylation of this gene is a potential marker for the presence of colorectal cancer.
17050	
17051	
17052	
17053	Eukaryotes contain two protein translational systems, one in the cytoplasm and one in the mitochondria. Mitochondrial translation is crucial for maintaining mitochondrial function and mutations in this system lead to a breakdown in the respiratory chain-oxidative phosphorylation system and to impaired maintenance of mitochondrial DNA. This gene encodes one of mitochondrial translation elongation factors. Its role in the regulation of normal mitochondrial function and in different disease states attributed to mitochondrial dysfunction is not known. Alternative splicing results in at least three transcript variants encoding distinct isoforms.
17054	
17055	Carboxylesterase 2 is a member of a large multigene family. The enzymes encoded by these genes are responsible for the hydrolysis of ester- and amide-bond-containing drugs such as cocaine and heroin. They also hydrolize long-chain fatty acid esters and thioesters. The specific function of this enzyme has not yet been determined; however, it is speculated that carboxylesterases may play a role in lipid metabolism and/or the blood-brain barrier system. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
17056	
17057	
17058	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This protein contains a C-terminal PTP domain and an N-terminal domain homologous to the band 4.1 superfamily of cytoskeletal-associated proteins. P97, a cell cycle regulator involved in a variety of membrane related functions, has been shown to be a substrate of this PTP. This PTP was also found to interact with, and be regulated by adaptor protein 14-3-3 beta.
17059	
17060	
17061	The translation initiation factor eIF2 catalyzes the first regulated step of protein synthesis initiation, promoting the binding of the initiator tRNA to 40S ribosomal subunits. Binding occurs as a ternary complex of methionyl-tRNA, eIF2, and GTP. eIF2 is composed of 3 nonidentical subunits, alpha (36 kD), beta (38 kD, MIM 603908), and gamma (52 kD, MIM 300161). The rate of formation of the ternary complex is modulated by the phosphorylation state of eIF2-alpha (Ernst et al., 1987).[supplied by OMIM]
17062	
17063	
17064	
17065	
17066	
17067	
17068	This gene encodes one of the two alpha chains of type VIII collagen. The gene product is a short chain collagen and a major component of the basement membrane of the corneal endothelium. The type VIII collagen fibril can be either a homo- or a heterotrimer. Alternatively spliced transcript variants encoding the same isoform have been observed.
17069	
17070	
17071	
17072	
17073	This gene encodes a ubiquitin-specific protease, an enzyme that removes ubiquitin from ubiquitinated proteins. The encoded protein belongs to the C19 peptidase family, also known as family 2 of ubiquitin carboxyl-terminal hydrolases. This protein has been reported to be capable of removing NEDD8 from NEDD8 conjugates. Alternatively spliced transcript variants encoding different isoforms have been identified.
17074	The protein encoded by this gene is highly similar to human hippocalcin protein and hippocalcin like-1 protein. It also has similarity to rat neural visinin-like Ca2+-binding protein-type 1 and 2 proteins. This encoded protein may be involved in the calcium-dependent regulation of rhodopsin phosphorylation. The transcript of this gene has multiple polyadenylation sites.
17075	
17076	
17077	This gene, through alternative splicing, encodes three different isoforms. Two of the protein isoforms encoded by this gene are specific inhibitors of type 1 serine/threonine protein phosphatases and can bind but not cleave RNA. The third protein isoform lacks the phosphatase inhibitory function but is a single-strand endoribonuclease comparable to RNase E of E. coli. This isoform requires magnesium for its function and cleaves specific sites in A+U-rich regions of RNA.
17078	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that belongs to the ribosomal protein S21P family. Pseudogenes corresponding to this gene are found on chromosomes 1p, 1q, 9p, 10p, 10q, 16q, and 17q. Available sequence data analyses identified splice variants that differ in the 5@#$%& UTR; both transcripts encode the same protein.
17079	
17080	
17081	
17082	Glycophorin C (GYPC) is an integral membrane glycoprotein. It is a minor species carried by human erythrocytes, but plays an important role in regulating the mechanical stability of red cells. A number of glycophorin C mutations have been described. The Gerbich and Yus phenotypes are due to  deletion of exon 3 and 2, respectively. The Webb and Duch antigens, also known as glycophorin D, result from single point mutations of the glycophorin C gene. The glycophorin C protein has very little homology with glycophorins A and B.
17083	The product of this gene belongs to the family of basic helix-loop-helix leucine zipper (bHLH-Zip) transcription factors. These factors form heterodimers with Mad proteins and play a role in proliferation, determination and differentiation. This gene product may act to diversify Mad family function by its restricted association with a subset of the Mad family of transcriptional repressors, namely, Mad1 and Mad4. Alternatively spliced transcript variants encoding different isoforms have been identified for this gene.
17084	The protein encoded by this gene shares similarity with mouse Hs1bp3, an Hcls1/Hs1-interacting protein that may be involved in lymphocyte activation. Two alternatively spliced transcript variants of this gene, which encode distinct isoforms, have been reported.
17085	This gene encodes a protein which is a member of the cysteine-aspartic acid protease (caspase) family. Sequential activation of caspases plays a central role in the execution-phase of cell apoptosis. Caspases exist as inactive proenzymes which undergo proteolytic processing at conserved aspartic residues to produce 2 subunits, large and small, that dimerize to form the active enzyme. This protein was shown to cleave and activate caspases 3 and 7, and itself could be processed by caspase 8.  Mutations in this gene are associated with apoptosis defects seen in type II autoimmune lymphoproliferative syndrome. Four alternatively spliced transcript variants encoding different isoforms have been described for this gene.
17086	This gene encodes a protein that is a member of the S9B family in clan SC of the serine proteases. This protein has no detectable protease activity, most likely due to the absence of the conserved serine residue normally present in the catalytic domain of serine proteases. A link between mutations in this gene and asthma is suspected. Two splice variants of this gene, encoding different isoforms, have been described.
17087	
17088	The protein encoded by this gene is similar to the chicken and mouse CRTAP genes. The encoded protein is a scaffolding protein that may influence the activity of at least one member of the cytohesin/ARNO family in response to specific cellular stimuli.
17089	
17090	
17091	This gene encodes a disintegrin and metalloproteinase with thrombospondin (ADAMTS) motifs-3, which is a member of the ADAMTS protein family. Members of the family share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. The protein encoded by this gene is the major procollagen II N-propeptidase. A deficiency of this protein may be responsible for dermatosparaxis, a genetic defect of connective tissues.
17092	Arg and c-Abl represent the mammalian members of the Abelson family of non-receptor protein-tyrosine kinases.  They interact with the Arg/Abl binding proteins via the SH3 domains present in the carboxy end of the latter group of proteins.  The subcellular localization of Arg/Abl-interacting  protein 2 encoded by this gene in epithelial and cardiac muscle cells, suggests that it functions as an adapter protein to assemble signaling complexes in stress fibers, and that it is a potential link between Abl family kinases and the actin cytoskeleton.  Several transcript variants encoding different isoforms of Arg/Abl-interacting protein 2 have been described.
17093	
17094	
17095	
17096	
17097	Three classes of protein carboxyl methyltransferases, distinguished by their methyl-acceptor substrate specificity, have been found in prokaryotic and eukaryotic cells. The type II enzyme catalyzes the transfer of a methyl group from S-adenosyl-L-methionine to the free carboxyl groups of D-aspartyl and L-isoaspartyl residues. These methyl-accepting residues result from the spontaneous deamidation, isomerization, and racemization of normal L-aspartyl and L-asparaginyl residues and represent sites of covalent damage to aging proteins PCMT1 (EC 2.1.1.77) is a protein repair enzyme that initiates the conversion of abnormal D-aspartyl and L-isoaspartyl residues to the normal L-aspartyl form.[supplied by OMIM]
17098	
17099	
17100	
17101	
17102	This gene encodes an enzyme with hydroxypyruvate reductase, glyoxylate reductase, and D-glycerate dehydrogenase enzymatic activities. The enzyme has widespread tissue expression and has a role in metabolism. Type II hyperoxaluria is caused by mutations in this gene.
17103	
17104	
17105	HEC is one of several proteins involved in spindle checkpoint signaling. This surveillance mechanism assures correct segregation of chromosomes during cell division by detecting unaligned chromosomes and causing prometaphase arrest until the proper bipolar attachment of chromosomes is achieved.[supplied by OMIM]
17106	
17107	
17108	
17109	This gene encodes one of the SERCA Ca(2+)-ATPases, which are intracellular pumps located in the sarcoplasmic or endoplasmic reticula of muscle cells. This enzyme catalyzes the hydrolysis of ATP coupled with the translocation of calcium from the cytosol to the sarcoplasmic reticulum lumen, and is involved in regulation of the contraction/relaxation cycle. Mutations in this gene cause Darier-White disease, also known as keratosis follicularis, an autosomal dominant skin disorder characterized by loss of adhesion between epidermal cells and abnormal keratinization. Alternative splicing results in two transcript variants encoding different isoforms.
17110	
17111	
17112	
17113	
17114	The protein encoded by this gene is a subunit of a disulfide-linked homotrimeric protein. This protein is an adhesive glycoprotein that mediates cell-to-cell and cell-to-matrix interactions. This protein can bind to fibrinogen, fibronectin, laminin, type V collagen and integrins alpha-V/beta-1. This protein has been shown to play roles in platelet aggregation, angiogenesis, and tumorigenesis.
17115	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L13E family of ribosomal proteins. It is located in the cytoplasm. This gene is expressed at significantly higher levels in benign breast lesions than in breast carcinomas. Transcript variants derived from alternative splicing and/or alternative polyadenylation exist; these variants encode the same protein. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
17116	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L13E family of ribosomal proteins. It is located in the cytoplasm. This gene is expressed at significantly higher levels in benign breast lesions than in breast carcinomas. Transcript variants derived from alternative splicing and/or alternative polyadenylation exist; these variants encode the same protein. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
17117	The HIRA protein shares sequence similarity with Hir1p and Hir2p, the two corepressors of histone gene transcription characterized in the yeast, Saccharomyces cerevisiae.  The structural features of the HIRA protein suggest that it may function as part of a multiprotein complex.  Recently, several cDNAs encoding HIRA-interacting proteins, or HIRIPs, have been identified.  In vitro, the HIRIP3 gene product binds HIRA, as well as H2B and H3 core histones, indicating that a complex containing HIRA-HIRIP3 could function in some aspects of chromatin and histone metabolism.
17118	The human alpha globin gene cluster located on chromosome 16 spans about 30 kb and includes the following five loci:  5@#$%&- zeta - pseudozeta - pseudoalpha-1 - alpha-2 - alpha-1 -3@#$%&. The alpha-2 (HBA2) and alpha-1 (HBA1) coding sequences are identical. These genes differ slightly over the 5@#$%& untranslated regions and the introns, but they differ significantly over the 3@#$%& untranslated regions. Two alpha chains plus two beta chains constitute HbA, which in normal adult life comprises about 97% of  the total hemoglobin; alpha chains combine with delta chains to constitute HbA-2, which with HbF (fetal hemoglobin) makes up the  remaining 3% of adult hemoglobin.  Alpha thalassemias result from deletions of each of the alpha genes as well as deletions of both HBA2 and HBA1 respectively; some nondeletion alpha thalassemias have also been reported.
17119	
17120	This gene encodes a protein that interacts with the DNA polymerase delta p50 subunit. The encoded protein also interacts with proliferating cell nuclear antigen. Some transcripts of this gene overlap in a tail-to-tail orientation with the gene for tumor necrosis factor, alpha-induced protein 1 (TNFAIP1).
17121	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the high sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
17122	This gene is located within the Smith-Magenis syndrome region on chromosome 17. It encodes a protein of unknown function.
17123	This gene encodes a member of the Asp-Glu-Ala-Asp (DEAD) box protein family. Members of this protein family are putative RNA helicases, and are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. Two transcript variants encoding the same protein have been identified for this gene.
17124	
17125	Caspase recruitment domain (CARD)-containing proteins, such as CARD8, are involved in pathways leading to activation of caspases or nuclear factor kappa-B (NFKB; see MIM 164011) in the context of apoptosis or inflammation, respectively (Bouchier-Hayes et al., 2001).[supplied by OMIM]
17126	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation.
17127	
17128	This gene encodes a phosphatidylinositol (PI) 4-kinase which catalyzes the first committed step in the biosynthesis of phosphatidylinositol 4,5-bisphosphate. The mammalian PI 4-kinases have been classified into two types, II and III, based on their molecular mass, and modulation by detergent and adenosine. Two transcript variants encoding different isoforms have been described for this gene.
17129	CDC42 is a member of the Rho GTPase family that regulates multiple cellular activities, including actin polymerization. The protein encoded by this gene is a CDC42 binding protein that mediates actin cytoskeleton reorganization at the plasma membrane. The encoded protein, which is secreted, is primarily found in bone marrow. Two transcript variants encoding different isoforms have been found for this gene.
17130	
17131	
17132	This gene encodes a member of the PMP22/EMP/claudin protein family. The encoded protein is localized to the ER and the plasma membrane. In dogs, transcripts of this gene exist at high levels in the brain.
17133	
17134	
17135	
17136	Roadblock/LC7 is one member of an ancient dynein light chain protein family conserved in nematode, fruit fly, mouse, and rat. The protein encoded by this gene is one of two human proteins that share sequence similarity with roadblock/LC7. Expression of this gene is up-regulated in hepatocellular carcinomas, suggesting that this gene may be involved in tumor progression. Multiple transcript variants encoding different isoforms have been found for this gene.
17137	
17138	
17139	The protein encoded by this gene is a member of the signal-regulatory-protein (SIRP) family, and also belongs to the immunoglobulin superfamily. SIRP family members are receptor-type transmembrane glycoproteins known to be involved in the negative regulation of receptor tyrosine kinase-coupled signaling processes. This protein was found to interact with TYROBP/DAP12, a protein bearing immunoreceptor tyrosine-based activation motifs. This protein was also reported to participate in the recruitment of tyrosine kinase SYK.
17140	The protein encoded by this gene is an integral membrane ATPase. The encoded protein is probably phosphorylated in its intermediate state and likely drives the transport of ions such as calcium across membranes. Two transcript variants encoding different isoforms have been found for this gene.
17141	
17142	
17143	
17144	
17145	
17146	
17147	
17148	
17149	
17150	
17151	
17152	
17153	
17154	
17155	
17156	
17157	
17158	
17159	
17160	
17161	
17162	
17163	The protein encoded by this gene is a subunit of the multisubunit NADH:ubiquinone oxidoreductase (complex I). Mammalian complex I is composed of 45 different subunits. It is located at the mitochondrial inner membrane. This protein has NADH dehydrogenase activity and oxidoreductase activity. It transfers electrons from NADH to the respiratory chain. The immediate electron acceptor for the enzyme is believed to be ubiquinone.
17164	
17165	
17166	
17167	
17168	
17169	
17170	
17171	
17172	This gene encodes a subunit of the BRCA1-BRCA2-containing complex (BRCC), which is an E3 ubiquitin ligase. This protein is also thought to be involved in the cellular response to ionizing radiation and progression through the G2/M checkpoint. Alternative splicing results in multiple transcript variants.
17173	
17174	
17175	
17176	
17177	
17178	
17179	
17180	
17181	
17182	
17183	In mouse, Hils1 displays characteristics of a linker histone and is expressed in nuclei of late maturing spermatids (Yan et al., 2003).[supplied by OMIM]
17184	
17185	This gene encodes a member of the polycystin protein family. Members of this protein family may function in renal tubular development, morphology, and function, and may modulate intracellular calcium homoeostasis and other signal transduction pathways. Two transcript variants encoding distinct isoforms have been identified for this gene.
17186	
17187	HLA-A belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon 1 encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Hundreds of HLA-A alleles have been described.
17188	
17189	
17190	
17191	
17192	
17193	
17194	
17195	Exportin-5 belongs to a large family of karyopherins (see MIM 602738) that mediate the transport of proteins and other cargo between the nuclear and cytoplasmic compartments.[supplied by OMIM]
17196	
17197	Translokin binds basic fibroblast growth factor (FGF2; MIM 134920) and mediates its nuclear translocation and mitogenic activity (Bossard et al., 2003).[supplied by OMIM]
17198	Oncostatin M is a member of the IL6 family of cytokines. Functional receptors for IL6 family cytokines are multisubunit complexes involving members of the hematopoietin receptor superfamily. Many IL6 cytokines utilize gp130 as a common receptor subunit. OSM binds to the gp130 receptor subunit and, in association with the leukemia inhibitory factor receptor, induces a proliferative response in permissive cells. OSMR is an alternative subunit (for an OSM receptor complex (a heterodimer of gp130 and OSMR) that is activated by OSM but not by LIF
17199	This gene is located within a chromosomal region that shows loss of heterozygosity in some bladder cancers.  It contains a 5@#$%& CpG island that may be a frequent target of hypermethylation, and it may undergo hypermethylation-based silencing in some bladder cancers.
17200	
17201	
17202	
17203	
17204	This gene encodes a protein that may play a role in organelle biogenesis associated with melanosomes, platelet dense granules, and lysosomes. A similar protein in mouse is a component of a protein complex termed biogenesis of lysosome-related organelles complex 1 (BLOC-1), and binds to alpha- and beta-dystrobrevins, which are components of the dystrophin-associated protein complex (DPC). Mutations in this gene are associated with Hermansky-Pudlak syndrome type 7. This gene may also be associated with schizophrenia. Multiple transcript variants encoding distinct isoforms have been identified for this gene.
17205	
17206	
17207	
17208	
17209	
17210	This gene encodes a member of the serine/threonine protein kinase family. The encoded kinase was identified by its interaction with MAP3K5/ASK, a protein kinase and an activator of c-Jun kinase (MAPK7/JNK) and MAPK14/p38 kinase. This kinase was found to weakly activate MAPK7, but not MAPK1/ERK or MAPK14. An alternatively spliced transcript variant has been found for this gene, but its biological validity has not been determined.
17211	
17212	
17213	
17214	
17215	
17216	
17217	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is intronless and encodes a member of the histone H2B family. Two transcripts that encode the same protein have been identified for this gene, which is found in the large histone gene cluster on chromosome 6p22-p21.3.
17218	
17219	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, which is an antigen recognized by autoimmune antibodies from a patient with watermelon stomach disease. This protein unwinds double-stranded RNA, folds single-stranded RNA, and may play important roles in ribosomal RNA biogenesis, RNA editing, RNA transport, and general transcription.
17220	
17221	The protein encoded by this gene is a member of the immunoglobulin binding factor family. It is synthesized by the epithelial cells of the prostate gland and secreted into the seminal plasma. This protein has inhibin-like activity. It may have a role as an autocrine paracrine factor in uterine, breast and other female reproductive tissues. The expression of the encoded protein is found to be decreased in prostate cancer. Two alternatively spliced transcript variants encoding different isoforms are described for this gene. The use of alternate polyadenylation sites has been found for this gene.
17222	
17223	
17224	Exonucleolytic degradation of the poly(A) tail is often the first step in the decay of eukaryotic mRNAs.  The amino acid sequence of poly(A)-specific ribonuclease shows homology to the RNase D family of 3@#$%&-exonucleases. The protein appears to be localized in both the nucleus and the cytoplasm. It is not stably associated with polysomes or ribosomal subunits.
17225	Ataxin-2 binding protein 1 has an RNP motif that is highly conserved among RNA-binding proteins. This protein binds to the C-terminus of ataxin-2 and may contribute to the restricted pathology of spinocerebellar ataxia type 2 (SCA2). Ataxin-2 is the gene product of the SCA2 gene which causes familial neurodegenerative diseases. Ataxin-2 binding protein 1 and ataxin-2 are both localized in the trans-Golgi network. Four alternatively spliced transcript variants have been found for this gene. Additional transcript variants have been found but their full length nature has not been determined.
17226	
17227	
17228	Members of the subtilisin-like proprotein convertase family process latent precursor proteins into their biologically active products. The protein encoded by this gene appears to function as an endogenous inhibitor of proprotein convertase subtilisin/kexin type 1.
17229	
17230	
17231	
17232	
17233	
17234	
17235	
17236	
17237	
17238	
17239	Leptin (LEP; MIM 164160), an adipocyte-specific hormone that regulates adipose-tissue mass through hypothalamic effects on satiety and energy expenditure, acts through the leptin receptor (LEPR), a single-transmembrane-domain receptor of the cytokine receptor family.[supplied by OMIM]
17240	This gene encodes a protein which specifically interacts with translin, a DNA-binding protein that binds consensus sequences at breakpoint junctions of chromosomal translocations. The encoded protein contains bipartite nuclear targeting sequences that may provide nuclear transport for translin, which lacks any nuclear targeting motifs.
17241	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S27E family of ribosomal proteins. It contains a C4-type zinc finger domain that can bind to zinc. The encoded protein has been shown to be able to bind to nucleic acid. It is located in the cytoplasm as a ribosomal component, but it has also been detected in the nucleus. Studies in rat indicate that ribosomal protein S27 is located near ribosomal protein S18 in the 40S subunit and is covalently linked to translation initiation factor eIF3. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
17242	
17243	
17244	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene encodes a member of the histone H2B family, and generates two transcripts through the use of the conserved stem-loop termination motif, and the polyA addition motif.
17245	
17246	
17247	This gene encodes a protein that is a member of the cysteine-aspartic acid protease (caspase) family. Sequential activation of caspases plays a central role in the execution-phase of cell apoptosis. Caspases exist as inactive proenzymes composed of a prodomain, a large protease subunit, and a small protease subunit. Activation of caspases requires proteolytic processing at conserved internal aspartic residues to generate a heterodimeric enzyme consisting of the large and small subunits. This protein is involved in the programmed cell death induced by Fas and various apoptotic stimuli. The N-terminal FADD-like death effector domain of this protein suggests that it may interact with Fas-interacting protein FADD. This protein was detected in the insoluble fraction of the affected brain region from Huntington disease patients but not in those from normal controls, which implicated the role in neurodegenerative diseases. Alternative splicing of this gene results in five transcript variants.
17248	
17249	
17250	
17251	This gene encodes a member of the beta chemokine receptor family, which is predicted to be a seven transmembrane protein similar to G protein-coupled receptors. The ligands of this receptor include macrophage inflammatory protein 1 alpha (MIP-1 alpha), regulated on activation normal T expressed and secreted protein (RANTES), monocyte chemoattractant protein 3 (MCP-3), and myeloid progenitor inhibitory factor-1 (MPIF-1). Chemokines and their receptors mediated signal transduction are critical for the recruitment of effector immune cells to the site of inflammation. Knockout studies of the mouse homolog suggested the roles of this gene in host protection from inflammatory response, and susceptibility to virus and parasite. This gene and other chemokine receptor genes, including CCR2, CCRL2, CCR3, CCR5 and CCXCR1, are found to form a gene cluster on chromosome 3p.
17252	
17253	Interleukin 1 induces synthesis of acute phase and proinflammatory proteins during infection, tissue damage, or stress, by forming a complex at the cell membrane with an interleukin 1 receptor and an accessory protein. This gene encodes an interleukin 1 receptor accessory protein. Alternative splicing of this gene results in two transcript variants encoding two different isoforms, one membrane-bound and one soluble.
17254	
17255	
17256	
17257	This gene is similar to the C. elegans MAB-21 cell fate-determining gene, a downstream target of transforming growth factor-beta signaling. It is thought that this gene may be involved in neural development. The protein encoded by this gene is primarily nuclear, although some cytoplasmic localization has been observed.
17258	
17259	
17260	
17261	
17262	
17263	
17264	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
17265	WD40 repeat proteins are key components of many essential biologic functions. They regulate the assembly of multiprotein complexes by presenting a beta-propeller platform for simultaneous and reversible protein-protein interactions. Members of the WIPI subfamily of WD40 repeat proteins, such as WIPI2, have a 7-bladed propeller structure and contain a conserved motif for interaction with phospholipids (Proikas-Cezanne et al., 2004).[supplied by OMIM]
17266	The protein encoded by this gene is a Ro RNP-binding protein. It interacts with Ro RNPs and their interaction is thought to represent a gain of function for Ro RNPs. This protein also forms a ternary complex with far upstream element (FUSE) and FUSE-binding protein. It can repress a c-myc reporter via the FUSE. It is also known to target transcription factor IIH and inhibit activated transcription. This gene is implicated in the xeroderma pigmentosum disorder. There are two alternatively spliced transcript variants of this gene encoding different isoforms. There seems to be evidence of multiple polyadenylation sites for this gene.
17267	This gene encodes a subunit of the N-methyl-D-aspartate (NMDA) receptors, which belong to the superfamily of glutamate-regulated ion channels, and function in physiological and pathological processes in the central nervous system. This subunit shows greater than 90% identity to the corresponding subunit in rat. Studies in the knockout mouse deficient in this subunit suggest that this gene may be involved in the development of synaptic elements by modulating NMDA receptor activity.
17268	
17269	The product of this gene belongs to the ligand-gated ion channel receptor superfamily. This gene encodes a subunit D of the type 3 receptor for 5-hydroxytryptamine (serotonin), a biogenic hormone that functions as a neurotransmitter, a hormone, and a mitogen. This receptor causes fast, depolarizing responses in neurons after activation. Genes encoding subunits C, D and E form a cluster on chromosome 3.
17270	
17271	The protein encoded by this gene has been found to form a complex with EPS8 and SOS1, and is thought to be involved in the transduction of signals from Ras to Rac. In addition, the encoded protein may play a role in the regulation of EGF-induced Erk pathway activation as well as cytoskeletal reorganization and EGFR signaling. Several transcript variants encoding multiple isoforms have been found for this gene.
17272	SUFU encodes a component of the sonic hedgehog (SHH; MIM 600725)/patched (PTCH; MIM 601309) signaling pathway. Mutations in genes encoding components of this pathway are deleterious for normal development and are associated with cancer-predisposing syndromes (e.g., HPE3, MIM 142945; BCNS, MIM 109400; and GCPS, MIM 175700).[supplied by OMIM]
17273	SUFU encodes a component of the sonic hedgehog (SHH; MIM 600725)/patched (PTCH; MIM 601309) signaling pathway. Mutations in genes encoding components of this pathway are deleterious for normal development and are associated with cancer-predisposing syndromes (e.g., HPE3, MIM 142945; BCNS, MIM 109400; and GCPS, MIM 175700).[supplied by OMIM]
17274	
17275	The product of this gene belongs to the ligand-gated ion channel receptor superfamily. This gene encodes subunit A of the type 3 receptor for 5-hydroxytryptamine (serotonin), a biogenic hormone that functions as a neurotransmitter, a hormone, and a mitogen. This receptor causes fast, depolarizing responses in neurons after activation. It appears that the heteromeric combination of A and B subunits is necessary to provide the full functional features of this receptor, since either subunit alone results in receptors with very low conductance and response amplitude. Alternatively spliced transcript variants encoding different isoforms have been identified.
17276	The protein encoded by this gene is a member of the fatty acid desaturase (FADS) gene family. Desaturase enzymes regulate unsaturation of fatty acids through the introduction of double bonds between defined carbons of the fatty acyl chain. FADS family members are considered fusion products composed of an N-terminal cytochrome b5-like domain and a C-terminal multiple membrane-spanning desaturase portion, both of which are characterized by conserved histidine motifs. This gene is clustered with family members FADS1 and FADS2 at 11q12-q13.1; this cluster is thought to have arisen evolutionarily from gene duplication based on its similar exon/intron organization.
17277	
17278	
17279	
17280	
17281	
17282	Serine palmitoyltransferase (SPT; EC 2.3.1.50) is the key enzyme in sphingolipid biosynthesis. It catalyzes the pyridoxal-5-prime-phosphate-dependent condensation of L-serine and palmitoyl-CoA to 3-oxosphinganine.[supplied by OMIM]
17283	
17284	
17285	Junctional complexes between the plasma membrane and endoplasmic/sarcoplasmic reticulum are a common feature of all excitable cell types and mediate cross talk between cell surface and intracellular ion channels. The protein encoded by this gene is a component of junctional complexes and is composed of a C-terminal hydrophobic segment spanning the endoplasmic/sarcoplasmic reticulum membrane and a remaining cytoplasmic domain that shows specific affinity for the plasma membrane. CAG/CTG repeat expansions at the Huntington@#$%&s disease (HD)-like 2 locus have been identified in this gene, which is a member of the junctophilin gene family.
17286	This gene encodes a trypsinogen, which is a member of the trypsin family of serine proteases. This enzyme is highly expressed in prostate epithelia and is one of several proteolytic enzymes found in seminal fluid. The proprotein is cleaved to produce a light chain and a heavy chain which are associated by a disulfide bond. It is active on peptide linkages involving the carboxyl group of lysine or arginine.
17287	
17288	
17289	
17290	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. Four sequence-related potassium channel genes - shaker, shaw, shab, and shal - have been identified in Drosophila, and each has been shown to have human homolog(s). This gene encodes a member of the potassium channel, voltage-gated, shaker-related subfamily. This member is one of the beta subunits, which are auxiliary proteins associating with functional Kv-alpha subunits. This member alters functional properties of the KCNA4 gene product. Alternative splicing of this gene results in two transcript variants encoding distinct isoforms.
17291	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the high sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
17292	
17293	CNDP2, also known as tissue carnosinase and peptidase A (EC 3.4.13.18), is a nonspecific dipeptidase rather than a selective carnosinase (Teufel et al., 2003).[supplied by OMIM]
17294	The protein encoded by this gene is a plasma glycoprotein of unknown function. The protein shows sequence similarity to the variable regions of some immunoglobulin supergene family member proteins.
17295	
17296	This gene product belongs to the zinc finger protein superfamily, members of which are regulatory proteins characterized by nucleic acid-binding zinc finger domains. The encoded protein is a member of the Kruppel family of zinc finger proteins, and contains Kruppel-associated box (KRAB) A and B domains and 15 tandemly arrayed C2H2-type zinc fingers. It is an ortholog of the mouse Zfp93 protein. This gene is located in a cluster of zinc finger genes on 19q13.2.
17297	
17298	
17299	
17300	
17301	This gene encodes a serine/threonine protein kinase. Although this gene product is similar to serum- and glucocorticoid-induced protein kinase (SGK), this gene is not  induced by serum or glucocorticoids. This gene is induced in response to signals that activate phosphatidylinositol 3-kinase, which is also true for SGK. Two alternate transcripts encoding two different isoforms have been described.
17302	The protein encoded by this gene binds copper and zinc ions and is one of two isozymes responsible for destroying free superoxide radicals in the body. The encoded isozyme is a soluble cytoplasmic protein, acting as a homodimer to convert naturally-occuring but harmful superoxide radicals to molecular oxygen and hydrogen peroxide. The other isozyme is a mitochondrial protein. Mutations in this gene have been implicated as causes of familial amyotrophic lateral sclerosis. Rare transcript variants have been reported for this gene.
17303	
17304	
17305	
17306	
17307	
17308	This gene encodes a cytoplasmic protein that contains a coiled-coil structure and a BTB/POZ domain at its N-terminus, ankyrin repeats in the middle portion, and a FYVE-finger motif at its C-terminus. This protein belongs to a subgroup of double zinc finger proteins which may be involved in vesicle or protein transport. Alternative splicing has been observed at this locus and two variants, each encoding a distinct isoform, have been identified.
17309	
17310	
17311	
17312	
17313	
17314	
17315	
17316	
17317	
17318	
17319	
17320	
17321	
17322	
17323	
17324	
17325	
17326	
17327	Synaptic vesicles are responsible for regulating the storage and release of neurotransmitters in the nerve terminal. The protein encoded by this gene is an abundant integral membrane protein of cholinergic synaptic vesicles and is thought to be involved in vesicular transport. It belongs to the quinone oxidoreductase subfamily of zinc-containing alcohol dehydrogenase proteins.
17328	
17329	
17330	
17331	
17332	
17333	A reciprocal translocation between chromosomes 22 and 9 produces the Philadelphia chromosome, which is often found in patients with chronic myelogenous leukemia. The chromosome 22 breakpoint for this translocation is located within the BCR gene. The translocation produces a fusion protein which is encoded by sequence from both BCR and ABL, the gene at the chromosome 9 breakpoint. Although the BCR-ABL fusion protein has been extensively studied, the function of the normal BCR gene product is not clear. The protein has serine/threonine kinase activity and is a GTPase-activating protein for p21rac.
17334	
17335	
17336	
17337	
17338	
17339	
17340	
17341	
17342	
17343	
17344	
17345	
17346	
17347	This gene encodes a member of the aldo/keto reductase superfamily, which consists of more than 40 known enzymes and proteins. These enzymes catalyze the conversion of aldehydes and ketones to their corresponding alcohols by utilizing NADH and/or NADPH as cofactors. The enzymes display overlapping but distinct substrate specificity. This enzyme binds bile acid with high affinity, and shows minimal 3-alpha-hydroxysteroid dehydrogenase activity. This gene shares high sequence identity with three other gene members and is clustered with those three genes at chromosome 10p15-p14.
17348	
17349	
17350	
17351	
17352	
17353	
17354	
17355	
17356	
17357	
17358	
17359	
17360	
17361	
17362	
17363	
17364	
17365	
17366	SHPRH is a ubiquitously expressed protein that contains motifs characteristics of several DNA repair proteins, transcription factors, and helicases.[supplied by OMIM]
17367	
17368	
17369	
17370	
17371	
17372	
17373	This gene encodes an enzyme which converts phosphatidylethanolamine to phosphatidylcholine by sequential methylation in the liver. The protein localizes to the endoplasmic reticulum and mitochondria-associated membranes. The gene is within the Smith-Magenis syndrome region on chromosome 17. Alternate splicing of this gene results in three transcript variants encoding two different isoforms.
17374	
17375	
17376	Transcription of protein-coding genes can be reconstituted on naked DNA with only the general transcription factors and RNA polymerase II. However, this minimal system cannot transcribe DNA packaged into chromatin, indicating that accessory factors may facilitate access to DNA. One such factor, FACT (facilitates chromatin transcription), interacts specifically with histones H2A/H2B to effect nucleosome disassembly and transcription elongation. FACT is composed of an 80 kDa subunit and a 140 kDa subunit, the latter of which is the protein encoded by this gene.
17377	ARHJ belongs to the Rho family of small GTP-binding proteins. Rho proteins regulate the dynamic assembly of cytoskeletal components for several physiologic processes, such as cell proliferation and motility and the establishment of cell polarity. They are also involved in pathophysiologic process, such as cell transformation and metastasis.[supplied by OMIM]
17378	
17379	
17380	This gene encodes a member of the tripartite motif (TRIM) family, whose members are involved in diverse cellular functions such as developmental patterning and oncogenesis. The TRIM motif includes zinc-binding domains, a RING finger region, a B-box motif and a coiled-coil domain. The RING finger and B-box domains chelate zinc and might be involved in protein-protein and/or protein-nucleic acid interactions. The gene mutations are associated with mulibrey (muscle-liver-brain-eye) nanism, an autosomal recessive disorder that involves several tissues of mesodermal origin. Alternatively spliced transcript variants encoding the same protein have been identified.
17381	
17382	
17383	This gene encodes menin, a putative tumor suppressor associated with a syndrome known as multiple endocrine neoplasia type 1. In vitro studies have shown menin is localized to the nucleus, possesses two functional nuclear localization signals, and inhibits transcriptional activation by JunD, however, the function of this protein is not known. Two messages have been detected on northern blots but the larger message has not been characterized. Two variants of the shorter transcript have been identified where alternative splicing affects the CDS. Five variants where alternative splicing takes place in the 5@#$%& UTR have also been identified.
17384	
17385	
17386	
17387	This gene encodes the insulin receptor substrate 2, a cytoplasmic signaling molecule that mediates effects of insulin, insulin-like growth factor 1, and other cytokines by acting as a molecular adaptor between diverse receptor tyrosine kinases and downstream effectors. The product of this gene is phosphorylated by the insulin receptor tyrosine kinase upon receptor stimulation, as well as by an interleukin 4 receptor-associated kinase in response to IL4 treatment.
17388	Endothelin-converting enzyme-1 is involved in the proteolytic processing of endothelin-1 (EDN1; MIM 131240), -2 (EDN2; MIM 131241), and -3 (EDN3; MIM 131242) to biologically active peptides.[supplied by OMIM]
17389	GPR100 is a member of the rhodopsin family of G protein-coupled receptors (GPRs) (Fredriksson et al., 2003).[supplied by OMIM]
17390	
17391	
17392	
17393	
17394	
17395	Aminoacyl-tRNA synthetases are a class of enzymes that charge tRNAs with their cognate amino acids.  Glycyl-tRNA synthetase is an (alpha)2 dimer which belongs to the class II family of tRNA synthetases.  It has been shown to be a target of autoantibodies in the human autoimmune diseases, polymyositis or dermatomyositis.
17396	
17397	
17398	NFKB1 (MIM 164011) or NFKB2 (MIM 164012) is bound to REL (MIM 164910), RELA (MIM 164014), or RELB (MIM 604758) to form the NFKB complex. The NFKB complex is inhibited by I-kappa-B proteins (NFKBIA, MIM 164008, or NFKBIB, MIM 604495), which inactivate NF-kappa-B by trapping it in the cytoplasm. Phosphorylation of serine residues on the I-kappa-B proteins by kinases (IKBKA, MIM 600664, or IKBKB) marks them for destruction via the ubiquitination pathway, thereby allowing activation of the NF-kappa-B complex. Activated NFKB complex translocates into the nucleus and binds DNA at kappa-B-binding motifs such as 5-prime GGGRNNYYCC 3-prime or 5-prime HGGARNYYCC 3-prime (where H is A, C, or T; R is an A or G purine; and Y is a C or T pyrimidine).[supplied by OMIM]
17399	
17400	This gene encodes a member of the peroxiredoxin family of antioxidant enzymes, which reduce hydrogen peroxide and alkyl hydroperoxides. The encoded protein may play an antioxidant protective role in different tissues under normal conditions and during inflammatory processes. This protein interacts with peroxisome receptor 1. The crystal structure of this protein in its reduced form has been resolved to 1.5 angstrom resolution. This gene uses alternate in-frame translation initiation sites to generate mitochondrial or peroxisomal/cytoplasmic forms. Three transcript variants encoding distinct isoforms have been identified for this gene.
17401	
17402	
17403	
17404	
17405	The protein encoded by this gene belongs to a family of ankyrin repeat proteins that, along with four other protein families, contains a C-terminal SOCS box motif. Growing evidence suggests that the SOCS box acts as a bridge between specific substrate-binding domains and the more generic proteins that comprise a large family of E3 ubiquitin protein ligases. In this way, SOCS box containing proteins may regulate protein turnover by targeting proteins for polyubiquination and, therefore, for proteasome-mediated degradation. Two alternative transcripts encoding different isoforms have been described.
17406	
17407	Zinc is an essential cofactor for hundreds of enzymes. It is involved in protein, nucleic acid, carbohydrate, and lipid metabolism, as well as in the control of gene transcription, growth, development, and differentiation. SLC39A6 belongs to a subfamily of proteins that show structural characteristics of zinc transporters (Taylor and Nicholson, 2003).[supplied by OMIM]
17408	
17409	
17410	
17411	
17412	This gene encodes a cytosolic enzyme that catalyzes the activation of acetate for use in lipid synthesis and energy generation. The protein acts as a monomer and produces acetyl-CoA from acetate in a reaction that requires ATP. Expression of this gene is regulated by sterol regulatory element-binding proteins, transcription factors that activate genes required for the synthesis of cholesterol and unsaturated fatty acids. Two transcript variants encoding different isoforms have been found for this gene.
17413	
17414	
17415	MaxiK channels are large conductance, voltage and calcium-sensitive potassium channels which are fundamental to the control of smooth muscle tone and neuronal excitability. MaxiK channels can be formed by 2 subunits: the pore-forming alpha subunit, which is the product of this gene, and the modulatory beta subunit. Intracellular calcium regulates the physical association between the alpha and beta subunits. Alternatively spliced transcript variants encoding different isoforms have been identified.
17416	
17417	
17418	
17419	
17420	
17421	
17422	
17423	
17424	
17425	
17426	
17427	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene encodes a member of the histone H2B family, and generates two transcripts through the use of the conserved stem-loop termination motif, and the polyA addition motif.
17428	
17429	
17430	
17431	
17432	
17433	
17434	The protein encoded by this gene is a member of the serine/threonine protein kinase family. This kinase has been shown to specifically activate MAPK8/JNK. The activation of MAPK8 by this kinase is found to be inhibited by the dominant-negative mutants of MAP3K7/TAK1, MAP2K4/MKK4, and MAP2K7/MKK7, which suggests that this kinase may function through the MAP3K7-MAP2K4-MAP2K7 kinase cascade, and mediate the TNF-alpha signaling pathway. Alternatively spliced transcript variants encoding different isoforms have been identified.
17435	Von Hippel-Lindau syndrome (VHL) is a dominantly inherited familial cancer syndrome predisposing to a variety of malignant and benign tumors. A germline mutation of this gene is the basis of familial inheritance of VHL syndrome. The protein encoded by this gene is a component of the protein complex that includes elongin B, elongin C, and cullin-2, and possesses ubiquitin ligase E3 activity. This protein is involved in the ubiquitination and degradation of hypoxia-inducible-factor (HIF), which is a transcription factor that plays a central role in the regulation of gene expression by oxygen. RNA polymerase II subunit POLR2G/RPB7 is also reported to be a target of this protein. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
17436	The protein encoded by this gene is a cytokine receptor. This receptor specifically binds to IL17B and IL17E, but does not bind to IL17 and IL17C. This receptor has been shown to mediate the activation of NF-kappaB and the production of IL8 induced by IL17E. The expression of the rat counterpart of this gene was found to be significantly up-regulated during intestinal inflammation, which suggested the immunoregulatory activity of this receptor. Two alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported.
17437	
17438	P-type ATPases, such as ATP11B, are phosphorylated in their intermediate state and drive uphill transport of ions across membranes. Several subfamilies of P-type ATPases have been identified. One subfamily transports heavy metal ions, such as Cu(2+) or Cd(2+). Another subfamily transports non-heavy metal ions, such as H(+), Na(+), K(+), or Ca(+). A third subfamily transports amphipaths, such as phosphatidylserine.[supplied by OMIM]
17439	
17440	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the proteasome B-type family, also known as the T1B family, that is a 20S core beta subunit. This gene is located in the class II region of the MHC (major histocompatibility complex). Expression of this gene is induced by gamma interferon and this gene product replaces catalytic subunit 1 (proteasome beta 6 subunit) in the immunoproteasome. Proteolytic processing is required to generate a mature subunit. Two alternative transcripts encoding different isoforms have been identified; both isoforms are processed to yield the same mature subunit.
17441	
17442	
17443	
17444	
17445	
17446	This gene was identified as a gene whose expression is rapidly induced by the tumor necrosis factor (TNF). The protein encoded by this gene is a zinc finger protein, and has been shown to inhibit NF-kappa B activation as well as TNF-mediated apoptosis. Knockout studies of a similar gene in mice suggested that this gene is critical for limiting inflammation by terminating TNF-induced NF-kappa B responses.
17447	
17448	
17449	
17450	
17451	
17452	
17453	
17454	
17455	Cylicin II (CYCL2) is specifically expressed in testis and is part of the cytoskeletal calyx of mammalian sperm heads.  Cylicin II may play a role in the morphogenesis of the sperm head.
17456	
17457	This gene encodes a member of the tenascin family of extracellular matrix proteins. The tenascins have anti-adhesive effects, as opposed to fibronectin which is adhesive. The specific function of this protein is unknown, but its deficiency has been associated with the connective tissue disorder Ehlers-Danlos syndrome. This gene localizes to the major histocompatibility complex (MHC) class III region on chromosome 6. It is one of four genes in this cluster which have been duplicated. The duplicated copy of this gene is incomplete and is a pseudogene which is transcribed but does not encode a protein. The structure of this gene is unusual in that it overlaps the CREBL1 and CYP21A2 genes at its 5@#$%& and 3@#$%& ends, respectively. The gene expresses two transcripts.
17458	
17459	
17460	
17461	
17462	The protein encoded by this gene belongs to the Rho family of the small GTPase superfamily. It plays a role in small GTPase mediated signal transduction. Alternative splicing occurs at this locus and two transcript variants encoding the same protein have been identified.
17463	
17464	
17465	EPN1 is an endocytic accessory protein that interacts with EPS15 (MIM 600051), the alpha subunit of the clathrin adaptor AP2 (AP2A1; MIM 601026), and clathrin (see MIM 118960), as well as with other accessory proteins for the endocytosis of clathrin-coated vesicles.[supplied by OMIM]
17466	
17467	EPN1 is an endocytic accessory protein that interacts with EPS15 (MIM 600051), the alpha subunit of the clathrin adaptor AP2 (AP2A1; MIM 601026), and clathrin (see MIM 118960), as well as with other accessory proteins for the endocytosis of clathrin-coated vesicles.[supplied by OMIM]
17468	This gene encodes an alpha-1 subunit of a voltage-dependent calcium channel. Calcium channels mediate the influx of calcium ions into the cell upon membrane polarization. The alpha-1 subunit consists of 24 transmembrane segments and forms the pore through which ions pass into the cell. The calcium channel consists of a complex of alpha-1, alpha-2/delta, beta, and gamma subunits in a 1:1:1:1 ratio. There are multiple isoforms of each of these proteins, either encoded by different genes or the result of alternative splicing of transcripts. The protein encoded by the gene described in this record binds to and is inhibited by dihydropyridine. Many alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
17469	Interleukin-1 receptor (see IL1R; MIM 147810)-associated kinases (e.g., IRAK1; MIM 300283) are important mediators in the signal transduction of Toll-like receptor (TLR, e.g., TLR4; MIM 603030) and IL1R family members, collectively referred to as TIRs. IRAK4 functions in this signal transduction pathway.[supplied by OMIM]
17470	
17471	Cytoplasmic ribosomes, organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein, which shares sequence similarity with yeast ribosomal protein L44, belongs to the L44E (L36AE) family of ribosomal proteins. This gene and the human gene officially named ribosomal protein L36a (RPL36A) encode nearly identical proteins; however, they are distinct genes. Although the name of this gene has been referred to as ribosomal protein L36a (RPL36A), its official name is ribosomal protein L36a-like (RPL36AL). As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
17472	
17473	
17474	
17475	This gene encodes a protein with sequence similarity to a mouse protein originally identified in embryonic stem cells. In mouse T-cell lines, this protein appears to be related to glucocorticoid- and staurine-induced apoptotic pathways, and to be linked to ceramide-mediated signalling. These observations suggest that this gene product is involved in specific apoptotic processes in T-cells.
17476	This gene encodes a protein that shares a high degree of sequence similarity with p21-activated kinase (PAK) family members. The proteins of this family are Rac/Cdc42-associated Ste20-like Ser/Thr protein kinases, characterized by a highly conserved amino-terminal Cdc42/Rac interactive binding (CRIB) domain and a carboxyl-terminal kinase domain. PAK kinases are implicated in the regulation of a number of cellular processes, including cytoskeleton rearrangement, apoptosis and the MAP kinase signaling pathway. The protein encoded by this gene was found to interact with androgen receptor (AR), which is a steroid hormone-dependent transcription factor that is important for male sexual differentiation and development. The p21-activated protein kinase 6 gene was found to be highly expressed in testis and prostate tissues and the encoded protein was shown to cotranslocate into the nucleus with AR in response to androgen.
17477	
17478	The CMRF35 antigen (CMRF35A; MIM 606786), which was identified by reactivity with a monoclonal antibody, is present on monocytes, neutrophils, and some T and B lymphocytes. CMRF35H is recognized by the same antibody and is distinct from CMRF35 (Green et al., 1998).[supplied by OMIM]
17479	TBKBP1 is an adaptor protein that binds to TBK1 (MIM 604834) and is part of the interaction network in the TNF (MIM 191160)/NFKB (see MIM 164011) pathway (Bouwmeester et al., 2004 [PubMed 14743216]).[supplied by OMIM]
17480	
17481	
17482	The protein encoded by this gene has been identified as a brain-specific angiogenesis inhibitor (BAI1)-binding protein.  This interaction at the cytoplasmic membrane is crucial to the function of this protein, which may be involved in neuronal growth-cone guidance.  This protein functions as an insulin receptor tyrosine kinase substrate and suggests a role for insulin in the central nervous system.  This protein has also been identified as interacting with the dentatorubral-pallidoluysian atrophy gene, which is associated with an autosomal dominant neurodegenerative disease.  It also associates with a downstream effector of Rho small G proteins, which is associated with the formation of stress fibers and cytokinesis.  Alternative splicing of the 3@#$%&-end of this gene results in three products of undetermined function.
17483	
17484	The protein encoded by this gene is a type I integral membrane protein localized in the intermediate region between the endoplasmic reticulum and the Golgi, presumably recycling between the two compartments. The protein is a mannose-specific lectin and is a member of a novel family of plant lectin homologs in the secretory pathway of animal cells. Mutations in the gene are associated with a coagulation defect. Using positional cloning, the gene was identified as the disease gene leading to combined factor V-factor VIII deficiency, a rare, autosomal recessive disorder in which both coagulation factors V and VIII are diminished.
17485	This gene encodes a protein that contains four domains present in other developmental proteins: a PWWP domain, an HMG box, a SET domain, and a PHD-type zinc finger. It is expressed ubiquitously in early development. Wolf-Hirschhorn syndrome (WHS) is a malformation syndrome associated with a hemizygous deletion of the distal short arm of chromosome 4. This gene maps to the 165 kb WHS critical region and has also been involved in the chromosomal translocation t(4;14)(p16.3;q32.3) in multiple myelomas. This gene contains 25 exons. Alternative splicing of this gene generates nine transcript variants, which encode four different isoforms.
17486	
17487	This gene, which encodes a member of the serine/threonine kinase family, regulates cell polarity and functions as a tumor suppressor. Mutations in this gene have been associated with Peutz-Jeghers syndrome, an autosomal dominant disorder characterized by the growth of polyps in the gastrointestinal tract, pigmented macules on the skin and mouth, and other neoplasms. Alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
17488	
17489	The protein encoded by this gene is a member of the immunophilin protein family, which play a role in immunoregulation and basic cellular processes involving protein folding and trafficking. This encoded protein is a cis-trans prolyl isomerase that binds the immunosuppressants FK506 and rapamycin. It interacts with several intracellular signal transduction proteins including type I TGF-beta receptor. It also interacts with multiple intracellular calcium release channels including the tetrameric skeletal muscle ryanodine receptor. In mouse, deletion of this homologous gene causes congenital heart disorder known as noncompaction of left ventricular myocardium. There is evidence of multiple alternatively spliced transcript variants for this gene, but the full length nature of some variants has not been determined.
17490	
17491	
17492	The protein encoded by this gene belongs to the innexin family. Innexin family members are the structural components of gap junctions. This protein and pannexin 1 are abundantly expressed in central nerve system (CNS) and are coexpressed in various neuronal populations. Studies in Xenopus oocytes suggest that this protein alone and in combination with pannexin 1 may form cell type-specific gap junctions with distinct properties.
17493	
17494	
17495	
17496	The protein encoded by this gene is a transmembrane protein containing three extracellular CUB domains. This protein is found to be overexpressed in colon and lung cancers. Its expression level is correlated with the metastatic ability of carcinoma cells. This protein is located on the cell surface. It has been shown to be tyrosine phosphorylated in a cancer cell line. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
17497	
17498	
17499	
17500	
17501	
17502	
17503	
17504	
17505	
17506	
17507	
17508	
17509	
17510	
17511	
17512	
17513	
17514	
17515	
17516	
17517	
17518	
17519	NFKB1 (MIM 164011) or NFKB2 (MIM 164012) is bound to REL (MIM 164910), RELA (MIM 164014), or RELB (MIM 604758) to form the NFKB complex. The NFKB complex is inhibited by I-kappa-B proteins (NFKBIA, MIM 164008, or NFKBIB), which inactivate NF-kappa-B by trapping it in the cytoplasm. Phosphorylation of serine residues on the I-kappa-B proteins by kinases (IKBKA, MIM 600664 or IKBKB, MIM 603258) marks them for destruction via the ubiquitination pathway, thereby allowing activation of the NF-kappa-B complex. Activated NFKB complex translocates into the nucleus and binds DNA at kappa-B-binding motifs such as 5-prime GGGRNNYYCC 3-prime or 5-prime HGGARNYYCC 3-prime (where H is A, C, or T; R is an A or G purine; and Y is a C or T pyrimidine).[supplied by OMIM]
17520	
17521	
17522	
17523	
17524	
17525	
17526	
17527	
17528	This gene is located at chromosome 7p13 which is close to the Williams Beuren syndrome chromosome region 7q11.23.
17529	
17530	
17531	
17532	
17533	
17534	
17535	
17536	
17537	
17538	
17539	
17540	
17541	
17542	
17543	
17544	
17545	
17546	
17547	
17548	
17549	
17550	
17551	
17552	
17553	
17554	HLA-B belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon 1 encodes the leader peptide, exon 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Hundreds of HLA-B alleles have been described
17555	This gene belongs to the polypeptide N-acetylgalactosaminyltransferase (pp-GalNAc-T) gene family. Polypeptide GalNAc transferases initiate the synthesis of mucin-type oligosaccharides by transferring GalNAc from UDP-GalNAc to the hydroxyl group of either a serine or threonine residue on the polypeptide acceptor. Following expression in insect cells, recombinant GalNAc transferase 10 showed significant GalNAcT activity toward mucin-derived peptides, and it utilized both nonglycosylated and glycosylated peptide substrates. Two transcript variants encoding distinct isoforms have been identified for this gene.
17556	The protein encoded by this gene is an integral membrane ATPase. The encoded protein is probably phosphorylated in its intermediate state and likely drives the transport of ions such as calcium across membranes. Two transcript variants encoding different isoforms have been found for this gene.
17557	A reciprocal translocation between chromosomes 22 and 9 produces the Philadelphia chromosome, which is often found in patients with chronic myelogenous leukemia. The chromosome 22 breakpoint for this translocation is located within the BCR gene. The translocation produces a fusion protein which is encoded by sequence from both BCR and ABL, the gene at the chromosome 9 breakpoint. Although the BCR-ABL fusion protein has been extensively studied, the function of the normal BCR gene product is not clear. The protein has serine/threonine kinase activity and is a GTPase-activating protein for p21rac.
17558	
17559	
17560	Ankyrins are a family of proteins that are believed to link the integral membrane proteins to the underlying spectrin-actin cytoskeleton and play key roles in activities such as cell motility, activation, proliferation, contact, and the maintenance of specialized membrane domains. Multiple isoforms of ankyrin with different affinities for various target proteins are expressed in a tissue-specific, developmentally regulated manner. Most ankyrins are typically composed of three structural domains: an amino-terminal domain containing multiple ankyrin repeats; a central region with a highly conserved spectrin binding domain; and a carboxy-terminal regulatory domain which is the least conserved and subject to variation. Ankyrin 3 is an immunologically distinct gene product from ankyrins 1 and 2, and was originally found at the axonal initial segment and nodes of Ranvier of neurons in the central and peripheral nervous systems. Alternatively spliced variants may be expressed in other tissues. Although multiple transcript variants encoding several different isoforms have been found for this gene, the full-length nature of only two have been characterized.
17561	The protein encoded by this gene catalyzes the conversion of (S)-2,3 oxidosqualene to lanosterol. The encoded protein is a member of the terpene cyclase/mutase family and catalyzes the first step in the biosynthesis of cholesterol, steroid hormones, and vitamin D. Two transcript variants encoding the same protein have been found for this gene.
17562	
17563	This gene codes for a centrosomal protein shown to be required for centriole-centriole cohesion during the cell cycle. The protein interacts with NIMA-related kinase 2 (NEK2), which is also associated with the centrosome.
17564	
17565	
17566	
17567	
17568	This gene encodes a WD repeat-containing protein. Multiple alternatively spliced transcript variants have been found for this gene, but the full-length nature of some variants has not been defined.
17569	
17570	
17571	
17572	
17573	
17574	
17575	The nuclear lamina consists of a two-dimensional matrix of proteins located next to the inner nuclear membrane. The lamin family of proteins make up the matrix and are highly conserved in evolution. During mitosis, the lamina matrix is reversibly disassembled as the lamin proteins are phosphorylated. Lamin proteins are thought to be involved in nuclear stability, chromatin structure and gene expression. Vertebrate lamins consist of two types, A and B. Through alternate splicing, this gene encodes three type A lamin isoforms. Mutations in this gene lead to several diseases: Emery-Dreifuss muscular dystrophy, familial partial lipodystrophy, limb girdle muscular dystrophy, dilated cardiomyopathy, Charcot-Marie-Tooth disease, and Hutchinson-Gilford progeria syndrome.
17576	
17577	
17578	Protein kinase C (PKC) is a family of serine- and threonine-specific protein kinases that can be activated by calcium and the second messenger diacylglycerol. PKC family members phosphorylate a wide variety of protein targets and are known to be involved in diverse cellular signaling pathways. PKC family members also serve as major receptors for phorbol esters, a class of tumor promoters. Each member of the PKC family has a specific expression profile and is believed to play a distinct role. The protein encoded by this gene is one of the PKC family members. This kinase can be activated rapidly by the agonists of G protein-coupled receptors. It resides in both cytoplasm and nucleus, and its nuclear accumulation is found to be dramatically enhanced in response to its activation. This kinase can also be activated after B-cell antigen receptor (BCR) engagement, which requires intact phopholipase C gamma and the involvement of other PKC family members.
17579	
17580	
17581	
17582	The protein encoded by this gene is a member of the platelet-derived growth factor family. The four members of this family are mitogenic factors for cells of mesenchymal origin and are characterized by a core motif of eight cysteines. This gene product appears to form only homodimers. It differs from the platelet-derived growth factor alpha and beta polypeptides in having an unusual N-terminal domain, the CUB domain.
17583	
17584	
17585	
17586	
17587	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This protein localizes to the nuclear matrix. It interacts with the enhancer of polycomb protein and represses gene transcription. It is also thought to be involved in the differentiation of male germ cells. Rearrangements of this gene and the RET gene result in a gene encoding the ret transforming protein. Two alternatively spliced transcript variants for this gene have been described.
17588	
17589	
17590	This gene encodes a member of the DEAD box protein family. DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. The protein encoded by this gene shows ATPase activity in the presence of polynucleotides and associates with nucleoplasmic 65S preribosomal particles. This gene may be involved in ribosome synthesis, most likely during assembly of the large 60S ribosomal subunit.
17591	
17592	
17593	
17594	
17595	
17596	The protein encoded by this gene is predicted to be a five-pass transmembrane protein. This gene may be predominately expressed in brain.
17597	
17598	
17599	
17600	
17601	
17602	
17603	
17604	
17605	
17606	
17607	
17608	
17609	
17610	The region of chromosome 21 between genes CBR and ERG (CBR-ERG region), which spans 2.5 Mb on 21q22.2, has been defined by analysis of patients with partial trisomy 21. It contributes significantly to the pathogenesis of many characteristics of Down syndrome, including morphological features, hypotonia, and mental retardation. The DSCR3 (Down syndrome critical region gene 3) gene is found in this region and is predictated to contain eight exons. DSCR3 is expressed in most tissues examined.
17611	
17612	
17613	
17614	
17615	
17616	Due to its chemical instability and low solubility in aqueous solution, vitamin A requires cellular retinol-binding proteins (CRBPs), such as RBP7, for stability, internalization, intercellular transfer, homeostasis, and metabolism.[supplied by OMIM]
17617	
17618	
17619	Protein kinase C (PKC) is a family of serine- and threonine-specific protein kinases that can be activated by calcium and the second messenger diacylglycerol. PKC family members phosphorylate a wide variety of protein targets and are known to be involved in diverse cellular signaling pathways. PKC family members also serve as major receptors for phorbol esters, a class of tumor promoters. Each member of the PKC family has a specific expression profile and is believed to play a distinct role. The protein encoded by this gene is one of the PKC family members. This kinase can be activated rapidly by the agonists of G protein-coupled receptors. It resides in both cytoplasm and nucleus, and its nuclear accumulation is found to be dramatically enhanced in response to its activation. This kinase can also be activated after B-cell antigen receptor (BCR) engagement, which requires intact phopholipase C gamma and the involvement of other PKC family members.
17620	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, and it is activated by Myc protein.
17621	
17622	
17623	
17624	Tight junctions represent one mode of cell-to-cell adhesion in epithelial or endothelial cell sheets, forming continuous seals around cells and serving as a physical barrier to prevent solutes and water from passing freely through the paracellular space. These junctions are comprised of sets of continuous networking strands in the outwardly facing cytoplasmic leaflet, with complementary grooves in the inwardly facing extracytoplasmic leaflet. The protein encoded by this gene, a member of the claudin family, is an integral membrane protein and a component of tight junction strands.
17625	This gene is a highly conserved lung cancer candidate gene. No other information about this gene is currently available.
17626	The GATA family of transcription factors, which contain zinc fingers in their DNA binding domain, have emerged as candidate regulators of gene expression in hematopoietic cells (Tsai et al., 1994 [PubMed 8078582]). GATA1 (MIM 305371) is essential for normal primitive and definitive erythropoiesis and is expressed at high levels in erythroid cells, mast cells, and megakaryocytes. GATA2 is expressed in hematopoietic progenitors, including early erythroid cells, mast cells, and megakaryocytes, and also in nonhematopoietic embryonic stem cells. In chicken erythroid progenitors, forced expression of GATA2 promotes proliferation at the expense of differentiation (Briegel et al., 1993). GATA3 (MIM 131320) expression is restricted to T-lymphoid cells and some nonhematopoietic cell types, including embryonic stem cells.[supplied by OMIM]
17627	
17628	This gene encodes the rate-limiting enzyme in de novo guanine nucleotide biosynthesis, maintaining cellular guanine deoxy- and ribonucleotide pools needed for DNA and RNA synthesis. The encoded protein catalyzes the NAD-dependent oxidation of inosine-5@#$%&-monophosphate into xanthine-5@#$%&-monophosphate, which is then converted into guanosine-5@#$%&-monophosphate. In some neoplasms, the encoded protein is up-regulated, suggesting it may play a role in malignant transformation. A gene encoding a related protein with an identical catalytic site is located on chromosome 7.
17629	
17630	
17631	
17632	
17633	In a study identifying genes in rat that are upregulated in response to nerve damage, a gene which is highly expressed in ganglia and in the central nervous system was found. The protein encoded by the rat gene contains both a zinc finger and a RING finger motif and is localized in the endosome/lysosome compartment, indicating that it may be involved in ubiquitin-mediated protein modification. The protein encoded by this human gene is highly similar in sequence to that encoded by the rat gene.
17634	
17635	
17636	
17637	
17638	
17639	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order   structures. This gene is intronless and encodes a member of the histone H4 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6.
17640	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This protein localizes to cytoplasmic bodies. Studies in mice suggest that this protein plays a role in terminal differentiation of erythroid cells. Alternate splicing of this gene generates two transcript variants encoding different isoforms.
17641	
17642	HLA-DPB belongs to the HLA class II beta chain paralogues. This class II molecule is a heterodimer consisting of an alpha (DPA) and a beta chain (DPB), both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa and its gene contains 6 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail. Within the DP molecule both the alpha chain and the beta chain contain the polymorphisms specifying the peptide binding specificities, resulting in up to 4 different molecules.
17643	
17644	This gene was identified as a gene whose expression is rapidly induced by the tumor necrosis factor (TNF). The protein encoded by this gene is a zinc finger protein, and has been shown to inhibit NF-kappa B activation as well as TNF-mediated apoptosis. Knockout studies of a similar gene in mice suggested that this gene is critical for limiting inflammation by terminating TNF-induced NF-kappa B responses.
17645	
17646	The protein encoded by this gene is a type II transmembrane protein and is similar to two families of cell adhesion molecules, the protocadherins and ependymins. This protein may play a role in calcium dependent cell adhesion.
17647	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
17648	
17649	The protein encoded by this gene is a member of the cyclin-dependent protein kinase (CDK) family. CDK family members are highly similar to the gene products of Saccharomyces cerevisiae cdc28, and Schizosaccharomyces pombe cdc2, and are known to be important regulators of cell cycle progression. This kinase is a catalytic subunit of the protein kinase complex that is important for cell cycle G1 phase progression and G1/S transition. The activity of this kinase first appears in mid-G1 phase, which is controlled by the regulatory subunits including D-type cyclins and members of INK4 family of CDK inhibitors. This kinase, as well as CDK4, has been shown to phosphorylate, and thus regulate the activity of, tumor suppressor protein Rb.
17650	
17651	
17652	
17653	
17654	
17655	
17656	The protein encoded by this gene is a cytokine that belongs to the tumor necrosis factor (TNF) ligand family. This protein is abundantly expressed in endothelial cells, but is not expressed in either B or T cells. The expression of this protein is inducible by TNF and IL-1 alpha. This cytokine is a ligand for receptor TNFRSF25 and decoy receptor TNFRSF21/DR6. It can activate NF-kappaB and MAP kinases, and acts as an autocrine factor to induce apoptosis in endothelial cells. This cytokine is also found to inhibit endothelial cell proliferation, and thus may function as an angiogenesis inhibitor. An additional isoform encoded by an alternatively spliced transcript variant has been reported but the sequence of this transcript has not been determined.
17657	
17658	The protein encoded by this gene is the medium subunit of AP-3, which is an adaptor-related protein complex associated with the Golgi region as well as more peripheral intracellular structures. AP-3 facilitates the budding of vesicles from the Golgi membrane and may be directly involved in protein sorting to the endosomal/lysosomal system. AP-3 is a heterotetrameric protein complex composed of two large subunits (delta and beta3), a medium subunit (mu3), and a small subunit (sigma 3). Mutations in one of the large subunits of AP-3 have been associated with the Hermansky-Pudlak syndrome, a genetic disorder characterized by defective lysosome-related organelles. Alternatively spliced transcript variants encoding the same protein have been observed.
17659	The protein encoded by this gene is the human ortholog of yeast mitochondrial AAA metalloprotease, Yme1p. It is localized in the mitochondria and can functionally complement a yme1 disruptant yeast strain. It is proposed that this gene plays a role in mitochondrial protein metabolism and could be involved in mitochondrial pathologies. Several transcript variants encoding different isoforms have been found for this gene.
17660	
17661	
17662	
17663	
17664	
17665	
17666	This gene encodes one of seven subunits of the human Arp2/3 protein complex. The Arp2/3 protein complex has been implicated in the control of actin polymerization in cells and has been conserved through evolution. The exact role of the protein encoded by this gene, the p21 subunit, has yet to be determined.
17667	
17668	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene is a candidate tumor suppressor gene and located in the critical region of loss of heterozygosity (LOH). It encodes a DEAD box protein, which is highly conserved in evolution, indicating an important cellular function.
17669	This gene encodes a member of the GTP cyclohydrolase I family. The encoded protein is the first and rate-limiting enzyme in tetrahydrobiopterin (BH4) biosynthesis, catalyzing the conversion of GTP into 7,8-dihydroneopterin triphosphate. BH4 is an essential cofactor required by aromatic amino acid hydroxylases as well as nitric oxide synthases. Mutations in this gene are associated with malignant hyperphenylalaninemia and dopa-responsive dystonia. Several alternatively spliced transcript variants encoding different isoforms have been described, however, only one gives rise to the functional enzyme.
17670	
17671	The protein encoded by this gene belongs to the family of latent transforming growth factor (TGF)-beta binding proteins (LTBP), which are extracellular matrix proteins with multi-domain structure. This protein is the largest member of the LTBP family possessing unique regions and with most similarity to the fibrillins. It has thus been suggested that it may have multiple functions: as a member of the TGF-beta latent complex, as a structural component of microfibrils, and a role in cell adhesion.
17672	
17673	
17674	This gene is a member of a phylogenetically conserved family of genes that share a common DNA-binding domain, the T-box. T-box genes encode transcription factors involved in the regulation of developmental processes. This gene is the human ortholog of mouse Tbx6. Knockout experiments in mice indicate that Tbx6 is important for specification of paraxial mesoderm structures. Transcript variants encoding different isoforms have been noted for this gene.
17675	
17676	FOXJ1 is a member of the forkhead gene family, which was originally identified in Drosophila. The forkhead family is composed of transcription factors with a conserved 100-amino acid DNA-binding motif.[supplied by OMIM]
17677	
17678	
17679	This gene encodes a member of the peroxiredoxin family of antioxidant enzymes, which reduce hydrogen peroxide and alkyl hydroperoxides. The encoded protein may play an antioxidant protective role in cells, and may contribute to the antiviral activity of CD8(+) T-cells. This protein may have a proliferative effect and play a role in cancer development or progression. The crystal structure of this protein has been resolved to 2.7 angstroms. Three transcript variants encoding distinct isoforms have been identified for this gene.
17680	This gene encodes a subunit of mitochondrial ATP synthase. Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. ATP synthase is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, Fo, comprising the proton channel. The catalytic portion of mitochondrial ATP synthase consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled with a stoichiometry of 3 alpha, 3 beta, and a single representative of the other 3. The proton channel consists of three main subunits (a, b, c). This gene encodes the delta subunit of the catalytic core. Alternatively spliced transcript variants encoding the same isoform have been identified.
17681	This gene, CYP4F3, encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum. The enzyme starts the process of inactivating and degrading leukotriene B4, a potent mediator of inflammation. This gene is part of a cluster of cytochrome P450 genes on chromosome 19. Another member of this family, CYP4F8, is approximately 18 kb away.
17682	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum. The enzyme starts the process of inactivating and degrading leukotriene B4, a potent mediator of inflammation. This gene is part of a cluster of cytochrome P450 genes on chromosome 19. Another member of this family, CYP4F11, is approximately 16 kb away.
17683	
17684	This gene has a highly complex imprinted expression pattern. It encodes maternally, paternally, and biallelically expressed proteins which are derived from alternatively spliced transcripts with alternate 5@#$%& exons. Each of the upstream exons is within a differentially methylated region, commonly found in imprinted genes. However, the close proximity (14 kb) of two oppositely expressed promoter regions is unusual. In addition, one of the alternate 5@#$%& exons introduces a frameshift relative to the other transcripts, resulting in one isoform which is structurally unrelated to the others. An antisense transcript exists, and may regulate imprinting in this region. Mutations in this gene result in pseudohypoparathyroidism type 1a (PHP1a), which has an atypical autosomal dominant inheritance pattern requiring maternal transmission for full penetrance. There are RefSeqs representing four transcript variants of this gene. Other transcript variants including four additional exons have been described; however, their full length sequences have not been determined.
17685	
17686	
17687	Initiation of transcription by RNA polymerase II requires the activities of more than 70 polypeptides. The protein that coordinates these activities is transcription factor IID (TFIID), which binds to the core promoter to position the polymerase properly, serves as the scaffold for assembly of the remainder of the transcription complex, and acts as a channel for regulatory signals. TFIID is composed of the TATA-binding protein (TBP) and a group of evolutionarily conserved proteins known as TBP-associated factors or TAFs. TAFs may participate in basal transcription, serve as coactivators, function in promoter recognition or modify general transcription factors (GTFs) to facilitate complex assembly and transcription initiation. This gene encodes one of the larger subunits of TFIID that is stably associated with the TFIID complex. It contributes to interactions at and downstream of the transcription initiation site, interactions that help determine transcription complex response to activators.
17688	
17689	
17690	HLA-DRB1 belongs to the HLA class II beta chain paralogues. The class II molecule is a heterodimer consisting of an alpha (DRA) and a beta chain (DRB), both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa. It is encoded by 6 exons, exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail. Within the DR molecule the beta chain contains all the polymorphisms specifying the peptide binding specificities. Hundreds of DRB1 alleles have been described and typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. DRB1 is expressed at a level five times higher than its paralogues DRB3, DRB4 and DRB5. DRB1 is present in all individuals. Allelic variants of DRB1 are linked with either none or one of the genes DRB3, DRB4 and DRB5. There are 4 related pseudogenes: DRB2, DRB6, DRB7, DRB8 and DRB9.
17691	
17692	
17693	
17694	
17695	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the MRP subfamily which is involved in multi-drug resistance. The specific function of this protein has not yet been determined; however, this protein may play a role in cellular detoxification as a pump for its substrate, organic anions.
17696	
17697	
17698	
17699	
17700	
17701	
17702	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S7E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
17703	
17704	
17705	
17706	Membrane mucins, such as MUC17, function in epithelial cells to provide cytoprotection, maintain luminal structure, provide signal transduction, and confer antiadhesive properties upon cancer cells that lose their apical/basal polarization.[supplied by OMIM]
17707	
17708	
17709	
17710	
17711	
17712	This gene encodes a heavily glycosylated protein which is a ligand for the NKG2D type II receptor. Binding of the ligand activates the cytolytic response of natural killer (NK) cells, CD8 alphabeta T cells, and gammadelta T cells which express the receptor. This protein is stress-induced and is similar to MHC class I molecules; however, it does not associate with beta-2-microglobulin or bind peptides.
17713	Immunoglobulins recognize foreign antigens and initiate immune responses such as phagocytosis and the complement system. Each immunoglobulin molecule consists of two identical heavy chains and two identical light chains. There are two classes of light chains, kappa and lambda. This region represents the germline organization of the lambda light chain locus. The locus includes V (variable), J (joining), and C (constant) segments. During B cell development, a recombination event at the DNA level joins a single V segment with a J segment; the C segment is later joined by splicing at the RNA level. Recombination of many different V segments with several J segments provides a wide range of antigen recognition. Additional diversity is attained by junctional diversity, resulting from the random additional of nucleotides by terminal deoxynucleotidyltransferase, and by somatic hypermutation, which occurs during B cell maturation in the spleen and lymph nodes. Several V segments and three C segments are known to be incapable of encoding a protein and are considered pseudogenes. The locus also includes several non-immunoglobulin genes, many of which are pseudogenes or are predicted by automated computational analysis or homology to other species.
17714	
17715	
17716	
17717	
17718	
17719	
17720	
17721	
17722	This gene encodes a member of the serine/threonine protein kinase family, that is highly similar to yeast SPS1/STE20 kinase. Yeast SPS1/STE20 functions near the beginning of the MAP kinase signal cascades that is essential for yeast pheromone response. This kinase was shown to activate Jun kinase in mammalian cells, which suggested a role in stress response. Two alternatively spliced transcript variants encoding the same protein have been described for this gene.
17723	
17724	
17725	
17726	
17727	
17728	
17729	
17730	
17731	
17732	
17733	
17734	This gene encodes a protein with similarity to the C. elegans unc93 protein. The Unc93 protein is involved in the regulation or coordination of muscle contraction in the worm.
17735	
17736	
17737	The protein encoded by this gene is a member of the parathyroid hormone family. This hormone regulates endochondral bone development and epithelial-mesenchymal interactions during the formation of the mammary glands and teeth. This hormone is involved in lactation possibly by regulating the mobilization and transfer of calcium to the milk. The receptor of this hormone, PTHR1, is responsible for most cases of humoral hypercalcemia of malignancy. Four alternatively spliced transcript variants encoding two distinct isoforms have been observed.
17738	
17739	
17740	
17741	
17742	
17743	
17744	
17745	
17746	This gene, expressed in the pituitary, encodes a receptor for growth hormone-releasing hormone. Binding of this hormone to the receptor leads to synthesis and release of growth hormone. Mutations in this gene have been associated with isolated growth hormone deficiency (IGHD), also known as Dwarfism of Sindh, a disorder characterized by short stature. Many alternate transcriptional splice variants encoding different isoforms have been described, but only two have been characterized to date.
17747	
17748	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
17749	
17750	The RYR1 gene encodes the skeletal muscle ryanodine receptor, which serves as a calcium release channel of the sarcoplasmic reticulum as well as a bridging structure connecting the sarcoplasmic reticulum and transverse tubule (MacLennan et al., 1989). See also RYR2 (MIM 180902) and RYR3 (MIM 180903), which encode the cardiac and brain ryanodine receptors, respectively.[supplied by OMIM]
17751	
17752	
17753	
17754	This gene encodes a hormone inducible transcriptional repressor. Repression of transcription by this gene product can occur through interactions with other repressors, by the recruitment of proteins involved in histone deacetylation, or through sequestration of transcriptional activators. The product of this gene contains a carboxy-terminal domain that permits binding to other corepressor proteins. This domain also permits interaction with members of the NuRD complex, a nucleosome remodeling protein complex that contains deacetylase activity. In addition, this repressor contains several RNA recognition motifs that confer binding to a steroid receptor RNA coactivator; this binding can modulate the activity of both liganded and nonliganded steroid receptors.
17755	
17756	
17757	
17758	
17759	
17760	
17761	
17762	
17763	
17764	
17765	
17766	
17767	
17768	
17769	
17770	
17771	
17772	
17773	
17774	
17775	
17776	
17777	
17778	
17779	
17780	The RAS oncogene (MIM 190020) is mutated in nearly one-third of all human cancers. Members of the RAS superfamily are plasma membrane GTP-binding proteins that modulate intracellular signal transduction pathways. A subfamily of RAS effectors, including NORE1, share a RAS association (RA) domain.[supplied by OMIM]
17781	LOR encodes loricrin, a major protein component of the cornified cell envelope found in terminally differentiated epidermal cells. Mutations in LOR may be the cause of both Vohwinkel@#$%&s syndrome and progressive symmetric erythrokeratoderma, both inherited skin diseases.
17782	This gene encodes a member of the PUF family. The encoded protein contains a sequence-specific RNA binding domain comprised of eight repeats and N- and C-terminal flanking regions, and serves as a translational regulator of specific mRNAs by binding to their 3@#$%& untranslated regions. The evolutionarily conserved function of the encoded protein in invertebrates and lower vertebrates suggests that the human protein may be involved in translational regulation of embryogenesis, and cell development and differentiation. Alternatively spliced transcript variants encoding different isoforms have been described.
17783	This gene was identified as a c-Myc responsive gene, and behaves as a direct c-Myc target gene. Overexpression of this gene is found to enhance the transformation of lymphoblastoid cells, and it complements a transformation-defective Myc Box II mutant, suggesting its involvement in c-Myc-mediated cell transformation. Two alternatively spliced transcript variants encoding distinct isoforms have been reported.
17784	This gene encodes a membrane-anchored protein with a hydrophobic amino terminal domain and a cyclophilin-like PPIase domain. It is present on the surface of natural killer cells and facilitates their binding to targets. Alternate transcriptional splice variants, encoding different isoforms, have been characterized and expression of the splice variants is regulated by IL2 activation of the cells.
17785	
17786	
17787	
17788	
17789	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This protein localizes to the cytoplasm and its expression is induced by interferon. The protein down-regulates transcription from the HIV-1 LTR promoter region, suggesting that function of this protein may be to mediate interferon@#$%&s antiviral effects.
17790	
17791	This gene encodes a water channel protein located in the kidney collecting tubule. It belongs to the MIP/aquaporin family, some members of which are clustered together on chromosome 12q13. Mutations in this gene have been linked to autosomal dominant, and recessive forms of nephrogenic diabetes insipidus.
17792	This gene encodes coagulation factor VII which is a vitamin K-dependent factor essential for hemostasis. This factor circulates in the blood in a zymogen form, and is converted to an active form by either factor IXa, factor Xa, factor XIIa, or thrombin by minor proteolysis. Upon activation of the factor VII, a heavy chain containing a catalytic domain and a light chain containing 2 EGF-like domains are generated, and two chains are held together by a disulfide bond. In the presence of factor III and calcium ions, the activated factor then further activates the coagulation cascade by converting factor IX to factor IXa and/or factor X to factor Xa. Alternative splicing of this gene results in 2 transcripts. Defects in this gene can cause coagulopathy.
17793	
17794	The human alanyl-tRNA synthetase (AARS) belongs to a family of tRNA synthases, of the class II enzymes.  Class II tRNA synthases evolved early in evolution and are highly conserved.  This is reflected by the fact that 498 of the 968-residue polypeptide human AARS shares 41% identity witht the E.coli protein.  tRNA synthases are the enzymes that interpret the RNA code and attach specific aminoacids to the tRNAs that contain the cognate trinucleotide anticodons.  They consist of a catalytic domain which interacts with the amino acid acceptor-T psi C helix of the tRNA, and a second domain which interacts with the rest of the tRNA structure.
17795	
17796	
17797	The protein encoded by this gene is a member of the STAT family of transcription factors. In response to cytokines and growth factors, STAT family members are phosphorylated by the receptor associated kinases, and then form homo- or heterodimers that translocate to the cell nucleus where they act as transcription activators. This protein is activated by, and mediates the responses of many cell ligands, such as IL2, IL3, IL7 GM-CSF, erythropoietin, thrombopoietin, and different growth hormones. Activation of this protein in myeloma and lymphoma associated with a TEL/JAK2 gene fusion is independent of cell stimulus and has been shown to be essential for the tumorigenesis. The mouse counterpart of this gene is found to induce the expression of BCL2L1/BCL-X(L), which suggests the antiapoptotic function of this gene in cells.
17798	The process of differentiation from mesodermal precursor cells to myoblasts has led to the discovery of a variety of tissue-specific factors that regulate muscle gene expression. The myogenic basic helix-loop-helix proteins, including myoD (MIM 159970), myogenin (MIM 159980), MYF5 (MIM 159990), and MRF4 (MIM 159991) are 1 class of identified factors. A second family of DNA binding regulatory proteins is the myocyte-specific enhancer factor-2 (MEF2) family. Each of these proteins binds to the MEF2 target DNA sequence present in the regulatory regions of many, if not all, muscle-specific genes. The MEF2 genes are members of the MADS gene family (named for the yeast mating type-specific transcription factor MCM1, the plant homeotic genes @#$%&agamous@#$%& and @#$%&deficiens@#$%& and the human serum response factor SRF (MIM 600589)), a family that also includes several homeotic genes and other transcription factors, all of which share a conserved DNA-binding domain.[supplied by OMIM]
17799	This gene is located within the Smith-Magenis syndrome region on chromosome 17. Mutations in this gene are associated with Birt-Hogg-Dube syndrome, which is characterized by fibrofolliculomas, renal tumors, lung cysts, and pneumothorax. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
17800	
17801	
17802	Ubiquitin is a highly conserved nuclear and cytoplasmic protein that has a major role in targeting cellular proteins for degradation by the 26S proteosome. It is also involved in the maintenance of chromatin structure, the regulation of gene expression, and the stress response. Ubiquitin is synthesized as a precursor protein consisting of either polyubiquitin chains or a single ubiquitin moiety fused to an unrelated protein. This gene encodes a fusion protein consisting of ubiquitin at the N terminus and ribosomal protein L40 at the C terminus. When expressed in yeast, the protein is post-translationally processed, generating free ubiquitin monomer and the 52-amino acid ribosomal protein L40. Ribosomal protein L40 is a component of the 60S subunit of the ribosome and belongs to the L40E family of ribosomal proteins. It contains a zinc finger-like domain and is located in the cytoplasm. Multiple processed pseudogenes derived from this gene are present in the genome. As with ribosomal protein L40, ribosomal protein S27a is also synthesized as a fusion protein with ubiquitin; similarly, ribosomal protein S30 is synthesized as a fusion protein with the ubiquitin-like protein fubi.
17803	
17804	Immunoglobulins recognize foreign antigens and initiate immune responses such as phagocytosis and the complement system. Each immunoglobulin molecule consists of two identical heavy chains and two identical light chains. There are two classes of light chains, kappa and lambda. This region represents the germline organization of the lambda light chain locus. The locus includes V (variable), J (joining), and C (constant) segments. During B cell development, a recombination event at the DNA level joins a single V segment with a J segment; the C segment is later joined by splicing at the RNA level. Recombination of many different V segments with several J segments provides a wide range of antigen recognition. Additional diversity is attained by junctional diversity, resulting from the random additional of nucleotides by terminal deoxynucleotidyltransferase, and by somatic hypermutation, which occurs during B cell maturation in the spleen and lymph nodes. Several V segments and three C segments are known to be incapable of encoding a protein and are considered pseudogenes. The locus also includes several non-immunoglobulin genes, many of which are pseudogenes or are predicted by automated computational analysis or homology to other species.
17805	This gene encodes a protein involved in import of peroxisomal matrix proteins. This protein localizes to the peroxisomal membrane. Mutations in this gene result in phenotypes within the Zellweger spectrum of peroxisomal biogenesis disorders, ranging from neonatal adrenoleukodystrophy to Zellweger syndrome. Alternative splicing results in two transcript variants encoding different isoforms.
17806	
17807	
17808	
17809	
17810	
17811	
17812	
17813	
17814	
17815	The protein encoded by this gene belongs to the highly conserved cyclin family, whose members are characterized by a dramatic periodicity in protein abundance through the cell cycle. Cyclins function as regulators of CDK kinases. Different cyclins exhibit distinct expression and degradation patterns which contribute to the temporal coordination of each mitotic event. This cyclin and its kinase partner CDK9 were found to be subunits of the transcription elongation factor p-TEFb. The p-TEFb complex containing this cyclin was reported to interact with, and act as a negative regulator of human immunodeficiency virus type 1 (HIV-1) Tat protein. Two alternatively spliced transcript variants, which encode distinct isoforms, have been described.
17816	
17817	
17818	
17819	
17820	The enzyme system for cleavage of glycine (glycine cleavage system; EC 2.1.2.10), which is confined to the mitochondria, is composed of 4 protein components: P protein (a pyridoxal phosphate-dependent glycine decarboxylase; MIM 238300), H protein (a lipoic acid-containing protein; MIM 238330), T protein (a tetrahydrofolate-requiring enzyme), and L protein (a lipoamide dehydrogenase; MIM 238331). Glycine encephalopathy (GCE; MIM 605899) may be due to a defect in any one of these enzymes.[supplied by OMIM]
17821	
17822	
17823	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene was previously referred to as FBX30, and belongs to the Fbws class; in addition to an F-box, this protein contains 7 tandem WD40 repeats. This protein binds directly to cyclin E and probably targets cyclin E for ubiquitin-mediated degradation. Mutations in this gene are detected in ovarian and breast cancer cell lines, implicating the gene@#$%&s potential role in the pathogenesis of human cancers. Three transcript variants encoding three different isoforms have been found for this gene.
17824	The protein encoded by this gene is a LIM domain protein. LIM domains are cysteine-rich double zinc fingers composed of 50 to 60 amino acids that are involved in protein-protein interactions. LIM domain-containing proteins are scaffolds for the formation of multiprotein complexes. The proteins are involved in cytoskeleton organization, cell lineage specification, organ development, and oncogenesis. The encoded protein is also a member of the Enigma class of proteins, a family of proteins that possess a 100-amino acid PDZ domain in the N terminus and 1 to 3 LIM domains in the C terminus. Multiple transcript variants encoding different isoforms have been found for this gene, although not all of them have been fully characterized.
17825	Polyspecific organic cation transporters in the liver, kidney, intestine, and other organs are critical for elimination of many endogenous small organic cations as well as a wide array of drugs and environmental toxins. The encoded protein is a plasma integral membrane protein which functions both as an organic cation transporter and as a sodium-dependent high affinity carnitine transporter. The encoded protein is involved in the active cellular uptake of carnitine. Mutations in this gene are the cause of systemic primary carnitine deficiency (CDSP), an autosomal recessive disorder manifested early in life by hypoketotic hypoglycemia and acute metabolic decompensation, and later in life by skeletal myopathy or cardiomyopathy.
17826	
17827	The protein encoded by this gene belongs to the subtilisin-like proprotein convertase family. The members of this family are proprotein convertases that process latent precursor proteins into their biologically active products. This encoded protein is a type I proinsulin-processing enzyme that plays a key role in regulating insulin biosynthesis. It is also known to cleave proopiomelanocortin, prorenin, proenkephalin, prodynorphin, prosomatostatin and progastrin. Mutations in this gene are thought to cause obesity. This encoded protein is associated with carcinoid tumors. The use of alternate polyadenylation sites has been found for this gene. Multiple alternatively spliced transcript variants have been described for this gene but their full length nature is not known.
17828	
17829	
17830	
17831	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. This structure consists of approximately 146 bp of DNA wrapped around a nucleosome, an octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is intronless and encodes a member of the histone H3 family. Transcripts from this gene lack polyA tails; instead, they contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6p22-p21.3.
17832	
17833	This gene encodes a member of the C-type lectin/C-type lectin-like domain (CTL/CTLD) superfamily. Members of this family share a common protein fold and have diverse functions, such as cell adhesion, cell-cell signalling, glycoprotein turnover, and roles in inflammation and immune response. The encoded type II transmembrane protein interacts with dnax-activation protein 12 and may play a role in cell activation. Alternative splice variants have been described but their full-length sequence has not been determined.
17834	
17835	
17836	
17837	
17838	
17839	
17840	
17841	
17842	
17843	
17844	Aminoacyl-tRNA synthetases are a class of enzymes that charge tRNAs with their cognate amino acids. The protein encoded by this gene belongs to the class I family of tRNA synthetases.
17845	This gene belongs to a group of vacuolar protein sorting (VPS) genes that, when functionally impaired, disrupt the efficient delivery of vacuolar hydrolases. The protein encoded by this gene is a component of a large multimeric complex, termed the retromer complex, which is involved in retrograde transport of proteins from endosomes to the trans-Golgi network. This VPS protein may be involved in the formation of the inner shell of the retromer coat for retrograde vesicles leaving the prevacuolar compartment. Alternative splice variants encoding different isoforms, and usage of multiple polyadenylation sites have been found for this gene.
17846	The protein encoded by this gene is a member of the dynamin superfamily of GTPases. Members of the dynamin-related subfamily, including the S. cerevisiae proteins Dnm1 and Vps1, contain the N-terminal tripartite GTPase domain but do not have the pleckstrin homology or proline-rich domains. This protein establishes mitochondrial morphology through a role in distributing mitochondrial tubules throughout the cytoplasm. The gene has 3 alternatively spliced transcripts encoding different isoforms. These transcripts are alternatively polyadenylated.
17847	This gene encodes a member of the Wiskott-Aldrich syndrome protein family. The gene product is a protein that forms a multiprotein complex that links receptor kinases and actin. Binding to actin occurs through a C-terminal verprolin homology domain in all family members. The multiprotein complex serves to tranduce signals that involve changes in cell shape, motility or function.
17848	This gene encodes a protein that contains two WD domains and an FYVE zinc finger region. The function of this gene is unknown. An alternatively spliced transcript variant of this gene may exist.
17849	The protein encoded by this gene belongs to the family of latent transforming growth factor (TGF)-beta binding proteins (LTBP), which are extracellular matrix proteins with multi-domain structure. This protein is the largest member of the LTBP family possessing unique regions and with most similarity to the fibrillins. It has thus been suggested that it may have multiple functions: as a member of the TGF-beta latent complex, as a structural component of microfibrils, and a role in cell adhesion.
17850	This gene encodes the GA-binding protein transcription factor, beta subunit. This protein forms a tetrameric complex with the alpha subunit, and stimulates transcription of target genes. The encoded protein may be involved in activation of cytochrome oxidase expression and nuclear control of mitochondrial function. The crystal structure of a similar protein in mouse has been resolved as a ternary protein complex. Multiple transcript variants encoding distinct isoforms have been identified for this gene.
17851	
17852	
17853	
17854	The enzyme encoded by this gene is classified as a thiolesterase and is responsible for the hydrolysis of S-lactoyl-glutathione to reduced glutathione and D-lactate.
17855	
17856	
17857	
17858	The B9 protein function is not known. Identification of a conserved mouse homolog and representation in the EST database indicates that this transcript does represent a gene. This gene is located within the Smith-Magenis syndrome region on chromosome 17.
17859	
17860	
17861	UGT2B17 belongs to the family of UDP-glucuronosyltransferases (UGTs; EC 2.4.1.17), enzymes that catalyze the transfer of glucuronic acid from uridine diphosphoglucuronic acid to a variety of substrates, including steroid hormones.[supplied by OMIM]
17862	The protein encoded by this gene is a member of the ubiquitin-conjugating enzyme family. Ubiquitin-conjugating enzyme catalyzes the covalent attachment of ubiquitin to other proteins. This protein is a part of the large multiprotein complex, which is required for ubiquitin-mediated degradation of cell cycle G1 regulators, and for the initiation of DNA replication.
17863	
17864	
17865	
17866	
17867	
17868	THO2 is part of the TREX (transcription/export) complex, which includes TEX1 (MIM 606929), HPR1 (MIM 606930), ALY (MIM 604171), and UAP56 (MIM 606390).[supplied by OMIM]
17869	
17870	
17871	
17872	
17873	
17874	
17875	
17876	
17877	
17878	
17879	
17880	
17881	
17882	
17883	
17884	
17885	
17886	
17887	
17888	
17889	This gene encodes twinfilin, an actin monomer-binding protein conserved from yeast to mammals. Studies of the mouse counterpart suggest that this protein may be an actin monomer-binding protein, and its localization to cortical G-actin-rich structures may be regulated by the small GTPase RAC1. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
17890	
17891	This gene is located at chromosome 7p13 which is close to the Williams Beuren syndrome chromosome region 7q11.23.
17892	
17893	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has one RRM domain that binds to RNAs. Two alternatively spliced transcript variants have been described for this gene.
17894	
17895	
17896	The protein encoded by this gene is a member of the cyclin-dependent serine/threonine protein kinase family. Members of this family are well known for their essential roles as master switches in cell cycle control. Some of the cell cycle control kinases are able to phosphorylate proteins that are important for cell differentiation and apoptosis, thus provide connections between cell proliferation, differentiation, and apoptosis. Proteins of this family may also be involved in non-cell cycle-related functions, such as neurocytoskeleton dynamics. The exact function of this protein has not yet been determined. It has unusually large N- and C-termini and is ubiquitously expressed in many tissues. Two alternatively spliced variants are described.
17897	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the MRP subfamily which is involved in multi-drug resistance. This protein is thought to form ATP-sensitive potassium channels in cardiac, skeletal, and vascular and non-vascular smooth muscle. Protein structure suggests a role as the drug-binding channel-modulating subunit of the extrapancreatic ATP-sensitive potassium channels. No disease has been associated with this gene thus far. Alternative splicing of this gene results in several products, two of which result from differential usage of two terminal exons and one which results from exon deletion.
17898	
17899	
17900	
17901	
17902	
17903	
17904	
17905	
17906	
17907	
17908	
17909	
17910	
17911	
17912	
17913	
17914	
17915	
17916	
17917	
17918	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein that belongs to the EcoL2 ribosomal protein family. A pseudogene corresponding to this gene is found on chromosome 12q.
17919	
17920	Eukaryotic protein kinases are enzymes that belong to a very extensive family of proteins which share a conserved catalytic core common with both serine/threonine and tyrosine protein kinases. This gene encodes an intestinal serine/threonine kinase harboring a dual phosphorylation site found in mitogen-activating protein (MAP) kinases. The protein localizes to the intestinal crypt region and is thought to be important in intestinal epithelial cell proliferation and differentiation. Alternative splicing has been observed at this locus and two variants, encoding the same isoform, have been identified.
17921	
17922	This gene encodes a member of the von Willebrand factor A domain containing protein family. This family of proteins is thought to be involved in the formation of filamentous networks in the extracellular matrices of various tissues. This protein contains five von Willebrand factor A domains. The specific function of this gene has not yet been determined. Two transcript variants encoding different isoforms have been found for this gene.
17923	This gene is a member of the tropomyosin family of highly conserved, widely distributed actin-binding proteins involved in the contractile system of striated and smooth muscles and the cytoskeleton of non-muscle cells. Tropomyosin is composed of two alpha-helical chains arranged as a coiled-coil. It is polymerized end to end along the two grooves of actin filaments and provides stability to the filaments. The encoded protein is one type of alpha helical chain that forms the predominant tropomyosin of striated muscle, where it also functions in association with the troponin complex to regulate the calcium-dependent interaction of actin and myosin during muscle contraction. In smooth muscle and non-muscle cells, alternatively spliced transcript variants encoding a range of isoforms have been described. Mutations in this gene are associated with type 3 familial hypertrophic cardiomyopathy.
17924	
17925	
17926	
17927	
17928	
17929	
17930	Many polypeptide mitogens, such as basic fibroblast growth factor (FGFB; MIM 134920) and platelet-derived growth factors (MIM 173430, MIM 190040), are active on a wide range of different cell types. In contrast, vascular endothelial growth factor is a mitogen primarily for vascular endothelial cells. It is, however, structurally related to platelet-derived growth factor.[supplied by OMIM]
17931	
17932	
17933	
17934	Protein kinase C (PKC) is a family of serine- and threonine-specific protein kinases that can be activated by calcium and the second messenger diacylglycerol. PKC family members phosphorylate a wide variety of protein targets and are known to be involved in diverse cellular signaling pathways. PKC family members also serve as major receptors for phorbol esters, a class of tumor promoters. Each member of the PKC family has a specific expression profile and is believed to play a distinct role. The protein encoded by this gene is one of the PKC family members. This kinase can be activated rapidly by the agonists of G protein-coupled receptors. It resides in both cytoplasm and nucleus, and its nuclear accumulation is found to be dramatically enhanced in response to its activation. This kinase can also be activated after B-cell antigen receptor (BCR) engagement, which requires intact phopholipase C gamma and the involvement of other PKC family members.
17935	
17936	
17937	
17938	
17939	
17940	Dehydrodolichyl diphosphate (dedol-PP) synthase catalyzes cis-prenyl chain elongation to produce the polyprenyl backbone of dolichol, a glycosyl carrier lipid required for the biosynthesis of several classes of glycoproteins.[supplied by OMIM]
17941	
17942	Adenylate kinases are involved in regulating the adenine nucleotide composition within a cell by catalyzing the reversible transfer of phosphate groups among adenine nucleotides. Three isozymes of adenylate kinase, namely 1, 2, and 3, have been identified in vertebrates; this gene encodes isozyme 2. Expression of these isozymes is tissue-specific and developmentally regulated. Isozyme 2 is localized in the mitochondrial intermembrane space and may play a role in apoptosis. Transcript variants encoding distinct isoforms have been identified for this gene.
17943	Dolichol-phosphate mannose (Dol-P-Man) serves as a donor of mannosyl residues on the lumenal side of the endoplasmic reticulum (ER). Lack of Dol-P-Man results in defective surface expression of GPI-anchored proteins. Dol-P-Man is synthesized from GDP-mannose and dolichol-phosphate on the cytosolic side of the ER by the enzyme dolichyl-phosphate mannosyltransferase. The protein encoded by this gene is a subunit of dolichyl-phosphate mannosyltransferase and acts as a stabilizer subunit of the dolichyl-phosphate mannosyltransferase complex.
17944	
17945	Interleukin 1 induces synthesis of acute phase and proinflammatory proteins during infection, tissue damage, or stress, by forming a complex at the cell membrane with an interleukin 1 receptor and an accessory protein. This gene encodes an interleukin 1 receptor accessory protein. Alternative splicing of this gene results in two transcript variants encoding two different isoforms, one membrane-bound and one soluble.
17946	
17947	Huntingtin is a disease gene linked to Huntington@#$%&s disease, a neurodegenerative disorder characterized by loss of striatal neurons.  This is thought to be caused by an expanded,  unstable trinucleotide repeat in the huntingtin gene, which translates as a polyglutamine repeat in the protein product.  The huntingtin locus is large, spanning 180 kb and consisting of 67 exons. The huntingtin gene is widely expressed and is required for normal development.  It is expressed as 2 alternatively polyadenylated forms displaying different relative abundance in various fetal and adult tissues.  The larger transcript is approximately 13.7 kb and is expressed predominantly in adult and fetal brain whereas the smaller transcript of approximately 10.3 kb is more widely expressed.  The genetic defect leading to Huntington@#$%&s disease may not necessarily eliminate transcription, but may confer a new property on the mRNA or alter the function of the protein. One candidate is the huntingtin-associated protein-1, highly expressed in brain, which has increased affinity for huntingtin protein with expanded polyglutamine repeats.
17948	This gene is a member of the stromal interaction molecule (STIM) family and likely arose, with related family member STIM1, from a common ancestral gene. It encodes a type 1 transmembrane protein whose function is currently unknown, but is a likely adhesion molecule with a role in early hematopoiesis. Alternative translation initiation from an AUG and a non-AUG (UUG) start site results in the production of two different isoforms.
17949	
17950	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein does not contain a coiled coil region, like some family members, but contains a SH3 domain. The specific function of this protein has not been determined.
17951	This gene encodes the homolog of the Drosophila melanogaster Nipped-B gene product and fungal Scc2-type sister chromatid cohesion proteins. The Drosophila protein facilitates enhancer-promoter communication of remote enhancers and plays a role in developmental regulation. It is also homologous to a family of chromosomal adherins with broad roles in sister chromatid cohesion, chromosome condensation, and DNA repair. The human protein has a bipartite nuclear targeting sequence and a putative HEAT repeat. Condensins, cohesins and other complexes with chromosome-related functions also contain HEAT repeats. Mutations in this gene result in Cornelia de Lange syndrome.
17952	
17953	
17954	CREB3L2 is a member of the old astrocyte specifically induced substance (OASIS) DNA binding and basic leucine zipper dimerization (bZIP) family of transcription factors, which includes CREB3 (MIM 606443) and CREB4 (MIM 607138).[supplied by OMIM]
17955	
17956	
17957	
17958	
17959	This gene is thought to play an important role in calcium homeostasis. Alternative splicing of this gene results in five transcript variants which vary in protein translation, the coding of catalytic domains, and tissue expression. Variation among these transcripts impacts their functions which involve roles in the calcium storage and release process in the endoplasmic and sarcoplasmic reticulum as well as hydroxylation of aspartic acid and asparagine in epidermal growth factor-like domains of various proteins.
17960	
17961	
17962	Prefoldin is a heterohexameric chaperone protein which assists in the correct folding of other proteins. It binds specifically to cytosolic chaperonin and transfers target proteins. Prefoldin may function by selectively targeting nascent actin and tubulin chains pending their transfer to cytosolic chaperonin for final folding and/or assembly. It promotes folding in an environment in which there are many competing pathways for nonnative proteins. This gene encodes a subunit of prefoldin, prefoldin 5. This subunit is associated with c-myc and is thought to repress the transcriptional activity of c-myc.
17963	
17964	
17965	
17966	
17967	
17968	
17969	
17970	
17971	
17972	This gene encodes a member of the oxysterol-binding protein (OSBP) family, a group of intracellular lipid receptors. Like most members, the encoded protein contains an N-terminal pleckstrin homology domain and a highly conserved C-terminal OSBP-like sterol-binding domain. Two transcript variants encoding the same isoform have been identified.
17973	
17974	
17975	Transforming growth factor-beta, or TGFB (see TGFB1; MIM 190180), remains biologically latent after secretion, blocked from activity by its N-terminal propeptide (the latency-associated peptide, or LAP). TGFB is activated by its dissociation from LAP. The latent form of TGFB is secreted in association with a latent TGFB-binding protein, or LTBP (see LTBP1; MIM 150390), and assembled into the extracellular matrix. LTBPs are mainly composed of epidermal growth factor (EGF; MIM 131530)-like repeats and protein domains with 8-cysteine repeats, both of which are also found in fibrillins (e.g., MIM 134797). See Oklu and Hesketh (2000) for a review of the LTBP gene family.[supplied by OMIM]
17976	When Chinese hamster K12 cells are starved of glucose, the synthesis of several proteins, called glucose-regulated proteins (GRPs), is markedly increased. Hendershot et al. (1994) pointed out that one of these, GRP78 (HSPA5), also referred to as @#$%&immunoglobulin heavy chain-binding protein@#$%& (BiP), is a member of the heat-shock protein-70 (HSP70) family and is involved in the folding and assembly of proteins in the endoplasmic reticulum (ER). Because so many ER proteins interact transiently with GRP78, it may play a key role in monitoring protein transport through the cell.[supplied by OMIM]
17977	
17978	
17979	
17980	
17981	
17982	This gene encodes a member of the amelogenin family of extracellular matrix proteins. Amelogenins are involved in biomineralization during tooth enamel development. Mutations in a related gene on chromosome X cause X-linked amelogenesis imperfecta.
17983	
17984	
17985	
17986	Dual specificity protein phosphatases inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues. They negatively regulate members of the MAPK superfamily (MAPK/ERK, SAPK/JNK, p38), which is associated with cellular proliferation and differentiation. Different members of this family of dual specificity phosphatases show distinct substrate specificities for MAPKs, different tissue distribution and subcellular localization, and different modes of inducibility of their expression by extracellular stimuli. This gene product binds to and inactivates p38 and SAPK/JNK, but not MAPK/ERK. Its subcellular localization is unique; it is evenly distributed in both the cytoplasm and the nucleus. This gene is widely expressed in various tissues and organs, and its expression is elevated by stress stimuli. Three transcript variants encoding two different isoforms have been found for this gene.
17987	Autophagy is the process by which endogenous proteins and damaged organelles are destroyed intracellularly. Autophagy is postulated to be essential for cell homeostasis and cell remodeling during differentiation, metamorphosis, non-apoptotic cell death, and aging. Reduced levels of autophagy have been described in some malignant tumors, and a role for autophagy in controlling the unregulated cell growth linked to cancer has been proposed. This gene encodes a member of the autophagin protein family. The encoded protein is also designated as a member of the C-54 family of cysteine proteases. Two transcript variants that encode different isoforms were identified through sequence analysis.
17988	
17989	The protein encoded by this gene is a member of the flamingo subfamily, part of the cadherin superfamily. The flamingo subfamily consists of nonclassic-type cadherins; a subpopulation that does not interact with catenins. The flamingo cadherins are located at the plasma membrane and have nine cadherin domains, seven epidermal growth factor-like repeats and two laminin A G-type repeats in their ectodomain. They also have seven transmembrane domains, a characteristic unique to this subfamily. It is postulated that these proteins are receptors involved in contact-mediated communication, with cadherin domains acting as homophilic binding regions and the EGF-like domains involved in cell adhesion and receptor-ligand interactions. The specific function of this particular member has not been determined.
17990	
17991	
17992	
17993	
17994	Potassium channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. Four sequence-related potassium channel genes - shaker, shaw, shab, and shal - have been identified in Drosophila, and each has been shown to have human homolog(s). This gene encodes a member of the potassium channel, voltage-gated, shaker-related subfamily. This member includes three distinct isoforms which are encoded by three alternatively spliced transcript variants of this gene. These three isoforms are beta subunits, which form heteromultimeric complex with alpha subunits and modulate the activity of the pore-forming alpha subunits.
17995	This gene encodes a protein which may function in signal transduction pathways and whose expression is elevated in germinal cell lymphomas. It contains a putative PDZ-interacting domain, an immunoreceptor tyrosine-based activation motif (ITAM), and two putative SH2 binding sites. In B cells, its expression is specifically induced by interleukin-4. Alternative splicing results in multiple transcript variants encoding different isoforms.
17996	
17997	
17998	
17999	
18000	
18001	This gene encodes the phosphatase 2A catalytic subunit. Protein phosphatase 2A is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. This gene encodes an alpha isoform of the catalytic subunit.
18002	
18003	
18004	
18005	
18006	
18007	
18008	The protein encoded by this gene is similar to Saccharomyces cerevisiae LYS5, which is required for the activation of the alpha-aminoadipate dehydrogenase in the biosynthetic pathway of lysine. Yeast alpha-aminoadipate dehydrogenase converts alpha-biosynthetic-aminoadipate semialdehyde to alpha-aminoadipate. It has been suggested that defects in the human gene result in pipecolic acidemia.
18009	
18010	
18011	
18012	
18013	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development,reproduction, and tissue remodeling, as well as in disease processes, such as arthritis and metastasis. Most MMP@#$%&s are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. The function of the protein encoded by this gene has not been determined. This gene was previously referred to as MMP18 but has been renamed matrix metalloproteinase 19 (MMP19). Multiple transcript variants encoding distict isoforms have been identified for this gene.
18014	
18015	
18016	
18017	
18018	
18019	This gene encodes a WD repeat-containing protein. Multiple alternatively spliced transcript variants have been found for this gene, but the full-length nature of some variants has not been defined.
18020	The protein encoded by this gene is highly conserved in human, mouse, and chicken, showing 94% and 79% amino acid identity of human to mouse and chicken sequences, respectively. Hybridization to this gene was detected in spindle-shaped cells located along nerve fibers between the auditory ganglion and sensory epithelium. These cells accompany neurites at the habenula perforata, the opening through which neurites extend to innervate hair cells. This and the pattern of expression of this gene in chicken inner ear paralleled the histologic findings of acidophilic deposits, consistent with mucopolysaccharide ground substance, in temporal bones from DFNA9 (autosomal dominant nonsyndromic sensorineural deafness 9) patients. Mutations that cause DFNA9 have been reported in this gene.
18021	The protein encoded by this gene is highly conserved in human, mouse, and chicken, showing 94% and 79% amino acid identity of human to mouse and chicken sequences, respectively. Hybridization to this gene was detected in spindle-shaped cells located along nerve fibers between the auditory ganglion and sensory epithelium. These cells accompany neurites at the habenula perforata, the opening through which neurites extend to innervate hair cells. This and the pattern of expression of this gene in chicken inner ear paralleled the histologic findings of acidophilic deposits, consistent with mucopolysaccharide ground substance, in temporal bones from DFNA9 (autosomal dominant nonsyndromic sensorineural deafness 9) patients. Mutations that cause DFNA9 have been reported in this gene.
18022	
18023	
18024	
18025	
18026	
18027	
18028	The protein encoded by this gene is a scavenger receptor that is expressed in endothelial cells and mediates the binding and degradation of acetylated low density lipoprotein (Ac-LDL). Transcription factors ZNF444/EZF-2 and SP1 are found to be responsible for the regulation of this gene. Five alternatively spliced transcript variants encoding distinct isoforms have been described.
18029	
18030	
18031	
18032	
18033	NSF and SNAPs are general elements of the cellular membrane transport apparatus. The sequence of the predicted 312-amino acid human protein encoded by NAPG is 95% identical to that of bovine gamma-SNAP. NAPG mediates platelet exocytosis and controls the membrane fusion events of this process.
18034	
18035	
18036	
18037	
18038	
18039	
18040	
18041	
18042	
18043	Oxygenation levels play an important role in the regulation of cellular invasiveness which occurs during early implantation when the trophoblast cells invade the uterus as well as during tumour progression and metastasis. This gene, which is regulated by oxygen tension, is induced in hypoxic trophoblast cells and is overexpressed in ovarian cancer. Two protein isoforms are encoded by transcript variants of this gene.
18044	An acetyl-CoA synthetase found in the mitochondrial matrix has been characterized in both cow and mouse. It is primarily a cardiac enzyme which produces acetyl-CoA mainly for the oxidation of acetate. The protein encoded by this gene is structurally similar to the cow and mouse proteins and therefore may serve a similar function in humans.
18045	
18046	This gene is a member of the MAGEA gene family. The members of this family encode proteins with 50 to 80% sequence identity to each other. The promoters and first exons of the MAGEA genes show considerable variability, suggesting that the existence of this gene family enables the same function to be expressed under different transcriptional controls. The MAGEA genes are clustered at chromosomal location Xq28. They have been implicated in some hereditary disorders, such as dyskeratosis congenita. At least four variants encoding the same protein have been found for this gene.
18047	
18048	
18049	
18050	
18051	
18052	
18053	
18054	
18055	
18056	
18057	
18058	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbls class. Multiple alternatively spliced transcript variants have been found for this gene, but the full-length nature of some variants has not been determined.
18059	
18060	ANAPC1 is 1 of at least 10 subunits of the anaphase-promoting complex (APC), which functions at the metaphase-to-anaphase transition of the cell cycle and is regulated by spindle checkpoint proteins. The APC is an E3 ubiquitin ligase that targets cell cycle regulatory proteins for degradation by the proteasome, thereby allowing progression through the cell cycle.[supplied by OMIM]
18061	
18062	Moesin (for membrane-organizing extension spike protein) is a member of the ERM family which includes ezrin and radixin. ERM proteins appear to function as cross-linkers between plasma membranes and actin-based cytoskeletons. Moesin is localized to filopodia and other membranous protrusions that are important for cell-cell recognition and signaling and for cell movement.
18063	
18064	
18065	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
18066	
18067	
18068	
18069	
18070	
18071	
18072	
18073	
18074	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
18075	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
18076	
18077	
18078	
18079	
18080	This gene is a member of the cadherin superfamily, genes encoding calcium dependent cell-cell adhesion glycoproteins. The protein encoded by this gene is a large, single-pass transmembrane protein composed of an extracellular domain containing 27 repeats that show significant homology to the cadherin ectodomain. Expressed in the neurosensory epithelium, the protein is thought to be involved in stereocilia organization and hair bundle formation. The gene is located in a region containing the human deafness loci DFNB12 and USH1D. Usher syndrome 1D and nonsyndromic autosomal recessive deafness DFNB12 are caused by allelic mutations of this novel cadherin-like gene. Two alternative splice variants have been identified, encoding different isoforms. Additional variants have been observed but their full-length nature has not been determined.
18081	
18082	
18083	
18084	
18085	
18086	This gene belongs to the protocadherin gene family, a subfamily of the cadherin superfamily. The mRNA encodes a cadherin-related neuronal receptor that localizes to synaptic junctions and is putatively involved in specific neuronal connections and signal transduction. Sharing a characteristic with other protocadherin genes, this gene has a notably large exon that encodes 6 cadherin domains and a transmembrane region. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
18087	
18088	
18089	
18090	
18091	
18092	
18093	
18094	
18095	High density lipoprotein-binding protein, also known as vigilin, is a 110-kD protein that specifically binds HDL molecules and may function in the removal of excess cellular cholesterol.[supplied by OMIM]
18096	
18097	
18098	
18099	Thyrotroph embryonic factor (TEF), a transcription factor, is a member of the PAR (proline and acidic amino acid-rich) subfamily of basic region/leucine zipper (bZIP) transcription factors.  It is expressed in a broad range of cells and tissues in adult animals, however, during embryonic development, TEF expression appears to be restricted to the developing anterior pituitary gland, coincident with the appearance of thyroid-stimulating hormone, beta (TSHB).  Indeed, TEF can bind to, and transactivate the TSHB promoter.  It shows homology (in the functional domains) with other members of the PAR-bZIP subfamily of transcription factors, which include albumin D box-binding protein (DBP), human hepatic leukemia factor (HLF) and chicken vitellogenin gene-binding protein (VBP); VBP is considered the chicken homologue of TEF.  Different members of the subfamily can readily form heterodimers, and share DNA-binding, and transcriptional regulatory properties.
18100	
18101	
18102	
18103	
18104	Huntington@#$%&s disease (HD), a neurodegenerative disorder characterized by loss of striatal neurons, is caused by an expansion of a polyglutamine tract in the HD protein huntingtin. This gene encodes a protein belonging to a class of huntingtin interacting proteins characterized by WW motifs. Two transcripts encoding different isoforms have been completely described.
18105	
18106	This gene is a member of the cyclophilin family of peptidylprolyl isomerases. The cyclophilins are a highly conserved family, members of which play an important role in protein folding, immunosuppression by cyclosporin A, and infection of HIV-1 virions.
18107	This gene encodes a member of the homer family of dendritic proteins. Members of this family regulate group 1 metabotrophic glutamate receptor function. The encoded protein may be involved in cell growth. Four transcript variants encoding distinct isoforms have been identified for this gene.
18108	
18109	
18110	This gene encodes a death agonist that heterodimerizes with either agonist BAX or antagonist BCL2. The encoded protein is a member of the BCL-2 family of cell death regulators. It is a mediator of mitochondrial damage induced by caspase-8 (CASP8); CASP8 cleaves this encoded protein, and the COOH-terminal part translocates to mitochondria where it triggers cytochrome c release. Multiple alternatively spliced transcript variants have been found, but the full-length nature of some variants has not been defined.
18111	
18112	Vesicle mediated protein sorting plays an important role in segregation of intracellular molecules into distinct organelles. Genetic studies in yeast have identified more than 40 vacuolar protein sorting (VPS) genes involved in vesicle transport to vacuoles. This gene encodes the human homolog of yeast class C Vps18 protein. The mammalian class C Vps proteins are predominantly associated with late endosomes/lysosomes, and like their yeast counterparts, may mediate vesicle trafficking steps in the endosome/lysosome pathway.
18113	
18114	
18115	
18116	
18117	
18118	
18119	Myosin is a hexameric ATPase cellular motor protein. It is composed of two myosin heavy chains, two nonphosphorylatable myosin alkali light chains, and two phosphorylatable myosin regulatory light chains. This gene encodes a myosin alkali light chain that is found in embryonic muscle and adult atria. Two alternatively spliced transcript variants encoding the same protein have been found for this gene.
18120	
18121	
18122	
18123	
18124	
18125	The protein encoded by this gene belongs to a small class of the protein tyrosine phosphatase (PTP) family. PTPs are cell signaling molecules that play regulatory roles in a variety of cellular processes. PTPs in this class contain a protein tyrosine phosphatase catalytic domain and a characteristic C-terminal prenylation motif. This PTP has been shown to primarily associate with plasmic and endosomal membrane through its C-terminal prenylation. This PTP was found to interact with the beta-subunit of Rab geranylgeranyltransferase II (beta GGT II), and thus may function as a regulator of GGT II activity. Overexpression of this gene in mammalian cells conferred a transformed phenotype, which suggested its role in tumorigenesis. Three alternatively spliced transcript variants, which encode two distinct isoforms, have been described.
18126	
18127	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L34E family of ribosomal proteins. It is located in the cytoplasm. This gene originally was thought to be located at 17q21, but it has been mapped to 4q. Transcript variants derived from alternative splicing, alternative transcription initiation sites, and/or alternative polyadenylation exist; these variants encode the same protein. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
18128	
18129	
18130	
18131	
18132	
18133	
18134	Short/branched chain acyl-CoA dehydrogenase(ACADSB) is a member of the acyl-CoA dehydrogenase family of enzymes that catalyze the dehydrogenation of acyl-CoA derivatives in the metabolism of fatty acids or branch chained amino acids. Substrate specificity is the primary characteristic used to define members of this gene family. The ACADSB gene product has the greatest activity towards the short branched chain acyl-CoA derivative, (S)-2-methylbutyryl-CoA, but also reacts significantly with other 2-methyl branched chain substrates and with short straight chain acyl-CoAs. The cDNA encodes for a mitochondrial precursor protein which is cleaved upon mitochondrial import and predicted to yield a mature peptide of approximately 43.7-KDa.
18135	
18136	
18137	
18138	The protein encoded by this gene belongs to the calcium channel beta subunit family. It plays an important role in the calcium channel by modulating G protein inhibition, increasing peak calcium current, controlling the alpha-1 subunit membrane targeting and shifting the voltage dependence of activation and inactivation. Alternative splicing occurs at this locus and three transcript variants encoding three distinct isoforms have been identified.
18139	
18140	
18141	
18142	The protein encoded by this gene is a member of the RAS superfamily which are small GTP/GDP-binding proteins with an average size of 200 amino acids. The RAS-related proteins of the RAB/YPT family may play a role in the transport of proteins from the endoplasmic reticulum to the Golgi and the plasma membrane. This protein shares 97%, 96%, and 51% similarity with the dog RAB8, mouse MEL, and mouse YPT1 proteins, respectively and contains the 4 GTP/GDP-binding sites that are present in all the RAS proteins. The putative effector-binding site of this protein is similar to that of the RAB/YPT proteins. However, this protein contains a C-terminal CAAX motif that is characteristic of many RAS superfamily members but which is not found in YPT1 and the majority of RAB proteins. Although this gene was isolated as a transforming gene from a melanoma cell line, no linkage between MEL and malignant melanoma has been demonstrable. This oncogene is located 800 kb distal to MY09B on chromosome 19p13.1.
18143	This gene encodes a secreted inhibitor which protects epithelial tissues from serine proteases.  It is found in various secretions including seminal plasma, cervical mucus, and bronchial secretions, and has affinity for trypsin, leukocyte elastase, and cathepsin G.  Its inhibitory effect contributes to the immune response by protecting epithelial surfaces from attack by endogenous proteolytic enzymes; the protein is also thought to have broad-spectrum anti-biotic activity.
18144	The product of this gene is a member of the CDC42-binding protein family. Members of this family interact with Rho family GTPases and regulate the organization of the actin cytoskeleton. This protein has been shown to bind both CDC42 and TC10 GTPases in a GTP-dependent manner. When overexpressed in fibroblasts, this protein was able to induce pseudopodia formation, which suggested a role in inducing actin filament assembly and cell shape control.
18145	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This protein contains 2 bromodomains and multiple WD repeats, and the function of this protein is not known. This gene is located within the Down syndrome region-2 on chromosome 21. Alternative splicing of this gene generates 3 transcript variants diverging at the 3@#$%& ends.
18146	
18147	
18148	
18149	Although the protein encoded by this gene is similar to N2,N2-dimethylguanosine tRNA methyltransferase from other organisms, the true function of the encoded protein is not known.
18150	ADP-ribosylation factor 1 (ARF1) is a member of the human ARF gene family. The family members encode small guanine nucleotide-binding proteins that stimulate the ADP-ribosyltransferase activity of cholera toxin and play a role in vesicular trafficking as activators of phospholipase D. The gene products, including 6 ARF proteins and 11 ARF-like proteins, constitute a family of the RAS superfamily. The ARF proteins are categorized as class I (ARF1, ARF2 and ARF3), class II (ARF4 and ARF5) and class III (ARF6), and members of each class share a common gene organization. The ARF1 protein is localized to the Golgi apparatus and has a central role in intra-Golgi transport. Multiple alternatively spliced transcript variants encoding the same protein have been found for this gene.
18151	
18152	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and hydroxylates medium-chain fatty acids such as laurate and myristate.
18153	
18154	
18155	
18156	
18157	Myosin is a hexameric ATPase cellular motor protein. It is composed of two heavy chains, two nonphosphorylatable alkali light chains, and two phosphorylatable regulatory light chains. This gene encodes a myosin alkali light chain expressed in fast skeletal muscle. Two transcript variants have been identified for this gene.
18158	
18159	
18160	
18161	The Golgi complex plays a key role in the sorting and modification of proteins exported from the endoplasmic reticulum. The protein encoded by this gene is a type II Golgi-resident protein. It may process proteins synthesized in the rough endoplasmic reticulum and assist in the transport of protein cargo through the Golgi apparatus.
18162	
18163	
18164	
18165	SFRS12 belongs to the superfamily of serine/arginine-rich (SR) splicing factors. It modulates splice site selection by regulating the activities of other SR proteins (Barnard et al., 2002).[supplied by OMIM]
18166	
18167	Mutations linked to neurofibromatosis type 1 led to the identification of NF1. NF1 encodes the protein neurofibromin, which appears to be a negative regulator of the ras signal transduction pathway. In addition to type 1 neurofibromatosis, mutations in NF1 can also lead to juvenile myelomonocytic leukemia. Alternatively spliced NF1 mRNA transcripts have been isolated, although their functions, if any, remain unclear.
18168	This gene is a member of the protocadherin alpha gene cluster, one of three related gene clusters tandemly linked on chromosome five that demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The alpha gene cluster is composed of 15 cadherin superfamily genes related to the mouse CNR genes and consists of 13 highly similar and 2 more distantly related coding sequences. The tandem array of 15 N-terminal exons, or variable exons, are followed by downstream C-terminal exons, or constant exons, which are shared by all genes in the cluster. The large, uninterrupted N-terminal exons each encode six cadherin ectodomains while the C-terminal exons encode the cytoplasmic domain. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins that most likely play a critical role in the establishment and function of specific cell-cell connections in the brain. Alternative splicing has been observed and additional variants have been suggested but their full-length nature has yet to be determined.
18169	
18170	
18171	The Sec61 complex is the central component of the protein translocation apparatus of the endoplasmic reticulum (ER) membrane. The protein encoded by this gene and SEC62 protein are found to be associated with ribosome-free SEC61 complex. It is speculated that Sec61-Sec62-Sec63 may perform post-translational protein translocation into the ER. The Sec61-Sec62-Sec63 complex might also perform the backward transport of ER proteins that are subject to the ubiquitin-proteasome-dependent degradation pathway. The encoded protein is an integral membrane protein located in the rough ER.
18172	Werner@#$%&s syndrome is a rare autosomal recessive disorder characterized by premature aging. The protein encoded by this gene interacts with the N-terminal portion of Werner protein containing the exonuclease domain. This protein shows homology to replication factor C family proteins, and is conserved from E. coli to human. Studies in yeast suggest that this gene may influence the aging process. Two transcript variants encoding different isoforms have been isolated for this gene.
18173	
18174	The product of this gene belongs to the small GTPase superfamily, Ras family of proteins. GTP-binding proteins mediate the transmembrane signaling initiated by the occupancy of certain cell surface receptors. This gene encodes a low molecular mass ras-like GTP-binding protein that shares about 50% similarity with other ras proteins.
18175	The protein encoded by this gene is a methyltransferase that converts guanidoacetate to creatine, using S-adenosylmethionine as the methyl donor. Defects in this gene have been implicated in neurologic syndromes and muscular hypotonia, probably due to creatine deficiency and accumulation of guanidinoacetate in the brain of affected individuals. Two transcript variants encoding different isoforms have been described for this gene.
18176	
18177	
18178	This gene encodes a telomere specific protein which is a component of the telomere nucleoprotein complex. This protein is present at telomeres throughout the cell cycle and functions as an inhibitor of telomerase, acting in cis to limit the elongation of individual chromosome ends. The protein structure contains a C-terminal Myb motif, a dimerization domain near its N-terminus and an acidic N-terminus. Two transcripts of this gene are alternatively spliced products.
18179	The protein encoded by this gene is a member of the TNF-receptor superfamily. This protein is an osteoblast-secreted decoy receptor that functions as a negative regulator of bone resorption. This protein specifically binds to its ligand, osteoprotegerin ligand (TNFSF11/OPGL), both of which are key extracellular regulators of osteoclast development. Studies of the mouse counterpart also suggest this protein and its ligand play a role in lymph-node organogenesis and vascular calcification. Alternatively spliced transcript variants of this gene have been reported, but their full length nature has not been determined.
18180	
18181	
18182	
18183	
18184	
18185	This gene encodes a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, namely a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region.
18186	
18187	
18188	
18189	This gene encodes a protein that contains a ubiquitin associated domain at the N-terminus, an SH3 domain, and a C-terminal domain with similarities to the catalytic motif of phosphoglycerate mutase. The encoded protein was found to inhibit endocytosis of epidermal growth factor receptor (EGFR) and platelet-derived growth factor receptor.
18190	
18191	
18192	The protein encoded by this gene is a member of the fatty acid desaturase (FADS) gene family. Desaturase enzymes regulate unsaturation of fatty acids through the introduction of double bonds between defined carbons of the fatty acyl chain. FADS family members are considered fusion products composed of an N-terminal cytochrome b5-like domain and a C-terminal multiple membrane-spanning desaturase portion, both of which are characterized by conserved histidine motifs. This gene is clustered with family members FADS1 and FADS2 at 11q12-q13.1; this cluster is thought to have arisen evolutionarily from gene duplication based on its similar exon/intron organization.
18193	The protein encoded by this gene belongs to the highly conserved cyclin family, whose members are characterized by a dramatic periodicity in protein abundance throughout the cell cycle. Cyclins function as regulators of CDK kinases. Different cyclins exhibit distinct expression and degradation patterns which contribute to the temporal coordination of each mitotic event. This cyclin forms a complex with and functions as a regulatory subunit of CDK4 or CDK6, whose activity is required for cell cycle G1/S transition. This protein has been shown to interact with tumor suppressor protein Rb and the expression of this gene is regulated positively by Rb. Mutations, amplification and overexpression of this gene, which alters cell cycle progression, are observed frequently in a variety of tumors and may contribute to tumorigenesis.
18194	The product of this gene is one of the minor apolipoproteins found in plasma. This protein forms complexes with lipoproteins and may be involved in transport and/or esterification of cholesterol.
18195	Natural killer (NK) cells are lymphocytes that can mediate lysis of certain tumor cells and virus-infected cells without previous activation. They can also regulate specific humoral and  cell-mediated immunity. NK cells preferentially express several calcium-dependent (C-type) lectins, which have been implicated in the regulation of NK cell function. D12S2489E is a member of the NKG2 group which are expressed primarily in natural killer (NK) cells and encodes a family of transmembrane proteins characterized by a type II membrane orientation (extracellular C terminus) and the presence of a C-type lectin domain. The NKG2 gene family is located within the NK complex, a region that contains several C-type lectin genes preferentially expressed on NK cells. The first non-coding exon at the 5@#$%& end of the D12S2489E transcript is included in the 3@#$%& end of the KLRC4 transcript.
18196	
18197	
18198	In Drosophila, the Additional sex combs (Asx) gene encodes a chromatin-binding protein required for normal determination of segment identity in the developing embryo. The protein is a member of the Polycomb group of proteins, which are necessary for the maintenance of stable repression of homeotic and other loci. The protein is thought to disrupt chromatin in localized areas, enhancing transcription of certain genes while repressing the transcription of other genes. Although the function of the protein encoded by this gene is not known, it does show some sequence similarity to the protein encoded by the Drosophila Asx gene.
18199	
18200	
18201	The protein encoded by this gene contains a structural module related to that of the seven transmembrane domain G protein-coupled receptor superfamily. This protein has sequence and structural similarities to the beta-amyloid binding protein (BBP), but, unlike BBP, it does not regulate a response to beta-amyloid peptide. This protein may have regulatory roles in cell death or proliferation signal cascades. Several alternatively spliced transcript variants of this gene are described but the full length nature of some variants has not been determined. Multiple polyadenylation sites have been found in this gene.
18202	
18203	
18204	GPT (MIM 138200) and GPT2 (EC 2.6.1.2), also known as alanine transaminases, are pyridoxal enzymes that catalyze the reversible transamination between alanine and 2-oxoglutarate to form pyruvate and glutamate. By mediating the conversion of these 4 major intermediate metabolites, these transaminases have roles in gluconeogenesis and in amino acid metabolism.[supplied by OMIM]
18205	The protein encoded by this gene shares significant homology to the adenomatous polyposis coli (APC) protein-binding EB1 gene family. The function of this protein is unknown; however, its homology suggests involvement in tumorigenesis of colorectal cancers and proliferative control of normal cells. This gene may belong to the intermediate/early gene family, involved in the signal transduction cascade downstream of the TCR.
18206	This gene encodes a protein that is involved in glycosylphosphatidylinositol (GPI)-anchor biosynthesis. The GPI-anchor is a glycolipid found on many blood cells and serves to anchor proteins to the cell surface. This protein is expressed in the endoplasmic reticulum and transfers phosphoethanolamine (EtNP) to the first mannose of the GPI anchor. Two alternatively spliced variants, which encode an identical isoform, have been reported.
18207	This gene encodes one of the members of the superfamily of potassium channel proteins containing two pore-forming P domains. This channel protein, considered an open rectifier, is widely expressed. It is stimulated by arachidonic acid, and inhibited by internal acidification and volatile anaesthetics.
18208	
18209	
18210	
18211	This reference sequence was derived from AF161462.1 and ESTs; validated by multiple replicate ESTs and human genomic sequence. This gene encodes a member of a family of small membrane proteins that share a 35-amino acid signature sequence domain, beginning with the sequence PFXYD and containing 7 invariant and 6 highly conserved amino acids. The approved human gene nomenclature for the family is FXYD-domain containing ion transport regulator. Mouse FXYD5 has been termed RIC (Related to Ion Channel). FXYD2, also known as the gamma subunit of the Na,K-ATPase, regulates the properties of that enzyme. FXYD1 (phospholemman), FXYD2 (gamma), FXYD3 (MAT-8), FXYD4 (CHIF), and FXYD5 (RIC) have been shown to induce channel activity in experimental expression systems. Transmembrane topology has been established for two family members (FXYD1 and FXYD2), with the N-terminus extracellular and the C-terminus on the cytoplasmic side of the membrane. This gene product, FXYD5, has not been characterized as a protein. Two transcript variants have been found for this gene, and they are both predicted to encode the same protein. [RefSeq curation by Kathleen J. Sweadner, Ph.D., sweadner@helix.mgh.harvard.edu.]
18212	
18213	This gene encodes a protein that is part of a post-splicing multiprotein complex involved in both mRNA nuclear export and mRNA surveillance. mRNA surveillance detects exported mRNAs with truncated open reading frames and initiates nonsense-mediated mRNA decay (NMD). When translation ends upstream from the last exon-exon junction, this triggers NMD to degrade mRNAs containing premature stop codons. This protein is located only in the cytoplasm. When translation ends, it interacts with the protein that is a functional homolog of yeast Upf2p to trigger mRNA decapping. Use of multiple polyadenylation sites has been noted for this gene.
18214	
18215	Junctional complexes between the plasma membrane and endoplasmic/sarcoplasmic reticulum are a common feature of all excitable cell types and mediate cross talk between cell surface and intracellular ion channels. The protein encoded by this gene is a component of junctional complexes and is composed of a C-terminal hydrophobic segment spanning the endoplasmic/sarcoplasmic reticulum membrane and a remaining cytoplasmic domain that shows specific affinity for the plasma membrane. This gene is a member of the junctophilin gene family. Alternative splicing has been observed at this locus and two variants encoding distinct isoforms are described.
18216	
18217	
18218	Similarity to a transcriptional repressor suggests that this gene@#$%&s protein product is involved in the regulation of transcription. Alternative splicing of this gene results in four transcript variants encoding different isoforms. Additional transcript variants have been described, but their full length sequences have not been determined.
18219	
18220	
18221	
18222	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the ultrahigh sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
18223	
18224	
18225	
18226	
18227	
18228	
18229	
18230	
18231	The protein encoded by this gene is a GTPase which belongs to the RAS superfamily of small GTP-binding proteins. Members of this superfamily appear to regulate a diverse array of cellular events, including the control of cell growth, cytoskeletal reorganization, and the activation of protein kinases. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
18232	
18233	
18234	NPTX1 is a member of the neuronal pentraxin gene family. Neuronal pentraxin 1 is similar to the rat NP1 gene which encodes a binding protein for the snake venom toxin taipoxin. Human NPTX1 mRNA is exclusively localized to the nervous system.
18235	
18236	
18237	
18238	
18239	
18240	
18241	
18242	
18243	
18244	
18245	
18246	
18247	
18248	
18249	
18250	
18251	
18252	
18253	
18254	
18255	
18256	
18257	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
18258	
18259	
18260	
18261	
18262	
18263	
18264	
18265	
18266	
18267	
18268	
18269	
18270	The protein encoded by this gene shares strong similarity with Saccharomyces cerevisiae protein Cdc27, and the gene product of Schizosaccharomyces pombe nuc 2. This protein is a component of anaphase-promoting complex (APC), which is composed of eight protein subunits and highly conserved in eucaryotic cells. APC catalyzes the formation of cyclin B-ubiquitin conjugate that is responsible for the ubiquitin-mediated proteolysis of B-type cyclins. This protein and 3 other members of the APC complex contain the TPR (tetratricopeptide repeat), a protein domain important for protein-protein interaction. This protein was shown to interact with mitotic checkpoint proteins including Mad2, p55CDC and BUBR1, and thus may be involved in controlling the timing of mitosis.
18271	
18272	
18273	
18274	This gene encodes a protein with ankyrin repeats, which are associated with protein-protein interactions. Studies in mice suggest that this protein is involved in liver development. Two transcript variants encoding different isoforms have been found for this gene.
18275	
18276	
18277	
18278	
18279	
18280	
18281	
18282	
18283	Myosin light chain phosphatase (MLCP) consists of three subunits- catalytic subunit, large subunit/myosin binding subunit (MBS) and small subunit (sm-M20). This gene is a multi-functional gene which encodes both MBS and sm-M20. MLCP regulates myosins and the dephosphorylation is enhanced by the presence of MBS. The sm-M20 is suggested to play a regulatory role in muscle contraction by binding to MBS. MBS is also encoded by another gene, myosin light chain phosphatase target subunit 1. sm-M20 shows higher binding affinity to this gene product than to myosin light chain phosphatase target subunit 2-MBS even though the two MBS proteins are highly similar. Although both MBSs increase the activity of MLCP, myosin light chain phosphatase target subunit 1-MBS is a more efficient activator. There are four alternatively spliced transcript variants described; two alter the MBS coding region and two alter the sm-M20 coding region of this gene.
18284	This gene encodes a member of the SLC2A facilitative glucose transporter family. Members of this family play a significant role in maintaining glucose homeostasis. The encoded protein may play a role in the development and survival of chondrocytes in cartilage matrices. Two transcript variants encoding distinct isoforms have been identified for this gene.
18285	
18286	
18287	
18288	
18289	
18290	
18291	
18292	
18293	This gene encodes an enzyme that plays a major role in polyamine metabolism and is important for the salvage of both adenine and methionine. The encoded enzyme is deficient in many cancers because this gene and the tumor suppressor p16 gene are co-deleted. Multiple alternatively spliced transcript variants have been described for this gene, but their full-length natures remain unknown.
18294	This gene encodes a protein which is similar to the Drosophila crumbs protein and localizes to the inner segment of mammalian photoreceptors. In Drosophila crumbs localizes to the stalk of the fly photoreceptor and may be a component of the molecular scaffold that controls proper development of polarity in the eye. Mutations in this gene are associated with a severe form of retinitis pigmentosa, RP12, and with Leber congenital amaurosis. Two alternative transcripts encoding different isoforms have been described.
18295	
18296	This gene encodes a member of a family of small membrane proteins that share a 35-amino acid signature sequence domain, beginning with the sequence PFXYD and containing 7 invariant and 6 highly conserved amino acids. The approved human gene nomenclature for the family is FXYD-domain containing ion transport regulator. Mouse FXYD5 has been termed RIC (Related to Ion Channel). FXYD2, also known as the gamma subunit of the Na,K-ATPase, regulates the properties of that enzyme. FXYD1 (phospholemman), FXYD2 (gamma), FXYD3 (MAT-8), FXYD4 (CHIF), and FXYD5 (RIC) have been shown to induce channel activity in experimental expression systems. Transmembrane topology has been established for two family members (FXYD1 and FXYD2), with the N-terminus extracellular and the C-terminus on the cytoplasmic side of the membrane. The Type III integral membrane protein encoded by this gene is the gamma subunit of the Na,K-ATPase present on the plasma membrane. Although the Na,K-ATPase does not depend on the gamma subunit to be functional, it is thought that the gamma subunit modulates the enzyme@#$%&s activity by inducing ion channel activity. Mutations in this gene have been associated with renal hypomagnesaemia.
18297	
18298	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix for both normal physiological processes, such as embryonic development, reproduction and tissue remodeling, and disease processes, such as asthma and metastasis. This gene encodes a secreted enzyme that degrades casein. Its expression pattern suggests that it plays a role in tissue homeostasis and in wound repair. Transcript variants encoding different isoforms have been described.
18299	
18300	
18301	
18302	Caspase recruitment domain (CARD)-containing proteins, such as CARD8, are involved in pathways leading to activation of caspases or nuclear factor kappa-B (NFKB; see MIM 164011) in the context of apoptosis or inflammation, respectively (Bouchier-Hayes et al., 2001).[supplied by OMIM]
18303	
18304	
18305	HLA-DQA1 belongs to the HLA class II alpha chain paralogues. The class II molecule is a heterodimer consisting of an alpha (DQA) and a beta chain (DQB), both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B Lymphocytes, dendritic cells, macrophages). The alpha chain is approximately 33-35 kDa. It is encoded by 5 exons; exon 1 encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, and exon 4 encodes the transmembrane domain and the cytoplasmic tail. Within the DQ molecule both the alpha chain and the beta chain contain the polymorphisms specifying the peptide binding specificities, resulting in up to four different molecules. Typing for these polymorphisms is routinely done for bone marrow transplantation.
18306	This gene encodes a cytoskeletal protein which is concentrated in areas of cell-substratum and cell-cell contacts. This protein plays a significant role in the assembly of actin filaments and in spreading and migration of various cell types, including fibroblasts and osteoclasts. It codistributes with integrins in the cell surface membrane in order to assist in the attachment of adherent cells to extracellular matrices and of lymphocytes to other cells. The N-terminus of this protein contains elements for localization to cell-extracellular matrix junctions. The C-terminus contains binding sites for proteins such as beta-1-integrin, actin, and vinculin.
18307	
18308	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters.  ABC proteins transport various molecules across extra- and intra-cellular membranes.  ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White).  This protein is a member of the MDR/TAP subfamily.  Members of the MDR/TAP subfamily are involved in multidrug resistance as well as antigen presentation.  This half-transporter likely plays a role in mitochondrial function.  Localized to 2q26, this gene is considered a candidate gene for lethal neonatal metabolic syndrome, a disorder of mitochondrial function.
18309	
18310	
18311	
18312	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to cytoplasmic bodies and its function has not been determined. Four alternatively spliced transcript variants for this gene have been described.
18313	
18314	
18315	Calcitonin is a peptide hormone synthesized by the parafollicular cells of the thyroid. It causes reduction in serum calcium, an effect opposite to that of parathyroid hormone (PTH; MIM 168450).[supplied by OMIM]
18316	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. Overexpression of this gene results in a decrease in the processing of insulin and hepatocyte growth factor receptors to their mature subunits. This decrease is caused by the mislocalization of furin, the endoprotease responsible for cleavage of insulin and hepatocyte growth factor receptors. This protein is involved in endosomal trafficking from the plasma membrane to recycling endosomes or the trans-Golgi network. This gene encodes two transcript variants encoding distinct isoforms.
18317	
18318	
18319	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This phosphoprotein localizes to nuclear bodies where it functions as a transcription factor and tumor suppressor. Its expression is cell-cycle related and it regulates the p53 response to oncogenic signals. The gene is often involved in the translocation with the retinoic acid receptor alpha gene associated with acute promyelocytic leukemia (APL). Extensive alternative splicing of this gene results in several variations of the protein@#$%&s central and C-terminal regions; all variants encode the same N-terminus. Fourteen transcript variants for this gene have been described; however, the full length nature of three of the variants has not been determined.
18320	
18321	ZFP67 is an early growth response gene that encodes a zinc finger-containing transcription factor that binds to the promoter regions of type I collagen genes (e.g., COL1A1; MIM 120150) and has a role in development.[supplied by OMIM]
18322	
18323	The protein encoded by this gene is involved in the import of precursor proteins into mitochondria. The encoded protein has a chaperone-like activity, binding the mature portion of unfolded proteins and aiding their import into mitochondria. This protein, which is found in the cytoplasm and sometimes associated with the outer mitochondrial membrane, has a weak ATPase activity and contains 6 TPR repeats.
18324	The protein encoded by this gene is found in the nucleus and may be involved in the early transcriptional regulation of patterning of the mesoderm. The encoded basic helix-loop-helix protein requires dimerization with another basic helix-loop-helix protein for efficient DNA binding.
18325	The protein encoded by this gene localizes to perinuclear endosomes. This protein associates with peptidylglycine alpha-amidating monooxygenase, and may be involved with the trafficking of this enzyme through secretory or endosomal pathways.
18326	Angiopoietins are members of the vascular endothelial growth factor family and the only known growth factors largely specific for vascular endothelium. Angiopoietin-1, angiopoietin-2, and angiopoietin-4 participate in the formation of blood vessels. The protein encoded by this gene is another member of the angiopoietin family that is widely expressed in adult tissues with mRNA levels highest in highly vascularized tissues. This protein was found to be a secretory protein that does not act as an endothelial cell mitogen in vitro.
18327	Polycomb group (PcG) of proteins form the multiprotein complexes that are important for the transcription repression of various genes involved in development and cell proliferation. The protein encoded by this gene is one of the PcG proteins. It has been shown to interact with, and suppress the activity of, transcription factor CP2 (TFCP2/CP2). Studies of the mouse counterpart suggested the involvement of this gene in the specification of anterior-posterior axis, as well as in cell proliferation in early development. This protein was also found to interact with huntingtin interacting protein 2 (HIP2), an ubiquitin-conjugating enzyme, and possess ubiquitin ligase activity.
18328	
18329	
18330	
18331	This gene product belongs to the 14-3-3 family of proteins which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals, and this protein is 99% identical to the mouse and rat orthologs. This gene is upregulated in patients with amyotrophic lateral sclerosis. It contains in its 5@#$%& UTR a 6 bp tandem repeat sequence which is polymorphic, however, there is no correlation between the repeat number and the disease.
18332	
18333	
18334	The protein encoded by this gene is a cytokine receptor that belongs to the interleukin 1 receptor family. This protein is a receptor for interleukin alpha (IL1A), interleukin beta (IL1B), and interleukin 1 receptor, type I(IL1R1/IL1RA). It is an important mediator involved in many cytokine induced immune and inflammatory responses. This gene along with interleukin 1 receptor, type II (IL1R2), interleukin 1 receptor-like 2 (IL1RL2), and interleukin 1 receptor-like 1 (IL1RL1) form a cytokine receptor gene cluster in a region mapped to chromosome 2q12.
18335	
18336	
18337	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is found in the asymmetrical unit membrane (AUM) where it can form a complex with other transmembrane 4 superfamily proteins. It may play a role in normal bladder epithelial physiology, possibly in regulating membrane permeability of superficial umbrella cells or in stabilizing the apical membrane through AUM/cytoskeletal interactions. The use of alternate polyadenylation sites has been found for this gene.
18338	
18339	
18340	A lack of MHC-II expression results in a severe immunodeficiency syndrome called MHC-II deficiency, or the bare lymphocyte syndrome (BLS; MIM 209920). At least 4 complementation groups have been identified in B-cell lines established from patients with BLS. The molecular defects in complementation groups B, C, and D all lead to a deficiency in RFX, a nuclear protein complex that binds to the X box of MHC-II promoters. The lack of RFX binding activity in complementation group C results from mutations in the RFX5 gene encoding the 75-kD subunit of RFX (Steimle et al., 1995). RFX5 is the fifth member of the growing family of DNA-binding proteins sharing a novel and highly characteristic DNA-binding domain called the RFX motif.
18341	
18342	
18343	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. This enzyme is required for post-replicative DNA damage repair. Its protein sequence is 100% identical to the mouse, rat, and rabbit homologs, which indicates that this enzyme is highly conserved in eukaryotic evolution.
18344	
18345	
18346	This gene belongs to the RING finger family, members of which encode proteins characterized by a RING domain, a zinc-binding motif related to the zinc finger domain. The gene product can bind DNA and can act as a transcriptional repressor. It is associated with the multimeric polycomb group protein complex. The gene product interacts with the polycomb group proteins BMI1, EDR1, and CBX4, and colocalizes with these proteins in large nuclear domains. It interacts with the CBX4 protein via its glycine-rich C-terminal domain. The gene maps to the HLA class II region, where it is contiguous with the RING finger genes FABGL and HKE4.
18347	Bidirectional transport of macromolecules between the cytoplasm and nucleus occurs through nuclear pore complexes (NPCs) embedded in the nuclear envelope. NPCs are composed of subcomplexes, and NUP43 is part of one such subcomplex, Nup107-160 (Loiodice et al., 2004).[supplied by OMIM]
18348	
18349	
18350	
18351	
18352	
18353	
18354	
18355	
18356	Tight junction proteins (TJPs) belong to a family of membrane-associated guanylate kinase (MAGUK) homologs that are involved in the organization of epithelial and endothelial intercellular junctions. TJPs bind to the cytoplasmic C termini of junctional transmembrane proteins and link them to the actin cytoskeleton.[supplied by OMIM]
18357	
18358	
18359	
18360	
18361	
18362	The protein encoded by this gene is an isozyme of the long-chain fatty-acid-coenzyme A ligase family. Although differing in substrate specificity, subcellular localization, and tissue distribution, all isozymes of this family convert free long-chain fatty acids into fatty acyl-CoA esters, and thereby play a key role in lipid biosynthesis and fatty acid degradation. This isozyme is highly expressed in uterus and spleen, and in trace amounts in normal brain, but has markedly increased levels in malignant gliomas. This gene functions in mediating fatty acid-induced glioma cell growth. Three transcript variants encoding two different isoforms have been found for this gene.
18363	
18364	
18365	This gene is an ortholog of the C. elegans unc-76 gene, which is necessary for normal axonal bundling and elongation within axon bundles. Expression of this gene in C. elegans unc-76 mutants can restore to the mutants partial locomotion and axonal fasciculation, suggesting that it also functions in axonal outgrowth. The N-terminal half of the gene product is highly acidic. Alternatively spliced transcript variants encoding different isoforms of this protein have been described.
18366	This gene encodes a member of the C-type lectin/C-type lectin-like domain (CTL/CTLD) superfamily. Members of this family share a common protein fold and have diverse functions, such as cell adhesion, cell-cell signalling, glycoprotein turnover, and roles in inflammation and immune response. The encoded type 2 transmembrane protein may function as a cell activation antigen. An alternative splice variant has been described but its full-length sequence has not been determined. This gene is closely linked to other CTL/CTLD superfamily members on chromosome 12p13 in the natural killer gene complex region.
18367	
18368	T-cell activation requires stimulation of the T-cell receptor (TCR; see MIM 186880)-CD3 (see CD3Z; MIM 186780) complex, followed by recruitment of an array of intracellular signaling proteins (e.g., GRB2 (MIM 108355) and PLCG1 (MIM 172420)). Mediating the interaction between the extracellular receptors and intracellular signaling pathways are adaptor proteins such as LAT (MIM 602354), TRIM (MIM 604962), and LNK.[supplied by OMIM]
18369	
18370	
18371	
18372	This gene product is a component of the ribonucleoprotein complex responsible for telomerase activity which catalyzes the addition of new telomeres on the chromosome ends. The telomerase-associated proteins are conserved from ciliates to humans.
18373	This gene encodes a subfamily member of the G-protein couple receptor (GPCR) superfamily. The encoded protein is a high-affinity receptor for lysophosphatidylcholine (LPC), a major phospholipid component of oxidized low density lipoprotein. This protein may react to LPC levels at sites of inflammation to limit the expansion of tissue-infiltrating cells. A similar protein in mouse is involved in cell cycle progression.
18374	
18375	
18376	
18377	
18378	
18379	
18380	
18381	Vitamin B6, or pyridoxal 5-prime-phosphate (PLP), is critical for normal cellular function, and some cancer cells have notable differences in vitamin B6 metabolism compared to their normal counterparts. The rate-limiting enzyme in vitamin B6 synthesis is pyridoxine-5-prime-phosphate (PNP) oxidase (PNPO; EC 1.4.3.5).[supplied by OMIM]
18382	
18383	The protein encoded by this gene contains a RING finger, a motif present in a variety of functionally distinct proteins and known to be involved in protein-DNA and protein-protein interactions. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
18384	This gene encodes a member of the aquaporin family of intrinsic membrane proteins that function as water-selective channels in the plasma membranes of many cells. The encoded protein is the predominant aquaporin found in brain. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
18385	The protein encoded by this gene belongs to the SMAD, a family of proteins similar to the gene products of the Drosophila gene @#$%&mothers against decapentaplegic@#$%& (Mad) and the C. elegans gene Sma. SMAD proteins are signal transducers and transcriptional modulators that mediate multiple signaling pathways. This protein mediates the signal of the transforming growth factor (TGF)-beta, and thus regulates multiple cellular processes, such as cell proliferation, apoptosis, and differentiation. This protein is recruited to the TGF-beta receptors through its interaction with the SMAD anchor for receptor activation (SARA) protein. In response to TGF-beta signal, this protein is phosphorylated by the TGF-beta receptors. The phosphorylation induces the dissociation of this protein with SARA and the association with the family member SMAD4. The association with SMAD4 is important for the translocation of this protein into the nucleus, where it binds to target promoters and forms a transcription repressor complex with other cofactors. This protein can also be phosphorylated by activin type 1 receptor kinase, and mediates the signal from the activin. Alternatively spliced transcript variants encoding the same protein have been observed.
18386	
18387	
18388	Inositol 1,4,5-trisphosphate 3-kinase (ITPK) catalyzes the phosphorylation of Ins(1,4,5)P3 to Ins(1,3,4,5)P4, both of which are modulators of calcium homeostasis. ITPK isoforms include ITPKA (MIM 147521), ITPKB (MIM 147522), and ITPKC, all of which contain a conserved catalytic unit in their C termini, but have unique N-terminal sequences and tissue distributions.[supplied by OMIM]
18389	
18390	
18391	This gene encodes subunit 2 of the splicing factor 3a protein complex. The splicing factor 3a heterotrimer includes subunits 1, 2 and 3 and is necessary for the in vitro conversion of 15S U2 snRNP into an active 17S particle that performs pre-mRNA splicing. Subunit 2 interacts with subunit 1 through its amino-terminus while the single zinc finger domain of subunit 2 plays a role in its binding to the 15S U2 snRNP. Subunit 2 may also function independently of its RNA splicing function as a microtubule-binding protein.
18392	
18393	
18394	This intronless gene encodes a 70kDa heat shock protein which is a member of the heat shock protein 70 family. In conjuction with other heat shock proteins, this protein stabilizes existing proteins against aggregation and mediates the folding of newly translated proteins in the cytosol and in organelles. It is also involved in the ubiquitin-proteasome pathway through interaction with the AU-rich element RNA-binding protein 1. The gene is located in the major histocompatibility complex class III region, in a cluster with two closely related genes which encode similar proteins.
18395	
18396	
18397	
18398	DiGeorge syndrome, and more widely, the CATCH 22 syndrome, are associated with microdeletions in chromosomal region 22q11.2. This gene product shares homology with the Drosophila melanogaster gonadal protein, which participates in gonadal and germ cell development, and with the human laminin gamma-1 chain, which upon polymerization with alpha- and beta-chains forms the laminin molecule. Laminin binds to cells through interaction with a receptor and has functions in cell attachment, migration, and tissue organization during development. This gene could be a candidate for involvement in the DiGeorge syndrome pathology by playing a role in neural crest cell migration into the third and fourth pharyngeal pouches, the structures from which derive the organs affected in DiGeorge syndrome.
18399	The protein encoded by this gene is a dual-specificity protein phosphatase that can dephosphorylate proteins at serine, threonine, and tyrosine residues. This protein is structurally similar to myotubularin but in addition contains a FYVE domain, a double zinc finger motif which specifically binds phosphatidylinositol(3)-phosphate. The encoded protein has been observed to have a perinuclear, possibly membrane-bound, distribution in cells, but it has also been found free in the cytoplasm. Three transcript variants encoding three different isoforms have been found for this gene.
18400	The interleukin 2 receptor, which is involved in T cell-mediated immune responses, is present in 3 forms with respect to ability to bind interleukin 2. The low affinity form is a monomer of the alpha subunit and is not involved in signal transduction. The intermediate affinity form consists of an alpha/beta subunit heterodimer, while the high affinity form consists of an alpha/beta/gamma subunit heterotrimer. Both the intermediate and high affinity forms of the receptor are involved in receptor-mediated endocytosis and transduction of mitogenic signals from interleukin 2. The protein encoded by this gene represents the beta subunit and is a type I membrane protein.
18401	This gene is directly involved, with Myc and IgH, in a three-way gene translocation in a Burkitt lymphoma cell line. As a result of the gene translocation, the N-terminal region of the gene product is disrupted, which is thought to be related to the pathogenesis of a subset of high-grade B cell non-Hodgkin lymphoma. The N-terminal segment involved in the translocation includes the region that shares a strong sequence similarity with those of BCL7B and BCL7C.
18402	This gene is part of a gene cluster on chromosome Xp11.23. The encoded protein contains a zinc finger motif often found in transcriptional regulators, however, its exact function is not known.
18403	
18404	
18405	
18406	
18407	
18408	
18409	
18410	
18411	Homeobox genes encode DNA-binding proteins, many of which are thought to be involved in early embryonic development. Homeobox genes encode a DNA-binding domain of 60 to 63 amino acids referred to as the homeodomain. This pseudogene is a member of the TPRX homeobox gene family.
18412	
18413	
18414	
18415	
18416	
18417	
18418	
18419	
18420	This gene encodes a mitochondrially localized enzyme that catalyzes the reversible formation of acetoacetyl-CoA from two molecules of acetyl-CoA. This gene spans approximately 27 kb and contains 12 exons interrupted by 11 introns. Defects in this gene are associated with the alpha-methylacetoaceticaciduria disorder, an inborn error of isoleucine catabolism characterized by urinary excretion of 2-methyl-3-hydroxybutyric acid, 2-methylacetoacetic acid, tiglylglycine, and butanone.
18421	
18422	Transport of small hydrophilic substances across cell membranes is mediated by substrate-specific transporter proteins which have been classified into several families of related genes. The protein encoded by this gene is a member of the subgroup of transporter with unidentified substrates within the Na+ and Cl- coupled transporter family. This gene is expressed in kidney, and its alternative splicing generates 2 transcript variants.
18423	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum; its substrate has not yet been determined. This gene, which produces two transcript variants, is part of a large cluster of cytochrome P450 genes from the CYP2A, CYP2B and CYP2F subfamilies on chromosome 19q.
18424	
18425	
18426	
18427	
18428	
18429	This gene encodes one of seven subunits of the human Arp2/3 protein complex. The Arp2/3 protein complex has been implicated in the control of actin polymerization in cells and has been conserved through evolution. The exact role of the protein encoded by this gene, the p20 subunit, has yet to be determined.
18430	
18431	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L34E family of ribosomal proteins. It is located in the cytoplasm. This gene originally was thought to be located at 17q21, but it has been mapped to 4q. Transcript variants derived from alternative splicing, alternative transcription initiation sites, and/or alternative polyadenylation exist; these variants encode the same protein. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
18432	
18433	
18434	
18435	
18436	This gene encodes a member of the forkhead box (FOX) family of transcription factors. FOX family members are monomeric, helix-turn-helix proteins with a core DNA-binding domain of approximately 110 aa. Many FOX transcription factors play roles in determining cell fates during early development. This forkhead box protein lacks the C-terminal basic region found in many other FOX family members. It is located within the 11q23.3 region which is commonly deleted in neuroblastomas.
18437	
18438	
18439	
18440	
18441	
18442	
18443	
18444	
18445	
18446	This gene encodes a protein that mediates ligand-independent transcription repression of thyroid-hormone and retinoic-acid receptors by promoting chromatin condensation and preventing access of the transcription machinery. It is part of a complex which also includes histone deacetylases and transcriptional regulators similar to the yeast protein Sin3p. This gene is located between the Charcot-Marie-Tooth and Smith-Magenis syndrome critical regions on chromosome 17. An alternatively spliced transcript variant has been described, but its full length sequence has not been determined.
18447	
18448	
18449	
18450	
18451	
18452	
18453	This gene encodes an enzyme that catalyzes the NAD(P)H-dependent conversion of pyrroline-5-carboxylate to proline. This enzyme may also play a physiologic role in the generation of NADP(+) in some cell types. The protein forms a homopolymer and localizes to the mitochondrion. Alternate splicing results in two transcript variants encoding different isoforms.
18454	
18455	
18456	
18457	
18458	
18459	
18460	
18461	
18462	
18463	
18464	
18465	
18466	
18467	
18468	
18469	
18470	
18471	
18472	
18473	
18474	
18475	tRNA splicing is a fundamental process required for cell growth and division. SEN54 is a subunit of the tRNA splicing endonuclease, which catalyzes the removal of introns, the first step in tRNA splicing (Paushkin et al., 2004).[supplied by OMIM]
18476	
18477	
18478	
18479	
18480	
18481	The zinc finger protein encoded by this gene is one of several cellular transcription factors whose DNA-binding activities are regulated through the action of adenovirus E1A. A 50-kDa amino-terminal product is generated from the full-length protein through proteolytic cleavage. The protein is differentially regulated by E1A-induced phosphorylation. The full-length gene product represses transcription from the E4 promoter in the absence of E1A, while the 50-kDa form acts as a transcriptional activator in its presence.
18482	This gene encodes a nuclear-encoded mitochondrial metalloprotease protein that is a member of the AAA (ATPases associated with a variety of cellular activities) protein family. Members of this protein family share an ATPase domain and have roles in diverse cellular processes including membrane trafficking, intracellular motility, organelle biogenesis, protein folding, and proteolysis. Two transcript variants encoding distinct isoforms have been identified for this gene. Mutations associated with this gene cause autosomal recessive spastic paraplegia 7.
18483	
18484	
18485	
18486	
18487	
18488	Arylsulfatase B encoded by this gene belongs to the sulfatase family. The arylsulfatase B homodimer hydrolyzes sulfate groups of N-Acetyl-D-galactosamine, chondriotin sulfate, and dermatan sulfate. The protein is targetted to the lysozyme. Defects in this gene cause Maroteaux-Lamy syndrome. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
18489	
18490	
18491	
18492	
18493	
18494	
18495	
18496	
18497	
18498	
18499	
18500	The lethal (2) giant larvae protein of Drosophila plays a role in asymmetric cell division, epithelial cell polarity, and cell migration. This human gene encodes a protein similar to lethal (2) giant larvae of Drosophila. In fly, the protein@#$%&s ability to localize cell fate determinants is regulated by the atypical protein kinase C (aPKC). In human, this protein interacts with aPKC-containing complexes and is cortically localized in mitotic cells. Alternative splicing results in multiple transcript variants encoding different isoforms.
18501	
18502	
18503	Peptidylarginine deiminases (PADs; EC 3.5.3.15), such as PADI2, are a family of posttranslational modification enzymes that convert arginine residues in peptides to citrulline residues in the presence of calcium ion.[supplied by OMIM]
18504	
18505	The protein encoded by this gene is a member of the SEC22 family of vesicle trafficking proteins. It is localized at the endoplasmic reticulum and it is thought to play a role in the early stages of the ER-Golgi protein trafficking. There are two alternatively spliced transcript variants encoding different isoforms described for this gene.
18506	
18507	
18508	Chemokines are a group of small (approximately 8 to 14 kD), mostly basic, structurally related molecules that regulate cell trafficking of various types of leukocytes through interactions with a subset of 7-transmembrane, G protein-coupled receptors. Chemokines also play fundamental roles in the development, homeostasis, and function of the immune system, and they have effects on cells of the central nervous system as well as on endothelial cells involved in angiogenesis or angiostasis. Chemokines are divided into 2 major subfamilies, CXC and CC. This gene is a CXC member of the chemokine superfamily. Its encoded protein induces a chemotactic response in activated T-cells and is the dominant ligand for CXC receptor-3. The gene encoding this protein contains 4 exons and at least three polyadenylation signals which might reflect cell-specific regulation of expression. IFN-gamma is a potent inducer of transcription of this gene.
18509	MaxiK channels are large conductance, voltage and calcium-sensitive potassium channels which are fundamental to the control of smooth muscle tone and neuronal excitability. MaxiK channels can be formed by 2 subunits: the pore-forming alpha subunit and the product of this gene, the modulatory beta subunit. Intracellular calcium regulates the physical association between the alpha and beta subunits.
18510	
18511	Lymphoid enhancer-binding factor-1 (LEF1) is a 48-kD nuclear protein that is expressed in pre-B and T cells. It binds to a functionally important site in the T-cell receptor-alpha (TCRA; MIM 186880) enhancer and confers maximal enhancer activity. LEF1 belongs to a family of regulatory proteins that share homology with high mobility group protein-1 (HMG1; MIM 163905).[supplied by OMIM]
18512	
18513	Histones play a critical role in transcriptional regulation, cell cycle progression, and developmental events. Histone acetylation/deacetylation alters chromosome structure and affects transcription factor access to DNA. The protein encoded by this gene has sequence homology to members of the histone deacetylase family. This gene is orthologous to the Xenopus and mouse MITR genes. The MITR protein lacks the histone deacetylase catalytic domain. It represses MEF2 activity through recruitment of multicomponent corepressor complexes that include CtBP and HDACs. This encoded protein may play a role in hematopoiesis. Multiple alternatively spliced transcripts have been described for this gene but the full-length nature of some of them has not been determined.
18514	
18515	
18516	
18517	
18518	
18519	
18520	The protein encoded by this gene is a member of the fibroblast growth factor receptor family, where amino acid sequence is highly conserved between members and throughout evolution. FGFR family members differ from one another in their ligand affinities and tissue distribution. A full-length representative protein consists of an extracellular region, composed of three immunoglobulin-like domains, a single hydrophobic membrane-spanning segment and a cytoplasmic tyrosine kinase domain. The extracellular portion of the protein interacts with fibroblast growth factors, setting in motion a cascade of downstream signals, ultimately influencing mitogenesis and differentiation. This particular family member is a high-affinity receptor for acidic, basic and/or keratinocyte growth factor, depending on the isoform. Mutations in this gene are associated with many craniosynostotic syndromes and bone malformations. The genomic organization of this gene encompasses 20 exons. Alternative splicing in multiple exons, including those encoding the Ig-like domains, the transmembrane region and the carboxyl terminus, results in varied isoforms which differ in structure and specificity.
18521	MBL2 encodes the soluble mannose-binding protein found in serum.  MBL2 recognizes mannose and N-acetylglucosamine on bacterial pathogens, and is capable of activating the classical complement pathway.
18522	
18523	
18524	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. Overexpression of this gene results in a decrease in the processing of insulin and hepatocyte growth factor receptors to their mature subunits. This decrease is caused by the mislocalization of furin, the endoprotease responsible for cleavage of insulin and hepatocyte growth factor receptors. This protein is involved in endosomal trafficking from the plasma membrane to recycling endosomes or the trans-Golgi network. This gene encodes two transcript variants encoding distinct isoforms.
18525	Crystallins are separated into two classes: taxon-specific, or enzyme, and ubiquitous. The latter class constitutes the major proteins of vertebrate eye lens and maintains the transparency and refractive index of the lens. Since lens central fiber cells lose their nuclei during development, these crystallins are made and then retained throughout life, making them extremely stable proteins. Mammalian lens crystallins are divided into alpha, beta, and gamma families; beta and gamma crystallins are also considered as a superfamily. Alpha and beta families are further divided into acidic and basic groups. Seven protein regions exist in crystallins: four homologous motifs, a connecting peptide, and N- and C-terminal extensions. Alpha crystallins are composed of two gene products: alpha-A and alpha-B, for acidic and basic, respectively. Alpha crystallins can be induced by heat shock and are members of the small heat shock protein (sHSP also known as the HSP20) family. They act as molecular chaperones although they do not renature proteins and release them in the fashion of a true chaperone; instead they hold them in large soluble aggregates. Post-translational modifications decrease the ability to chaperone. These heterogeneous aggregates consist of 30-40 subunits; the alpha-A and alpha-B subunits have a 3:1 ratio, respectively. Two additional functions of alpha crystallins are an autokinase activity and participation in the intracellular architecture. Alpha-A and alpha-B gene products are differentially expressed; alpha-A is preferentially restricted to the lens and alpha-B is expressed widely in many tissues and organs. Elevated expression of alpha-B crystallin occurs in many neurological diseases; a missense mutation cosegregated in a family with a desmin-related myopathy.
18526	
18527	
18528	This gene encodes the insulin receptor substrate 2, a cytoplasmic signaling molecule that mediates effects of insulin, insulin-like growth factor 1, and other cytokines by acting as a molecular adaptor between diverse receptor tyrosine kinases and downstream effectors. The product of this gene is phosphorylated by the insulin receptor tyrosine kinase upon receptor stimulation, as well as by an interleukin 4 receptor-associated kinase in response to IL4 treatment.
18529	
18530	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This phosphoprotein localizes to nuclear bodies where it functions as a transcription factor and tumor suppressor. Its expression is cell-cycle related and it regulates the p53 response to oncogenic signals. The gene is often involved in the translocation with the retinoic acid receptor alpha gene associated with acute promyelocytic leukemia (APL). Extensive alternative splicing of this gene results in several variations of the protein@#$%&s central and C-terminal regions; all variants encode the same N-terminus. Fourteen transcript variants for this gene have been described; however, the full length nature of three of the variants has not been determined.
18531	
18532	
18533	
18534	
18535	
18536	
18537	
18538	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L37E family of ribosomal proteins. It is located in the cytoplasm. The protein contains a C2C2-type zinc finger-like motif. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
18539	This gene is one of the at least 20 genes expressed during epidermal differentiation and located on chromosomal band 1q21. This gene is involved in epidermal differentiation, and it is expressed at high levels in normal and psoriatic skin, but not in cultured keratinocytes or in any other tested cell types or tissues.
18540	
18541	The protein encoded by this gene is an axonal glycoprotein belonging to the immunoglobulin supergene family. The ectodomain, consisting of several immunoglobulin-like domains and fibronectin-like repeats (type III), is linked via a single transmembrane sequence to a conserved cytoplasmic domain. This cell adhesion molecule plays an important role in nervous system development, including neuronal migration and differentiation. Mutations in the gene cause three X-linked neurological syndromes known by the acronym CRASH (corpus callosum hypoplasia, retardation, aphasia, spastic paraplegia and hydrocephalus). Alternative splicing of a neuron-specific exon is thought to be functionally relevant.
18542	
18543	
18544	
18545	
18546	
18547	
18548	
18549	This gene encodes a protein that has been directly linked to Bardet-Biedl syndrome. The primary features of this syndrome include retinal dystrophy, obesity, polydactyly, renal abnormalities and learning disabilities. Experimentation in non-human eukaryotes suggests that this gene is expressed in ciliated cells and that it is required for the formation of cilia. Alternate transcriptional splice variants have been observed but have not been fully characterized.
18550	The WNT gene family consists of structurally related genes which encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family. It is overexpressed in cervical cancer cell line and strongly coexpressed with another family member, WNT10A, in colorectal cancer cell line. The gene overexpression may play key roles in carcinogenesis. This gene and the WNT10A gene are clustered in the chromosome 2q35 region. The protein encoded by this gene is 97% identical to the mouse Wnt6 protein at the amino acid level.
18551	The protein encoded by this gene is a cytokine receptor that belongs to the interleukin 1 receptor family. This receptor specifically binds interleukin 18 (IL18), and is essential for IL18 mediated signal transduction. IFN-alpha and IL12 are reported to induce the expression of this receptor in NK and T cells. This gene along with four other members of the interleukin 1 receptor family, including IL1R2, IL1R1, ILRL2 (IL-1Rrp2), and IL1RL1 (T1/ST2), form a gene cluster on chromosome 2q.
18552	The protein encoded by this gene is a type I transmembrane protein and is a tumor-specific endothelial marker that has been implicated in colorectal cancer. This protein has been shown to also be a docking protein or receptor for Bacillus anthracis toxin, the causative agent of the disease, anthrax. The binding of the protective antigen (PA) component, of the tripartite anthrax toxin, to this receptor protein mediates delivery of toxin components to the cytosol of cells. Once inside the cell, the other two components of anthrax toxin, edema factor (EF) and lethal factor (LF) disrupt normal cellular processes. Three alternatively spliced variants have been described.
18553	
18554	
18555	
18556	The protein encoded by this gene is a member of the immunoglobulin superfamily. It is a glycosylphosphatidylinositol (GPI)-anchored neuronal membrane protein that functions as a cell adhesion molecule. It may play a role in the formation of axon connections in the developing nervous system. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
18557	Ghrelin is an endogenous ligand for the growth hormone secretagogue receptor (GHSR; MIM 601898) and is involved in regulating GH release.[supplied by OMIM]
18558	This gene encodes a lysosomal hyaluronidase. Hyaluronidases intracellularly degrade hyaluronan, one of the major glycosaminoglycans of the extracellular matrix. Hyaluronan is thought to be involved in cell proliferation, migration and differentiation. This enzyme is active at an acidic pH and is the major hyaluronidase in plasma. Mutations in this gene are associated with mucopolysaccharidosis type IX, or hyaluronidase deficiency. The gene is one of several related genes in a region of chromosome 3p21.3 associated with tumor suppression. Eight alternatively spliced transcript variants of this gene encoding six distinct isoforms have been described.
18559	This gene encodes the enzyme acylpeptide hydrolase, which catalyzes the hydrolysis of the terminal acetylated amino acid preferentially from small acetylated peptides. The acetyl amino acid formed by this hydrolase is further processed to acetate and a free amino acid by an aminoacylase. This gene is located within the same region of chromosome 3 (3p21) as the aminoacylase gene, and deletions at this locus are also associated with a decrease in aminoacylase activity. The acylpeptide hydrolase is a homotetrameric protein of 300 kDa with each subunit consisting of 732 amino acid residues. It can play an important role in destroying oxidatively damaged proteins in living cells. Deletions of this gene locus are found in various types of carcinomas, including small cell lung carcinoma and renal cell carcinoma.
18560	
18561	
18562	
18563	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to cytoplasmic filaments. Its function has not been identified.
18564	
18565	This gene encodes a protein that binds to the C-terminus of adenovirus E1A proteins. This phosphoprotein is a transcriptional repressor and may play a role during cellular proliferation. This protein and the product of a second closely related gene, CTBP2, can dimerize. Both proteins can also interact with a polycomb group protein complex which participates in regulation of gene expression during development. Alternative splicing of transcripts from this gene results in multiple transcript variants.
18566	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This protein is also a member of the ADP ribosylation factor family of guanine nucleotide-binding family of proteins. Its carboxy terminus contains an ADP-ribosylation factor domain and a guanine nucleotide binding site, while the amino terminus contains a GTPase activating protein domain which acts on the guanine nucleotide binding site. The protein localizes to lysosomes and the Golgi apparatus. It plays a role in the formation of intracellular transport vesicles, their movement from one compartment to another, and phopholipase D activation. Three alternatively spliced transcript variants for this gene have been described.
18567	The protein encoded by this gene is a cytokine that acts as a growth and differentiation factor for both B cells and eosinophils. This cytokine is a main regulator of eosinopoiesis, eosinophil maturation and activation. The elevated production of this cytokine is reported to be related to asthma or hypereosinophilic syndromes. The receptor of this cytokine is a heterodimer, whose beta subunit is shared with the receptors for interleukine 3 (IL3) and colony stimulating factor 2 (CSF2/GM-CSF). This gene, together with those for interleukin 4 (IL4), interleukin 13 (IL13), and CSF2, form a cytokine gene cluster on chromosome 5. This cytokine, IL4, and IL13 are found to be regulated coordinately by long-range regulatory elements spread over 120 kilobases on chromosome 5q31.
18568	The protein encoded by this gene belongs to a small group of evolutionarily conserved proteins with three transmembrane domains. It is a potential target for ubiquitination by the Nedd4 family of proteins. This protein is thought to be part of a family of integral Golgi membrane proteins.
18569	The protein encoded by this gene belongs to a small group of evolutionarily conserved proteins with three transmembrane domains. It is a potential target for ubiquitination by the Nedd4 family of proteins. This protein is thought to be part of a family of integral Golgi membrane proteins.
18570	
18571	
18572	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that belongs to the ribosomal protein S10P family. Pseudogenes corresponding to this gene are found on chromosomes 1q, 3p, and 9p.
18573	
18574	
18575	
18576	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
18577	
18578	This gene product belongs to the 14-3-3 family of proteins which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals, and this protein is 99% identical to the mouse, rat and sheep orthologs. The encoded protein interacts with IRS1 protein, suggesting a role in regulating insulin sensitivity. Two transcript variants differing in the 5@#$%& UTR, but encoding the same protein, have been identified for this gene.
18579	
18580	This gene encodes the smallest subunit of dynactin, a macromolecular complex consisting of 10 subunits ranging in size from 22 to 150 kD. Dynactin binds to both microtubules and cytoplasmic dynein. It is involved in a diverse array of cellular functions, including ER-to-Golgi transport, the centripetal movement of lysosomes and endosomes, spindle formation, cytokinesis, chromosome movement, nuclear positioning, and axonogenesis. This subunit, like most other dynactin subunits, exists only as a part of the dynactin complex. It is primarily an alpha-helical protein with very little coiled coil, and binds directly to the largest subunit (p150) of dynactin. Alternative splicing of this gene generates 2 transcript variants.
18581	
18582	
18583	
18584	The protein encoded by this gene is a receptor for mannose-6-phosphate groups on lysosomal enzymes. The receptor forms a homodimer or homotetramer for intracellular targeting of lysosomal enzymes and export of newly synthesized lysosomal enzymes into the cell secretions. The receptor is an integral membrane protein which localizes to the trans-Golgi reticulum, endosomes, and the plasma membrane.
18585	
18586	This gene encodes a member of the Kruppel-like factor subfamily of zinc finger proteins. Since the protein localizes to the nucleus and binds the epidermal growth factor response element, it is thought to be a transcription factor.
18587	
18588	
18589	
18590	
18591	The protein encoded by this gene is a member of the chromogranin/secretogranin family of neuroendocrine secretory proteins. Granins may serve as precursors for biologically active peptides. Some granins have been shown to function as helper proteins in sorting and proteolytic processing of prohormones; however, the function of this protein is unknown.
18592	
18593	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
18594	
18595	
18596	
18597	
18598	
18599	The protein encoded by this gene is a member of the TNF-receptor superfamily. This receptors can interact with various TRAF family proteins, through which this receptor induces the activation of NF-kappa B and MAPK8/JNK. This receptor and its ligand are important regulators of the interaction between T cells and dendritic cells. This receptor is also an essential mediator for osteoclast and lymph node development. Studies of the mouse counterpart suggest that this receptor directly mediates the osteoprotegerin ligand (OPGL)-induced osteoclastogenesis in osteoclast precursor cells.
18600	
18601	
18602	
18603	
18604	
18605	
18606	
18607	Aminopeptidase P is a hydrolase specific for N-terminal imido bonds, which are common to several collagen degradation products, neuropeptides, vasoactive peptides, and cytokines. Structurally, the enzyme is a member of the @#$%&pita bread fold@#$%& family and occurs in mammalian tissues in both soluble and GPI-anchored membrane-bound forms.  A membrane-bound and soluble form of this enzyme have been identified as products of two separate genes.
18608	
18609	
18610	
18611	Cytosolic and membrane-bound forms of glutathione S-transferase are encoded by two distinct supergene families. At present, eight distinct classes of the soluble cytoplasmic mammalian glutathione S-transferases have been identified: alpha, kappa, mu, omega, pi, sigma, theta and zeta. This gene encodes a glutathione S-transferase that belongs to the mu class. The mu class of enzymes functions in the detoxification of electrophilic compounds, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress, by conjugation with glutathione. The genes encoding the mu class of enzymes are organized in a gene cluster on chromosome 1p13.3 and are known to be highly polymorphic. These genetic variations can change an individual@#$%&s susceptibility to carcinogens and toxins as well as affect the toxicity and efficacy of certain drugs. Diversification of these genes has occurred in regions encoding substrate-binding domains, as well as in tissue expression patterns, to accommodate an increasing number of foreign compounds.
18612	
18613	
18614	
18615	This gene belongs to the neuron navigator family and is expressed predominantly in the nervous system. The encoded protein contains coiled-coil domains and a conserved AAA domain characteristic for ATPases associated with a variety of cellular activities. This gene is similar to unc-53, a Caenorhabditis elegans gene involved in axon guidance. The exact function of this gene is not known.
18616	
18617	
18618	
18619	
18620	
18621	
18622	
18623	
18624	
18625	
18626	
18627	
18628	
18629	
18630	
18631	
18632	
18633	
18634	
18635	
18636	
18637	
18638	
18639	The protein encoded by this gene has similarity to a yeast protein. It seems to be part of a peripheral membrane protein complex localized on cis/medial Golgi cisternae where it may participate in tethering intra-Golgi transport vesicles.
18640	
18641	The protein encoded by this gene is a member of the SWI/SNF family of proteins and is similar to the brahma protein of Drosophila. Members of this family have helicase and ATPase activities and are thought to regulate transcription of certain genes by altering the chromatin structure around those genes. The encoded protein is part of the large ATP-dependent chromatin remodeling complex SNF/SWI, which is required for transcriptional activation of genes normally repressed by chromatin. In addition, this protein can bind BRCA1, as well as regulate the expression of the tumorigenic protein CD44. Alternatively spliced transcripts have been found for this gene, but their full-length natures have not been determined.
18642	
18643	
18644	
18645	
18646	
18647	
18648	
18649	Cytochrome c oxidase (COX), the terminal component of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. This component is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in electron transfer, and the nuclear-encoded subunits may function in the regulation and assembly of the complex. This nuclear gene encodes a protein which is not a structural subunit, but may be a heme A biosynthetic enzyme involved in COX formation, according to the yeast mutant studies. However, the studies in Rhodobacter sphaeroides suggest that this gene is not required for heme A biosynthesis, but required for stable formation of the Cu(B) and magnesium centers of COX. This human protein is predicted to contain a transmembrane domain localized in the mitochondrial inner membrane. A pseudogene COX11P has been found on chromosome 6.
18650	
18651	
18652	
18653	
18654	
18655	
18656	
18657	
18658	
18659	
18660	
18661	
18662	
18663	
18664	
18665	
18666	
18667	
18668	
18669	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. Two alternatively spliced transcript variants of this gene have been reported, one of which encodes a receptor-type PTP that possesses a short extracellular domain, a single transmembrane region, and two tandem intracytoplasmic catalytic domains; Another one encodes a PTP that contains a distinct hydrophilic N-terminus, and thus represents a nonreceptor-type isoform of this PTP. Studies of the similar gene in mice suggested the regulatory roles of this PTP in RAS related signal transduction pathways, cytokines induced SATA signaling, as well as the activation of voltage-gated K+ channels.
18670	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. Two alternatively spliced transcript variants of this gene have been reported, one of which encodes a receptor-type PTP that possesses a short extracellular domain, a single transmembrane region, and two tandem intracytoplasmic catalytic domains; Another one encodes a PTP that contains a distinct hydrophilic N-terminus, and thus represents a nonreceptor-type isoform of this PTP. Studies of the similar gene in mice suggested the regulatory roles of this PTP in RAS related signal transduction pathways, cytokines induced SATA signaling, as well as the activation of voltage-gated K+ channels.
18671	
18672	
18673	
18674	
18675	
18676	
18677	The protein encoded by this gene is a member of the kelch family of proteins, which is characterized by a 44-56 amino acid repeat motif. The kelch motif appears in many different polypeptide contexts and contains multiple potential protein-protein contact sites. Members of this family are present both throughout the cell and extracellularly, with diverse activities.
18678	
18679	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L31E family of ribosomal proteins. It is located in the cytoplasm. Higher levels of expression of this gene in familial adenomatous polyps compared to matched normal tissues have been observed. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
18680	
18681	The protein encoded by this gene is highly similar to Nocturnin, a gene identified as a circadian clock regulated gene in Xenopus laevis. This protein and Nocturnin protein share similarity with the C-terminal domain of a yeast transcription factor, carbon catabolite repression 4 (CCR4). The mRNA abundance of a similar gene in mouse has been shown to exhibit circadian rhythmicity, which suggests a role for this protein in clock function or as a circadian clock effector.
18682	
18683	This gene encodes a member of the tenascin family of extracellular matrix proteins. The tenascins have anti-adhesive effects, as opposed to fibronectin which is adhesive. The specific function of this protein is unknown, but its deficiency has been associated with the connective tissue disorder Ehlers-Danlos syndrome. This gene localizes to the major histocompatibility complex (MHC) class III region on chromosome 6. It is one of four genes in this cluster which have been duplicated. The duplicated copy of this gene is incomplete and is a pseudogene which is transcribed but does not encode a protein. The structure of this gene is unusual in that it overlaps the CREBL1 and CYP21A2 genes at its 5@#$%& and 3@#$%& ends, respectively. The gene expresses two transcripts.
18684	
18685	
18686	
18687	
18688	
18689	The nucleolus is a dense subnuclear membraneless organelle that assembles around clusters of rRNA genes and functions in ribosome biogenesis. This gene encodes a nucleolar RNA-associated protein that is highly conserved between species. RNase treatment of permeabilized cells indicates that the nucleolar localization is RNA dependent. Further studies suggest that the protein is associated with ribosome biogenesis through an interaction with pre-rRNA primary transcripts. Alternative splicing has been observed at this locus and three splice variants, each encoding a distinct isoform, have been identified.
18690	
18691	This gene is located in the surfeit gene cluster, a group of very tightly linked genes that do not share sequence similarity. The gene demonstrates features of a housekeeping gene, being ubiquitously expressed, and the encoded protein has been localized to the nucleolus. The protein includes motifs found in both the mouse and fish orthologs, which suggests a putative function as a nucleolar-matrix protein with nucleic acid-binding properties, based on characteristics determined in mouse.
18692	The protein encoded by this gene belongs to the SEC13 family of WD-repeat proteins. It has similarity to the yeast SEC13 protein, which is required for vesicle biogenesis from endoplasmic reticulum during the transport of proteins. Two transcript variants encoding different isoforms have been found for this gene.
18693	
18694	
18695	
18696	The protein encoded by this gene is a receptor for glucocorticoids and can act as both a transcription factor and a regulator of other transcription factors. The encoded protein can bind DNA as a homodimer or as a heterodimer with another protein such as the retinoid X receptor. This protein can also be found in heteromeric cytoplasmic complexes along with heat shock factors and immunophilins. The protein is typically found in the cytoplasm until it binds a ligand, which induces transport into the nucleus. Mutations in this gene are a cause of glucocorticoid resistance, or cortisol resistance. Alternate splicing, the use of at least three different promoters, and alternate translation initiation sites result in several transcript variants encoding the same protein or different isoforms, but the full-length nature of some variants has not been determined.
18697	
18698	
18699	
18700	The protein encoded by this gene was identified as a binding protein of the protein kinase C, delta (PRKCD). The expression of this gene in cultured cell lines is strongly induced by serum starvation. The expression of this protein was found to be down-regulated in various cancer cell lines, suggesting the possible tumor suppressor function of this protein.
18701	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein. It may contribute to the cell proliferation and carcinogenesis.
18702	The protein encoded by this gene contains a paired-like homeodomain most similar to that of the Drosophila aristaless gene product. This protein is expressed specifically in noradrenergic cell types. It regulates the expression of tyrosine hydroxylase and dopamine beta-hydroxylase, two catecholaminergic biosynthetic enzymes essential for the differentiation and maintenance of noradrenergic phenotype. Mutations in this gene have been associated with autosomal recessive congenital fibrosis of the extraocular muscles (CFEOM2).
18703	The protein encoded by this gene contains a C2H2 zinc finger, and has been shown to function as a transcriptional repressor. The Kruppel-associated box (KRAB) domain of this protein is found to be responsible for its transcriptional repression activity. RING finger containing protein TIF1 was reported to interact with the KRAB domain, and may serve as a mediator for the repression activity of this protein.
18704	
18705	
18706	The protein encoded by this gene belongs to the family of P-type cation transport ATPases, and to the subfamily of aminophospholipid-transporting ATPases. The aminophospholipid translocases transport phosphatidylserine and phosphatidylethanolamine from one side of a bilayer to another. This gene is maternally expressed. It maps within the most common interval of deletion responsible for Angelman syndrome, also known as @#$%&happy puppet syndrome@#$%&.
18707	
18708	Protein disulfide isomerases (EC 5.3.4.1), such as PDIP, are endoplasmic reticulum (ER) resident proteins that catalyze protein folding and thiol-disulfide interchange reactions (Desilva et al., 1996).[supplied by OMIM]
18709	
18710	
18711	
18712	The product of this gene is a membrane-associated guanylate kinase found at the postsynaptic density in neuronal cells. It is a signaling molecule that can interact with potassium channels and receptors, as well as other signaling molecules. The protein encoded by this gene can interact with PSD-95 through its guanylate kinase domain and may be involved in clustering PSD-95 in the postsynaptic density region. The encoded protein is one of at least four similar proteins that have been found. Alternatively spliced transcript variants encoding different isoforms have been found for this gene.
18713	The cystatin superfamily encompasses proteins that contain multiple cystatin-like sequences. Some of the members are active cysteine protease inhibitors, while others have lost or perhaps never acquired this inhibitory activity. There are three inhibitory families in the superfamily, including the type 1 cystatins (stefins), type 2 cystatins and the kininogens. The type 2 cystatin proteins are a class of cysteine proteinase inhibitors found in a variety of human fluids and secretions, where they appear to provide protective functions. The cystatin locus on chromosome 20 contains the majority of the type 2 cystatin genes and pseudogenes. This gene is located in the cystatin locus and encodes the most abundant extracellular inhibitor of cysteine proteases, which is found in high concentrations in biological fluids and is expressed in virtually all organs of the body. A mutation in this gene has been associated with amyloid angiopathy. Expression of this protein in vascular wall smooth muscle cells is severely reduced in both atherosclerotic and aneurysmal aortic lesions, establishing its role in vascular disease.
18714	
18715	
18716	
18717	
18718	
18719	
18720	
18721	
18722	
18723	
18724	
18725	
18726	
18727	This gene product is highly similar to Schizosaccharomyces pombe rad9, a cell cycle checkpoint protein required for cell cycle arrest and DNA damage repair in response to DNA damage. This protein is found to possess 3@#$%& to 5@#$%& exonuclease activity, which may contribute to its role in sensing and repairing DNA damage. It forms a checkpoint protein complex with RAD1 and HUS1. This complex is recruited by checkpoint protein RAD17 to the sites of DNA damage, which is thought to be important for triggering the checkpoint-signaling cascade. Use of alternative polyA sites has been noted for this gene.
18728	
18729	
18730	
18731	The inhibitory glycine receptor mediates postsynaptic inhibition in the spinal cord and other regions of the central nervous system. It is a pentameric receptor composed of alpha (GLRA1, MIM 138491; GLRA2, MIM 305990) and beta subunits.[supplied by OMIM]
18732	
18733	Epiregulin is a member of the epidermal growth factor family. Epiregulin can function as a ligand of EGFR (epidermal growth factor receptor), as well as a ligand of most members of the ERBB (v-erb-b2 oncogene homolog) family of tyrosine-kinase receptors.
18734	
18735	The protein encoded by this gene is a transcription factor of the PR-domain protein family. It contains a PR-domain and multiple zinc finger motifs. Ttranscription factors of the PR-domain family are known to be involved in cell differentiation and tumorigenesis.
18736	
18737	This gene belongs to the polypeptide N-acetylgalactosaminyltransferase (pp-GalNAc-T) gene family. Polypeptide GalNAc transferases initiate the synthesis of mucin-type oligosaccharides by transferring GalNAc from UDP-GalNAc to the hydroxyl group of either a serine or threonine residue on the polypeptide acceptor. Following expression in insect cells, recombinant GalNAc transferase 10 showed significant GalNAcT activity toward mucin-derived peptides, and it utilized both nonglycosylated and glycosylated peptide substrates. Two transcript variants encoding distinct isoforms have been identified for this gene.
18738	
18739	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. This structure consists of approximately 146 bp of DNA wrapped around a nucleosome, an octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is intronless and encodes a member of the histone H2A family. Transcripts from this gene lack polyA tails; instead, they contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6p22-p21.3.
18740	
18741	
18742	
18743	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
18744	This gene encodes a protein associated with the cytoplasmic surface of synaptic vesicles. A subset of patients with stiff-man syndrome who were also affected by breast cancer are positive for autoantibodies against this protein. Alternate splicing of this gene results in two transcript variants encoding different isoforms. Additional splice variants have been described, but their full length sequences have not been determined.
18745	
18746	
18747	The protein encoded by this gene is a member of the TNF-receptor superfamily, and contains an intracelluar death domain. This receptor can be activated by tumor necrosis factor-related apoptosis inducing ligand (TNFSF10/TRAIL/APO-2L), and transduces apoptosis signal. Studies with FADD-deficient mice suggested that FADD, a death domain containing adaptor protein, is required for the apoptosis mediated by this protein.
18748	
18749	
18750	The activation of gene transcription is a multistep process that is triggered by factors that recognize transcriptional enhancer sites in DNA. These factors work with co-activators to direct transcriptional initiation by the RNA polymerase II apparatus. The protein encoded by this gene is a subunit of the CRSP (cofactor required for SP1 activation) complex, which, along with TFIID, is required for efficient activation by SP1. This protein is also a component of other multisubunit complexes e.g. thyroid hormone receptor-(TR-) associated proteins which interact with TR and facilitate TR function on DNA templates in conjunction with initiation factors and cofactors.
18751	This gene encodes a protein that is a member of the dickkopf family. The secreted protein contains two cysteine rich regions and is involved in embryonic development through its interactions with the Wnt signaling pathway. The expression of this gene is decreased in a variety of cancer cell lines and it may function as a tumor suppressor gene. Alternative splicing results in multiple transcript variants encoding the same protein.
18752	The aldehyde dehydrogenases are a family of isozymes that may play a major role in the detoxification of aldehydes generated by alcohol metabolism and lipid peroxidation. This particular gene spans about 20 kb of genomic DNA and is composed of 9 coding exons. The gene encodes a single transcript of 2.8 kb which is highly expressed in kidney and lung. The functional significance of this gene as well as the cellular localization of its product are presently unknown.
18753	
18754	
18755	This gene belongs to the homeobox family of genes. The homeobox genes encode a highly conserved family of transcription factors that play an important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, located on different chromosomes, consisting of 9 to 11 genes arranged in tandem. This gene is one of several homeobox HOXC genes located in a cluster on chromosome 12. The protein level is controlled during cell differentiation and proliferation, which may indicate this protein role in origin activation. The exact role of this gene has to be determined.
18756	
18757	
18758	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility.
18759	
18760	
18761	
18762	
18763	
18764	
18765	YY1 is a ubiquitously distributed transcription factor belonging to the GLI-Kruppel class of zinc finger proteins. The protein is involved in repressing and activating a diverse number of promoters.  YY1 may direct histone deacetylases and histone acetyltransferases to a promoter in order to activate or repress the promoter, thus implicating histone modification in the function of YY1.
18766	
18767	The mitochondrial deoxyribonucleotide (dNTP) pool is separated from the cytosolic pool because the mitochondria inner membrane is impermeable to charged molecules. The mitochondrial pool is maintained by either import of cytosolic dNTPs through dedicated transporters or by salvaging deoxynucleosides within the mitochondria; apparently, enzymes of the de novo dNTP synthesis pathway are not present in the mitochondria. In nonreplicating cells, where cytosolic dNTP synthesis is downregulated, mtDNA synthesis depends solely on the mitochondrial salvage pathway enzymes, the deoxyribonucleoside kinases. Two of the 4 human deoxyribonucleoside kinases, deoxyguanosine kinase (DGK) and thymidine kinase-2, are expressed in mitochondria. Human DGK, encoded by the DGUOK gene (MIM 601465), efficiently phosphorylates deoxyguanosine and deoxyadenosine, whereas TK2 phosphorylates deoxythymidine, deoxycytidine, and deoxyuridine. Thymidine kinase-2 (TK2) is a deoxyribonucleoside kinase that phosphorylates thymidine, deoxycytidine, and deoxyuridine, and also phosphorylates antiviral and anticancer nucleoside analogs.[supplied by OMIM]
18768	
18769	
18770	
18771	
18772	
18773	
18774	
18775	
18776	
18777	
18778	
18779	The protein encoded by this gene was first identified by its binding to the APC protein which is often mutated in familial and sporadic forms of colorectal cancer. This protein localizes to microtubules, especially the growing ends, in interphase cells. During mitosis, the protein is associated with the centrosomes and spindle microtubules. The protein also associates with components of the dynactin complex and the intermediate chain of cytoplasmic dynein. Because of these associations, it is thought that this protein is involved in the regulation of microtubule structures and chromosome stability. This gene is a member of the RP/EB family.
18780	This gene encodes a type I integral membrane protein found only in the rough endoplasmic reticulum. The encoded protein is part of an N-oligosaccharyl transferase complex that links high mannose oligosaccharides to asparagine residues found in the Asn-X-Ser/Thr consensus motif of nascent polypeptide chains. This protein forms part of the regulatory subunit of the 26S proteasome and may mediate binding of ubiquitin-like domains to this proteasome.
18781	The protein encoded by this gene is a member of the homeodomain family of DNA binding proteins. It may regulate gene expression, morphogenesis, and differentiation and it may also play a role in the cell cycle progession. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
18782	
18783	
18784	
18785	
18786	
18787	This gene encodes a member of the diacylglycerol kinase (DGK) enzyme family of proteins, specifically the type II DGK subfamily. Members of this family are involved in regulating the intracellular concentrations of diacylglycerol and phosphatidic acid. Two transcript variants encoding distinct isoforms have been identified for this gene.
18788	
18789	This gene encodes a protein with ankyrin repeats, which are associated with protein-protein interactions. Studies in mice suggest that this protein is involved in liver development. Two transcript variants encoding different isoforms have been found for this gene.
18790	
18791	
18792	
18793	This gene encodes a member of the aldo/keto reductase superfamily, which consists of more than 40 known enzymes and proteins. These enzymes catalyze the conversion of aldehydes and ketones to their corresponding alcohols by utilizing NADH and/or NADPH as cofactors. The enzymes display overlapping but distinct substrate specificity. This enzyme binds bile acid with high affinity, and shows minimal 3-alpha-hydroxysteroid dehydrogenase activity. This gene shares high sequence identity with three other gene members and is clustered with those three genes at chromosome 10p15-p14.
18794	
18795	
18796	
18797	
18798	
18799	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the ALD subfamily, which is involved in peroxisomal import of fatty acids and/or fatty acyl-CoAs in the organelle. All known peroxisomal ABC transporters are half transporters which require a partner half transporter molecule to form a functional homodimeric or heterodimeric transporter. This peroxisomal membrane protein likely plays an important role in peroxisome biogenesis. Mutations have been associated with some forms of Zellweger syndrome, a heterogeneous group of peroxisome assembly disorders.
18800	
18801	
18802	
18803	
18804	
18805	
18806	
18807	
18808	Islet, or insulinoma, amyloid polypeptide (IAPP, or amylin) is commonly found in pancreatic islets of patients suffering diabetes mellitus type II, or harboring an insulinoma.  While the assosciation of amylin with the development of type II diabetes has been known for some time, a direct causative role for amylin has been harder to establish.  Recent results suggest that amylin, like the related beta-amyloid (Abeta) assosciated with Alzheimer@#$%&s disease, can induce apoptotic cell-death in particular cultured cells, an effect that may be relevant to the development of type II diabetes.
18809	
18810	
18811	
18812	
18813	
18814	
18815	
18816	
18817	
18818	This gene encodes a protein possessing an RNA helicase motif containing a DEXH box in its amino terminus and an RNA motif in the carboxy terminus. The encoded protein functions as a ribonuclease and is required by the RNA interference and small temporal RNA (stRNA) pathways to produce the active small RNA component that represses gene expression. Two transcript variants encoding the same protein have been identified for this gene.
18819	
18820	
18821	
18822	
18823	
18824	
18825	
18826	
18827	
18828	
18829	
18830	This gene was isolated using a selection system for genes encoding nuclear-targeted proteins. The gene product contains a pleckstrin homology (PH) domain and a coiled-coil region. PH domains occurs in a wide range of proteins involved in intracellular signaling. The domain family represents multiple functions and PH domains have been found to possess inserted domains and to be inserted within other domains. Alternative splicing occurs at this gene locus and two variants, encoding distinct isoforms, have been described. Additional splicing has been observed but the full-length nature of these variants has not been determined.
18831	
18832	
18833	
18834	
18835	
18836	
18837	
18838	
18839	This gene encodes a protein containing a polynucleotide kinase domain (PNK) near the N-terminal region, and a Small MutS Related (Smr) domain near the C-terminal region. The encoded protein can bind to both B-cell leukemia/lymphoma 3 (BCL-3) and neural precursor cell expressed, developmentally downregulated 4, (Nedd4) proteins. This protein binds and hydrolyzes ATP, may function as a 5@#$%&-polynucleotide kinase, and has the capacity to be a ubiquitylation substrate. This protein may play a role in transcription-coupled DNA repair or genetic recombination.
18840	
18841	
18842	
18843	
18844	
18845	
18846	
18847	
18848	
18849	Hexosaminidase A is the alpha subunit of the lysosomal enzyme beta-hexosaminidase that, together with the cofactor GM2 activator protein, catalyzes the degradation of the ganglioside GM2, and other molecules containing terminal N-acetyl hexosamines. Beta-hexosaminidase is composed of two subunits, alpha and beta, which are encoded by separate genes.  Both beta-hexosaminidase alpha and beta subunits are members of family 20 of glycosyl hydrolases.  Mutations in the alpha or beta subunit genes lead to an accumulation of GM2 ganglioside in neurons and neurodegenerative disorders termed the GM2 gangliosidoses.  Alpha subunit gene mutations lead to Tay-Sachs disease (GM2-gangliosidosis type I).
18850	
18851	
18852	This gene encodes a major constituent of the human complement subcomponent C1q. C1q associates with C1r and C1s in order to yield the first component of the serum complement system. Deficiency of C1q has been associated with lupus erythematosus and glomerulonephritis. C1q is composed of 18 polypeptide chains: six A-chains, six B-chains, and six C-chains. Each chain contains a collagen-like region located near the N terminus and a C-terminal globular region. The A-, B-, and C-chains are arranged in the order A-C-B on chromosome 1. This gene encodes the A-chain polypeptide of human complement subcomponent C1q.
18853	
18854	This gene encodes a member of the IL10 family of cytokines. It was identified as a gene induced during terminal differentiation in melanoma cells. The protein encoded by this gene can induce apoptosis selectively in various cancer cells. Overexpression of this gene leads to elevated expression of several GADD family genes, which correlates with the induction of apoptosis. The phosphorylation of mitogen-activated protein kinase 14 (MAPK7/P38), and heat shock 27kDa protein 1 (HSPB2/HSP27) are found to be induced by this gene in melanoma cells, but not in normal immortal melanocytes. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
18855	Apoptosis is a cell death process that removes toxic and/or useless cells during mammalian development. The apoptotic process is accompanied by shrinkage and fragmentation of the cells and nuclei and degradation of the chromosomal DNA into nucleosomal units. DNA fragmentation factor (DFF) is a heterodimeric protein of 40-kD (DFFB) and 45-kD (DFFA) subunits. DFFA is the substrate for caspase-3 and triggers DNA fragmentation during apoptosis. DFF becomes activated when DFFA is cleaved by caspase-3. The cleaved fragments of DFFA dissociate from DFFB, the active component of DFF. DFFB has been found to trigger both DNA fragmentation and chromatin condensation during apoptosis. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
18856	
18857	
18858	
18859	
18860	
18861	
18862	
18863	
18864	
18865	
18866	
18867	
18868	
18869	
18870	This gene is a member of the regulatory factor X gene family, which encodes transcription factors that contain a highly-conserved winged helix DNA binding domain. The protein encoded by this gene is structurally related to regulatory factors X1, X2, X3, and X5. It has been shown to interact with itself as well as with regulatory factors X2 and X3, but it does not interact with regulatory factor X1. This protein may be a transcriptional repressor rather than a transcriptional activator. Three transcript variants encoding different isoforms have been described for this gene.
18871	The protein encoded by this gene is a component of the mammalian counterpart of the yeast Exocyst complex, which is essential for post-Golgi traffic.
18872	
18873	This gene encodes the major vault protein which is a lung resistance-related protein. Vaults are multi-subunit structures that may be involved in nucleo-cytoplasmic transport. This protein mediates drug resistance, perhaps via a transport process. It is widely distributed in normal tissues, and overexpressed in multidrug-resistant cancer cells. The protein overexpression is a potentially useful marker of clinical drug resistance. This gene produces two transcripts by using two alternative exon 2 sequences; however, the open reading frames are the same in both transcripts.
18874	This gene product belongs to a family of nonselective cation channels that function in a variety of processes, including temperature sensation and vasoregulation. The thermosensitive members of this family are expressed in subsets of sensory neurons that terminate in the skin, and are activated at distinct physiological temperatures. This channel is activated at temperatures between 22 and 40 degrees C. This gene lies in close proximity to another family member (TRPV1) gene on chromosome 17, and the two encoded proteins are thought to associate with each other to form heteromeric channels.
18875	This gene encodes the microtubule-associated protein tau (MAPT) whose transcript undergoes complex, regulated alternative splicing, giving rise to several mRNA species. MAPT transcripts are differentially expressed in the nervous system, depending on stage of neuronal maturation and neuron type. MAPT gene mutations result in several neurodegenerative disorders such as Alzheimer@#$%&s disease, Pick@#$%&s disease, frontotemporal dementia, cortico-basal degeneration and progressive supranuclear palsy.
18876	
18877	
18878	Sm-like proteins were identified in a variety of organisms based on sequence homology with the Sm protein family (see SNRPD2; MIM 601061). Sm-like proteins contain the Sm sequence motif, which consists of 2 regions separated by a linker of variable length that folds as a loop. The Sm-like proteins are thought to form a stable heteromer present in tri-snRNP particles, which are important for pre-mRNA splicing.[supplied by OMIM]
18879	This gene encodes a cytoplasmic protein which contains a regulation of G-protein signaling (RGS) domain and a dishevelled and axin (DIX) domain. The encoded protein interacts with adenomatosis polyposis coli, catenin (cadherin-associated protein), beta 1, 88kDa, glycogen synthase kinase 3 beta, protein phosphate 2, and itself. This protein functions as a negative regulator of the wingless-type MMTV integration site family, member 1 (WNT) signaling pathway and can induce apoptosis. The crystal structure of a portion of this protein, alone and in a complex with other proteins, has been resolved. Mutations in this gene have been associated with hepatocellular carcinoma, hepatoblastomas, ovarian endometriod adenocarcinomas, and medullablastomas. Two transcript variants encoding distinct isoforms have been identified for this gene.
18880	This gene is a member of the fringe gene family which also includes Radical and Lunatic fringe. They all encode evolutionarily conserved secreted proteins that act in Notch receptor pathway to demarcate boundaries during embryonic development. The genomic structure of each fringe gene consists of eight exons. Fringe proteins are related to a family of glycosyltransferase, but their genomic structure is distinct from the glycosyltransferases.
18881	Chloride channel Kb (CLCNKB) is a member of the CLC family of voltage-gated chloride channels, which comprises at least 9 mammalian chloride channels. Each is believed to have 12 transmembrane domains and intracellular N and C termini.   Mutations in CLCNKB result in the autosomal recessive Type III Bartter Syndrome.  CLCNKA and CLCNKB are closely related (94% sequence identity), tightly linked (separated by 11 kb of genomic sequence) and  are both expressed in mammalian kidney.
18882	
18883	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S19P family of ribosomal proteins. It is located in the cytoplasm. This gene has been found to be activated in various tumors, such as insulinomas, esophageal cancers, and colon cancers. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
18884	
18885	This gene encodes a protein with sequence similarity to retinoblastoma-binding protein-1. The gene product is recognized by IgG antibody isolated from a breast cancer patient. Transcript variants encoding different isoforms and utilizing alternative polyA sites have been observed.
18886	
18887	The protein encoded by this gene is similar to the yeast Mob1 protein. Yeast Mob1 binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
18888	CSRP1 is a member of the CSRP family of genes encoding a group of LIM domain proteins, which may be involved in regulatory processes important for development and cellular differentiation.  The LIM/double zinc-finger motif found in CRP1 is found in a group of proteins with critical functions in gene regulation, cell growth, and somatic differentiation  Other genes in the family include CSRP2 and CSRP3.
18889	
18890	
18891	
18892	The protein encoded by this gene is a member of the rhomboid family of integral membrane proteins. This family contains proteins that are related to Drosophila rhomboid protein. Members of this family are found in both prokaryotes and eukaryotes and are thought to function as intramembrane serine proteases. The encoded protein is thought to release soluble growth factors by proteolytic cleavage of certain membrane-bound substrates, including ephrin B2 and ephrin B3.
18893	
18894	Ribonucleotide reductase catalyzes the formation of deoxyribonucleotides from ribonucleotides. It is composed of 2 non-identical subunits, proteins M1 and M2.  Synthesis of M2 is regulated in a cell-cycle dependent fashion.
18895	This gene is thought to be involved in apoptosis.
18896	
18897	
18898	This gene was identified based on its similarity with the mouse counterpart. Studies of the mouse counterpart suggest that the expression of this gene may be regulated during oocyte maturation and preimplantation following zygotic gene activation.  Alternatively spliced transcript variants encoding distinct isoforms have been observed.
18899	The product of this gene belongs to the family of G-protein coupled receptors. This family has several receptor subtypes with different pharmacological selectivity, which overlaps in some cases, for various adenosine and uridine nucleotides. This receptor functions as a receptor for extracellular ATP and ADP. In platelets binding to ADP leads to mobilization of intracellular calcium ions via activation of phospholipase C, a change in platelet shape, and probably to platelet aggregation.
18900	
18901	
18902	Dystroglycan is a laminin binding component of the dystrophin-glycoprotein complex which provides a linkage between the subsarcolemmal cytoskeleton and the extracellular matrix. Dystroglycan 1 is a candidate gene for the site of the mutation in autosomal recessive muscular dystrophies. The dramatic reduction of dystroglycan 1 in Duchenne muscular dystrophy leads to a loss of linkage between the sarcolemma and extracellular matrix, rendering muscle fibers more susceptible to necrosis. Dystroglycan also functions as dual receptor for agrin and laminin-2 in the Schwann cell membrane. The muscle and nonmuscle isoforms of dystroglycan differ by carbohydrate moieties but not protein sequence.
18903	
18904	
18905	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are nucleic acid binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It localizes to both the nucleus and the cytoplasm. This protein is implicated in the regulation of mRNA stability. Alternative splicing of this gene results in four transcript variants.
18906	KSP37 is constitutively secreted by Th1-type CD4 (MIM 186940)-positive T lymphocytes and by lymphocytes with cytotoxic potential that usually coexpress perforin (PRF1; MIM 170280).[supplied by OMIM]
18907	
18908	
18909	
18910	
18911	GABA is the major inhibitory neurotransmitter in the mammalian brain where it acts at GABA-A receptors, which are ligand-gated chloride channels. Chloride conductance of these channels can be modulated by agents such as benzodiazepines that bind to the GABA-A receptor. At least 16 distinct subunits of GABA-A receptors have been identified.
18912	
18913	
18914	
18915	
18916	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
18917	Beta actin is one of six different actin isoforms which have been identified.  ACTB is one of the two nonmuscle cytoskeletal actins.  Actins are highly conserved proteins that are involved in cell motility, structure and integrity.  Alpha actins are a major constituent of the contractile apparatus.
18918	
18919	
18920	Some high-mobility group (HMG) box proteins (e.g., LEF1; MIM 153245) contain a single HMG box motif and bind DNA in a sequence-specific manner, while other members of this family (e.g., HMG1; MIM 163905) have multiple HMG boxes and bind DNA in a sequence-independent but structure-dependent manner. All HMG box proteins are able to induce a sharp bend in DNA. TOX contains a single HMG box motif.[supplied by OMIM]
18921	This gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins. The protein may play a role in tumorigenesis. A similar protein in mice is involved in the regulation of the wingless-type MMTV integration site family (Wnt) pathway.
18922	
18923	
18924	
18925	
18926	This gene encodes a member of the RIEG/PITX homeobox family, which is in the bicoid class of homeodomain proteins. Members of this family act as transcription factors. Mutations of this gene have been associated with anterior segment mesenchymal dysgenesis (ASMD) and congenital cataracts. This protein is involved in lens formation during eye development.
18927	Glycerol-3-phosphate acyltransferase (GPAT; EC 2.3.1.15), which catalyzes the initial and committing step in glycerolipid biosynthesis, is predicted to play a pivotal role in the regulation of cellular triacylglycerol and phospholipid levels. Two mammalian forms of GPAT have been identified on the basis of localization to either the endoplasmic reticulum or mitochondria.[supplied by OMIM]
18928	This gene encodes a motor protein that belongs to the kinesin-like protein family. Members of this protein family are known to be involved in various kinds of spindle dynamics. The function of this gene product includes chromosome positioning, centrosome separation and establishing a bipolar spindle during cell mitosis.
18929	
18930	The protein encoded by this gene is a member of an epithelium-specific subclass of the Ets transcritpion factor family. In addition to its role in regulating the later stages of terminal differentiation of keratinocytes, it appears to regulate a number of epithelium-specific genes found in tissues containing glandular epithelium such as salivary gland and prostate. It has very low affinity to DNA due to its negative regulatory domain at the amino terminus. Two alternatively spliced transcript variants encoding different isoforms have been described for this gene.
18931	
18932	
18933	
18934	
18935	This gene encodes a member of the kinesin family of proteins. Members of this family are part of a multisubunit complex that functions as a microtubule motor in intracellular organelle transport. Mutations in this gene cause autosomal dominant spastic paraplegia 10.
18936	
18937	
18938	
18939	
18940	
18941	The protein encoded by this gene is a cytokine receptor for interleukin 21 (IL21). It belongs to the type I cytokine receptors, and has been shown to form a heterodimeric receptor complex with the common gamma-chain, a receptor subunit also shared by the receptors for interleukin 2 (IL2) and interleukin 5 (IL5). This receptor transduces the growth promoting signal of IL21, and is important for the proliferation and differentiation of T cells, B cells, and natural killer (NK) cells. The ligand binding of this receptor leads to the activation of multiple downstream signaling molecules, including JAK1, JAK3, STAT1, and STAT3. Knockout studies of a similar gene in mouse suggest a role for this gene in regulating immunoglobulin production. Three alternatively spliced transcript variants encoding the same protein have been described.
18942	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. Four alternatively spliced transcript variants encoding the same protein have been found for this gene.
18943	
18944	
18945	
18946	This gene is one of several cytokine genes clustered on the q-arm of chromosome 17. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. This protein binds to several chemokine receptors including chemokine binding protein 2 and chemokine (C-C motif) receptor 5 (CCR5). CCR5 is a co-receptor for HIV, and binding of this protein to CCR5 inhibits HIV entry. The copy number of this gene varies among individuals; most individuals have 1-6 copies in the diploid genome, although rare individuals have zero or more than six copies. The human genome reference assembly contains two full copies of the gene and a partial pseudogene. This record represents the more telomeric full-length gene.
18947	KRT17 encodes the type I intermediate filament chain keratin 17, expressed in nail bed, hair follicle, sebaceous glands, and other epidermal appendages. Mutations in this gene lead to Jackson-Lawler type pachyonychia congenita and steatocystoma multiplex.
18948	
18949	This gene is one of seven beta-1,4-galactosyltransferase (beta4GalT) genes. They encode type II membrane-bound glycoproteins that appear to have exclusive specificity for the donor substrate UDP-galactose; all transfer galactose in a beta1,4 linkage to similar acceptor sugars: GlcNAc, Glc, and Xyl. Each beta4GalT has a distinct function in the biosynthesis of different glycoconjugates and saccharide structures. As type II membrane proteins, they have an N-terminal hydrophobic signal sequence that directs the protein to the Golgi apparatus and which then remains uncleaved to function as a transmembrane anchor. By sequence similarity, the beta4GalTs form four groups: beta4GalT1 and beta4GalT2, beta4GalT3 and beta4GalT4, beta4GalT5 and beta4GalT6, and beta4GalT7. The enzyme encoded by this gene is a lactosylceramide synthase important for glycolipid biosynthesis.
18950	
18951	
18952	Coagulation factor II (thrombin) receptor-like 3 (F2RL3) is a member of the large family of 7-transmembrane-region receptors that couple to guanosine-nucleotide-binding proteins. F2RL3 is also a member of the protease-activated receptor family. F2RL3 is activated by proteolytic cleavage of its extracellular amino terminus. The new amino terminus functions as a tethered ligand and activates the receptor. F2RL3 is activated by thrombin and trypsin.
18953	
18954	
18955	
18956	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. Deletions in this gene may contribute to the etiology of velo-cardio-facial syndrome and DiGeorge syndrome.
18957	The protein encoded by this gene is a GTPase which belongs to the RAS superfamily of small GTP-binding proteins. Members of this superfamily appear to regulate a diverse array of cellular events, including the control of cell growth, cytoskeletal reorganization, and the activation of protein kinases.
18958	
18959	The protein encoded by this gene is one of two human homologs of Saccharomyces cerevisiae Rad23, a protein involved in nucleotide excision repair (NER). This protein was shown to interact with, and elevate the nucleotide excision activity of 3-methyladenine-DNA glycosylase (MPG), which suggested a role in DNA damage recognition in base excision repair. This protein contains an N-terminal ubiquitin-like domain, which was reported to interact with 26S proteasome, as well as with ubiquitin protein ligase E6AP, and thus suggests that this protein may be involved in the ubiquitin mediated proteolytic pathway in cells.
18960	Rho GTPases play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. The encoded protein belongs to a family of cytoplasmic proteins that activate the Ras-like family of Rho proteins by exchanging bound GDP for GTP. It may form a complex with G proteins and stimulate Rho-dependent signals. This protein is activated by PI3-kinase. Mutations in this gene can cause X-chromosomal non-specific mental retardation.
18961	
18962	Glutamate receptors are the predominant excitatory neurotransmitter receptors in the mammalian brain and are activated in a variety of normal neurophysiologic processes. These receptors are heteromeric protein complexes with multiple subunits, each possessing transmembrane regions, and all arranged to form a ligand-gated ion channel. The classification of glutamate receptors is based on their activation by different pharmacologic agonists. This gene belongs to a family of alpha-amino-3-hydroxy-5-methyl-4-isoxazole propionate (AMPA) receptors. Alternative splicing at this locus results in several different isoforms which may vary in their signal transduction properties.
18963	
18964	
18965	
18966	
18967	
18968	
18969	
18970	The protein encoded by this gene contains a formin homology 2 domain and has high sequence identity to the mouse Wbp3 protein. Two alternative transcripts encoding different isoforms have been described.
18971	
18972	
18973	
18974	
18975	
18976	
18977	
18978	
18979	
18980	
18981	
18982	
18983	
18984	
18985	
18986	
18987	
18988	
18989	
18990	
18991	
18992	
18993	
18994	
18995	
18996	MHC2TA encodes a non-DNA binding transactivator that functions both in constitutive and inducible MHC Class II expression. Mutations in MHC2TA result in the complementation group A of Bare Lymphocyte Syndrome.
18997	
18998	
18999	
19000	
19001	
19002	
19003	
19004	
19005	
19006	
19007	
19008	
19009	
19010	This gene belongs to the VCX/Y gene family, which has multiple members on both X and Y chromosomes, and all are expressed exclusively in male germ cells. The X-linked members are clustered on chromosome Xp22 and Y-linked members are two identical copies of the gene within a palindromic region on Yq11. The family members share a high degree of sequence identity, with the exception that a 30-bp unit is tandemly repeated in X-linked members but occurs only once in Y-linked members. The VCX gene cluster is polymorphic in terms of copy number; different individuals may have a different number of VCX genes. VCX/Y genes encode small and highly charged proteins of unknown function. The presence of a putative bipartite nuclear localization signal suggests that VCX/Y members are nuclear proteins. This gene contains 2 repeats of the 30-bp unit.
19011	
19012	
19013	
19014	
19015	
19016	
19017	
19018	
19019	
19020	
19021	
19022	
19023	The interleukin 2 (IL2) receptor alpha (IL2RA) and beta (IL2RB) chains, together with the common gamma chain (IL2RG), constitute the high-affinity IL2 receptor. Homodimeric alpha chains (IL2RA) result in low-affinity receptor, while homodimeric beta (IL2RB) chains produce a medium-affinity receptor. Normally an integral-membrane protein, soluble IL2RA has been isolated and determined to result from extracellular proteolyisis.  Alternately-spliced IL2RA mRNAs have been isolated, but the significance of each is presently unknown.
19024	The forkhead family of transcription factors belongs to the winged helix class of DNA-binding proteins. The protein encoded by this intronless gene contains a forkhead domain and is found predominantly in aorta and kidney. The function of the encoded protein is unknown.
19025	
19026	
19027	
19028	This gene encodes a member of the STAT-induced STAT inhibitor (SSI), also known as suppressor of cytokine signaling (SOCS), family. SSI family members are cytokine-inducible negative regulators of cytokine signaling. The expression of this gene can be induced by a subset of cytokines, including erythropoietin, GM-CSF, IL10 and interferon (IFN)-gamma. The protein encoded by this gene is found to interact with the cytoplasmic domain of insulin-like growth factor-1 receptor (IGF1R), and thus is thought to be involved in the regulation of IGF1R mediated cell signaling. Knockout studies in mice also suggested a regulatory role of this gene in IGF-1 related growth control.
19029	
19030	
19031	
19032	
19033	
19034	This gene encodes a member of the trypsin family of serine proteases. Studies in mouse suggest that the encoded enzyme may be involved in structural reorganizations associated with learning and memory. The enzyme is also expressed in Leydig cells in the testis, but its function in this tissue is unknown.
19035	
19036	
19037	
19038	
19039	
19040	This gene is a member of the Period family of genes and is expressed in a circadian pattern in the suprachiasmatic nucleus, the primary circadian pacemaker in the mammalian brain. Genes in this family encode components of the circadian rhythms of locomotor activity, metabolism, and behavior. Circadian expression in the suprachiasmatic nucleus continues in constant darkness, and a shift in the light/dark cycle evokes a proportional shift of gene expression in the suprachiasmatic nucleus. The specific function of this gene is not yet known.
19041	
19042	This gene encodes an orphan nuclear receptor which is a member of the nuclear hormone receptor family. Its expression pattern suggests that it may be involved in neurogenesis and germ cell development. The protein can homodimerize and bind DNA, but in vivo targets have not been identified. The gene expresses three alternatively spliced transcript variants.
19043	
19044	
19045	This gene is similar to a mouse gene that encodes a putative RNA helicase and shows testis-specific expression.
19046	
19047	
19048	
19049	Leptin (LEP; MIM 164160), an adipocyte-specific hormone that regulates adipose-tissue mass through hypothalamic effects on satiety and energy expenditure, acts through the leptin receptor (LEPR), a single-transmembrane-domain receptor of the cytokine receptor family.[supplied by OMIM]
19050	Activins are dimeric growth and differentiation factors which belong to the transforming growth factor-beta (TGF-beta) superfamily of structurally related signaling proteins. Activins signal through a heteromeric complex of receptor serine kinases which include at least two type I (I and IB) and two type II (II and IIB) receptors. These receptors are all transmembrane proteins, composed of a ligand-binding extracellular domain with cysteine-rich region, a transmembrane domain, and a cytoplasmic domain with predicted serine/threonine specificity. Type I receptors are essential for signaling; and type II receptors are required for binding ligands and for expression of type I receptors. Type I and II receptors form a stable complex after ligand binding, resulting in phosphorylation of type I receptors by type II receptors. Type II receptors are considered to be constitutively active kinases. This gene encodes activin A type IIB receptor, which displays a 3- to 4-fold higher affinity for the ligand than activin A type II receptor.
19051	
19052	Clathrin is a major protein component of the cytoplasmic face of intracellular organelles, called coated vesicles and coated pits. These specialized organelles are involved in the intracellular trafficking of receptors and endocytosis of a variety of macromolecules.  The basic subunit of the clathrin coat is composed of three heavy chains and three light chains.
19053	
19054	This gene encodes a selenoprotein containing selenocysteine at the active site. The selenocysteine is encoded by the usual stop codon UGA. Mutations in this gene cause the classical phenotype of multiminicore disease and congenital muscular dystrophy with spinal rigidity and restrictive respiratory syndrome. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
19055	Peptidoglycan recognition proteins, such as PGRPI-beta, are part of the innate immune system and recognize peptidoglycan, a ubiquitous component of bacterial cell walls.[supplied by OMIM]
19056	
19057	The protein encoded by this gene is an aminopeptidase which prefers acidic amino acids, and specifically favors aspartic acid over glutamic acid. It is thought to be a cytosolic protein involved in general metabolism of intracellular proteins.
19058	This gene encodes a member of the RSK (ribosomal S6 kinase) family of serine/threonine kinases. This kinase contains 2 non-identical kinase catalytic domains and phosphorylates various substrates, including members of the mitogen-activated kinase (MAPK) signalling pathway. The activity of this protein has been implicated in controlling cell growth and differentiation. Mutations in this gene have been associated with Coffin-Lowry syndrome (CLS).
19059	
19060	Homocysteine is a sulfur-containing amino acid that plays a crucial role in methylation reactions. Transfer of the methyl group from betaine to homocysteine creates methionine, which donates the methyl group to methylate DNA, proteins, lipids, and other intracellular metabolites. The protein encoded by this gene is one of two methyl transferases that can catalyze the transfer of the methyl group from betaine to homocysteine. Anomalies in homocysteine metabolism have been implicated in disorders ranging from vascular disease to neural tube birth defects such as spina bifida.
19061	
19062	
19063	This gene encodes a macrophage scavenger receptor-like protein. This protein has been shown to deplete reactive oxygen species, and thus play an important role in protecting cells from oxidative stress. The expression of this gene is induced by oxidative stress. Alternatively spliced transcript variants encoding distinct isoforms have been described.
19064	
19065	
19066	
19067	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. This gene and the gene for the HRD1 protein use in their respective 3@#$%& UTRs some of the same genomic sequence. Pseudogenes corresponding to this gene are found on chromosomes 5q and 8p.
19068	
19069	
19070	The protein encoded by this gene is a member of the SWI/SNF family of proteins, whose members display helicase and ATPase activities and which are thought to regulate transcription of certain genes by altering the chromatin structure around those genes. The encoded protein is part of the large ATP-dependent chromatin remodeling complex SNF/SWI and has sequence similarity to the yeast Swp73 protein. Two transcript variants encoding different isoforms have been found for this gene.
19071	
19072	
19073	This gene encodes a member of the bicoid subfamily of the paired (PRD) homeobox family of proteins. The encoded protein acts as a transcription factor and may be autoregulatory. A similar protein in mice plays a role in craniofacial and rib cage development during embryogenesis.
19074	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L6P family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
19075	
19076	
19077	
19078	
19079	
19080	Leukaemia inhibitory factor is a cytokine that induces macrophage differentiation. Neurotransmitters and neuropeptides, well known for their role in the communication between neurons, are also capable of activating monocytes and macrophages and inducing chemotaxis in immune cells. LIF signals through different receptors and transcription factors. LIF in conjunction with BMP2 acts in synergy on primary fetal neural progenitor cells to induce astrocytes.
19081	
19082	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the proteasome B-type family, also known as the T1B family, that is a 20S core beta subunit. This gene is tightly linked to the TBP (TATA-binding protein) gene in human and in mouse, and is transcribed in the opposite orientation in both species.
19083	The protein encoded by this gene belongs to the cdc2/cdkx subfamily of the ser/thr family of protein kinases. It may play a role in signal transduction cascades in terminally differentiated cells. This gene is thought to escape X inactivation. Two transcript variants encoding the same protein have been found for this gene.
19084	
19085	
19086	Complex II of the respiratory chain, which is specifically involved in the oxidation of succinate, carries electrons from FADH to CoQ.  The complex is composed of four nuclear-encoded subunits and is localized in the mitochondrial inner membrane.  The 15.5 kDa subunit C protein is one of two integral membrane proteins anchoring the complex to the matrix side of the membrane.  Multiple SDH3 pseudogenes in various genomic locations have been identified.
19087	This gene encodes a protein that is one of more than 20 subunits of the mediator complex, first identified in S. cerevisiae, that is required for activation of transcription. The product of this gene also interacts with elongins B and C, and CUL2 and RBX1, to reconstitute a ubiquitin ligase. Five alternative transcripts encoding four isoforms have been described.
19088	
19089	Cholinephosphotransferase catalyses the final step in the synthesis of phosphatidylcholine by the transfer of phosphocholine from CDP-choline to diacylglycerol. The synthesis of phosphatidylethanolamine by ethanolaminephosphotransferase occurs using an analogous reaction. This gene codes for a choline/ethanolaminephosphotransferase. The protein can synthesize either choline- or ethanolamine- containing phospholipids. Two alternatively spliced transcripts encoding the same isoform have been identified.
19090	
19091	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. The encoded protein also contains a Ras association (RalGDS/AF-6) domain.
19092	
19093	
19094	This gene encodes a member of the ALEX family of proteins which may play a role in tumor suppression. The encoded protein contains a potential N-terminal transmembrane domain and a single Armadillo (arm) repeat. Other proteins containing the arm repeat are involved in development, maintenance of tissue integrity, and tumorigenesis. This gene is closely localized with other family members on the X chromosome. Three transcript variants encoding the same protein have been identified for this gene.
19095	
19096	Ribonucleotide reductase catalyzes the formation of deoxyribonucleotides from ribonucleotides. It is composed of 2 non-identical subunits, proteins M1 and M2.  Synthesis of M2 is regulated in a cell-cycle dependent fashion.
19097	
19098	
19099	
19100	
19101	
19102	
19103	
19104	
19105	This gene is a member of the MAD gene family . The MAD genes encode basic helix-loop-helix-leucine zipper proteins that heterodimerize with MAX protein, forming a transcriptional repression complex. The MAD proteins compete for MAX binding with MYC, which heterodimerizes with MAX forming a transcriptional activation complex. Studies in rodents suggest that the MAD genes are tumor suppressors and contribute to the regulation of cell growth in differentiating tissues.
19106	
19107	
19108	
19109	
19110	The immunosuppressant drug cyclosporin A blocks a calcium-dependent signal from the T-cell receptor (TCR) that normally leads to T-cell activation. When bound to cyclophilin B, cyclosporin A binds and inactivates the key signaling intermediate calcineurin. The protein encoded by this gene functions similarly to cyclosporin A, binding to cyclophilin B and acting downstream of the TCR and upstream of calcineurin by causing an influx of calcium. This integral membrane protein appears to be a new participant in the calcium signal transduction pathway, implicating cyclophilin B in calcium signaling, even in the absence of cyclosporin.
19111	
19112	
19113	
19114	
19115	
19116	This gene encodes a cytoplasmic protein which specifically colocalizes and interacts with the serine/threonine protein kinase, receptor-interacting protein (RIP). Zinc finger domains of the encoded protein are required for its interaction with RIP and for inhibition of TNF- and IL1-induced NF-kappa B activation pathways. The encoded protein may also function as an E3 ubiquitin-protein ligase which accepts ubiquitin from E2 ubiquitin-conjugating enzymes and transfers it to substrates. Several alternatively spliced transcript variants have been described for this locus but the full-length natures of only some are known.
19117	
19118	
19119	The product of this gene belongs to a small family of adapter proteins that are known to interact with a number of receptor tyrosine kinases and signaling molecules. This gene encodes a growth factor receptor-binding protein that interacts with insulin receptors and insulin-like growth-factor receptors. Overexpression of some isoforms of the encoded protein inhibits tyrosine kinase activity and results in growth suppression. This gene is imprinted in a highly isoform- and tissue-specific manner. Alternatively spliced transcript variants encoding different isoforms have been identified.
19120	
19121	
19122	
19123	The structure of the protein encoded by this gene is highly related to both the protein ankyrin and transmembrane proteins.  The specific function of this protein has not yet been determined; however, studies indicate the function may involve a role in signal transduction and growth control.
19124	
19125	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters.  ABC proteins transport various molecules across extra- and intracellular membranes.  ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White).  This protein is a member of the ABC1 subfamily.  Members of the ABC1 subfamily comprise the only major ABC subfamily found exclusively in multicellular eukaryotes.  With cholesterol as its substrate, this protein functions as a cholesteral efflux pump in the cellular lipid removal pathway.  Mutations in this gene have been associated with Tangier@#$%&s disease and familial high-density lipoprotein deficiency.
19126	
19127	
19128	
19129	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
19130	
19131	The protein encoded by this gene contains a RING finger, a motif present in a variety of functionally distinct proteins and known to be involved in protein-protein and protein-DNA interactions. Multiple alternatively spliced transcript variants encoding distinct isoforms have been reported. Three of them are supported by at least two independent transcripts or ESTs, the full length natures of others are not clear.
19132	
19133	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene is located on chromosome 12 and encodes a variant H2A histone. The protein is divergent at the C-terminus compared to the consensus H2A histone family member. Two transcript variants encoding different isoforms have been found for this gene.
19134	
19135	This gene encodes one of the three enolase isoenzymes found in mammals. This isoenzyme, a homodimer, is found in mature neurons and cells of neuronal origin. A switch from alpha enolase to gamma enolase occurs in neural tissue during development in rats and primates.
19136	The CLCNKA gene, a member of the chloride channel (ClC) family, encodes a kidney-specific chloride channel that mediates transepithelial chloride transport in the thin ascending limb of the Henle loop (tAL) in the inner medulla (Matsumura et al., 1999 [PubMed 9916798]). Transport of NaCl in the tAL is thought to be a component of urinary concentration in a passive model of the countercurrent multiplication system. Each member of the ClC family is thought to have 12 transmembrane domains and intracellular N and C termini (Jentsch and Gunther, 1997).[supplied by OMIM]
19137	
19138	Heparan sulfate biosynthetic enzymes are key components in generating a myriad of distinct heparan sulfate fine structures that carry out multiple biologic activities. The enzyme encoded by this gene is a member of the heparan sulfate biosynthetic enzyme family. It is a type II integral membrane protein and possesses heparan sulfate glucosaminyl 3-O-sulfotransferase activity. The sulfotransferase domain of this enzyme is highly similar to the same domain of heparan sulfate D-glucosaminyl 3-O-sulfotransferase 3A1, and these two enzymes sulfate an identical disaccharide. This gene is widely expressed, with the most abundant expression in liver and placenta.
19139	
19140	Synaptic vesicles dock and fuse in the active zone of the plasma membrane at chemical synapses. The presynaptic cytoskeletal matrix (PCM), which is associated with the active zone and is situated between synaptic vesicles, is thought to be involved in maintaining the neurotransmitter release site in register with the postsynaptic reception apparatus. The cycling of synaptic vesicles is a multistep process involving a number of proteins (see MIM 603215). Among the components of the PCM that orchestrate these events are Bassoon (BSN; MIM 604020), RIM (RBBP8; MIM 604124), Oboe, and Piccolo (PCLO).[supplied by OMIM]
19141	This gene encodes a protein containing an RNA-binding motif in the N-terminus and four SRGY (serine, arginine, glycine, tyrosine) boxes in the C-terminus. Multiple copies of this gene are found in the AZFb azoospermia factor region of chromosome Y and the encoded protein is thought to be involved in spermatogenesis. Most copies of this locus are pseudogenes, although six highly similar copies have full-length ORFs and are considered functional. Four functional copies of this gene are found within inverted repeat IR2; two functional copies of this gene are found in palindrome P3, along with two copies of PTPN13-like, Y-linked. Alternative splicing of transcripts results in two transcript variants that encode different proteins.
19142	
19143	
19144	
19145	
19146	There are at least two protein isoforms of the Double C2 protein,  namely alpha (DOC2A) and beta (DOC2B), which contain two C2-like domains.  DOC2A and DOC2B are encoded by different genes; these genes are at times confused with the unrelated DAB2 gene which was initially named DOC-2. DOC2B is expressed ubiquitously and is suggested to be involved in Ca(2+)-dependent intracellular vesicle trafficking in various types of cells.
19147	
19148	Kallikreins are a subgroup of serine proteases having diverse physiological functions. Growing evidence suggests that many kallikreins are implicated in carcinogenesis and some have potential as novel cancer and other disease biomarkers. This gene is one of the fifteen kallikrein subfamily members located in a cluster on chromosome 19. The encoded enzyme is regulated by steroid hormones. In tissue culture, the enzyme has been found to generate amyloidogenic fragments from the amyloid precursor protein, suggesting a potential for involvement in Alzheimer@#$%&s disease. Multiple alternatively spliced transcript variants that encode different isoforms have been identified for this gene.
19149	The protein encoded by this gene is an evolutionarily conserved protein similar to yeast SSF1 as well as to the gene product of the Drosophila gene peter pan (ppan). SSF1 is known to be involved in the second step of mRNA splicing. Both SSF1 and ppan are essential for cell growth and proliferation. This gene was found to cotranscript with P2RY11/P2Y(11), an immediate downstream gene on the chromosome that encodes a ATP receptor. The chimeric transcripts of this gene and P2RY11 were found to be ubiquitously present and regulated during granulocytic differentiation. Exogenous expression of this gene was reported to reduce the anchorage-independent growth of some tumor cells.
19150	
19151	The protein encoded by this gene is a member of the tumor necrosis factor (TNF) ligand family. This protein is a ligand for TNFRSF14, which is a member of the tumor necrosis factor receptor superfamily, and which is also known as a herpesvirus entry mediator (HVEM). This protein may function as a costimulatory factor for the activation of lymphoid cells and as a deterrent to infection by herpesvirus. This protein has been shown to stimulate the proliferation of T cells, and trigger apoptosis of various tumor cells. This protein is also reported to prevent tumor necrosis factor alpha mediated apoptosis in primary hepatocyte. Two alternatively spliced transcript variant encoding distinct isoforms have been reported.
19152	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. This gene encodes a member of the potassium channel, voltage-gated, subfamily G. This gene is abundantly expressed in skeletal muscle. Alternative splicing results in at least two transcript variants encoding distinct isoforms.
19153	The cystatin superfamily encompasses proteins that contain multiple cystatin-like sequences. Some of the members are active cysteine protease inhibitors, while others have lost or perhaps never acquired this inhibitory activity. There are three inhibitory families in the superfamily, including the type 1 cystatins (stefins), type 2 cystatins and the kininogens. The type 2 cystatin proteins are a class of cysteine proteinase inhibitors found in a variety of human fluids and secretions. The cystatin locus on chromosome 20 contains the majority of the type 2 cystatin genes and pseudogenes. This gene is located in the cystatin locus and encodes a type 2 salivary cysteine peptidase inhibitor. The protein is an S-type cystatin, based on its high level of expression in saliva, tears and seminal plasma. The specific role in these fluids is unclear but antibacterial and antiviral activity is present, consistent with a protective function.
19154	
19155	The protein encoded by this gene is a member of the MAP kinase family. MAP kinases, also known as extracellular signal-regulated kinases (ERKs), act as an integration point for multiple biochemical signals, and are involved in a wide variety of cellular processes such as proliferation, differentiation, transcription regulation and development. The activation of this kinase requires its phosphorylation by upstream kinases. Upon activation, this kinase translocates to the nucleus of the stimulated cells, where it phosphorylates nuclear targets. Two alternatively spliced transcript variants encoding the same protein, but differing in the UTRs, have been reported for this gene.
19156	
19157	NIPSNAP1 protein has a strong sequence similarity to the central portion of a hypothetical protein encoded by C. elegans chromosome III between a 4-nitrophenylphosphatase (NIP) domain and non-neuronal SNAP25-like protein.  The NIPSNAP1 gene is located between the NF2 and pK1.3 genes on 22q12. Its function is unknown.
19158	The protein encoded by this gene is found exclusively in the mitochondrion, where it catalyzes the transfer of a malonyl group from malonyl-CoA to the mitochondrial acyl carrier protein. The encoded protein may be part of a fatty acid synthase complex that is more like the type II prokaryotic and plastid complexes rather than the type I human cytosolic complex. Two transcript variants encoding different isoforms have been found for this gene.
19159	
19160	This gene encodes an essential eukaryotic translation initiation factor. The protein is required for the binding of the 43S complex (a 40S subunit, eIF2/GTP/Met-tRNAi and eIF3) to the 5@#$%& end of capped RNA.
19161	
19162	SLY contains an Src homology-3 (SH3) domain and a sterile alpha motif (SAM), both of which are found in proteins involved in cell signaling (Beer et al., 2001).[supplied by OMIM]
19163	DNA methylation is the major modification of eukaryotic genomes and plays an essential role in mammalian development. Human proteins MECP2, MBD1, MBD2, MBD3, and MBD4 comprise a family of nuclear proteins related by the presence in each of a methyl-CpG binding domain (MBD). Each of these proteins, with the exception of MBD3, is capable of binding specifically to methylated DNA. MECP2, MBD1 and MBD2 can also repress transcription from methylated gene promoters. In contrast to other MBD family members, MECP2 is X-linked and subject to X inactivation.  MECP2 is dispensible in stem cells, but is essential for embryonic development. MECP2 gene mutations are the cause of some cases of Rett syndrome, a progressive neurologic developmental disorder and one of the most common causes of mental retardation in females.
19164	This gene encodes a protein similar to eukaryotic translation initiation factor 1A (EIF1A). EIF1A is required for the binding of the 43S complex (a 40S subunit, eIF2/GTP/Met-tRNAi and eIF3) to the 5@#$%& end of capped RNA.
19165	This gene encodes a zinc-dependent aminopeptidase that cleaves vasopressin, oxytocin, lys-bradykinin, met-enkephalin, dynorphin A and other peptide hormones. The protein can be secreted in maternal serum, reside in intracellular vesicles with the insulin-responsive glucose transporter GLUT4, or form a type II integral membrane glycoprotein. The protein catalyzes the final step in the conversion of angiotensinogen to angiotensin IV (AT4) and is also a receptor for AT4. Alternative splicing results in multiple transcript variants encoding different isoforms.
19166	
19167	This gene encodes a member of the WAP-type four-disulfide core (WFDC) domain family. The WFDC domain, or WAP signature motif, contains eight cysteines forming four disulfide bonds at the core of the protein, and functions as a protease inhibitor. The encoded protein contains a Kunitz-inhibitor domain, in addition to three WFDC domains. Most WFDC genes are localized to chromosome 20q12-q13 in two clusters: centromeric and telomeric. This gene belongs to the telomeric cluster. Two alternatively spliced transcript variants have been found for this gene, and they encode the same protein.
19168	
19169	
19170	
19171	
19172	
19173	This gene encodes a protein that mediates ligand-independent transcription repression of thyroid-hormone and retinoic-acid receptors by promoting chromatin condensation and preventing access of the transcription machinery. It is part of a complex which also includes histone deacetylases and transcriptional regulators similar to the yeast protein Sin3p. This gene is located between the Charcot-Marie-Tooth and Smith-Magenis syndrome critical regions on chromosome 17. An alternatively spliced transcript variant has been described, but its full length sequence has not been determined.
19174	
19175	
19176	The protein encoded by this gene belongs to the SEC14 cytosolic factor family. It has similarity to yeast SEC14 and to Japanese flying squid RALBP which suggests a possible role of the gene product in an intracellular transport system.
19177	
19178	
19179	
19180	
19181	
19182	
19183	
19184	
19185	
19186	
19187	
19188	
19189	
19190	
19191	The galectins are a family of beta-galactoside-binding proteins implicated in modulating cell-cell and cell-matrix interactions. Differential and in situ hybridizations indicate that this lectin is specifically expressed in keratinocytes. It is expressed at all stages of epidermal differentiation (i.e., in basal and suprabasal layers). It is moderately repressed by retinoic acid. The protein was found mainly in stratified squamous epithelium. The antigen localized to basal keratinocytes, although it was also found, albeit at lower levels, in the suprabasal layers where it concentrated to areas of cell-to-cell contact. The cellular localization and its striking down-regulation in cultured keratinocytes imply a role in cell-cell and/or cell-matrix interactions necessary for normal growth control.
19192	
19193	
19194	
19195	
19196	
19197	
19198	
19199	
19200	This gene encodes a member of the NCK family of adaptor proteins. The protein contains three SH3 domains and one SH2 domain. The protein has no known catalytic function but has been shown to bind and recruit various proteins involved in the regulation of receptor protein tyrosine kinases. It is through these regulatory activities that this protein is believed to be involved in cytoskeletal reorganization. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
19201	
19202	
19203	
19204	
19205	
19206	
19207	
19208	
19209	
19210	
19211	
19212	
19213	
19214	
19215	
19216	
19217	
19218	
19219	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, which shows little similarity to any of the other known human DEAD box proteins, but shows a high similarity to mouse Ddx24 at the amino acid level.
19220	
19221	
19222	
19223	
19224	The preB cell receptor is found on the surface of proB and preB cells, where it is involved in transduction of signals for cellular proliferation, differentiation from the proB cell to the preB cell stage, allelic exclusion at the Ig heavy chain gene locus, and promotion of Ig light chain gene rearrangements. The preB cell receptor is composed of a membrane-bound Ig mu heavy chain in association with a heterodimeric surrogate light chain. This gene encodes one of the surrogate light chain subunits and is a member of the immunoglobulin gene superfamily. This gene does not undergo rearrangement. Mutations in this gene can result in B cell deficiency and agammaglobulinemia, an autosomal recessive disease in which few or no gamma globulins or antibodies are made. Two transcript variants encoding different isoforms have been found for this gene.
19225	The major constituents of mucus, the viscous secretion that covers epithelial surfaces such as those in the trachea, colon, and cervix, are highly glycosylated proteins called mucins. These glycoproteins play important roles in the protection of the epithelial cells and have been implicated in epithelial renewal and differentiation. This gene encodes an integral membrane glycoprotein found on the cell surface, although secreted isoforms exist. At least two dozen transcript variants of this gene have been found, although for many of them the full-length transcript has not been determined or they are found only in tumor tissues. This gene contains a region in the coding sequence which has a variable number (>100) of 48 nt tandem repeats, although only two repeats are included in these sequences.
19226	The major constituents of mucus, the viscous secretion that covers epithelial surfaces such as those in the trachea, colon, and cervix, are highly glycosylated proteins called mucins. These glycoproteins play important roles in the protection of the epithelial cells and have been implicated in epithelial renewal and differentiation. This gene encodes an integral membrane glycoprotein found on the cell surface, although secreted isoforms exist. At least two dozen transcript variants of this gene have been found, although for many of them the full-length transcript has not been determined or they are found only in tumor tissues. This gene contains a region in the coding sequence which has a variable number (>100) of 48 nt tandem repeats, although only two repeats are included in these sequences.
19227	The major constituents of mucus, the viscous secretion that covers epithelial surfaces such as those in the trachea, colon, and cervix, are highly glycosylated proteins called mucins. These glycoproteins play important roles in the protection of the epithelial cells and have been implicated in epithelial renewal and differentiation. This gene encodes an integral membrane glycoprotein found on the cell surface, although secreted isoforms exist. At least two dozen transcript variants of this gene have been found, although for many of them the full-length transcript has not been determined or they are found only in tumor tissues. This gene contains a region in the coding sequence which has a variable number (>100) of 48 nt tandem repeats, although only two repeats are included in these sequences.
19228	
19229	This gene encodes a protein associated with the cytoplasmic surface of synaptic vesicles. A subset of patients with stiff-man syndrome who were also affected by breast cancer are positive for autoantibodies against this protein. Alternate splicing of this gene results in two transcript variants encoding different isoforms. Additional splice variants have been described, but their full length sequences have not been determined.
19230	
19231	
19232	This gene encodes a protein containing multiple SH3 domains. It has been shown to bind many components of the JNK apoptosis signaling pathway including MLKs, MKKs and JNKs.
19233	
19234	This gene encodes a dual-specificity phosphatase that associates with polyribosomes. The encoded protein may be involved in the regulation of glycogen metabolism. Mutations in this gene have been associated with myoclonic epilepsy of Lafora. Alternative splicing results in multiple transcript variants.
19235	
19236	The ZFYVE16 gene encodes endofin, an endosomal protein implicated in regulating membrane trafficking. It is characterized by the presence of a phosphatidylinositol 3-phosphate-binding FYVE domain positioned in the middle of the molecule (Seet et al., 2004).[supplied by OMIM]
19237	
19238	
19239	This gene is a member of a group of genes whose transcript levels are increased following stressful growth arrest conditions and treatment with DNA-damaging agents. The genes in this group respond to environmental stresses by mediating activation of the p38/JNK pathway.  This activation is mediated via their proteins binding and activating MTK1/MEKK4 kinase, which is an upstream activator of both p38 and JNK MAPKs. The function of these genes or their protein products is involved in the regulation of growth and apoptosis. These genes are regulated by different mechanisms, but they are often coordinately expressed and can function cooperatively in inhibiting cell growth.
19240	Branched-chain keto acid dehydrogenase is a multienzyme complex associated with the inner membrane of mitochondria, and functions in the catabolism of branched-chain amino acids. The complex consists of multiple copies of 3 components: branched-chain alpha-keto acid decarboxylase (E1), lipoamide acyltransferase (E2) and lipoamide dehydrogenase (E3). This gene encodes the E1 beta subunit, and mutations therein have been associated with maple syrup urine disease (MSUD), type 1B. Alternative splicing at this locus results in transcript variants with different 3@#$%& noncoding regions, but encoding the same isoform.
19241	HLA-DQB1 belongs to the HLA class II beta chain paralogues. This class II molecule is a heterodimer consisting of an alpha (DQA) and a beta chain (DQB), both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa and it contains 6 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail. Within the DQ molecule both the alpha chain and the beta chain contain the polymorphisms specifying the peptide binding specificities, resulting in up to 4 different molecules. Typing for these polymorphisms is routinely done for bone marrow transplantation.
19242	
19243	This gene is one of seven beta-1,4-galactosyltransferase (beta4GalT) genes. They encode type II membrane-bound glycoproteins that appear to have exclusive specificity for the donor substrate UDP-galactose; all transfer galactose in a beta1,4 linkage to similar acceptor sugars: GlcNAc, Glc, and Xyl. Each beta4GalT has a distinct function in the biosynthesis of different glycoconjugates and saccharide structures. As type II membrane proteins, they have an N-terminal hydrophobic signal sequence that directs the protein to the Golgi apparatus and which then remains uncleaved to function as a transmembrane anchor. By sequence similarity, the beta4GalTs form four groups: beta4GalT1 and beta4GalT2, beta4GalT3 and beta4GalT4, beta4GalT5 and beta4GalT6, and beta4GalT7. The function of the enzyme encoded by this gene is not clear. This gene was previously designated as B4GALT4 but was renamed to B4GALT5. In the literature it is also referred to as beta4GalT2.
19244	This gene belongs to the VCX/Y gene family, which has multiple members on both X and Y chromosomes, and all are expressed exclusively in male germ cells. The X-linked members are clustered on chromosome Xp22 and Y-linked members are two identical copies of the gene within a palindromic region on Yq11. The family members share a high degree of sequence identity, with the exception that a 30-bp unit is tandemly repeated in X-linked members but occurs only once in Y-linked members. The VCX gene cluster is polymorphic in terms of copy number; different individuals may have a different number of VCX genes. VCX/Y genes encode small and highly charged proteins of unknown function. The presence of a putative bipartite nuclear localization signal suggests that VCX/Y members are nuclear proteins. This gene contains 8 repeats of the 30-bp unit.
19245	
19246	
19247	
19248	This gene encodes a protein that acts in the sorting of transmembrane proteins into lysosomes/vacuoles via the multivesicular body (MVB) pathway. This protein, along with other soluble coiled-coil containing proteins, forms part of the ESCRT-III protein complex that binds to the endosomal membrane and recruits additional cofactors for protein sorting into the MVB. This protein may also co-immunoprecipitate with a member of the IFG-binding protein superfamily. Alternative splicing results in multiple transcript variants encoding different isoforms.
19249	
19250	
19251	
19252	Centromere protein C 1 is a centromere autoantigen and a component of the inner kinetochore plate. The protein is required for maintaining proper kinetochore size and a timely transition to anaphase. A putative pseudogene exists on chromosome 12.
19253	
19254	
19255	
19256	
19257	
19258	
19259	This gene encodes a member of the natural killer cell receptor C-type lectin family. The encoded protein inhibits osteoclast formation and contains a transmembrane domain near the N-terminus as well as the C-type lectin-like extracellular domain. Several alternatively spliced transcript variants have been identified, but the full-length nature of every transcript has not been defined.
19260	
19261	This gene encodes a WD repeat-containing protein. Multiple alternatively spliced transcript variants have been found for this gene, but the full-length nature of some variants has not been defined.
19262	This gene encodes a protein that was identified in a screen for genes expressed in metastatic cells, specifically, mammary adenocarcinoma cell lines. Expression of this gene has been correlated with the metastatic potential of at least two types of carcinomas although it is also expressed in many normal tissues. The role it plays in metastasis is unclear. It was initially thought to be the 70kD component of a nucleosome remodeling deacetylase complex, NuRD, but it is more likely that this component is a different but very similar protein. These two proteins are so closely related, though, that they share the same types of domains. These domains include two DNA binding domains, a dimerization domain, and a domain commonly found in proteins that methylate DNA. The profile and activity of this gene product suggest that it is involved in regulating transcription and that this may be accomplished by chromatin remodeling.
19263	This gene encodes a protein that was identified in a screen for genes expressed in metastatic cells, specifically, mammary adenocarcinoma cell lines. Expression of this gene has been correlated with the metastatic potential of at least two types of carcinomas although it is also expressed in many normal tissues. The role it plays in metastasis is unclear. It was initially thought to be the 70kD component of a nucleosome remodeling deacetylase complex, NuRD, but it is more likely that this component is a different but very similar protein. These two proteins are so closely related, though, that they share the same types of domains. These domains include two DNA binding domains, a dimerization domain, and a domain commonly found in proteins that methylate DNA. The profile and activity of this gene product suggest that it is involved in regulating transcription and that this may be accomplished by chromatin remodeling.
19264	
19265	
19266	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intracellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, and White). This encoded protein is a member of the ABC1 subfamily. Members of the ABC1 subfamily comprise the only major ABC subfamily found exclusively in multicellular eukaryotes. This gene is clustered among 4 other ABC1 family members on 17q24 and may play a role in monocyte differentiation and macrophage lipid homeostasis. Alternative splicing of this gene results in multiple transcript variants; however, not all variants have been fully described.
19267	
19268	
19269	
19270	The RAN GTPase plays an essential role in the transport of macromolecules between the cytoplasm and the nucleus. The nucleotide-bound state of RAN is regulated by RANGAP1, a RAN-specific GTPase activating protein, and by RCC1, a specific guanine nucleotide exchange factor. RANBP3 (RAN binding protein 3) contains FXFG motifs characteristic of a subgroup of nucleoporins, and a C-terminal domain showing similarity to RANBP1. RANBP3 exhibits several alternatively spliced transcript variants containing differences in the N-terminus, two internal exons, and the 3@#$%&UTR.
19271	The protein encoded by this gene is one of at least forty-one subunits that make up the NADH-ubiquinone oxidoreductase complex. This complex is part of the mitochondrial respiratory chain and serves to catalyze the rotenone-sensitive oxidation of NADH and the reduction of ubiquinone. The encoded protein is one of three proteins found in the flavoprotein fraction of the complex. The specific function of the encoded protein is unknown. Two transcript variants encoding different isoforms have been found for this gene.
19272	
19273	
19274	
19275	
19276	
19277	
19278	This gene is a member of the septin family of GTPases. Members of this family are required for cytokinesis. One version of pediatric acute myeloid leukemia is the result of a reciprocal translocation between chromosomes 11 and X, with the breakpoint associated with the genes encoding the mixed-lineage leukemia and septin 2 proteins. This gene encodes four transcript variants encoding three distinct isoforms. An additional transcript variant has been identified, but its biological validity has not been determined.
19279	
19280	
19281	
19282	Heparan sulfate biosynthetic enzymes are key components in generating a myriad of distinct heparan sulfate fine structures that carry out multiple biologic activities. This gene encodes heparan sulfate 2-O-sulfotransferase, a member of the heparan sulfate biosynthetic enzyme family. This family member transfers sulfate to the 2 position of the iduronic acid residue of heparan sulfate. The disruption of this gene resulted in no kidney formation in knockout embryonic mice, indicating that the absence of this enzyme may interfere with the signaling required for kidney formation.
19283	Heparan sulfate biosynthetic enzymes are key components in generating a myriad of distinct heparan sulfate fine structures that carry out multiple biologic activities. This gene encodes heparan sulfate 2-O-sulfotransferase, a member of the heparan sulfate biosynthetic enzyme family. This family member transfers sulfate to the 2 position of the iduronic acid residue of heparan sulfate. The disruption of this gene resulted in no kidney formation in knockout embryonic mice, indicating that the absence of this enzyme may interfere with the signaling required for kidney formation.
19284	
19285	Dentatorubral pallidoluysian atrophy is a rare neurodegenerative disorder characterized by cerebellar ataxia, myoclonic epilepsy, choreoathetosis, and dementia. The disorder is related to the expansion of a trinucleotide repeat within this gene. The encoded protein includes a serine repeat and a region of alternating acidic and basic amino acids, as well as the variable glutamine repeat. Alternative splicing results in two transcripts variants that encode the same protein.
19286	This gene is one of seven beta-1,4-galactosyltransferase (beta4GalT) genes. They encode type II membrane-bound glycoproteins that appear to have exclusive specificity for the donor substrate UDP-galactose; all transfer galactose in a beta1,4 linkage to similar acceptor sugars: GlcNAc, Glc, and Xyl. Each beta4GalT has a distinct function in the biosynthesis of different glycoconjugates and saccharide structures. As type II membrane proteins, they have an N-terminal hydrophobic signal sequence that directs the protein to the Golgi apparatus and which then remains uncleaved to function as a transmembrane anchor. By sequence similarity, the beta4GalTs form four groups: beta4GalT1 and beta4GalT2, beta4GalT3 and beta4GalT4, beta4GalT5 and beta4GalT6, and beta4GalT7. The enzyme encoded by this gene synthesizes N-acetyllactosamine in glycolipids and glycoproteins. Its substrate specificity is affected by alpha-lactalbumin but it is not expressed in lactating mammary tissue. Two transcript variants encoding the same protein have been found for this gene.
19287	
19288	
19289	The protein encoded by this gene is a LIM domain protein. LIM domains are cysteine-rich double zinc fingers composed of 50 to 60 amino acids that are involved in protein-protein interactions. LIM domain-containing proteins are scaffolds for the formation of multiprotein complexes. The proteins are involved in cytoskeleton organization, cell lineage specification, organ development, and oncogenesis. The encoded protein is also a member of the Enigma class of proteins, a family of proteins that possess a 100-amino acid PDZ domain in the N terminus and 1 to 3 LIM domains in the C terminus. Multiple transcript variants encoding different isoforms have been found for this gene, although not all of them have been fully characterized.
19290	The protein encoded by this gene, a member of the peptidase C1 family, is a lysosomal cysteine proteinase that may participate in the degradation of antigenic proteins to peptides for presentation on MHC class II molecules. The encoded protein can function as an elastase over a broad pH range in alveolar macrophages. Transcript variants utilizing alternative polyadenylation signals exist for this gene.
19291	The HADHA and HADHB (MIM 143450) genes encode the alpha and beta subunits of the mitochondrial trifunctional protein, respectively. The heterocomplex contains 4 alpha and 4 beta subunits and catalyzes 3 steps in mitochondrial beta-oxidation of fatty acids, including the long-chain 3-hydroxyl-CoA dehydrogenase (LCHAD) step. The alpha subunit harbors the 3-hydroxyacyl-CoA dehydrogenase and enoyl-CoA hydratase activities (Kamijo et al., 1994).[supplied by OMIM]
19292	
19293	
19294	This gene encodes a DNA-binding protein which specifically recognizes conserved target sequences at the breakpoint junction of chromosomal translocations. Translin polypeptides form a multimeric structure that is responsible for its DNA-binding activity. Recombination-associated motifs and translin-binding sites are present at recombination hotspots and may serve as indicators of breakpoints in genes which are fused by translocations. These binding activities may play a crucial role in chromosomal translocation in lymphoid neoplasms.
19295	
19296	DAP gene encodes a basic, proline-rich, 15-kD protein.  Death-associated protein acts as a positive mediator of programmed cell death that is induced by interferon-gamma.
19297	
19298	ITGA2 encodes integrin alpha chain 2. Integrins are heterodimeric integral membrane proteins composed of an alpha chain and a beta chain. The I-domain containing alpha integrin 2 combines with beta integrin 1 to form a collagen-binding integrin referred to as glycoprotein Ia/IIa when expressed on on platelets, and very late (activation) antigen 2 (@#$%&VLA-2@#$%&) when found on T-cells. In addition to adhesion, integrins are known to participate in cell-surface mediated signalling.
19299	The protein encoded by this gene belongs to the methionine adenosyltransferase (MAT) family. MAT catalyzes the biosynthesis of S-adenosylmethionine from methionine and ATP. This protein is the regulatory beta subunit of MAT. Alternative splicing occurs at this locus and two transcript variants encoding distinct isoforms have been identified.
19300	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections. The transcript for this particular family member uses more than one polyadenylation site.
19301	This gene encodes a nucleolar protein with a LIS1 homology domain. The protein is involved in ribosomal DNA gene transcription through its interaction with upstream binding factor (UBF). Mutations in this gene have been associated with Treacher Collins syndrome, a disorder which includes abnormal craniofacial development. Alternate transcriptional splice variants encoding different isoforms have been found for this gene, but only three of them have been characterized to date.
19302	
19303	HLA-DPA1 belongs to the HLA class II alpha chain paralogues. This class II molecule is a heterodimer consisting of an alpha (DPA) and a beta (DPB) chain, both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The alpha chain is approximately 33-35 kDa and its gene contains 5 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and the cytoplasmic tail. Within the DP molecule both the alpha chain and the beta chain contain the polymorphisms specifying the peptide binding specificities, resulting in up to 4 different molecules.
19304	The protein encoded by this gene is involved in the sodium-independent transport and excretion of organic anions, some of which are potentially toxic. The encoded protein is an integral membrane protein and appears to be localized to the basolateral membrane of the kidney. Two transcript variants encoding different isoforms have been found for this gene.
19305	
19306	
19307	
19308	
19309	
19310	
19311	
19312	This gene encodes the protein elongin C, which is a subunit of the transcription factor B (SIII) complex. The SIII complex is composed of elongins A/A2, B and C. It activates elongation by RNA polymerase II by suppressing transient pausing of the polymerase at many sites within transcription units. Elongin A functions as the transcriptionally active component of the SIII complex, whereas elongins B and C are regulatory subunits. Elongin A2 is specifically expressed in the testis, and capable of forming a stable complex with elongins B and C. The von Hippel-Lindau tumor suppressor protein binds to elongins B and C, and thereby inhibits transcription elongation.
19313	DNAJB12 belongs to the evolutionarily conserved DNAJ/HSP40 family of proteins, which regulate molecular chaperone activity by stimulating ATPase activity. DNAJ proteins may have up to 3 distinct domains: a conserved 70-amino acid J domain, usually at the N terminus, a glycine/phenylalanine (G/F)-rich region, and a cysteine-rich domain containing 4 motifs resembling a zinc-finger domain (Ohtsuka and Hata, 2000).[supplied by OMIM]
19314	DNAJB12 belongs to the evolutionarily conserved DNAJ/HSP40 family of proteins, which regulate molecular chaperone activity by stimulating ATPase activity. DNAJ proteins may have up to 3 distinct domains: a conserved 70-amino acid J domain, usually at the N terminus, a glycine/phenylalanine (G/F)-rich region, and a cysteine-rich domain containing 4 motifs resembling a zinc-finger domain (Ohtsuka and Hata, 2000).[supplied by OMIM]
19315	
19316	
19317	Alternative splicing has been observed at this gene locus and two transcript variants, encoding distinct isoforms, have been identified. The function of this protein has not been determined.
19318	
19319	
19320	
19321	
19322	
19323	
19324	
19325	
19326	
19327	
19328	
19329	This gene encodes a protein involved in nonsense-mediated mRNA decay (NMD) as part of the mRNA surveillance complex. The protein has kinase activity and is thought to function in NMD by phosphorylating the regulator of nonsense transcripts 1 protein. Alternative spliced transcript variants have been described, but their full-length natures have not been determined.
19330	
19331	
19332	The protein encoded by this gene is a member of the tumor necrosis factor (TNF) ligand family. This protein is a ligand for TNFRSF17/BCMA, a member of the TNF receptor family. This protein and its receptor are both found to be important for B cell development. In vitro experiments suggested that this protein may be able to induce apoptosis through its interaction with other TNF receptor family proteins such as TNFRSF6/FAS and TNFRSF14/HVEM. Three alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported.
19333	cAMP is a signaling molecule important for a variety of cellular functions. cAMP exerts its effects by activating the cAMP-dependent protein kinase (AMPK), which transduces the signal through phosphorylation of different target proteins. The inactive holoenzyme of AMPK is a tetramer composed of two regulatory and two catalytic subunits. cAMP causes the dissociation of the inactive holoenzyme into a dimer of regulatory subunits bound to four cAMP and two free monomeric catalytic subunits. Four different regulatory subunits and three catalytic subunits of AMPK have been identified in humans. The protein encoded by this gene is one of the regulatory subunits. This protein was found to be a tissue-specific extinguisher that down-regulates the expression of seven liver genes in hepatoma x fibroblast hybrids. Functional null mutations in this gene cause Carney complex (CNC), an autosomal dominant multiple neoplasia syndrome. This gene can fuse to the RET protooncogene by gene rearrangement and form the thyroid tumor-specific chimeric oncogene known as PTC2. Three alternatively spliced transcript variants encoding the same protein have been observed.
19334	
19335	This gene encodes a subunit of mitochondrial ATP synthase. Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. ATP synthase is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, Fo, comprising the proton channel. The catalytic portion of mitochondrial ATP synthase consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled with a stoichiometry of 3 alpha, 3 beta, and a single representative of the other 3. The proton channel consists of three main subunits (a, b, c). This gene encodes the alpha subunit of the catalytic core. Alternatively spliced transcript variants encoding different isoforms have been identified. The three pseudogenes of this gene are located on chromosomes 9, 2 and 16.
19336	
19337	
19338	
19339	
19340	SUMO proteins, such as SUMO3, and ubiquitin (see MIM 191339) posttranslationally modify numerous cellular proteins and affect their metabolism and function. However, unlike ubiquitination, which targets proteins for degradation, sumoylation participates in a number of cellular processes, such as nuclear transport, transcriptional regulation, apoptosis, and protein stability (Su and Li, 2002).[supplied by OMIM]
19341	
19342	
19343	
19344	This gene belongs to the VCX/Y gene family, which has multiple members on both X and Y chromosomes, and all are expressed exclusively in male germ cells. The X-linked members are clustered on chromosome Xp22 and Y-linked members are two identical copies of the gene within a palindromic region on Yq11. The family members share a high degree of sequence identity, with the exception that a 30-bp unit is tandemly repeated in X-linked members but occurs only once in Y-linked members. The VCX gene cluster is polymorphic in terms of copy number; different individuals may have a different number of VCX genes. VCX/Y genes encode small and highly charged proteins of unknown function. The presence of a putative bipartite nuclear localization signal suggests that VCX/Y members are nuclear proteins. This gene contains 10 repeats of the 30-bp unit.
19345	This gene encodes a protein belonging to the pantothenate kinase family. Pantothenate kinase is a key regulatory enzyme in the biosynthesis of coenzyme A (CoA) in bacteria and mammalian cells. It catalyzes the first committed step in the universal biosynthetic pathway leading to CoA and is itself subject to regulation through feedback inhibition by CoA. Alternative splicing has been observed at this locus and three variants, each encoding a distinct isoform, have been identified.
19346	
19347	
19348	
19349	This gene encodes a type II integral membrane protein that is 77% identical to CD209 antigen, a HIV gp120-binding protein. This protein, like CD209, efficiently binds both intercellular adhesion molecule 3 (ICAM3) and HIV-1 gp120, and enhances HIV-1 infection of T cells. This gene is mapped to 19p13.3, in a cluster with the CD209 and CD23/FCER2 genes. Multiple alternatively spliced transcript variants have been found for this gene, but the biological validity of some variants has not been determined.
19350	
19351	
19352	
19353	This gene encodes a protein that contains four domains present in other developmental proteins: a PWWP domain, an HMG box, a SET domain, and a PHD-type zinc finger. It is expressed ubiquitously in early development. Wolf-Hirschhorn syndrome (WHS) is a malformation syndrome associated with a hemizygous deletion of the distal short arm of chromosome 4. This gene maps to the 165 kb WHS critical region and has also been involved in the chromosomal translocation t(4;14)(p16.3;q32.3) in multiple myelomas. This gene contains 25 exons. Alternative splicing of this gene generates nine transcript variants, which encode four different isoforms.
19354	
19355	This gene encodes a member of the IQGAP family. The protein contains three IQ domains, one calponin homology domain, one Ras-GAP domain and one WW domain. It interacts with components of the cytoskeleton, with cell adhesion molecules, and with several signaling molecules to regulate cell morphology and motility.
19356	UNG2 is a glycosylase that removes uracil from both double- and single-stranded DNA in nuclear chromatin, thus contributing to base excision repair.[supplied by OMIM]
19357	
19358	This gene encodes a transcription factor that is a member of the leucine zipper family of DNA binding proteins. This protein binds to the cAMP-responsive element (CRE), an octameric palindrome. The protein forms a homodimer or heterodimer with c-Jun and stimulates CRE-dependent transcription. The protein is also a histone acetyltransferase (HAT) that specifically acetylates histones H2B and H4 in vitro; thus it may represent a class of sequence-specific factors that activate transcription by direct effects on chromatin components. Additional transcript variants have been identified but their biological validity has not been determined.
19359	
19360	RHD11, a member of the short-chain dehydrogenase/reductase (SDR) superfamily of oxidoreductases, is expressed at high levels in prostate epithelium, and its expression is regulated by androgens.[supplied by OMIM]
19361	
19362	
19363	
19364	
19365	
19366	
19367	HBXAP is involved in transcription repression, transcription coactivation when associated with hepatitis B virus X protein (HBX), and chromatin remodeling and spacing when associated with SNF2H (MIM 603375).[supplied by OMIM]
19368	
19369	
19370	
19371	This gene encodes a transcriptional repressor which represses neuronal genes in non-neuronal tissues. It is a member of the Kruppel-type zinc finger transcription factor family. It represses transcription by binding a DNA sequence element called the neuron-restrictive silencer element. The protein is also found in undifferentiated neuronal progenitor cells, and it is thought that this repressor may act as a master negative regular of neurogenesis. Alternatively spliced transcript variants have been described; however, their full length nature has not been determined.
19372	
19373	
19374	
19375	
19376	This gene belongs to a family of glycohydrolytic enzymes which remove sialic acid residues from glycoproteins and glycolipids.
19377	This gene was identified in the neuroblastoma tumour suppressor candidate region on chromosome 1p36. It contains a TFIID-31 domain, similar to that found in TATA box-binding protein-associated factor, TAF(II)31, which is required for p53-mediated transcription activation. This gene was expressed at very low levels in neuroblastoma tumours, and was shown to reduce cell growth in neuroblastoma cells, suggesting that it may have a role in a cell death pathway. Two alternatively spliced transcript variants encoding different isoforms have been identified for this gene.
19378	
19379	
19380	
19381	
19382	
19383	
19384	
19385	Initiation of transcription by RNA polymerase I requires the formation of a complex composed of the TATA-binding protein (TBP) and three TBP-associated factors (TAFs) specific for RNA polymerase I. This complex, known as SL1, binds to the core promoter of ribosomal RNA genes to position the polymerase properly and acts as a channel for regulatory signals. This gene encodes the largest SL1-specific TAF. Two transcripts encoding different isoforms have been identified.
19386	
19387	
19388	
19389	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S25E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
19390	
19391	
19392	
19393	
19394	
19395	
19396	
19397	
19398	
19399	
19400	
19401	
19402	
19403	
19404	
19405	
19406	
19407	
19408	
19409	
19410	Three classes of protein carboxyl methyltransferases, distinguished by their methyl-acceptor substrate specificity, have been found in prokaryotic and eukaryotic cells. The type II enzyme catalyzes the transfer of a methyl group from S-adenosyl-L-methionine to the free carboxyl groups of D-aspartyl and L-isoaspartyl residues. These methyl-accepting residues result from the spontaneous deamidation, isomerization, and racemization of normal L-aspartyl and L-asparaginyl residues and represent sites of covalent damage to aging proteins PCMT1 (EC 2.1.1.77) is a protein repair enzyme that initiates the conversion of abnormal D-aspartyl and L-isoaspartyl residues to the normal L-aspartyl form.[supplied by OMIM]
19411	
19412	
19413	
19414	Membrane mucins, such as MUC17, function in epithelial cells to provide cytoprotection, maintain luminal structure, provide signal transduction, and confer antiadhesive properties upon cancer cells that lose their apical/basal polarization.[supplied by OMIM]
19415	
19416	
19417	
19418	
19419	
19420	
19421	
19422	
19423	
19424	
19425	
19426	
19427	Synaptic vesicles dock and fuse in the active zone of the plasma membrane at chemical synapses. The presynaptic cytoskeletal matrix (PCM), which is associated with the active zone and is situated between synaptic vesicles, is thought to be involved in maintaining the neurotransmitter release site in register with the postsynaptic reception apparatus. The cycling of synaptic vesicles is a multistep process involving a number of proteins (see MIM 603215). Among the components of the PCM that orchestrate these events are Bassoon (BSN; MIM 604020), RIM (RBBP8; MIM 604124), Oboe, and Piccolo (PCLO).[supplied by OMIM]
19428	
19429	Thioredoxin reductase (TR) is a dimeric NADPH-dependent FAD containing enzyme that catalyzes the reduction of the active site disulfide of thioredoxin and other substrates. TR is a member of a family of pyridine nucleotide-disulfide oxidoreductases and is a key enzyme in the regulation of the intracellular redox environment. Three thioredoxin reductase genes have been found that encode selenocysteine containing proteins. Alternative splicing of this gene results in three distinct transcripts encoding different isoforms, one of which has been shown to be located in the mitochondria. This TR gene partially overlaps the COMT gene on chromosome 22.
19430	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class. Three alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
19431	
19432	
19433	
19434	
19435	This gene encodes a mitochondrial DNA topoisomerase, an enzyme that controls and alters the topologic states of DNA during transcription. This enzyme catalyzes the transient breaking and rejoining of DNA which allows the strands to pass through one another and relieves the stress introduced by a circular mitochondrial genome during replication and transcription.
19436	
19437	
19438	
19439	The vitamin A metabolite, all-trans retinoic acid (atRA), plays an important role in neuronal development, including neurite outgrowth. RAINB1 is an atRA-responsive gene.[supplied by OMIM]
19440	Agrin is a neuronal aggregating factor that induces the aggregation of acetylcholine receptors and other postsynaptic proteins on muscle fibers and is crucial for the formation of the neuromuscular junction.[supplied by OMIM]
19441	
19442	The protein encoded by this gene is a member of the SWI/SNF family of proteins. Members of this family have helicase and ATPase activities and are thought to regulate transcription of certain genes by altering the chromatin structure around those genes. The encoded protein contains a RING finger DNA binding motif. Two transcript variants encoding the same protein have been found for this gene. However, use of an alternative translation start site produces an isoform which is truncated at the N-terminus as compared to the full-length protein.
19443	
19444	
19445	
19446	
19447	
19448	The protein encoded by this gene is an isozyme of the long-chain fatty-acid-coenzyme A ligase family. Although differing in substrate specificity, subcellular localization, and tissue distribution, all isozymes of this family convert free long-chain fatty acids into fatty acyl-CoA esters, and thereby play a key role in lipid biosynthesis and fatty acid degradation. This isozyme is highly expressed in brain, and preferentially utilizes myristate, arachidonate, and eicosapentaenoate as substrates. The amino acid sequence of this isozyme is 92% identical to that of rat homolog. Two transcript variants encoding the same protein have been found for this gene.
19449	
19450	
19451	
19452	
19453	
19454	DIRAS2 belongs to a distinct branch of the functionally diverse Ras (see HRAS; MIM 190020) superfamily of monomeric GTPases.[supplied by OMIM]
19455	Butyrophilin is the major protein associated with fat droplets in the milk. This gene is a member of the BTN2 subfamily of genes, which encode proteins belonging to the butyrophilin protein family. The gene is located in a cluster on chromosome 6, consisting of seven genes belonging to the expanding B7/butyrophilin-like group, a subset of the immunoglobulin gene superfamily. The encoded protein is a type 1 receptor glycoprotein involved in lipid, fatty-acid and sterol metabolism. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some variants has not been determined.
19456	
19457	
19458	
19459	
19460	
19461	
19462	
19463	
19464	DYRK2 belongs to a family of protein kinases whose members are presumed to be involved in cellular growth and/or development.  The family is defined by structural similarity of their kinase domains and their capability to autophosphorylate on tyrosine residues.  DYRK2 has demonstrated tyrosine autophosphorylation and catalyzed phosphorylation of histones H3 and H2B in vitro.  Two isoforms of DYRK2 have been isolated.  The predominant isoform, isoform 1, lacks a 5@#$%& terminal insert.
19465	The product of this gene belongs to the family of basic helix-loop-helix leucine zipper (bHLH-Zip) transcription factors. These factors form heterodimers with Mad proteins and play a role in proliferation, determination and differentiation. This gene product may act to diversify Mad family function by its restricted association with a subset of the Mad family of transcriptional repressors, namely, Mad1 and Mad4. Alternatively spliced transcript variants encoding different isoforms have been identified for this gene.
19466	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. Sequence analysis identified two transcript variants that encode different isoforms. A pseudogene corresponding to this gene is found on chromosome 5q.
19467	
19468	
19469	
19470	
19471	
19472	
19473	This gene encodes a member of the WAP-type four-disulfide core (WFDC) domain family. The WFDC domain, or WAP signature motif, contains eight cysteines forming four disulfide bonds at the core of the protein, and functions as a protease inhibitor. Most WFDC gene members are localized to chromosome 20q12-q13 in two clusters: centromeric and telomeric. This gene belongs to the telomeric cluster.
19474	
19475	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It is one of the most abundant core proteins of hnRNP complexes and it is localized to the nucleoplasm. This protein, along with other hnRNP proteins, is exported from the nucleus, probably bound to mRNA, and is immediately re-imported. Its M9 domain acts as both a nuclear localization and nuclear export signal. The encoded protein is involved in the packaging of pre-mRNA into hnRNP particles, transport of poly A+ mRNA from the nucleus to the cytoplasm, and may modulate splice site selection. It is also thought have a primary role in the formation of specific myometrial protein species in parturition. Multiple alternatively spliced transcript variants have been found for this gene but only two transcripts are fully described. These variant have multiple alternative transcription initiation sites and multiple polyA sites.
19476	
19477	
19478	The RABL2B protein is a member of the RAB gene family which belongs to the RAS GTPase superfamily. RABL2B is located within a subtelomeric region of 22q13.3. Multiple alternatively spliced transcript variants have been found, but the biological validity of some variants has not been determined.
19479	
19480	The autosomal dominant cerebellar ataxias (ADCA) are a heterogeneous group of neurodegenerative disorders characterized by progressive degeneration of the cerebellum, brain stem and spinal cord. Clinically, ADCA has been divided into three groups: ADCA types I-III. ADCAI is genetically heterogeneous, with five genetic loci, designated spinocerebellar ataxia (SCA) 1, 2, 3, 4 and 6, being assigned to five different chromosomes. ADCAII, which always presents with retinal degeneration (SCA7), and ADCAIII often referred to as the `pure@#$%& cerebellar syndrome (SCA5), are most likely homogeneous disorders. Several SCA genes have been cloned and shown to contain CAG repeats in their coding regions. ADCA is caused by the expansion of the CAG repeats, producing an elongated polyglutamine tract in the corresponding protein. The expanded repeats are variable in size and unstable, usually increasing in size when transmitted to successive generations. The function of the ataxins is not known. This locus has been mapped to chromosome 3, and it has been determined that the diseased allele contains 38-130 CAG repeats (near the N-terminus), compared to 7-17 in the normal allele. Because it contains an NLS consensus sequence and is found to be localized in the nucleus, it has been postulated to be a potential transcription factor.
19481	
19482	
19483	
19484	PEA15 is a death effector domain (DED)-containing protein predominantly expressed in the central nervous system, particularly in astrocytes.[supplied by OMIM]
19485	
19486	
19487	The product of this gene belongs to the phosphatase 2A regulatory subunit B family. Protein phosphatase 2A is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. The B regulatory subunit might modulate substrate selectivity and catalytic activity. This gene encodes an alpha isoform of the regulatory subunit B56 subfamily.
19488	
19489	
19490	The muscarinic cholinergic receptors belong to a larger family of G protein-coupled receptors. The functional diversity of these receptors is defined by the binding of acetylcholine and includes cellular responses such as adenylate cyclase inhibition, phosphoinositide degeneration, and potassium channel mediation. Muscarinic receptors influence many effects of acetylcholine in the central and peripheral nervous system. The muscarinic cholinergic receptor 3 controls smooth muscle contraction and its stimulation causes secretion of glandular tissue.
19491	
19492	
19493	This gene encodes a caveolin family member, which functions as a component of the caveolae plasma membranes found in most cell types. Caveolin proteins are proposed to be scaffolding proteins for organizing and concentrating certain caveolin-interacting molecules. Mutations identified in this gene lead to interference with protein oligomerization or intra-cellular routing, disrupting caveolae formation and resulting in Limb-Girdle muscular dystrophy type-1C (LGMD-1C), hyperCKemia or rippling muscle disease (RMD). Alternative splicing has been identified for this locus, with inclusion or exclusion of a differentially spliced intron. In addition, transcripts utilize multiple polyA sites and contain two potential translation initiation sites.
19494	
19495	
19496	The Sec61 complex is the central component of the protein translocation apparatus of the endoplasmic reticulum (ER) membrane. The protein encoded by this gene and SEC63 protein are found to be associated with ribosome-free SEC61 complex. It is speculated that Sec61-Sec62-Sec63 may perform post-translational protein translocation into the ER. The Sec61-Sec62-Sec63 complex might also perform the backward transport of ER proteins that are subject to the ubiquitin-proteasome-dependent degradation pathway. The encoded protein is an integral membrane protein located in the rough ER.
19497	
19498	
19499	
19500	
19501	
19502	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections.
19503	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. This protein functions as a modifier of endothelial cell migration and proliferation, as well as an angiogenic factor. It acts as a mitogen for a variety of mesoderm- and neuroectoderm-derived cells in vitro, thus is thought to be involved in organogenesis. Three alternatively spliced variants encoding different isoforms have been described.
19504	
19505	
19506	The protein encoded by this gene belongs to a small class of prenylated protein tyrosine phosphatases (PTPs), which contains a PTP domain and a characteristic C-terminal prenylation motif. PTPs are cell signaling molecules that play regulatory roles in a variety of cellular processes. This tyrosine phosphatase is a nuclear protein, but may primarily associate with plasma membrane. The surface membrane association of this protein depends on its C-terminal prenylation. Overexpression of this gene in mammalian cells conferred a transformed phenotype, which implicated its role in the tumorigenesis. Studies in rat suggested that this gene may be an immediate-early gene in mitogen-stimulated cells.
19507	
19508	
19509	
19510	
19511	
19512	This gene encodes a protein that contains two WD domains and an FYVE zinc finger region. The function of this gene is unknown. An alternatively spliced transcript variant of this gene may exist.
19513	The cell adhesion molecules (CAMs) are a subset of the immunoglobin (Ig) superfamily found in the nervous systems of both vertebrates and invertebrates. They are usually surface membrane proteins with multiple Ig domains at their N termini followed by several fibronectin type III repeats and either a transmembrane intracellular domain or a glycophosphatidylinositol-linked membrane anchor at the C terminus (Lane et al., 1996).[supplied by OMIM]
19514	
19515	
19516	
19517	
19518	This gene is a member of the neuronal calcium sensor gene family, which encode calcium-binding proteins expressed predominantly in neurons. The protein encoded by this gene regulates G protein-coupled receptor phosphorylation in a calcium-dependent manner and can substitute for calmodulin.  This protein is thought to be associated with secretory granules and may be involved in the regulation of neurosecretion.
19519	Copper is an element essential for life, but excessive copper can be toxic or even lethal to the cell. Therefore, cells have developed sophisticated ways to maintain a critical copper balance, with the intake, export, and intracellular compartmentalization or buffering of copper strictly regulated. The 2 related genes ATP7A (MIM 300011) and ATP7B (MIM 606882), responsible for the human diseases Menkes syndrome (MIM 309400) and Wilson disease (MIM 277900), respectively, are involved in copper export. In S. cerevisiae, the copper uptake genes CTR1, CTR2, and CTR3 have been identified, and in human the CTR1 and CTR2 (MIM 603088) genes have been identified.[supplied by OMIM]
19520	
19521	
19522	
19523	
19524	
19525	
19526	
19527	The protein encoded by this gene is a peripheral membrane protein that is a component of tight junctions or TJs. TJs form an apical junctional structure and act to control paracellular permeability and maintain cell polarity. This protein is related to angiomotin, an angiostatin binding protein that regulates endothelial cell migration and capillary formation.
19528	
19529	Apoliprotein (apo) A-IV gene contains 3 exons separated by two introns. A sequence polymorphism has been identified in the 3@#$%&UTR of the third exon.  The primary translation product is a 396-residue preprotein which after proteolytic processing is secreted its primary site of synthesis, the intestine, in association with chylomicron particles.  Although its precise function is not known, apo A-IV is a potent activator of lecithin-cholesterol acyltransferase in vitro.
19530	This gene was identified as a gene specifically expressed in Burkitt@#$%&s lymphoma and lymphatic tissues. The protein encoded by this gene is predicted to be a seven transmembrane G protein- coupled receptor and belongs to the CXC chemokine receptor family. BLC, a B-lymphocyte chemoattractant, was identified to be a specific ligand for this receptor. Studies of this gene and its mouse conterpart strongly suggest the essential function of this gene in B cell migration and localization within specific anatomic compartments, such as follicles in lymph nodes as well as in spleen. Two alternatively spliced variants of this gene exist.
19531	
19532	The WNT gene family consists of structurally related genes which encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family. It encodes a protein which shows 97%, 85%, and 63% amino acid identity with mouse, chicken, and Xenopus Wnt11 protein, respectively. This gene may play roles in the development of skeleton, kidney and lung, and is considered to be a plausible candidate gene for High Bone Mass Syndrome.
19533	
19534	
19535	
19536	
19537	
19538	
19539	
19540	
19541	
19542	
19543	The function of this gene has not yet been determined; however, it is speculated that it may represent a breast cancer candidate gene.  It is located close to FGFR1 on a region of chromosome 8 that is amplified in some breast cancers.
19544	
19545	
19546	
19547	This gene encodes an enzyme that catalyzes the second step of glycosylphosphatidylinositol (GPI) biosynthesis, which is the de-N-acetylation of N-acetylglucosaminylphosphatidylinositol (GlcNAc-PI). Study of a similar rat enzyme suggests that this protein localizes to the endoplasmic reticulum.
19548	This gene encodes a regulator of the CDC7-like 1 protein, a serine-threonine kinase which links cell cycle regulation to genome duplication. This protein localizes to the nucleus and its expression is cell cycle-regulated. Alternative splicing of this gene results in two transcript variants encoding different isoforms. Additional transcript variants have been described, but their full length sequences have not been determined.
19549	Caveolae are small domains on the inner cell membrane involved in vesicular trafficking and signal transduction.  FLOT2 encodes a caveolae-associated, integral membrane protein.  Flotillin 2 is though to function in neuronal signaling.
19550	The beta adaptin subunit is part of the clathrin coat assembly complex which links clathrin to receptors in coated pits and vesicles. These vesicles are involved in endocytosis and Golgi processing. The beta 1 subunit is one of the assembly proteins which binds to clathrin and initiates coat formation.
19551	The protein encoded by this gene is a member of the intercellular adhesion molecule (ICAM) family. All ICAM proteins are type I transmembrane glycoproteins, contain 2-9 immunoglobulin-like C2-type domains, and bind to the leukocyte adhesion LFA-1 protein. This protein is expressed on the surface of telencephalic neurons and displays two types of adhesion activity, homophilic binding between neurons and heterophilic binding between neurons and leukocytes. It may be a critical component in neuron-microglial cell interactions in the course of normal development or as part of neurodegenerative diseases.
19552	
19553	
19554	
19555	
19556	
19557	
19558	The absorption of vitamin C into the body and its distribution to organs requires two sodium-dependent vitamin C transporters. This gene encodes one of the two required transporters and the encoded protein accounts for tissue-specific uptake of vitamin C. Previously, this gene had an official symbol of SLC23A1.
19559	
19560	The region of chromosome 21 between genes CBR and ERG (CBR-ERG region), which spans 2.5 Mb on 21q22.2, has been defined by analysis of patients with partial trisomy 21. It contributes significantly to the pathogenesis of many characteristics of Down syndrome, including morphological features, hypotonia, and mental retardation. The DSCR3 (Down syndrome critical region gene 3) gene is found in this region and is predictated to contain eight exons. DSCR3 is expressed in most tissues examined.
19561	
19562	
19563	This gene is a member of the GAGE family. The GAGE genes are expressed in a variety of tumors and in some fetal and reproductive tissues. This gene is strongly expressed in prostate and prostate cancer, but is also expressed in other male and female reproductive tissues including testis, fallopian tube, uterus, and placenta, as well as in testicular cancer and uterine cancer. The protein encoded by this gene shares sequence similarity with other GAGE/PAGE proteins, and also belongs to a family of CT (cancer-testis) antigens.
19564	
19565	
19566	ARHGEF9 belongs to a family of Rho-like GTPases that act as molecular switches by cycling from the active GTP-bound state to the inactive GDP-bound state. These proteins are key regulators of the actin cytoskeleton and are involved in cell signaling.[supplied by OMIM]
19567	
19568	
19569	Homeobox genes encode DNA-binding proteins, many of which are thought to be involved in early embryonic development. Homeobox genes encode a DNA-binding domain of 60 to 63 amino acids referred to as the homeodomain. This pseudogene is a member of the Vent homeobox gene family.
19570	
19571	This gene encodes a trypsinogen, which is a member of the trypsin family of serine proteases. This enzyme is secreted by the pancreas and cleaved to its active form in the small intestine. It is active on peptide linkages involving the carboxyl group of lysine or arginine. Mutations in this gene are associated with hereditary pancreatitis. This gene and several other trypsinogen genes are localized to the T cell receptor beta locus on chromosome 7.
19572	
19573	This gene encodes a protein containing domains that associate it with the peptidase family C19, also known as family 2 of ubiquitin carboxyl-terminal hydrolases. Family members function as deubiquitinating enzymes, recognizing and hydrolyzing the peptide bond at the C-terminal glycine of ubiquitin. Enzymes in peptidase family C19 are involved in the processing of poly-ubiquitin precursors as well as that of ubiquitinated proteins.
19574	
19575	
19576	
19577	Histones play a critical role in transcriptional regulation, cell cycle progression, and developmental events. Histone acetylation/deacetylation alters chromosome structure and affects transcription factor access to DNA. The protein encoded by this gene belongs to class I of the histone deacetylase/acuc/apha family. It has histone deacetylase activity and represses transcription when tethered to a promoter.
19578	
19579	
19580	
19581	
19582	This gene is a member of the N-acetylated alpha-linked acidic dipeptidase (NAALADase) gene family. The representative member of this family is the gene encoding human prostate-specific membrane antigen (PSM), which is a marker of prostatic carcinomas and is the first to be shown to possess NAALADase activity. NAALADase cleaves N-acetyl-L-aspartate-L-glutamate (NAAG), which is a neuropeptide expressed both in the central nervous systems and in the periphery and is thought to function as a neurotransmitter. The product of this gene is a type II integral membrane protein. Transient transfection of this gene confers both NAALADase and dipetidyl peptidase IV activities to mammalian cells. This gene is highly expressed in ovary and testis as well as within discrete brain areas.
19583	
19584	
19585	
19586	N-methyl-D-aspartate (NMDA) receptors are a class of ionotropic glutamate receptors.  NMDA channel has been shown to be involved in long-term potentiation, an activity-dependent increase in the efficiency of synaptic transmission thought to underlie certain kinds of memory and learning.  NMDA receptor channels are heteromers composed of the key receptor subunit NMDAR1 (GRIN1) and 1 or more of the 4 NMDAR2 subunits: NMDAR2A (GRIN2A), NMDAR2B (GRIN2B), NMDAR2C (GRIN2C), and NMDAR2D (GRIN2D).
19587	
19588	
19589	
19590	
19591	
19592	
19593	
19594	
19595	
19596	
19597	
19598	
19599	
19600	
19601	
19602	
19603	
19604	The protein encoded by this gene is similar to the Maf1, a Saccharomyces cerevisiae protein highly conserved in eukaryotic cells. Yeast Maf1 is a negative effector of RNA polymerase III (Pol III). It responses to the changes in the cellular environment and represses the pol III transcription. Biochemical studies identified the initiation factor TFIIIB as a target for Maf1-dependent repression.
19605	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP contains an N-terminal domain, similar to cytoskeletal- associated proteins including band 4.1, ezrin, merlin, and radixin. This PTP was shown to specially interact with BMX/ETK, a member of Tec tyrosine kinase family characterized by a multimodular structures including PH, SH3, and SH2 domains. The interaction of this PTP with BMX kinase was found to increase the activation of STAT3, but not STAT2 kinase. Studies of the similar gene in mice suggested the possible roles of this PTP in liver regeneration and spermatogenesis.
19606	
19607	
19608	
19609	
19610	
19611	
19612	
19613	
19614	
19615	
19616	
19617	
19618	
19619	
19620	
19621	
19622	
19623	
19624	
19625	The product of this gene belongs to the family of highly homologous synovial sarcoma X (SSX) breakpoint proteins. These proteins may function as transcriptional repressors. They are also capable of eliciting spontaneously humoral and cellular immune responses in cancer patients, and are potentially useful targets in cancer vaccine-based immunotherapy. SSX1, SSX2 and SSX4 genes have been involved in the t(X;18) translocation characteristically found in all synovial sarcomas. This translocation results in the fusion of the synovial sarcoma translocation gene on chromosome 18 to one of the SSX genes on chromosome X. The encoded hybrid proteins are probably responsible for transforming activity. Two transcript variants encoding distinct isoforms have been identified for this gene.
19626	The RAS oncogene (MIM 190020) is mutated in nearly one-third of all human cancers. Members of the RAS superfamily are plasma membrane GTP-binding proteins that modulate intracellular signal transduction pathways. A subfamily of RAS effectors, including RASSF3, share a RAS association (RA) domain.[supplied by OMIM]
19627	ADP-ribosylation factor 3 (ARF3) is a member of the human ARF gene family. These genes encode small guanine nucleotide-binding proteins that stimulate the ADP-ribosyltransferase activity of cholera toxin and play a role in vesicular trafficking and as activators of phospholipase D.  The gene products include 6 ARF proteins and 11 ARF-like proteins and constitute 1 family of the RAS superfamily. The ARF proteins are categorized as class I (ARF1, ARF2,and ARF3), class II (ARF4 and ARF5) and class III (ARF6) and members of each class share a common gene organization. The ARF3 gene contains five exons and four introns.
19628	REV1L is the human homolog of the S. cerevisiae mutagenesis protein Rev1. The REV1 protein is conserved from yeast to humans. All REV1 proteins contain a BRCT domain, which is important in protein-protein interactions.[supplied by OMIM]
19629	
19630	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and a single intracellular catalytic domain, and thus represents a receptor-type PTP. The catalytic domain of this PTP is most closely related to PTPRN/IA-2beta. This PTP and PTPRN are both found to be major autoantigens associated with insulin-dependent diabetes mellitus. Three alternatively spliced transcript variants of this gene, which encode distinct proteins, have been reported.
19631	
19632	
19633	
19634	
19635	
19636	The VG5Q gene encodes a potent angiogenic factor that contains a forkhead-associated domain and a G-patch domain.[supplied by OMIM]
19637	
19638	
19639	CSTF3 is one of three (including CSTF1 and CSTF2) cleavage stimulation factors that combine to form CSTF.  CSTF is involved in the polyadenylation and 3@#$%&end cleavage of pre-mRNAs.
19640	
19641	
19642	
19643	
19644	This gene belongs to the sulfotransferase gene family. Sulfotransferases generate sulfated glycosaminoglycan (GAG) moities during chondroitin sulfate biosynthesis. They create considerable structural diversity among chondroitin sulfates by transferring sulfate with remarkable specificity for the underlying oligosaccharide substrate. This gene product mainly transfers sulfate to N-acetylgalactosamine. The regulated expression of each member of this gene family may be an important determinant of sulfated GAGs expression and the associated function of chondroitin sulfates as regulators of many biologic processes. This gene is part of a gene cluster on chromosome Xp11.23.
19645	
19646	Rho GTPases play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. The encoded protein may form complex with G proteins and stimulate Rho-dependent signals.
19647	This gene encodes the D4 subtype of the dopamine receptor. The D4 subtype is a G-protein coupled receptor which inhibits adenylyl cyclase. It is a target for drugs which treat schizophrenia and Parkinson disease. Mutations in this gene have been associated with various behavioral phenotypes, including autonomic nervous system dysfunction, attention deficit/hyperactivity disorder, and the personality trait of novelty seeking. This gene contains a polymorphic number (2-10 copies) of tandem 48 nt repeats; the sequence shown contains four repeats.
19648	
19649	
19650	
19651	
19652	
19653	
19654	
19655	The protein encoded by this gene is the tropomyosin-binding subunit of the troponin complex, which is located on the thin filament of striated muscles and regulates muscle contraction in response to alterations in intracellular calcium ion concentration. Mutations in this gene have been associated with familial hypertrophic cardiomyopathy as well as with dilated cardiomyopathy. Transcripts for this gene undergo alternative splicing that results in many tissue-specific isoforms, however, the full length nature of some of these variants has not yet been determined.
19656	
19657	The TRAF (tumor necrosis factor receptor-associated factor) family of proteins associate with and transduce signals from members of the tumor necrosis factor receptor superfamily. The protein encoded by this gene is found in the cytoplasm and can bind to TRAF1, TRAF2, or TRAF3, thereby inhibiting TRAF function by sequestering the TRAFs in a latent state in the cytoplasm. For example, the protein encoded by this gene can block TRAF2 binding to LMP1, the Epstein-Barr virus transforming protein, and inhibit LMP1-mediated NF-kappa-B activation. Two transcript variants encoding different isoforms have been found for this gene.
19658	
19659	
19660	
19661	
19662	Expression of this gene is affected by vitamin A. The encoded protein of this gene may be associated with the cytoskeleton. A similar protein in rats may play a role in the regulation of cell differentiation. The rat protein binds and inhibits the cell membrane glutamate transporter EAAC1. The expression of the rat gene is upregulated by retinoic acid, which results in a specific reduction in EAAC1-mediated glutamate transport.
19663	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L15E family of ribosomal proteins. It is located in the cytoplasm. This gene shares sequence similarity with the yeast ribosomal protein YL10 gene. Although this gene has been referred to as RPL10, its official symbol is RPL15. This gene has been shown to be overexpressed in some esophageal tumors compared to normal matched tissues. Transcript variants utilizing alternative polyA signals exist. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
19664	
19665	
19666	
19667	
19668	
19669	
19670	
19671	Twinkle is a mitochondrial protein with structural similarity to the phage T7 primase/helicase (GP4) and other hexameric ring helicases. The twinkle protein colocalizes with mtDNA in mitochondrial nucleoids, and the protein@#$%&s name derives from the unusual localization pattern reminiscent of twinkling stars (Spelbrink et al., 2001).[supplied by OMIM]
19672	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
19673	
19674	
19675	
19676	
19677	
19678	The product of this gene is a component of the nuclear factor of activated T cells DNA-binding transcription complex. This complex consists of at least two components: a preexisting cytosolic component that translocates to the nucleus upon T cell receptor (TCR) stimulation, and an inducible nuclear component. Proteins belonging to this family of transcription factors play a central role in inducible gene transcription during immune response. The product of this gene is an inducible nuclear component. It functions as a major molecular target for the immunosuppressive drugs such as cyclosporin A. Five transcript variants encoding distinct isoforms have been identified for this gene. Different isoforms of this protein may regulate inducible expression of different cytokine genes.
19679	
19680	
19681	
19682	
19683	
19684	ADP-ribosylation factor 1 (ARF1) is a member of the human ARF gene family. The family members encode small guanine nucleotide-binding proteins that stimulate the ADP-ribosyltransferase activity of cholera toxin and play a role in vesicular trafficking as activators of phospholipase D. The gene products, including 6 ARF proteins and 11 ARF-like proteins, constitute a family of the RAS superfamily. The ARF proteins are categorized as class I (ARF1, ARF2 and ARF3), class II (ARF4 and ARF5) and class III (ARF6), and members of each class share a common gene organization. The ARF1 protein is localized to the Golgi apparatus and has a central role in intra-Golgi transport. Multiple alternatively spliced transcript variants encoding the same protein have been found for this gene.
19685	
19686	
19687	
19688	This gene encodes a loosely associated peripheral membrane protein related to the LanC family of bacterial membrane-associated proteins involved in the biosynthesis of antimicrobial peptides. This protein may play a role as a peptide-modifying enzyme component in eukaryotic cells. Previously considered a member of the G-protein-coupled receptor superfamily, this protein is now in the LanC family.
19689	
19690	
19691	
19692	The coenzyme NAD and its derivatives are involved in hundreds of metabolic redox reactions and are utilized in protein ADP-ribosylation, histone deacetylation, and in some Ca(2+) signaling pathways. NMNAT (EC 2.7.7.1) is a central enzyme in NAD biosynthesis, catalyzing the condensation of nicotinamide mononucleotide (NMN) or nicotinic acid mononucleotide (NaMN) with the AMP moiety of ATP to form NAD or NaAD (Zhang et al., 2003).[supplied by OMIM]
19693	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two RRM domains that bind to RNAs. Two alternatively spliced transcript variants have been described for this gene. The variants encode the same protein but have different 3@#$%& UTRs. This protein is similar to its family member HNRPD.
19694	The protein encoded by this gene is a microsomal enzyme that catalyzes the esterification of all-trans-retinol into all-trans-retinyl ester, an essential reaction for the retinoid cycle in visual system and vitamin A status in liver. Mutations in this gene have been associated with early-onset severe retinal dystrophy.
19695	This gene encodes glutathione-dependent formaldehyde dehydrogenase or class III alcohol dehydrogenase chi subunit, which is a member of the alcohol dehydrogenase family. Members of this family metabolize a wide variety of substrates, including ethanol, retinol, other aliphatic alcohols, hydroxysteroids, and lipid peroxidation products. Class III alcohol dehydrogenase is a homodimer composed of 2 chi subunits. It has virtually no activity for ethanol oxidation, but exhibits high activity for oxidation of long-chain primary alcohols and for oxidation of S-hydroxymethyl-glutathione, a spontaneous adduct between formaldehyde and glutathione. This enzyme is an important component of cellular metabolism for the elimination of formaldehyde, a potent irritant and sensitizing agent that causes lacrymation, rhinitis, pharyngitis, and contact dermatitis.
19696	
19697	This gene encodes the beta subunit of the karyopherin receptor complex which interacts with nuclear localization signals to target nuclear proteins to the nucleus. The karyopherin receptor complex is a heterodimer of an alpha subunit which recognizes the nuclear localization signal and a beta subunit which docks the complex at nucleoporins. Alternate splicing of this gene results in two transcript variants encoding the same protein.
19698	The protein encoded by this gene belongs to the thrombospondin protein family. Thrombospondin family members are adhesive glycoproteins that mediate cell-to-cell and cell-to-matrix interactions. This protein forms a pentamer and can bind to heparin and calcium. Studies of the rat counterpart suggest that this protein may be involved in local signaling in the developing and adult nervous system.
19699	
19700	
19701	
19702	The protein encoded by this gene contains a RING finger motif and a FHA domain. This protein has been shown to interact with several class II ubiquitin-conjugating enzymes (E2), including UBE2E1/UBCH6, UBE2E2, and UBE2E3, and may act as an ubiquitin ligase (E3) in the ubiquitination of certain nuclear proteins. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
19703	
19704	
19705	RDH10 generates all-trans retinal from all-trans retinol and may plan an important role in the photic visual cycle. All-trans retinal is isomerized to 11-cis retinal by the retinal G protein-coupled receptor (RGR; MIM 600342) when the retinal pigment epithelium (RPE) is illuminated.[supplied by OMIM]
19706	
19707	This gene encodes a myotubularin-related protein that is atypical to most other members of the myotubularin-related protein family because it has no dual-specificity phosphatase domain. The encoded protein contains a double-helical motif similar to the SET interaction domain, which is thought to have a role in the control of cell proliferation. In mouse, a protein similar to the encoded protein binds with MTMR7, and together they dephosphorylate phosphatidylinositol 3-phosphate and inositol 1,3-bisphosphate.
19708	
19709	
19710	
19711	
19712	
19713	This gene encodes a 42.6 kD subunit of dynactin, a macromolecular complex consisting of 10-11 subunits ranging in size from 22 to 150 kD. Dynactin binds to both microtubules and cytoplasmic dynein. It is involved in a diverse array of cellular functions, including ER-to-Golgi transport, the centripetal movement of lysosomes and endosomes, spindle formation, chromosome movement, nuclear positioning, and axonogenesis. This subunit is present in 8-13 copies per dynactin molecule, and is the most abundant molecule in the dynactin complex. It is an actin-related protein, and is approximately 60% identical at the amino acid level to conventional actin.
19714	
19715	
19716	Neurexins are a family of proteins that function in the vertebrate nervous system as cell adhesion molecules and receptors. They are encoded by several unlinked genes of which two, NRXN1 and NRXN3, are among the largest known human genes. Three of the genes (NRXN1-3) utilize two alternate promoters and include numerous alternatively spliced exons to generate thousands of distinct mRNA transcripts and protein isoforms. The majority of transcripts are produced from the upstream promoter and encode alpha-neurexin isoforms; a much smaller number of transcripts are produced from the downstream promoter and encode beta-neurexin isoforms. The alpha-neurexins contain epidermal growth factor-like (EGF-like) sequences and laminin G domains, and have been shown to interact with neurexophilins. The beta-neurexins lack EGF-like sequences and contain fewer laminin G domains than alpha-neurexins.
19717	
19718	
19719	
19720	
19721	The protein encoded by this gene contains a ring finger motif, which is known to be involved in protein-protein interactions. The specific function of this protein has not yet been determined. EST data suggests the existence of multiple alternatively spliced transcript variants, however, their full length nature is not known.
19722	
19723	Natural killer (NK) cells are lymphocytes that can mediate lysis of certain tumor cells and virus-infected cells without previous activation. They can also regulate specific humoral and cell-mediated immunity. The protein encoded by this gene belongs to the killer cell lectin-like receptor (KLR) family, which is a group of transmembrane proteins preferentially expressed in NK cells. Studies in mice suggested that the expression of this gene may be regulated by MHC class I molecules. Alternatively spliced transcript variants have been reported, but their full-length natures have not yet been determined.
19724	
19725	Retinitis pigmentosa is an inherited progressive disease which is a major cause of blindness in western communities. It can be inherited as an autosomal dominant, autosomal recessive, or X-linked recessive disorder. In the autosomal dominant form,which comprises about 25% of total cases, approximately 30% of families have mutations in the gene encoding the rod photoreceptor-specific protein rhodopsin. This is the transmembrane protein which, when photoexcited, initiates the visual transduction cascade. Defects in this gene are also one of the causes of congenital stationary night blindness.
19726	
19727	
19728	
19729	
19730	In a study identifying genes in rat that are upregulated in response to nerve damage, a gene which is highly expressed in ganglia and in the central nervous system was found. The protein encoded by the rat gene contains both a zinc finger and a RING finger motif and is localized in the endosome/lysosome compartment, indicating that it may be involved in ubiquitin-mediated protein modification. The protein encoded by this human gene is highly similar in sequence to that encoded by the rat gene.
19731	
19732	Prohibitin is an evolutionarily conserved gene that is ubiquitously expressed.  It is thought to be a negative regulator of cell proliferation and may be a tumor suppressor.  Mutations in PHB have been linked to sporadic breast cancer.  Prohibitin is expressed as two transcripts with varying lengths of 3@#$%& untranslated region.  The longer transcript is present at higher levels in proliferating tissues and cells, suggesting that this longer 3@#$%& untranslated region may function as a trans-acting regulatory RNA.
19733	The protein encoded by this gene was identified on the basis of its interaction with MAP3K12/DLK, a protein kinase known to be involved in the induction of cell apoptosis. This gene product contains a leucine zipper, which is a characteristic motif of transcription factors, and was shown to exhibit strong transactivation activity when fused to Gal4 DNA binding domain. Overexpression of this gene interfered with MAP3K12 induced apoptosis.
19734	The protein encoded by this gene is a nuclear hormone receptor for triiodothyronine. It is one of the several receptors for thyroid hormone, and has been shown to mediate the biological activities of thyroid hormone. Knockout studies in mice suggest that the different receptors, while having certain extent of redundancy, may mediate different functions of thyroid hormone. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
19735	This gene encodes one of the members of the superfamily of potassium channel proteins containing two pore-forming P domains. This channel protein, considered an open rectifier, is widely expressed. It is stimulated by arachidonic acid, and inhibited by internal acidification and volatile anaesthetics.
19736	The protein encoded by this gene is an inhibitory receptor found on peripheral mononuclear cells, including NK cells, T cells, and B cells. Inhibitory receptors regulate the immune response to prevent lysis of cells recognized as self. The gene is a member of both the immunoglobulin superfamily and the leukocyte-associated inhibitory receptor family. The gene maps to a region of 19q13.4 called the leukocyte receptor cluster, which contains at least 29 genes encoding leukocyte-expressed receptors of the immunoglobulin superfamily.
19737	
19738	
19739	This protein belongs to the aldehyde dehydrogenase family of proteins. This enzyme is a mitochondrial matrix NAD-dependent dehydrogenase which catalyzes the second step of the proline degradation pathway, converting pyrroline-5-carboxylate to glutamate. Deficiency of this enzyme is associated with type II hyperprolinemia, an autosomal recessive disorder characterized by accumulation of delta-1-pyrroline-5-carboxylate (P5C) and proline. Two transcript variants encoding the same protein have been identified for this gene.
19740	
19741	
19742	
19743	
19744	
19745	The protein encoded by this gene is a member of the thioredoxin family. The exact function of this protein is not known but it is associated with cell differentiation.
19746	
19747	
19748	
19749	The DOCK2 gene encodes a hematopoietic cell-specific CDM family protein that is indispensable for lymphocyte chemotaxis.[supplied by OMIM]
19750	
19751	
19752	
19753	
19754	
19755	The ficolin family of proteins are characterized by the presence of a leader peptide, a short N-terminal segment, followed by a collagen-like region, and a C-terminal fibrinogen-like domain. The collagen-like and the fibrinogen-like domains are also found separately in other proteins such as complement protein C1q, C-type lectins known as collectins, and tenascins.  However, all these proteins recognize different targets, and are functionally distinct.  Ficolin 1 encoded by FCN1 is predominantly expressed in the peripheral blood leukocytes, and has been postulated to function as a plasma protein with elastin-binding activity.
19756	
19757	
19758	ABL2 is a cytoplasmic tyrosine kinase which is closely related to but distinct from ABL1.  The similarity of the proteins includes the tyrosine kinase domains and extends amino-terminal to include the SH2 and SH3 domains.  ABL2 is expressed in both normal and tumor cells.  The ABL2 gene product is expressed as two variants bearing different amino termini, both approximately 12-kb in length.
19759	
19760	
19761	
19762	
19763	
19764	
19765	Cell surface heparan sulfate proteoglycans are composed of a membrane-associated protein core substituted with a variable number of heparan sulfate chains. Members of the glypican-related integral membrane proteoglycan family (GRIPS) contain a core protein anchored to the cytoplasmic membrane via a glycosyl phosphatidylinositol linkage.  These proteins may play a role in the control of cell division and growth regulation.
19766	
19767	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L23P family of ribosomal proteins. It is located in the cytoplasm. The protein may be one of the target molecules involved in mediating growth inhibition by interferon. In yeast, the corresponding protein binds to a specific site on the 26S rRNA. This gene is co-transcribed with the U42A, U42B, U101A, and U101B small nucleolar RNA genes, which are located in its third, first, second, and fourth introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
19768	
19769	
19770	
19771	
19772	
19773	
19774	
19775	
19776	
19777	
19778	
19779	
19780	
19781	
19782	
19783	The CHD family of proteins is characterized by the presence of chromo (chromatin organization modifier) domains and SNF2-related helicase/ATPase domains.  CHD genes alter gene expression possibly by modification of chromatin structure thus altering access of the transcriptional apparatus to its chromosomal DNA template.
19784	
19785	
19786	
19787	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP contains an N-terminal noncatalytic domain similar to that of band 4.1 superfamily cytoskeleton-associated proteins, which suggested the membrane or cytoskeleton localization of this protein. The specific function of this PTP has not yet been determined.
19788	
19789	
19790	
19791	
19792	
19793	The protein encoded by this gene is a noncollagenous extracellular matrix (ECM) protein. It consists of five identical glycoprotein subunits, each with EGF-like and calcium-binding (thrombospondin-like) domains. Oligomerization results from formation of a five-stranded coiled coil and disulfides. Binding to other ECM proteins such as collagen appears to depend on divalent cations. Mutations can cause the osteochondrodysplasias pseudochondroplasia (PSACH) and multiple epiphyseal dysplasia (MED).
19794	This gene encodes a member of the Notch family. Members of this Type 1 transmembrane protein family share structural characteristics including an extracellular domain consisting of multiple epidermal growth factor-like (EGF) repeats, and an intracellular domain consisting of multiple, different domain types. Notch family members play a role in a variety of developmental processes by controlling cell fate decisions. The Notch signaling network is an evolutionarily conserved intercellular signaling pathway which regulates interactions between physically adjacent cells. In Drosophilia, notch interaction with its cell-bound ligands (delta, serrate) establishes an intercellular signaling pathway that plays a key role in development. Homologues of the notch-ligands have also been identified in human, but precise interactions between these ligands and the human notch homologues remain to be determined. This protein is cleaved in the trans-Golgi network, and presented on the cell surface as a heterodimer. This protein functions as a receptor for membrane bound ligands, and may play a role in vascular, renal and hepatic development. This gene may be associated with susceptibility to schizophrenia in a small portion of cases. An alternative splice variant has been described but its biological nature has not been determined.
19795	
19796	This gene encodes the fibrillin protein, an important component of extracellular microfibrils. The encoded protein is found in elastic and nonelastic connective tissues of the body. Mutations in this gene cause Marfan syndrome.
19797	
19798	
19799	
19800	
19801	
19802	
19803	
19804	
19805	In Drosophila melanogaster, the @#$%&Polycomb@#$%& group (PcG) of genes are part of a cellular memory system that is responsible for the stable inheritance of gene activity. PcG proteins form a large multimeric, chromatin-associated protein complex. The protein encoded by this gene has homology to the Drosophila PcG protein @#$%&polyhomeotic@#$%& (Ph) and is known to heterodimerize with EDR1 and colocalize with BMI1 in interphase nuclei of human cells. The specific function in human cells has not yet been determined. Two transcript variants encoding different isoforms have been found for this gene.
19806	
19807	
19808	Tumor protein p53, a nuclear protein, plays an essential role in the regulation of cell cycle, specifically in the transition from G0 to G1.  It is found in very low levels in normal cells, however, in a variety of transformed cell lines, it is expressed in high amounts, and believed to contribute to transformation and malignancy.  p53 is a DNA-binding protein containing DNA-binding, oligomerization and transcription activation domains.  It is postulated to bind as a tetramer to a p53-binding site and activate expression of downstream genes that inhibit growth and/or invasion, and thus function as a tumor suppressor.  Mutants of p53 that frequently occur in a number of different human cancers fail to bind the consensus DNA binding site, and hence cause the loss of tumor suppressor activity.  Alterations of the TP53 gene occur not only as somatic mutations in human malignancies, but also as germline mutations in some cancer-prone families with Li-Fraumeni syndrome.
19809	KRT17 encodes the type I intermediate filament chain keratin 17, expressed in nail bed, hair follicle, sebaceous glands, and other epidermal appendages. Mutations in this gene lead to Jackson-Lawler type pachyonychia congenita and steatocystoma multiplex.
19810	
19811	
19812	This gene encodes a membrane glycoprotein found on peripheral blood cells, glomerular podocytes, and follicular dendritic cells. The protein is a receptor for complement components C3b and C4b and regulates the activity of the complement cascade. Variation in this protein is the basis of the Knops blood group system. The two most common alleles, F and S, differ by 8 exons and are thought to be the result of an unequal crossover event. A secreted form of the protein present in plasma has been described, but its full length nature has not been determined.
19813	
19814	
19815	
19816	
19817	
19818	
19819	
19820	The nuclear envelope creates distinct nuclear and cytoplasmic compartments in eukaryotic cells. It consists of two concentric membranes perforated by nuclear pores, large protein complexes that form aqueous channels to regulate the flow of macromolecules between the nucleus and the cytoplasm. These complexes are composed of at least 100 different polypeptide subunits, many of which belong to the nucleoporin family. This gene encodes a member of the FG-repeat-containing nucleoporins. The protein encoded by this gene is an integral membrane protein that localizes to the central spoke ring complex and participates in anchoring the nuclear pore complex to the nuclear envelope. Alternatively spliced variants that encode different protein isoforms have been described but the full-length nature of only one has been determined.
19821	
19822	
19823	
19824	
19825	
19826	
19827	
19828	
19829	
19830	
19831	The protein encoded by this gene is a member of the signal-regulatory-protein (SIRP) family, and also belongs to the immunoglobulin superfamily. SIRP family members are receptor-type transmembrane glycoproteins known to be involved in the negative regulation of receptor tyrosine kinase-coupled signaling processes. This protein can be phosphorylated by tyrosine kinases. The phospho-tyrosine residues of this PTP have been shown to recruit SH2 domain containing tyrosine phosphatases (PTP), and serve as substrates of PTPs. This protein was found to participate in signal transduction mediated by various growth factor receptors. CD47 has been demonstrated to be a ligand for this receptor protein. This gene and its product share very high similarity with several other members of the SIRP family. These related genes are located in close proximity to each other on chromosome 20p13.
19832	
19833	
19834	
19835	
19836	
19837	
19838	
19839	
19840	
19841	
19842	
19843	
19844	
19845	DEC1 encodes a basic helix-loop-helix protein expressed in various tissues.  Expression in the chondrocytes is responsive to the addition of Bt2cAMP.  Differentiated embryo chondrocyte expressed gene 1 is believed to be involved in the control of cell differentiation.
19846	
19847	
19848	
19849	
19850	
19851	The protein encoded by this gene is a membrane-associated type I inositol 1,4,5-trisphosphate (InsP3) 5-phosphatase. InsP3 5-phosphatases hydrolyze Ins(1,4,5)P3, which mobilizes intracellular calcium and acts as a second messenger mediating cell responses to various stimulation.
19852	
19853	Antiquitin is a member of subfamily 7 in the aldehyde dehydrogenase gene family. These enzymes are thought to play a major role in the detoxification of aldehydes generated by alcohol metabolism and lipid peroxidation. This particular member has homology to a previously described protein from the green garden pea, the 26g pea turgor protein. Four human antiquitin-like sequences, possibly pseudogenes, have also been identified.
19854	SPRY4 is an inhibitor of the receptor-transduced mitogen-activated protein kinase (MAPK) signaling pathway. It is positioned upstream of RAS (see HRAS; MIM 190020) activation and impairs the formation of active GTP-RAS (Leeksma et al., 2002).[supplied by OMIM]
19855	
19856	
19857	
19858	
19859	
19860	
19861	
19862	The protein encoded by this gene is a homolog of yeast Bet1 protein which is a golgi-associated protein that participates in vesicular transport from the endoplasmic reticulum (ER) to the Golgi apparatus. Studies with the rat homolog suggest that it is localized in the vesicular structures, and may be involved in the docking process of ER-derived vesicles with the cis-Golgi membrane.
19863	
19864	
19865	This gene encodes an accessory enzyme which participates in the beta-oxidation and metabolism of unsaturated fatty enoyl-CoA esters.
19866	
19867	The protein encoded by this gene is a member of the AAA family of adenosine triphosphatases (ATPases), is related to the Clp protease/heat shock family and is expressed prominently in the substantia nigra pars compacta. Mutations in this gene result in the autosomal dominant disorder, torsion dystonia 1.
19868	
19869	
19870	This gene encodes a 42.6 kD subunit of dynactin, a macromolecular complex consisting of 10-11 subunits ranging in size from 22 to 150 kD. Dynactin binds to both microtubules and cytoplasmic dynein. It is involved in a diverse array of cellular functions, including ER-to-Golgi transport, the centripetal movement of lysosomes and endosomes, spindle formation, chromosome movement, nuclear positioning, and axonogenesis. This subunit is present in 8-13 copies per dynactin molecule, and is the most abundant molecule in the dynactin complex. It is an actin-related protein, and is approximately 60% identical at the amino acid level to conventional actin.
19871	
19872	
19873	
19874	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein that is known to complex with integrins. It may function as a blood platelet activation marker. Deficiency of this protein is associated with Hermansky-Pudlak syndrome. Also this gene has been associated with tumor progression. The use of alternate polyadenylation sites has been found for this gene.
19875	This gene encodes a protein that is highly expressed in osteosarcomas. This protein binds to the hypoxia-inducible factor 1 (HIF-1), a key regulator of the hypoxic response and angiogenesis, and promotes the degradation of one of its subunits. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
19876	The protein encoded by this gene belongs to the highly conserved cyclin family, whose members are characterized by a dramatic periodicity in protein abundance through the cell cycle. Cyclins function as regulators of CDK kinases. Different cyclins exhibit distinct expression and degradation patterns which contribute to the temporal coordination of each mitotic event. This cyclin forms a complex with and functions as a regulatory subunit of CDK4 or CDK6, whose activtiy is required for cell cycle G1/S transition. This protein has been shown to interact with and be involved in the phosphorylation of tumor suppressor protein Rb. Knockout studies of the homologous gene in mouse suggest the essential roles of this gene in ovarian granulosa and germ cell proliferation. High level expression of this gene was observed in ovarian and testicular tumors.
19877	
19878	This gene encodes a member of the ZIC family of C2H2-type zinc finger proteins. Members of this family are important during development, and have been associated X-linked visceral heterotaxy and holoprosencephaly type 5. This gene is closely linked to a gene encoding zinc finger protein of the cerebellum 2, a related family member on chromosome 13. This gene encodes a protein of unknown function.
19879	The neuronal ceroid lipofuscinoses (CLN or NCL) are a group of autosomal recessive, progressive encephalopathies in children. They are characterized by psychomotor deterioration, visual failure, and the accumulation of autofluorescent lipopigment in neurons and other cell types. The main childhood forms are the infantile type (Santavuori-Haltia disease; MIM 256730), the late infantile type (Jansky-Bielschowsky disease; MIM 204500), and the juvenile type (Batten disease; MIM 204200) based on the age of onset, clinical course, neurologic and ophthalmologic findings, and ultrastructural analysis (Carpenter et al., 1977 [PubMed 193610]).[supplied by OMIM]
19880	
19881	
19882	
19883	
19884	
19885	
19886	
19887	
19888	
19889	This gene belongs to a group of vacuolar protein sorting (VPS) genes. The encoded protein is a component of a large multimeric complex, termed the retromer complex, involved in retrograde transport of proteins from endosomes to the trans-Golgi network. The close structural similarity between the yeast and human proteins that make up this complex suggests a similarity in function. Expression studies in yeast and mammalian cells indicate that this protein interacts directly with VPS35, which serves as the core of the retromer complex.
19890	This gene encodes a member of an adapter protein family that binds to several tyrosine-phosphorylated proteins. The product of this gene has several SH2 and SH3 domains (src-homology domains) and is involved in several signaling pathways, recruiting cytoplasmic proteins in the vicinity of tyrosine kinase through SH2-phosphotyrosine interaction. The N-terminal SH2 domain of this protein functions as a positive regulator of transformation whereas the C-terminal SH3 domain functions as a negative regulator of transformation. Two alternative transcripts encoding different isoforms with distinct biological activity have been described.
19891	Glycoprotein Ib (GP Ib) is a platelet surface membrane glycoprotein composed of a heterodimer, an alpha chain and a beta chain, that are linked by disulfide bonds. The Gp Ib functions as a receptor for von Willebrand factor (VWF). The complete receptor complex includes noncovalent association of the alpha and beta subunits with platelet glycoprotein IX and platelet glycoprotein V. The binding of the GP Ib-IX-V complex to VWF facilitates initial platelet adhesion to vascular subendothelium after vascular injury, and also initiates signaling events within the platelet that lead to enhanced platelet activation, thrombosis, and hemostasis. This gene encodes the alpha subunit. Several polymorphisms and mutations have been described in this gene, some of which are the cause of Bernard-Soulier syndromes and platelet-type von Willebrand disease.
19892	This gene encodes a protein that may modulate the transcriptional repression activities of death-associated protein 6 (DAXX), which interacts with histone deacetylase, core histones, and other histone-associated proteins. In mouse, the encoded protein binds to the putative leucine zipper domain of macroH2A1.2, a variant H2A histone that is enriched on inactivated X chromosomes. The BTB/POZ domain of this protein has been shown in other proteins to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes. Alternative splicing of this gene results in multiple transcript variants encoding the same protein.
19893	
19894	
19895	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L22P family of ribosomal proteins. It is located in the cytoplasm. This gene has been referred to as rpL23 because the encoded protein shares amino acid identity with ribosomal protein L23 from Halobacterium marismortui; however, its official symbol is RPL17. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
19896	
19897	
19898	SLC2A10 is a member of the facilitative glucose transporter family, which plays a significant role in maintaining glucose homeostasis.[supplied by OMIM]
19899	Breakdown products of phosphoinositides are ubiquitous second messengers that function downstream of many G protein-coupled receptors and tyrosine kinases regulating cell growth, calcium metabolism, and protein kinase C activity. This gene encodes an enzyme which regulates the amount of phosphatidylinositol available for signaling by catalyzing the conversion of phosphatidic acid to CDP-diacylglycerol. This enzyme is an integral membrane protein localized to two subcellular domains, the matrix side of the inner mitochondrial membrane where it is thought to be involved in the synthesis of phosphatidylglycerol and cardiolipin and the cytoplasmic side of the endoplasmic reticulum where it functions in phosphatidylinositol biosynthesis. Two genes encoding this enzyme have been identified in humans, one mapping to human chromosome 4q21 and a second to 20p13.
19900	The protein encoded by this gene is secreted via the Golgi apparatus and may function in cartilage development and maintenance. A frequent polymorphism in the translation start codon of this gene can abolish translation and may be associated with forms of deafness. This gene is a member of the melanoma-inhibiting activity gene family. In addition, alternate polyA sites exist for this gene.
19901	
19902	This gene is a member of the XAGE subfamily, which belongs to the GAGE family. The GAGE genes are expressed in a variety of tumors and in some fetal and reproductive tissues. This gene is strongly expressed in Ewing@#$%&s sarcoma, alveolar rhabdomyosarcoma and normal testis. The protein encoded by this gene contains a nuclear localization signal and shares a sequence similarity with other GAGE/PAGE proteins. Because of the expression pattern and the sequence similarity, this protein also belongs to a family of CT (cancer-testis) antigens. Alternative splicing of this gene generates 3 transcript variants, and one of which includes 2 transcripts generated from alternate transcription initiation sites.
19903	
19904	
19905	
19906	
19907	
19908	
19909	
19910	
19911	
19912	
19913	
19914	
19915	
19916	
19917	
19918	
19919	
19920	This gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins. The encoded protein may play a role in chondrogenesis. A pseudogene of this gene is located on chromosome 8. Multiple transcript variants encoding distinct isoforms have been identified for this gene.
19921	
19922	
19923	
19924	
19925	
19926	
19927	
19928	
19929	
19930	
19931	
19932	
19933	
19934	
19935	
19936	
19937	
19938	
19939	
19940	
19941	
19942	
19943	
19944	
19945	
19946	
19947	
19948	
19949	
19950	The obscurin gene spans more than 150 kb, contains over 80 exons and encodes a protein of approximately 720 kDa. The encoded protein contains 68 Ig domains, 2 fibronectin domains, 1 calcium/calmodulin-binding domain, 1 RhoGEF domain with an associated PH domain, and 2 serine-threonine kinase domains. This protein belongs to the family of giant sacromeric signaling proteins that includes titin and nebulin, and may have a role in the organization of myofibrils during assembly and may mediate interactions between the sarcoplasmic reticulum and myofibrils. While several transcript variants likely exist for this gene, the full-length nature of only one has been described to date.
19951	
19952	
19953	
19954	
19955	
19956	
19957	The product of this gene belongs to the Serine/Threonine protein kinase family, and to the Ca(2+)/calmodulin-dependent protein kinase subfamily. This protein plays a role in the calcium/calmodulin-dependent (CaM) kinase cascade by phosphorylating the downstream kinases CaMK1 and CaMK4. Seven transcript variants encoding six distinct isoforms have been identified for this gene. Additional splice variants have been described but their full-length nature has not been determined. The identified isoforms exhibit a distinct ability to  undergo autophosphorylation and to phosphorylate the downstream kinases.
19958	
19959	
19960	
19961	
19962	There are 2 forms of glucose-6-phosphate dehydrogenase. G form is X-linked and H form, encoded by this gene, is autosomally linked. This H form shows activity with other hexose-6-phosphates, especially galactose-6-phosphate, whereas the G form is specific for glucose-6-phosphate. Both forms are present in most tissues, but H form is not found in red cells.
19963	
19964	
19965	HEL308 is a single-stranded DNA-dependent ATPase and DNA helicase.[supplied by OMIM]
19966	
19967	
19968	This gene encodes the human ortholog of chicken YAP protein which binds to the SH3 domain of the Yes proto-oncogene product. This protein contains a WW domain that is found in various structural, regulatory and signaling molecules in yeast, nematode, and mammals, and may be involved in protein-protein interaction.
19969	
19970	
19971	
19972	
19973	The protein encoded by this gene may play a role in intracellular vesicle trafficking. It interacts with Syntaxin 13 which mediates intracellular membrane fusion. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
19974	
19975	
19976	
19977	
19978	
19979	
19980	
19981	The BMP1 locus encodes a protein that is capable of inducing formation of cartilage in vivo.  Although other bone morphogenetic proteins are members of the TGF-beta superfamily, BMP1 encodes a protein that is not closely related to other known growth factors. BMP1 protein and procollagen C proteinase (PCP), a secreted metalloprotease requiring calcium and needed for cartilage and bone formation, are identical. PCP or BMP1 protein cleaves the C-terminal propeptides of procollagen I, II, and III and its activity is increased by the procollagen C-endopeptidase enhancer protein.  The BMP1 gene is expressed as alternatively spliced variants that share an N-terminal protease domain but differ in their C-terminal region
19982	This gene is a member of the Regulator of Complement Activation (RCA) gene cluster and encodes a protein with twenty short concensus repeat (SCR) domains. This protein is secreted into the bloodstream and has an essential role in the regulation of complement activation, restricting this innate defense mechanism to microbial infections. Mutations in this gene have been associated with hemolytic-uremic syndrome (HUS) and chronic hypocomplementemic nephropathy. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
19983	This gene encodes a protein with multiple PDZ domains. PDZ domains mediate protein-protein interactions, and proteins with multiple PDZ domains often organize multimeric complexes at the plasma membrane. This protein localizes to tight junctions and to the apical membrane of epithelial cells. A similar protein in Drosophila is a scaffolding protein which tethers several members of a multimeric signaling complex in photoreceptors. Alternative splicing results in four transcript variants encoding different isoforms. Additional transcript variants have been described but their full-length sequences have not been determined.
19984	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters.  ABC proteins transport various molecules across extra- and intra-cellular membranes.  ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White).  This protein is a member of the MDR/TAP subfamily.  Members of the MDR/TAP subfamily are involved in multidrug resistance.  The function of this mitochondrial protein is unknown.
19985	
19986	The nuclear import of the spliceosomal snRNPs U1, U2, U4 and U5, is dependent on the presence of a complex nuclear localization signal. The latter is composed of the 5@#$%&-2,2,7-terminal trimethylguanosine (m3G) cap structure of the U snRNA and the Sm core domain. The protein encoded by this gene interacts specifically with m3G-cap and functions as an snRNP-specific nuclear import receptor.
19987	CD222 is a 250kDa transmembrane protein with a short cytoplasmic tail containing an internalization signal. CD222 was originally identified as a receptor for IGFII and M6P-containing proteins (e.g. lysosomal hydrolases). Lysosomal enzymes are sorted to lysosomes via CD222 either from the Golgi, where the enzymes acquire M6P, or from the extracellular space. The majority of CD222 molecules (approximately 90-95%) are located intracellularly, only 5-10% is present on the cell membrane. The internalization rate seems to be enhanced by ligand induced dimerization of CD222 as well as by phosphorylation of its cytoplasmic serine. CD222 is also a receptor for TGFbeta latency associated peptide (LAP), proliferin and may bind several molecules independently of M6P, including plasminogen, CD87 or retinoic acid. It is involved in activation of latent TGFbeta [PROW]
19988	This gene belongs to the ephrin receptor subfamily of the protein-tyrosine kinase family. EPH and EPH-related receptors have been implicated in mediating developmental events, particularly in the nervous system. Receptors in the EPH subfamily typically have a single kinase domain and an extracellular region containing a Cys-rich domain and 2 fibronectin type III repeats. The ephrin receptors are divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands.
19989	The protein encoded by this gene is a member of the STAT protein family. In response to cytokines and growth factors, STAT family members are phosphorylated by the receptor associated kinases, and then form homo- or heterodimers that translocate to the cell nucleus where they act as transcription activators. This protein can be activated by various ligands including interferon-alpha, interferon-gamma, EGF, PDGF and IL6. This protein mediates the expression of a variety of genes, which is thought to be important for cell viability in response to different cell stimuli and pathogens. Two alternatively spliced transcript variants encoding distinct isoforms have been described.
19990	
19991	
19992	
19993	This gene is a human homolog of rat DLP11 and a member of the dynein heavy chain family. It is a microtubule-dependent motor ATPase and has been reported to be involved in the movement of respiratory cilia. Mutations in this gene have been implicated in causing Kartagener Syndrome (a combination of situs inversus totalis and Primary Ciliary Dyskinesia (PCD), also called Immotile Cilia Syndrome 1 (ICS1)) and male sterility.
19994	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. However, this protein is considered a member of the cytochrome P450 superfamily on the basis of sequence similarity rather than functional similarity. This endoplasmic reticulum membrane protein catalyzes the conversion of prostglandin H2 to thromboxane A2, a potent vasoconstrictor and inducer of platelet aggregation. The enzyme plays a role in several pathophysiological processes including hemostasis, cardiovascular disease, and stroke. The gene expresses two transcript variants.
19995	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
19996	
19997	
19998	
19999	
20000	This gene product belongs to the neurexin family, members of which function in the vertebrate nervous system as cell adhesion molecules and receptors. This protein, like other neurexin proteins, contains epidermal growth factor repeats and laminin G domains. In addition, it includes an F5/8 type C domain, discoidin/neuropilin- and fibrinogen-like domains, and thrombospondin N-terminal-like domains. Alternative splicing of this gene results in 2 transcript variants encoding different isoforms.
20001	
20002	
20003	
20004	
20005	
20006	Cofactor D is one of four proteins (cofactors A, D, E, and C) involved in the pathway leading to correctly folded beta-tubulin from folding intermediates. Cofactors A and D are believed to play a role in capturing and stabilizing beta-tubulin intermediates in a quasi-native confirmation. Cofactor E binds to the cofactor D/beta-tubulin complex; interaction with cofactor C then causes the release of beta-tubulin polypeptides that are committed to the native state.
20007	The protein encoded by this gene is a member of the bone morphogenetic protein (BMP) family and the TGF-beta superfamily. This group of proteins is characterized by a polybasic proteolytic processing site which is cleaved to produce a mature protein containing seven conserved cysteine residues. The members of this family are regulators of cell growth and differentiation in both embryonic and adult tissues. Studies in yeast suggest that the protein is involved in aging. This protein is transcribed from both a monocistronic mRNA and a bicistronic mRNA that also encodes growth differentiation factor 1.
20008	The muscle acetylcholine receptor is composed of five subunits: two alpha subunits and one beta, one gamma, and one delta subunit. This gene encodes the beta subunit of the acetylcholine receptor. The acetylcholine receptor changes conformation upon acetylcholine binding leading to the opening of an ion-conducting channel across the plasma membrane. Mutations in this gene are associated with slow-channel congenital myasthenic syndrome.
20009	
20010	
20011	Lysophospholipases are enzymes that act on biological membranes to regulate the multifunctional lysophospholipids. There are alternatively spliced transcript variants described for this gene but the full length nature is not known yet.
20012	
20013	
20014	Receptors for the Fc fragment of IgG, or FCGRs (see MIM 146790), are cell surface glycoproteins of the Ig superfamily (IgSF). These receptors mediate phagocytosis of IgG-coated pathogens and promote activation of effector cells, leading to inflammatory responses and antibody-mediated cellular cytotoxicity. All FCGR genes map to human chromosome 1. Additional genes in this region, including FREB, encode FCGR homologs that are selectively expressed in B cells and may be implicated in B-cell development and lymphomagenesis.[supplied by OMIM]
20015	
20016	Palmitoyl-protein thioesterase (PPT) is a small glycoprotein that removes palmitate groups from cysteine residues in lipid-modified proteins. PPT is thought to be involved in the catabolism of lipid-modified proteins (Camp et al., 1994).[supplied by OMIM]
20017	HIV-1, the causative agent of acquired immunodeficiency syndrome (AIDS), contains an RNA genome that produces a chromosomally integrated DNA during the replicative cycle. Activation of HIV-1 gene expression by the transactivator Tat is dependent on an RNA regulatory element (TAR) located downstream of the transcription initiation site. The protein encoded by this gene is a transcriptional repressor that binds to chromosomally integrated TAR DNA and represses HIV-1 transcription. In addition, this protein regulates alternate splicing of the CFTR gene. A similar pseudogene is present on chromosome 20.
20018	
20019	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This gene is preferentially expressed in a variety of hematopoietic cells, and is an early response gene in lymphokine stimulated cells. The noncatalytic N-terminus of this PTP can interact with MAP kinases and suppress the MAP kinase activities. This PTP was shown to be involved in the regulation of T cell antigen receptor (TCR) signaling, which was thought to function through dephosphorylating the molecules related to MAP kinase pathway. Three alternatively spliced transcript variants of this gene, which encode two distinct isoforms, are reported.
20020	
20021	
20022	The specific function of this gene has not yet been determined; however, the protein it encodes is known to be a major constituent of the ARP2/3 complex. This complex is located at the cell surface and is essential to cell shape and motility through lamellipodial actin assembly and protrusion. Two transcript variants encoding different isoforms have been found for this gene.
20023	
20024	
20025	The coenzyme NAD and its derivatives are involved in hundreds of metabolic redox reactions and are utilized in protein ADP-ribosylation, histone deacetylation, and in some Ca(2+) signaling pathways. NMNAT (EC 2.7.7.1) is a central enzyme in NAD biosynthesis, catalyzing the condensation of nicotinamide mononucleotide (NMN) or nicotinic acid mononucleotide (NaMN) with the AMP moiety of ATP to form NAD or NaAD (Zhang et al., 2003).[supplied by OMIM]
20026	Histamine is a ubiquitous messenger molecule released from mast cells, enterochromaffin-like cells, and neurons. Its various actions are mediated by histamine receptors H1, H2, H3 and H4. This gene was thought to be intronless until recently. The protein encoded by this gene is an integral membrane protein and belongs to the family 1 of G protein-coupled receptors. It mediates the contraction of smooth muscles, increase in capillary permeability due to contraction of terminal venules, catecholamine release from adrenal medulla, and mediates neurotransmission in the central nervous system.
20027	
20028	
20029	The protein encoded by this gene is similar to yeast Sec31 protein. Yeast Sec31 protein is known to be a component of the COPII protein complex which is responsible for vesicle budding from endoplasmic reticulum (ER). This protein was found to colocalize with SEC13, one of the other components of COPII , in the subcellular structures corresponding to the vesicle transport function. An immunodepletion experiment confirmed that this protein is required for ER-Golgi transport. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
20030	The protein encoded by this gene is an inflammatory chemokine that belongs to the CXC chemokine family. This chemokine is produced concomitantly with interleukin-8 (IL8) in response to stimulation with either interleukin-1 (IL1) or tumor necrosis factor-alpha (TNFA). This chemokine is a potent chemotaxin involved in neutrophil activation.
20031	
20032	
20033	
20034	
20035	
20036	
20037	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein does not contain a coiled coil region, like most family members. This protein interacts with phosphatidylinositol-3-phosphate, and is involved in protein trafficking. This gene results in three transcript variants encoding three distinct isoforms.
20038	
20039	
20040	This gene encodes one of the chains for type I collagen, the fibrillar collagen found in most connective tissues. Mutations in this gene are associated with osteogenesis imperfecta, Ehlers-Danlos syndrome, idiopathic osteoporosis, and atypical Marfan syndrome. Symptoms associated with mutations in this gene, however, tend to be less severe than mutations in the gene for alpha-1 type I collagen since alpha-2 is less abundant. Multiple messages for this gene result from multiple polyadenylation signals, a feature shared by most of the other collagen genes.
20041	The balance between fission and fusion regulates the morphology of mitochondria. TTC11 is a component of a mitochondrial complex that promotes mitochondrial fission (James et al., 2003).[supplied by OMIM]
20042	The protein encoded by this gene contains an RGD (Arg-Gly-Asp) motif in the N-terminal region, which confers adhesive properties to macromolecular proteins like fibronectin. It shares a high degree of sequence similarity with the mouse homolog, which has been suggested to play a role in mitochondrial biogenesis. The exact biological function of this gene is not known.
20043	Mutations in this gene are associated with Nijmegen breakage syndrome, an autosomal recessive chromosomal instability syndrome characterized by microcephaly, growth retardation, immunodeficiency, and cancer predisposition. The encoded protein is a member of the MRE11/RAD50 double-strand break repair complex which consists of 5 proteins. This gene product is thought to be involved in DNA double-strand break repair and DNA damage-induced checkpoint activation.
20044	
20045	
20046	
20047	
20048	
20049	
20050	The protein encoded by this gene is a member of the ankyrin repeat and SOCS box-containing (ASB) family of proteins. They contain ankyrin repeat sequence and a SOCS box domain. The SOCS box serves to couple suppressor of cytokine signalling (SOCS) proteins and their binding partners with the elongin B and C complex, possibly targeting them for degradation. Multiple alternatively spliced transcript variants have been described for this gene, but their full-length sequences are not known.
20051	
20052	
20053	Thymus development depends on a complex series of interactions between thymocytes and the stromal component of the organ. Epithelial V-like antigen (EVA) is expressed in thymus epithelium and strongly downregulated by thymocyte developmental progression. This gene is expressed in the thymus and in several epithelial structures early in embryogenesis. It is highly homologous to the myelin protein zero and, in thymus-derived epithelial cell lines, is poorly soluble in nonionic detergents, strongly suggesting an association to the cytoskeleton. Its capacity to mediate cell adhesion through a homophilic interaction and its selective regulation by T cell maturation might imply the participation of EVA in the earliest phases of thymus organogenesis. The protein bears a characteristic V-type domain and two potential N-glycosylation sites in the extracellular domain; a putative serine phosphorylation site for casein kinase 2 is also present in the cytoplasmic tail. Two transcript variants encoding the same protein have been found for this gene.
20054	
20055	
20056	
20057	The protein encoded by this gene is an adaptor molecule that interacts with various cell surface receptors and mediates cell apoptotic signals. Through its C-terminal death domain, this protein can be recruited by TNFRSF6/Fas-receptor, tumor necrosis factor receptor, TNFRSF25, and TNFSF10/TRAIL-receptor, and thus it participates in the death signaling initiated by these receptors. Interaction of this protein with the receptors unmasks the N-terminal effector domain of this protein, which allows it to recruit caspase-8, and thereby activate the cysteine protease cascade. Knockout studies in mice also suggest the importance of this protein in early T cell development.
20058	
20059	Apoptosis is a cell death process that removes toxic and/or useless cells during mammalian development. The apoptotic process is accompanied by shrinkage and fragmentation of the cells and nuclei and degradation of the chromosomal DNA into nucleosomal units. DNA fragmentation factor (DFF) is a heterodimeric protein of 40-kD (DFFB) and 45-kD (DFFA) subunits. DFFA is the substrate for caspase-3 and triggers DNA fragmentation during apoptosis. DFF becomes activated when DFFA is cleaved by caspase-3. The cleaved fragments of DFFA dissociate from DFFB, the active component of DFF. DFFB has been found to trigger both DNA fragmentation and chromatin condensation during apoptosis. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene, but the biological validity of some variants has not been determined.
20060	The protein encoded by this gene belongs to the highly conserved cyclin family, whose members are characterized by a dramatic periodicity in protein abundance through the cell cycle. Cyclins function as regulators of CDK kinases. Different cyclins exhibit distinct expression and degradation patterns which contribute to the temporal coordination of each mitotic event. This cyclin forms a complex with and functions as a regulatory subunit of CDK4 or CDK6, whose activtiy is required for cell cycle G1/S transition. This protein has been shown to interact with and be involved in the phosphorylation of tumor suppressor protein Rb. Knockout studies of the homologous gene in mouse suggest the essential roles of this gene in ovarian granulosa and germ cell proliferation. High level expression of this gene was observed in ovarian and testicular tumors.
20061	
20062	
20063	
20064	
20065	This gene includes 11 exons spanning 25 kb and maps to a region of chromosome 16 that is sometimes deleted in breast and prostrate cancer. The second intron contains an apparently intronless gene, C16orf3, that is transcribed in the opposite orientation. This gene is a putative tumor suppressor gene.
20066	This gene encodes a component of vacuolar ATPase (V-ATPase), a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, receptor-mediated endocytosis, and synaptic vesicle proton gradient generation. V-ATPase is composed of a cytosolic V1 domain and a transmembrane V0 domain. The V1 domain consists of three A and three B subunits, two G subunits plus the C, D, E, F, and H subunits. The V1 domain contains the ATP catalytic site. The V0 domain consists of five different subunits: a, c, c@#$%&, c@#$%!, and d. Additional isoforms of many of the V1 and V0 subunit proteins are encoded by multiple genes or alternatively spliced transcript variants. This encoded protein is part of the V0 domain. This gene had the previous symbols of ATP6C and ATP6L.
20067	Fibrosin is a lymphokine secreted by activated lymphocytes that induces fibroblast proliferation (Prakash and Robbins, 1998).[supplied by OMIM]
20068	This gene encodes a protein with 5-phosphatase activity toward polyphosphate inositol. The protein localizes to the cytosol in regions lacking actin stress fibers. It is thought that this protein may negatively regulate the actin cytoskeleton. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
20069	
20070	This gene is located within the Smith-Magenis syndrome region on chromosome 17. It encodes a protein of unknown function.
20071	The protein encoded by this gene is a type I membrane protein and member of the immunoglobulin superfamily. It is thought to be involved in the process of myelination. It is a lectin that binds to sialylated glycoconjugates and mediates certain myelin-neuron cell-cell interactions. Two alternatively spliced transcripts encoding different isoforms have been described for this gene.
20072	
20073	
20074	
20075	
20076	
20077	The protein encoded by this gene is a preproprotein that is proteolytically processed to form the secreted opioid peptides beta-neoendorphin, dynorphin, leu-enkephalin, rimorphin, and leumorphin. These peptides are ligands for the kappa-type of opioid receptor. Dynorphin is involved in modulating responses to several psychoactive substances, including cocaine.
20078	Similarity to a transcriptional repressor suggests that this gene@#$%&s protein product is involved in the regulation of transcription. Alternative splicing of this gene results in four transcript variants encoding different isoforms. Additional transcript variants have been described, but their full length sequences have not been determined.
20079	
20080	The protein encoded by this gene is a subunit of the interleukin 13 receptor. This subunit forms a receptor complex with IL4 receptor alpha, a subunit shared by IL13 and IL4 receptors. This subunit serves as a primary IL13-binding subunit of the IL13 receptor, and may also be a component of IL4 receptors. This protein has been shown to bind tyrosine kinase TYK2, and thus may mediate the signaling processes that lead to the activation of JAK1, STAT3 and STAT6 induced by IL13 and IL4.
20081	
20082	There are two copies of this gene within the P1 palindrome on chromosome Y. This record represents the more telomeric copy.
20083	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the high sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
20084	
20085	
20086	
20087	
20088	This gene encodes a potential transmembrane protein that may function in vesicle-mediated transport and sorting of proteins within the cell. This protein may play a role in the development and the function of the eye, hematological system, and central nervous system. Mutations in this gene have been associated with Cohen syndrome. Multiple splice variants encoding distinct isoforms have been identified for this gene.
20089	
20090	
20091	Members of the F-box protein family, such as FBXO40, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
20092	
20093	
20094	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S21E family of ribosomal proteins. It is located in the cytoplasm. Alternative splice variants that encode different protein isoforms have been described, but their existence has not been verified. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
20095	
20096	
20097	
20098	
20099	
20100	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L21E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
20101	
20102	
20103	
20104	
20105	
20106	
20107	
20108	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections.
20109	A reciprocal translocation between chromosomes 22 and 9 produces the Philadelphia chromosome, which is often found in patients with chronic myelogenous leukemia. The chromosome 22 breakpoint for this translocation is located within the BCR gene. The translocation produces a fusion protein which is encoded by sequence from both BCR and ABL, the gene at the chromosome 9 breakpoint. Although the BCR-ABL fusion protein has been extensively studied, the function of the normal BCR gene product is not clear. The protein has serine/threonine kinase activity and is a GTPase-activating protein for p21rac.
20110	
20111	
20112	
20113	
20114	
20115	
20116	
20117	
20118	
20119	
20120	
20121	
20122	
20123	
20124	
20125	
20126	
20127	
20128	
20129	
20130	
20131	
20132	
20133	
20134	
20135	
20136	
20137	
20138	
20139	
20140	Dihydrolipoamide acetyltransferase (DLAT; EC 2.3.1.12) is 1 of 6 components of the mammalian pyruvate dehydrogenase complex (PDC) of the inner mitochondrial membrane. Patients with primary biliary cirrhosis (PBC; MIM 109720) show autoantibodies to DLAT.[supplied by OMIM]
20141	
20142	
20143	The nuclear pore complex is a massive structure that extends across the nuclear envelope, forming a gateway that regulates the flow of macromolecules between the nucleus and the cytoplasm. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells. The protein encoded by this gene is a membrane-spanning glycoprotein that is a major component of the nuclear pore complex.
20144	
20145	
20146	
20147	
20148	
20149	
20150	
20151	TRIP11 was first identified through its ability to interact functionally with thyroid hormone receptor-beta (THRB; MIM 190160). It has also been found in association with the Golgi apparatus and microtubules.[supplied by OMIM]
20152	This gene encodes a protein containing zinc-finger-like repeats and a calpain-like protease domain. The encoded protein may function as a transcription factor, RNA-binding protein, or in protein-protein interactions during visual system development.
20153	
20154	The protein encoded by this gene is the alpha component of fibrinogen, a blood-borne glycoprotein comprised of three pairs of nonidentical polypeptide chains. Following vascular injury, fibrinogen is cleaved by thrombin to form fibrin which is the most abundant component of blood clots. In addition, various cleavage products of fibrinogen and fibrin regulate cell adhesion and spreading, display vasoconstrictor and chemotactic activities, and are mitogens for several cell types. Mutations in this gene lead to several disorders, including dysfibrinogenemia, hypofibrinogenemia, afibrinogenemia and renal amyloidosis. Alternative splicing results in two isoforms which vary in the carboxy-terminus.
20155	
20156	
20157	
20158	Mutations in dysferlin, a protein associated with the plasma membrane, can cause muscle weakness that affects both proximal and distal muscles. The protein encoded by this gene is a type II membrane protein that is structurally similar to dysferlin. It is a member of the ferlin family and associates with both plasma and nuclear membranes. The protein contains C2 domains that play a role in calcium-mediated membrane fusion events, suggesting that it may be involved in membrane regeneration and repair. Two transcript variants encoding different isoforms have been found for this gene. Other possible variants have been detected, but their full-length natures have not been determined.
20159	
20160	
20161	MYH7 encodes the cardiac muscle beta (or slow) isoform of myosin. Changes in the relative abundance of MYH7 and MYH6 (the alpha, or fast, isoform of cardiac myosin heavy chain) correlate with the contractile velocity of cardiac muscle. Mutations in MYH7 are associated with familial hypertrophic cardiomyopathy.
20162	The ABL1 protooncogene encodes a cytoplasmic and nuclear protein tyrosine kinase that has been implicated in processes of cell differentiation, cell division, cell adhesion, and stress response.  Activity of c-Abl protein is negatively regulated by its SH3 domain, and deletion of the SH3 domain turns ABL1 into an oncogene.  The t(9;22) translocation results in the head-to-tail fusion of the BCR (MIM:151410) and ABL1 genes present in many cases of chronic myelogeneous leukemia.  The DNA-binding activity of the ubiquitously expressed ABL1 tyrosine kinase is regulated by CDC2-mediated phosphorylation, suggesting a cell cycle function for ABL1.  The ABL1 gene is expressed as either a 6- or 7-kb mRNA transcript, with alternatively spliced first exons spliced to the common exons 2-11.
20163	
20164	
20165	The signal sequence receptor (SSR) is a glycosylated endoplasmic reticulum (ER) membrane receptor associated with protein translocation across the ER membrane. The SSR consists of 2 subunits, a 34-kD glycoprotein (alpha-SSR or SSR1) and a 22-kD glycoprotein (beta-SSR or SSR2). The human beta-signal sequence receptor gene (SSR2) maps to chromosome bands 1q21-q23.
20166	
20167	
20168	
20169	This gene is a member of a multi-gene family which shares strong similarity with the Drosophila dishevelled gene, dsh. The Drosophila dishevelled gene encodes a cytoplasmic phosphoprotein that regulates cell proliferation.
20170	CD14 is a surface protein preferentially expressed on monocytes/macrophages.  It binds lipopolysaccharide binding protein and recently has been shown to bind apoptotic cells.
20171	This gene is part of a 500 kb inverted duplication on chromosome 5q13. This duplicated region contains at least four genes and repetitive elements which make it prone to rearrangements and deletions. The repetitiveness and complexity of the sequence have also caused difficulty in determining the organization of this genomic region. This copy of the gene is full length; additional copies with truncations and internal deletions are also present in this region of chromosome 5q13. It is thought that this gene is a modifier of spinal muscular atrophy caused by mutations in a neighboring gene, SMN1. The protein encoded by this gene contains regions of homology to two baculovirus inhibitor of apoptosis proteins, and it is able to suppress apoptosis induced by various signals. A second transcript for this gene has been described, but its full length nature has not been determined.
20172	Myosin VI is a molecular motor involved in intracellular vesicle and organelle transport. It is one of the so-called unconventional myosins.[supplied by OMIM]
20173	
20174	Three classes of protein carboxyl methyltransferases, distinguished by their methyl-acceptor substrate specificity, have been found in prokaryotic and eukaryotic cells. The type II enzyme catalyzes the transfer of a methyl group from S-adenosyl-L-methionine to the free carboxyl groups of D-aspartyl and L-isoaspartyl residues. These methyl-accepting residues result from the spontaneous deamidation, isomerization, and racemization of normal L-aspartyl and L-asparaginyl residues and represent sites of covalent damage to aging proteins PCMT1 (EC 2.1.1.77) is a protein repair enzyme that initiates the conversion of abnormal D-aspartyl and L-isoaspartyl residues to the normal L-aspartyl form.[supplied by OMIM]
20175	This protein belongs to the aldehyde dehydrogenases family of proteins. It plays a role in a pathway of 9-cis-retinoic acid biosynthesis in vivo. This enzyme converts 9-cis-retinal into the retinoid X receptor ligand 9-cis-retinoic acid, and has approximately 40-fold higher activity with 9-cis-retinal than with all-trans-retinal. Therefore, it is the first known aldehyde dehydrogenase to show a preference for 9-cis-retinal relative to all-trans-retinal. Two transcript variants encoding distinct isoforms have been identified for this gene.
20176	
20177	
20178	Rho GTPases play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. The encoded protein may form complex with G proteins and stimulate Rho-dependent signals.
20179	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
20180	This gene encodes a secreted protein resembling members of the ADAMTS (a disintegrin and metalloproteinase with thrombospondin motif) family. This protein lacks the propeptide region and the metalloproteinase and disintegrin-like domains, which are typical of the ADAMTS family, but contains other ADAMTS domains, including the thrombospondin type 1 motif. This protein may have important functions in the extracellular matrix. Alternative splicing of this gene results in 3 transcript variants encoding different isoforms.
20181	In eukaryotic cells, protein transport between the endoplasmic reticulum and Golgi compartments is mediated in part by non-clathrin-coated vesicular coat proteins (COPs).  Seven coat proteins have been identified, and they represent subunits of a complex known as coatomer. The subunits are designated alpha-COP, beta-COP, beta-prime-COP, gamma-COP, delta-COP, epsilon-COP, and zeta-COP.  The alpha-COP, encoded by COPA, shares high sequence similarity with RET1P, the alpha subunit of the coatomer complex in yeast.  Also, the N-terminal 25 amino acids of alpha-COP encode the bioactive peptide, xenin, which stimulates exocrine pancreatic secretion and may act as a gastrointestinal hormone.
20182	
20183	This gene encodes the D2 subtype of the dopamine receptor. This G-protein coupled receptor inhibits adenylyl cyclase activity. A missense mutation in this gene causes myoclonus dystonia; other mutations have been associated with schizophrenia. Alternative splicing of this gene results in two transcript variants encoding different isoforms. A third variant has been described, but it has not been determined whether this form is normal or due to aberrant splicing.
20184	
20185	The increased expression level of this gene is associated with HER-2/neu proto-oncogene overexpression. Amplification and resulting overexpression of this proto-oncogene are found in approximately 30% of human breast and 20% of human ovarian cancers. Further characterization of the product of this gene may yield insight into the fundamental biology and pathogenetic effects of HER-2/neu overexpression in human breast and ovarian cancer cells. Alternatively spliced variants encoding different isoforms have been identified for this gene.
20186	
20187	
20188	
20189	This gene encodes a member of the epidermal growth factor (EGF) receptor family of receptor tyrosine kinases. This protein has no ligand binding domain of its own and therefore cannot bind growth factors. However, it does bind tightly to other ligand-bound EGF receptor family members to form a heterodimer, stabilizing ligand binding and enhancing kinase-mediated activation of downstream signalling pathways, such as those involving mitogen-activated protein kinase and phosphatidylinositol-3 kinase. Allelic variations at amino acid positions 654 and 655 of isoform a (positions 624 and 625 of isoform b) have been reported, with the most common allele, Ile654/Ile655, shown here. Amplification and/or overexpression of this gene has been reported in numerous cancers, including breast and ovarian tumors. Alternative splicing results in several additional transcript variants, some encoding different isoforms and others that have not been fully characterized.
20190	
20191	
20192	This gene is a member of the cyclophilin family of peptidylprolyl isomerases. The cyclophilins are a highly conserved ubiquitous family, members of which play an important role in protein folding, immunosuppression by cyclosporin A, and infection of HIV-1 virions. This protein interacts with the proteinase inhibitor eglin c and is localized in the nucleus. This gene encodes three transcript variants encoding two distinct isoforms.
20193	
20194	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the proteasome B-type family, also known as the T1B family, that is a 20S core beta subunit.
20195	This gene belongs to the protocadherin gene family, a subfamily of the cadherin superfamily. This gene encodes a protein which contains 6 extracellular cadherin domains, a transmembrane domain and a cytoplasmic tail differing from those of the classical cadherins. Although its specific function is undetermined, the cadherin-related neuronal receptor is thought to play a role in the establishment and function of specific cell-cell connections in the brain.
20196	Iduronate-2-sulfatase is required for the lysosomal degradation of heparan sulfate and dermatan sulfate. Mutations in this X-chromosome gene that result in enzymatic deficiency lead to the sex-linked Mucopolysaccharidosis Type II, also known as Hunter Syndrome. Iduronate-2-sulfatase has a strong sequence homology with human arylsulfatases A, B, and C, and human glucosamine-6-sulfatase. A splice variant of this gene has been described.
20197	FCRL2 belongs to the Fc receptor family. Fc receptors are involved in phagocytosis, antibody-dependent cell cytotoxicity, immediate hypersensitivity, and transcytosis of immunoglobulins via their ability to bind immunoglobulin (Ig) constant regions (Chikaev et al., 2005).[supplied by OMIM]
20198	The exact function of the protein encoded by this gene is not known. However, this gene product contains a KRAB domain (which is involved in protein-protein interactions) at the N-terminus, and a transposase domain at the C-terminus, suggesting that it may belong to the family of DNA-mediated transposons in human.
20199	
20200	The second major step in the catabolism of the branched-chain amino acids, isoleucine, leucine, and valine, is catalyzed by the branched-chain alpha-keto acid dehydrogenase complex (BCKD; EC 1.2.4.4), an inner-mitochondrial enzyme complex that consists of 3 catalytic components: a heterotetrameric (alpha2, beta2) branched-chain alpha-ketoacid decarboxylase (E1-alpha; MIM 608348 and E1-beta; MIM 248611), a homo-24-meric dihydrolipoyl transacylase (E2), and a homodimeric dihydrolipoamide dehydrogenase (E3; MIM 246900). The reaction is irreversible and constitutes the first committed step in BCAA oxidation. The complex also contains 2 regulatory enzymes, a kinase and a phosphorylase. The DBT gene encodes the E2 component.[supplied by OMIM]
20201	
20202	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. Sequence analysis identified two transcript variants.
20203	
20204	
20205	Metabolic N-oxidation of the diet-derived amino-trimethylamine (TMA) is mediated by flavin-containing monooxygenase and is subject to an inherited FMO3 polymorphism in man resulting in a small subpopulation with reduced TMA N-oxidation capacity resulting in fish odor syndrome Trimethylaminuria. Three forms of the enzyme, FMO1 found in fetal liver, FMO2 found in adult liver, and FMO3 are encoded by genes clustered in the 1q23-q25 region.   Flavin-containing monooxygenases are NADPH-dependent flavoenzymes that catalyzes the oxidation of soft nucleophilic heteroatom centers in drugs, pesticides, and xenobiotics.
20206	
20207	This gene encodes a protein that is part of the EGFR pathway. The protein is present at clatherin-coated pits and is involved in receptor-mediated endocytosis of EGF. Notably, this gene is rearranged with the HRX/ALL/MLL gene in acute myelogeneous leukemias. Alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
20208	The protein encoded by this gene is a subunit of the multisubunit NADH:ubiquinone oxidoreductase (complex I). Mammalian complex I is composed of 45 different subunits. It locates at the mitochondrial inner membrane. This protein has NADH dehydrogenase activity and oxidoreductase activity. It transfers electrons from NADH to the respiratory chain. The immediate electron acceptor for the enzyme is believed to be ubiquinone.
20209	The protein encoded by this gene belongs to the ubiquitin-conjugating enzyme family. It binds selectively to a large region at the N terminus of huntingtin. This interaction is not influenced by the length of the huntingtin polyglutamine tract. This protein has been implicated in the degradation of huntingtin and suppression of apoptosis.
20210	The ZFYVE16 gene encodes endofin, an endosomal protein implicated in regulating membrane trafficking. It is characterized by the presence of a phosphatidylinositol 3-phosphate-binding FYVE domain positioned in the middle of the molecule (Seet et al., 2004).[supplied by OMIM]
20211	
20212	
20213	
20214	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1, a B-box type 2, and a coiled-coil region. The protein localizes to both the nucleus and the cytoplasm, and may represent a participant in the initiation of glycogen synthesis. Multiple transcript variants have been found for this gene, and some of them encode the same isoform.
20215	
20216	
20217	Calcitonin gene-related peptide (CGRP; MIM 114130), a 37-amino acid neuropeptide, induces increased intracellular cAMP levels. This occurs when CGRP binds to a CGRP receptor (CRLR, or CGRPR; MIM 114190) after the latter is transported to the cell surface by RAMP1 (MIM 605153). CGRPR component protein, or CGRPRCP, modulates CGRP responsiveness in a variety of cell types.[supplied by OMIM]
20218	This gene product is a sequence-specific, single-stranded DNA-binding protein. It binds preferentially to the single strand of the purine-rich element termed PUR, which is present at origins of replication and in gene flanking regions in a variety of eukaryotes from yeasts through humans. Thus, it is implicated in the control of both DNA replication and transcription. Deletion of this gene has been associated with myelodysplastic syndrome and acute myelogenous leukemia.
20219	
20220	
20221	
20222	
20223	
20224	
20225	
20226	The protein encoded by this gene is a member of the serine/threonine protein kinase family. Although this kinase is found in many tissues, its expression in lymphoid follicles is restricted to the cells of germinal centre, where it may participate in B-cell differentiation. This kinase can be activated by TNF-alpha, and has been shown to specifically activate MAP kinases. This kinase is also found to interact with TNF receptor-associated factor 2 (TRAF2), which is involved in the activation of MAP3K1/MEKK1.
20227	The product encoded by this gene belongs to the heat shock protein 70 family which contains both heat-inducible and constitutively expressed members. The latter are called heat-shock cognate proteins. This gene encodes a heat-shock cognate protein. This protein binds to nascent polypeptides to facilitate correct folding. It also functions as an ATPase in the disassembly of clathrin-coated vesicles during transport of membrane components through the cell. Two alternatively spliced variants have been characterized to date.
20228	
20229	This gene encodes a subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This subunit contains an N-terminal glutathione transferase domain, which may be involved in regulating the assembly of multisubunit complexes containing this elongation factor and aminoacyl-tRNA synthetases.
20230	
20231	
20232	
20233	This gene was identified upon genomic analysis of a gene-dense region at human chromosome 12p13. It appears to be mainly expressed in the brain, however, its function is not known. Alternatively spliced transcript variants encoding different isoforms have been identified.
20234	
20235	WW domain-containing proteins are found in all eukaryotes and play an important role in the regulation of a wide variety of cellular functions such as protein degradation, transcription, and RNA splicing. This gene encodes a protein which contains 2 WW domains and a coiled-coil region. It is ubiquitously expressed in adult tissues. The encoded protein is 94% identical to the mouse protein at the amino acid level.
20236	The FYVE domain mediates the recruitment of proteins involved in membrane trafficking and cell signaling to phosphatidylinositol 3-phosphate (PtdIns(3)P)-containing membranes. This gene encodes a protein which contains two zinc-binding FYVE domains in tandem. This protein displays a predominantly Golgi, endoplasmic reticulum and vesicular distribution. Alternatively spliced transcript variants have been found for this gene, and they encode two isoforms with different sizes.
20237	
20238	
20239	
20240	
20241	The protein encoded by this gene is one subunit of a cleavage factor required for 3@#$%& RNA cleavage and polyadenylation processing. The interaction of the protein with the RNA is one of the earliest steps in the assembly of the 3@#$%& end processing complex and facilitates the recruitement of other processing factors. This gene encodes the 25kD subunit of the protein complex, which is composed of four polypeptides.
20242	
20243	
20244	
20245	The product of this gene belongs to the family of basic helix-loop-helix leucine zipper (bHLH-Zip) transcription factors. These factors form heterodimers with Mad proteins and play a role in proliferation, determination and differentiation. This gene product may act to diversify Mad family function by its restricted association with a subset of the Mad family of transcriptional repressors, namely, Mad1 and Mad4. Alternatively spliced transcript variants encoding different isoforms have been identified for this gene.
20246	
20247	
20248	
20249	
20250	
20251	
20252	This gene encodes a subunit of mitochondrial ATP synthase. Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. ATP synthase is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, Fo, comprising the proton channel. The catalytic portion of mitochondrial ATP synthase consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled with a stoichiometry of 3 alpha, 3 beta, and a single representative of the other 3. The proton channel consists of three main subunits (a, b, c). This gene encodes the epsilon subunit of the catalytic core. Two pseudogenes of this gene are located on chromosomes 4 and 13.
20253	This gene encodes a protein that contains a Ras association domain. Similar to its cattle and sheep counterparts, this gene is located near the prion gene. The specific function of this gene has not yet been determined. Three alternatively spliced transcript variants of this gene encoding two distinct isoforms have been reported.
20254	The protein encoded by this gene is essential for meiotic homologous recombination. Genetic recombination in meiosis plays an important role in generating diversity of genetic information. The product of this gene is structurally and evolutionary related to the products of the yeast RAD51 and E. coli RecA genes. Alternative splice variants of this gene have been described but their full-length nature has not been determined.
20255	The protein encoded by this gene is a subunit of the interleukin 13 receptor. This subunit forms a receptor complex with IL4 receptor alpha, a subunit shared by IL13 and IL4 receptors. This subunit serves as a primary IL13-binding subunit of the IL13 receptor, and may also be a component of IL4 receptors. This protein has been shown to bind tyrosine kinase TYK2, and thus may mediate the signaling processes that lead to the activation of JAK1, STAT3 and STAT6 induced by IL13 and IL4.
20256	
20257	This gene encodes a protein required for synthesis of N-acetylglucosaminyl phosphatidylinositol (GlcNAc-PI), the first intermediate in the biosynthetic pathway of GPI anchor. The GPI anchor is a glycolipid found on many blood cells and which serves to anchor proteins to the cell surface. Paroxysmal nocturnal hemoglobinuria, an acquired hematologic disorder, has been shown to result from mutations in this gene. Three mRNA species have been described that result from alternative splicing of exon 2.
20258	This gene is a member of the XAGE subfamily, which belongs to the GAGE family. The GAGE genes are expressed in a variety of tumors and in some fetal and reproductive tissues. The protein encoded by this gene shares a sequence similarity with other GAGE/PAGE proteins. Because of the expression pattern and the sequence similarity, this protein also belongs to a family of CT (cancer-testis) antigens.
20259	
20260	
20261	
20262	
20263	
20264	
20265	
20266	
20267	
20268	
20269	CBL (MIM 165360) constitutively interacts with SH3 domain-containing proteins and, upon tyrosine phosphorylation, with SH2 domain-containing proteins. The SH3KBP1 gene encodes an 85-kD CBL-interacting protein that enhances tumor necrosis factor (MIM 191160)-mediated apoptotic cell death (Narita et al., 2001).[supplied by OMIM]
20270	
20271	
20272	
20273	Members of the trefoil family are characterized by having at least one copy of the trefoil motif, a 40-amino acid domain that contains three conserved disulfides. They are stable secretory proteins expressed in gastrointestinal mucosa. Their functions are not defined, but they may protect the mucosa from insults, stabilize the mucus layer and affect healing of the epithelium. This gene is expressed in goblet cells of the intestines and colon. This gene and two other related trefoil family member genes are found in a cluster on chromosome 21.
20274	
20275	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters.  ABC proteins transport various molecules across extra- and intracellular membranes.  ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White).  This protein is a member of the ABC1 subfamily.  Members of the ABC1 subfamily comprise the only major ABC subfamily found exclusively in multicellular eukaryotes.  With cholesterol as its substrate, this protein functions as a cholesteral efflux pump in the cellular lipid removal pathway.  Mutations in this gene have been associated with Tangier@#$%&s disease and familial high-density lipoprotein deficiency.
20276	This gene encodes a putative retinal G-protein coupled receptor. The gene is a member of the opsin subfamily of the 7 transmembrane, G-protein coupled receptor 1 family. Like other opsins which bind retinaldehyde, it contains a conserved lysine residue in the seventh transmembrane domain. The protein acts as a photoisomerase to catalyze the conversion of all-trans-retinal to 11-cis-retinal. The reverse isomerization occurs with rhodopsin in retinal photoreceptor cells. The protein is exclusively expressed in tissue adjacent to retinal photoreceptor cells, the retinal pigment epithelium and Mueller cells. This gene may be associated with autosomal recessive and autosomal dominant retinitis pigmentosa (arRP and adRP, respectively). Alternative splicing results in multiple transcript variants encoding different isoforms.
20277	
20278	
20279	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S26E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
20280	
20281	
20282	This ribonuclease gene is a novel member of the Rh/T2/S-glycoprotein class of extracellular ribonucleases. It is a single copy gene that maps to 6q27, a region associated with human malignancies and chromosomal rearrangement.
20283	
20284	
20285	
20286	
20287	
20288	
20289	
20290	
20291	The gamma globin genes (HBG1 and HBG2) are normally expressed in the fetal liver, spleen and bone marrow. Two gamma chains together with two alpha chains constitute fetal hemoglobin (HbF) which is normally replaced by adult hemoglobin (HbA) at birth. In some beta-thalassemias and related conditions, gamma chain production continues into adulthood. The two types of gamma chains differ at residue 136 where glycine is found in the G-gamma product (HBG2) and alanine is found in the A-gamma product (HBG1). The former is predominant at birth. The order of the genes in the beta-globin cluster is: 5@#$%&-epsilon -- gamma-G -- gamma-A -- delta -- beta--3@#$%&
20292	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes one of the non-ATPase subunits of the 19S regulator lid.
20293	
20294	
20295	
20296	
20297	
20298	
20299	
20300	
20301	
20302	
20303	
20304	
20305	This gene encodes a microtubule-associated protein. The encoded protein is a calmodulin-binding and calmodulin-regulated protein that is involved in microtubule stabilization. Two transcript variants encoding distinct isoforms have been identified for this gene.
20306	Sm-like proteins were identified in a variety of organisms based on sequence homology with the Sm protein family (see SNRPD2; MIM 601061). Sm-like proteins contain the Sm sequence motif, which consists of 2 regions separated by a linker of variable length that folds as a loop. The Sm-like proteins are thought to form a stable heteromer present in tri-snRNP particles, which are important for pre-mRNA splicing.[supplied by OMIM]
20307	
20308	
20309	
20310	
20311	The protein encoded by this gene is a member of the immunoglobulin superfamily. It is a glycosylphosphatidylinositol (GPI)-anchored neuronal membrane protein that functions as a cell adhesion molecule. It may play a role in the formation of axon connections in the developing nervous system. It may also be involved in glial tumorigenesis and may provide a potential target for therapeutic intervention.
20312	
20313	
20314	HLA-E belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. HLA-E binds a restricted subset of peptides derived from the leader peptides of other class I molecules. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail.
20315	This gene encodes one of the six subunits of type IV collagen, the major structural component of basement membranes. Mutations in this gene are associated with X-linked Alport syndrome, also known as hereditary nephritis. Like the other members of the type IV collagen gene family, this gene is organized in a head-to-head conformation with another type IV collagen gene so that each gene pair shares a common promoter. Three transcript variants have been identified for this gene.
20316	
20317	
20318	
20319	
20320	
20321	
20322	This gene encodes a component of prolyl 4-hydroxylase, a key enzyme in collagen synthesis composed of two identical alpha subunits and two beta subunits. The encoded protein is one of several different types of alpha subunits and provides the major part of the catalytic site of the active enzyme. In collagen and related proteins, prolyl 4-hydroxylase catalyzes the formation of 4-hydroxyproline that is essential to the proper three-dimensional folding of newly synthesized procollagen chains. Alternatively spliced transcript variants have been observed, but their full-length nature has not been determined.
20323	
20324	
20325	
20326	
20327	
20328	The cytoplasmic peripheral membrane protein encoded by this gene functions as a protein-tyrosine kinase substrate in microvilli.  As a member of the ERM protein family, this protein serves as an intermediate between the plasma membrane and the actin cytoskeleton.  It plays a key role in cell surface structure adhesion, migration, and organization.
20329	The protein encoded by this gene is a peripheral membrane protein localized to the trans-Golgi network. It is sensitive to brefeldin A. This encoded protein contains a GRIP domain which is thought to be used in targeting. Two alternatively spliced transcript variants encoding different isoforms have been described for this gene.
20330	
20331	
20332	
20333	
20334	Cancer-associated chromosomal changes often involve regions containing fragile sites. This gene maps to a commom fragile site on chromosome 7q31.2 designated FRA7G. This gene is similar to mouse Testin, a testosterone-responsive gene encoding a Sertoli cell secretory protein containing three LIM domains. LIM domains are double zinc-finger motifs that mediate protein-protein interactions between transcription factors, cytoskeletal proteins and signaling proteins. Multiple protein isoforms are encoded by transcript variants of this gene.
20335	Studies in Drosophila have identified this gene as encoding a positive regulator of the Notch-signaling pathway. The human gene encodes a protein of unknown function; however, it may play a role in basic helix-loop-helix transcription factor activity.
20336	
20337	
20338	
20339	
20340	
20341	
20342	This gene is a member of a family of sulfate/anion transporter genes. Family members are well conserved in their genomic (number and size of exons) and protein (aa length among species) structures, but have markedly different tissue expression patterns. This gene is primarily expressed in the liver, pancreas, and brain. Three splice variants that encode different isoforms have been identified.
20343	This gene is a member of the Period family of genes and is expressed in a circadian pattern in the suprachiasmatic nucleus, the primary circadian pacemaker in the mammalian brain. Genes in this family encode components of the circadian rhythms of locomotor activity, metabolism, and behavior. Circadian expression in the suprachiasmatic nucleus continues in constant darkness, and a shift in the light/dark cycle evokes a proportional shift of gene expression in the suprachiasmatic nucleus. The specific function of this gene is not yet known.
20344	
20345	
20346	REV1L is the human homolog of the S. cerevisiae mutagenesis protein Rev1. The REV1 protein is conserved from yeast to humans. All REV1 proteins contain a BRCT domain, which is important in protein-protein interactions.[supplied by OMIM]
20347	This gene is a member of a multi-gene family which shares strong similarity with the Drosophila dishevelled gene, dsh. The Drosophila dishevelled gene encodes a cytoplasmic phosphoprotein that regulates cell proliferation.
20348	
20349	This gene encodes class I alcohol dehydrogenase, gamma subunit, which is a member of the alcohol dehydrogenase family. Members of this enzyme family metabolize a wide variety of substrates, including ethanol, retinol, other aliphatic alcohols, hydroxysteroids, and lipid peroxidation products. Class I alcohol dehydrogenase, consisting of several homo- and heterodimers of alpha, beta, and gamma subunits, exhibits high activity for ethanol oxidation and plays a major role in ethanol catabolism. Three genes encoding alpha, beta and gamma subunits are tandemly organized in a genomic segment as a gene cluster.
20350	This gene encodes the alpha chain of type XII collagen, a member of the FACIT (fibril-associated collagens with interrupted triple helices) collagen family. Type XII collagen is a homotrimer found in association with type I collagen, an association that is thought to modify the interactions between collagen I fibrils and the surrounding matrix. Two transcript variants encoding different isoforms have been identified for this gene.
20351	This gene encodes a protein that is highly similar to the CDC10 protein of Saccharomyces cerevisiae. The protein also shares similarity with Diff 6 of Drosophila and with H5 of mouse. Each of these similar proteins, including the yeast CDC10, contains a GTP-binding motif. The yeast CDC10 protein is a structural component of the 10 nm filament which lies inside the cytoplasmic membrane and is essential for cytokinesis. Although the exact function of this gene has not yet been determined, its high similarity to yeast CDC10 and the high conservative nature of eukaryotic cell cycle machinery suggest a similar role to that of its yeast counterpart. Alternative splicing results in two transcript variants encoding different isoforms.
20352	
20353	Similarity to a related locus suggests that this gene encodes a subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. This subunit exists as a heterodimer with another polymerase subunit; together they form a core subassembly unit of the polymerase. Two similar genes are located downstream of this locus. Alternative splicing of this gene results in three transcript variants encoding different isoforms.
20354	Mitochondrial uncoupling proteins (UCP) are members of the larger family of mitochondrial anion carrier proteins (MACP). UCPs separate oxidative phosphorylation from ATP synthesis with energy dissipated as heat, also referred to as the mitochondrial proton leak. UCPs facilitate the transfer of anions from the inner to the outer mitochondrial membrane and the return transfer of protons from the outer to the inner mitochondrial membrane. They also reduce the mitochondrial membrane potential in mammalian cells. Tissue specificity occurs for the different UCPs and the exact methods of how UCPs transfer H+/OH- are not known. UCPs contain the three homologous protein domains of MACPs. This gene has tissue-specific transcription initiation with other transcription initiation sites upstream of SM-1 (major skeletal muscle site). Chromosomal order is 5@#$%&-UCP3-UCP2-3@#$%&. Two splice variants have been found for this gene.
20355	This gene encodes a sodium-independent transporter which mediates cellular uptake of organic ions in the liver. Its substrates include bile acids, bromosulphophthalein, and some steroidal compounds. The protein is a member of the SLC21A family of solute carriers. Alternate splicing of this gene results in three transcript variants encoding two different isoforms.
20356	Branching structures in complex N-glycans are synthesized on a common core structure of Man3GlcNAc2Asn in the Golgi apparatus by the N-acetylglucosaminyltransferases. The mannosyl (alpha-1,3-)-glycoprotein beta-1,4-N-acetylglucosaminyltransferases, which include isoenzyme A (MGAT4A) and isoenzyme B (MGAT4B), are key N-acetylglucosaminyltransferases regulating formation of tri- and other multiantennary structures. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
20357	
20358	This gene encodes a kinase that specifically phosphorylates the N-terminal region of the non-collagenous domain of the alpha 3 chain of type IV collagen, known as the Goodpasture antigen. Goodpasture disease is the result of an autoimmune response directed at this antigen. One isoform of this protein is also involved in ceramide intracellular transport. Two transcripts exist for this gene.
20359	
20360	
20361	The product of this gene belongs to the Serine/Threonine protein kinase family, and to the Ca(2+)/calmodulin-dependent protein kinase subfamily. This protein plays a role in the calcium/calmodulin-dependent (CaM) kinase cascade. Three transcript variants encoding two distinct isoforms have been identified for this gene.
20362	Neuronal CDC2-like kinase, which is involved in the regulation of neuronal differentiation, is composed of a catalytic subunit, CDK5, and an activating subunit, p25NCK5A. The protein encoded by this gene binds to p25NCK5A and therefore may be involved in neuronal differentiation. The encoded protein, which may be a substrate of neuronal CDC2-like kinase, has also been found in vascular endothelial cells, where it mediates cell proliferation. Three transcript variants encoding two different isoforms have been found for this gene.
20363	
20364	
20365	
20366	This gene encodes a member of a family of small membrane proteins that share a 35-amino acid signature sequence domain, beginning with the sequence PFXYD and containing 7 invariant and 6 highly conserved amino acids. The approved human gene nomenclature for the family is FXYD-domain containing ion transport regulator. Mouse FXYD5 has been termed RIC (Related to Ion Channel). FXYD2, also known as the gamma subunit of the Na,K-ATPase, regulates the properties of that enzyme. FXYD1 (phospholemman), FXYD2 (gamma), FXYD3 (MAT-8), FXYD4 (CHIF), and FXYD5 (RIC) have been shown to induce channel activity in experimental expression systems. Transmembrane topology has been established for two family members (FXYD1 and FXYD2), with the N-terminus extracellular and the C-terminus on the cytoplasmic side of the membrane. The protein encoded by this gene may function as a chloride channel or as a chloride channel regulator. Two transcript variants encode two different isoforms of the protein; in addition, transcripts utilizing alternative polyA signals have been described in the literature.
20367	
20368	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. The expression pattern of a similar rat protein suggests that this protein may be involved in neurovascular function in the brain.
20369	
20370	
20371	
20372	This gene encodes a GTP-binding protein that belongs to the small GTPase superfamily and Ras family of proteins. GTP-binding proteins mediate the transmembrane signaling initiated by the occupancy of certain cell surface receptors.
20373	
20374	
20375	Phosphatidylserine-specific phospholipase A1-alpha (PLA1A) acts specifically on phosphatidylserine (PS) and 1-acyl-2-lysophosphatidylserine (lyso-PS) to hydrolyze fatty acids at the sn-1 position of these phospholipids.[supplied by OMIM]
20376	The semaphorin/collapsin family of molecules plays a critical role in the guidance of growth cones during neuronal development. The secreted protein encoded by this gene family member is important in axonal guidance and has been shown to act as a tumor suppressor by inducing apoptosis.
20377	
20378	
20379	The protein encoded by this gene is highly expressed in peripheral blood of patients with atopic dermatitis (AD), compared to normal individuals. It may play a role in regulating the resistance to apoptosis that is observed in T cells and eosinophils of AD patients.
20380	
20381	
20382	
20383	The protein encoded by this gene belongs to a small class of prenylated protein tyrosine phosphatases (PTPs), which contains a PTP domain and a characteristic C-terminal prenylation motif. PTPs are cell signaling molecules that play regulatory roles in a variety of cellular processes. This tyrosine phosphatase is a nuclear protein, but may primarily associate with plasma membrane. The surface membrane association of this protein depends on its C-terminal prenylation. Overexpression of this gene in mammalian cells conferred a transformed phenotype, which implicated its role in the tumorigenesis. Studies in rat suggested that this gene may be an immediate-early gene in mitogen-stimulated cells.
20384	The autosomal dominant cerebellar ataxias (ADCA) are a heterogeneous group of neurodegenerative disorders characterized by progressive degeneration of the cerebellum, brain stem and spinal cord. Clinically, ADCA has been divided into three groups: ADCA types I-III. ADCAI is genetically heterogeneous, with five genetic loci, designated spinocerebellar ataxia (SCA) 1, 2, 3, 4 and 6, being assigned to five different chromosomes. ADCAII, which always presents with retinal degeneration (SCA7), and ADCAIII often referred to as the `pure@#$%& cerebellar syndrome (SCA5), are most likely homogeneous disorders. Several SCA genes have been cloned and shown to contain CAG repeats in their coding regions. ADCA is caused by the expansion of the CAG repeats, producing an elongated polyglutamine tract in the corresponding protein. The expanded repeats are variable in size and unstable, usually increasing in size when transmitted to successive generations. The function of the ataxins is not known. This locus has been mapped to chromosome 6, and it has been determined that the diseased allele contains 41-81 CAG repeats, compared to 6-39 in the normal allele. Several transcript variants in the 5@#$%& UTR have been described; however, their full-length nature is not known.
20385	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The function of this protein has not been identified.
20386	The product of this gene belongs to the phosphatase 2A regulatory subunit B family. Protein phosphatase 2A is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. The B regulatory subunit might modulate substrate selectivity and catalytic activity. This gene encodes a delta isoform of the regulatory subunit B56 subfamily. Alternatively spliced transcript variants encoding different isoforms have been identified.
20387	
20388	
20389	
20390	This gene is thought to play an important role in calcium homeostasis. Alternative splicing of this gene results in five transcript variants which vary in protein translation, the coding of catalytic domains, and tissue expression. Variation among these transcripts impacts their functions which involve roles in the calcium storage and release process in the endoplasmic and sarcoplasmic reticulum as well as hydroxylation of aspartic acid and asparagine in epidermal growth factor-like domains of various proteins.
20391	
20392	Valosin-containing protein (VCP) is a member of a family that includes putative ATP-binding proteins involved in vesicle transport and fusion, 26S proteasome function, and assembly of peroxisomes. VCP, as a structural protein, is associated with clathrin, and heat-shock protein Hsc70, to form a complex. VCP has been implicated in a number of cellular events that are regulated during mitosis, including homotypic membrane fusion, spindle pole body function, and ubiquitin-dependent protein degradation.
20393	
20394	
20395	The function of this gene has not yet been determined but may involve a role in tumor suppression. Alternative splicing of this gene results in several transcript variants; however, not all variants have been fully described.
20396	Coagulation factor II is proteolytically cleaved to form thrombin in the first step of the coagulation cascade which ultimately results in the stemming of blood loss.  F2 also plays a role in maintaining vascular integrity during development and postnatal life.  Mutations in F2 leads to various forms of thrombosis and dysprothrombinemia.
20397	FUT4 catalyzes the synthesis of the non-sialylated antigen, Lewis x (CD15). It has been detected in human embryos (5-10 weeks) suggesting a role in development.
20398	The protein encoded by this gene is similar to Saccharomyces cerevisiae LYS5, which is required for the activation of the alpha-aminoadipate dehydrogenase in the biosynthetic pathway of lysine. Yeast alpha-aminoadipate dehydrogenase converts alpha-biosynthetic-aminoadipate semialdehyde to alpha-aminoadipate. It has been suggested that defects in the human gene result in pipecolic acidemia.
20399	
20400	
20401	The product encoded by this gene belongs to the heat shock protein 70 family which contains both heat-inducible and constitutively expressed members. The latter are called heat-shock cognate proteins. This gene encodes a heat-shock cognate protein. This protein binds to nascent polypeptides to facilitate correct folding. It also functions as an ATPase in the disassembly of clathrin-coated vesicles during transport of membrane components through the cell. Two alternatively spliced variants have been characterized to date.
20402	Acyl-CoA:diacylglycerol acyltransferase, or DGAT (EC 2.3.1.20), is responsible for the synthesis of triglycerides. It catalyzes a reaction in which diacylglycerol is covalently joined to long chain fatty acyl-CoAs.[supplied by OMIM]
20403	
20404	
20405	
20406	Syntaxin 1 is a component of the 7S and 20S SNARE complexes which are involved in docking and fusion of synaptic vesicles with the presynaptic plasma membrane. This gene encodes a syntaxin 1 binding protein. In rat, a similar protein dissociates syntaxin 1 from the Munc18/n-Sec1/rbSec1 complex to form a 10S complex, an intermediate which can be converted to the 7S SNARE complex. Thus this protein is thought to be involved in neurotransmitter release by stimulating SNARE complex formation. Alternatively spliced variants have been identified, but their biological validity has not been determined.
20407	
20408	
20409	The protein encoded by this gene has similarity to a yeast protein. It seems to be part of a peripheral membrane protein complex localized on cis/medial Golgi cisternae where it may participate in tethering intra-Golgi transport vesicles.
20410	
20411	
20412	
20413	
20414	This gene shares high sequence similarity with the genes WBSCR20A and WBSCR20B; these three genes are the products of gene duplication during evolution. Alternative splicing of this gene results in four transcript variants encoding different isoforms. This gene is deleted in Williams syndrome, a multisystem developmental disorder caused by the deletion of contiguous genes at 7q11.23.
20415	
20416	
20417	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. The encoded protein shares 98-100% sequence identity with the zebrafish, frog, rat and mouse counterparts, which indicates that this enzyme is highly conserved in eukaryotes. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
20418	Heterochromatin protein-1 (HP1) is localized at heterochromatin sites, where it mediates gene silencing.[supplied by OMIM]
20419	Many vertebrate homeo box-containing genes have been identified on the basis of their sequence similarity with Drosophila developmental genes. Members of the Dlx gene family contain a homeobox that is related to that of Distal-less (Dll), a gene expressed in the head and limbs of the developing fruit fly.  The Distal-less (Dlx) family of genes comprises at least 6 different members, DLX1-DLX6. Trichodentoosseous syndrome (TDO), an autosomal dominant condition, has been correlated with DLX3 gene mutation.
20420	This gene encodes cytochrome c, a component of the electron transport chain in mitochondria. The heme group of cytochrome c accepts electrons from the b-c1 complex and transfers electrons to the cytochrome oxidase complex. Cytochrome c is also involved in initiation of apoptosis. Upon release of cytochrome c to the cytoplasm, the protein binds apoptotic protease activating factor which activates the apoptotic initiator procaspase 9. Many cytochrome c pseudogenes exist, scattered throughout the human genome.
20421	
20422	The protein encoded by this gene is a dual specificity protein kinase that belongs to the MAP kinase kinase family. This kinase is activated by mitogenic and environmental stress, and participates in the MAP kinase-mediated signaling cascade. It phosphorylates and thus activates MAPK14/p38-MAPK. This kinase can be activated by insulin, and is necessary for the expression of glucose transporter. Expression of RAS oncogene is found to result in the accumulation of the active form of this kinase, which thus leads to the constitutive activation of MAPK14, and confers oncogenic transformation of primary cells. The inhibition of this kinase is involved in the pathogenesis of Yersina pseudotuberculosis. Three alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported.
20423	PPP1R14A is a phosphorylation-dependent inhibitor of smooth muscle myosin phosphatase (see MIM 603768). Inhibition leads to increased myosin phosphorylation and enhances smooth muscle contraction in the absence of increased intracellular Ca(2+) concentration.[supplied by OMIM]
20424	
20425	
20426	
20427	
20428	The Makorin ring finger protein-1 gene (MKRN1) is a highly transcribed, intron-containing source for a family of intronless mammalian genes encoding a novel class of zinc finger proteins. Phylogenetic analyses indicate that the MKRN1 gene is the ancestral founder of this gene family.[supplied by OMIM]
20429	
20430	
20431	
20432	This gene is a member of the septin gene family which encodes nucleotide binding proteins, originally described in yeast as cell division cycle regulatory proteins. Septins are highly conserved in yeast, Drosophila and mouse, and appear to regulate cytoskeletal organization. Disruption of septin function disturbs cytokinesis and results in large multinucleate or polyploid cells. This gene is mapped to 22q11, the region frequently deleted in DiGeorge and velocardiofacial syndromes, and a translocation involving MLL gene and this gene has also been reported in patients with acute myeloid leukemia. Two transcripts, a major one of 2.2 kb and a minor one of 3.5 kb, have been observed in the gene region. The 2.2 kb form which encodes this septin (369 amino acids) protein, results from the utilization of a non-consensus polyA_signal (AACAAT). In the absence of polyadenylation from this imperfect site, this gene uses the consensus polyA_signal within its 3@#$%& neighbour gene (GP1BB; platelet glycoprotein Ib), resulting in the 3.5 kb read-through transcript. A transcript variant with a different 5@#$%& end arising by alternative splicing, and possibly accompanied by alternative promoter usage, has also been identified.
20433	Gamma-glutamyltransferase-like activity 1 (GGTLA1) is a member of a gene family with at least 4 members (GGTLA1, GGTLA2, GGTLA3 and GGTLA4). The enzyme encoded by GGTLA1 is related to, but distinct from, gamma-glutamyl transpeptidase (GGT). The GGTLA1 enzyme consists of a heavy and a light chain and is able to hydrolyze the gamma-glutamyl moiety of glutathione. It converts leukotriene C4 to leukotriene D4, however, it doesn@#$%&t convert synthetic substrates that are commonly used to assay GGT. Its amino acid sequence shows an overall similarity of 39.5% with human GGT.
20434	
20435	
20436	
20437	
20438	
20439	
20440	
20441	
20442	
20443	
20444	This gene encodes a member of the DEAD box protein family. DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. The protein encoded by this gene shows ATPase activity in the presence of polynucleotides and associates with nucleoplasmic 65S preribosomal particles. This gene may be involved in ribosome synthesis, most likely during assembly of the large 60S ribosomal subunit.
20445	This gene encodes an enzyme that plays a major role in polyamine metabolism and is important for the salvage of both adenine and methionine. The encoded enzyme is deficient in many cancers because this gene and the tumor suppressor p16 gene are co-deleted. Multiple alternatively spliced transcript variants have been described for this gene, but their full-length natures remain unknown.
20446	
20447	
20448	
20449	
20450	Heterochromatin protein-1 (HP1) is localized at heterochromatin sites, where it mediates gene silencing.[supplied by OMIM]
20451	
20452	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L30E family of ribosomal proteins. It is located in the cytoplasm. This gene is co-transcribed with the U72 small nucleolar RNA gene, which is located in its fourth intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
20453	
20454	
20455	
20456	
20457	The protein encoded by this gene is a member of the interleukin 1 receptor family. This protein is similar to the interleukin 1 accessory proteins, and is most closely related to interleukin 1 receptor accessory protein-like 1 (IL1RAPL1). This gene and IL1RAPL1 are located at a region on chromosome X that is associated with X-linked non-syndromic mental retardation.
20458	
20459	HLA-B belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon 1 encodes the leader peptide, exon 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Hundreds of HLA-B alleles have been described
20460	
20461	
20462	
20463	
20464	
20465	
20466	
20467	
20468	
20469	
20470	
20471	
20472	
20473	This gene encodes a member of the 1-acylglycerol-3-phosphate O-acyltransferase family. This integral membrane protein converts lysophosphatidic acid to phosphatidic acid, the second step in de novo phospholipid biosynthesis.
20474	
20475	
20476	
20477	
20478	
20479	
20480	
20481	
20482	
20483	
20484	
20485	
20486	Rho GTPases play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. The encoded protein may form a complex with G proteins and stimulate Rho-dependent signals. This protein is observed to form myeloid/lymphoid fusion partner in acute myeloid leukemia.
20487	
20488	
20489	This gene encodes a transcription factor that belongs to the cap@#$%&n@#$%&collar type of basic region leucine zipper factor family (CNC-bZip). The encoded protein contains broad complex, tramtrack, bric-a-brac/poxvirus and zinc finger (BTB/POZ) domains, which is atypical of CNC-bZip family members. These BTB/POZ domains facilitate protein-protein interactions and formation of homo- and/or hetero-oligomers. When this encoded protein forms a heterodimer with MafK, it functions as a repressor of Maf recognition element (MARE) and transcription is repressed. Multiple alternatively spliced transcript variants have been identified for this gene. Some exons of this gene overlap with some exons from the C21orf41 gene, which is transcribed in an opposite orientation to this gene but does not seem to encode a protein.
20490	
20491	
20492	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and two tandem catalytic domains, and thus represents a receptor-type PTP. The extracellular region contains a meprin-A5 antigen-PTP mu (MAM) domain, an Ig-like domain and four fibronectin type III-like repeats. This PTP has been shown to mediate cell-cell aggregation through the interaction with another molecule of this PTP on an adjacent cell. This PTP can interact with scaffolding protein RACK1/GNB2L1, which may be necessary for the downstream signaling in response to cell-cell adhesion.
20493	
20494	
20495	The protein encoded by this gene belongs to the Ser/Thr family of protein kinases SNF1 subfamily. There is strong evidence that this protein is the endogenous mu 2 kinase and plays a regulatory role in clathrin-mediated endocytosis, including the regulation of fatty acid synthesis and cholesterol synthesis.
20496	
20497	
20498	This gene encodes a protein which interacts with clathrin and adaptor-related protein complex 2, alpha 1 subunit. The protein is found in a brain-derived clathrin-coated vesicle fraction and localizes to the peri-Golgi region and the cell periphery. The protein is thought to be involved in clathrin-mediated endocytosis. Alternate splicing of this gene results in two transcript variants encoding different isoforms.
20499	The protein encoded by this gene is a pleiotropic cytokine that functions as a chemoattractant, a modulator of T cell activation, and an inhibitor of HIV replication. The signaling process of this cytokine is mediated by CD4. The product of this gene undergoes proteolytic processing, which is found to yield two functional proteins. The cytokine function is exclusively attributed to the secreted C-terminal peptide, while the N-terminal product may play a role in cell cycle control. Caspase 3 is reported to be involved in the proteolytic processing of this protein. Two alternatively spliced transcript variants encoding distinct isoforms have been reported.
20500	
20501	
20502	The protein encoded by this gene is one subunit of a trimeric complex, forming a highly conserved transcription factor that binds with high specificity to CCAAT motifs in the promoter regions in a variety of genes. This gene product, subunit C, forms a tight dimer with the B subunit, a prerequisite for subunit A association. The resulting trimer binds to DNA with high specificity and affinity. Subunits B and C each contain a histone-like motif. Observation of the histone nature of these subunits is supported by two types of evidence; protein sequence alignments and experiments with mutants. Additional regulation, preliminarily supported by the EST database, may be represented by alternative splicing in this subunit.
20503	This gene belongs to a group of vacuolar protein sorting (VPS) genes. The encoded protein is a component of a large multimeric complex, termed the retromer complex, involved in retrograde transport of proteins from endosomes to the trans-Golgi network. The close structural similarity between the yeast and human proteins that make up this complex suggests a similarity in function. Expression studies in yeast and mammalian cells indicate that this protein interacts directly with VPS35, which serves as the core of the retromer complex.
20504	This gene encodes a transcriptional repressor which represses neuronal genes in non-neuronal tissues. It is a member of the Kruppel-type zinc finger transcription factor family. It represses transcription by binding a DNA sequence element called the neuron-restrictive silencer element. The protein is also found in undifferentiated neuronal progenitor cells, and it is thought that this repressor may act as a master negative regular of neurogenesis. Alternatively spliced transcript variants have been described; however, their full length nature has not been determined.
20505	
20506	
20507	
20508	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to cytoplasmic bodies. The protein has also been localized to the nucleus, where it interacts with the activation domain of the HIV-1 Tat protein. The Tat protein activates transcription of HIV-1 genes.
20509	
20510	
20511	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
20512	PAS domains regulate the function of many intracellular signaling pathways in response to both extrinsic and intrinsic stimuli. PAS kinase, or PASK, is an evolutionarily conserved protein present in yeast, flies, and mammals.[supplied by OMIM]
20513	
20514	
20515	
20516	
20517	Cytosolic and membrane-bound forms of glutathione S-transferase are encoded by two distinct supergene families. These enzymes function in the detoxification of electrophilic compounds, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress, by conjugation with glutathione. The genes encoding these enzymes are known to be highly polymorphic. These genetic variations can change an individual@#$%&s susceptibility to carcinogens and toxins as well as affect the toxicity and efficacy of some drugs. At present, eight distinct classes of the soluble cytoplasmic mammalian glutathione S-transferases have been identified: alpha, kappa, mu, omega, pi, sigma, theta and zeta. This gene encodes a glutathione S-tranferase belonging to the alpha class. The alpha class genes, located in a cluster mapped to chromosome 6, are the most abundantly expressed glutathione S-transferases in liver. In addition to metabolizing bilirubin and certain anti-cancer drugs in the liver, the alpha class of these enzymes exhibit glutathione peroxidase activity thereby protecting the cells from reactive oxygen species and the products of peroxidation.
20518	
20519	Members of ADAM family are cell surface proteins with a unique structure possessing both potential adhesion and protease domains. The extracellular region of ADAM8 shows significant amino acid sequence homology to hemorrhagic snake venom proteins, including the metalloprotease and disintegrin domains.
20520	
20521	
20522	This gene encodes a protein containing a calponin homology (CH) domain, a PDZ domain, and a LIM domain. An F-box (FBX) domain is present in alternative splice variants. Members of the LIM protein family carry the LIM domain, a unique cysteine-rich zinc-binding domain. Members of the FBX protein family are involved in protein-protein interactions. The encoded protein may be involved in protein-protein interactions. Multiple alternative splice variants have been described but their full-length sequences have not been determined.
20523	
20524	
20525	
20526	
20527	The protein encoded by this gene was originally identified as an ovarian tumor antigen monitored in ovarian cancer. The encoded protein contains a B-box/coiled coil motif, which is present in many genes with transformation potential, but the function of this protein is unknown. This gene is located on a region of chromosome 17q21.1 that is in close proximity to tumor suppressor gene BRCA1. Three alternatively spliced variants encoding the same protein have been identified for this gene.
20528	
20529	Mouse studies suggest that this gene functions during normal adipose tissue development and may play a role in human triglyceride metabolism. This gene represents a candidate gene for human lipodystrophy, characterized by loss of body fat, fatty liver, hypertriglyceridemia, and insulin resistance.
20530	This gene encodes a multidomain protein containing an N-terminal alpha-helical region with a coiled-coil motif, followed by a pleckstrin homology (PH) domain, an Arf-GAP domain, an ankyrin homology region, a proline-rich region, and a C-terminal Src homology 3 (SH3) domain. The protein localizes in the Golgi apparatus and at the plasma membrane, where it colocalizes with protein tyrosine kinase 2-beta (PYK2). The encoded protein forms a stable complex with PYK2 in vivo. This interaction appears to be mediated by binding of its SH3 domain to the C-terminal proline-rich domain of PYK2. The encoded protein is tyrosine phosphorylated by activated PYK2. In vitro it shows strong GTPase-activating protein (GAP) activity towards the small GTPases ADP-ribosylation factor (ARF) 1 and ARF5 and weak activity towards ARF6. The encoded protein is believed to function as an ARF GAP that controls ARF-mediated vesicle budding when recruited to Golgi membranes. In addition, it functions as a substrate and downstream target for PYK2 and SRC, a pathway that may be involved in the regulation of vesicular transport.
20531	
20532	
20533	Histamine is a ubiquitous messenger molecule released from mast cells, enterochromaffin-like cells, and neurons. Its various actions are mediated by histamine receptors H1, H2, H3 and H4. Histamine receptor H3 belongs to the family 1 of G protein-coupled receptors. It is an integral membrane protein and can regulate neurotransmitter release. This receptor can also increase voltage-dependent calcium current in smooth muscles and innervates the blood vessels and the heart in cardiovascular system.
20534	The protein encoded by this gene is a trifunctional polypeptide. It has phosphoribosylglycinamide formyltransferase, phosphoribosylglycinamide synthetase, phosphoribosylaminoimidazole synthetase activity which is required for de novo purine biosynthesis. This enzyme is highly conserved in vertebrates. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
20535	
20536	
20537	
20538	This gene product was isolated on the basis of its interaction with BRCA2 and p21 proteins. It is an evolutionarily conserved nuclear protein with multiple interacting domains. The N-terminal half shares moderate homology with regions of calmodulin and M-calpain, suggesting that it may also bind calcium. Functional studies indicate that this protein may be an important cofactor for BRCA2 in tumor suppression, and a modulator of CDK2 kinase activity via p21. Several transcript variants encoding different isoforms have been described for this gene.
20539	
20540	The protein encoded by this gene is member of the G-protein coupled receptor family. This protein is a receptor for prostaglandin F2-alpha (PGF2-alpha), which is known to be a potent luteolytic agent, and may also be involved in modulating intraocular pressure and smooth muscle contraction in uterus. Knockout studies in mice suggest that the interaction of PGF2-alpha with this receptor may initiate parturition in ovarian luteal cells and thus induce luteolysis.
20541	Apolipoprotein (Apo-) A-II is the second most abundant protein of the high density lipoprotein particles. The apo-A-II gene consists of 4 exons and 3 introns.  The four exons encode the 5@#$%& untranslated region, pre-peptide, a short N-terminal domain and a C-terminal domain composed of a variable number of lipid-binding amphipathic helices.  Familial apo-A-II deficiency may result from a splice-junction alteration which blocks splicing of intron 3 from the primary transcript and result the formation of a non-functional mRNA.
20542	
20543	
20544	
20545	This gene product belongs to the basic helix-loop-helix family of transcription factors and the myogenic factors subfamily. It is involved in muscle cell differentiation, and is essential for repair of damaged tissue. It activates its own transcription which may stabilize commitment to myogenesis.
20546	
20547	This intronless gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins. The protein may function in the developing nervous system and play a role in tumorigenesis.
20548	Rho GTPases play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. The encoded protein may form complex with G proteins and stimulate Rho-dependent signals. This protein is similar to rat collybistin protein. Alternative splicing of this gene generates two transcript variants which encode different isoforms. Also there is possibility for the usage of multiple polyadenylation sites for this gene.
20549	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. Sequence analysis identified at least three transcript variants. Pseudogenes corresponding to this gene are found on chromosomes 6p and 12p.
20550	
20551	
20552	
20553	Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. It is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, F0, which comprises the proton channel. The F1 complex consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled in a ratio of 3 alpha, 3 beta, and a single representative of the other 3. The F0 seems to have nine subunits (a, b, c, d, e, f, g, F6 and 8). This gene encodes the e subunit of the F0 complex.
20554	The protein encoded by this gene is a transcriptional coactivator that regulates the genes involved in energy metabolism. This protein interacts with PPARgamma, which permits the interaction of this protein with multiple transcription factors. This protein can interact with, and regulate the activities of, cAMP response element binding protein (CREB) and nuclear respiratory factors (NRFs). It provides a direct link between external physiological stimuli and the regulation of mitochondrial biogenesis, and is a major factor that regulates muscle fiber type determination. This protein may be also involved in controlling blood pressure, regulating cellular cholesterol homoeostasis, and the development of obesity.
20555	
20556	This gene encodes the interferon (gamma)-induced protein of 10kDa, a chemokine of the CXC subfamily that is one of the ligands for the receptor CXCR3. The binding of this protein to CXCR3 causes pleiotropic effects, including stimulation of monocytes, natural killer and T-cell migration, and modulation of adhesion molecule expression.
20557	
20558	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L34E family of ribosomal proteins. It is located in the cytoplasm. This gene originally was thought to be located at 17q21, but it has been mapped to 4q. Transcript variants derived from alternative splicing, alternative transcription initiation sites, and/or alternative polyadenylation exist; these variants encode the same protein. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
20559	
20560	This gene encodes a multipass transmembrane protein that is expressed in joints and other tissues and controls pyrophosphate levels in cultured cells. Mutation at the mouse @#$%&progressive ankylosis@#$%& (ank) locus causes a generalized, progressive form of arthritis accompanied by mineral deposition, formation of bony outgrowths, and joint destruction. The human homolog is virtually identical to the mouse protein and ANKH-mediated control of pyrophosphate levels has been suggested as a possible mechanism regulating tissue calcification and susceptibility to arthritis in higher animals.
20561	
20562	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a member of the histone H2A family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the histone microcluster on chromosome 6p21.33.
20563	A cluster of genes, BAT1-BAT5, has been localized in the vicinity of the genes for TNF alpha and TNF beta. This gene is found near this cluster; it was mapped near the gene for C2 within a 120-kb region that included a HSP70 gene pair. These genes are all within the human major histocompatibility complex class III region. This gene was thought to be two different genes, NG36 and G9a, adjacent to each other but a recent publication shows that there is only a single gene. The protein encoded by this gene is thought to be involved in intracellular protein-protein interaction. There are three alternatively spliced transcript variants of this gene but only two are fully described.
20564	
20565	
20566	
20567	
20568	
20569	
20570	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
20571	
20572	
20573	
20574	
20575	This gene encodes a nuclear protein which is involved in cell-cycle regulation. The encoded protein is a ubiquitin-ligase capable of ubiquinating PCNP (PEST-containing nuclear protein), and together they may play a role in tumorigenesis. Alternative splicing of this gene results in several transcripts; however, not all variants have been fully described.
20576	Guanine nucleotide-binding proteins are a family of heterotrimeric proteins that couple cell surface, 7-transmembrane domain receptors to intracellular signaling pathways. Receptor activation catalyzes the exchange of GTP for GDP bound to the inactive G protein alpha subunit resulting in a conformational change and dissociation of the complex. The G protein alpha and beta-gamma subunits are capable of regulating various cellular effectors. Activation is terminated by a GTPase intrinsic to the G-alpha subunit. G-alpha-q is the alpha subunit of one of the heterotrimeric GTP-binding proteins that mediates stimulation of phospholipase C-beta (MIM 600230).[supplied by OMIM]
20577	
20578	
20579	
20580	
20581	
20582	This gene encodes a member of the theta class glutathione S-transferase-like (GSTTL) protein family. In mouse, the encoded protein acts as a small stress response protein, likely involved in cellular redox homeostasis.
20583	
20584	
20585	The protein encoded by this gene is involved in the sodium-independent transport and excretion of organic anions, some of which are potentially toxic. The encoded protein is an integral membrane protein and appears to be localized to the basolateral membrane of the kidney.
20586	The somatomedins, or insulin-like growth factors (IGFs), comprise a family of peptides that play important roles in mammalian growth and development. IGF1 mediates many of the growth-promoting effects of growth hormone (GH; MIM 139250). Early studies showed that growth hormone did not directly stimulate the incorporation of sulfate into cartilage, but rather acted through a serum factor, termed @#$%&sulfation factor,@#$%& which later became known as @#$%&somatomedin@#$%& (Daughaday et al., 1972). Three main somatomedins have been characterized: somatomedin C (IGF1), somatomedin A (IGF2; MIM 147470), and somatomedin B (MIM 193190) (Rotwein, 1986; Rosenfeld, 2003).[supplied by OMIM]
20587	This gene encodes one of three enzymes that are involved in metabolizing serine and glycine. L-serine dehydratase converts L-serine to pyruvate and ammonia and requires pyridoxal phosphate as a cofactor. The encoded protein can also metabolize threonine to NH4+ and 2-ketobutyrate. The encoded protein is found predominantly in the liver.
20588	
20589	This gene encodes a member of the nucleosome assembly protein (NAP) family. This protein participates in DNA replication and may play a role in modulating chromatin formation and contribute to the regulation of cell proliferation. Alternative splicing of this gene results in several transcript variants; however, not all have been fully described.
20590	
20591	This gene encodes a protein which is a member of the cysteine-aspartic acid protease (caspase) family. Sequential activation of caspases plays a central role in the execution-phase of cell apoptosis. Caspases exist as inactive proenzymes which undergo proteolytic processing at conserved aspartic residues to produce 2 subunits, large and small, that dimerize to form the active enzyme. This protein was shown to cleave and activate caspases 3 and 7, and itself could be processed by caspase 8.  Mutations in this gene are associated with apoptosis defects seen in type II autoimmune lymphoproliferative syndrome. Four alternatively spliced transcript variants encoding different isoforms have been described for this gene.
20592	The protein encoded by this gene is a member of a family of membrane glycoproteins.  This glycoprotein provides selectins with carbohydrate ligands.  It may also play a role in tumor cell metastasis.
20593	
20594	
20595	The protein encoded by this gene is a cyclosporine-binding protein and is mainly located within the endoplasmic reticulum. It is associated with the secretory pathway and released in biological fluids. This protein can bind to cells derived from T- and B-lymphocytes, and may regulate cyclosporine A-mediated immunosuppression.
20596	
20597	In the mouse, Nkd is a Dishevelled (see DVL1; MIM 601365)-binding protein that functions as a negative regulator of the Wnt (see WNT1; MIM 164820)-beta-catenin (see MIM 116806)-Tcf (see MIM 602272) signaling pathway.[supplied by OMIM]
20598	
20599	
20600	
20601	This gene belongs to the homeobox family of genes. The homeobox genes encode a highly conserved family of transcription factors that play an important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, located on different chromosomes, consisting of 9 to 11 genes arranged in tandem. This gene is one of several homeobox HOXB genes located in a cluster on chromosome 17. The exact role of this gene has yet to be determined.
20602	
20603	
20604	The protein encoded by this gene is a putative protein kinase that is induced by the transcription factor NF-kappaB. The encoded protein is a negative regulator of NF-kappaB and can also sensitize cells to TNF- and TRAIL-induced apoptosis. In addition, this protein can negatively regulate the cell survival serine-threonine kinase AKT1.
20605	This gene is a member of a phylogenetically conserved family of genes that share a common DNA-binding domain, the T-box. T-box genes encode transcription factors involved in the regulation of developmental processes. This gene is closely linked to related family member T-box 3 (ulnar mammary syndrome) on human chromosome 12. The encoded protein may play a role in heart development and specification of limb identity. Mutations in this gene have been associated with Holt-Oram syndrome, a developmental disorder affecting the heart and upper limbs. Several transcript variants encoding different isoforms have been described for this gene.
20606	The RAB5 protein is a small GTPase involved in membrane trafficking in the early endocytic pathway. The protein encoded by this gene binds the GTP-bound form of the RAB5 protein preferentially over the GDP-bound form, and functions as a guanine nucleotide exchange factor for RAB5. The encoded protein is found primarily as a tetramer in the cytoplasm and does not bind other members of the RAB family.
20607	The protein encoded by this gene is an integral plasma membrane protein which promotes cell adhesion. The encoded protein is thought to undergo O-linked glycosylation. Expression of this gene has been shown to be induced by gamma interferon in some cancer cells. Two transcript variants encoding the same protein have been found for this gene.
20608	
20609	
20610	
20611	The protein encoded by this gene is found in the cytoplasm, where it catalyzes the transfer of phosphatidylinositol and phosphatidylcholine between membranes.
20612	The protein encoded by this gene belongs to the spermidine/spermine synthases family. This gene encodes an ubiquitous enzyme of polyamine metabolism.
20613	Temporally regulated transcription and translation of several testis-specific genes is required to initiate the series of molecular and morphological changes in the male germ cell lineage necessary for the formation of mature spermatozoa. This gene is a member of the SPANX family of cancer/testis-associated genes, which are located in a cluster on chromosome X. The SPANX genes encode differentially expressed testis-specific proteins that localize to various subcellular compartments. This particular gene maps to chromosome X in a head-to-tail orientation with SPANX family member B2, which appears to be a duplication of the B1 locus. The SPANXB genes are unique members of this gene family, since they contain an additional 18 nt in their coding region compared to the majority of family members. Although the protein encoded by this gene contains consensus nuclear localization signals, the major site for subcellular localization of expressed protein is in the cytoplasmic droplets of ejaculated spermatozoa. This protein provides a biochemical marker for studying the unique structures in spermatazoa, while attempting to further define its role in spermatogenesis.
20614	
20615	
20616	
20617	
20618	
20619	
20620	
20621	
20622	
20623	
20624	
20625	
20626	
20627	
20628	
20629	
20630	
20631	
20632	
20633	
20634	
20635	This gene encodes WW domain-containing binding protein 4. The WW domain represents a small and compact globular structure that interacts with proline-rich ligands. This encoded protein is a general spliceosomal protein that may play a role in cross-intron bridging of U1 and U2 snRNPs in the spliceosomal complex A.
20636	
20637	
20638	
20639	
20640	
20641	
20642	
20643	
20644	
20645	
20646	
20647	
20648	
20649	
20650	
20651	
20652	
20653	
20654	This gene encodes a subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This subunit functions as guanine nucleotide exchange factor. It is reported that this subunit interacts with HIV-1 Tat, and thus it represses the translation of host-cell, but not HIV-1, mRNAs. Several alternatively spliced transcript variants have been found for this gene, however, the full length nature of only two variants has been determined.
20655	
20656	This gene encodes a mitogen-activated kinase which localizes to the nucleus. The gene maps to the major histocompatability complex (MHC) class II region on chromosome 6p21.3 but sequence comparison suggests that the protein is not involved in the immune response. Homology to the Drosophila gene female sterile homeotic suggests that this human gene may be part of a signal transduction pathway involved in growth control.
20657	
20658	
20659	
20660	
20661	
20662	
20663	This gene encodes a member of the tenascin family of extracellular matrix proteins. The tenascins have anti-adhesive effects, as opposed to fibronectin which is adhesive. The specific function of this protein is unknown, but its deficiency has been associated with the connective tissue disorder Ehlers-Danlos syndrome. This gene localizes to the major histocompatibility complex (MHC) class III region on chromosome 6. It is one of four genes in this cluster which have been duplicated. The duplicated copy of this gene is incomplete and is a pseudogene which is transcribed but does not encode a protein. The structure of this gene is unusual in that it overlaps the CREBL1 and CYP21A2 genes at its 5@#$%& and 3@#$%& ends, respectively. The gene expresses two transcripts.
20664	
20665	
20666	
20667	
20668	
20669	
20670	
20671	
20672	
20673	
20674	
20675	
20676	
20677	
20678	
20679	
20680	
20681	
20682	
20683	The protein encoded by this gene belongs to the family of P-type cation transport ATPases, and to the subfamily of Na+/K+ -ATPases. Na+/K+ -ATPase is an integral membrane protein responsible for establishing and maintaining the electrochemical gradients of Na and K ions across the plasma membrane. These gradients are essential for osmoregulation, for sodium-coupled transport of a variety of organic and inorganic molecules, and for electrical excitability of nerve and muscle. This enzyme is composed of two subunits, a large catalytic subunit (alpha) and a smaller glycoprotein subunit (beta). The catalytic subunit of Na+/K+ -ATPase is encoded by multiple genes. This gene encodes an alpha 4 subunit. Alternatively spliced transcript variants encoding different isoforms have been identified.
20684	V7-3 shows structural characteristics of an Na(+) and Cl(-)-dependent neurotransmitter transporter, including 12 transmembrane (TM) domains, intracellular N and C termini, and large extracellular loops containing multiple N-glycosylation sites.[supplied by OMIM]
20685	
20686	
20687	
20688	
20689	
20690	
20691	
20692	
20693	
20694	
20695	Members of the F-box protein family, such as FBXL18, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
20696	The lethal (2) giant larvae protein of Drosophila plays a role in asymmetric cell division, epithelial cell polarity, and cell migration. This human gene encodes a protein similar to lethal (2) giant larvae of Drosophila. In fly, the protein@#$%&s ability to localize cell fate determinants is regulated by the atypical protein kinase C (aPKC). In human, this protein interacts with aPKC-containing complexes and is cortically localized in mitotic cells. Alternative splicing results in multiple transcript variants encoding different isoforms.
20697	
20698	
20699	
20700	
20701	
20702	
20703	
20704	JAK3 encodes Janus kinase 3, a tyrosine kinase that belongs to the Janus family. JAK3 functions in signal transduction and interacts with members of the STAT (signal transduction and activators of transcription) family. JAK3 is predominantly expressed in immune cells and transduces a signal in response to its activation via tyrosine phosphorylation by interleukin receptors. Mutations that abrogate Janus kinase 3 function cause an autosomal SCID (severe combined immunodeficiency disease).
20705	
20706	
20707	
20708	This gene encodes one of the six subunits of type IV collagen, the major structural component of basement membranes. In the Goodpasture syndrome, autoantibodies bind to the collagen molecules in the basement membranes of alveoli and glomeruli. The epitopes that elicit these autoantibodies are localized largely to the non-collagenous C-terminal domain of the protein. A specific kinase phosphorylates amino acids in this same C-terminal region and the expression of this kinase is upregulated during pathogenesis. There are six alternate transcripts that appear to be unique to this human subunit gene and alternate splicing is restricted to the six exons that encode this domain. This gene is also linked to an autosomal recessive form of Alport syndrome. The mutations contributing to this syndrome are also located within the exons that encode this C-terminal region. Like the other members of the type IV collagen gene family, this gene is organized in a head-to-head conformation with another type IV collagen gene so that each gene pair shares a common promoter.
20709	The protein encoded by this gene is highly similar to the CML1 gene product, which is a kidney and liver protein with homology to bacterial acetyltransferases involved in drug resistance. This gene is localized on chromosome 2 in the vicinity of the CML1 gene. It appears to be an intronless gene and may represent a pseudogene of CML1, however, its exact function is not known.
20710	
20711	
20712	This gene is a member of the protocadherin gamma gene cluster, one of three related clusters tandemly linked on chromosome five. These gene clusters have an immunoglobulin-like organization, suggesting that a novel mechanism may be involved in their regulation and expression. The gamma gene cluster includes 22 genes divided into 3 subfamilies. Subfamily A contains 12 genes, subfamily B contains 7 genes and 2 pseudogenes, and the more distantly related subfamily C contains 3 genes. The tandem array of 22 large, variable region exons are followed by a constant region, containing 3 exons shared by all genes in the cluster. Each variable region exon encodes the extracellular region, which includes 6 cadherin ectodomains and a transmembrane region. The constant region exons encode the common cytoplasmic region. These neural cadherin-like cell adhesion proteins most likely play a critical role in the establishment and function of specific cell-cell connections in the brain. Alternative splicing has been described for the gamma cluster genes.
20713	This gene encodes a large secreted extracellular matrix protein thought to control cell-cell interactions critical for cell positioning and neuronal migration during brain development. This protein may be involved in schizophrenia, autism, bipolar disorder, major depression and in migration defects associated with temporal lobe epilepsy. Mutations of this gene are associated with autosomal recessive lissencephaly with cerebellar hypoplasia. Two transcript variants encoding distinct isoforms have been identified for this gene. Other transcript variants have been described but their full length nature has not been determined.
20714	The POT1 gene encodes a protein conserved across widely diverged eukaryotes that binds the G-rich strand of its own telomeric repeat sequence, consistent with a direct role in protecting chromosome ends.[supplied by OMIM]
20715	
20716	
20717	Prosaposin is a 511-amino acid glycoprotein which is a precursor for 4 saposins that are designated A, B, C, and D (Morimoto et al., 1989). Each domain of the precursor protein is approximately 80 amino acid residues long; has nearly identical placement of cysteine residues, glycosylation sites, and helicoregions; and is flanked by proteolytic cleavage sites. Proteolytic cleavage of prosaposin at each domain boundary is predicted to give rise to 4 saposin proteins. The PSAP gene can be said to be polycistronic.[supplied by OMIM]
20718	
20719	PORIMIN (pro-oncosis receptor inducing membrane injury), a member of the cell membrane-associated mucin family, mediates oncosis-like cell death.[supplied by OMIM]
20720	
20721	
20722	
20723	
20724	
20725	The hepatotrophic factor designated augmenter of liver regeneration (ALR) is thought to be one of the factors responsible for the extraordinary regenerative capacity of mammalian liver. It has also been called hepatic regenerative stimulation substance (HSS). The gene resides on chromosome 16 in the interval containing the locus for polycystic kidney disease (PKD1). The putative gene product is 42% similar to the scERV1 protein of yeast. The yeast scERV1 gene had been found to be essential for oxidative phosphorylation, the maintenance of mitochondrial genomes, and the cell division cycle. The human gene is both the structural and functional homolog of the yeast scERV1 gene.
20726	
20727	
20728	
20729	This gene encodes a protein that is part of a post-splicing multiprotein complex involved in both mRNA nuclear export and mRNA surveillance. The encoded protein is one of two functional homologs to yeast Upf3p. mRNA surveillance detects exported mRNAs with truncated open reading frames and initiates nonsense-mediated mRNA decay (NMD). When translation ends upstream from the last exon-exon junction, this triggers NMD to degrade mRNAs containing premature stop codons. This protein binds to the mRNA and remains bound after nuclear export, acting as a nucleocytoplasmic shuttling protein. It forms with Y14 a complex that binds specifically 20 nt upstream of exon-exon junctions. This gene is located on the long arm of chromosome X. Two splice variants encoding different isoforms have been found for this gene.
20730	Members of the F-box protein family, such as FBXO28, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
20731	
20732	CAPZA1 is a member of the F-actin capping protein alpha subunit family. This gene encodes the alpha subunit of the barbed-end actin binding protein.  The protein regulates growth of the actin filament by capping the barbed end of growing actin filaments.
20733	Multiprotein complexes are key determinants of Golgi apparatus structure and its capacity for intracellular transport and glycoprotein modification. Several complexes have been identified, including the Golgi transport complex (GTC), the LDLC complex, which is involved in glycosylation reactions, and the SEC34 complex, which is involved in vesicular transport. These 3 complexes are identical and have been termed the conserved oligomeric Golgi (COG) complex, which includes COG2 (Ungar et al., 2002).[supplied by OMIM]
20734	Myogenin is a muscle-specific transcription factor that can induce myogenesis in a variety of cell types in tissue culture. It is a member of a large family of proteins related by sequence homology, the helix-loop-helix (HLH) proteins. It is essential for the development of functional skeletal muscle.
20735	Transcriptional repression is a general mechanism for regulating transcriptional initiation in organisms ranging from yeast to humans.  Accurate initiation of transcription from eukaryotic protein-encoding genes requires the assembly of a large multiprotein complex consisting of RNA polymerase II and general transcription factors such as TFIIA, TFIIB, and TFIID. DR1 is a repressor that interacts with the TATA-binding protein (TBP) of TFIID and prevents the formation of an active transcription complex by precluding the entry of TFIIA and/or TFIIB into the preinitiation complex. The protein encoded by this gene is a corepressor of transcription that interacts with DR1 to enhance DR1-mediated repression. The interaction between this corepressor and DR1 is required for corepressor function and appears to stabilize the TBP-DR1-DNA complex.
20736	
20737	
20738	This gene encodes one of the major intermediate filament proteins of mature astrocytes. It is used as a marker to distinguish astrocytes from other glial cells during development. Mutations in this gene cause Alexander disease, a rare disorder of astrocytes in the central nervous system. An additional transcript variant has been described, but its full length sequence has not been determined.
20739	
20740	The MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) family includes a number of human proteins, several of which are involved the production of leukotrienes. This gene encodes an enzyme that catalyzes the first step in the biosynthesis of cysteinyl leukotrienes, potent biological compounds derived from arachidonic acid. Leukotrienes have been implicated as mediators of anaphylaxis and inflammatory conditions such as human bronchial asthma. This protein localizes to the nuclear envelope and adjacent endoplasmic reticulum. Multiple protein isoforms are encoded by transcript variants of this gene.
20741	
20742	This gene encodes one of the three alpha chains of type IX collagen, the major collagen component of hyaline cartilage. Type IX collagen is usually found in tissues containing type II collagen, a fibrillar collagen. Studies in knockout mice have shown that synthesis of the alpha 1 chain is essential for assembly of type IX collagen molecules, a heterotrimeric molecule, and that lack of type IX collagen is associated with early onset osteoarthritis. Mutations in this gene may be associated with multiple epiphyseal dysplasia. Two transcript variants have been identified for this gene.
20743	
20744	HLA-G belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. HLA-G is expressed on fetal derived placental cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domain, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exon 6 encodes the cytoplasmic tail.
20745	This gene encodes a C2H2 zinc finger protein with transactivation and DNA-binding activity. This gene has been shown to exhibit antiproliferative activities and is a tumor suppressor gene candidate. Two transcript variants encoding different isoforms have been found for this gene.
20746	
20747	
20748	
20749	
20750	
20751	
20752	
20753	
20754	
20755	This gene encodes a member of the aldo/keto reductase superfamily, which consists of more than 40 known enzymes and proteins. These enzymes catalyze the conversion of aldehydes and ketones to their corresponding alcohols by utilizing NADH and/or NADPH as cofactors. The enzymes display overlapping but distinct substrate specificity. This enzyme binds bile acid with high affinity, and shows minimal 3-alpha-hydroxysteroid dehydrogenase activity. This gene shares high sequence identity with three other gene members and is clustered with those three genes at chromosome 10p15-p14.
20756	
20757	
20758	
20759	The protein encoded by this gene belongs to the family of Na+/K+ and H+/K+ ATPases beta chain proteins, and to the subfamily of Na+/K+ -ATPases. Na+/K+ -ATPase is an integral membrane protein responsible for establishing and maintaining the electrochemical gradients of Na and K ions across the plasma membrane. These gradients are essential for osmoregulation, for sodium-coupled transport of a variety of organic and inorganic molecules, and for electrical excitability of nerve and muscle. This enzyme is composed of two subunits, a large catalytic subunit (alpha) and a smaller glycoprotein subunit (beta). The beta subunit regulates, through assembly of alpha/beta heterodimers, the number of sodium pumps transported to the plasma membrane. The glycoprotein subunit of Na+/K+ -ATPase is encoded by multiple genes. This gene encodes a beta 2 subunit.
20760	The protein encoded by this gene is a neuron-specific protein. The similar protein in pig enhances neuroblastoma cell differentiation in vitro and may be involved in neuronal differentiation in vivo.
20761	
20762	
20763	
20764	
20765	
20766	The protein encoded by this gene is a type I transmembrane protein. It is thought to modulate bone morphogenetic protein (BMP) receptor function by serving as an accessory or coreceptor, and thus facilitates or hinders BMP binding. It is known that the mouse AMN gene is expressed in the extraembryonic visceral endoderm layer during gastrulation, but it is found to be mutated in amnionless mouse. The encoded protein has sequence similarity to short gastrulation (Sog) and procollagen IIA proteins in Drosophila.
20767	
20768	
20769	The protein encoded by this gene is a cytosolic factor that facilitates protein transport into the nucleus. It interacts with the nuclear pore complex glycoprotein p62. This encoded protein acts at a relative late stage of nuclear protein import, subsequent to the initial docking of nuclear import ligand at the nuclear envelope. It is thought to be part of a multicomponent system of cytosolic factors that assemble at the pore complex during nuclear import.
20770	Precerebellin is similar to the noncollagen domain of complement component C1qB. It gives rise to several truncated derivatives, including the hexadecapeptide cerebellin which is highly enriched in postsynaptic structures of cerebellar Purkinje cells in cartwheel neurons of the dorsal cochlear nucleus.
20771	
20772	
20773	
20774	Synaptobrevins/VAMPs, syntaxins, and the 25-kD synaptosomal-associated protein SNAP25 are the main components of a protein complex involved in the docking and/or fusion of synaptic vesicles with the presynaptic membrane. VAMP2 is a member of the vesicle-associated membrane protein (VAMP)/synaptobrevin family. VAMP2 is a likely candidate gene for familial infantile myasthenia (FIMG) because of its map location and because it encodes a synaptic vesicle protein of the type that has been implicated in the pathogenesis of FIMG. VAMP2 is thought to participate in neurotransmitter release at a step between docking and fusion. The protein forms a stable complex with syntaxin, synaptosomal-associated protein, 25 kD, and synaptotagmin. It also forms a distinct complex with synaptophysin.
20775	
20776	
20777	
20778	
20779	PAK proteins, a family of serine/threonine p21-activating kinases, include PAK1, PAK2, PAK3 and PAK4. PAK proteins are critical effectors that link Rho GTPases to cytoskeleton reorganization and nuclear signaling. They serve as targets for the small GTP binding proteins Cdc42 and Rac and have been implicated in a wide range of biological activities. PAK4 interacts specifically with the GTP-bound form of Cdc42Hs and weakly activates the JNK family of MAP kinases. PAK4 is a mediator of filopodia formation and may play a role in the reorganization of the actin cytoskeleton. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
20780	
20781	This gene encodes a calcium-binding protein, which may play a role in the regulation of ion transport. A similar protein was first described as a potentially important regulatory protein in the dog thyroid and was termed as R2D5 antigen in rabbit. Alternative splicing of this gene generates two transcript variants.
20782	The protein encoded by this gene is a nucleoplasmic protein. It binds cytidine-rich sequences in double-stranded DNA. This protein has three types of domains: MH1, MH2 (repeated three times) and MH3. It is associated with packaging, transferring, or processing transcripts. Multiple alternatively spliced transcript variants have been found for this gene, but the biological validity of some variants has not been determined.
20783	This gene encodes a cell surface receptor and transmembrane precursor protein that is cleaved by secretases to form a number of peptides. Some of these peptides are secreted and can bind to the acetyltransferase complex APBB1/TIP60 to promote transcriptional activation, while others form the protein basis of the amyloid plaques found in the brains of patients with Alzheimer disease. Mutations in this gene have been implicated in autosomal dominant Alzheimer disease and cerebroarterial amyloidosis (cerebral amyloid angiopathy). Multiple transcript variants encoding several different isoforms have been found for this gene.
20784	
20785	
20786	
20787	This gene belongs to a family of genes that are expressed in a variety of tumors but not in normal tissues, except for the testis. Unlike the other gene family members, this gene does not encode an antigenic peptide. Nothing is presently known about the function of this protein.
20788	
20789	This gene encodes coagulation factor VIII, which participates in the intrinsic pathway of blood coagulation; factor VIII is a cofactor for factor IXa which, in the presence of Ca+2 and phospholipids, converts factor X to the activated form Xa.  This gene produces two alternatively spliced transcripts. Transcript variant 1 encodes a large glycoprotein, isoform a, which circulates in plasma and associates with von Willebrand factor in a noncovalent complex. This protein undergoes multiple cleavage events. Transcript variant 2 encodes a putative small protein, isoform b, which consists primarily of the phospholipid binding domain of factor VIIIc. This binding domain is essential for coagulant activity. Defects in this gene results in hemophilia A, a common recessive X-linked coagulation disorder.
20790	
20791	
20792	
20793	
20794	
20795	This gene encodes a member of the polycystin protein family. Members of this protein family may function in renal tubular development, morphology, and function, and may modulate intracellular calcium homoeostasis and other signal transduction pathways. Two transcript variants encoding distinct isoforms have been identified for this gene.
20796	
20797	
20798	This gene lies within the major histocompatibility complex class I region on chromosome 6. Studies of a similar rat protein suggest that this gene encodes a protein that plays a role in an early phase of synaptic plasticity. Alternative splicing results in three transcript variants encoding different isoforms.
20799	The protein encoded by this gene is a member of the SEC24 subfamily of the SEC23/SEC24 family, which is involved in vesicle trafficking. The encoded protein has similarity to yeast Sec24p component of COPII. COPII is the coat protein complex responsible for vesicle budding from the ER. The product of this gene may play a role in shaping the vesicle, as well as in cargo selection and concentration. Alternatively spliced transcript variants encoding the same protein have been identified.
20800	
20801	
20802	
20803	
20804	
20805	
20806	
20807	
20808	
20809	
20810	
20811	
20812	
20813	
20814	
20815	SPRED2 is a member of the Sprouty (see SPRY1; MIM 602465)/SPRED family of proteins that regulate growth factor-induced activation of the MAP kinase cascade (see MAPK1; MIM 176948) (Nonami et al., 2004).[supplied by OMIM]
20816	The protein encoded by this gene has been identified as a brain-specific angiogenesis inhibitor (BAI1)-binding protein.  This interaction at the cytoplasmic membrane is crucial to the function of this protein, which may be involved in neuronal growth-cone guidance.  This protein functions as an insulin receptor tyrosine kinase substrate and suggests a role for insulin in the central nervous system.  This protein has also been identified as interacting with the dentatorubral-pallidoluysian atrophy gene, which is associated with an autosomal dominant neurodegenerative disease.  It also associates with a downstream effector of Rho small G proteins, which is associated with the formation of stress fibers and cytokinesis.  Alternative splicing of the 3@#$%&-end of this gene results in three products of undetermined function.
20817	
20818	
20819	
20820	
20821	
20822	
20823	This gene contains two ankyrin repeat-encoding regions. Ankyrin repeats are tandemly repeated modules of about 33 amino acids described as L-shaped structures consisting of a beta-hairpin and two alpha-helices. Ankyrin repeats occur in a large number of functionally diverse proteins, mainly from eukaryotes, and are known to function as protein-protein interaction domains. Alternative splicing has been observed for this gene but the full-length nature of additional variants has not been determined.
20824	
20825	
20826	
20827	
20828	
20829	
20830	
20831	
20832	
20833	
20834	
20835	
20836	
20837	
20838	
20839	
20840	The protein encoded by this gene is a component of the exocyst complex, a multiple protein complex essential for targeting exocytic vesicles to specific docking sites on the plasma membrane. Though best characterized in yeast, the component proteins and functions of exocyst complex have been demonstrated to be highly conserved in higher eukaryotes. At least eight components of the exocyst complex, including this protein, are found to interact with the v-ral simian leukemia viral oncogene homolog A (RALA), a Ras-related small GTPase involved in controlling actin cytoskeletal remodeling and vesicle transport. The protein complex containing the similar protein in rat was reported to be involved in synaptic vesicle docking and fusion at the presynaptic terminal.
20841	
20842	
20843	
20844	
20845	
20846	Phosphatidylinositol-4,5-bisphosphate, the precursor to second messengers of the phosphoinositide signal transduction pathways, is thought to be involved in the regulation of secretion, cell proliferation, differentiation, and motility. The protein encoded by this gene is one of a family of enzymes capable of catalyzing the phosphorylation of phosphatidylinositol-4-phosphate on the fifth hydroxyl of the myo-inositol ring to form phosphatidylinositol-4,5-bisphosphate. The amino acid sequence of this enzyme does not show homology to other kinases, but the recombinant protein does exhibit kinase activity. This gene is a member of the phosphatidylinositol-4-phosphate 5-kinase family.
20847	This gene is involved in the first step in glycosylphosphatidylinositol (GPI)-anchor biosynthesis. The GPI-anchor is a glycolipid found on many blood cells and serves to anchor proteins to the cell surface. This gene encodes a N-acetylglucosaminyl transferase component that is part of the complex that catalyzes transfer of N-acetylglucosamine (GlcNAc) from UDP-GlcNAc to phosphatidylinositol (PI).
20848	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. Four sequence-related potassium channel genes - shaker, shaw, shab, and shal - have been identified in Drosophila, and each has been shown to have human homolog(s). This gene encodes a member of the potassium channel, voltage-gated, shab-related subfamily. This member is a delayed rectifier potassium channel and its activity is modulated by some other family members.
20849	
20850	
20851	Nucleases are components of DNA and RNA metabolism that carry out functions in DNA repair, replication, and recombination and in RNA processing and degradation. SFN is a homolog of Orn, a 3-prime-to-5-prime exoribonuclease of E. coli that attacks the free 3-prime hydroxyl group on single-stranded RNA, releasing 5-prime mononucleotides in a sequential manner.[supplied by OMIM]
20852	
20853	
20854	Ecto-5-prime-nucleotidase (5-prime-ribonucleotide phosphohydrolase; EC 3.1.3.5) catalyzes the conversion at neutral pH of purine 5-prime mononucleotides to nucleosides, the preferred substrate being AMP. The enzyme consists of a dimer of 2 identical 70-kD subunits bound by a glycosyl phosphatidyl inositol linkage to the external face of the plasma membrane. The enzyme is used as a marker of lymphocyte differentiation. Consequently, a deficiency of NT5 occurs in a variety of immunodeficiency diseases (e.g., see MIM 102700, MIM 300300). Other forms of 5-prime nucleotidase exist in the cytoplasm and lysosomes and can be distinguished from ecto-NT5 by their substrate affinities, requirement for divalent magnesium ion, activation by ATP, and inhibition by inorganic phosphate.[supplied by OMIM]
20855	
20856	This gene encodes a member of the SWI2/SNF2 family of proteins, which play a critical role in altering protein-DNA interactions. The encoded protein has been shown to have dsDNA-dependent ATPase activity and RNA polymerase II termination activity. This protein interacts with cell division cycle 5-like, associates with human splicing complexes, and plays a role in pre-mRNA splicing.
20857	The protein encoded by this gene belongs to the complex I 42kDA subunit family. Mammalian complex I is the first enzyme complex in the electron transport chain of mitochondria. It is composed of 45 different subunits. This protein is a component of the hydrophobic protein fraction and has NADH dehydrogenase activity and oxidoreductase activity. It transfers electrons from NADH to the respiratory chain.
20858	
20859	The protein encoded by this gene is a glycoprotein and a member of the NADPH oxidase family. The synthesis of thyroid hormone is catalyzed by a protein complex located at the apical membrane of thyroid follicular cells. This complex contains an iodide transporter, thyroperoxidase, and a peroxide generating system that includes this encoded protein and DUOX2. This protein is known as dual oxidase because it has both a peroxidase homology domain and a gp91phox domain. Two alternatively spliced transcript variants encoding the same protein have been described for this gene.
20860	This gene encodes a member of the EDG family of receptors, which are G protein-coupled receptors. This protein has been identified as a functional receptor for sphingosine 1-phosphate and likely contributes to the regulation of angiogenesis and vascular endothelial cell function.
20861	
20862	
20863	In infectious mononucleosis, anti-EBNA-1 antibodies are produced which cross-react with multiple normal human proteins. The cross-reactivity is due to anti-gly/ala antibodies that cross-react with host proteins containing configurations like those in the EBNA-1 repeat. One such antigen is RALY which is a member of the heterogeneous nuclear ribonucleoprotein gene family.
20864	
20865	Insulinoma-associated 1 (INSM1) gene is intronless and encodes a protein containing both a zinc finger DNA-binding domain and a putative prohormone domain. This gene is a sensitive marker for neuroendocrine differentiation of human lung tumors.
20866	
20867	
20868	
20869	
20870	
20871	
20872	
20873	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
20874	This gene encodes a zinc finger protein of the Kruppel family. The protein contains a SCAN box and a KRAB A domain. A similar protein in mouse is differentially expressed in spermatogenesis. Alternative splicing of this gene results in two transcript variants differing only in the 5@#$%& UTR. Additional variants have been described, but their full length sequences have not been determined.
20875	RDHE2 belongs to a family of short-chain alcohol dehydrogenases/reductases that catalyze the first and rate-limiting step that generates retinaldehyde from retinol (Matsuzaka et al., 2002).[supplied by OMIM]
20876	This gene is related to the Wolf-Hirschhorn syndrome candidate-1 gene and encodes a protein with PWWP (proline-tryptophan-tryptophan-proline) domains. The function of the protein has not been determined. Two alternatively spliced variants have been described.
20877	
20878	
20879	The protein encoded by this gene is an inositol 1,4,5-trisphosphate (InsP3) 5-phosphatase. InsP3 5-phosphatases hydrolyze Ins(1,4,5)P3, which mobilizes intracellular calcium and acts as a second messenger mediating cell responses to various stimulation. Studies of the mouse counterpart suggest that this protein may hydrolyze phosphatidylinositol 3,4, 5-trisphosphate and phosphatidylinositol 3,5-bisphosphate on the cytoplasmic Golgi membrane and thereby regulate Golgi-vesicular trafficking.
20880	The protein encoded by this gene is a nuclear protein that forms intranuclear foci during proliferation and is redistributed in the nucleoplasm during the cell cycle. Short-wave ultraviolet light provokes the relocalization of the protein, suggesting its participation in the cellular response to DNA damage. Originally selected based on protein-binding with RecA antibodies, the mouse protein presents a limited similarity with a functional domain of the bacterial RecA protein, a characteristic shared by this human ortholog.
20881	
20882	
20883	
20884	
20885	
20886	
20887	The protein encoded by this gene is a member of the E2F family of transcription factors. The E2F family plays a crucial role in the control of cell cycle and action of tumor suppressor proteins and is also a target of the transforming proteins of small DNA tumor viruses. The E2F proteins contain several evolutionally conserved domains found in most members of the family. These domains include a DNA binding domain, a dimerization domain which determines interaction with the differentiation regulated transcription factor proteins (DP), a transactivation domain enriched in acidic amino acids, and a tumor suppressor protein association domain which is embedded within the transactivation domain. This protein binds to all three of the  tumor suppressor proteins pRB, p107 and p130, but with higher affinity to the last two. It plays an important role in the suppression of proliferation-associated genes, and its gene mutation and increased expression may be associated with human cancer.
20888	This gene is a member of the casein kinase I (CKI) gene family whose members have been implicated in the control of cytoplasmic and nuclear processes, including DNA replication and repair. The encoded protein is highly similar to the mouse and rat CK1 delta homologs. Two transcript variants encoding different isoforms have been found for this gene.
20889	
20890	The ITGB3 protein product is the integrin beta chain beta 3. Integrins are integral cell-surface proteins composed of an alpha chain and a beta chain. A given chain may combine with multiple partners resulting in different integrins. Integrin beta 3 is found along with the alpha IIb chain in platelets. Integrins are known to participate in cell adhesion as well as cell-surface mediated signalling.
20891	
20892	
20893	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein does not contain a coiled coil region, like some family members. The specific function of this protein has not been determined. Two transcript variants encoding distinct isoforms have been identified for this gene.
20894	
20895	The protein encoded by this gene phosphorylates vitamin B6, a step required for the conversion of vitamin B6 to pyridoxal-5-phosphate, an important cofactor in intermediary metabolism. The encoded protein is cytoplasmic and probably acts as a homodimer.
20896	Immunoglobulins recognize foreign antigens and initiate immune responses such as phagocytosis and the complement system. Each immunoglobulin molecule consists of two identical heavy chains and two identical light chains. There are two classes of light chains, kappa and lambda. This region represents the germline organization of the lambda light chain locus. The locus includes V (variable), J (joining), and C (constant) segments. During B cell development, a recombination event at the DNA level joins a single V segment with a J segment; the C segment is later joined by splicing at the RNA level. Recombination of many different V segments with several J segments provides a wide range of antigen recognition. Additional diversity is attained by junctional diversity, resulting from the random additional of nucleotides by terminal deoxynucleotidyltransferase, and by somatic hypermutation, which occurs during B cell maturation in the spleen and lymph nodes. Several V segments and three C segments are known to be incapable of encoding a protein and are considered pseudogenes. The locus also includes several non-immunoglobulin genes, many of which are pseudogenes or are predicted by automated computational analysis or homology to other species.
20897	Glutathione S-transferase (GST) theta 1 (GSTT1) is a member of a superfamily of proteins that catalyze the conjugation of reduced glutathione to a variety of electrophilic and hydrophobic compounds. Human GSTs can be divided into five main classes: alpha, mu, pi, theta, and zeta. The theta class includes GSTT1 and GSTT2. The GSTT1 and GSTT2 share 55% amino acid sequence identity and both of them were claimed to have an important role in human carcinogenesis. The GSTT1 gene is located approximately 50kb away from the GSTT2 gene. The GSTT1 and GSTT2 genes have a similar structure, being composed of five exons with identical exon/intron boundaries.
20898	This gene is one of several cytokine genes clustered on the q-arm of chromosome 17. Cytokines are a family of secreted proteins involved in immunoregulatory and inflammatory processes. The CC cytokines are proteins characterized by two adjacent cysteines. The cytokine encoded by this gene displays chemotactic activity on resting T lymphocytes and monocytes, lower activity on neutrophils and no activity on activated T lymphocytes. The protein is also a strong suppressor of colony formation by a multipotential hematopoietic progenitor cell line. In addition, the product of this gene is a potent agonist at CC chemokine receptor 1. Two alternatively spliced variants encoding different active isoforms have been identified.
20899	The product of this gene belongs to the family of purinoceptors for ATP. This receptor functions as a ligand-gated ion channel and is responsible for ATP-dependent lysis of macrophages through the formation of membrane pores permeable to large molecules. Activation of this nuclear receptor by ATP in the cytoplasm may be a mechanism by which cellular activity can be coupled to changes in gene expression. Alternatively spliced variants encoding different isoforms have been identified.
20900	
20901	This gene encodes a protein that is a member of the SUMO (small ubiquitin-like modifier) protein family. It functions in a manner similar to ubiquitin in that it is bound to target proteins as part of a post-translational modification system. However, unlike ubiquitin which targets proteins for degradation, this protein is involved in a variety of cellular processes, such as nuclear transport, transcriptional regulation, apoptosis, and protein stability. It is not active until the last four amino acids of the carboxy-terminus have been cleaved off. Several pseudogenes have been reported for this gene. Alternate transcriptional splice variants encoding different isoforms have been characterized.
20902	
20903	
20904	The glycoprotein encoded by this gene is a cell surface antigen that is expressed in greater than 95% of human colon cancers. The open reading frame encodes a 319-amino acid polypeptide having a putative secretory signal sequence and 3 potential glycosylation sites. The predicted mature protein has a 213-amino acid extracellular region, a single transmembrane domain, and a 62-amino acid intracellular tail. The sequence of the extracellular region contains 2 domains characteristic of the CD2 subgroup of the immunoglobulin (Ig) superfamily.
20905	
20906	The protein encoded by this gene belongs to the Bcl-2 family. Alternative splicing occurs at this locus and two transcript variants encoding distinct isoforms have been identified. The longer gene product (isoform 1) enhances cell survival by inhibiting apoptosis while the alternatively spliced shorter gene product (isoform 2) promotes apoptosis and is death-inducing.
20907	
20908	
20909	This gene encodes one of several forms of glutamic acid decarboxylase, identified as a major autoantigen in insulin-dependent diabetes. The enzyme encoded is responsible for catalyzing the production of gamma-aminobutyric acid from L-glutamic acid. A pathogenic role for this enzyme has been identified in the human pancreas since it has been identified as an autoantigen and an autoreactive T cell target in insulin-dependent diabetes. This gene may also play a role in the stiff man syndrome. Deficiency in this enzyme has been shown to lead to pyridoxine dependency with seizures. Alternative splicing of this gene results in two products, the predominant 67-kD form and a less-frequent 25-kD form.
20910	
20911	Homeobox genes, of which the most well-characterized category is represented by the HOX genes, play a crucial role in normal development. In addition, several homeoproteins are involved in neoplasia. MEIS1 encodes a homeobox protein belonging to the TALE (@#$%&three amino acid loop extension@#$%&) family of homeodomain-containing proteins.
20912	This gene, a muscle member of the immunoglobulin gene superfamily, encodes myosin light chain kinase which is a calcium/calmodulin dependent enzyme. This kinase phosphorylates myosin regulatory light chains to facilitate myosin interaction with actin filaments to produce contractile activity. This gene encodes both smooth muscle and nonmuscle isoforms. In addition, using a separate promoter in an intron in the 3@#$%& region, it encodes telokin, a small protein identical in sequence to the C-terminus of myosin light chain kinase, that is independently expressed in smooth muscle and functions to stabilize unphosphorylated myosin filaments. A pseudogene is located on the p arm of chromosome 3. Seven transcript variants that produce seven isoforms of the calcium/calmodulin dependent enzyme have been identified as well as two transcripts that produce two isoforms of telokin.
20913	The signal sequence receptor (SSR) is a glycosylated endoplasmic reticulum (ER) membrane receptor associated with protein translocation across the ER membrane. The SSR is comprised of four membrane proteins/subunits: alpha, beta, gamma, and delta. The first two are glycosylated subunits and the latter two are non-glycosylated subunits. The protein encoded by this gene is the gamma subunit and is predicted to span the membrane four times.
20914	This gene encodes a caveolin family member, which functions as a component of the caveolae plasma membranes found in most cell types. Caveolin proteins are proposed to be scaffolding proteins for organizing and concentrating certain caveolin-interacting molecules. Mutations identified in this gene lead to interference with protein oligomerization or intra-cellular routing, disrupting caveolae formation and resulting in Limb-Girdle muscular dystrophy type-1C (LGMD-1C), hyperCKemia or rippling muscle disease (RMD). Alternative splicing has been identified for this locus, with inclusion or exclusion of a differentially spliced intron. In addition, transcripts utilize multiple polyA sites and contain two potential translation initiation sites.
20915	Transducin is a 3-subunit guanine nucleotide-binding protein (G protein) which stimulates the coupling of rhodopsin and cGMP-phoshodiesterase during visual impulses. The transducin alpha subunits in rods and cones are encoded by separate genes. This gene encodes the alpha subunit in rods. Alternative splicing of this gene results in two transcript variants.
20916	
20917	
20918	
20919	
20920	
20921	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has three repeats of quasi-RRM domains that bind to RNAs. It is very similar to the family member HNRPF. This gene is thought to be potentially involved in hereditary lymphedema type I phenotype.
20922	Dihydrofolate reductase converts dihydrofolate into tetrahydrofolate, a methyl group shuttle required for the de novo synthesis of purines, thymidylic acid, and certain amino acids.  While the functional dihydrofolate reductase gene has been mapped to chromosome 5, four intronless processed pseudogenes have been identified on separate chromosomes.  Dihydrofolate reductase deficiency has been linked to megaloblastic anemia.
20923	
20924	
20925	
20926	
20927	Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. It is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, F0, which comprises the proton channel. The F1 complex consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled in a ratio of 3 alpha, 3 beta, and a single representative of the other 3. The F0 seems to have nine subunits (a, b, c, d, e, f, g, F6 and 8). This gene encodes the f subunit of the F0 complex. Alternatively spliced transcript variants encoding different isoforms have been identified for this gene.
20928	
20929	This gene is a member of the insulin-like growth factor binding protein (IGFBP) family and encodes a protein with an IGFBP domain and a thyroglobulin type-I domain. The protein forms a ternary complex with insulin-like growth factor acid-labile subunit (IGFALS) and either insulin-like growth factor (IGF) I or II. In this form, it circulates in the plasma, prolonging the half-life of IGFs and altering their interaction with cell surface receptors. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
20930	
20931	
20932	
20933	The protein encoded by this gene shares similarity with the yeast Mob1 protein. Yeast Mob1 binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. This gene is located on the opposite strand as the interferon kappa precursor (IFNK) gene.
20934	
20935	
20936	Proteoglycans, which consist of a core protein and covalently linked glycosaminoglycans, are components of the extracellular matrix. SPOCK2 encodes a member of a novel Ca(2+)-binding proteoglycan family.[supplied by OMIM]
20937	
20938	
20939	
20940	
20941	The product of this gene belongs to the ligand-gated ion channel receptor superfamily. This gene encodes a subunit E of the type 3 receptor for 5-hydroxytryptamine (serotonin), a biogenic hormone that functions as a neurotransmitter, a hormone, and a mitogen. This receptor causes fast, depolarizing responses in neurons after activation. Genes encoding subunits C, D and E form a cluster on chromosome 3. An alternative splice variant has been described but its full length sequence has not been determined.
20942	
20943	
20944	
20945	
20946	
20947	
20948	
20949	
20950	
20951	
20952	
20953	
20954	
20955	
20956	
20957	
20958	The protein encoded by this gene is a subunit of nuclear domain 10 (ND10) bodies. ND10 bodies are nuclear domains appearing immunohistochemically as ten dots per nucleus. They are believed to be associated with the nuclear matrix on the basis of their resistance to nuclease digestion and salt extraction. ND10 proteins are removed from the nucleus by herpes simplex virus-1 infection and may have a role in viral life cycles.
20959	
20960	
20961	The protein encoded by this gene is the ER to nucleus signalling 1 protein, a human homologue of the yeast Ire1 gene product. This protein possesses intrinsic kinase activity and an endoribonuclease activity and it is important in altering gene expression as a response to endoplasmic reticulum-based stress signals. Two alternatively spliced transcript variants encoding different isoforms have been found for this gene.
20962	
20963	
20964	The protein encoded by this gene is a member of the AAA protein family (ATPases associated with diverse cellular activities), and is the homolog of the yeast Vps4 protein. In humans, two paralogs of the yeast protein have been identified. The former share a high degree of aa sequence similarity with each other, and also with yeast Vps4 and mouse Skd1 proteins. The mouse Skd1 (suppressor of K+ transport defect 1) has been shown to be really an yeast Vps4 ortholog. Functional studies indicate that both human paralogs associate with the endosomal compartments, and are involved in intracellular protein trafficking, similar to Vps4 protein in yeast. The gene encoding this paralog has been mapped to chromosome 18; the gene for the other resides on chromosome 16. Usage of multiple polyadenylation sites have been noted for this gene.
20965	The hypoxia inducible factor (HIF) is a transcriptional complex which is involved in oxygen homeostasis. At normal oxygen levels, the alpha subunit of HIF is targeted for degration by prolyl hydroxylation. This gene encodes an enzyme responsible for this posttranslational modification. Alternative splicing of this gene results in three transcript variants encoding different isoforms.
20966	
20967	
20968	
20969	
20970	Apoptosis, a major form of cell death, is an efficient mechanism for eliminating unwanted cells and is of central importance for development and homeostasis in metazoan animals. In mice, the death inducer-obliterator-1 gene is upregulated by apoptotic signals and encodes a cytoplasmic protein that translocates to the nucleus upon apoptotic signal activation. When overexpressed, the mouse protein induced apoptosis in cell lines growing in vitro. This gene is similar to the mouse gene and therefore is thought to be involved in apoptosis. At least four alternatively spliced transcripts have been found for this gene, but the full-length nature of some variants has not been determined.
20971	The protein encoded by this gene interacts with the dedicator of cyto-kinesis 1 protein. Similarity to a C. elegans protein suggests that this protein may function in phagocytosis of apoptotic cells and in cell migration. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
20972	
20973	Dystrophin is a large, rod-like cytoskeletal protein found at the inner surface of muscle fibers. Dystrophin is missing in Duchenne Muscular Dystrophy patients and is present in reduced amounts in Becker Muscular Dystrophy patients. The protein encoded by this gene is a peripheral membrane protein found associated with dystrophin and dystrophin-related proteins. This gene is a member of the syntrophin gene family, which contains at least two other structurally-related genes.
20974	The protein encoded by this gene belongs to the cytokine receptor family. It is an accessory chain essential for the active interleukin 10 receptor complex. Coexpression of this and IL10RA proteins has been shown to be required for IL10-induced signal transduction. This gene and three other interferon receptor genes, IFAR2, IFNAR1, and IFNGR2, form a class II cytokine receptor gene cluster located in a small region on chromosome 21.
20975	
20976	Members of the trefoil family are characterized by having at least one copy of the trefoil motif, a 40-amino acid domain that contains three conserved disulfides. They are stable secretory proteins expressed in gastrointestinal mucosa. Their functions are not defined, but they may protect the mucosa from insults, stabilize the mucus layer, and affect healing of the epithelium. This gene, which is expressed in the gastric mucosa, has also been studied because of its expression in human tumors. This gene and two other related trefoil family member genes are found in a cluster on chromosome 21.
20977	
20978	The protein encoded by this gene is a member of the dual specificity protein phosphatase subfamily. These phosphatases inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues. They negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily (MAPK/ERK, SAPK/JNK, p38), which is associated with cellular proliferation and differentiation. Different members of the family of dual specificity phosphatases show distinct substrate specificities for various MAP kinases, different tissue distribution and subcellular localization, and different modes of inducibility of their expression by extracellular stimuli. This gene product inactivates SAPK/JNK and p38, is expressed predominantly in the adult brain, heart, and skeletal muscle, is localized in the cytoplasm, and is induced by nerve growth factor and insulin. An intronless pseudogene for DUSP8 is present on chromosome 10q11.2.
20979	
20980	
20981	
20982	The protein encoded by this gene is a preproprotein that is cleaved to form a secreted 10 aa peptide hormone. The secreted decapeptide regulates reproduction in females by stimulating the secretion of both luteinizing- and follicle-stimulating hormones. Three transcript variants that encode unique proproteins but the same peptide hormone have been found for this gene.
20983	
20984	This gene is a homolog of the Drosophila diaphanous gene, and has been linked to autosomal dominant, fully penetrant, nonsyndromic sensorineural progressive low-frequency hearing loss. Actin polymerization involves proteins known to interact with diaphanous protein in Drosophila and mouse. It has therefore been speculated that this gene may have a role in the regulation of actin polymerization in hair cells of the inner ear.
20985	
20986	
20987	
20988	This gene encodes a protein with multiple ankyrin repeat domains and a single KH-domain. Co-transcription of this gene and the neighboring downstream gene (EIF4EBP3) generates a transcript (MASK-BP3) which encodes a fusion protein comprised of the MASK protein sequence for the majority of the protein and a different C-terminus due to an alternate reading frame for the EIF4EBP3 segments.
20989	
20990	
20991	
20992	This gene contains two ankyrin repeat-encoding regions. Ankyrin repeats are tandemly repeated modules of about 33 amino acids described as L-shaped structures consisting of a beta-hairpin and two alpha-helices. Ankyrin repeats occur in a large number of functionally diverse proteins, mainly from eukaryotes, and are known to function as protein-protein interaction domains. Alternative splicing has been observed for this gene but the full-length nature of additional variants has not been determined.
20993	
20994	
20995	MYH14 is a member of the nonmuscle myosin II family of ATP-dependent molecular motors, which interact with cytoskeletal actin and regulate cytokinesis, cell motility, and cell polarity (Golomb et al., 2004).[supplied by OMIM]
20996	This gene encodes a protein that may have acetyltransferase activity and may be required for the establishment of sister chromatid cohesion during the S phase of mitosis. Mutations in this gene have been associated with Roberts syndrome.
20997	This gene belongs to the ADAMTS (A Disintegrin And Metalloproteinase domain, with ThromboSpondin type-1 modules) family of zinc-dependent proteases. ADAMTS proteases are complex secreted enzymes containing a prometalloprotease domain of the reprolysin type attached to an ancillary domain with a highly conserved structure that includes at least one thrombospondin type 1 repeat. They have been demonstrated to have important roles in connective tissue organization, coagulation, inflammation, arthritis, angiogenesis and cell migration. The product of this gene plays a major role in growth and in skin, lens, and heart development. It is also a candidate gene for autosomal recessive Weill-Marchesani syndrome.
20998	
20999	
21000	
21001	
21002	
21003	
21004	
21005	
21006	Ataxin-2 binding protein 1 has an RNP motif that is highly conserved among RNA-binding proteins. This protein binds to the C-terminus of ataxin-2 and may contribute to the restricted pathology of spinocerebellar ataxia type 2 (SCA2). Ataxin-2 is the gene product of the SCA2 gene which causes familial neurodegenerative diseases. Ataxin-2 binding protein 1 and ataxin-2 are both localized in the trans-Golgi network. Four alternatively spliced transcript variants have been found for this gene. Additional transcript variants have been found but their full length nature has not been determined.
21007	
21008	
21009	
21010	
21011	
21012	
21013	
21014	
21015	
21016	
21017	
21018	Homeobox genes encode DNA-binding proteins, many of which are thought to be involved in early embryonic development. Homeobox genes encode a DNA-binding domain of 60 to 63 amino acids referred to as the homeodomain. This gene is a member of the TPRX homeobox gene family.
21019	
21020	
21021	
21022	
21023	
21024	
21025	
21026	
21027	
21028	
21029	
21030	
21031	
21032	
21033	
21034	
21035	
21036	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters.  ABC proteins transport various molecules across extra- and intra-cellular membranes.  ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White).  This protein is a member of the MDR/TAP subfamily.  Members of the MDR/TAP subfamily are involved in multidrug resistance.  The function of this mitochondrial protein is unknown.
21037	
21038	
21039	
21040	
21041	
21042	
21043	
21044	
21045	
21046	
21047	
21048	
21049	The family of myotubularin-related proteins includes lipid phosphatases, such as MTM1 (MIM 600415), and pseudophosphatases, such as SBF1 (MIM 603560) and SBF2. Pseudophosphatases contain inactivating substitutions at the catalytic cysteine.[supplied by OMIM]
21050	
21051	
21052	
21053	
21054	
21055	Activin A receptor, type II-like 1 (also called activin receptor-like kinase 1) is a type I cell-surface receptor for the TGF-beta superfamily of ligands. It shares with other type I receptors a high degree of similarity in serine-threonine kinase subdomains, a glycine- and serine-rich region (called the GS domain) preceding the kinase domain, and a short C-terminal tail.  The protein (sometimes termed ALK1) shares similar domain structures with 3 other closely related ALK proteins (ALK2, ALK3, and ALK4) that together form a subfamily of receptor serine/threonine kinases.  ACVRL1 is most highly expressed in human placenta and lung. ACVRL1 deficiency causes hemorrhagic telangiectasia type 2 (HHT2; MIM 600376) also known as Rendu-Osler-Weber syndrome 2 (ORW2).
21056	
21057	
21058	
21059	
21060	This gene encodes retinoic acid receptor beta, a member of the thyroid-steroid hormone receptor superfamily of nuclear transcriptional regulators. This receptor localizes to the cytoplasm and to subnuclear compartments. It binds retinoic acid, the biologically active form of vitamin A which mediates cellular signalling in embryonic morphogenesis, cell growth and differentiation. It is thought that this protein limits growth of many cell types by regulating gene expression. The gene was first identified in a hepatocellular carcinoma where it flanks a hepatitis B virus integration site. The gene expresses at least two transcript variants; one additional transcript has been described, but its full length nature has not been determined.
21061	This gene encodes a WD repeat protein, which interacts with Cockayne syndrome type B (CSB) protein and with p44 protein, a subunit of the RNA polymerase II transcription factor IIH. Mutations in this gene have been identified in patients with hereditary disease Cockayne syndrome (CS). The CS cells are abnormally sensitive to ultraviolet radiation and are defective in the repair of transcriptionally active genes. Multiple alternatively spliced transcript variants encoding different isoforms have been found for this gene.
21062	
21063	
21064	
21065	
21066	
21067	The protein encoded by this gene belongs to the RGS (regulator of G protein signaling) family. Members of the RGS family act as GTPase-activating proteins on the alpha subunits of heterotrimeric, signal-transducing G proteins. This protein inhibits signal transduction by increasing the GTPase activity of G protein alpha subunits, thereby driving them into their inactive GDP-bound form. Alternative splicing occurs at this locus and two transcript variants encoding distinct isoforms have been identified.
21068	
21069	
21070	
21071	
21072	The central core of each mitogen-activated protein kinase (MAPK) pathway is a conserved cascade of 3 protein kinases: an activated MAPK kinase kinase (MAPKKK) phosphorylates and activates a specific MAPK kinase (MAPKK), which then activates a specific MAPK. While the ERK MAPKs are activated by mitogenic stimulation, the CSBP2 and JNK MAPKs are activated by environmental stresses such as osmotic shock, UV irradiation, wound stress, and inflammatory factors. This gene encodes a MAPKKK, the MEKK4 protein, also called MTK1. This protein contains a protein kinase catalytic domain at the C terminus. The N-terminal nonkinase domain may contain a regulatory domain. Expression of MEKK4 in mammalian cells activated the CSBP2 and JNK MAPK pathways, but not the ERK pathway. In vitro kinase studies indicated that recombinant MEKK4 can specifically phosphorylate and activate PRKMK6 and SERK1, MAPKKs that activate CSBP2 and JNK, respectively but cannot phosphorylate PRKMK1, an MAPKK that activates ERKs. MEKK4 is a major mediator of environmental stresses that activate the CSBP2 MAPK pathway, and a minor mediator of the JNK pathway. Two alternatively spliced transcripts encoding distinct isoforms have been described.
21073	
21074	This gene encodes a protein that is a multi-ligand type-1 receptor with similarity to the yeast carboxypeptidase Y sorting receptor Vps10 protein. The encoded protein, a trans-Golgi network (TGN) transmembrane protein, binds a number of unrelated ligands that participate in a wide range of cellular processes; however, it lacks the typical features of a signalling receptor. In the TGN, furin mediates the activation of the mature binding form. The encoded protein consists of a large luminal domain, a single transmembrane segment and short C-terminal cytoplasmic tail. The luminal domain contains a cysteine-rich region similar to two corresponding segments in the yeast Vps10p; the cytoplasmic tail is similar to the corresponding segment of the cation-independent mannose 6-phosphate receptor and the tail also interacts with the VHS domains of GGA (Golgi-associated, gamma-adaptin homologous, ARF-interacting) proteins.
21075	
21076	
21077	
21078	
21079	The leukemia inhibitory factor is a polyfunctional cytokine that affects the differentiation, survival, and proliferation of a wide variety of cells in the adult and the embryo. LIF action appears to be mediated through a high-affinity receptor complex composed of a low-affinity LIF binding chain (LIF receptor) and a high-affinity converter subunit, gp130. Both LIFR and gp130 are members of a family of cytokine receptors that includes components of the receptors for the majority of hematopoietic cytokines and for cytokines that affect other systems, including the ciliary neurotrophic factor, growth hormone and prolactin.
21080	HLA-E belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. HLA-E binds a restricted subset of peptides derived from the leader peptides of other class I molecules. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail.
21081	HLA-DQA1 belongs to the HLA class II alpha chain paralogues. The class II molecule is a heterodimer consisting of an alpha (DQA) and a beta chain (DQB), both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B Lymphocytes, dendritic cells, macrophages). The alpha chain is approximately 33-35 kDa. It is encoded by 5 exons; exon 1 encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, and exon 4 encodes the transmembrane domain and the cytoplasmic tail. Within the DQ molecule both the alpha chain and the beta chain contain the polymorphisms specifying the peptide binding specificities, resulting in up to four different molecules. Typing for these polymorphisms is routinely done for bone marrow transplantation.
21082	WD40 repeat proteins are key components of many essential biologic functions. They regulate the assembly of multiprotein complexes by presenting a beta-propeller platform for simultaneous and reversible protein-protein interactions. Members of the WIPI subfamily of WD40 repeat proteins, such as WIPI2, have a 7-bladed propeller structure and contain a conserved motif for interaction with phospholipids (Proikas-Cezanne et al., 2004).[supplied by OMIM]
21083	
21084	
21085	
21086	
21087	
21088	Beta-adrenergic receptor kinase (ADRBK1) phosphorylates the beta-2-adrenergic receptor and appears to mediate agonist-specific desensitization observed at high agonist concentrations. ADRBK1 is an ubiquitous cytosolic enzyme that specifically phosphorylates the activated form of the beta-adrenergic and related G-protein-coupled receptors. The ADRBK1 gene spans approximately 23 kb and is composed of 21 exons.  Heart failure is accompanied by severely impaired beta-adrenergic receptor (beta-AR) function. An important mechanism for the rapid desensitization of beta-AR function is agonist-stimulated receptor phosphorylation by the beta-AR kinase (beta-ARK1), an enzyme known to be elevated in failing human heart tissue. Abnormal coupling of beta-adrenergic receptor to G protein is involved in the pathogenesis of the failing heart. Inhibition of ADRBK1 is a novel mode of therapy.
21089	
21090	
21091	The protein encoded by this gene is a mitochondrial phosphate-activated glutaminase that catalyzes the hydrolysis of glutamine to stoichiometric amounts of glutamate and ammonia. This protein is functionally similar to the kidney glutaminase but is a little smaller in size. Originally thought to be liver-specific, this protein has been found in other tissues as well. At least one transcribed pseudogene and three transcript variants have been found for this gene, although the full-length nature of one of the variants has not been determined.
21092	
21093	This gene encodes a WD repeat-containing protein. Multiple alternatively spliced transcript variants have been found for this gene, but the full-length nature of some variants has not been defined.
21094	
21095	
21096	
21097	
21098	
21099	The WNK4 gene encodes a serine-threonine kinase expressed in distal nephron.[supplied by OMIM]
21100	
21101	
21102	
21103	
21104	
21105	
21106	The Golgi complex plays a key role in the sorting and modification of proteins exported from the endoplasmic reticulum. The protein encoded by this gene is involved in establishing the stacked structure of the Golgi apparatus.
21107	
21108	
21109	
21110	The protein encoded by this gene contains a SH2 domain and a SOCS BOX domain. The protein thus belongs to the suppressor of cytokine signaling (SOCS) family, also known as STAT-induced STAT inhibitor (SSI) protein family. SOCS family members are known to be cytokine-inducible negative regulators of cytokine signaling. The specific function of this protein has not yet been determined. Two alternatively spliced transcript variants encoding an identical protein have been reported.
21111	
21112	Huntington@#$%&s disease (HD), a neurodegenerative disorder characterized by loss of striatal neurons, is caused by an expansion of a polyglutamine tract in the HD protein huntingtin. This gene encodes a protein that interacts with the huntingtin-associated protein 1, which is a huntingtin binding protein that may function in vesicle trafficking. An alternatively spliced variant has been described but its full-length sequence has not been determined.
21113	
21114	
21115	
21116	
21117	This gene was identified based on its homology to the gene encoding the RING3 protein, a serine/threonine kinase. The gene localizes to 9q34, a region which contains several major histocompatibility complex (MHC) genes. The function of the encoded protein is not known.
21118	
21119	
21120	
21121	
21122	
21123	
21124	
21125	
21126	
21127	
21128	
21129	This gene encodes an ubiquitin-like protein (ubiquilin) that shares high degree of similarity with related products in yeast, rat and frog. Ubiquilins contain a N-terminal ubiquitin-like domain and a C-terminal ubiquitin-associated domain. They physically associate with both proteasomes and ubiquitin ligases, and thus are thought to functionally link the ubiquitination machinery to the proteasome to affect in vivo protein degradation. This ubiquilin has also been shown to modulate accumulation of presenilin proteins, and is found in lesions associated with Alzheimer@#$%&s and Parkinson@#$%&s disease. Two transcript variants encoding different isoforms have been found for this gene.
21130	This gene encodes an ubiquitin-like protein (ubiquilin) that shares high degree of similarity with related products in yeast, rat and frog. Ubiquilins contain a N-terminal ubiquitin-like domain and a C-terminal ubiquitin-associated domain. They physically associate with both proteasomes and ubiquitin ligases, and thus are thought to functionally link the ubiquitination machinery to the proteasome to affect in vivo protein degradation. This ubiquilin has also been shown to modulate accumulation of presenilin proteins, and is found in lesions associated with Alzheimer@#$%&s and Parkinson@#$%&s disease. Two transcript variants encoding different isoforms have been found for this gene.
21131	
21132	
21133	The expression of this gene in adipose tissue is induced by fasting. The function of protein encoded by this gene is currently unknown.
21134	The protein encoded by this gene is a peripheral membrane protein that is a component of tight junctions or TJs. TJs form an apical junctional structure and act to control paracellular permeability and maintain cell polarity. This protein is related to angiomotin, an angiostatin binding protein that regulates endothelial cell migration and capillary formation.
21135	
21136	
21137	Vesicle mediated protein sorting plays an important role in segregation of intracellular molecules into distinct organelles. Genetic studies in yeast have identified more than 40 vacuolar protein sorting (VPS) genes involved in vesicle transport to vacuoles. This gene encodes the human homolog of yeast class C Vps11 protein. The mammalian class C Vps proteins are predominantly associated with late endosomes/lysosomes, and like their yeast counterparts, may mediate vesicle trafficking steps in the endosome/lysosome pathway.
21138	This gene encodes a subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. The product of this gene contains four conserved cysteines characteristic of an atypical zinc-binding domain. Like its counterpart in yeast, this subunit may be shared by the other two DNA-directed RNA polymerases.
21139	
21140	
21141	
21142	
21143	Endothelin receptor type B is a G protein-coupled receptor which activates a phosphatidylinoitol-calcium second messenger system. Its ligand, endothelin, consists of a family of three potent vasoactive peptides: ET1, ET2, and ET3. Studies suggest that the multigenic disorder, Hirschsprung disease type 2, is due to mutation in endothelin receptor type B gene.  A splice variant, named SVR, has been described; the sequence of the ETB-SVR receptor is identical to ETRB except for the intracellular C-terminal domain.  While both splice variants bind ET1, they exhibit different responses upon binding which suggests that they may be functionally distinct.
21144	
21145	The RCOR gene encodes a functional corepressor required for regulation of neural-specific gene expression.[supplied by OMIM]
21146	
21147	
21148	
21149	
21150	
21151	This nuclear gene encodes a mitochondrial protein tyrosine aminotransferase which is present in the liver and catalyzes the conversion of L-tyrosine into p-hydroxyphenylpyruvate. Mutations in this gene cause tyrosinemia (type II, Richner-Hanhart syndrome), a disorder accompanied by major skin and corneal lesions, with possible mental retardation. A regulator gene for tyrosine aminotransferase is X-linked.
21152	
21153	4-aminobutyrate aminotransferase (ABAT) is responsible for catabolism of gamma-aminobutyric acid (GABA), an important, mostly inhibitory neurotransmitter in the central nervous system, into succinic semialdehyde. The active enzyme is a homodimer of 50-kD subunits complexed to pyridoxal-5-phosphate. The protein sequence is over 95% similar to the pig protein. GABA is estimated to be present in nearly one-third of human synapses. ABAT in liver and brain is controlled by 2 codominant alleles with a frequency in a Caucasian population of 0.56 and 0.44. The ABAT deficiency phenotype includes psychomotor retardation, hypotonia, hyperreflexia, lethargy, refractory seizures, and EEG abnormalities. Two alternatively spliced transcript variants encoding the same protein isoform have been found for this gene.
21154	This gene encodes a protein that is a member of the SUMO (small ubiquitin-like modifier) protein family. It functions in a manner similar to ubiquitin in that it is bound to target proteins as part of a post-translational modification system. However, unlike ubiquitin which targets proteins for degradation, this protein is involved in a variety of cellular processes, such as nuclear transport, transcriptional regulation, apoptosis, and protein stability. It is not active until the last two amino acids of the carboxy-terminus have been cleaved off. Numerous pseudogenes have been reported for this gene. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
21155	This gene belongs to the homeobox family of genes. The homeobox genes encode a highly conserved family of transcription factors that play an important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, located on different chromosomes, consisting of 9 to 11 genes arranged in tandem. This gene is one of several homeobox HOXB genes located in a cluster on chromosome 17. The product of the Hoxb2 mouse gene is essential for motor neuron development in the @#$%&ventral@#$%& basal plate of rhombomere 4. The exact role of the human gene has yet to be determined.
21156	The protein encoded by this gene is a subunit of nuclear domain 10 (ND10) bodies. ND10 bodies are nuclear domains appearing immunohistochemically as ten dots per nucleus. They are believed to be associated with the nuclear matrix on the basis of their resistance to nuclease digestion and salt extraction. ND10 proteins are removed from the nucleus by herpes simplex virus-1 infection and may have a role in viral life cycles.
21157	TCF2 encodes transcription factor 2, a liver-specific factor of the homeobox-containing basic helix-turn-helix family. The TCF2 protein is believed to form heterodimers with another liver-specific member of this transcription factor family, TCF1; depending on the TCF2 isoform, the result may be to activate or inhibit transcription of target genes. Mutation of TCF2 that disrupts normal function has been identified as the cause of MODY5 (Maturity-Onset of Diabetes, Type 5). A third human transcript variant is believed to exist based on such a variant in the rat: however, to date such an mRNA species has not been isolated.
21158	
21159	
21160	
21161	
21162	
21163	
21164	
21165	This gene belongs to the TIMP gene family. The proteins encoded by this gene family are natural inhibitors of the matrix metalloproteinases, a group of peptidases involved in degradation of the extracellular matrix. The expression of this gene is induced in response to mitogenic stimulation and subject to cell cycle regulation. Mutations in this gene cause the autosomal dominant disorder Sorsby@#$%&s fundus dystrophy.
21166	This gene is a member of the MAGEA gene family. The members of this family encode proteins with 50 to 80% sequence identity to each other. The promoters and first exons of the MAGEA genes show considerable variability, suggesting that the existence of this gene family enables the same function to be expressed under different transcriptional controls. The MAGEA genes are clustered at chromosomal location Xq28. They have been implicated in some hereditary disorders, such as dyskeratosis congenita. Two transcript variants encoding the same protein have been found for this gene.
21167	The protein encoded by this gene catalyzes the conversion of sulfated steroid precursors to estrogens during pregnancy. The encoded protein is found in the endoplasmic reticulum, where it acts as a homodimer. Mutations in this gene are known to cause X-linked ichthyosis (XLI).
21168	
21169	
21170	
21171	
21172	
21173	TIRP is a Toll/interleukin-1 receptor (IL1R; MIM 147810) (TIR) domain-containing adaptor protein involved in Toll receptor signaling (see TLR4; MIM 603030).[supplied by OMIM]
21174	
21175	
21176	
21177	
21178	
21179	The collagens are a superfamily of proteins that play a role in maintaining the integrity of various tissues. Collagens are extracellular matrix proteins and have a triple-helical domain as their common structural element. Collagen VI is a major structural component of microfibrils. The basic structural unit of collagen VI is a heterotrimer of the alpha1(VI), alpha2(VI), and alpha3(VI) chains. The alpha2(VI) and alpha3(VI) chains are encoded by the COL6A2 and COL6A3 genes, respectively. The protein encoded by this gene is the alpha 1 subunit of type VI collagen (alpha1(VI) chain). Mutations in the genes that code for the collagen VI subunits result in the autosomal dominant disorder, Bethlem myopathy.
21180	
21181	
21182	
21183	
21184	
21185	
21186	
21187	
21188	
21189	
21190	
21191	
21192	
21193	
21194	
21195	
21196	
21197	
21198	
21199	
21200	
21201	
21202	
21203	The ADF (actin-depolymerizing factor)/cofilin family (see MIM 601442) is composed of stimulus-responsive mediators of actin dynamics. ADF/cofilin proteins are inactivated by kinases such as LIM domain kinase-1 (LIMK1; MIM 601329). The SSH family appears to play a role in actin dynamics by reactivating ADF/cofilin proteins in vivo (Niwa et al., 2002).[supplied by OMIM]
21204	
21205	
21206	
21207	
21208	
21209	
21210	
21211	
21212	
21213	
21214	
21215	
21216	
21217	
21218	HLA-DOA belongs to the HLA class II alpha chain paralogues. HLA-DOA forms a heterodimer with HLA-DOB. The heterodimer, HLA-DO, is found in lysosomes in B cells and regulates HLA-DM-mediated peptide loading on MHC class II molecules. In comparison with classical HLA class II molecules, this gene exhibits very little sequence variation, especially at the protein level.
21219	The protein encoded by this gene contains similarity to the Drosophila goliath protein and thus may function as a transcription factor.
21220	
21221	
21222	
21223	
21224	
21225	
21226	
21227	
21228	This gene encodes a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, namely a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region.
21229	
21230	
21231	
21232	
21233	
21234	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
21235	This gene encodes a trypsinogen, which is a member of the trypsin family of serine proteases. This enzyme is secreted by the pancreas and cleaved to its active form in the small intestine. It is active on peptide linkages involving the carboxyl group of lysine or arginine. Mutations in this gene are associated with hereditary pancreatitis. This gene and several other trypsinogen genes are localized to the T cell receptor beta locus on chromosome 7.
21236	
21237	
21238	
21239	
21240	
21241	Zinc is an essential cofactor for hundreds of enzymes. It is involved in protein, nucleic acid, carbohydrate, and lipid metabolism, as well as in the control of gene transcription, growth, development, and differentiation. SLC39A14 belongs to a subfamily of proteins that show structural characteristics of zinc transporters (Taylor and Nicholson, 2003).[supplied by OMIM]
21242	
21243	Members of the F-box protein family, such as FBXO42, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
21244	
21245	
21246	The protein encoded by this gene is a member of a superfamily of WD repeat proteins.  The function of this protein is not yet determined; however, other related family members perform functions involved in regulatory processes.
21247	
21248	
21249	
21250	
21251	
21252	The synaptonemal complex is a proteinaceous structure that links homologous chromosomes during the prophase of meiosis. The protein encoded by this gene is a major component of the synaptonemal complex and may bind DNA at scaffold attachment regions. The encoded protein requires synaptonemal complex protein 3, but not 1, for inclusion in the synaptonemal complex.
21253	The protein encoded by this gene is a member of the Ser/Thr protein kinase family, and is most closely related to mitogen-activated protein kinases (MAP kinases). MAP kinases also known as extracellular signal-regulated kinases (ERKs), are activated through protein phosphorylation cascades and act as integration points for multiple biochemical signals. This kinase is localized in the nucleus, and has been reported to be activated in fibroblasts upon treatment with serum or phorbol esters.
21254	Kallikreins are a subgroup of serine proteases having diverse physiological functions. Growing evidence suggests that many kallikreins are implicated in carcinogenesis and some have potential as novel cancer and other disease biomarkers. This gene is one of the fifteen kallikrein subfamily members located in a cluster on chromosome 19. Expression of this gene is regulated by steroid hormones and may be useful as a marker for breast cancer. An additional transcript variant has been identified, but its full length sequence has not been determined.
21255	
21256	This gene encodes a component of vacuolar ATPase (V-ATPase), a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, receptor-mediated endocytosis, and synaptic vesicle proton gradient generation. V-ATPase is composed of a cytosolic V1 domain and a transmembrane V0 domain. The V1 domain consists of three A and three B subunits, two G subunits plus the C, D, E, F, and H subunits. The V1 domain contains the ATP catalytic site. The V0 domain consists of five different subunits: a, c, c@#$%&, c@#$%!, and d. Additional isoforms of many of the V1 and V0 subunit proteins are encoded by multiple genes or alternatively spliced transcript variants. This gene encodes one of three A subunit proteins and the encoded protein is associated with clathrin-coated vesicles. The occurrence of splice variants encoding different protein products has been reported, but the full-length natures of these transcripts have not been determined.
21257	
21258	
21259	
21260	
21261	
21262	
21263	This gene encodes fibronectin, a glycoprotein present in a soluble dimeric form in plasma, and in a dimeric or multimeric form at the cell surface and in extracellular matrix. Fibronectin is involved in cell adhesion and migration processes including embryogenesis, wound healing, blood coagulation, host defense, and metastasis. The gene has three regions subject to alternative splicing, with the potential to produce 20 different transcript variants. However, the full-length nature of some variants has not been determined.
21264	
21265	
21266	
21267	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth, and invasion. This growth factor lacks the N-terminal signal sequence present in most of the FGF family members, but it contains clusters of basic residues that have been demonstrated to act as a nuclear localization signal. When transfected into mammalian cells, this protein accumulated in the nucleus, but was not secreted. The specific function of this gene has not yet been determined. Two alternatively spliced transcript variants encoding distinct isoforms have been reported.
21268	
21269	
21270	This gene product belongs to the 14-3-3 family of proteins which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals, and this protein is 100% identical to the mouse ortholog. It interacts with CDC25 phosphatases, RAF1 and IRS1 proteins, suggesting its role in diverse biochemical activities related to signal transduction, such as cell division and regulation of insulin sensitivity. It has also been implicated in the pathogenesis of small cell lung cancer.
21271	This gene product mediates the attachment of erythroblasts to macrophages. This attachment promotes terminal maturation and enucleation of erythroblasts, presumably by suppressing apoptosis. This protein is an integral membrane protein with the N-terminus on the extracellular side and the C-terminus on the cytoplasmic side of the cell. Two immunologically related isoforms of erythroblast macrophage protein with apparent molecular weights of 33 kD and 36 kD were detected in macrophage membranes; this gene encodes the larger isoform. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene, but the biological validity of some variants has not been determined.
21272	This gene encodes a disintegrin and metalloproteinase with thrombospondin motifs-6 (ADAMTS6), which is a member of the ADAMTS protein family. Members of the family share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. Multiple alternatively spliced transcript variants have been found for this gene, but the biological validity of some variants has not been determined.
21273	
21274	
21275	
21276	GPR141 is a member of the rhodopsin family of G protein-coupled receptors (GPRs) (Fredriksson et al., 2003).[supplied by OMIM]
21277	
21278	
21279	
21280	
21281	
21282	
21283	
21284	
21285	The protein encoded by this gene regulates the turnover of diphosphoinositol polyphosphates. The turnover of these high-energy diphosphoinositol polyphosphates represents a molecular switching activity with important regulatory consequences. Molecular switching by diphosphoinositol polyphosphates may contribute to regulating intracellular trafficking. Several alternatively spliced transcript variants have been described, but the full-length nature of some variants has not been determined. Isoforms DIPP2alpha and DIPP2beta are distinguishable from each other solely by DIPP2beta possessing one additional amino acid due to intron boundary skidding in alternate splicing.
21286	PUS1 converts uridine into pseudouridine after the nucleotide has been incorporated into RNA. Pseudouridine may have a functional role in tRNAs and may assist in the peptidyl transfer reaction of rRNAs (Chen and Patton, 1999).[supplied by OMIM]
21287	
21288	
21289	This gene encodes a transmembrane and secreted protein with characteristics of a type 1a transmembrane protein. It is found in a perinuclear Golgi-like pattern and thought to be involved in hematopoietic and/or immune system processes.
21290	
21291	This gene encodes an unconventional myosin. This protein differs from other myosins in that it has a long N-terminal extension preceding the conserved motor domain. Studies in mice suggest that this protein is necessary for actin organization in the hair cells of the cochlea. Mutations in this gene have been associated with profound, congenital, neurosensory, nonsyndromal deafness. This gene is located within the Smith-Magenis syndrome region on chromosome 17. Read-through transcripts containing an upstream gene and this gene have been identified, but they are not thought to encode a fusion protein. Several alternatively spliced transcript variants have been described, but their full length sequences have not been determined.
21292	This gene encodes adenylate cyclase activating polypeptide 1. Mediated by adenylate cyclase activating polypeptide 1 receptors, this polypeptide stimulates adenylate cyclase and subsequently increases the cAMP level in target cells. Adenylate cyclase activating polypeptide 1 is not only a hypophysiotropic hormone, but also functions as a neurotransmitter and neuromodulator. In addition, it plays a role in paracrine and autocrine regulation of certain types of cells. This gene is composed of five exons. Exons 1 and 2 encode the 5@#$%& UTR and signal peptide, respectively; exon 4 encodes an adenylate cyclase activating polypeptide 1-related peptide; and exon 5 encodes the mature peptide and 3@#$%& UTR. This gene encodes three different mature peptides, including two isotypes: a shorter form and a longer form.
21293	
21294	
21295	
21296	Calcineurin plays an important role in the T-cell receptor-mediated signal transduction pathway. The protein encoded by this gene binds specifically to the activated form of calcineurin and inhibits calcineurin-mediated signal transduction. The encoded protein is found in the nucleus and contains a leucine zipper domain as well as several PEST motifs, sequences which confer targeted degradation to those proteins which contain them. At least four alternatively spliced transcripts have been found for this gene, but the full-length nature of most of them has not been determined.
21297	
21298	
21299	The helix-loop-helix (HLH) proteins are a family of putative transcription factors, some of which have been shown to play an important role in growth and development of a wide variety of tissues and species. Four members of this family have been clearly implicated in tumorigenesis via their involvement in chromosomal translocations in lymphoid tumors: MYC (MIM 190080), LYL1 (MIM 151440), E2A (MIM 147141), and SCL (MIM 187040).[supplied by OMIM]
21300	
21301	
21302	
21303	The RAS oncogene (MIM 190020) is mutated in nearly one-third of all human cancers. Members of the RAS superfamily are plasma membrane GTP-binding proteins that modulate intracellular signal transduction pathways. A subfamily of RAS effectors, including NORE1, share a RAS association (RA) domain.[supplied by OMIM]
21304	Sperm mitochondria differ in morphology and subcellular localization from those of somatic cells. They are elongated, flattened, and arranged circumferentially to form a helical coiled sheath in the midpiece of the sperm flagellum. The protein encoded by this gene localizes to the capsule associated with the mitochondrial outer membranes and is thought to function in the organization and stabilization of the helical structure of the sperm@#$%&s mitochondrial sheath.
21305	
21306	
21307	The Sec4/Rab-related small GTP-binding proteins are involved in the regulation of intracellular vesicular transport. Mss4 stimulates GTP-GDP exchange in Sec4 and Rab and binds to a subset of genetically related Rab proteins.[supplied by OMIM]
21308	
21309	The protein encoded by this gene belongs to the Bcl-2 family. Alternative splicing occurs at this locus and two transcript variants encoding distinct isoforms have been identified. The longer gene product (isoform 1) enhances cell survival by inhibiting apoptosis while the alternatively spliced shorter gene product (isoform 2) promotes apoptosis and is death-inducing.
21310	The protein encoded by this gene belongs to the Bcl-2 family. Alternative splicing occurs at this locus and two transcript variants encoding distinct isoforms have been identified. The longer gene product (isoform 1) enhances cell survival by inhibiting apoptosis while the alternatively spliced shorter gene product (isoform 2) promotes apoptosis and is death-inducing.
21311	
21312	
21313	
21314	This gene encodes a serine protease. The protein has been localized in the endoplasmic reticulum and interacts with an alternatively spliced form of mitogen-activated protein kinase 14. The protein has also been localized to the mitochondria with release to the cytosol following apoptotic stimulus. The protein is thought to induce apoptosis by binding the apoptosis inhibitory protein baculoviral IAP repeat-containing 4. Nuclear localization of this protein has also been observed. Alternate splicing of this gene results in two transcript variants encoding different isoforms. Additional transcript variants have been described, but their full-length sequences have not been determined.
21315	
21316	
21317	
21318	This gene encodes a member of the @#$%&regulator of G protein signaling@#$%& (RGS) gene family. The encoded protein may function as a guanosine triphosphatase (GTPase)-activating protein as well as a transcriptional repressor. This protein may play a role in tumorigenesis. Multiple transcript variants encoding distinct isoforms have been identified for this gene. Other alternative splice variants have been described but their biological nature has not been determined.
21319	
21320	The protein encoded by this gene belongs to the G-protein coupled receptor family and acts as a receptor for oxytocin. Its activity is mediated by G proteins which activate a phosphatidylinositol-calcium second messenger system. The oxytocin-oxytocin receptor system plays an important role in the uterus during parturition.
21321	
21322	
21323	The protein encoded by this gene catalyzes the deamination of dCMP to dUMP, the nucleotide substrate for thymidylate synthase. The encoded protein is allosterically activated by dCTP and inhibited by dTTP, and is found as a homohexamer. This protein uses zinc as a cofactor for its activity. Two transcript variants encoding different isoforms have been found for this gene.
21324	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has three repeats of quasi-RRM domains that bind to RNAs. It is very similar to the family member HNRPF. This gene is thought to be potentially involved in hereditary lymphedema type I phenotype.
21325	
21326	
21327	
21328	
21329	Butyrophilin is the major protein associated with fat droplets in the milk. This gene is a member of the BTN2 subfamily of genes, which encode proteins belonging to the butyrophilin protein family. The gene is located in a cluster on chromosome 6, consisting of seven genes belonging to the expanding B7/butyrophilin-like group, a subset of the immunoglobulin gene superfamily. The encoded protein is a type 1 receptor glycoprotein involved in lipid, fatty-acid and sterol metabolism. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some variants has not been determined.
21330	This gene encodes a ubiquitous nuclear protein that stimulates both cell proliferation and differentiation. It is a member of the MADS (MCM1, Agamous, Deficiens, and SRF) box superfamily of transcription factors. This protein binds to the serum response element (SRE) in the promoter region of target genes. This protein regulates the activity of many immediate-early genes, for example c-fos, and thereby participates in cell cycle regulation, apoptosis, cell growth, and cell differentiation. This gene is the downstream target of many pathways; for example, the mitogen-activated protein kinase pathway (MAPK) that acts through the ternary complex factors (TCFs).
21331	
21332	
21333	This gene product is a member of the Vanin family of proteins which share extensive sequence similarity with each other, and also with biotinidase. The family includes secreted and membrane-associated proteins, a few of which have been reported to participate in hematopoietic cell trafficking. No biotinidase activity has been demonstrated for any of the vanin proteins, however, they possess pantetheinase activity, which may play a role in oxidative-stress response. This protein, like its mouse homolog, is likely a GPI-anchored cell surface molecule. The mouse protein is expressed by the perivascular thymic stromal cells and regulates migration of T-cell progenitors to the thymus. This gene lies in close proximity to, and in same transcriptional orientation as two other vanin genes on chromosome 6q23-q24.
21334	
21335	
21336	
21337	
21338	
21339	
21340	
21341	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It is localized in nuclear bodies of the nucleus. This protein is involved in the splicing process and it also participates in early heat shock-induced splicing arrest by transiently leaving the hnRNP complexes. Multiple alternative transcript variants seem to be present for this gene and some appear to have intronic regions in the mRNA. Presently, only two transcript variants are fully described.
21342	
21343	
21344	
21345	The protein encoded by this gene belongs to the ARL (ADP-ribosylation factor-like) family of proteins, which are structurally related to ADP-ribosylation factors (ARFs). ARFs, described as activators of cholera toxin (CT) ADP-ribosyltransferase activity, regulate intracellular vesicular membrane trafficking, and stimulate a phospholipase D (PLD) isoform. Although, ARL proteins were initially thought not to activate CT or PLD, later work showed that they are weak stimulators of PLD and CT in a phospholipid dependent manner.
21346	
21347	
21348	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, which shows little similarity to any of the other known human DEAD box proteins, but shows a high similarity to mouse Ddx24 at the amino acid level.
21349	
21350	Mitochondrial creatine (MtCK) kinase is responsible for the transfer of high energy phosphate from mitochondria to the cytosolic carrier, creatine. It belongs to the creatine kinase isoenzyme family. It exists as two isoenzymes, sarcomeric MtCK and ubiquitous MtCK, encoded by separate genes. Mitochondrial creatine kinase occurs in two different oligomeric forms: dimers and octamers, in contrast to the exclusively dimeric cytosolic creatine kinase isoenzymes. Many malignant cancers with poor prognosis have shown overexpression of ubiquitous mitochondrial creatine kinase; this may be related to high energy turnover and failure to eliminate cancer cells via apoptosis. Ubiquitous mitochondrial creatine kinase has 80% homology with the coding exons of sarcomeric mitochondrial creatine kinase. Two genes located near each other on chromosome 15 have been identified which encode identical mitochondrial creatine kinase proteins.
21351	
21352	
21353	This gene encodes a cell-surface anchored serine protease, which is a member of the trypsin family of serine proteases. It is predicted to be active on peptide linkages involving the carboxyl group of lysine or arginine. The protein localizes to the cytoplasm and the plasma membrane of premeiotic testicular germ cells and it may be involved in progression of testicular tumors of germ cell origin. Alternative splicing of this gene results in three transcript variants encoding three different isoforms.
21354	
21355	
21356	
21357	
21358	
21359	This gene encodes an inwardly rectifying K+ channel which may be blocked by divalent cations. This protein is thought to be one of multiple inwardly rectifying channels which contribute to the cardiac inward rectifier current (IK1). The gene is located within the Smith-Magenis syndrome region on chromosome 17.
21360	
21361	
21362	
21363	The nuclear pore complex is a massive structure that extends across the nuclear envelope, forming a gateway that regulates the flow of macromolecules between the nucleus and the cytoplasm. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells. The protein encoded by this gene is a member of the FG-repeat containing nucleoporins and is localized to the nuclear pore central plug. This protein associates with the importin alpha/beta complex which is involved in the import of proteins containing nuclear localization signals. Multiple transcript variants of this gene encode a single protein isoform.
21364	This gene encodes a protein expressed in tumor cells during apoptosis independent of the apoptosis-inducing stimuli. Prior to apoptosis induction, this gene product is distributed in both the nucleus and cytoplasm. Once apoptosis is induced, the level of this protein increases and by relocation from the cytoplasm, it accumulates in the nucleus. Although its exact function is not defined, this protein is thought to play an early and universal role in apoptosis.
21365	
21366	
21367	
21368	
21369	The protein encoded by this gene is a member of the platelet-derived growth factor family. The four members of this family are mitogenic factors for cells of mesenchymal origin and are characterized by a motif of eight cysteines. This gene product can exist either as a homodimer or as a heterodimer with the platelet-derived growth factor alpha polypeptide, where the dimers are connected by disulfide bonds. Mutations in this gene are associated with meningioma. Reciprocal translocations between chromosomes 22 and 7, at sites where this gene and that for COL1A1 are located, are associated with a particular type of skin tumor called dermatofibrosarcoma protuberans resulting from unregulated expression of growth factor. Two splice variants have been identified for this gene.
21370	
21371	The protein encoded by this gene is a type I transmembrane protein that localizes to the endocytic pathway. This protein contains a RING zinc-finger motif and has been shown to possess E3 ubiquitin ligase activity. Expression of this gene in retrovirally transduced T cell hybridoma significantly inhibits activation-induced IL2 and IL4 cytokine production. Induced expression of this gene was observed in anergic CD4(+) T cells, which suggested a role in the induction of anergic phenotype. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
21372	
21373	
21374	
21375	The protein encoded by this gene belongs to the family of G-protein coupled receptors, that are preferentially activated by adenosine and uridine nucleotides. This gene is moderately expressed in undifferentiated HL60 cells, and is located on both chromosomes X and Y.
21376	
21377	The sperm-specific protein Izumo, named for a Japanese shrine dedicated to marriage, is essential for sperm-egg plasma membrane binding and fusion (Inoue et al., 2005).[supplied by OMIM]
21378	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities and are involved in a variety of biological processes including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. The mouse homolog of this gene was found to be preferentially expressed in the inner root sheath of the hair follicle, which suggested a role in hair development.
21379	
21380	This gene belongs to the ephrin receptor subfamily of the protein-tyrosine kinase family. EPH and EPH-related receptors have been implicated in mediating developmental events, particularly in the nervous system. Receptors in the EPH subfamily typically have a single kinase domain and an extracellular region containing a Cys-rich domain and 2 fibronectin type III repeats. The ephrin receptors are divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands.
21381	The protein encoded by this gene is highly similar to cytoplasmic polyadenylation element binding protein (CPEB), an mRNA-binding protein that regulates cytoplasmic polyadenylation of mRNA as a trans factor in oogenesis and spermatogenesis. Studies of the similar gene in mice suggested a possible role of this protein in transcriptionally inactive haploid spermatids. Alternatively spliced transcript variants encoding distinct isoforms have been described.
21382	P-type ATPases, such as ATP11B, are phosphorylated in their intermediate state and drive uphill transport of ions across membranes. Several subfamilies of P-type ATPases have been identified. One subfamily transports heavy metal ions, such as Cu(2+) or Cd(2+). Another subfamily transports non-heavy metal ions, such as H(+), Na(+), K(+), or Ca(+). A third subfamily transports amphipaths, such as phosphatidylserine.[supplied by OMIM]
21383	
21384	
21385	
21386	
21387	
21388	
21389	
21390	
21391	
21392	
21393	
21394	
21395	
21396	
21397	
21398	
21399	
21400	
21401	
21402	
21403	
21404	
21405	
21406	
21407	
21408	
21409	
21410	
21411	
21412	
21413	
21414	
21415	
21416	
21417	
21418	
21419	
21420	
21421	
21422	
21423	The product of this gene belongs to the phosphatase 2 regulatory subunit B family. Protein phosphatase 2 is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. The B regulatory subunit might modulate substrate selectivity and catalytic activity. This gene encodes a gamma isoform of the regulatory subunit B55 subfamily. Alternatively spliced transcript variants encoding different isoforms have been identified.
21424	
21425	Spire proteins, such as SPIRE1, are highly conserved between species. They belong to the family of Wiskott-Aldrich homology region-2 (WH2) proteins, which are involved in actin organization (Kerkhoff et al., 2001).[supplied by OMIM]
21426	
21427	
21428	
21429	Core binding factor (CBF) is a heterodimeric transcription factor that binds to the core element of many enhancers and promoters. The protein encoded by this gene represents the alpha subunit of CBF and is thought to be involved in the development of normal hematopoiesis. Chromosomal translocations involving this gene are well-documented and have been associated with several types of leukemia. Two transcript variants encoding different isoforms have been found for this gene.
21430	
21431	
21432	
21433	
21434	
21435	Type II iodothyronine deiodinase activates thyroid hormone by converting the prohormone thyroxine (T4) by outer ring deiodination (ORD) to bioactive 3,3@#$%&,5-triiodothyronine (T3). Thyroid hormone appears to have important regulatory effects in some mammalian tissues, such as the developing brain, the anterior pituitary gland, and brown adipose tissue. A relatively high proportion of the receptor-bound triiodothyronine is found within the tissue itself rather than in plasma. The expression in these tissues of type II iodothyronine deiodinase, suggests that deiodinase II is responsible for this @#$%&local@#$%& production of T3 and is thus important in influencing thyroid hormone action in these tissues. In addition, activity of this protein is markedly elevated in the hypothyroid state and appears to be responsible for catalyzing the production of a large proportion of the circulating T3 under such conditions. Dioiodinase 2 may contribute significantly to the relative increase in thyroidal T3 production in patients with Graves disease, toxic adenomas, and, perhaps, iodine deficiency. Alternative splicing results in multiple transcript variants encoding different isoforms.
21436	
21437	
21438	
21439	
21440	
21441	Selenoproteins, such as SELO, contain the rare twenty-first amino acid, selenocysteine (sec). These proteins lack common amino acid sequence motifs, but the 3-prime untranslated regions of selenoprotein genes have a common stem-loop structure, the sec insertion sequence (SECIS), that is necessary for the recognition of UGA as a sec codon rather than as a stop signal.[supplied by OMIM]
21442	
21443	
21444	This gene product is a serine/threonine protein kinase that contains an N-terminal protein kinase domain and a C-terminal proline-rich domain. Its protein kinase domain is most closely related to those of the LIM motif-containing protein kinases (LIMKs). The encoded protein can phosphorylate myelin basic protein and histone in vitro. The testicular germ cell-specific expression and developmental pattern of expression of the mouse gene suggests that this gene plays an important role at and after the meiotic phase of spermatogenesis.
21445	Activins are dimeric growth and differentiation factors which belong to the transforming growth factor-beta (TGF-beta) superfamily of structurally related signaling proteins. Activins signal through a heteromeric complex of receptor serine kinases which include at least two type I (I and IB) and two type II (II and IIB) receptors. These receptors are all transmembrane proteins, composed of a ligand-binding extracellular domain with a cysteine-rich region, a transmembrane domain, and a cytoplasmic domain with predicted serine/threonine specificity. Type I receptors are essential for signaling, and type II receptors are required for binding ligands and for expression of type I receptors. Type I and II receptors form a stable complex after ligand binding, resulting in phosphorylation of type I receptors by type II receptors. This gene encodes activin A type IB receptor, composed of 11 exons. Alternative splicing and alternative polyadenylation result in 3 fully described transcript variants. The mRNA expression of variants 1, 2, and 3 is confirmed, and a potential fourth variant contains an alternative exon 8 and lacks exons 9 through 11, but its mRNA expression has not been confirmed.
21446	
21447	The SIMP protein contains a highly immunogenic minor histocompatibility antigen epitope of 9 amino acids, B6(dom1). Like ITM1 (MIM 601134), SIMP is homologous to yeast STT3, an oligosaccharyltransferase essential for cell proliferation (McBride et al., 2002 [PubMed 12439619]).[supplied by OMIM]
21448	
21449	
21450	
21451	
21452	
21453	This gene encodes subunit 3 of the splicing factor 3a protein complex. The splicing factor 3a heterotrimer includes subunits 1, 2 and 3 and is necessary for the in vitro conversion of 15S U2 snRNP into an active 17S particle that performs pre-mRNA splicing. Subunit 3 interacts with subunit 1 through its amino-terminus while the zinc finger domain of subunit 3 plays a role in its binding to the 15S U2 snRNP. This gene has a pseudogene on chromosome 20.
21454	
21455	
21456	This gene encodes the GA-binding protein transcription factor, beta subunit. This protein forms a tetrameric complex with the alpha subunit, and stimulates transcription of target genes. The encoded protein may be involved in activation of cytochrome oxidase expression and nuclear control of mitochondrial function. The crystal structure of a similar protein in mouse has been resolved as a ternary protein complex. Multiple transcript variants encoding distinct isoforms have been identified for this gene.
21457	This gene is a type II classical cadherin from the cadherin superfamily and one of three cadherin 7-like genes located in a cluster on chromosome 18. The encoded membrane protein is a calcium dependent cell-cell adhesion glycoprotein comprised of five extracellular cadherin repeats, a transmembrane region and a highly conserved cytoplasmic tail. Type II (atypical) cadherins are defined based on their lack of a HAV cell adhesion recognition sequence specific to type I cadherins. Since disturbance of intracellular adhesion is a prerequisite for invasion and metastasis of tumor cells, cadherins are considered prime candidates for tumor suppressor genes.
21458	The protein encoded by this gene is a member of the STAT family of transcription factors. In response to cytokines and growth factors, STAT family members are phosphorylated by the receptor associated kinases, and then form homo- or heterodimers that translocate to the cell nucleus where they act as transcription activators. This protein mediates the signal transduction triggered by various cell ligands, such as IL2, IL4, CSF1, and different growth hormones. It has been shown to be involved in diverse biological processes, such as TCR signaling, apoptosis, adult mammary gland development, and sexual dimorphism of liver gene expression. This gene was found to fuse to retinoic acid receptor-alpha (RARA) gene in a small subset of acute promyelocytic leukemias (APLL). The dysregulation of the signaling pathways mediated by this protein may be the cause of the APLL.
21459	The protein encoded by this gene belongs to the family of P-type cation transport ATPases, and to the subfamily of aminophospholipid-transporting ATPases. The aminophospholipid translocases transport phosphatidylserine and phosphatidylethanolamine from one side of a bilayer to another. This gene encodes the member 3 of the phospholipid-transporting ATPase 8B.
21460	Calcium-permeable channels, such as TRPV5, participate in neurotransmission, muscle contraction, and exocytosis by providing calcium as an intracellular second messenger. Depending on the tissue, transcellular calcium transport may be regulated by vitamin D, parathyroid hormone (PTH; MIM 168450), or calcitonin (CALCA; MIM 114130).[supplied by OMIM]
21461	
21462	
21463	
21464	
21465	
21466	This gene encodes a cytosolic, NADP-dependent enzyme that generates NADPH for fatty acid biosynthesis. The activity of this enzyme, the reversible oxidative decarboxylation of malate, links the glycolytic and citric acid cycles. The regulation of expression for this gene is complex. Increased expression can result from elevated levels of thyroid hormones or by higher proportions of carbohydrates in the diet.
21467	
21468	
21469	
21470	
21471	This gene encodes a member of the adenylate kinase family, which is involved in regulating the adenine nucleotide composition within a cell by catalyzing the reversible transfer of phosphate groups among adenine nucleotides. This member is related to the UMP/CMP kinase of several species. It is located in the cytosol and expressed exclusively in brain. Alternatively spliced transcript variants encoding distinct isoforms have been identified for this gene.
21472	
21473	
21474	
21475	Dihydrofolate reductase converts dihydrofolate into tetrahydrofolate, a methyl group shuttle required for the de novo synthesis of purines, thymidylic acid, and certain amino acids.  While the functional dihydrofolate reductase gene has been mapped to chromosome 5, four intronless processed pseudogenes have been identified on separate chromosomes.  Dihydrofolate reductase deficiency has been linked to megaloblastic anemia.
21476	HLA-DRB1 belongs to the HLA class II beta chain paralogues. The class II molecule is a heterodimer consisting of an alpha (DRA) and a beta chain (DRB), both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa. It is encoded by 6 exons, exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail. Within the DR molecule the beta chain contains all the polymorphisms specifying the peptide binding specificities. Hundreds of DRB1 alleles have been described and typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. DRB1 is expressed at a level five times higher than its paralogues DRB3, DRB4 and DRB5. DRB1 is present in all individuals. Allelic variants of DRB1 are linked with either none or one of the genes DRB3, DRB4 and DRB5. There are 4 related pseudogenes: DRB2, DRB6, DRB7, DRB8 and DRB9.
21477	The protein encoded by this gene contains six repeated kelch motifs that are structurally similar to recombination activating gene 2 (RAG2), a protein involved in the activation of the V(D)J recombination. This gene is found to express specifically in testis. Its expression in pachytene spermatocytes is localized to cytoplasma and meiotic chromatin, which suggests that this gene may be involved in meiotic recombination.
21478	
21479	This gene encodes a protein that is one of the components of the Mediator coactivator complex. The Mediator complex is a multiprotein complex required for transcriptional activation by DNA binding transcription factors of genes transcribed by RNA polymerase II. The protein encoded by this gene is similar to cyclin-dependent kinase 8 which can also be a component of the Mediator complex.
21480	
21481	The protein encoded by this gene is thought to be a transcriptional corepressor. However, molecules that interact with this protein have not yet been identified. The protein is a member of the tripartite motif family. This motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. Three alternatively spliced transcript variants for this gene have been described, however, the full length nature of one variant has not been determined.
21482	The product of this gene belongs to the family of transient receptor potential (TRP) channels. TRP channels are cation-selective channels important for cellular calcium signaling and homeostasis. The protein encoded by this gene mediates calcium entry, and this entry is potentiated by calcium store depletion. Alternatively spliced transcript variants encoding different isoforms have been identified.
21483	This gene encodes a member of the neurotrophic tyrosine receptor kinase (NTRK) family. This kinase is a membrane-bound receptor that, upon neurotrophin binding, phosphorylates itself and members of the MAPK pathway. Signalling through this kinase leads to cell differentiation. Mutations in this gene have been associated with obesity and mood disorders. Alternate transcriptional splice variants encoding different isoforms have been found for this gene, but only two of them have been characterized to date.
21484	
21485	
21486	
21487	
21488	
21489	This gene encodes an enzyme included in the 2@#$%&, 5@#$%& oligoadenylate synthase family.  This enzyme is induced by interferons and catalyzes the 2@#$%&, 5@#$%& oligomers of adenosine in order to bind and activate RNase L.  This enzyme family plays a significant role in the inhibition of cellular protein synthesis and viral infection resistance.  Alternative splicing of this gene produces two isoforms which share a common 5@#$%& region but diverge in the 3@#$%& region.
21490	
21491	
21492	REC8L1 is the human homolog of yeast Rec8, a meiosis-specific phosphoprotein involved in recombination events (Parisi et al., 1999).[supplied by OMIM]
21493	This gene encodes one of the three subunits of the RNA polymerase III transcription factor complex. This complex plays a central role in transcription initiation by RNA polymerase III on genes encoding tRNA, 5S rRNA, and other small structural RNAs. The gene product belongs to the TF2B family. Three alternatively spliced variants encoding three different isoforms, that function at different promoters transcribed by RNA polymerase III, have been identified. A transcript encoding a fourth isoform has not yet been completely characterized.
21494	This gene encodes the integrin alpha X chain protein. Integrins are heterodimeric integral membrane proteins composed of an alpha chain and a beta chain. This protein combines with the beta 2 chain (ITGB2) to form a leukocyte-specific integrin referred to as inactivated-C3b (iC3b) receptor 4 (CR4). The alpha X beta 2 complex seems to overlap the properties of the alpha M beta 2 integrin in the adherence of neutrophils and monocytes to stimulated endothelium cells, and in the phagocytosis of complement coated particles.
21495	
21496	
21497	The protein encoded by this gene is a calcium-dependent glycoprotein that is a member of the desmocollin subfamily of the cadherin superfamily. These desmosomal family members, along with the desmogleins, are found primarily in epithelial cells where they constitute the adhesive proteins of the desmosome cell-cell junction and are required for cell adhesion and desmosome formation. The desmosomal family members are arranged in two clusters on chromosome 18, occupying less than 650 kb combined. Alternative splicing results in two transcript variants encoding distinct isoforms.
21498	Kallikreins are a subgroup of serine proteases having diverse physiological functions. Growing evidence suggests that many kallikreins are implicated in carcinogenesis and some have potential as novel cancer and other disease biomarkers. This gene is one of the fifteen kallikrein subfamily members located in a cluster on chromosome 19. Its protein product is a protease present in seminal plasma. It is thought to function normally in the liquefaction of seminal coagulum, presumably by hydrolysis of the high molecular mass seminal vesicle protein. Serum level of this protein, called PSA in the clinical setting, is useful in the diagnosis and monitoring of prostatic carcinoma. Alternate splicing of this gene generates two transcript variants encoding different isoforms. Additional transcript variants have been described, but it is unclear if these transcripts are normally expressed or if they are specific to benign or malignant tumors.
21499	
21500	
21501	The protein encoded by this gene is an acyltransferase that converts lysophosphatidic acid into phosphatidic acid, which is the second step in the de novo phospholipid biosynthetic pathway. The encoded protein may be an integral membrane protein.
21502	
21503	
21504	
21505	
21506	NUDT11 belongs to a subgroup of phosphohydrolases that preferentially attack diphosphoinositol polyphosphates (Hidaka et al., 2002).[supplied by OMIM]
21507	
21508	
21509	
21510	
21511	This gene product belongs to the DYRK family of dual-specificity protein kinases that catalyze autophosphorylation on serine/threonine and tyrosine residues. The members of this family share structural similarity, however, differ in their substrate specificity, suggesting their involvement in different cellular functions. The encoded protein has been shown to autophosphorylate on tyrosine residue and catalyze phosphorylation of histones H3 and H2B in vitro. Alternatively spliced transcript variants encoding different isoforms have been identified.
21512	
21513	
21514	The Sec4/Rab-related small GTP-binding proteins are involved in the regulation of intracellular vesicular transport. Mss4 stimulates GTP-GDP exchange in Sec4 and Rab and binds to a subset of genetically related Rab proteins.[supplied by OMIM]
21515	
21516	The Golgi complex plays a key role in the sorting and modification of proteins exported from the endoplasmic reticulum. The protein encoded by this gene is localized at the Golgi stack and may have a regulatory role in Golgi trafficking.
21517	The protein encoded by this gene is similar to ING1, a tumor suppressor protein that can interact with TP53, inhibit cell growth, and induce apoptosis. This protein contains a PHD-finger, which is a common motif in proteins involved in chromatin remodeling. This protein can bind TP53 and EP300/p300, a component of the histone acetyl transferase complex, suggesting its involvement in TP53-dependent regulatory pathway.
21518	
21519	The protein encoded by this gene is the 80-kilodalton subunit of the Ku heterodimer protein which is also known as ATP-dependant DNA helicase II or DNA repair protein XRCC5. Ku is the DNA-binding component of the DNA-dependent protein kinase, and it functions together with the DNA ligase IV-XRCC4 complex in the repair of DNA double-strand break by non-homologous end joining and the completion of V(D)J recombination events.  This gene functionally complements Chinese hamster xrs-6, a mutant defective in DNA double-strand break repair and in ability to undergo V(D)J recombination. A rare microsatellite polymorphism in this gene is associated with cancer in patients of varying radiosensitivity.
21520	
21521	
21522	
21523	
21524	Slightly proximal to the Huntington disease locus, the human MSX1 gene is deleted in patients with Wolf-Hirschhorn syndrome. This gene is also called HOX7. The Msx family of vertebrate HOX genes was originally isolated by homology to the Drosophila msh (muscle segment homeo box) gene. This is a candidate gene for human cleft palate.
21525	
21526	
21527	The protein encoded by this gene catalyzes the deamination of dCMP to dUMP, the nucleotide substrate for thymidylate synthase. The encoded protein is allosterically activated by dCTP and inhibited by dTTP, and is found as a homohexamer. This protein uses zinc as a cofactor for its activity. Two transcript variants encoding different isoforms have been found for this gene.
21528	
21529	
21530	
21531	
21532	
21533	
21534	Butyrophilin is the major protein associated with fat droplets in the milk. This gene is a member of the BTN2 subfamily of genes, which encode proteins belonging to the butyrophilin protein family. The gene is located in a cluster on chromosome 6, consisting of seven genes belonging to the expanding B7/butyrophilin-like group, a subset of the immunoglobulin gene superfamily. The encoded protein is a type 1 receptor glycoprotein involved in lipid, fatty-acid and sterol metabolism. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some variants has not been determined.
21535	
21536	This gene encodes a ubiquitous nuclear protein that stimulates both cell proliferation and differentiation. It is a member of the MADS (MCM1, Agamous, Deficiens, and SRF) box superfamily of transcription factors. This protein binds to the serum response element (SRE) in the promoter region of target genes. This protein regulates the activity of many immediate-early genes, for example c-fos, and thereby participates in cell cycle regulation, apoptosis, cell growth, and cell differentiation. This gene is the downstream target of many pathways; for example, the mitogen-activated protein kinase pathway (MAPK) that acts through the ternary complex factors (TCFs).
21537	
21538	
21539	
21540	
21541	
21542	
21543	
21544	The product of this gene belongs to the phosphatase 2 regulatory subunit B family. Protein phosphatase 2 is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. The B regulatory subunit might modulate substrate selectivity and catalytic activity. This gene encodes an alpha isoform of the regulatory subunit B55 subfamily.
21545	
21546	
21547	HERC4 belongs to the HERC family of ubiquitin ligases, all of which contain a HECT domain and at least 1 RCC1 (MIM 179710)-like domain (RLD). The 350-amino acid HECT domain is predicted to catalyze the formation of a thioester with ubiquitin before transferring it to a substrate, and the RLD is predicted to act as a guanine nucleotide exchange factor for small G proteins (Hochrainer et al., 2005).[supplied by OMIM]
21548	
21549	
21550	
21551	
21552	The mitochondrial oxidation of long-chain fatty acids is initiated by the sequential action of carnitine palmitoyltransferase I (which is located in the outer membrane and is detergent-labile) and carnitine palmitoyltransferase II (which is located in the inner membrane and is detergent-stable), together with a carnitine-acylcarnitine translocase. CPT I is the key enzyme in the carnitine-dependent transport across the mitochondrial inner membrane and its deficiency results in a decreased rate of fatty acid beta-oxidation.
21553	
21554	FMFRamide-related protein precursor plays a role in the regulation of heart rate and blood pressure and the modulation of morphine-induced antinociception.  FMRFAL encodes a preproprotein which is cleaved to form two active peptides with similar function.
21555	
21556	This gene encodes a protein with protease activity. Its specific function has not been determined. The protein may be useful as a tumor marker.
21557	The Shaker gene family of Drosophila encodes components of voltage-gated potassium channels and is comprised of four subfamilies. Based on sequence similarity, this gene is similar to one of these subfamilies, namely the Shaw subfamily. The protein encoded by this gene belongs to the delayed rectifier class of channel proteins and is an integral membrane protein that mediates the voltage-dependent potassium ion permeability of excitable membranes. Three transcript variants encoding three different isoforms have been found for this gene.
21558	This gene encodes a protein which, with 5-lipoxygenase, is required for leukotriene synthesis. Leukotrienes are arachidonic acid metabolites which have been implicated in various types of inflammatory responses, including asthma, arthritis and psoriasis. This protein localizes to the plasma membrane. Inhibitors of its function impede translocation of 5-lipoxygenase from the cytoplasm to the cell membrane and inhibit 5-lipoxygenase activation.
21559	
21560	
21561	Functions of the cell cortex, including motility, adhesion, and cytokinesis, are mediated by the reorganization of the actin cytoskeleton and recent evidence suggests a role for the Formin homology (FH) proteins in these processes. The protein encoded by this gene contains FH domains and belongs to a novel FH protein subfamily implicated in cell polarity. Wnt/Fz signaling activates the small GTPase Rho, a key regulator of cytoskeleton architecture, to control cell polarity and movement during development. Activation requires Dvl-Rho complex formation, an assembly mediated by this gene product, which is thought to function as a scaffolding protein. Evidence of alternative splicing has been observed for this gene but the full-length nature of these variants has not been determined.
21562	Members of the peripheral membrane-associated guanylate kinase (MAGUK) family function in tumor suppression and receptor clustering by forming multiprotein complexes containing distinct sets of transmembrane, cytoskeletal, and cytoplasmic signaling proteins. All MAGUKs contain a PDZ-SH3-GUK core and are divided into 4 subfamilies, DLG-like (see DLG1; MIM 601014), ZO1-like (see TJP1; MIM 601009), p55-like (see MPP1; MIM 305360), and LIN2-like (see CASK; MIM 300172), based on their size and the presence of additional domains (Tseng et al., 2001). MPP5 is a member of the p55-like MAGUK subfamily.[supplied by OMIM]
21563	
21564	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes one of the ATPase subunits, a member of the triple-A family of ATPases which have a chaperone-like activity. This subunit and a 20S core alpha subunit interact specifically with the hepatitis B virus X protein, a protein critical to viral replication. This subunit also interacts with the adenovirus E1A protein and this interaction alters the activity of the proteasome. Finally, this subunit interacts with ataxin-7, suggesting a role for the proteasome in the development of spinocerebellar ataxia type 7, a progressive neurodegenerative disorder.
21565	This gene encodes a protein related to talin 1, a cytoskeletal protein that plays a significant role in the assembly of actin filaments and in spreading and migration of various cell types, including fibroblasts and osteoclasts. This protein has a different pattern of expression compared to talin 1 but, like talin 1, is thought to associate with unique transmembrane receptors to form novel linkages between extracellular matrices and the actin cytoskeleton.
21566	
21567	This gene is a member of the BORIS + CTCF gene family and encodes a transcriptional regulator protein with 11 highly conserved zinc finger (ZF) domains. This nuclear protein is able to use different combinations of the ZF domains to bind different DNA target sequences and proteins. Depending upon the context of the site, the protein can bind a histone acetyltransferase (HAT)-containing complex and function as a transcriptional activator or bind a histone deacetylase (HDAC)-containing complex and function as a transcriptional repressor. If the protein is bound to a transcriptional insulator element, it can block communication between enhancers and upstream promoters, thereby regulating imprinted expression. Mutations in this gene have been associated with invasive breast cancers, prostate cancers, and Wilms@#$%& tumors.
21568	This gene encodes an integral membrane protein containing four transmembrane regions and a C-terminal cytoplasmic tail that is tyrosine phosphorylated. The exact function of this protein is unclear, but studies of a similar rat protein suggest that it may play a role in regulating membrane traffic in non-neuronal cells. The gene belongs to the synaptogyrin gene family.
21569	The leukotrienes constitute a group of arachidonic acid-derived compounds with biologic activities suggesting important roles in inflammation and immediate hypersensitivity. These compounds are catabolized, in part, by lipoxygenases. Epidermis-type lipoxygenases, a distinct subclass within the multigene family of mammalian lipoxygenases, are novel isoenzymes isolated from human and mouse skin including human ALOX15B (MIM 603697), human and mouse ALOX12B (MIM 603741), mouse 8S-LOX, and mouse e-LOX-3.[supplied by OMIM]
21570	
21571	This gene encodes a Ras-related protein that is stimulated by dexamethasone. The exact function of this gene is unknown, but it may play a role in dexamethasone-induced alterations in cell morphology, growth and cell-extracellular matrix interactions.  In addition, studies of a similar rat protein suggest that it functions as as a novel physiologic nitric oxide (NO) effector. The gene product belongs to the Ras superfamily of small GTPases.
21572	
21573	This gene encodes a mitochondrial NAD-dependent malic enzyme, a homotetrameric protein, that catalyzes the oxidative decarboxylation of malate to pyruvate. It had previously been weakly linked to a syndrome known as Friedreich ataxia that has since been shown to be the result of mutation in a completely different gene.
21574	The galectins are a family of beta-galactoside-binding proteins implicated in modulating cell-cell and cell-matrix interactions. Differential and in situ hybridizations indicate that this lectin is specifically expressed in keratinocytes. It is expressed at all stages of epidermal differentiation (i.e., in basal and suprabasal layers). It is moderately repressed by retinoic acid. The protein was found mainly in stratified squamous epithelium. The antigen localized to basal keratinocytes, although it was also found, albeit at lower levels, in the suprabasal layers where it concentrated to areas of cell-to-cell contact. The cellular localization and its striking down-regulation in cultured keratinocytes imply a role in cell-cell and/or cell-matrix interactions necessary for normal growth control.
21575	
21576	
21577	
21578	
21579	
21580	
21581	The human NDUFA1 gene codes for an essential component of complex I of the respiratory chain, which transfers electrons from NADH to ubiquinone. It has been noted that the N-terminal hydrophobic domain has the potential to be folded into an alpha-helix spanning the inner mitochondrial membrane with a C-terminal hydrophilic domain interacting with globular subunits of complex I. The highly conserved two-domain structure suggests that this feature is critical for the protein function and might act as an anchor for the NADH:ubiquinone oxidoreductase complex at the inner mitochondrial membrane. However, the NDUFA1 peptide is one of about 31 components of the @#$%!hydrophobic protein@#$%! (HP) fraction of complex I which is involved in proton translocation. Thus the NDUFA1 peptide may also participate in that function.
21582	
21583	The protein encoded by this gene contains an ATPase/helicase domain, and thus it belongs to the SWI/SNF family of chromatin remodeling proteins. The mutations of this gene are associated with an X-linked mental retardation (XLMR) syndrome most often accompanied by alpha-thalassemia (ATRX) syndrome. Theses mutations have been shown to cause diverse changes in the pattern of DNA methylation, which may provide a link between chromatin remodeling, DNA methylation, and gene expression in developmental processes. This protein is found to undergo cell cycle-dependent phosphorylation, which regulates its nuclear matrix and chromatin association, and suggests its involvement in the gene regulation at interphase and chromosomal segregation in mitosis. Three alternatively spliced transcript variants encoding distinct isoforms have been reported.
21584	Cytoplasmic ribosomes, organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein, which shares sequence similarity with yeast ribosomal protein L44, belongs to the L44E (L36AE) family of ribosomal proteins. Although this gene has been referred to as ribosomal protein L44 (RPL44), its official name is ribosomal protein L36a (RPL36A). This gene and the human gene officially named ribosomal protein L36a-like (RPL36AL) encode nearly identical proteins; however, they are distinct genes. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
21585	
21586	
21587	
21588	
21589	
21590	
21591	The nuclear envelope creates distinct nuclear and cytoplasmic compartments in eukaryotic cells. It consists of two concentric membranes perforated by nuclear pores, large protein complexes that form aqueous channels to regulate the flow of macromolecules between the nucleus and the cytoplasm. These complexes are composed of at least 100 different polypeptide subunits, many of which belong to the nucleoporin family. This gene encodes a member of the FG-repeat-containing nucleoporins. The protein encoded by this gene is an integral membrane protein that localizes to the central spoke ring complex and participates in anchoring the nuclear pore complex to the nuclear envelope. Alternatively spliced variants that encode different protein isoforms have been described but the full-length nature of only one has been determined.
21592	
21593	
21594	
21595	
21596	
21597	
21598	
21599	
21600	This gene is a member of the protocadherin gamma gene cluster, one of three related clusters tandemly linked on chromosome five. These gene clusters have an immunoglobulin-like organization, suggesting that a novel mechanism may be involved in their regulation and expression. The gamma gene cluster includes 22 genes divided into 3 subfamilies. Subfamily A contains 12 genes, subfamily B contains 7 genes and 2 pseudogenes, and the more distantly related subfamily C contains 3 genes. The tandem array of 22 large, variable region exons are followed by a constant region, containing 3 exons shared by all genes in the cluster. Each variable region exon encodes the extracellular region, which includes 6 cadherin ectodomains and a transmembrane region. The constant region exons encode the common cytoplasmic region. These neural cadherin-like cell adhesion proteins most likely play a critical role in the establishment and function of specific cell-cell connections in the brain. Alternative splicing has been described for the gamma cluster genes.
21601	
21602	
21603	
21604	
21605	
21606	
21607	
21608	
21609	Upstream binding factor (UBF) is a transcription factor required for expression of the 18S, 5.8S, and 28S ribosomal RNAs, along with SL1 (a complex of TBP (MIM 600075) and multiple TBP-associated factors or @#$%&TAFs@#$%&). Two UBF polypeptides, of 94 and 97 kD, exist in the human (Bell et al., 1988). UBF is a nucleolar phosphoprotein with both DNA binding and transactivation domains. Sequence-specific DNA binding to the core and upstream control elements of the human rRNA promoter is mediated through several HMG boxes (Jantzen et al., 1990).[supplied by OMIM]
21610	
21611	
21612	
21613	
21614	
21615	
21616	Laminins, a family of extracellular matrix glycoproteins, are the major noncollagenous constituent of basement membranes. They have been implicated in a wide variety of biological processes including cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis. Many of the effects of laminin are mediated through interactions with cell surface receptors. These receptors include members of the integrin family, as well as non-integrin laminin-binding proteins. This gene encodes a high-affinity, non-integrin family, laminin receptor 1. This receptor has been variously called 67 kD laminin receptor, 37 kD laminin receptor precursor (37LRP) and p40 ribosome-associated protein. The amino acid sequence of laminin receptor 1 is highly conserved through evolution, suggesting a key biological function. It has been observed that the level of the laminin receptor transcript is higher in colon carcinoma tissue and lung cancer cell line than their normal counterparts. Also, there is a correlation between the upregulation of this polypeptide in cancer cells and their invasive and metastatic phenotype. Multiple copies of this gene exist, however, most of them are pseudogenes thought to have arisen from retropositional events. Two alternatively spliced transcript variants encoding the same protein have been found for this gene.
21617	
21618	
21619	
21620	
21621	
21622	
21623	The protein encoded by this gene is a mitochondria protein highly similar to the peptide chain release factors (RFs) discovered in bacteria and yeast. The peptide chain release factors direct the termination of translation in response to the peptide chain termination codons. This protein contains the signature sequences of class I peptide chain release factors, suggesting its role in the termination of mitochondria protein synthesis. Multiple alternatively spliced transcript variants have been suggested by mRNA and EST data. Their full length natures, however, are not clear.
21624	
21625	
21626	
21627	
21628	
21629	
21630	
21631	
21632	
21633	
21634	
21635	
21636	
21637	
21638	
21639	
21640	
21641	
21642	
21643	
21644	
21645	
21646	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth, and invasion. This gene is located to a region associated with Borjeson-Forssman-Lehmann syndrome (BFLS), a syndromal X-linked mental retardation, which suggests it may be a candidate gene for familial cases of the BFL syndrome. The function of this gene has not yet been determined. Two alternatively spliced transcripts encoding different isoforms have been described for this gene.
21647	
21648	
21649	This gene encodes a bipartite protein with distinct amino- and carboxy-terminal domains. The amino-terminus contains nuclear localization signals and the carboxy-terminus contains numerous consecutive sequences with extensive similarity to proteins in the gelsolin family of actin-binding proteins, which cap, nucleate, and/or sever actin filaments. The gene product is tightly associated with both actin filaments and plasma membranes, suggesting a role as a high-affinity link between the actin cytoskeleton and the membrane. Its function may include recruitment of actin and other cytoskeletal proteins into specialized structures at the plasma membrane and in the nuclei of growing cells. Two transcript variants encoding different isoforms of supervillin have been described.
21650	
21651	
21652	
21653	
21654	
21655	Neurofascin is an L1 family immunoglobulin cell adhesion molecule (see L1CAM; MIM 308840) involved in axon subcellular targeting and synapse formation during neural development (Ango et al., 2004).[supplied by OMIM]
21656	
21657	
21658	
21659	
21660	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
21661	The protein encoded by this gene is a component of connective tissue microfibrils and be involved in elastic fiber assembly. Mutations in this gene are a cause of congenital contractural arachnodactyly.
21662	RAN is a small GTP-binding protein of the RAS superfamily that is associated with the nuclear membrane and is thought to control a variety of cellular functions through its interactions with other proteins. This gene encodes a very large RAN-binding protein that immunolocalizes to the nuclear pore complex. The protein is a giant scaffold and mosaic cyclophilin-related nucleoporin implicated in the Ran-GTPase cycle. The encoded protein directly interacts with the E2 enzyme UBC9 and strongly enhances SUMO1 transfer from UBC9 to the SUMO1 target SP100. These findings place sumoylation at the cytoplasmic filaments of the nuclear pore complex and suggest that, for some substrates, modification and nuclear import are linked events. This gene is partially duplicated in a gene cluster that lies in a hot spot for recombination on chromosome 2q.
21663	Deoxycytidine kinase (DCK) is required for the phosphorylation of several deoxyribonucleosides and their nucleoside analogs.  Deficiency of DCK is associated with resistance to antiviral and anticancer chemotherapeutic agents.  Conversely, increased deoxycytidine kinase activity is associated with increased activation of these compounds to cytotoxic nucleoside triphosphate derivatives.  DCK is clinically important because of its relationship to drug resistance and sensitivity.
21664	The Myc/Max/Mad network comprises a group of transcription factors that co-interact to regulate gene-specific transcriptional activation or repression. This gene encodes a protein member of the Myc/Max/Mad network. This protein has a basic-Helix-Loop-Helix-zipper domain (bHLHzip) with which it binds the canonical DNA sequence CANNTG, known as the E box, following heterodimerization with Max proteins. This protein is likely a transcriptional repressor and an antagonist of Myc-dependent transcriptional activation and cell growth. This protein represses transcription by binding to DNA binding proteins at its N-terminal Sin3-interaction domain.
21665	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
21666	
21667	
21668	
21669	
21670	
21671	
21672	
21673	
21674	
21675	
21676	
21677	
21678	TFCP2L3 is a member of a family of transcription factor genes whose archetype is TFCP2 (MIM 189889), a mammalian homolog of the Drosophila gene @#$%&grainyhead@#$%& (grh).[supplied by OMIM]
21679	
21680	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a protein identified as belonging to both the 28S and the 39S subunits. Further experiments will be needed to identify the specific subunit localization. Sequence analysis identified three transcript variants that encode two different isoforms. Pseudogenes corresponding to this gene are found on chromosomes 4q, 6p, 6q, 7p, and 15q.
21681	Galactosylceramidase (GALC) is a lysosomal enzyme that hydrolyzes several galactolipids. GALC deficiency is associated with Krabbe disease (globoid cell leukodystrophy). The gene is about 60 kb in length and consists of 17 exons. This gene contains ten GC-box-like sequences within the promoter region but no typical TATA box. This feature is also characteristic of other lysosomal protein encoding genes.
21682	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. The immunoproteasome contains an alternate regulator, referred to as the 11S regulator or PA28, that replaces the 19S regulator. Three subunits (alpha, beta and gamma) of the 11S regulator have been identified. This gene encodes the gamma subunit of the 11S regulator. Six gamma subunits combine to form a homohexameric ring. Two transcript variants encoding different isoforms have been identified.
21683	
21684	The protein encoded by this gene is a member of the signal-regulatory-protein (SIRP) family, and also belongs to the immunoglobulin superfamily. SIRP family members are receptor-type transmembrane glycoproteins known to be involved in the negative regulation of receptor tyrosine kinase-coupled signaling processes. The function of this protein has not yet been determined. Two alternatively spliced transcript variants of this gene, which encode distinct isoforms, have been reported.
21685	
21686	
21687	This gene encodes a member of the kelch family of proteins, which are characterized by kelch repeat motifs and a POZ/BTB protein-binding domain. It is thought that kelch repeats are actin binding domains. However, the specific function of this protein has not been determined. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
21688	There are at least four distinct but related alkaline phosphatases: intestinal, placental, placental-like, and liver/bone/kidney (tissue non-specific). The first three are located together on chromosome 2 while the tissue non-specific form is located on chromosome 1. The product of this gene is a membrane bound glycosylated enzyme that is not expressed in any particular tissue and is, therefore, referred to as the tissue-nonspecific form of the enzyme. The exact physiological function of the alkaline phosphatases is not known. A proposed function of this form of the enzyme is matrix mineralization, however, mice that lack a functional form of this enzyme show normal skeletal development. This enzyme has been linked directly to a disorder known as hypophosphatasia, a disorder that is characterized by hypercalcemia and includes skeletal defects. The character of this disorder can vary, however, depending on the specific mutation since this determines age of onset and severity of symptoms.
21689	
21690	This gene is one of seven beta-1,4-galactosyltransferase (beta4GalT) genes. They encode type II membrane-bound glycoproteins that appear to have exclusive specificity for the donor substrate UDP-galactose; all transfer galactose in a beta1,4 linkage to similar acceptor sugars: GlcNAc, Glc, and Xyl. Each beta4GalT has a distinct function in the biosynthesis of different glycoconjugates and saccharide structures. As type II membrane proteins, they have an N-terminal hydrophobic signal sequence that directs the protein to the Golgi apparatus and which then remains uncleaved to function as a transmembrane anchor. By sequence similarity, the beta4GalTs form four groups: beta4GalT1 and beta4GalT2, beta4GalT3 and beta4GalT4, beta4GalT5 and beta4GalT6, and beta4GalT7. The enzyme encoded by this gene synthesizes N-acetyllactosamine in glycolipids and glycoproteins. Its substrate specificity is affected by alpha-lactalbumin but it is not expressed in lactating mammary tissue. Two transcript variants encoding the same protein have been found for this gene.
21691	This gene encodes a hormone inducible transcriptional repressor. Repression of transcription by this gene product can occur through interactions with other repressors, by the recruitment of proteins involved in histone deacetylation, or through sequestration of transcriptional activators. The product of this gene contains a carboxy-terminal domain that permits binding to other corepressor proteins. This domain also permits interaction with members of the NuRD complex, a nucleosome remodeling protein complex that contains deacetylase activity. In addition, this repressor contains several RNA recognition motifs that confer binding to a steroid receptor RNA coactivator; this binding can modulate the activity of both liganded and nonliganded steroid receptors.
21692	
21693	This gene encodes polypeptide N-acetylgalactosaminyltransferase 2, a member of the GalNAc-transferases family. This family transfers an N-acetyl galactosamine to the hydroxyl group of a serine or threonine residue in the first step of O-linked oligosaccharide biosynthesis. Individual GalNAc-transferases have distinct activities and initiation of O-glycosylation in a cell is regulated by a repertoire of GalNAc-transferases.
21694	
21695	
21696	The HADHA and HADHB (MIM 143450) genes encode the alpha and beta subunits of the mitochondrial trifunctional protein, respectively. The heterocomplex contains 4 alpha and 4 beta subunits and catalyzes 3 steps in mitochondrial beta-oxidation of fatty acids, including the long-chain 3-hydroxyl-CoA dehydrogenase (LCHAD) step. The alpha subunit harbors the 3-hydroxyacyl-CoA dehydrogenase and enoyl-CoA hydratase activities (Kamijo et al., 1994).[supplied by OMIM]
21697	The protein encoded by this gene is member of the GAP1 family of GTPase-activating proteins. The gene product stimulates the GTPase activity of normal RAS p21 but not its oncogenic counterpart. Acting as a suppressor of RAS function, the protein enhances the weak intrinsic GTPase activity of RAS proteins resulting in the inactive GDP-bound form of RAS, thereby allowing control of cellular proliferation and differentiation. This particular family member has a perinuclear localization and is an inositol 1,3,4,5-tetrakisphosphate-binding protein; a compound suggested to function as a second messenger.
21698	
21699	
21700	
21701	
21702	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene encodes a replication-independent member of the histone H2A family that is distinct from other members of the family. Studies in mice have shown that this particular histone is required for embryonic development and indicate that lack of functional histone H2A leads to embryonic lethality.
21703	
21704	
21705	
21706	
21707	This gene encodes the protein core of a seminal plasma proteoglycan containing chondroitin- and heparan-sulfate chains. The protein@#$%&s function is unknown, although similarity to thyropin-type cysteine protease-inhibitors suggests its function may be related to protease inhibition.
21708	Ecto-5-prime-nucleotidase (5-prime-ribonucleotide phosphohydrolase; EC 3.1.3.5) catalyzes the conversion at neutral pH of purine 5-prime mononucleotides to nucleosides, the preferred substrate being AMP. The enzyme consists of a dimer of 2 identical 70-kD subunits bound by a glycosyl phosphatidyl inositol linkage to the external face of the plasma membrane. The enzyme is used as a marker of lymphocyte differentiation. Consequently, a deficiency of NT5 occurs in a variety of immunodeficiency diseases (e.g., see MIM 102700, MIM 300300). Other forms of 5-prime nucleotidase exist in the cytoplasm and lysosomes and can be distinguished from ecto-NT5 by their substrate affinities, requirement for divalent magnesium ion, activation by ATP, and inhibition by inorganic phosphate.[supplied by OMIM]
21709	
21710	PARD proteins, which were first identified in C. elegans, are essential for asymmetric cell division and polarized growth, whereas CDC42 (MIM 116952) mediates the establishment of cell polarity. The CDC42 GTPase, which is controlled by nucleotide exchange factors (GEFs; see MIM 606057) and GTPase-activating proteins (GAPs; see MIM 604980), interacts with a large set of effector proteins that typically contain a CDC42/RAC (MIM 602048)-interactive binding (CRIB) domain.[supplied by OMIM]
21711	HLA-DOA belongs to the HLA class II alpha chain paralogues. HLA-DOA forms a heterodimer with HLA-DOB. The heterodimer, HLA-DO, is found in lysosomes in B cells and regulates HLA-DM-mediated peptide loading on MHC class II molecules. In comparison with classical HLA class II molecules, this gene exhibits very little sequence variation, especially at the protein level.
21712	
21713	
21714	
21715	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
21716	
21717	
21718	
21719	The protein encoded by this gene belongs to the subtilisin-like proprotein convertase family. The members of this family are proprotein convertases that process latent precursor proteins into their biologically active products. This encoded protein mediates posttranslational endoproteolytic processing for several integrin alpha subunits. It is thought to process prorenin, pro-membrane type-1 matrix metalloproteinase and HIV-1 glycoprotein gp160. Two alternatively spliced transcripts are described for this gene but only one has its full length nature known.
21720	
21721	
21722	The protein encoded by this gene is a member of the bone morphogenetic protein (BMP) family and the TGF-beta superfamily. This group of proteins is characterized by a polybasic proteolytic processing site which is cleaved to produce a mature protein containing seven conserved cysteine residues. The members of this family are regulators of cell growth and differentiation in both embryonic and adult tissues. Studies in mice suggest that the protein encoded by this gene plays a role in skeletal morphogenesis.
21723	
21724	The epsilon globin gene (HBE) is normally expressed in the embryonic yolk sac: two epsilon chains together with two zeta chains (an alpha-like globin) constitute the embryonic hemoglobin Hb Gower I; two epsilon chains together with two alpha chains form the embryonic Hb Gower II. Both of these embryonic hemoglobins are normally supplanted by fetal, and later, adult hemoglobin. The five beta-like globin genes are found within a 45 kb cluster on chromosome 11 in the following order:  5@#$%&-epsilon - G-gamma - A-gamma - delta - beta-3@#$%&
21725	
21726	
21727	
21728	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities and are involved in a variety of biological processes including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. This gene was identified by its oncogenic transforming activity. This gene and FGF3, another oncogenic growth factor, are located closely on chromosome 11. Co-amplification of both genes was found in various kinds of human tumors. Studies on the mouse homolog suggested a function in bone morphogenesis and limb development through the sonic hedgehog (SHH) signaling pathway.
21729	This gene encodes a member of the p34Cdc2 protein kinase family. p34Cdc2 kinase family members are known to be essential for eukaryotic cell cycle control. This gene is in close proximity to CDC2L1, a nearly identical gene in the same chromosomal region. The gene loci including this gene, CDC2L1, as well as metalloprotease MMP21/22, consist of two identical, tandemly linked genomic regions, which are thought to be a part of the larger region that has been duplicated. This gene and CDC2L1 were shown to be deleted or altered frequently in neuroblastoma with amplified MYCN genes. The protein kinase encoded by this gene could be cleaved by caspases and was demonstrated to play roles in cell apoptosis. A large number of the alternatively spliced variants of this gene have been reported.
21730	
21731	
21732	
21733	
21734	The translation initiation factor eIF2 catalyzes the first regulated step of protein synthesis initiation, promoting the binding of the initiator tRNA to 40S ribosomal subunits. Binding occurs as a ternary complex of methionyl-tRNA, eIF2, and GTP. eIF2 is composed of 3 nonidentical subunits, alpha (36 kD), beta (38 kD, MIM 603908), and gamma (52 kD, MIM 300161). The rate of formation of the ternary complex is modulated by the phosphorylation state of eIF2-alpha (Ernst et al., 1987).[supplied by OMIM]
21735	
21736	
21737	This gene is a member of the BORIS + CTCF gene family and encodes a transcriptional regulator protein with 11 highly conserved zinc finger (ZF) domains. This nuclear protein is able to use different combinations of the ZF domains to bind different DNA target sequences and proteins. Depending upon the context of the site, the protein can bind a histone acetyltransferase (HAT)-containing complex and function as a transcriptional activator or bind a histone deacetylase (HDAC)-containing complex and function as a transcriptional repressor. If the protein is bound to a transcriptional insulator element, it can block communication between enhancers and upstream promoters, thereby regulating imprinted expression. Mutations in this gene have been associated with invasive breast cancers, prostate cancers, and Wilms@#$%& tumors.
21738	
21739	
21740	
21741	PAFAH1B1 was identified as encoding a gene that when mutated or lost caused the lissencephaly associated with Miller-Dieker lissencephaly syndrome. PAFAH1B1 encodes the non-catalytic alpha subunit of the intracellular Ib isoform of platelet-activating factor acteylhydrolase, a heterotrimeric enzyme that specifically catalyzes the removal of the acetyl group at the SN-2 position of platelet-activating factor (identified as 1-O-alkyl-2-acetyl-sn-glyceryl-3-phosphorylcholine). Two other isoforms of intracellular platelet-activating factor acetylhydrolase exist: one composed of multiple subunits, the other, a single subunit. In addition, a single-subunit isoform of this enzyme is found in serum.
21742	The WNT gene family consists of structurally related genes which encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family. It encodes a protein which shows 97%, 85%, and 63% amino acid identity with mouse, chicken, and Xenopus Wnt11 protein, respectively. This gene may play roles in the development of skeleton, kidney and lung, and is considered to be a plausible candidate gene for High Bone Mass Syndrome.
21743	
21744	cAMP is a signaling molecule important for a variety of cellular functions. cAMP exerts its effects by activating the cAMP-dependent protein kinase (AMPK), which transduces the signal through phosphorylation of different target proteins. The inactive holoenzyme of AMPK is a tetramer composed of two regulatory and two catalytic subunits. cAMP causes the dissociation of the inactive holoenzyme into a dimer of regulatory subunits bound to four cAMP and two free monomeric catalytic subunits. Four different regulatory subunits and three catalytic subunits of AMPK have been identified in humans. The protein encoded by this gene is one of the regulatory subunits. This protein was found to be a tissue-specific extinguisher that down-regulates the expression of seven liver genes in hepatoma x fibroblast hybrids. Functional null mutations in this gene cause Carney complex (CNC), an autosomal dominant multiple neoplasia syndrome. This gene can fuse to the RET protooncogene by gene rearrangement and form the thyroid tumor-specific chimeric oncogene known as PTC2. Three alternatively spliced transcript variants encoding the same protein have been observed.
21745	Expression of this gene is induced by hexamethylene-bis-acetamide in vascular smooth muscle cells. The function of its protein product is not yet known.
21746	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L18E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
21747	
21748	Mammalian sodium-dicarboxylate cotransporters transport succinate and other Krebs cycle intermediates. They fall into 2 categories based on their substrate affinity: low affinity and high affinity. Both the low- and high-affinity transporters play an important role in the handling of citrate by the kidneys. The protein encoded by this gene represents the high-affinity form. Alternatively spliced transcript variants encoding different isoforms have been found for this gene, although the full-length nature of some of them have not been characterized yet.
21749	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This mitochondrial protein initiates the degradation of 1,25-dihydroxyvitamin D3, the physiologically active form of vitamin D3, by hydroxylation of the side chain. In regulating the level of vitamin D3, this enzyme plays a role in calcium homeostasis and the vitamin D endocrine system.
21750	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. The encoded protein shares 100% sequence identity with the mouse counterpart. This gene is ubiquitously expressed, with high expression seen in adult muscle. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
21751	
21752	This gene encodes a Ras-related protein that enriched in striatum. The product of this gene binds to GTP and possesses intrinsic GTPase activity. The gene belongs to the Ras superfamily of small GTPases. The exact function of this gene is unknown, but most striatum-specific mRNAs characterized to date encode components of signal transduction cascades.
21753	
21754	The protein encoded by this gene is a member of the S-adenosylmethionine-binding protein family. It is a nucleolar protein and may be involved in the processing and modification of rRNA. Three alternatively spliced transcript variants encoding different isoforms have been described for this gene.
21755	
21756	
21757	
21758	This gene encodes a protein containing an RNA-binding motif in the N-terminus and four SRGY (serine, arginine, glycine, tyrosine) boxes in the C-terminus. Multiple copies of this gene are found in the AZFb azoospermia factor region of chromosome Y and the encoded protein is thought to be involved in spermatogenesis. Most copies of this locus are pseudogenes, although six highly similar copies have full-length ORFs and are considered functional. Four functional copies of this gene are found within inverted repeat IR2; two functional copies of this gene are found in palindrome P3, along with two copies of PTPN13-like, Y-linked. Alternative splicing of transcripts results in two transcript variants that encode different proteins.
21759	
21760	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L21E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
21761	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class. Alternatively spliced transcript variants encoding distinct isoforms have been identified for this gene.
21762	This gene encodes a protein which contains a zinc finger-like PHD (plant homeodomain) finger, distinct from other classes of zinc finger motifs, and a hydrophobic and highly conserved domain. The PHD finger shows the typical Cys4-His-Cys3 arrangement. PHD finger genes are thought to belong to a diverse group of transcriptional regulators possibly affecting eukaryotic gene expression by influencing chromatin structure.
21763	
21764	This gene encodes the homolog of the Drosophila melanogaster Nipped-B gene product and fungal Scc2-type sister chromatid cohesion proteins. The Drosophila protein facilitates enhancer-promoter communication of remote enhancers and plays a role in developmental regulation. It is also homologous to a family of chromosomal adherins with broad roles in sister chromatid cohesion, chromosome condensation, and DNA repair. The human protein has a bipartite nuclear targeting sequence and a putative HEAT repeat. Condensins, cohesins and other complexes with chromosome-related functions also contain HEAT repeats. Mutations in this gene result in Cornelia de Lange syndrome.
21765	
21766	
21767	
21768	
21769	
21770	
21771	
21772	
21773	
21774	
21775	
21776	
21777	
21778	
21779	This gene encodes one of the DNA-unwinding enzymes which prefers partially unwound 3@#$%&-tailed substrates and can also unwind partial RNA/DNA and RNA/RNA duplexes in an ATP-dependent fashion. This enzyme is a member of the heterogeneous nuclear RNA-binding proteins and is also an element of the Ras signal transduction pathway. It binds specifically to the Ras-GTPase-activating protein by associating with its SH3 domain. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
21780	
21781	
21782	
21783	
21784	
21785	
21786	
21787	
21788	
21789	
21790	
21791	
21792	
21793	
21794	
21795	
21796	
21797	
21798	
21799	
21800	
21801	
21802	
21803	Agrin is a neuronal aggregating factor that induces the aggregation of acetylcholine receptors and other postsynaptic proteins on muscle fibers and is crucial for the formation of the neuromuscular junction.[supplied by OMIM]
21804	
21805	
21806	
21807	
21808	
21809	
21810	
21811	
21812	
21813	N-methyl-D-aspartate (NMDA) receptors are a class of ionotropic glutamate receptors.  NMDA channel has been shown to be involved in long-term potentiation, an activity-dependent increase in the efficiency of synaptic transmission thought to underlie certain kinds of memory and learning.  NMDA receptor channels are heteromers composed of the key receptor subunit NMDAR1 (GRIN1) and 1 or more of the 4 NMDAR2 subunits: NMDAR2A (GRIN2A), NMDAR2B (GRIN2B), NMDAR2C (GRIN2C), and NMDAR2D (GRIN2D).
21814	
21815	AATK is a novel gene which is induced during apoptosis.  It encodes a protein with a tyrosine kinase domain at the N-terminal end and a proline-rich domain at the C-terminal end.  AATK expression may be a necessary pre-requisite for the induction of growth arrest and/or apoptosis of myeloid precursor cells
21816	
21817	This gene is a member of the cadherin superfamily of calcium-dependent cell-cell adhesion molecules. The encoded protein has a signal peptide, six cadherin repeat domains and a unique cytoplasmic region. This non-classical cadherin appears to be exclusively expressed in the mitral and tufted cells in the main and accessory olfactory bulbs of the brain, suggesting a possible role in the formation and maintenance of neuronal networks.
21818	
21819	
21820	Membrane mucins, such as MUC17, function in epithelial cells to provide cytoprotection, maintain luminal structure, provide signal transduction, and confer antiadhesive properties upon cancer cells that lose their apical/basal polarization.[supplied by OMIM]
21821	
21822	
21823	
21824	
21825	The protein encoded by this gene is a glycoprotein and a member of the NADPH oxidase family. The synthesis of thyroid hormone is catalyzed by a protein complex located at the apical membrane of thyroid follicular cells. This complex contains an iodide transporter, thyroperoxidase, and a peroxide generating system that includes this encoded protein and DUOX1. This protein is known as dual oxidase because it has both a peroxidase homology domain and a gp91phox domain.
21826	This gene encodes a cytoplasmic protein tyrosine kinase which is involved in calcium-induced regulation of ion channels and activation of the map kinase signaling pathway. The encoded protein may represent an important signaling intermediate between neuropeptide-activated receptors or neurotransmitters that increase calcium flux and the downstream signals that regulate neuronal activity. The encoded protein undergoes rapid tyrosine phosphorylation and activation in response to increases in the intracellular calcium concentration, nicotinic acetylcholine receptor activation, membrane depolarization, or protein kinase C activation. This protein has been shown to bind CRK-associated substrate, nephrocystin, GTPase regulator associated with FAK, and the SH2 domain of GRB2. The encoded protein is a member of the FAK subfamily of protein tyrosine kinases but lacks significant sequence similarity to kinases from other subfamilies. Four transcript variants encoding two different isoforms have been found for this gene.
21827	
21828	
21829	CAPZB is a member of the F-actin capping protein family. This gene encodes the beta subunit of the barbed-end actin binding protein.  The protein regulates growth of the actin filament by capping the barbed end of growing actin filaments.
21830	
21831	
21832	
21833	
21834	
21835	This gene is involved in the first step in glycosylphosphatidylinositol (GPI)-anchor biosynthesis. The GPI-anchor is a glycolipid found on many blood cells and serves to anchor proteins to the cell surface. This gene encodes a N-acetylglucosaminyl transferase component that is part of the complex that catalyzes transfer of N-acetylglucosamine (GlcNAc) from UDP-GlcNAc to phosphatidylinositol (PI).
21836	
21837	
21838	AKT2 is a putative oncogene encoding a protein belonging to a subfamily of serine/threonine kinases containing SH2-like (Src homology 2-like) domains. Furthermore, AKT2 was shown to be amplified and overexpressed in 2 of 8 ovarian carcinoma cell lines and 2 of 15 primary ovarian tumors. Overexpression of AKT2 contributes to the malignant phenotype of a subset of human ductal pancreatic cancers. AKT2 is a general protein kinase capable of phophorylating several known proteins.
21839	Calcineurin plays an important role in the T-cell receptor-mediated signal transduction pathway. The protein encoded by this gene binds specifically to the activated form of calcineurin and inhibits calcineurin-mediated signal transduction. The encoded protein is found in the nucleus and contains a leucine zipper domain as well as several PEST motifs, sequences which confer targeted degradation to those proteins which contain them. At least four alternatively spliced transcripts have been found for this gene, but the full-length nature of most of them has not been determined.
21840	GM2 and GD2 gangliosides are sialic acid-containing glycosphingolipids. GalNAc-T is the enzyme involved in the biosynthesis of G(M2) and G(D2) glycosphingolipids. GalNAc-T catalyzes the transfer of GalNAc into G(M3) and G(D3) by a beta-1,4 linkage, resulting in the synthesis of G(M2) and G(D2), respectively.
21841	
21842	
21843	
21844	
21845	
21846	
21847	
21848	
21849	Netrin G1 (NTNG1) belongs to a conserved family of proteins that act as axon guidance cues during vertebrate nervous system development (Nakashiba et al., 2000).[supplied by OMIM]
21850	
21851	This gene encodes a member of the bicoid sub-family of homeodomain-containing transcription factors. The encoded protein acts as a transcription factor and may play a role in brain and sensory organ development. A similar protein in mice is required for proper brain and sensory organ development and can cause epilepsy.
21852	Histones play a critical role in transcriptional regulation, cell cycle progression, and developmental events. Histone acetylation/deacetylation alters chromosome structure and affects transcription factor access to DNA. The protein encoded by this gene belongs to class II of the histone deacetylase/acuc/apha family. It possesses histone deacetylase activity and represses transcription when tethered to a promoter. This protein does not bind DNA directly, but through transcription factors MEF2C and MEF2D. It seems to interact in a multiprotein complex with RbAp48 and HDAC3.
21853	Histones play a critical role in transcriptional regulation, cell cycle progression, and developmental events. Histone acetylation/deacetylation alters chromosome structure and affects transcription factor access to DNA. The protein encoded by this gene belongs to class II of the histone deacetylase/acuc/apha family. It possesses histone deacetylase activity and represses transcription when tethered to a promoter. This protein does not bind DNA directly, but through transcription factors MEF2C and MEF2D. It seems to interact in a multiprotein complex with RbAp48 and HDAC3.
21854	
21855	Patients with Chediak-Higashi syndrome (CHS1; MIM 214500) suffer from a systemic immunodeficiency involving defects in polarized trafficking of vesicles in a number of immune system cell types. In mouse, this syndrome is reproduced in strains with a mutation in the @#$%&beige@#$%& gene that results in proteins lacking the BEACH (beige and CHS1) domain and C-terminal WD repeats. LRBA contains key features of both beige/CHS1 and A kinase anchor proteins (AKAPs; see MIM 602449).[supplied by OMIM]
21856	
21857	This ribonuclease gene is a novel member of the Rh/T2/S-glycoprotein class of extracellular ribonucleases. It is a single copy gene that maps to 6q27, a region associated with human malignancies and chromosomal rearrangement.
21858	
21859	
21860	
21861	This gene lies adjacent to the tumor suppressor gene CDKN2A in a region that is frequently mutated and deleted in a wide variety of tumors. This gene encodes a cyclin-dependent kinase inhibitor, which forms a complex with CDK4 or CDK6, and prevents the activation of the CDK kinases, thus the encoded protein functions as a cell growth regulator that controls cell cycle G1 progression. The expression of this gene was found to be dramatically induced by TGF beta, which suggested its role in the TGF beta induced growth inhibition. Two alternatively spliced transcript variants of this gene, which encode distinct proteins, have been reported.
21862	
21863	This gene encodes a bZIP transcription factor that binds to the cAMP responsive element found in many viral and cellular promoters. It is an important component of cAMP-mediated signal transduction during the spermatogenetic cycle, as well as other complex processes. Alternative promoter and translation initiation site usage allows this gene to exert spatial and temporal specificity to cAMP responsiveness. Multiple alternatively spliced transcript variants encoding several different isoforms have been found for this gene, with some of them functioning as activators and some as repressors of transcription.
21864	
21865	
21866	
21867	
21868	
21869	
21870	
21871	
21872	
21873	
21874	
21875	
21876	This gene encodes the enzyme responsible for RNA editing by site-specific deamination of adenosines.  This enzyme destabilizes double stranded RNA through conversion of adenosine to inosine.  Alternative splicing of this gene results in several transcript variants, three which are characterized by deletion of exons 6 and/or 7.
21877	Vesicle mediated protein sorting plays an important role in segregation of intracellular molecules into distinct organelles. Genetic studies in yeast have identified more than 40 vacuolar protein sorting (VPS) genes involved in vesicle transport to vacuoles. This gene is a member of the Sec1 domain family, and shows a high degree of sequence similarity to mouse, rat and yeast Vps45. The exact function of this gene is not known, but its high expression in peripheral blood mononuclear cells suggests a role in trafficking proteins, including inflammatory mediators.
21878	The human MDM4 gene, which plays a role in apoptosis, encodes a 490-amino acid protein containing a RING finger domain and a putative nuclear localization signal. The MDM4 putative nuclear localization signal, which all Mdm proteins contain, is located in the C-terminal region of the protein. The mRNA is expressed at a high level in thymus and at lower levels in all other tissues tested. MDM4 protein produced by in vitro translation interacts with p53 via a binding domain located in the N-terminal region of the MDM4 protein. MDM4 shows significant structural similarity to p53-binding protein MDM2
21879	This gene encodes a multidomain protein containing an N-terminal alpha-helical region with a coiled-coil motif, followed by a pleckstrin homology (PH) domain, an Arf-GAP domain, an ankyrin homology region, a proline-rich region, and a C-terminal Src homology 3 (SH3) domain. The protein localizes in the Golgi apparatus and at the plasma membrane, where it colocalizes with protein tyrosine kinase 2-beta (PYK2). The encoded protein forms a stable complex with PYK2 in vivo. This interaction appears to be mediated by binding of its SH3 domain to the C-terminal proline-rich domain of PYK2. The encoded protein is tyrosine phosphorylated by activated PYK2. In vitro it shows strong GTPase-activating protein (GAP) activity towards the small GTPases ADP-ribosylation factor (ARF) 1 and ARF5 and weak activity towards ARF6. The encoded protein is believed to function as an ARF GAP that controls ARF-mediated vesicle budding when recruited to Golgi membranes. In addition, it functions as a substrate and downstream target for PYK2 and SRC, a pathway that may be involved in the regulation of vesicular transport.
21880	
21881	This gene is a member of the paired box (PAX) family of transcription factors. Members of the PAX family typically contain a paired box domain and a paired-type homeodomain. These genes play critical roles during fetal development. Mutations in paired box gene 3 are associated with Waardenburg syndrome, craniofacial-deafness-hand syndrome, and alveolar rhabdomyosarcoma. The translocation t(2;13)(q35;q14), which represents a fusion between PAX3 and the forkhead gene, is a frequent finding in alveolar rhabdomyosarcoma. Alternative splicing results in transcripts encoding isoforms with different C-termini.
21882	The protein encoded by this gene is a member of the thioredoxin family. The exact function of this protein is not known but it is associated with cell differentiation.
21883	
21884	This gene belongs to subfamily P of the forkhead box (FOX) transcription factor family. Forkhead box transcription factors play important roles in the regulation of tissue- and cell type-specific gene transcription during both development and adulthood. Forkhead box P1 protein contains both DNA-binding- and protein-protein binding-domains. This gene may act as a tumor suppressor as it is lost in several tumor types and maps to a chromosomal region (3p14.1) reported to contain a tumor suppressor gene(s). Alternative splicing results in multiple transcript variants encoding different isoforms.
21885	
21886	
21887	
21888	
21889	
21890	
21891	
21892	
21893	The human glucocorticoid receptor DNA binding factor, which associates with the promoter region of the glucocorticoid receptor gene (hGR gene), is a repressor of glucocorticoid receptor transcription. The amino acid sequence deduced from the cDNA sequences show the presence of three sequence motifs characteristic of a zinc finger and one motif suggestive of a leucine zipper in which 1 cysteine is found instead of all leucines. The GRLF1 enhances the homologous down-regulation of wild-type hGR gene expression. Biochemical analysis suggests that GRLF1 interaction is sequence specific and that transcriptional efficacy of GRLF1 is regulated through its interaction with specific sequence motif. The level of expression is regulated by glucocorticoids. Two transcript variants encoding different isoforms have been found for this gene.
21894	
21895	
21896	
21897	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. Multiple alternatively spliced transcript variants that encode different protein isoforms have been described for this gene.
21898	This gene encodes a protein that is one of two cannabinoid receptors. The cannabinoids, principally delta-9-tetrahydrocannabinol and synthetic analogs, are psychoactive ingredients of marijuana. The cannabinoid receptors are members of the family of guanine-nucleotide-binding protein (G-protein) coupled receptors which inhibit adenylate cyclase activity in a dose-dependent, stereoselective and pertussis toxin-sensitive manner. The two receptors have been found to be involved in the cannabinoid-induced CNS effects (including alterations in mood and cognition) experienced by users of marijuana. Two transcript variants encoding different isoforms have been described for this gene.
21899	This gene encodes a specific inhibitor of protein phosphatase-1 (PP1) with a differential sensitivity toward the metal-independent and metal-dependent forms of PP1. The gene is located within the major histocompatibility complex class I region on chromosome 6. Alternative splicing results in two transcript variants encoding different isoforms. Other alternatively spliced transcripts have been described, but their full length sequences have not been determined.
21900	The HEY2 gene encodes the homolog of the zebrafish @#$%&gridlock@#$%& gene, which is expressed in lateral plate mesoderm before vessel formation and is involved in the arterial-venous cell fate decision. For background information on hairy/enhancer of split-related proteins, see HEY1 (MIM 602953).[supplied by OMIM]
21901	
21902	
21903	
21904	
21905	
21906	
21907	
21908	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a non-ATPase subunit of the 19S regulator base.
21909	Fibrillar collagens, such as COL27A1, compose one of the most ancient families of extracellular matrix molecules. They form major structural elements in extracellular matrices of cartilage, skin, and tendon (Boot-Handford et al., 2003).[supplied by OMIM]
21910	
21911	
21912	
21913	
21914	
21915	
21916	
21917	
21918	
21919	
21920	
21921	This gene encodes a member of the natural killer cell receptor C-type lectin family. The encoded protein inhibits osteoclast formation and contains a transmembrane domain near the N-terminus as well as the C-type lectin-like extracellular domain. Several alternatively spliced transcript variants have been identified, but the full-length nature of every transcript has not been defined.
21922	The protein encoded by this gene is a member of the TNF-receptor superfamily. This protein is one of the major receptors for the tumor necrosis factor-alpha. This receptor can activate NF-kappaB, mediate apoptosis, and function as a regulator of inflammation. Antiapoptotic protein BCL2-associated athanogene 4 (BAG4/SODD) and adaptor proteins TRADD and TRAF2 have been shown to interact with this receptor, and thus play regulatory roles in the signal transduction mediated by the receptor. Germline mutations of the extracellular domains of this receptor were found to be associated with the autosomal dominant periodic fever syndrome. The impaired receptor clearance is thought to be a mechanism of the disease.
21923	The MMAB gene encodes cob(I)alamin adenosyltransferase (EC 2.5.1.17), which catalyzes the final step in the synthesis of the coenzyme adenosylcobalamin (AdoCbl). AdoCbl is a vitamin B12-containing coenzyme for methylmalonyl-CoA mutase (MIM 609058).[supplied by OMIM]
21924	Myosin phosphatase target subunit 1, which is also called the myosin-binding subunit of myosin phosphatase, is one of the subunits of myosin phosphatase. Myosin phosphatase regulates the interaction of actin and myosin downstream of the guanosine triphosphatase Rho. The small guanosine triphosphatase Rho is implicated in myosin light chain (MLC) phosphorylation, which results in contraction of smooth muscle and interaction of actin and myosin in nonmuscle cells. The guanosine triphosphate (GTP)-bound, active form of RhoA (GTP.RhoA) specifically interacted with the myosin-binding subunit (MBS) of myosin phosphatase, which regulates the extent of phosphorylation of MLC. Rho-associated kinase (Rho-kinase), which is activated by GTP. RhoA, phosphorylated MBS and consequently inactivated myosin phosphatase. Overexpression of RhoA or activated RhoA in NIH 3T3 cells increased phosphorylation of MBS and MLC. Thus, Rho appears to inhibit myosin phosphatase through the action of Rho-kinase.
21925	This gene encodes a protein with an N-terminal RCC1 domain and a C-terminal BTB (broad complex, tramtrack and bric-a-brac) domain. In rat, over-expression of this gene in vascular smooth muscle cells induced cellular hypertrophy. In rat, the C-terminus of RCBTB1 interacts with the angiotensin II receptor-1A. In humans, this gene maps to a region of chromosome 13q that is frequently deleted in B-cell chronic lymphocytic leukemia and other lymphoid malignancies.
21926	
21927	The protein encoded by this gene is a secreted, extracellular matrix protein containing an Arg-Gly-Asp (RGD) motif and calcium-binding EGF-like domains. It promotes adhesion of endothelial cells through interaction of integrins and the RGD motif. It is prominently expressed in developing arteries but less so in adult vessels. However, its expression is reinduced in balloon-injured vessels and atherosclerotic lesions, notably in intimal vascular smooth muscle cells and endothelial cells. Therefore, the protein encoded by this gene may play a role in vascular development and remodeling.
21928	
21929	
21930	NHE3 (SLC9A3; MIM 182307) is a sodium/hydrogen exchanger in the brush border membrane of the proximal tubule, small intestine, and colon that plays a major role in transepithelial sodium absorption. SLC9A3R2, as well as SLC9A3R1 (MIM 604990) and protein kinase A phosphorylation, may play a role in NHE3 regulation.[supplied by OMIM]
21931	C2H2 zinc finger proteins, such as ZNF213, have bipartite structures in which one domain binds DNA or RNA and the other modulates target gene expression.[supplied by OMIM]
21932	
21933	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this DEAD box protein family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a member of this family. The function of this member has not been determined.
21934	
21935	Extracellular signals are transduced into cells through mitogen-activated protein kinases. The structural organization of these kinases into specific signaling domains is facilitated by scaffolding proteins involved in closely tethering different kinases so that successive phosphorylation events can occur. The protein encoded by this gene is a scaffolding protein that brings together mitogen-activated protein kinases and their transcription factor targets for the activation of specific signaling pathways. This gene which is abundantly expressed in testicular haploid germ cells encodes a protein that is recognized by sperm-agglutinating antibodies and implicated in infertility. Alternatively spliced variants that encode different protein isoforms have been described but the full-length sequences of only two have been determined.
21936	
21937	
21938	
21939	
21940	
21941	
21942	
21943	The protein encoded by this gene is a nuclear transcription factor involved in the activation of the solute carrier family 2 member 4 gene. The encoded protein interacts with another transcription factor, myocyte enhancer factor 2, to activate transcription of this gene.
21944	This gene is one of seven beta-1,4-galactosyltransferase (beta4GalT) genes. They encode type II membrane-bound glycoproteins that appear to have exclusive specificity for the donor substrate UDP-galactose; all transfer galactose in a beta1,4 linkage to similar acceptor sugars: GlcNAc, Glc, and Xyl. Each beta4GalT has a distinct function in the biosynthesis of different glycoconjugates and saccharide structures. As type II membrane proteins, they have an N-terminal hydrophobic signal sequence that directs the protein to the Golgi apparatus and which then remains uncleaved to function as a transmembrane anchor. By sequence similarity, the beta4GalTs form four groups: beta4GalT1 and beta4GalT2, beta4GalT3 and beta4GalT4, beta4GalT5 and beta4GalT6, and beta4GalT7. The function of the enzyme encoded by this gene is not clear. This gene was previously designated as B4GALT4 but was renamed to B4GALT5. In the literature it is also referred to as beta4GalT2.
21945	The protein encoded by this gene is a member of the GDNF receptor family. It is a glycosylphosphatidylinositol(GPI)-linked cell surface receptor for persephin, and mediates activation of the RET tyrosine kinase receptor. This gene is a candidate gene for RET-associated diseases. Three alternatively spliced transcript variants encoding different isoforms have been described for this gene.
21946	
21947	
21948	This gene encodes a member of the forkhead class of DNA-binding proteins. These hepatocyte nuclear factors are transcriptional activators for liver-specific transcripts such as albumin and transthyretin, and they also interact with chromatin. Similar family members in mice have roles in the regulation of metabolism and in the differentiation of the pancreas and liver. This gene has been linked to sporadic cases of maturity-onset diabetes of the young. Two transcript variants encoding the same protein have been identified for this gene.
21949	The protein encoded by this gene is a protein-tyrosine kinase that is predominantly expressed in hemopoietic cell types. The encoded protein may help couple the Fc receptor to the activation of the respiratory burst. In addition, it may play a role in neutrophil migration and in the degranulation of neutrophils. Alternate translation initiation site usage, including a non-AUG (CUG) codon, results in the production of two different isoforms, that have different subcellular localization.
21950	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that belongs to the ribosomal protein S6P family. Pseudogenes corresponding to this gene are found on chromosomes 1p and 12q.
21951	
21952	SLC6A14 is a member of the Na(+)- and Cl(-)-dependent neurotransmitter transporter family and transports both neutral and cationic amino acids in an Na(+)- and Cl(-)-dependent manner.[supplied by OMIM]
21953	
21954	Cytochrome b (-245) is composed of cytochrome b alpha (CYBA) and beta (CYBB) chain. It has been proposed as a primary component of the microbicidal oxidase system of phagocytes. CYBB deficiency is one of five described biochemical defects associated with chronic granulomatous disease (CGD). In this disorder, there is decreased activity of phagocyte NADPH oxidase; neutrophils are able to phagocytize bacteria but cannot kill them in the phagocytic vacuoles. The cause of the killing defect is an inability to increase the cell@#$%&s respiration and consequent failure to deliver activated oxygen into the phagocytic vacuole.
21955	This gene encodes a member of the ephrin family. The encoded protein is a type I membrane protein and a ligand of Eph-related receptor tyrosine kinases. It may play a role in cell adhesion and function in the development or maintenance of the nervous system.
21956	
21957	
21958	The protein encoded by this gene contains a THAP domain, which is a conserved DNA-binding domain that has striking similarity to the site-specific DNA-binding domain (DBD) of Drosophila P element transposases.
21959	
21960	
21961	
21962	
21963	
21964	
21965	KIF21A belongs to a family of plus end-directed kinesin (see MIM 600025) motor proteins. Neurons use kinesin and dynein (see MIM 600112) microtubule-dependent motor proteins to transport essential cellular components along axonal and dendritic microtubules.[supplied by OMIM]
21966	
21967	
21968	
21969	
21970	
21971	
21972	
21973	This gene encodes an integral membrane protein containing four transmembrane regions and a C-terminal cytoplasmic tail that is tyrosine phosphorylated. The exact function of this protein is unclear, but studies of a similar rat protein suggest that it may play a role in regulating membrane traffic in non-neuronal cells. The gene belongs to the synaptogyrin gene family.
21974	
21975	
21976	
21977	
21978	This gene encodes the integrin alpha X chain protein. Integrins are heterodimeric integral membrane proteins composed of an alpha chain and a beta chain. This protein combines with the beta 2 chain (ITGB2) to form a leukocyte-specific integrin referred to as inactivated-C3b (iC3b) receptor 4 (CR4). The alpha X beta 2 complex seems to overlap the properties of the alpha M beta 2 integrin in the adherence of neutrophils and monocytes to stimulated endothelium cells, and in the phagocytosis of complement coated particles.
21979	
21980	Lymphoid nuclear protein related to AF4 (LAF4) is related to MLLT2 which is also referred to as AF4. LAF4 is a tissue-restricted nuclear transcriptional activator that is preferentially expressed in lymphoid tissue.  Isolation of LAF4 has defined a highly conserved LAF4/MLLT2 gene family of nuclear transcription factors that may function in lymphoid development and oncogenesis.
21981	
21982	The protein encoded by this gene belongs to the family of P-type cation transport ATPases, and to the subfamily of aminophospholipid-transporting ATPases. The aminophospholipid translocases transport phosphatidylserine and phosphatidylethanolamine from one side of a bilayer to another. Alternatively spliced transcript variants encoding different isoforms have been identified.
21983	
21984	
21985	
21986	
21987	
21988	
21989	
21990	
21991	
21992	
21993	
21994	
21995	This gene is a member of the Snail family of C2H2-type zinc finger transcription factors. It codes for a neural-specific transcriptional repressor that binds to E-box motifs. Scrt1 may promote neural differention and may be involved in cancers with neuroendocrine features.
21996	
21997	
21998	
21999	
22000	
22001	
22002	
22003	
22004	
22005	
22006	
22007	Receptor tyrosine kinases (RTKs) play a key role in the communication of cells with their microenvironment. These molecules are involved in the regulation of cell growth, differentiation and metabolism. The protein encoded by this gene is a RTK that is widely expressed in normal and transformed epithelial cells and is activated by various types of collagen. This protein belongs to a subfamily of tyrosine kinase receptors with a homology region to the Dictyostelium discoideum protein discoidin I in their extracellular domain. Its autophosphorylation is achieved by all collagens so far tested (type I to type VI). In situ studies and Northern-blot analysis showed that expression of this encoded protein is restricted to epithelial cells, particularly in the kidney, lung, gastrointestinal tract, and brain. In addition, this protein is significantly over-expressed in several human tumors from breast, ovarian, esophageal, and pediatric brain. This gene is located on chromosome 6p21.3 in proximity to several HLA class I genes. Three isoforms of this gene are generated by alternative splicing. [PROW]
22008	
22009	
22010	
22011	
22012	
22013	
22014	The gamma-aminobutyric acid (GABA) A receptor is a multisubunit chloride channel that mediates the fastest inhibitory synaptic transmission in the central nervous system. This gene encodes GABA A receptor, theta subunit. It is mapped to chromosome Xq28 in a cluster including the genes encoding the alpha 3 and epsilon subunits of the same receptor. This gene location is also the candidate region of 2 different neurologic diseases: early-onset parkinsonism (Waisman syndrome) and X-linked mental retardation (MRX3).
22015	
22016	
22017	
22018	
22019	The SMADIP1 gene (also known as SIP1) is a member of the delta-EF1 (TCF8; MIM 189909)/Zfh1 family of 2-handed zinc finger/homeodomain proteins. SMADIP1 interacts with receptor-mediated, activated full-length SMADs (see MIM 605568).[supplied by OMIM]
22020	
22021	The protein encoded by this gene is a receptor for C-C type chemokines. It belongs to family 1 of the G protein-coupled receptors. This receptor binds and responds to a variety of chemokines, including eotaxin (CCL11), eotaxin-3 (CCL26), MCP-3 (CCL7), MCP-4 (CCL13), and RANTES (CCL5). It is highly expressed in eosinophils and basophils, and is also detected in TH1 and TH2 cells, as well as in airway epithelial cells. This receptor may contribute to the accumulation and activation of eosinophils and other inflammatory cells in the allergic airway. It is also known to be an entry co-receptor for HIV-1. This gene and seven other chemokine receptor genes form a chemokine receptor gene cluster on the chromosomal region 3p21. Alternatively spliced transcript variants encoding the same protein have been described.
22022	
22023	
22024	This gene encodes a member of the RCC1-related GEF family. The N-terminal half of the encoded amino acid sequence shows similarity to the regulator of chromosome condensation RCC1, which acts as a guanine nucleotide exchange factor (GEF) protein for the Ras-related GTPase Ran.
22025	
22026	This gene encodes a member of the Rho family of GTPases. This protein can activate PAK1 and JNK1, and can induce filopodium formation and stress fiber dissolution. It may also mediate the effects of WNT1 signaling in the regulation of cell morphology, cytoskeletal organization, and cell proliferation.
22027	
22028	
22029	
22030	
22031	
22032	
22033	
22034	
22035	Naturally occurring antagonist for both ANGPT1 and TIE2; expressed only at the sites of vascular remodeling; similar to angiopoietin-1
22036	Hexokinases phosphorylate glucose to produce glucose-6-phosphate, thus committing glucose to the glycolytic pathway. This gene encodes a ubiquitous form of hexokinase which localizes to the outer membrane of mitochondria. Mutations in this gene have been associated with hemolytic anemia due to hexokinase deficiency. Alternative splicing of this gene results in five transcript variants which encode different isoforms, some of which are tissue-specific. Each isoform has a distinct N-terminus; the remainder of the protein is identical among all the isoforms. A sixth transcript variant has been described, but due to the presence of several stop codons, it is not thought to encode a protein.
22037	
22038	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene includes 2 alternatively spliced transcripts encoding 2 different isoforms. The larger isoform is a DEAD box protein with RNA helicase activity. It may participate in melting of DNA:RNA hybrids, such as those that occur during transcription, and may play a role in X-linked gene expression. It contains 2 copies of a double-stranded RNA-binding domain, a DEXH core domain and an RGG box. The RNA-binding domains and RGG box influence and regulate RNA helicase activity. The smaller isoform is a lymphocyte granule protein. It lacks RNA-binding domains and DEXH core domain, but contains an RGG box, which may render this isoform RNA binding function.
22039	
22040	
22041	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation.
22042	
22043	
22044	This gene encodes a protein that is a member of the vertebrate eve-related homeo box family. This protein acts as a transcriptional repressor. A similar protein in mice is critical for early embryogenesis.
22045	
22046	
22047	
22048	
22049	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to cytoplasmic bodies. Its function has not been identified. Alternate splicing of this gene generates two transcript variants encoding different isoforms.
22050	
22051	The product of this gene belongs to the phosphatase 2A regulatory subunit B family. Protein phosphatase 2A is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. The B regulatory subunit might modulate substrate selectivity and catalytic activity. This gene encodes a gamma isoform of the regulatory subunit B56 subfamily. Alternatively spliced transcript variants encoding different isoforms have been identified.
22052	
22053	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It is one of the most abundant core proteins of hnRNP complexes and it is localized to the nucleoplasm. This protein, along with other hnRNP proteins, is exported from the nucleus, probably bound to mRNA, and is immediately re-imported. Its M9 domain acts as both a nuclear localization and nuclear export signal. The encoded protein is involved in the packaging of pre-mRNA into hnRNP particles, transport of poly A+ mRNA from the nucleus to the cytoplasm, and may modulate splice site selection. It is also thought have a primary role in the formation of specific myometrial protein species in parturition. Multiple alternatively spliced transcript variants have been found for this gene but only two transcripts are fully described. These variant have multiple alternative transcription initiation sites and multiple polyA sites.
22054	This gene encodes an enzyme that hydrolyzes terminal, non-reducing alpha-D-mannose residues in alpha-D-mannosides. Its activity is necessary for the catabolism of N-linked carbohydrates released during glycoprotein turnover and it is member of family 38 of glycosyl hydrolases. The full length protein is processed in two steps. First, a 49 aa leader sequence is cleaved off and the remainer of the protein is processed into 3 peptides of 70 kDa, 42 kDa (D) and 13/15 kDa (E). Next, the 70 kDa peptide is further processed into three peptides (A, B and C). The A, B and C peptides are disulfide-linked. Defects in this gene have been shown to be the cause of lysosomal alpha-mannosidosis (AM), a lysosomal storage disease characterized by the accumulation of unbranched oligosaccharide chains.
22055	
22056	
22057	This gene was identified by the induction of its expression in B lymphocytes by Epstein-Barr virus infection. The protein encoded by this gene is a secreted glycoprotein, which is a member of the hematopoietin receptor family related to the p40 subunit of interleukin 12 (IL-12). It might play a role in regulating cell-mediated immune responses.
22058	
22059	
22060	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
22061	HLA-DRB1 belongs to the HLA class II beta chain paralogues. The class II molecule is a heterodimer consisting of an alpha (DRA) and a beta chain (DRB), both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa. It is encoded by 6 exons, exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail. Within the DR molecule the beta chain contains all the polymorphisms specifying the peptide binding specificities. Hundreds of DRB1 alleles have been described and typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. DRB1 is expressed at a level five times higher than its paralogues DRB3, DRB4 and DRB5. DRB1 is present in all individuals. Allelic variants of DRB1 are linked with either none or one of the genes DRB3, DRB4 and DRB5. There are 4 related pseudogenes: DRB2, DRB6, DRB7, DRB8 and DRB9.
22062	
22063	
22064	Apoptosis is a cell death process that removes toxic and/or useless cells during mammalian development. The apoptotic process is accompanied by shrinkage and fragmentation of the cells and nuclei and degradation of the chromosomal DNA into nucleosomal units. DNA fragmentation factor (DFF) is a heterodimeric protein of 40-kD (DFFB) and 45-kD (DFFA) subunits. DFFA is the substrate for caspase-3 and triggers DNA fragmentation during apoptosis. DFF becomes activated when DFFA is cleaved by caspase-3. The cleaved fragments of DFFA dissociate from DFFB, the active component of DFF. DFFB has been found to trigger both DNA fragmentation and chromatin condensation during apoptosis. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene, but the biological validity of some variants has not been determined.
22065	
22066	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This protein localizes to the nucleus and the cytoplasm. Its function has not been identified.
22067	Guanylate binding protein expression is induced by interferon.  Guanylate binding proteins are characterized by their ability to specifically bind guanine nucleotides (GMP, GDP, and GTP) and are distinguished from the GTP-binding proteins by the presence of 2 binding motifs rather than 3
22068	Synaptobrevins/VAMPs, syntaxins, and the 25-kD synaptosomal-associated protein are the main components of a protein complex involved in the docking and/or fusion of synaptic vesicles with the presynaptic membrane. This gene is a member of the vesicle-associated membrane protein (VAMP)/synaptobrevin family. Because of its high homology to other known VAMPs, its broad tissue distribution, and its subcellular localization, the protein encoded by this gene was shown to be the human equivalent of the rodent cellubrevin. In platelets the protein resides on a compartment that is not mobilized to the plasma membrane on calcium or thrombin stimulation.
22069	CYR61 is a secreted, cysteine-rich, heparin-binding protein encoded by a growth factor-inducible immediate-early gene. Acting as an extracellular, matrix-associated signaling molecule, CYR61 promotes the adhesion of endothelial cells through interaction with integrin and augments growth factor-induced DNA synthesis in the same cell type.[supplied by OMIM]
22070	
22071	
22072	
22073	
22074	
22075	
22076	
22077	
22078	This gene encodes a multipass transmembrane protein that is expressed in joints and other tissues and controls pyrophosphate levels in cultured cells. Mutation at the mouse @#$%&progressive ankylosis@#$%& (ank) locus causes a generalized, progressive form of arthritis accompanied by mineral deposition, formation of bony outgrowths, and joint destruction. The human homolog is virtually identical to the mouse protein and ANKH-mediated control of pyrophosphate levels has been suggested as a possible mechanism regulating tissue calcification and susceptibility to arthritis in higher animals.
22079	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind RNAs, followed by a glycine-rich C-terminus.
22080	
22081	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L37E family of ribosomal proteins. It is located in the cytoplasm. The protein contains a C2C2-type zinc finger-like motif. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
22082	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. A pseudogene corresponding to this gene is found on chromosome 2p.
22083	The four human glycoprotein hormones chorionic gonadotropin (CG), luteinizing hormone (LH), follicle stimulating hormone (FSH), and thyroid stimulating hormone (TSH) are dimers consisting of alpha and beta subunits that are associated noncovalently. The alpha subunits of these hormones are identical, however, their beta chains are unique and confer biological specificity. The protein encoded by this gene is the alpha subunit and belongs to the glycoprotein hormones alpha chain family.
22084	This gene belongs to subfamily P of the forkhead box (FOX) transcription factor family. Forkhead box transcription factors play important roles in the regulation of tissue- and cell type-specific gene transcription during both development and adulthood. Many members of the forkhead box gene family, including members of subfamily P, have roles in mammalian oncogenesis. This gene may play a role in the development of tumors of the kidney and larynx. Alternative splicing of this gene produces multiple transcript variants, some encoding different isoforms.
22085	
22086	
22087	
22088	
22089	
22090	
22091	
22092	
22093	
22094	
22095	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S15P family of ribosomal proteins. It is located in the cytoplasm. The protein has been shown to bind to the 5.8S rRNA in rat. The gene product of the E. coli ortholog (ribosomal protein S15) functions at early steps in ribosome assembly. This gene is co-transcribed with two U14 small nucleolar RNA genes, which are located in its third and fifth introns. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
22096	The protein encoded by this gene is a member of the Fe65 protein family. It is an adaptor protein localized in the nucleus. It interacts with the Alzheimer@#$%&s disease amyloid precursor protein (APP), transcription factor CP2/LSF/LBP1 and the low-density lipoprotein receptor-related protein. APP functions as a cytosolic anchoring site that can prevent the gene product@#$%&s nuclear translocation. This encoded protein could play an important role in the pathogenesis of Alzheimer@#$%&s disease. It is thought to regulate transcription. Also it is observed to block cell cycle progression by downregulating thymidylate synthase expression. Multiple alternatively spliced transcript variants have been described for this gene but some of their full length sequence is not known.
22097	
22098	The protein encoded by this protein contains a RING finger, a motif known to be involved in protein-DNA and protein-protein interactions. The mouse counterpart of this protein has been shown to interact with Ube2l3/UbcM4, which is an ubiquitin-conjugating enzyme involved in embryonic development.
22099	
22100	
22101	
22102	This gene encodes an ETS family transcription factor. The product of this gene contains two functional domains: a N-terminal pointed (PNT) domain that is involved in the protein-protein interactions with itself and other proteins, and a C-terminal DNA-binding domain. Gene knockout studies in mice suggest that it is required for hematopoiesis and maintenance of the developing vascular network. This gene is known to be involved in a large number of chromosomal rearrangements associated with leukemia and congenital fibrosarcoma.
22103	MVK encodes the peroxisomal enzyme mevalonate kinase. Mevalonate is a key intermediate, and mevalonate kinase a key early enzyme, in isoprenoid and sterol synthesis. Mevalonate kinase deficiency caused by mutation of MVK results in mevalonic aciduria.
22104	
22105	
22106	
22107	
22108	
22109	
22110	
22111	The protein encoded by this gene is an isozyme of the long-chain fatty-acid-coenzyme A ligase family. Although differing in substrate specificity, subcellular localization, and tissue distribution, all isozymes of this family convert free long-chain fatty acids into fatty acyl-CoA esters, and thereby play a key role in lipid biosynthesis and fatty acid degradation. This isozyme is highly expressed in brain, and preferentially utilizes myristate, arachidonate, and eicosapentaenoate as substrates. The amino acid sequence of this isozyme is 92% identical to that of rat homolog. Two transcript variants encoding the same protein have been found for this gene.
22112	
22113	
22114	
22115	
22116	The ASPA gene product, aspartoacylase, catalyzes the conversion of N-acetyl_L-aspartic acid (NAA) to aspartate and acetate.  NAA is abundant in the brain where hydrolysis by aspartoacylase is thought to help maintain white matter.  The ASPA gene product is an NAA scavenger in other tissues.  Defects in ASPA cause Canavan disease.
22117	
22118	
22119	
22120	
22121	
22122	
22123	
22124	The protein encoded by this gene is a transmembrane (type I) heparan sulfate proteoglycan that functions as a receptor in intracellular signaling. The encoded protein is found as a homodimer and is a member of the syndecan proteoglycan family. This gene is found on chromosome 20, while a pseudogene has been found on chromosome 22.
22125	The protein encoded by this gene is a member of the dual specificity protein phosphatase subfamily.  These phosphatases inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues.  They negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily (MAPK/ERK, SAPK/JNK, p38), which are associated with cellular proliferation and differentiation.  Different members of the family of dual specificity phosphatases show distinct substrate specificities for various MAP kinases, different tissue distribution and subcellular localization, and different modes of inducibility of their expression by extracellular stimuli.  This gene product inactivates ERK1 and ERK2, is predominantly expressed in hematopoietic tissues, and is localized in the nucleus.
22126	
22127	
22128	
22129	Pyruvate dehydrogenase (PDH) is a mitochondrial multienzyme complex that catalyzes the oxidative decarboxylation of pyruvate and is one of the major enzymes responsible for the regulation of homeostasis of carbohydrate fuels in mammals. The enzymatic activity is regulated by a phosphorylation/dephosphorylation cycle. Phosphorylation of PDH by a specific pyruvate dehydrogenase kinase (PDK) results in inactivation.
22130	
22131	
22132	
22133	
22134	
22135	
22136	DISCONTINUED: LocusID:79914 was defined by AK023045.1 which contains repeat sequence and does not appear to represent a protein coding gene.
22137	
22138	Staufen homolog 2 is a member of the family of double-stranded RNA (dsRNA)-binding proteins involved in the transport and/or localization of mRNAs to different subcellular compartments and/or organelles.  These proteins are characterized by the presence of multiple dsRNA-binding domains which are required to bind RNAs having double-stranded secondary structures.  Staufen homolog 2 shares 48.5% and 59.9% similarity with drosophila and human staufen, respectively.  The exact function of Staufen homolog 2 is not known, but since it contains 3 copies of conserved dsRNA binding domain, it could be involved in double-stranded RNA binding events.
22139	
22140	
22141	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind RNAs, followed by a glycine-rich C-terminus.
22142	
22143	
22144	
22145	Pleckstrin homology, Sec7 and coiled/coil domains 2 (PSCD2) is a member of the PSCD family. Members of this family have identical structural organization that consists of an N-terminal coiled-coil motif, a central Sec7 domain, and a C-terminal pleckstrin homology (PH) domain. The coiled-coil motif is involved in homodimerization, the Sec7 domain contains guanine-nucleotide exchange protein (GEP) activity, and the PH domain interacts with phospholipids and is responsible for association of PSCDs with membranes. Members of this family appear to mediate the regulation of protein sorting and membrane trafficking. PSCD2 exhibits GEP activity in vitro with ARF1, ARF3, and ARF6. PSCD2 protein is 83% homologous to PSCD1. Two transcript variants encoding different isoforms have been found for this gene.
22146	The protein encoded by this gene is a type I membrane receptor for group B coxsackieviruses and subgroup C adenoviruses. Pseudogenes of this gene are found on chromosomes 15, 18, and 21.
22147	
22148	
22149	
22150	
22151	
22152	
22153	
22154	
22155	
22156	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L11P family of ribosomal proteins. It is located in the cytoplasm. The protein binds directly to the 26S rRNA. This gene is co-transcribed with the U65 snoRNA, which is located in its fourth intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
22157	
22158	
22159	
22160	
22161	
22162	
22163	
22164	
22165	
22166	
22167	
22168	
22169	
22170	
22171	
22172	
22173	
22174	
22175	
22176	
22177	Cytoplasmic ribosomes, organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L7AE family of ribosomal proteins. It can interact with a subclass of nuclear hormone receptors, including thyroid hormone receptor, and inhibit their ability to transactivate by preventing their binding to their DNA response elements. This gene is included in the surfeit gene cluster, a group of very tightly linked genes that do not share sequence similarity. It is co-transcribed with the U24, U36a, U36b, and U36c small nucleolar RNA genes, which are located in its second, fifth, fourth, and sixth introns, respectively. This gene rearranges with the trk proto-oncogene to form the chimeric oncogene trk-2h, which encodes an oncoprotein consisting of the N terminus of ribosomal protein L7a fused to the receptor tyrosine kinase domain of trk. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
22178	
22179	
22180	
22181	
22182	
22183	
22184	
22185	
22186	Kallikreins are a subgroup of serine proteases having diverse physiological functions. Growing evidence suggests that many kallikreins are implicated in carcinogenesis and some have potential as novel cancer and other disease biomarkers. This gene is one of the fifteen kallikrein subfamily members located in a cluster on chromosome 19. In some tissues its expression is hormonally regulated. The expression pattern of a similar mouse protein in murine developing teeth supports a role for the protein in the degradation of enamel proteins. Alternate splice variants for this gene have been described, but their biological validity has not been determined.
22187	
22188	This protein belongs to the aldehyde dehydrogenase family of proteins. This gene encodes a mitochondrial NAD(+)-dependent succinic semialdehyde dehydrogenase. A deficiency of this enzyme, known as 4-hydroxybutyricaciduria, is a rare inborn error in the metabolism of the neurotransmitter 4-aminobutyric acid (GABA). In response to the defect, physiologic fluids from patients accumulate GHB, a compound with numerous neuromodulatory properties. Two transcript variants encoding distinct isoforms have been identified for this gene.
22189	In rat brain, Cip98 interacts with a calmodulin-dependent serine kinase, CASK (MIM 300172), and may be involved in the formation of scaffolding protein complexes that facilitate synaptic transmission in the central nervous system (CNS) (Yap et al., 2003 [PubMed 12641734]). Mutations in this gene, also known as WHRN, cause DFNB31 (MIM 607084).[supplied by OMIM]
22190	
22191	
22192	
22193	
22194	
22195	
22196	
22197	
22198	
22199	Spinophilin is a regulatory subunit of protein phosphatase-1 catalytic subunit (PP1; see MIM 176875) and is highly enriched in dendritic spines, specialized protrusions from dendritic shafts that receive most of the excitatory input in the central nervous system.[supplied by OMIM]
22200	
22201	
22202	
22203	Similarity to a related locus suggests that this gene encodes a subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. This subunit exists as a heterodimer with another polymerase subunit; together they form a core subassembly unit of the polymerase. Two similar genes are located downstream of this locus. Alternative splicing of this gene results in three transcript variants encoding different isoforms.
22204	This gene encodes a highly conserved neurotrophic factor. The recombinant form of this protein was shown to promote the survival and differentiation of dopaminergic neurons in culture, and was able to prevent apoptosis of motor neurons induced by axotomy. The encoded protein is processed to a mature secreted form that exists as a homodimer. The mature form of the protein is a ligand for the product of the RET (rearranged during transfection) protooncogene. In addition to the transcript encoding GDNF, two additional alternative transcripts encoding distinct proteins, referred to as astrocyte-derived trophic factors, have also been described. Mutations in this gene may be associated with Hirschsprung disease.
22205	HLA-A belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon 1 encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Hundreds of HLA-A alleles have been described.
22206	
22207	This gene encodes cytochrome c, a component of the electron transport chain in mitochondria. The heme group of cytochrome c accepts electrons from the b-c1 complex and transfers electrons to the cytochrome oxidase complex. Cytochrome c is also involved in initiation of apoptosis. Upon release of cytochrome c to the cytoplasm, the protein binds apoptotic protease activating factor which activates the apoptotic initiator procaspase 9. Many cytochrome c pseudogenes exist, scattered throughout the human genome.
22208	
22209	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intracellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, and White). This encoded protein is a member of the ABC1 subfamily. Members of the ABC1 subfamily comprise the only major ABC subfamily found exclusively in multicellular eukaryotes. This gene is clustered among 4 other ABC1 family members on 17q24, but neither the substrate nor the function of this gene is known.
22210	This gene is a member of a family of sulfate/anion transporter genes. Family members are well conserved in their genomic (number and size of exons) and protein (aa length among species) structures, but have markedly different tissue expression patterns. This gene is primarily expressed in the liver, pancreas, and brain. Three splice variants that encode different isoforms have been identified.
22211	
22212	CA6 is one of several isozymes of carbonic anhydrase. CA6 is found only in salivary glands and saliva.  The protein may play a role in reversible hydratation of carbon dioxide though its function in saliva is unknown.
22213	
22214	
22215	
22216	
22217	The protein encoded by this gene belongs to the Tec family, which is a subfamily of non-receptor protein-tyrosine kinases containing a pleckstrin homology domain. Tec family kinases are involved in the intracellular signaling mechanisms of cytokine receptors, lymphocyte surface antigens, heterotrimeric G-protein coupled receptors, and integrin molecules. They are also key players in the regulation of the immune functions. Tec kinase is an integral component of T cell signaling and has a distinct role in T cell activation. This gene may be associated with myelodysplastic syndrome.
22218	Megakaryocytopoiesis is the cellular development process that leads to platelet production. The protein encoded by this gene is a humoral growth factor that is necessary for megakaryocyte proliferation and maturation, as well as for thrombopoiesis. This protein is the ligand for MLP/C_MPL, the product of myeloproliferative leukemia virus oncogene. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
22219	
22220	
22221	
22222	
22223	Pre-mRNA splicing occurs in two sequential transesterification steps. The protein encoded by this gene is a splicing factor that has been found to be essential during the second catalytic step in the pre-mRNA splicing process. It associates with the spliceosome and contains a zinc knuckle motif that is found in other splicing factors and is involved in protein-nucleic acid and protein-protein interactions.
22224	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to cytoplasmic bodies. Although the function of the protein is unknown, the RING domain suggests that the protein may have DNA-binding activity. The gene localizes to the major histocompatibility complex (MHC) class I region on chromosome 6.
22225	
22226	
22227	This gene encodes a member of the regulator of G-protein signaling (RGS) family. This protein is a GTP-ase activating protein which inhibits G-protein mediated signal transduction. The protein is largely cytosolic, but G-protein activation leads to translocation of this protein to the plasma membrane. A nuclear form of this protein has also been described, but its sequence has not been identified. Multiple alternatively spliced transcript variants have been described for this gene but the full-length nature of some transcripts is not yet known.
22228	
22229	
22230	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a member of the histone H2B family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the small histone gene cluster on chromosome 6p22-p21.3.
22231	
22232	The encoded protein has a central role in the regulation of lipid metabolism in cells. It is a sterol-regulated subtilisin-like serine protease that cleaves ER membrane-bound sterol regulatory element-binding proteins (SREBPs), a reaction that initiates the two-step proteolytic process by which transcriptionally active fragments of SREBPs are released from the membrane for translocation to the nucleus. The gene product is an integral membrane ER protein, with the bulk located in the ER lumen. It is synthesized as an inactive preproprotein that is self-activated by an intramolecular cleavage that generates the mature protein. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
22233	
22234	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intracellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, and White). This encoded protein is a member of the ABC1 subfamily. Members of the ABC1 subfamily comprise the only major ABC subfamily found exclusively in multicellular eukaryotes. This gene is clustered among 4 other ABC1 family members on 17q24 and may play a role in macrophage lipid homeostasis. Alternative splicing of this gene results in two transcript variants.
22235	This gene encodes a secreted protein which is mainly expressed in the endothelial cells in human lung and kidney tissues. The expression of this gene is regulated by cytokines, suggesting that it may play a role in endothelium-dependent pathological disorders. The transcript contains multiple polyadenylation and mRNA instability signals.
22236	
22237	
22238	Neurotensin receptor 1 belongs to the large superfamily of G-protein coupled receptors. NTSR1 mediates the multiple functions of neurotensin, such as hypotension, hyperglycemia, hypothermia, antinociception, and regulation of intestinal motility and secretion.
22239	
22240	
22241	The beta-adrenergic receptor kinase specifically phosphorylates the agonist-occupied form of the beta-adrenergic and related G protein-coupled receptors. Overall, the ADRBK2 enzyme has 85% amino acid similarity with ADRBK1, with the protein kinase catalytic domain having 95% similarity. The ADRBK2 mRNA is approximately 8 kilobases with a distribution similar to that of ADRBK1. These data suggest the existence of a family of receptor kinases which may serve broadly to regulate receptor function
22242	
22243	
22244	
22245	
22246	
22247	This gene encodes a protein that is similar to other immediate early response proteins. In the mouse, a similar gene may play an important role in mediating the cellular response to mitogenic signals. Studies in rats found the expression of a similar gene to be increased after waking and sleep deprivation.
22248	
22249	
22250	CAPZA1 is a member of the F-actin capping protein alpha subunit family. This gene encodes the alpha subunit of the barbed-end actin binding protein.  The protein regulates growth of the actin filament by capping the barbed end of growing actin filaments.
22251	
22252	Interleukin 6 (IL6) is a potent pleiotropic cytokine that regulates cell growth and differentiation and plays an important role in immune response. The protein encoded by this gene is a subunit of the receptor complex for IL6. The IL6 receptor is a protein complex consisting of this protein and interleukin 6 signal transducer (IL6ST/GP130/IL6-beta), a receptor subunit also shared by many other cytokines. Dysregulated production of IL6 and this receptor are implicated in the pathogenesis of many diseases, such as multiple myeloma, autoimmune diseases and prostate cancer. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
22253	
22254	
22255	This gene encodes a member of the cysteine protease family C13 that is involved in glycosylphosphatidylinositol (GPI)-anchor biosynthesis. The GPI-anchor is a glycolipid found on many blood cells and serves to anchor proteins to the cell surface. This protein is a member of the multisubunit enzyme, GPI transamidase and is thought to be its enzymatic component. GPI transamidase mediates GPI anchoring in the endoplasmic reticulum, by catalyzing the transfer of fully assembled GPI units to proteins.
22256	Hexokinases phosphorylate glucose to produce glucose-6-phosphate, thus committing glucose to the glycolytic pathway. This gene encodes hexokinase 3. Similar to hexokinases 1 and 2, this allosteric enzyme is inhibited by its product glucose-6-phosphate.
22257	
22258	Spectrin is a tetrameric cytoskeletal protein essential for determination of cell shape, resilience of membranes to mechanical stress, positioning of transmembrane proteins, and organization of organelles and molecular traffic. Alpha- and beta-spectrin subunits form antiparallel dimers that self associate to give the spectrin tetramer. Beta subunits, such as SPTBN1, contain most of the spectrin binding activity (Hayes et al., 2000).[supplied by OMIM]
22259	The protein encoded by this gene is an aminopeptidase which prefers acidic amino acids, and specifically favors aspartic acid over glutamic acid. It is thought to be a cytosolic protein involved in general metabolism of intracellular proteins.
22260	
22261	MAX dimerization protein belongs to a subfamily of MAX-interacting proteins. This protein competes with MYC for binding to MAX to form a sequence-specific DNA-binding complex, acts as a transcriptional repressor (while MYC appears to function as an activator) and is a candidate tumor suppressor.
22262	
22263	
22264	Acetyl-Coenzyme A acyltransferase (ACAA1) is an enzyme operative in the beta-oxidation system of the peroxisomes.  Deficiency of this enzyme leads to pseudo-Zellweger syndrome
22265	
22266	This intronless gene encodes a member of the SRY-related HMG-box (SOX) family of transcription factors involved in the regulation of embryonic development and in the determination of cell fate. The encoded protein may act as a transcriptional activator after forming a protein complex with other proteins. Mutations in this gene have been associated with bilateral anophthalmia, a severe form of structural eye malformation. This gene lies within an intron of another gene called SOX2 overlapping transcript (SOX2OT).
22267	
22268	
22269	
22270	
22271	
22272	
22273	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections.
22274	This gene encodes a protein belonging to the pantothenate kinase family. Pantothenate kinase is a key regulatory enzyme in the biosynthesis of coenzyme A (CoA) in bacteria and mammalian cells. It catalyzes the first committed step in the universal biosynthetic pathway leading to CoA and is itself subject to regulation through feedback inhibition by CoA. This family member is expressed most abundantly in the liver.
22275	The protein encoded by this gene is similar to the mouse stromal membrane-associated protein-1. This similarity suggests that this human gene product is also a type II membrane glycoprotein involved in the erythropoietic stimulatory activity of stromal cells.
22276	
22277	
22278	
22279	
22280	
22281	The protein encoded by this gene is a transmembrane (type I) heparan sulfate proteoglycan and is a member of the syndecan proteoglycan family. The syndecans mediate cell binding, cell signaling, and cytoskeletal organization and syndecan receptors are required for internalization of the HIV-1 tat protein. The syndecan-2 protein functions as an integral membrane protein and participates in cell proliferation, cell migration and cell-matrix interactions via its receptor for extracellular matrix proteins. Altered syndecan-2 expression has been detected in several different tumor types.
22282	The protein encoded by this gene is a member of the immunophilin protein family, which play a role in immunoregulation and basic cellular processes involving protein folding and trafficking. This encoded protein is a cis-trans prolyl isomerase that binds to the immunosuppressants FK506 and rapamycin. It is thought to mediate calcineurin inhibition. It also interacts functionally with mature hetero-oligomeric progesterone receptor complexes along with the 90 kDa heat shock protein and P23 protein. This gene has been found to have multiple polyadenylation sites.
22283	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene is located in the nucleoplasm and has three repeats of KH domains that binds to RNAs. It is distinct among other hnRNP proteins in its binding preference; it binds tenaciously to poly(C). This protein is also thought to have a role during cell cycle progession. Multiple alternatively spliced transcript variants have been described for this gene but only three variants have been fully described.
22284	
22285	
22286	
22287	
22288	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. Expression of this gene is up-regulated by interferon. This gene is mappped to chromosome 11p15, where it resides within a TRIM gene cluster. Alternate splicing of this gene generates four transcript variants. Additionally, a read-through transcript transcribed from this gene and TRIM6 has been observed.
22289	
22290	
22291	
22292	
22293	
22294	
22295	
22296	Histone acetylation plays a key role in the regulation of eukaryotic gene expression. Histone acetylation and deacetylation are catalyzed by multisubunit complexes. The protein encoded by this gene is a component of the histone deacetylase complex which includes SIN3, SAP30, HDAC1, HDAC2, RbAp46, RbAp48 and other polypeptides. This protein directly interacts with SIN3 and enhances SIN3-mediated transcriptional repression when tethered to the promoter.
22297	This protein belongs to the aldehyde dehydrogenases family of proteins. This enzyme plays a role in the valine and pyrimidine catabolic pathways. The product of this gene, a mitochondrial methylmalonate semialdehyde dehydrogenase, catalyzes the irreversible oxidative decarboxylation of malonate and methylmalonate semialdehydes to acetyl- and propionyl-CoA. Methylmalonate semialdehyde dehydrogenase deficiency is characterized by elevated beta-alanine, 3-hydroxypropionic acid, and both isomers of 3-amino and 3-hydroxyisobutyric acids in urine organic acids.
22298	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes one of the ATPase subunits, a member of the triple-A family of ATPases which have a chaperone-like activity. Pseudogenes have been identified on chromosomes 8 and 12.
22299	This gene encodes an E3 ubiquitin-protein ligase, part of the ubiquitin protein degradation system. This imprinted gene is maternally expressed in brain and biallelically expressed in other tissues. Maternally inherited deletion of this gene causes Angelman Syndrome, characterized by severe motor and intellectual retardation, ataxia, hypotonia, epilepsy, absence of speech, and characteristic facies. The protein also interacts with the E6 protein of human papillomavirus types 16 and 18, resulting in ubiquitination and proteolysis of tumor protein p53. Alternative splicing of this gene results in three transcript variants encoding three isoforms with different N-termini. Additional transcript variants have been described, but their full length nature has not been determined.
22300	
22301	
22302	Heme oxygenase, an essential enzyme in heme catabolism, cleaves heme to form biliverdin, which is subsequently converted to bilirubin by biliverdin reductase, and carbon monoxide, a putative neurotransmitter. Heme oxygenase activity is induced by its substrate heme and by various nonheme substances. Heme oxygenase occurs as 2 isozymes, an inducible heme oxygenase-1 and a constitutive heme oxygenase-2. HMOX1 and HMOX2 belong to the heme oxygenase family.
22303	Adaptins are important components of clathrin-coated vesicles transporting ligand-receptor complexes from the plasma membrane or from the trans-Golgi network to lysosomes. The adaptin family of proteins is composed of four classes of molecules named alpha, beta-, beta prime- and gamma- adaptins. Adaptins, together with medium and small subunits, form a heterotetrameric complex called an adaptor, whose role is to promote the formation of clathrin-coated pits and vesicles. The protein encoded by this gene is a gamma-adaptin protein and it belongs to the adaptor complexes large subunits family.
22304	
22305	
22306	
22307	
22308	
22309	
22310	This gene is a member of the insulin-like growth factor binding protein (IGFBP) family and encodes a protein with an IGFBP domain and a thyroglobulin type-I domain. The protein binds both insulin-like growth factors (IGFs) I and II and circulates in the plasma in both glycosylated and non-glycosylated forms. Binding of this protein prolongs the half-life of the IGFs and alters their interaction with cell surface receptors.
22311	
22312	This gene is a type II classical cadherin from the cadherin superfamily. The encoded membrane protein is a calcium dependent cell-cell adhesion glycoprotein comprised of five extracellular cadherin repeats, a transmembrane region and a highly conserved cytoplasmic tail. Type II (atypical) cadherins are defined based on their lack of a HAV cell adhesion recognition sequence specific to type I cadherins. Cadherins mediate cell-cell binding in a homophilic manner, contributing to the sorting of heterogeneous cell types and the maintenance of orderly structures. Alternative splicing in the 5@#$%& UTR of this gene yields variant transcripts encoding the same protein.
22313	
22314	
22315	
22316	
22317	The protein encoded by this gene is a proteolipid that may be involved in the regulation of ion channels during brain development. The encoded protein may also play a role in forming and maintaining the structure of the nervous system. This gene is found within an intron of the BLCAP gene, but on the opposite strand. This gene is imprinted and is expressed only from the paternal allele, while BLCAP is not imprinted. Two transcript variants encoding two different isoforms have been found for this gene.
22318	There are two proteins encoded by this gene, oxytocin and neurophysin I. Oxytocin is posterior pituitary hormone which is synthesized as an inactive precursor in the hypothalamus along with its carrier protein neurophysin I. Together with neurophysin, it is packaged into neurosecretory vesicles and transported axonally to the nerve endings in the neurohypophysis, where it is either stored or secreted into the bloodstream. The precursor seems to be activated while it is being transported along the axon to the posterior pituitary. This hormone contracts smooth muscle during parturition and lactation. It is also involved in cognition, tolerance, adaptation and complex sexual and maternal behaviour, as well as in the regulation of water excretion and cardiovascular functions.
22319	
22320	
22321	
22322	Members of the @#$%&frizzled@#$%& gene family encode 7-transmembrane domain proteins that are receptors for Wnt signaling proteins.  The FZD1 protein contains a signal peptide, a cysteine-rich domain in the N-terminal extracellular region, 7 transmembrane domains, and a C-terminal PDZ domain-binding motif.  The FZD1 transcript is expressed in various tissues.
22323	Originally named because of its sequence similarity to the Saccharomyces cerevisiae NHP2 (non-histone protein 2), this protein appears to be a highly conserved nuclear protein that is a component of the [U4/U6.U5] tri-snRNP. It binds to the 5@#$%& stem-loop of U4 snRNA. Two transcript variants encoding the same protein have been found for this gene.
22324	
22325	Hair keratins and hair keratin-associated proteins (KAPs), such as KRTAP13-1, are the main structural proteins of hair fibers (Rogers et al., 2002).[supplied by OMIM]
22326	
22327	
22328	
22329	
22330	
22331	
22332	
22333	
22334	
22335	Zinc is an essential cofactor for hundreds of enzymes. It is involved in protein, nucleic acid, carbohydrate, and lipid metabolism, as well as in the control of gene transcription, growth, development, and differentiation. SLC39A14 belongs to a subfamily of proteins that show structural characteristics of zinc transporters (Taylor and Nicholson, 2003).[supplied by OMIM]
22336	
22337	
22338	
22339	The estrogen receptor (ESR) is a ligand-activated transcription factor composed of several domains important for hormone binding, DNA binding, and activation of transcription. Alternative splicing results in several ESR1 mRNA transcripts, which differ primarily in their 5-prime untranslated regions. The translated receptors show less variability.[supplied by OMIM]
22340	
22341	
22342	
22343	
22344	
22345	
22346	
22347	
22348	
22349	This gene encodes a member of the C-type lectin/C-type lectin-like domain (CTL/CTLD) superfamily. Members of this family share a common protein fold and have diverse functions, such as cell adhesion, cell-cell signalling, glycoprotein turnover, and roles in inflammation and immune response. The encoded type 2 transmembrane protein may play a role in inflammatory and immune response. Multiple transcript variants encoding distinct isoforms have been identified for this gene. This gene is closely linked to other CTL/CTLD superfamily members on chromosome 12p13 in the natural killer gene complex region.
22350	
22351	
22352	
22353	
22354	
22355	
22356	
22357	
22358	
22359	
22360	This gene is a member of the N-myc downregulated gene family which belongs to the alpha/beta hydrolase superfamily. The protein encoded by this gene is a cytoplasmic protein involved in stress responses, hormone responses, cell growth, and differentiation. Mutation in this gene has been reported to be causative for hereditary motor and sensory neuropathy-Lom.
22361	
22362	
22363	
22364	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S3AE family of ribosomal proteins. It is located in the cytoplasm. Disruption of the gene encoding rat ribosomal protein S3a, also named v-fos transformation effector protein, in v-fos-transformed rat cells results in reversion of the transformed phenotype. Transcript variants utilizing alternative transcription start sites have been described. This gene is co-transcribed with the U73A and U73B small nucleolar RNA genes, which are located in its fourth and third introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
22365	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
22366	
22367	
22368	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
22369	
22370	
22371	
22372	
22373	
22374	
22375	
22376	
22377	The protein encoded by this gene is a member of the intercellular adhesion molecule (ICAM) family. All ICAM proteins are type I transmembrane glycoproteins, contain 2-9 immunoglobulin-like C2-type domains, and bind to the leukocyte adhesion LFA-1 protein. This protein is constitutively and abundantly expressed by all leucocytes and may be the most important ligand for LFA-1 in the initiation of the immune response. It functions not only as an adhesion molecule, but also as a potent signalling molecule.
22378	
22379	
22380	
22381	
22382	
22383	This gene encodes a putative RNA binding protein of unknown function.  Mutations in this gene, specifically a t(11;22)(q24;q12) translocation, are known to cause Ewing sarcoma as well as neuroectodermal and various other tumors.  Alternative splicing of this gene results in two products.
22384	
22385	This gene encodes a putative RNA binding protein of unknown function.  Mutations in this gene, specifically a t(11;22)(q24;q12) translocation, are known to cause Ewing sarcoma as well as neuroectodermal and various other tumors.  Alternative splicing of this gene results in two products.
22386	
22387	
22388	
22389	
22390	
22391	The protein encoded by this gene catalyzes the conversion of (S)-2,3 oxidosqualene to lanosterol. The encoded protein is a member of the terpene cyclase/mutase family and catalyzes the first step in the biosynthesis of cholesterol, steroid hormones, and vitamin D. Two transcript variants encoding the same protein have been found for this gene.
22392	
22393	
22394	
22395	
22396	
22397	
22398	
22399	
22400	
22401	
22402	Initiation of transcription by RNA polymerase II requires the activities of more than 70 polypeptides. The protein that coordinates these activities is transcription factor IID (TFIID), which binds to the core promoter to position the polymerase properly, serves as the scaffold for assembly of the remainder of the transcription complex, and acts as a channel for regulatory signals. TFIID is composed of the TATA-binding protein (TBP) and a group of evolutionarily conserved proteins known as TBP-associated factors or TAFs. TAFs may participate in basal transcription, serve as coactivators, function in promoter recognition or modify general transcription factors (GTFs) to facilitate complex assembly and transcription initiation. This gene encodes one of the larger subunits of TFIID that is stably associated with the TFIID complex. It contributes to interactions at and downstream of the transcription initiation site, interactions that help determine transcription complex response to activators.
22403	Stearoyl-CoA desaturases catalyze the committed step in the biosynthesis of monounsaturated fatty acids from saturated fatty acids. This reaction involves the introduction of a cis-double bond between carbons 9 and 10 in a spectrum of methylene-interrupted fatty acyl-CoAs.[supplied by OMIM]
22404	This gene product belongs to the zinc finger protein superfamily, members of which are regulatory proteins characterized by nucleic acid-binding zinc finger domains. The encoded protein contains 20 tandemly arrayed C2H2-type zinc fingers, a Kruppel-associated box (KRAB) domain, and a SCAN box. This transcript turns over rapidly and contains 3@#$%& UTR AUUUA motifs, which are often a hallmark of rapid turnover. It is overexpressed in some thyroid papillary carcinomas. This gene is located in a cluster of zinc finger genes at 3p21.
22405	
22406	The CD233 gene is located on chromosome 17q21-q22 and is part of the anion exchanger (AE) family. CD233 is expressed in the erythrocyte plasma membrane where it functions as a chloride/bicarbonate exchanger involved in carbon dioxide transport from tissues to lungs. The protein comprises two domains that are structurally and functionally distinct. The N-terminal 40kDa domain is located in the cytoplasm and acts as an attachment site for the red cell skeleton by binding ankyrin. The glycosylated C-terminal membrane-associated domain contains 12-14 membrane spanning segments and carries out the stilbene disulphonate-sensitive exchange transport of anions. The cytoplasmic tail at the extreme C-terminus of the membrane domain binds carbonic anhydrase II. CD233 associates with the red cell membrane protein glycophorin A and this association promotes the correct folding and translocation of CD233. CD233 is predominantly dimeric but forms tetramers in the presence of ankyrin. Many CD233 mutations are known in man and these mutations can lead to two types of disease; destabilization of red cell membrane leading to hereditary spherocytosis, and defective kidney acid secretion leading to distal renal tubular acidosis. Other CD233 mutations that do not give rise to disease result in novel blood group antigens, which form the Diego blood group system. Southeast Asian ovalocytosis (SAO, Melanesian ovalocytosis) results from the heterozygous presence of a deletion in the CD233 protein and is common in areas where Plasmodium falciparum malaria is endemic. One CD233 null human is known also with very severe anemia and nephrocalcinosis [PROW]
22407	This protein belongs to the aldehyde dehydrogenase family of proteins. It has a high activity for oxidation of gamma-aminobutyraldehyde and other amino aldehydes. The enzyme catalyzes the dehydrogenation of gamma-aminobutyraldehyde to gamma-aminobutyric acid (GABA). This isozyme is a tetramer of identical 54-kD subunits.
22408	
22409	
22410	The protein encoded by this gene belongs to the Ser/Thr family of protein kinases and HIPK subfamily. It phosphorylates homeodomain transcription factors and may also function as a co-repressor for homeodomain transcription factors. Alternative splicing results in four transcript variants encoding four distinct isoforms.
22411	
22412	
22413	
22414	The nucleolus is a dense subnuclear membraneless organelle that assembles around clusters of rRNA genes and functions in ribosome biogenesis. This gene encodes a nucleolar RNA-associated protein that is highly conserved between species. RNase treatment of permeabilized cells indicates that the nucleolar localization is RNA dependent. Further studies suggest that the protein is associated with ribosome biogenesis through an interaction with pre-rRNA primary transcripts. Alternative splicing has been observed at this locus and three splice variants, each encoding a distinct isoform, have been identified.
22415	
22416	
22417	
22418	The protein encoded by this gene is a member of the SEC23 subfamily of the SEC23/SEC24 family, which is involved in vesicle trafficking. The encoded protein has similarity to yeast Sec23p component of COPII. COPII is the coat protein complex responsible for vesicle budding from the ER. The function of this gene product has been implicated in cargo selection and concentration. Multiple alternatively spliced transcript variants encoding the same protein have been found for this gene.
22419	
22420	Members of arrestin/beta-arrestin protein family are thought to participate in agonist-mediated desensitization of G-protein-coupled receptors and cause specific dampening of cellular responses to stimuli such as hormones, neurotransmitters, or sensory signals.  Arrestin beta 1 is a cytosolic protein and acts as a cofactor in the beta-adrenergic receptor kinase (BARK) mediated desensitization of beta-adrenergic receptors.  Besides the central nervous system, it is expressed at high levels in  peripheral blood leukocytes, and thus the BARK/beta-arrestin system is believed to play a major role in regulating receptor-mediated immune functions.  Alternatively spliced transcripts encoding different isoforms of arrestin beta 1 have been described, however, their exact functions are not known.
22421	
22422	
22423	
22424	NKX2C is a member of the NKX family of homeodomain-containing transcription factors, which are implicated in many aspects of cell type specification and maintenance of differentiated tissue functions. See Harvey (1996) for a review of the structure, regulation, function, and evolution of NK2 homeobox genes with an emphasis on their roles in heart development.[supplied by OMIM]
22425	The protein encoded by this gene is a heat stable, nuclear protein and functions as a molecular chaperone. It is thought to regulate dimerization, DNA binding, and transcriptional activity of basic region-leucine zipper (bZIP) proteins.
22426	This gene encodes a member of the RSK (ribosomal S6 kinase) family of serine/threonine kinases. This kinase contains 2 non-identical kinase catalytic domains and phosphorylates several residues of the S6 ribosomal protein. The kinase activity of this protein leads to an increase in protein synthesis and cell proliferation. Amplification of the region of DNA encoding this gene and overexpression of this kinase are seen in some breast cancer cell lines. Alternate translational start sites have been described and alternate transcriptional splice variants have been observed but have not been thoroughly characterized.
22427	
22428	
22429	CCCTC-binding factor (CTCF), an 11-zinc-finger factor involved in gene regulation, utilizes different zinc fingers to bind varying DNA target sites. CTCF forms methylation-sensitive insulators that regulate X-chromosome inactivation. This gene is a paralog of CTCF and appears to be expressed primarily in the cytoplasm of spermatocytes, unlike CTCF which is expressed primarily in the nucleus of somatic cells. CTCF and the protein encoded by this gene are normally expressed in a mutually exclusive pattern that correlates with resetting of methylation marks during male germ cell differentiation.
22430	
22431	
22432	
22433	This gene encodes a member of the membrane fatty acid desaturase family which is responsible for inserting double bonds into specific positions in fatty acids. This protein contains three His-containing consensus motifs that are characteristic of a group of membrane fatty acid desaturases. It is predicted to be a multiple membrane-spanning protein localized to the endoplasmic reticulum. Overexpression of this gene inhibited biosynthesis of the EGF receptor, suggesting a possible role of a fatty acid desaturase in regulating biosynthetic processing of the EGF receptor. Two splice variants have been identified.
22434	Potassium channels are present in most mammalian cells, where they participate in a wide range of physiologic responses. The protein encoded by this gene is an integral membrane protein and inward-rectifier type potassium channel. It is activated by internal ATP and may be responsible for the potassium buffering action of glial cells in the brain. The encoded protein, which may form a heterodimer with another potassium channel protein, has a greater tendency to allow potassium to flow into a cell rather than out of a cell.
22435	
22436	
22437	
22438	
22439	This gene encodes one of the members of the superfamily of potassium channel proteins containing two pore-forming P domains. The gene product is an outwardly rectifying channel that is sensitive to changes in extracellular pH and is inhibited by extracellular acidification. Also referred to as an acid-sensitive potassium channel, it is activated by the anesthetics halothane and isoflurane. Although three transcripts are detected in northern blots, there is currently no sequence available to confirm transcript variants for this gene.
22440	
22441	
22442	The protein encoded by this gene is a type II membrane protein that catalyzes the transfer of sialic acid from CMP-sialic acid to galactose-containing substrates. The encoded protein, which is normally found in the Golgi but which can be proteolytically processed to a soluble form, is involved in the generation of the cell-surface carbohydrate determinants and differentiation antigens HB-6, CDw75, and CD76. This protein is a member of glycosyltransferase family 29. Three transcript variants encoding two different isoforms have been found for this gene.
22443	
22444	
22445	
22446	The protein encoded by this gene contains a C3HC4 type RING finger, which is a motif known to be involved in protein-protein interactions. The specific function of this protein has not yet been determined.
22447	AIP may play a positive role in AHR-mediated signalling possibly by influencing its receptivity for ligand and/or its nuclear targeting. AIP is the cellular negative regulator of the HBV X protein.
22448	
22449	
22450	
22451	
22452	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene encodes a member of the histone H2A family, and generates two transcripts through the use of the conserved stem-loop termination motif, and the polyA addition motif.
22453	
22454	Rho GTPases play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. The encoded protein may form complex with G proteins and stimulate Rho-dependent signals. This protein may be involved in the control of cytoskeletal organization. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
22455	This gene, CYP3A4, encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and its expression is induced by glucocorticoids and some pharmacological agents. This enzyme is involved in the metabolism of approximately half the drugs which are are used today, including acetaminophen, codeine, cyclosporin A, diazepam and erythromycin. The enzyme also metabolizes some steroids and carcinogens. This gene is part of a cluster of cytochrome P450 genes on chromosome 7q21.1. Previously another CYP3A gene, CYP3A3, was thought to exist; however, it is now thought that this sequence represents a transcript variant of CYP3A4.
22456	
22457	
22458	The BMP1 locus encodes a protein that is capable of inducing formation of cartilage in vivo.  Although other bone morphogenetic proteins are members of the TGF-beta superfamily, BMP1 encodes a protein that is not closely related to other known growth factors. BMP1 protein and procollagen C proteinase (PCP), a secreted metalloprotease requiring calcium and needed for cartilage and bone formation, are identical. PCP or BMP1 protein cleaves the C-terminal propeptides of procollagen I, II, and III and its activity is increased by the procollagen C-endopeptidase enhancer protein.  The BMP1 gene is expressed as alternatively spliced variants that share an N-terminal protease domain but differ in their C-terminal region
22459	
22460	
22461	
22462	
22463	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene is located in the nucleoplasm and has three repeats of KH domains that binds to RNAs. It is distinct among other hnRNP proteins in its binding preference; it binds tenaciously to poly(C). This protein is also thought to have a role during cell cycle progession. Multiple alternatively spliced transcript variants have been described for this gene but only three variants have been fully described.
22464	
22465	
22466	
22467	GARP encodes a type I membrane protein which contains 20 leucine-rich repeats. Alterations in the chromosomal region 11q13-11q14 are involved in several pathologies. The FGF and TYR genes are also located in this region.
22468	
22469	This gene encodes a member of the p34Cdc2 protein kinase family. p34Cdc2 kinase family members are known to be essential for eukaryotic cell cycle control. This gene is in close proximity to CDC2L1, a nearly identical gene in the same chromosomal region. The gene loci including this gene, CDC2L1, as well as metalloprotease MMP21/22, consist of two identical, tandemly linked genomic regions, which are thought to be a part of the larger region that has been duplicated. This gene and CDC2L1 were shown to be deleted or altered frequently in neuroblastoma with amplified MYCN genes. The protein kinase encoded by this gene could be cleaved by caspases and was demonstrated to play roles in cell apoptosis. A large number of the alternatively spliced variants of this gene have been reported.
22470	
22471	
22472	
22473	
22474	
22475	
22476	
22477	
22478	
22479	This gene encodes a transcription factor that binds to the sterol regulatory element-1 (SRE1), which is a decamer flanking the low density lipoprotein receptor gene and some genes involved in sterol biosynthesis. The protein is synthesized as a precursor that is attached to the nuclear membrane and endoplasmic reticulum. Following cleavage, the mature protein translocates to the nucleus and activates transcription by binding to the SRE1. Sterols inhibit the cleavage of the precursor, and the mature nuclear form is rapidly catabolized, thereby reducing transcription. The protein is a member of the basic helix-loop-helix-leucine zipper (bHLH-Zip) transcription factor family. This gene is located within the Smith-Magenis syndrome region on chromosome 17. Two transcript variants encoding different isoforms have been found for this gene.
22480	
22481	
22482	The PRX gene encodes L- and S-periaxin, proteins of myelinating Schwann cells, and is mutated in Dejerine-Sottas syndrome (MIM 145900) and Charcot-Marie-Tooth disease type 4F (MIM 145900).[supplied by OMIM]
22483	
22484	
22485	
22486	This gene is a member of the regulatory factor X gene family, which encodes transcription factors that contain a highly-conserved winged helix DNA binding domain. The protein encoded by this gene is structurally related to regulatory factors X2, X3, X4, and X5. It is a transcriptional activator that can bind DNA as a monomer or as a heterodimer with RFX family members X2, X3, and X5, but not with X4. This protein binds to the X-boxes of MHC class II genes and is essential for their expression.  Also, it can bind to an inverted repeat that is required for expression of hepatitis B virus genes.
22487	
22488	
22489	The encoded protein, which directly binds to pituitary tumor-transforming gene 1 protein (PTTG1), facilitates the nuclear translocation of PTTG1 and potentiates the transcriptional activation of basic fibroblast growth factor by PTTG1. The gene product localizes to both the cytoplasm and nucleus. Its NLS is required for its own nuclear localization, the nuclear localization of PTTG1, and its interaction with PTTG1.
22490	The autosomal dominant cerebellar ataxias (ADCAs) are a clinically and genetically heterogeneous group of disorders characterized by ataxia, dysarthria, dysmetria, and intention tremor. All ADCAs involve some degree of cerebellar dysfunction and a varying degree of signs from other components of the nervous system. A commonly accepted clinical classification (Harding, 1993) divides ADCAs into 3 different groups based on the presence or absence of associated symptoms such as brainstem signs or retinopathy. The presence of pyramidal and extrapyramidal symptoms and ophthalmoplegia makes the diagnosis of ADCA I, the presence of retinopathy points to ADCA II, and the absence of associated signs to ADCA III. Genetic linkage and molecular analyses revealed that ADCAs are genetically heterogeneous even within the various subtypes.[supplied by OMIM]
22491	Initiation of transcription by RNA polymerase II requires the activities of more than 70 polypeptides. The protein that coordinates these activities is transcription factor IID (TFIID), which binds to the core promoter to position the polymerase properly, serves as the scaffold for assembly of the remainder of the transcription complex, and acts as a channel for regulatory signals. TFIID is composed of the TATA-binding protein (TBP) and a group of evolutionarily conserved proteins known as TBP-associated factors or TAFs. TAFs may participate in basal transcription, serve as coactivators, function in promoter recognition or modify general transcription factors (GTFs) to facilitate complex assembly and transcription initiation. This gene encodes a protein that is similar to one of the small subunits of TFIID, TBP-associated factor 9.
22492	The protein encoded by this gene is a growth-related cell surface protein. It was identifed because it reacts with the monoclonal antibody KI in cells, such as the ovarian carcinoma line OVCAR-3, also expressing the CAKI surface glycoprotein. The encoded protein has two enzymatic activities: catalysis of hydroquinone or NADH oxidation, and protein disulfide interchange. The two activities alternate with a period length of about 24 minutes. The encoded protein also displays prion-like properties. Two transcript variants encoding different isoforms have been found for this gene.
22493	
22494	Cytochrome c oxidase (COX), the terminal component of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. This component is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in electron transfer, and the nuclear-encoded subunits may function in the regulation and assembly of the complex. This nuclear gene encodes a protein which is not a structural subunit, but may be a heme A biosynthetic enzyme involved in COX formation, according to the yeast mutant studies. However, the studies in Rhodobacter sphaeroides suggest that this gene is not required for heme A biosynthesis, but required for stable formation of the Cu(B) and magnesium centers of COX. This human protein is predicted to contain a transmembrane domain localized in the mitochondrial inner membrane. A pseudogene COX11P has been found on chromosome 6.
22495	The aryl hydrocarbon (Ah) receptor is involved in the induction of several enzymes that participate in xenobiotic metabolism. The ligand-free, cytosolic form of the Ah receptor is complexed to heat shock protein 90. Binding of ligand, which includes dioxin and polycyclic aromatic hydrocarbons, results in translocation of the ligand-binding subunit only to the nucleus. Induction of enzymes involved in xenobiotic metabolism occurs through binding of the ligand-bound Ah receptor to xenobiotic responsive elements in the promoters of genes for these enzymes. This gene encodes a protein that forms a complex with the ligand-bound Ah receptor, and is required for receptor function. The encoded protein has also been identified as the beta subunit of a heterodimeric transcription factor, hypoxia-inducible factor 1 (HIF1). A t(1;12)(q21;p13) translocation, which results in a TEL-ARNT fusion protein, is associated with acute myeloblastic leukemia. Three alternatively spliced variants encoding different isoforms have been described for this gene.
22496	
22497	
22498	
22499	
22500	
22501	
22502	
22503	
22504	
22505	
22506	Proteins encoded by the complexin/synaphin gene family are cytosolic proteins that function in synaptic vesicle exocytosis. These proteins bind syntaxin, part of the SNAP receptor. The protein product of this gene binds to the SNAP receptor complex and disrupts it, allowing transmitter release. Two transcript variants encoding the same protein have been found for this gene.
22507	G protein-coupled receptors (GPCRs, or GPRs) contain 7 transmembrane domains and transduce extracellular signals through heterotrimeric G proteins.[supplied by OMIM]
22508	
22509	
22510	
22511	
22512	
22513	
22514	
22515	
22516	
22517	
22518	
22519	
22520	
22521	This gene encodes a bifunctional enzyme that catalyzes the first two steps in the mammalian lysine degradation pathway. The N-terminal and the C-terminal portions of this enzyme contain lysine-ketoglutarate reductase and saccharopine dehydrogenase activity, respectively, resulting in the conversion of lysine to alpha-aminoadipic semialdehyde. Mutations in this gene are associated with familial hyperlysinemia.
22522	This gene encodes a bifunctional enzyme that catalyzes the first two steps in the mammalian lysine degradation pathway. The N-terminal and the C-terminal portions of this enzyme contain lysine-ketoglutarate reductase and saccharopine dehydrogenase activity, respectively, resulting in the conversion of lysine to alpha-aminoadipic semialdehyde. Mutations in this gene are associated with familial hyperlysinemia.
22523	The protein encoded by this gene is a component of the mammalian counterpart of the yeast Exocyst complex, which is essential for post-Golgi traffic.
22524	
22525	
22526	
22527	
22528	
22529	
22530	
22531	This gene was identified by the reactivity of its encoded protein to a monoclonal antibody prepared against brain homogenates from patients with Alzheimer@#$%&s disease. Analysis of the original protein (fetal Alz-50 reactive clone 1, or FAC1), identified as an 810 aa protein containing a DNA-binding domain and a zinc finger motif, suggested it might play a role in the regulation of transcription. High levels of FAC1 were detected in fetal brain and in patients with neurodegenerative diseases. The protein encoded by this gene is actually much larger than originally thought, and it also contains a C-terminal bromodomain characteristic of proteins that regulate transcription during proliferation. The encoded protein is highly similar to the largest subunit of the Drosophila NURF (nucleosome remodeling factor) complex. In Drosophila, the NURF complex, which catalyzes nucleosome sliding on DNA and interacts with sequence-specific transcription factors, is necessary for the chromatin remodeling required for transcription. Two alternative transcripts encoding different isoforms have been described completely.
22532	
22533	
22534	
22535	
22536	
22537	
22538	The product of this gene belongs to the family of transient receptor potential (TRP) channels. TRP channels are cation-selective channels important for cellular calcium signaling and homeostasis. The protein encoded by this gene mediates calcium entry, and this entry is potentiated by calcium store depletion. Alternatively spliced transcript variants encoding different isoforms have been identified.
22539	
22540	
22541	
22542	
22543	
22544	
22545	
22546	
22547	
22548	
22549	
22550	
22551	
22552	
22553	
22554	KRT17 encodes the type I intermediate filament chain keratin 17, expressed in nail bed, hair follicle, sebaceous glands, and other epidermal appendages. Mutations in this gene lead to Jackson-Lawler type pachyonychia congenita and steatocystoma multiplex.
22555	
22556	
22557	
22558	
22559	HLA-DQA1 belongs to the HLA class II alpha chain paralogues. The class II molecule is a heterodimer consisting of an alpha (DQA) and a beta chain (DQB), both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B Lymphocytes, dendritic cells, macrophages). The alpha chain is approximately 33-35 kDa. It is encoded by 5 exons; exon 1 encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, and exon 4 encodes the transmembrane domain and the cytoplasmic tail. Within the DQ molecule both the alpha chain and the beta chain contain the polymorphisms specifying the peptide binding specificities, resulting in up to four different molecules. Typing for these polymorphisms is routinely done for bone marrow transplantation.
22560	
22561	NEPH1 is a member of the nephrin-like protein family, which includes NEPH2 (MIM 607761) and NEPH3 (MIM 607762). The cytoplasmic domains of these proteins interact with the C terminus of podocin (NPHS2; MIM 604766), and the genes are expressed in kidney podocytes, cells involved in ensuring size- and charge-selective ultrafiltration (Sellin et al., 2003).[supplied by OMIM]
22562	
22563	
22564	SEP1 localizes to cytoplasmic foci containing a complex of mRNA-degrading enzymes. In addition to mRNA metabolism, yeast Sep1 has been implicated in a variety of nuclear and cytoplasmic functions, including homologous recombination, meiosis, telomere maintenance, and microtubule assembly.[supplied by OMIM]
22565	
22566	This gene encodes the homolog of a protein identified in Drosophila as a suppressor of malignant transformation of neuroblasts and ganglion-mother cells in the optic centers of the brain. This gene product is localized to condensed chromosomes in mitotic cells. Overexpression of this gene in a glioma cell line results in improper nuclear segregation and cytokinesis producing multinucleated cells. Two transcripts have been identified for this gene.
22567	This gene belongs to the TIMP gene family. The proteins encoded by this gene family are natural inhibitors of the matrix metalloproteinases, a group of peptidases involved in degradation of the extracellular matrix. The expression of this gene is induced in response to mitogenic stimulation and subject to cell cycle regulation. Mutations in this gene cause the autosomal dominant disorder Sorsby@#$%&s fundus dystrophy.
22568	Death-associated protein kinase 1 is a positive mediator of gamma-interferon induced programmed cell death.  DAPK1 encodes a structurally unique 160-kD calmodulin dependent serine-threonine kinase that carries 8 ankyrin repeats and 2 putative P-loop consensus sites. It is a tumor suppressor candidate.
22569	
22570	
22571	
22572	
22573	
22574	The protein encoded by this gene is an enzyme that catalyzes the hydrolytic deamination of guanine, producing xanthine and ammonia. It is also known as a cytosolic regulator of PSD-95 postsynaptic targeting.
22575	
22576	
22577	
22578	Glycophorins A (GYPA) and B (GYPB) are major sialoglycoproteins of the human erythrocyte membrane which bear the antigenic determinants for the MN and Ss blood groups. GYPB gene consists of 5 exons and has 97% sequence homology with GYPA from the 5@#$%& UTR to the coding sequence encoding the first 45 amino acids. In addition to the M or N and S or s antigens, that commonly occur in all populations, about 40 related variant phenotypes have been identified. These variants include all the variants of the Miltenberger complex and several isoforms of Sta; also, Dantu, Sat, He, Mg, and deletion variants Ena, S-s-U- and Mk. Most of the variants are resulted from gene recombinations between GYPA and GYPB.
22579	
22580	The ABL1 protooncogene encodes a cytoplasmic and nuclear protein tyrosine kinase that has been implicated in processes of cell differentiation, cell division, cell adhesion, and stress response.  Activity of c-Abl protein is negatively regulated by its SH3 domain, and deletion of the SH3 domain turns ABL1 into an oncogene.  The t(9;22) translocation results in the head-to-tail fusion of the BCR (MIM:151410) and ABL1 genes present in many cases of chronic myelogeneous leukemia.  The DNA-binding activity of the ubiquitously expressed ABL1 tyrosine kinase is regulated by CDC2-mediated phosphorylation, suggesting a cell cycle function for ABL1.  The ABL1 gene is expressed as either a 6- or 7-kb mRNA transcript, with alternatively spliced first exons spliced to the common exons 2-11.
22581	
22582	
22583	
22584	
22585	
22586	
22587	DPH2L2 encodes diphthamide biosynthesis-like protein 2, one of two human genes similar to the yeast gene dph2. The yeast gene was identified by its ability to complement a diphthamide mutant strain, and thus probably functions in diphthamide biosynthesis. Diphthamide is a post-translationally modified histidine residue present in elongation factor 2 (EF2) that is the target of diphtheria toxin ADP-ribosylation. Two transcript variants encoding different isoforms have been found for this gene.
22588	
22589	
22590	
22591	
22592	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This protein contains six WD repeats, a clathrin heavy-chain repeat, and three transmembrane domains. This gene is conserved from C. elegans to human. It may participate in androgen-regulated signaling mechanisms or in the vesicular trafficking of androgen-regulated secretory processes. Alternatively spliced transcript variants encoding distinct isoforms have been reported but the full-length nature of one of these variants has not been defined.
22593	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This protein contains six WD repeats, a clathrin heavy-chain repeat, and three transmembrane domains. This gene is conserved from C. elegans to human. It may participate in androgen-regulated signaling mechanisms or in the vesicular trafficking of androgen-regulated secretory processes. Alternatively spliced transcript variants encoding distinct isoforms have been reported but the full-length nature of one of these variants has not been defined.
22594	
22595	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a member of the histone H2A family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the small histone gene cluster on chromosome 6p22-p21.3.
22596	The protein encoded by this gene encodes the lysosomal enzyme, which cleaves terminal sialic acid residues from substrates such as glycoproteins and glycolipids. In the lysosome, this enzyme is part of a heterotrimeric complex together with beta-galactosidase and cathepsin A (the latter also referred to as @#$%&protective protein@#$%&). Mutations in this gene can lead to sialidosis.
22597	
22598	The origin recognition complex (ORC) is a highly conserved six subunit protein complex essential for the initiation of the DNA replication in eukaryotic cells. Studies in yeast demonstrated that ORC binds specifically to origins of replication and serves as a platform for the assembly of additional initiation factors such as Cdc6 and Mcm proteins. The protein encoded by this gene is a subunit of the ORC complex. It has been shown to form a core complex with ORC2L, -3L, and 4L. Alternatively spliced transcript variants encoding distinct isoforms have been described.
22599	ADP-ribosylation factor 5 (ARF5) is a member of the human ARF gene family. These genes encode small guanine nucleotide-binding proteins that stimulate the ADP-ribosyltransferase activity of cholera toxin and play a role in vesicular trafficking and as activators of phospholipase D.  The gene products include 6 ARF proteins and 11 ARF-like proteins and constitute 1 family of the RAS superfamily. The ARF proteins are categorized as class I (ARF1, ARF2,and ARF3), class II (ARF4 and ARF5) and class III (ARF6). The members of each class share a common gene organization. The ARF5 gene spans approximately 3.2kb of genomic DNA and contains six exons and five introns.
22600	
22601	The Golgi complex plays a key role in the sorting and modification of proteins exported from the endoplasmic reticulum. The protein encoded by this gene is a type II Golgi transmembrane protein. It processes protein synthesized in the rough endoplasmic reticulum and assists in the transport of protein cargo through the Golgi apparatus. The expression of this encoded protein has been observed to be upregulated in response to viral infection. Two alternatively spliced transcript variants encoding the same protein have been described for this gene.
22602	
22603	
22604	
22605	
22606	Growth arrest-specific 7 is expressed primarily in terminally differentiated brain cells and predominantly in mature cerebellar Purkinje neurons. GAS7 plays a putative role in neuronal development. Several transcript variants encoding proteins which vary in the N-terminus have been described.
22607	
22608	
22609	DLM1 encodes a Z-DNA binding protein. Z-DNA formation is a dynamic process, largely controlled by the amount of supercoiling.[supplied by OMIM]
22610	This gene encodes a protein that belongs to the inositol phosphokinase (IPK) family. This protein is likely responsible for the conversion of inositol hexakisphosphate (InsP6) to diphosphoinositol pentakisphosphate (InsP7/PP-InsP5). It may also convert 1,3,4,5,6-pentakisphosphate (InsP5) to PP-InsP4. Alternative splicing occurs for this gene; however, the full-length nature of all transcript variants has not yet been described.
22611	The protein encoded by this gene, a member of the peptidase C1 family, is a cysteine proteinase that may have a specific function in the mechanism or regulation of T-cell cytolytic activity. The encoded protein is found associated with the membrane inside the endoplasmic reticulum of natural killer and cytotoxic T-cells. Expression of this gene is up-regulated by interleukin-2.
22612	This gene encodes the beta-subunit of glucosidase II, an N-linked glycan-processing enzyme in the endoplasmic reticulum (ER). This protein is an acidic phospho-protein known to be a substrate for protein kinase C. Mutations in this gene have been associated with the autosomal dominant polycystic liver disease (PCLD). Alternatively spliced transcript variants encoding distinct isoforms have been observed.
22613	
22614	
22615	Endothelin-converting enzyme-1 is involved in the proteolytic processing of endothelin-1 (EDN1; MIM 131240), -2 (EDN2; MIM 131241), and -3 (EDN3; MIM 131242) to biologically active peptides.[supplied by OMIM]
22616	
22617	The protein encoded by this gene binds the epidermal growth factor receptor and contains one SH2 domain and two SH3 domains. Its two SH3 domains direct complex formation with proline-rich regions of other proteins, and its SH2 domain binds tyrosine phosphorylated sequences. This gene is similar to the Sem5 gene of C.elegans, which is involved in the signal transduction pathway. Two alternatively spliced transcript variants encoding different isoforms have been found for this gene.
22618	
22619	This gene encodes a member of the RSK (ribosomal S6 kinase) family of serine/threonine kinases. This kinase contains 2 nonidentical kinase catalytic domains and phosphorylates various substrates, including members of the mitogen-activated kinase (MAPK) signalling pathway. The activity of this protein has been implicated in controlling cell growth and differentiation. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
22620	Cytosolic and membrane-bound forms of glutathione S-transferase are encoded by two distinct supergene families. At present, eight distinct classes of the soluble cytoplasmic mammalian glutathione S-transferases have been identified: alpha, kappa, mu, omega, pi, sigma, theta and zeta. This gene encodes a glutathione S-transferase that belongs to the mu class. The mu class of enzymes functions in the detoxification of electrophilic compounds, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress, by conjugation with glutathione. The genes encoding the mu class of enzymes are organized in a gene cluster on chromosome 1p13.3 and are known to be highly polymorphic. These genetic variations can change an individual@#$%&s susceptibility to carcinogens and toxins as well as affect the toxicity and efficacy of certain drugs. Diversification of these genes has occurred in regions encoding substrate-binding domains, as well as in tissue expression patterns, to accommodate an increasing number of foreign compounds. Multiple transcript variants, each encoding a distinct protein isoform, have been identified.
22621	Cytosolic and membrane-bound forms of glutathione S-transferase are encoded by two distinct supergene families. At present, eight distinct classes of the soluble cytoplasmic mammalian glutathione S-transferases have been identified: alpha, kappa, mu, omega, pi, sigma, theta and zeta. This gene encodes a glutathione S-transferase that belongs to the mu class. The mu class of enzymes functions in the detoxification of electrophilic compounds, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress, by conjugation with glutathione. The genes encoding the mu class of enzymes are organized in a gene cluster on chromosome 1p13.3 and are known to be highly polymorphic. These genetic variations can change an individual@#$%&s susceptibility to carcinogens and toxins as well as affect the toxicity and efficacy of certain drugs. Diversification of these genes has occurred in regions encoding substrate-binding domains, as well as in tissue expression patterns, to accommodate an increasing number of foreign compounds. Multiple transcript variants, each encoding a distinct protein isoform, have been identified.
22622	
22623	
22624	
22625	This gene is thought to encode a GDP-mannose pyrophosphorylase. This enzyme catalyzes the reaction which converts mannose-1-phosphate and GTP to GDP-mannose which is involved in the production of N-linked oligosaccharides.
22626	
22627	The protein encoded by this gene belongs to the complex I 42kDA subunit family. Mammalian complex I is the first enzyme complex in the electron transport chain of mitochondria. It is composed of 45 different subunits. This protein is a component of the hydrophobic protein fraction and has NADH dehydrogenase activity and oxidoreductase activity. It transfers electrons from NADH to the respiratory chain.
22628	The protein encoded by this gene is a ubiquitous actin monomer-binding protein belonging to the profilin family. It is thought to regulate actin polymerization in response to extracellular signals. There are two alternatively spliced transcript variants encoding different isoforms described for this gene.
22629	This gene encodes a component of vacuolar ATPase (V-ATPase), a multisubunit enzyme that mediates acidification of eukaryotic intracellular organelles. V-ATPase dependent organelle acidification is necessary for such intracellular processes as protein sorting, zymogen activation, receptor-mediated endocytosis, and synaptic vesicle proton gradient generation. V-ATPase is composed of a cytosolic V1 domain and a transmembrane V0 domain. The V1 domain consists of three A and three B subunits, two G subunits plus the C, D, E, F, and H subunits. The V1 domain contains the ATP catalytic site. The V0 domain consists of five different subunits: a, c, c@#$%&, c@#$%!, and d. Additional isoforms of many of the V1 and V0 subunit proteins are encoded by multiple genes or alternatively spliced transcript variants. This encoded protein is one of two V1 domain A subunit isoforms and is found in all tissues. Transcript variants derived from alternative polyadenylation exist.
22630	The protein encoded by this gene is a type II membrane protein that catalyzes the transfer of sialic acid from CMP-sialic acid to galactose-containing substrates. The encoded protein, which is normally found in the Golgi but which can be proteolytically processed to a soluble form, is involved in the generation of the cell-surface carbohydrate determinants and differentiation antigens HB-6, CDw75, and CD76. This protein is a member of glycosyltransferase family 29. Three transcript variants encoding two different isoforms have been found for this gene.
22631	
22632	In all eukaryotes, the cell cycle is governed by cyclin-dependent protein kinases (CDKs), whose activities are regulated by cyclins and CDK inhibitors in a diverse array of mechanisms that involve the control of phosphorylation and dephosphorylation of Ser, Thr or Tyr residues. Cyclins are molecules that possess a consensus domain called the @#$%&cyclin box.@#$%& In mammalian cells, 9 cyclin species have been identified, and they are referred to as cyclins A through I. Cyclin G is a direct transcriptional target of the p53 tumor suppressor gene product and thus functions downstream of p53. GAK is an association partner of cyclin G and CDK5.
22633	
22634	
22635	
22636	The MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) family includes a number of human proteins, several of which are involved the production of leukotrienes. This gene encodes an enzyme that catalyzes the first step in the biosynthesis of cysteinyl leukotrienes, potent biological compounds derived from arachidonic acid. Leukotrienes have been implicated as mediators of anaphylaxis and inflammatory conditions such as human bronchial asthma. This protein localizes to the nuclear envelope and adjacent endoplasmic reticulum. Multiple protein isoforms are encoded by transcript variants of this gene.
22637	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. This gene has been described to generate two alternatively spliced transcript variants which encode different isoforms.
22638	The leukemia inhibitory factor is a polyfunctional cytokine that affects the differentiation, survival, and proliferation of a wide variety of cells in the adult and the embryo. LIF action appears to be mediated through a high-affinity receptor complex composed of a low-affinity LIF binding chain (LIF receptor) and a high-affinity converter subunit, gp130. Both LIFR and gp130 are members of a family of cytokine receptors that includes components of the receptors for the majority of hematopoietic cytokines and for cytokines that affect other systems, including the ciliary neurotrophic factor, growth hormone and prolactin.
22639	
22640	The enzyme encoded by this gene is responsible for the formation of the blood group I antigen. The i and I antigens are determined by linear and branched poly-N-acetyllactosaminoglycans, respectively. During embryonic development in human erythrocytes, the fetal i antigen is replaced by the adult I antigen as the result of the appearance of a beta-1,6-N-acetylglucosaminyltransferase, the I-branching enzyme. This gene encodes the I-branching enzyme that converts the linear form into the branched form. Defects in this gene have been associated with adult i blood group phenotype. Several transcript variants encoding different isoforms have been described.
22641	This gene encodes the enzyme that metabolizes thiopurine drugs via S-adenosyl-L-methionine as the S-methyl donor and S-adenosyl-L-homocysteine as a byproduct. Thiopurine drugs such as 6-mercaptopurine are used as chemotherapeutic agents. Genetic polymorphisms that affect this enzymatic activity are correlated with variations in sensitivity and toxicity to such drugs within individuals. A pseudogene for this locus is located on chromosome 18q.
22642	
22643	Cofactor C is one of four proteins (cofactors A, D, E, and C) involved in the pathway leading to correctly folded beta-tubulin from folding intermediates. Cofactors A and D are believed to play a role in capturing and stabilizing beta-tubulin intermediates in a quasi-native confirmation. Cofactor E binds to the cofactor D/beta-tubulin complex; interaction with cofactor C then causes the release of beta-tubulin polypeptides that are committed to the native state.
22644	The protein encoded by this gene contains a RING finger domain, a motif present in a variety of functionally distinct proteins and known to be involved in protein-protein and protein-DNA interactions. This gene is located in a chromosomal region known to be frequently deleted in patients with neurofibromatosis. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
22645	
22646	This gene is similar to the mouse obesity gene (ob). The protein encoded by this gene is secreted by white adipocytes. In the mouse study, mutations in this gene are linked to severe and morbid obesity.
22647	
22648	The protein encoded by this gene is a lysosomal cysteine proteinase composed of a dimer of disulfide-linked heavy and light chains, both produced from a single protein precursor. It is also known as amyloid precursor protein secretase and is involved in the proteolytic processing of amyloid precursor protein (APP). Incomplete proteolytic processing of APP has been suggested to be a causative factor in Alzheimer disease, the most common cause of dementia. Overexpression of the encoded protein, which is a member of the peptidase C1 family, has been associated with esophageal adenocarcinoma and other tumors. At least five transcript variants encoding the same protein have been found for this gene.
22649	
22650	
22651	
22652	
22653	
22654	
22655	
22656	The protein encoded by this gene is a transformation and shape-change sensitive actin cross-linking/gelling protein found in fibroblasts and smooth muscle. Its expression is down-regulated in many cell lines, and this down-regulation may be an early and sensitive marker for the onset of transformation. A functional role of this protein is unclear. Two transcript variants encoding the same protein have been found for this gene.
22657	
22658	The protein encoded by this gene contains ARF-GAP, RHO-GAP, ankyrin repeat, RAS-associating, and pleckstrin homology domains. In vitro, this protein displays RHO-GAP and phosphatidylinositol (3,4,5) trisphosphate (PIP3)-dependent ARF-GAP activity. The encoded protein associates with the Golgi, and the ARF-GAP activity mediates changes in the Golgi and the formation of filopodia. The RHO-GAP activity may mediate cell rounding and loss of stress fibers. At least three transcript variants encoding different isoforms have been found for this gene, but the full-length nature of one variant has not been determined.
22659	The somatomedins, or insulin-like growth factors (IGFs), comprise a family of peptides that play important roles in mammalian growth and development. IGF1 mediates many of the growth-promoting effects of growth hormone (GH; MIM 139250). Early studies showed that growth hormone did not directly stimulate the incorporation of sulfate into cartilage, but rather acted through a serum factor, termed @#$%&sulfation factor,@#$%& which later became known as @#$%&somatomedin@#$%& (Daughaday et al., 1972). Three main somatomedins have been characterized: somatomedin C (IGF1), somatomedin A (IGF2; MIM 147470), and somatomedin B (MIM 193190) (Rotwein, 1986; Rosenfeld, 2003).[supplied by OMIM]
22660	This gene encodes one of several uracil-DNA glycosylases. One important function of uracil-DNA glycosylases is to prevent mutagenesis by eliminating uracil from DNA molecules by cleaving the N-glycosylic bond and initiating the base-excision repair (BER) pathway. Uracil bases occur from cytosine deamination or misincorporation of dUMP residues. Alternative promoter usage and splicing of this gene leads to two different isoforms: the mitochondrial UNG1 and the nuclear UNG2.
22661	
22662	
22663	
22664	
22665	
22666	
22667	
22668	The increased expression level of this gene is associated with HER-2/neu proto-oncogene overexpression. Amplification and resulting overexpression of this proto-oncogene are found in approximately 30% of human breast and 20% of human ovarian cancers. Further characterization of the product of this gene may yield insight into the fundamental biology and pathogenetic effects of HER-2/neu overexpression in human breast and ovarian cancer cells. Alternatively spliced variants encoding different isoforms have been identified for this gene.
22669	
22670	The protein encoded by this gene is similar to a mitochondrial di-iron containing hydroxylase in Saccharomyces cerevisiae that is involved with ubiquinone biosynthesis. Mutations in the yeast gene lead to slower development and longer life span. Transcript variants have been described for this gene, but their full-length sequences have not been determined.
22671	
22672	
22673	
22674	
22675	
22676	
22677	The WFIKKN1 protein contains a WAP domain, follistatin domain, immunoglobulin domain, two tandem Kunitz domains, and an NTR domain. This gene encodes a WFIKKN1-related protein which has the same domain organization as the WFIKKN1 protein. The WAP-type, follistatin type, Kunitz-type, and NTR-type protease inhibitory domains may control the action of multiple types of proteases.
22678	
22679	
22680	
22681	
22682	Kallikreins are a subgroup of serine proteases having diverse physiological functions. Growing evidence suggests that many kallikreins are implicated in carcinogenesis and some have potential as novel cancer and other disease biomarkers. This gene is one of the fifteen kallikrein subfamily members located in a cluster on chromosome 19. Its encoded protein is secreted and may play a role in suppression of tumorigenesis in breast and prostate cancers. Alternate splicing of this gene results in two transcript variants encoding the same protein.
22683	
22684	This gene encodes a protein which is capable of interacting with the major nonstructural protein of parvovirus H-1 and 70-kDa heat shock cognate protein; however, its function is not known. Since this transcript is expressed ubiquitously in various tissues, this protein may serve a housekeeping function.
22685	This gene encodes the third discovered human homologue of the Drosophilia melanogaster type I membrane protein notch. In Drosophilia, notch interaction with its cell-bound ligands (delta, serrate) establishes an intercellular signalling pathway that plays a key role in neural development. Homologues of the notch-ligands have also been identified in human, but precise interactions between these ligands and the human notch homologues remains to be determined. Mutations in NOTCH3 have been identified as the underlying cause of cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy (CADASIL).
22686	The product of this gene is an epsilon subunit of coatomer protein complex. Coatomer is a cytosolic protein complex that binds to dilysine motifs and reversibly associates with Golgi non-clathrin-coated vesicles. It is required for budding from Golgi membranes, and is essential for the retrograde Golgi-to-ER transport of dilysine-tagged proteins. Coatomer complex consists of at least the alpha, beta, beta@#$%&, gamma, delta, epsilon and zeta subunits. Alternatively spliced transcript variants encoding different isoforms have been identified.
22687	cAMP is a signaling molecule important for a variety of cellular functions. cAMP exerts its effects by activating the cAMP-dependent protein kinase (AMPK), which transduces the signal through phosphorylation of different target proteins. The inactive holoenzyme of AMPK is a tetramer composed of two regulatory and two catalytic subunits. cAMP causes the dissociation of the inactive holoenzyme into a dimer of regulatory subunits bound to four cAMP and two free monomeric catalytic subunits. Four different regulatory subunits and three catalytic subunits of AMPK have been identified in humans. The protein encoded by this gene is a member of the Ser/Thr protein kinase family and is a catalytic subunit of AMPK. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
22688	
22689	
22690	Voltage-dependent calcium channels control the rapid entry of Ca(2+) into a variety of cell types and are therefore involved in both electrical and cellular signaling. T-type channels, such as CACNA1I, are activated by small membrane depolarizations and can generate burst firing and pacemaker activity.[supplied by OMIM]
22691	
22692	
22693	
22694	
22695	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S28E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
22696	
22697	Iron-sulfur clusters are required for the function of many cellular enzymes. The protein encoded by this gene supplies inorganic sulfur to these clusters by removing the sulfur from cysteine, creating alanine in the process. This gene uses alternate in-frame translation initiation sites to generate mitochondrial forms and cytoplasmic/nuclear forms. Selection of the alternative initiation sites is determined by the cytosolic pH. The encoded protein belongs to the class-V family of pyridoxal phosphate-dependent aminotransferases. Two transcript variants encoding two different isoforms have been found for this gene.
22698	
22699	NFE2 (MIM 601490), NFE2L1 (MIM 163260), and NFE2L2 comprise a family of human genes encoding basic leucine zipper (bZIP) transcription factors. They share highly conserved regions that are distinct from other bZIP families, such as JUN (MIM 165160) and FOS (MIM 164810), although remaining regions have diverged considerably from each other (Chan et al., 1995).[supplied by OMIM]
22700	This gene encodes a member of the sodium bicarbonate cotransporter (NBC) family, part of the bicarbonate transporter superfamily. Sodium bicarbonate cotransporters are involved in intracellular pH regulation and electroneural or electrogenic sodium bicarbonate transport. This protein is thought to be an integral membrane protein. The gene expresses at least four alternatively spliced transcript variants.
22701	
22702	
22703	
22704	
22705	
22706	
22707	
22708	
22709	
22710	This gene encodes a protein containing an RNA-binding motif in the N-terminus and four SRGY (serine, arginine, glycine, tyrosine) boxes in the C-terminus. Multiple copies of this gene are found in the AZFb azoospermia factor region of chromosome Y and the encoded protein is thought to be involved in spermatogenesis. Most copies of this locus are pseudogenes, although six highly similar copies have full-length ORFs and are considered functional. Four functional copies of this gene are found within inverted repeat IR2; two functional copies of this gene are found in palindrome P3, along with two copies of PTPN13-like, Y-linked.
22711	
22712	
22713	
22714	
22715	
22716	
22717	
22718	
22719	
22720	
22721	This gene encodes a neuron-restricted protein that contains a CRAL-TRIO motif common to proteins that bind small lipophilic molecules. Mutations in this gene are associated with cerebellar ataxia, Cayman type.
22722	
22723	
22724	
22725	
22726	
22727	
22728	
22729	
22730	
22731	
22732	
22733	
22734	
22735	
22736	
22737	
22738	
22739	
22740	
22741	
22742	This gene encodes a protein that was identified in a screen for genes expressed in metastatic cells, specifically, mammary adenocarcinoma cell lines. Expression of this gene has been correlated with the metastatic potential of at least two types of carcinomas although it is also expressed in many normal tissues. The role it plays in metastasis is unclear. It was initially thought to be the 70kD component of a nucleosome remodeling deacetylase complex, NuRD, but it is more likely that this component is a different but very similar protein. These two proteins are so closely related, though, that they share the same types of domains. These domains include two DNA binding domains, a dimerization domain, and a domain commonly found in proteins that methylate DNA. The profile and activity of this gene product suggest that it is involved in regulating transcription and that this may be accomplished by chromatin remodeling.
22743	Endoglin is a homodimeric transmembrane glycoprotein highly expressed by endothelial cells. It is a component of the transforming growth factor beta receptor complex as it binds TGFB1 and TGFB3 with high affinity. Mutations in the endoglin gene produce hereditary hemorrhagic telangiectasia.
22744	
22745	
22746	
22747	
22748	
22749	The heterotetrameric adaptor protein (AP) complexes sort integral membrane proteins at various stages of the endocytic and secretory pathways. AP4 is composed of 2 large chains, beta-4 (AP4B1; MIM 607245) and epsilon-4 (AP4E1; MIM 607244), a medium chain, mu-4 (AP4M1; MIM 602296), and a small chain, sigma-4 (AP4S1).[supplied by OMIM]
22750	
22751	The protein encoded by this gene is a member of the peptidyl-prolyl cis-trans isomerase (PPIase) family. PPIases catalyze the cis-trans isomerization of proline imidic peptide bonds in oligopeptides and accelerate the folding of proteins. This protein is a specific component of the complex that includes pre-mRNA processing factors PRPF3, PRPF4, and PRPF18, as well as U4/U5/U6 tri-snRNP. This protein has been shown to possess PPIase activity and may act as a protein chaperone that mediates the interactions between different proteins inside the spliceosome.
22752	
22753	
22754	
22755	
22756	The protein encoded by this gene is a peripheral membrane protein that is a component of tight junctions or TJs. TJs form an apical junctional structure and act to control paracellular permeability and maintain cell polarity. This protein is related to angiomotin, an angiostatin binding protein that regulates endothelial cell migration and capillary formation.
22757	Tryptophan hydroxylase (TPH; EC 1.14.16.4) is the rate-limiting enzyme in the synthesis of serotonin (5-hydroxytryptamine, or 5HT). 5HT is causally involved in numerous central nervous activities, and it has several functions in peripheral tissues, including the maintenance of vascular tone and gut motility.[supplied by OMIM]
22758	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. This gene was identified as an oncogene, which confers transforming potential when transfected into mammalian cells. Targeted disruption of the homolog of this gene in mouse resulted in the phenotype of abnormally long hair, which suggested a function as an inhibitor of hair elongation. Two alternatively spliced variants encoding different isoforms have been described.
22759	
22760	
22761	
22762	
22763	
22764	The Sec61 complex is the central component of the protein translocation apparatus of the endoplasmic reticulum (ER) membrane. The protein encoded by this gene and SEC62 protein are found to be associated with ribosome-free SEC61 complex. It is speculated that Sec61-Sec62-Sec63 may perform post-translational protein translocation into the ER. The Sec61-Sec62-Sec63 complex might also perform the backward transport of ER proteins that are subject to the ubiquitin-proteasome-dependent degradation pathway. The encoded protein is an integral membrane protein located in the rough ER.
22765	
22766	
22767	
22768	
22769	HLA-DRB1 belongs to the HLA class II beta chain paralogues. The class II molecule is a heterodimer consisting of an alpha (DRA) and a beta chain (DRB), both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa. It is encoded by 6 exons, exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail. Within the DR molecule the beta chain contains all the polymorphisms specifying the peptide binding specificities. Hundreds of DRB1 alleles have been described and typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. DRB1 is expressed at a level five times higher than its paralogues DRB3, DRB4 and DRB5. DRB1 is present in all individuals. Allelic variants of DRB1 are linked with either none or one of the genes DRB3, DRB4 and DRB5. There are 4 related pseudogenes: DRB2, DRB6, DRB7, DRB8 and DRB9.
22770	
22771	This gene encodes a fibrillar collagen that is found in extensible connective tissues such as skin, lung, and the vascular system, frequently in association with type I collagen. Mutations in this gene are associated with Ehlers-Danlos syndrome type IV, and with aortic and arterial aneurysms. Although alternate transcripts have been detected for this gene, they are the result of mutations; these mutations alter splicing, often leading to the exclusion of multiple exons.
22772	
22773	The crooked neck (crn) gene of Drosophila is essential for embryogenesis and is thought to be involved in cell cycle progression and pre-mRNA splicing. This gene is similar in sequence to crn and encodes a protein which can localize to pre-mRNA splicing complexes in the nucleus. The encoded protein, which contains many tetratricopeptide repeats, is required for pre-mRNA splicing.
22774	Exonucleolytic degradation of the poly(A) tail is often the first step in the decay of eukaryotic mRNAs.  The amino acid sequence of poly(A)-specific ribonuclease shows homology to the RNase D family of 3@#$%&-exonucleases. The protein appears to be localized in both the nucleus and the cytoplasm. It is not stably associated with polysomes or ribosomal subunits.
22775	
22776	
22777	
22778	
22779	
22780	
22781	
22782	This gene encodes a member of the alpha-2/delta subunit family, a protein in the voltage-dependent calcium channel complex. Calcium channels mediate the influx of calcium ions into the cell upon membrane polarization and consist of a complex of alpha-1, alpha-2/delta, beta, and gamma subunits in a 1:1:1:1 ratio. Various versions of each of these subunits exist, either expressed from similar genes or the result of alternative splicing. Research on a highly similar protein in rabbit suggests the protein described in this record is cleaved into alpha-2 and delta subunits. Alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
22783	
22784	
22785	
22786	
22787	
22788	Pre-mRNA splicing occurs in two sequential transesterification steps, and the protein encoded by this gene is thought to be involved in pre-mRNA splicing and in signal transduction. This protein belongs to a kinase family that includes serine/arginine-rich protein-specific kinases and cyclin-dependent kinases (CDKs). This protein is regarded as a CDK-like kinase (Clk) with homology to mitogen-activated protein kinases (MAPKs). Sequence analysis identified two transcript variants that encode the same protein.
22789	Cell signaling pathways rely on a dynamic interaction between activating and inhibiting processes. SHP-1-mediated dephosphorylation of protein tyrosine residues is central to the regulation of several cell signaling pathways. Two types of inhibitory receptor superfamily members are immunoreceptor tyrosine-based inhibitory motif (ITIM)-bearing receptors and their non-ITIM-bearing, activating counterparts. Control of cell signaling via SHP-1 is thought to occur through a balance between PILRalpha-mediated inhibition and PILRbeta-mediated activation. These paired immunoglobulin-like receptor genes are located in a tandem head-to-tail orientation on chromosome 7. This particular gene encodes the non-ITIM-bearing member of the receptor pair, which has a truncated cytoplasmic tail relative to its ITIM-bearing partner and functions in the activating role. Alternative splicing has been observed at this locus and three variants, encoding two distinct isoforms, are described. Additional transcript variants have been identified but their full-length nature has not been determined.
22790	
22791	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L29P family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
22792	The protein encoded by this gene is a glutathione-dependent prostaglandin E synthase. The expression of this gene has been shown to be induced by proinflammatory cytokine interleukin 1 beta (IL1B). Its expression can also be induced by tumor suppressor protein TP53, and may be involved in TP53 induced apoptosis. Knockout studies in mice suggest that this gene may contribute to the pathogenesis of collagen-induced arthritis and mediate acute pain during inflammatory responses. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
22793	
22794	
22795	
22796	Zinc finger proteins, such as ZNF385, are regulatory proteins that act as transcription factors, bind single- or double-stranded RNA, or interact with other proteins (Sharma et al., 2004).[supplied by OMIM]
22797	
22798	
22799	
22800	
22801	X-linked mental retardation associated with marXq28, or fragile X syndrome, is characterized by moderate to severe mental retardation, macroorchidism, large ears, prominent jaw, and high-pitched jocular speech. Expression is variable, with mental retardation being the most common feature. This phenotype is associated with mutations in the FMR1 gene. McCabe et al. (1999) [PubMed 10398250] summarized the proceedings of a workshop on the fragile X syndrome held in December 1998.[supplied by OMIM]
22802	Cytochrome c oxidase (COX), the terminal component of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. This component is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in electron transfer, and the nuclear-encoded subunits may function in the regulation and assembly of the complex. This nuclear gene encodes a protein which is not a structural subunit, but may be a heme A biosynthetic enzyme involved in COX formation, according to the yeast mutant studies. However, the studies in Rhodobacter sphaeroides suggest that this gene is not required for heme A biosynthesis, but required for stable formation of the Cu(B) and magnesium centers of COX. This human protein is predicted to contain a transmembrane domain localized in the mitochondrial inner membrane. A pseudogene COX11P has been found on chromosome 6.
22803	
22804	
22805	
22806	
22807	This gene encodes a member of a subfamily of the adenosine deaminase protein family. The encoded protein may act as a growth factor and have adenosine deaminase activity. It may be responsible for some of the phenotypic features associated with cat eye syndrome. Two transcript variants encoding distinct isoforms have been identified for this gene.
22808	
22809	
22810	
22811	Apelin is a neuropeptide expressed in the supraoptic and paraventricular nuclei that acts on specific receptors located on vasopressinergic neurons (De Mota et al., 2004).[supplied by OMIM]
22812	
22813	
22814	
22815	
22816	
22817	
22818	The protein encoded by this gene belongs to the family of P-type primary ion transport ATPases characterized by the formation of an aspartyl phosphate intermediate during the reaction cycle. These enzymes remove bivalent calcium ions from eukaryotic cells against very large concentration gradients and play a critical role in intracellular calcium homeostasis. The mammalian plasma membrane calcium ATPase isoforms are encoded by at least four separate genes and the diversity of these enzymes is further increased by alternative splicing of transcripts. The expression of different isoforms and splice variants is regulated in a developmental, tissue- and cell type-specific manner, suggesting that these pumps are functionally adapted to the physiological needs of particular cells and tissues. This gene encodes the plasma membrane calcium ATPase isoform 4. Alternatively spliced transcript variants encoding different isoforms have been identified.
22819	The protein encoded by this gene is one of the three catalytic subunits of protein phosphatase 1 (PP1). PP1 is a serine/threonine specific protein phosphatase known to be involved in the regulation of a variety of cellular processes, such as cell division, glycogen metabolism, muscle contractility, protein synthesis, and HIV-1 viral transcription. Mouse studies suggest that PP1 functions as a suppressor of learning and memory. At least three alternatively spliced transcript variants encoding two distinct isoforms have been observed.
22820	This gene belongs to the superfamily of acid-sensing ion channels, which are proton-gated, amiloride-sensitive sodium channels. These channels have been implicated in synaptic transmission, pain perception as well as mechanoperception. This gene is predominantly expressed in the pituitary gland, and was considered a candidate for paroxysmal dystonic choreoathetosis (PDC), a movement disorder, however, no correlation was found between mutations in this gene and PDC. Alternative splicing at this locus results in two transcript variants encoding different isoforms.
22821	
22822	The product encoded by this gene is a member of a RNA-binding protein family and possesses nucleolytic activity against cytotoxic lymphocyte (CTL) target cells. It has been suggested that this protein may be involved in the induction of apoptosis as it preferentially recognizes poly(A) homopolymers and induces DNA fragmentation in CTL targets. The major granule-associated species is a 15-kDa protein that is thought to be derived from the carboxyl terminus of the 40-kDa product by proteolytic processing. Alternative splicing resulting in different isoforms of this gene product has been described in the literature.
22823	
22824	
22825	
22826	
22827	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections.
22828	Synaptopodin is an actin-associated protein that may play a role in actin-based cell shape and motility. The name synaptopodin derives from the protein@#$%&s associations with postsynaptic densities and dendritic spines and with renal podocytes (Mundel et al., 1997).[supplied by OMIM]
22829	
22830	
22831	
22832	The protein encoded by this gene is a member of the MAP kinase family. MAP kinases act as an integration point for multiple biochemical signals, and are involved in a wide variety of cellular processes such as proliferation, differentiation, transcription regulation and development. This kinase is closely related to p38 MAP kinase, both of which can be activated by proinflammatory cytokines and cellular stress. MAP kinase kinases 3, and 6 can phosphorylate and activate this kinase. Transcription factor ATF2, and microtubule dynamics regulator stathmin have been shown to be the substrates of this kinase.
22833	
22834	Bone morphogenic proteins (see BMP2, MIM 112261) are important for body patterning and morphogenesis. PRDC is one of several BMP antagonists that regulate the functions of BMPs during embryonic development and tissue differentiation (Sudo et al., 2004).[supplied by OMIM]
22835	The protein encoded by this gene is a subunit of the multisubunit NADH:ubiquinone oxidoreductase (complex I). Mammalian complex I is composed of 45 different subunits. This protein has NADH dehydrogenase activity and oxidoreductase activity. It plays a important role in transfering electrons from NADH to the respiratory chain. The immediate electron acceptor for the enzyme is believed to be ubiquinone. Hydropathy analysis revealed that this subunit and 4 other subunits have an overall hydrophilic pattern, even though they are found within the hydrophobic protein (HP) fraction of complex I.
22836	This gene is predominantly expressed in prostate tissue, and is found to be upregulated in multiple cancer cell lines. The gene product is predicted to be a six-transmembrane protein, and was shown to be a cell surface antigen significantly expressed at cell-cell junctions.
22837	
22838	Ubiquitin-conjugating enzyme E2 variant proteins constitute a distinct subfamily within the E2 protein family. They have sequence similarity to other ubiquitin-conjugating enzymes but lack the conserved cysteine residue that is critical for the catalytic activity of E2s. The protein encoded by this gene also shares homology with ubiquitin-conjugating enzyme E2 variant 1 and yeast MMS2 gene product. It may be involved in the differentiation of monocytes and enterocytes.
22839	
22840	
22841	This gene product shares extensive sequence similarity with the rat neuronal olfactomedin-related ER localized protein. While the exact function of the encoded protein is not known, its abundant expression in brain suggests that it may have an essential role in nerve tissue. Several alternatively spliced transcripts encoding different isoforms have been found for this gene.
22842	The protein encoded by this gene is one of two human homologs of Saccharomyces cerevisiae Rad23, a protein involved in the nucleotide excision repair (NER). This protein was found to be a component of the protein complex that specifically complements the NER defect of xeroderma pigmentosum group C (XP-c) cell extracts in vitro. This protein was also shown to interact with, and elevate the nucleotide excision activity of 3-methyladenine-DNA glycosylase (MPG), which suggested a role in DNA damage recognition in base excision repair. This protein contains an N-terminal ubiquitin-like domain, which was reported to interact with 26S proteasome, and thus this protein may be involved in the ubiquitin mediated proteolytic pathway in cells.
22843	This gene encodes a component of prolyl 4-hydroxylase, a key enzyme in collagen synthesis composed of two identical alpha subunits and two beta subunits. The encoded protein is one of several different types of alpha subunits and provides the major part of the catalytic site of the active enzyme. In collagen and related proteins, prolyl 4-hydroxylase catalyzes the formation of 4-hydroxyproline that is essential to the proper three-dimensional folding of newly synthesized procollagen chains. Alternatively spliced transcript variants encoding different isoforms have been described.
22844	LOXL4 is a member of the lysyl oxidase (LOX; MIM 153455) family of amine oxidases that catalyze the final enzymatic step in the lysine-derived cross-links of collagen fibrils and insoluble elastic fibers within the extracellular matrix. These enzymes may also contribute to developmental regulation, senescence, tumor suppression, cell growth control, and chemotaxis.[supplied by OMIM]
22845	
22846	This gene is a member of the myotubularin family and encodes a putative tyrosine phosphatase. Mutations in this gene are a cause of Charcot-Marie-Tooth disease type 4B, an autosomal recessive demyelinating neuropathy. Multiple alternatively spliced transcript variants have been found, but the biological validity of some variants has not been determined.
22847	Transduction of extracellular matrix signals through integrins influences intracellular and extracellular functions, and appears to require interaction of integrin cytoplasmic domains with cellular proteins. Integrin-linked kinase (ILK), interacts with the cytoplasmic domain of beta-1 integrin. This gene encodes a serine/threonine protein kinase with 4 ankyrin-like repeats, which associates with the cytoplasmic domain of beta integrins and acts as a proximal receptor kinase regulating integrin-mediated signal transduction. Multiple alternatively spliced transcript variants encoding the same protein have been found for this gene.
22848	
22849	
22850	
22851	
22852	
22853	
22854	The function of this protein has not yet been defined; however, this protein may play a role which is similar to or complementary to other T-cell leukemia/lymphoma proteins during early embrogenesis. The association of this gene with T-cell leukemia chromosome translocations implicates this gene as a candidate for leukemogenesis. Complex alternative splicing of this gene results in multiple transcript variants, six of which have been fully described. Among these described variants, four different open reading frames have been identified; three of the six variants code for an identical product.
22855	
22856	
22857	Sialophorin (leukosialin) is a major sialoglycoprotein on the surface of human T lymphocytes, monocytes, granulocytes, and some B lymphocytes, which appears to be important for immune function and may be part of a physiologic ligand-receptor complex involved in T-cell activation.[supplied by OMIM]
22858	
22859	The protein encoded by this gene binds the epidermal growth factor receptor and contains one SH2 domain and two SH3 domains. Its two SH3 domains direct complex formation with proline-rich regions of other proteins, and its SH2 domain binds tyrosine phosphorylated sequences. This gene is similar to the Sem5 gene of C.elegans, which is involved in the signal transduction pathway. Two alternatively spliced transcript variants encoding different isoforms have been found for this gene.
22860	
22861	
22862	This nucleolar protein was first characterized because it was an autoantigen in cases on interstitial cystitis. The protein, with a predicted molecular weight of 50 kDa, appears to be localized in the particulate compartment of the interphase nucleolus, with a distribution distinct from that of nucleolar protein B23. During mitosis it is associated with chromosomes.
22863	The protein encoded by this gene is part of a nuclear pore complex, Nup107-160. This protein contains WD repeats and shares 34% amino acid identity with yeast Seh1 and 30% identity with yeast Sec13. All constituents of the Nup107-160 complex, including this protein, specifically localize to kinetochores in mitosis. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
22864	
22865	
22866	DNA (cytosine-5-)-methyltransferase 1 has a role in the establishment and regulation of tissue-specific patterns of methylated cytosine residues. Aberrant methylation patterns are associated with certain human tumors and developmental abnormalities.
22867	cAMP is a signaling molecule important for a variety of cellular functions. cAMP exerts its effects by activating the cAMP-dependent protein kinase (AMPK), which transduces the signal through phosphorylation of different target proteins. The inactive holoenzyme of AMPK is a tetramer composed of two regulatory and two catalytic subunits. cAMP causes the dissociation of the inactive holoenzyme into a dimer of regulatory subunits bound to four cAMP and two free monomeric catalytic subunits. Four different regulatory subunits and three catalytic subunits of AMPK have been identified in humans. The protein encoded by this gene is a member of the Ser/Thr protein kinase family and is a catalytic subunit of AMPK. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
22868	The protein encoded by this gene is a member of the RAD51 protein family. RAD51 family members are highly similar to bacterial RecA and Saccharomyces cerevisiae Rad51, which are known to be involved in the homologous recombination and repair of DNA. This protein forms a complex with several other members of the RAD51 family, including RAD51L1, RAD51L2, and XRCC2. The protein complex formed with this protein has been shown to catalyze homologous pairing between single- and double-stranded DNA, and is thought to play a role in the early stage of recombinational repair of DNA. Multiple alternatively spliced transcript variants, which encode at least four distinct proteins, have been reported. Transcript variants utilizing alternative polyA sites exist.
22869	This gene encodes a member of the oxysterol-binding protein (OSBP) family, a group of intracellular lipid receptors. Most members contain an N-terminal pleckstrin homology domain and a highly conserved C-terminal OSBP-like sterol-binding domain, although some members contain only the sterol-binding domain. This encoded protein contains only the sterol-binding domain. In vitro studies have shown that the encoded protein can bind strongly to phosphatic acid and weakly to phosphatidylinositol 3-phosphate, but cannot bind to 25-hydroxycholesterol. The protein associates with the Golgi apparatus. Transcript variants encoding different isoforms have been described.
22870	
22871	
22872	
22873	NAGA encodes the lysosomal enzyme alpha-N-acetylgalactosaminidase, which cleaves alpha-N-acetylgalactosaminyl moieties from glycoconjugates. Mutations in NAGA have been identified as the cause of  Schindler disease types I and II (type II also known as Kanzaki disease).
22874	This gene encodes a member of the Rab family of proteins. Rab proteins are small GTPases that are involved in vesicular trafficking.
22875	The DOCK2 gene encodes a hematopoietic cell-specific CDM family protein that is indispensable for lymphocyte chemotaxis.[supplied by OMIM]
22876	
22877	
22878	
22879	
22880	
22881	
22882	
22883	
22884	
22885	
22886	
22887	
22888	
22889	
22890	
22891	
22892	
22893	
22894	This gene encodes the puromycin-sensitive aminopeptidase, a zinc metallopeptidase which hydrolyzes amino acids from the N-terminus of its substrate. The protein has been localized to both the cytoplasm and to cellular membranes. This enzyme degrades enkaphalins in the brain, and studies in mouse suggest that it is involved in proteolytic events regulating the cell cycle.
22895	
22896	
22897	
22898	
22899	
22900	
22901	
22902	
22903	
22904	
22905	
22906	
22907	
22908	
22909	
22910	The protein encoded by this gene is a regulator of the pancreatic beta-cell function. It is highly similar to JIP-1, a mouse protein known to be a regulator of c-Jun amino-terminal kinase (Mapk8). This protein has been shown to prevent MAPK8 mediated activation of transcription factors, and decrease IL-1 beta and MAP kinase kinase 1 (MEKK1) induced apoptosis in pancreatic beta cells. This protein also functions as a DNA-binding transactivator of the glucose transporter GLUT2. RE1-silencing transcription factor (REST) is reported to repress the expression of this gene in insulin-secreting beta cells. This gene is found to be mutated in a type 2 diabetes family, and thus is thought to be a susceptibility gene for type 2 diabetes.
22911	
22912	
22913	
22914	
22915	
22916	
22917	
22918	
22919	
22920	
22921	
22922	
22923	
22924	
22925	Chromosomal protein HMG14 and its close analog HMG17 (MIM 163910) bind to the inner side of the nucleosomal DNA, potentially altering the interaction between the DNA and the histone octamer. The 2 proteins may be involved in the process that maintains transcribable genes in a unique chromatin conformation. Their ubiquitous distribution and relative abundance, as well as the high evolutionary conservation of the DNA-binding domain of the HMG14 family of proteins, suggest that they may be involved in an important cellular function.[supplied by OMIM]
22926	
22927	
22928	
22929	
22930	
22931	
22932	
22933	
22934	
22935	
22936	
22937	
22938	
22939	
22940	
22941	Members of the F-box protein family, such as FBXO33, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
22942	
22943	
22944	
22945	
22946	
22947	
22948	The protein encoded by this gene is a member of the keratin gene family. As a type II hair keratin, it is a basic protein which heterodimerizes with type I keratins to form hair and nails. The type II hair keratins are clustered in a region of chromosome 12q13 and are grouped into two distinct subfamilies based on structure similarity. One subfamily, consisting of KRTHB1, KRTHB3, and KRTHB6, is highly related. The other less-related subfamily includes KRTHB2, KRTHB4, and KRTHB5. All hair keratins are expressed in the hair follicle; this hair keratin, as well as KRTHB3 and KRTHB6, is found primarily in the hair cortex. Mutations in this gene and KRTHB6 have been observed in patients with a rare dominant hair disease, monilethrix.
22949	
22950	
22951	
22952	PLCD3 belongs to a large family of enzymes that catalyze the hydrolysis of phosphatidylinositol 4,5-bisphosphate (PtdIns(4,5)P2) to generate the second messengers Ins(1,4,5)P3 and diacylglycerol, which mediate a wide variety of cellular stimuli.[supplied by OMIM]
22953	
22954	
22955	The protein encoded by this gene interacts with the transcription factor myocardin, a key regulator of smooth muscle cell differentiation. The encoded protein is predominantly nuclear and may help transduce signals from the cytoskeleton to the nucleus. This gene is involved in a specific translocation event that creates a fusion of this gene and the RNA-binding motif protein-15 gene. This translocation has been associated with acute megakaryocytic leukemia.
22956	
22957	
22958	
22959	
22960	Dihydropyrimidine dehydrogenase  (DPYD) is a pyrimidine  catabolic enzyme and the initial and rate-limiting step in the pathway of  uracil and thymidine catabolism and also in the pathway leading to the  formation of beta-alanine. The DPYD gene is a large gene of approximately 150  kb consisting of 23 exons encoding a protein of approximately 111-kDa. Genetic  deficiency of DPYD enzyme results in an error in pyrimidine metabolism  associated with thymine-uraciluria and an increased risk of toxicity in  cancer patients receiving 5-flourouracil chemotherapy.
22961	The Makorin ring finger protein-1 gene (MKRN1) is a highly transcribed, intron-containing source for a family of intronless mammalian genes encoding a novel class of zinc finger proteins. Phylogenetic analyses indicate that the MKRN1 gene is the ancestral founder of this gene family.[supplied by OMIM]
22962	
22963	
22964	Janus kinase 1 (JAK1), is a member of a new class of protein-tyrosine kinases (PTK) characterized by the presence of a second phosphotransferase-related domain immediately N-terminal to the PTK domain. The second phosphotransferase domain bears all the hallmarks of a protein kinase, although its structure differs significantly from that of the PTK and threonine/serine kinase family members. JAK1 is a large, widely expressed membrane-associated phosphoprotein.  JAK1 is involved in the interferon-alpha/beta and -gamma signal transduction pathways. The reciprocal interdependence between JAK1 and TYK2 activities in the interferon-alpha pathway, and between JAK1 and JAK2 in the interferon-gamma pathway, may reflect a requirement for these kinases in the correct assembly of interferon receptor complexes. These kinases couple cytokine ligand binding to tyrosine phosphorylation of various known signaling proteins and of a unique family of transcription factors termed the signal transducers and activators of transcription, or STATs.
22965	
22966	
22967	
22968	
22969	
22970	Huntington@#$%&s disease (HD), a neurodegenerative disorder characterized by loss of striatal neurons, is caused by an expansion of a polyglutamine tract in the HD protein huntingtin. This gene encodes a protein that interacts with the huntingtin-associated protein 1, which is a huntingtin binding protein that may function in vesicle trafficking. An alternatively spliced variant has been described but its full-length sequence has not been determined.
22971	
22972	
22973	
22974	
22975	PEA15 is a death effector domain (DED)-containing protein predominantly expressed in the central nervous system, particularly in astrocytes.[supplied by OMIM]
22976	
22977	This gene encodes a C2H2 type zinc-finger protein by its similarity to the mouse Bcl11a/Evi9 protein. The corresponding mouse gene is a common site of retroviral integration in myeloid leukemia, and may function as a leukemia disease gene, in part, through its interaction with BCL6. During hematopoietic cell differentiation, this gene is down-regulated. It is possibly involved in lymphoma pathogenesis since translocations associated with B-cell malignancies also deregulates its expression. Five alternatively spliced transcript variants of this gene, which encode distinct isoforms, have been reported.
22978	
22979	CD4 (MIM 186940)-positive T helper lymphocytes can be divided into types 1 (Th1) and 2 (Th2) on the basis of their cytokine secretion patterns. Th1 cells and their associated cytokines are involved in cell-mediated immunity to intracellular pathogens and delayed-type hypersensitivity reactions, whereas Th2 cells are involved in the control of extracellular helminthic infections and the promotion of atopic and allergic diseases. The 2 types of cells also cross-regulate the functions of the other. TIM3 is a Th1-specific cell surface protein that regulates macrophage activation and enhances the severity of experimental autoimmune encephalomyelitis in mice.[supplied by OMIM]
22980	
22981	
22982	
22983	
22984	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S7P family of ribosomal proteins. It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
22985	
22986	
22987	The protein encoded this gene contains two RING-finger motifs and an IBR (in between RING fingers) motif. This protein is an E3 ubiquintin ligase that is localized in Lewy bodies (LBs), a characteristic neuronal inclusion in Parkinson@#$%&s disease (PD) brains. This protein interacts with UBE2L3/UBCH7 and UBE2E2/UBCH8, but not other ubiquitin-conjugating enzymes. This protein is found to bind and ubiquitylate synphilin 1 (SNCAIP), which is a interacting protein of alpha synuclein in neurons, and a major component of LB. Alternatively spliced transcript variants encoding the same protein have been reported.
22988	The correlation of anti-sperm antibodies with cases of unexplained infertility implicates a role for these antibodies in blocking fertilization. Improved diagnosis and treatment of immunologic infertility, as well as identification of proteins for targeted contraception, are dependent on the identification and characterization of relevant sperm antigens. The protein encoded by this gene is recognized by sperm agglutinating antibodies from an infertile woman. This protein is localized in germ cells of the testis at all stages of spermatogenesis and is localized to the acrosomal region of mature spermatozoa. Alternatively spliced variants that encode different protein isoforms have been described but the full-length sequences of only two have been determined.
22989	The protein encoded by this gene is one of the highly conserved mini-chromosome maintenance proteins (MCM) that are involved in the initiation of eukaryotic genome replication. The hexameric protein complex formed by MCM proteins is a key component of the pre-replication complex (pre-RC) and it may be involved in the formation of replication forks and in the recruitment of other DNA replication related proteins. This protein can interact with MCM2 and MCM6, as well as with the origin recognition protein ORC2. It is regulated by proteolysis and phosphorylation in a cell cycle-dependent manner. Studies of a similar protein in Xenopus suggest that the chromatin binding of this protein at the onset of DNA replication is after pre-RC assembly and before origin unwinding. Alternatively spliced transcript variants encoding distinct isoforms have been identified.
22990	For background information on chemokines, see CXCL11 (SCYB11; MIM 604852). Stromal cell-derived factors 1-alpha and 1-beta are small cytokines that belong to the intercrine family, members of which activate leukocytes and are often induced by proinflammatory stimuli such as lipopolysaccharide, TNF (see MIM 191160), or IL1 (see MIM 147760). The intercrines are characterized by the presence of 4 conserved cysteines which form 2 disulfide bonds. They can be classified into 2 subfamilies. In the CC subfamily, which includes beta chemokine, the cysteine residues are adjacent to each other. In the CXC subfamily, which includes alpha chemokine, they are separated by an intervening amino acid. The SDF1 proteins belong to the latter group.[supplied by OMIM]
22991	
22992	
22993	
22994	
22995	
22996	
22997	
22998	
22999	
23000	
23001	
23002	
23003	
23004	
23005	
23006	
23007	
23008	The protein encoded by this gene belongs to the class II cytokine receptor family, and has been shown to be a receptor for interleukin 22 (IL22). IL22 receptor is a protein complex that consists of this protein and interleukin 10 receptor, beta (IL10BR/CRFB4), a subunit also shared by the receptor complex for interleukin 10 (IL10). This gene and interleukin 28 receptor, alpha (IL28RA) form a cytokine receptor gene cluster in the chromosomal region 1p36.
23009	The protein encoded by this gene is involved in both the synthesis and degradation of fructose-2,6-bisphosphate, a regulatory molecule that controls glycolysis in eukaryotes. The encoded protein has a 6-phosphofructo-2-kinase activity that catalyzes the synthesis of fructose-2,6-bisphosphate, and a fructose-2,6-biphosphatase activity that catalyzes the degradation of fructose-2,6-bisphosphate. This protein regulates fructose-2,6-bisphosphate levels in the heart, while a related enzyme encoded by a different gene regulates fructose-2,6-bisphosphate levels in the liver and muscle. This enzyme functions as a homodimer. Two transcript variants encoding two different isoforms have been found for this gene.
23010	
23011	
23012	This gene encodes a ssDNA binding protein that activates the far upstream element (FUSE) of c-myc and stimulates expression of c-myc in undifferentiated cells. Regulation of FUSE by FUBP occurs through single-strand binding of FUBP to the non-coding strand. This protein has been shown to function as an ATP-dependent DNA helicase.
23013	
23014	Centromeres are the differentiated chromosomal domains that specify the mitotic behavior of chromosomes.  CENPA encodes a centromere protein which contains a histone H3 related histone fold domain that is required for targeting to the centromere.  CENPA is proposed to be a component of a modified nucleosome or nucleosome-like structure in which it replaces 1 or both copies of conventional histone H3 in the (H3-H4)2 tetrameric core of the nucleosome particle.
23015	
23016	
23017	
23018	
23019	Glucosidase I is the first enzyme in the N-linked oligosaccharide processing pathway. GCS1 cleaves the distal alpha-1,2-linked glucose residue from the Glc(3)-Man(9)-GlcNAc(2) oligosaccharide precursor.  GCS1 is located in the lumen of the endoplasmic reticulum.
23020	MAD1L1 is a component of the mitotic spindle-assembly checkpoint that prevents the onset of anaphase until all chromosome are properly aligned at the metaphase plate. MAD1L1 functions as a homodimer and interacts with MAD2L1. MAD1L1 may play a role in cell cycle control and tumor suppression. Three transcript variants encoding the same protein have been found for this gene.
23021	
23022	This gene encodes the enzyme responsible for the excision of 8-oxoguanine, a mutagenic base byproduct which occurs as a result of exposure to reactive oxygen.  The action of this enzyme includes lyase activity for chain cleavage.  Alternative splicing of the C-terminal region of this gene classifies splice variants into two major groups, type 1 and type 2, depending on the last exon of the sequence.  Type 1 alternative splice variants end with exon 7 and type 2 end with exon 8.  All variants share the N-terminal region in common.  Many alternative splice variants for this gene have been described, but the full-length nature for every variant has not been determined.  The N-terminus of this gene contains a mitochondrial targetting signal, essential for mitochondrial localization.
23023	
23024	Crystallins are separated into two classes: taxon-specific, or enzyme, and ubiquitous. The latter class constitutes the major proteins of vertebrate eye lens and maintains the transparency and refractive index of the lens. Since lens central fiber cells lose their nuclei during development, these crystallins are made and then retained throughout life, making them extremely stable proteins. Mammalian lens crystallins are divided into alpha, beta, and gamma families; beta and gamma crystallins are also considered as a superfamily. Alpha and beta families are further divided into acidic and basic groups. Seven protein regions exist in crystallins: four homologous motifs, a connecting peptide, and N- and C-terminal extensions. Gamma-crystallins are a homogeneous group of highly symmetrical, monomeric proteins typically lacking connecting peptides and terminal extensions. They are differentially regulated after early development. This gene encodes a protein initially considered to be a beta-crystallin but the encoded protein is monomeric and has greater sequence similarity to other gamma-crystallins. This gene encodes the most significant gamma-crystallin in adult eye lens tissue. Whether due to aging or mutations in specific genes, gamma-crystallins have been involved in cataract formation.
23025	
23026	The product of this gene belongs to the ALG3 family. This enzyme adds the first Dol-P-Man derived mannose in an alpha 1,3 linkage to Man5GlcNAc2-PP-Dol. Defects in this gene are the cause of congenital disorder of glycosylation type Id (CDG-Id). This disease is biochemically characterized by abnormal N-glycosylation. Alternatively spliced transcript variants encoding different isoforms have been identified.
23027	
23028	DISCONTINUED: The record for PBM1 has been withdrawn by HGNC.
23029	
23030	
23031	Short-chain L-3-hydroxyacyl-CoA dehydrogenase (SCHAD; EC 1.1.1.35) plays an essential role in the mitochondrial beta-oxidation of short chain fatty acids. It catalyzes the reversible dehydrogenation of 3-hydroxyacyl-CoAs to their corresponding 3-ketoacyl-CoAs with concomitant reduction of NAD to NADH and exerts it highest activity toward 3-hydroxybutyryl-CoA.[supplied by OMIM]
23032	
23033	
23034	The function of gene is not known, but it encodes an osmosis responsive factor. Identification of a conserved mouse homolog of similar size and representation in the EST database indicates that this transcript does represent a gene.
23035	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a member of the histone H2A family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the histone microcluster on chromosome 6p21.33.
23036	
23037	This gene was identified on the basis of its stimulation by c-Myc protein. The latter is a transcription factor that participates in the regulation of cell proliferation, differentiation, and apoptosis. The exact function of this gene is not known but studies in rat suggest a role in cellular proliferation and c-Myc-mediated transformation. Two alternative transcripts encoding different proteins have been described.
23038	Pre-mRNA splicing occurs in two sequential transesterification steps. The protein encoded by this gene is found to be essential for the catalytic step II in pre-mRNA splicing process. It is found in the spliceosome, and contains seven WD repeats, which function in protein-protein interactions. This protein has a sequence similarity to yeast Prp17 protein, which functions in two different cellular processes: pre-mRNA splicing and cell cycle progression. It suggests that this protein may play a role in cell cycle progression.
23039	The protein encoded by this gene is found as a pentamer and is a major substrate for the cAMP-dependent protein kinase in cardiac muscle. The encoded protein is an inhibitor of cardiac muscle sarcoplasmic reticulum Ca(2+)-ATPase in the unphosphorylated state, but inhibition is relieved upon phosphorylation of the protein. The subsequent activation of the Ca(2+) pump leads to enhanced muscle relaxation rates, thereby contributing to the inotropic response elicited in heart by beta-agonists. The encoded protein is a key regulator of cardiac diastolic function. Mutations in this gene are a cause of inherited human dilated cardiomyopathy with refractory congestive heart failure.
23040	This gene is located at chromosome 7p13 which is close to the Williams Beuren syndrome chromosome region 7q11.23.
23041	
23042	
23043	
23044	This gene was identified by its differential expression in cancer cells. The product of this gene is predicted to be a transmembrane protein. The level of this protein was found to be lower in tumor derived cell lines compared to normal cells. This gene was thus proposed to be a candidate tumor suppressor gene.
23045	
23046	
23047	
23048	This gene encodes a member of the neurotrophic tyrosine receptor kinase (NTRK) family. This kinase is a membrane-bound receptor that, upon neurotrophin binding, phosphorylates itself and members of the MAPK pathway. Signalling through this kinase leads to cell differentiation. Mutations in this gene have been associated with obesity and mood disorders. Alternate transcriptional splice variants encoding different isoforms have been found for this gene, but only two of them have been characterized to date.
23049	The protein encoded by this gene is a cysteine-rich, extracellular protein with protease inhibitor-like domains whose expression is suppressed strongly in many tumors and cells transformed by various kinds of oncogenes. In normal cells, this membrane-anchored glycoprotein may serve as a negative regulator for matrix metalloproteinase-9, a key enzyme involved in tumor invasion and metastasis.
23050	BHC80 is a component of a BRAF35 (MIM 605535)/histone deacetylase (HDAC; see MIM 601241) complex (BHC) that mediates repression of neuron-specific genes through the cis-regulatory element known as repressor element-1 (RE1) or neural restrictive silencer (NRS) (Hakimi et al., 2002).[supplied by OMIM]
23051	
23052	
23053	This gene encodes a protein that may play a role in organelle biogenesis associated with melanosomes, platelet dense granules, and lysosomes. The encoded protein is a component of three different protein complexes termed biogenesis of lysosome-related organelles complex (BLOC)-3, BLOC4, and BLOC5. Mutations in this gene are associated with Hermansky-Pudlak syndrome type 1. Multiple transcript variants encoding distinct isoforms have been identified for this gene; the full-length sequences of some of these have not been determined yet.
23054	
23055	
23056	
23057	The muscarinic cholinergic receptors belong to a larger family of G protein-coupled receptors. The functional diversity of these receptors is defined by the binding of acetylcholine and includes cellular responses such as adenylate cyclase inhibition, phosphoinositide degeneration, and potassium channel mediation. Muscarinic receptors influence many effects of acetylcholine in the central and peripheral nervous system. The muscarinic cholinergic receptor 1 is involved in mediation of vagally-induced bronchoconstriction and in the acid secretion of the gastrointestinal tract. The gene encoding this receptor is localized to 11q13.
23058	MONDOA forms heterodimers with MLX (MIM 602976) that can bind to and activate transcription from CACGTG E boxes (Billin et al., 2000 [PubMed 11073985]).[supplied by OMIM]
23059	MONDOA forms heterodimers with MLX (MIM 602976) that can bind to and activate transcription from CACGTG E boxes (Billin et al., 2000 [PubMed 11073985]).[supplied by OMIM]
23060	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Nucleosomes consist of approximately 146 bp of DNA wrapped around a histone octamer composed of pairs of each of the four core histones (H2A, H2B, H3, and H4). The chromatin fiber is further compacted through the interaction of a linker histone, H1, with the DNA between the nucleosomes to form higher order chromatin structures. This gene encodes a replication-independent member of the histone H2A family that is distinct from other members of the family. Studies in mice have shown that this particular histone is required for embryonic development and indicate that lack of functional histone H2A leads to embryonic lethality.
23061	The protein encoded by this gene is a DNA ligase that joins single-strand breaks in a double-stranded polydeoxynucleotide in an ATP-dependent reaction. This protein is essential for V(D)J recombination and DNA double-strand break (DSB) repair through nonhomologous end joining (NHEJ). This protein forms a complex with the X-ray repair cross complementing protein 4 (XRCC4), and further interacts with the DNA-dependent protein kinase (DNA-PK). Both XRCC4 and DNA-PK are known to be required for NHEJ. The crystal structure of the complex formed by this protein and XRCC4 has been resolved. Defects in this gene are the cause of LIG4 syndrome. Alternatively spliced transcript variants encoding the same protein have been observed.
23062	The protein encoded by this gene is a DNA ligase that joins single-strand breaks in a double-stranded polydeoxynucleotide in an ATP-dependent reaction. This protein is essential for V(D)J recombination and DNA double-strand break (DSB) repair through nonhomologous end joining (NHEJ). This protein forms a complex with the X-ray repair cross complementing protein 4 (XRCC4), and further interacts with the DNA-dependent protein kinase (DNA-PK). Both XRCC4 and DNA-PK are known to be required for NHEJ. The crystal structure of the complex formed by this protein and XRCC4 has been resolved. Defects in this gene are the cause of LIG4 syndrome. Alternatively spliced transcript variants encoding the same protein have been observed.
23063	This gene encodes a protein with similarity to the estrogen receptor. Its function is unknown; however, a similar protein in mouse plays an essential role in placental development.
23064	
23065	
23066	This gene encodes a protein sharing a low level of sequence similarity with human ribosomal protein L24. Although this gene has been referred to as RPL24, L30, and 60S ribosomal protein L30 isolog in the sequence databases, it is distinct from the human genes officially named RPL24 (which itself has been referred to as ribosomal protein L30) and RPL30. The function of this gene is currently unknown. This gene utilizes alternative polyadenylation signals.
23067	
23068	
23069	
23070	
23071	
23072	
23073	This gene encodes a membrane-bound adenylate cyclase that catalyses the formation of cyclic AMP from ATP and is inhibitable by calcium. The product of this gene is a member of the adenylyl cyclase class-4/guanylyl cyclase enzyme family that is characterized by the presence of twelve membrane-spanning domains in its sequences.
23074	This gene encodes a member of the Golgi-localized, gamma adaptin ear-containing, ARF-binding (GGA) family. This family includes ubiquitous coat proteins that regulate the trafficking of proteins between the trans-Golgi network and the lysosome. These proteins share an amino-terminal VHS domain which mediates sorting of the mannose 6-phosphate receptors at the trans-Golgi network. They also contain a carboxy-terminal region with homology to the ear domain of gamma-adaptins. This family member may play a significant role in cargo molecules regulation and clathrin-coated vesicle assembly. Alternative splicing of this gene results in two transcript variants.
23075	
23076	
23077	The protein encoded by this gene contains a fork head DNA binding domain. This protein can bind to the purine-rich motifs of the HIV long terminal repeat (LTR), and to the similar purine-rich motif in the interleukin 2 (IL2) promoter. It may be involved in the regulation of viral and cellular promoter elements. Three alternatively spliced transcript variants encoding distinct isoforms have been described.
23078	
23079	This gene belongs to the homeobox family of genes. The homeobox genes encode a highly conserved family of transcription factors that play an important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, located on different chromosomes, consisting of 9 to 11 genes arranged in tandem. This gene is one of several homeobox HOXB genes located in a cluster on chromosome 17. The exact role of this gene has yet to be determined.
23080	The peroxisome biogenesis disorders (PBDs; MIM 601539) are a group of genetically heterogeneous diseases that are usually lethal in early infancy. Although the clinical features of PBD patients vary, cells from all PBD patients exhibit a defect in the import of one or more classes of peroxisomal matrix proteins into the organelle. This cellular phenotype is shared by yeast @#$%&pex@#$%& mutants, and human orthologs of yeast PEX genes defective in some PBD complementation groups (CGs).[supplied by OMIM]
23081	This gene encodes  a member of the dual specificity protein kinase family, which functions as a mitogen-activated protein (MAP) kinase kinase. MAP kinases, also known as extracellular signal-regulated kinases (ERKs), act as an integration point for multiple biochemical signals. This protein phosphorylates and activates p38 MAP kinase in response to inflammatory cytokines or environmental stress. As an essential component of p38 MAP kinase mediated signal transduction pathway, this gene is involved in  many cellular processes such as stress induced cell cycle arrest, transcription activation and apoptosis. Two transcript variants of this gene exist.
23082	
23083	
23084	Kallikreins are a subgroup of serine proteases having diverse physiological functions. Growing evidence suggests that many kallikreins are implicated in carcinogenesis and some have potential as novel cancer and other disease biomarkers. This gene is one of the fifteen kallikrein subfamily members located in a cluster on chromosome 19. Expression of this gene is regulated by steroid hormones and may be useful as a marker for breast cancer. An additional transcript variant has been identified, but its full length sequence has not been determined.
23085	The protein encoded by this gene is a member of the intercellular adhesion molecule (ICAM) family. All ICAM proteins are type I transmembrane glycoproteins, contain 2-9 immunoglobulin-like C2-type domains, and bind to the leukocyte adhesion LFA-1 protein. This protein is constitutively and abundantly expressed by all leucocytes and may be the most important ligand for LFA-1 in the initiation of the immune response. It functions not only as an adhesion molecule, but also as a potent signalling molecule.
23086	The protein encoded by this gene is a member of the intercellular adhesion molecule (ICAM) family. All ICAM proteins are type I transmembrane glycoproteins, contain 2-9 immunoglobulin-like C2-type domains, and bind to the leukocyte adhesion LFA-1 protein. This protein is constitutively and abundantly expressed by all leucocytes and may be the most important ligand for LFA-1 in the initiation of the immune response. It functions not only as an adhesion molecule, but also as a potent signalling molecule.
23087	The protein encoded by this gene is similar in sequence to the amino terminus of Drosophila enhancer of split groucho, a protein involved in neurogenesis during embryonic development. The encoded protein, which belongs to the groucho/TLE family of proteins, can function as a homooligomer or as a heteroologimer with other family members to dominantly repress the expression of other family member genes. Three transcript variants encoding different isoforms have been found for this gene.
23088	
23089	The protein encoded by this gene is one of at least forty-one subunits that make up the NADH-ubiquinone oxidoreductase complex. This complex is part of the mitochondrial respiratory chain and serves to catalyze the rotenone-sensitive oxidation of NADH and the reduction of ubiquinone. The encoded protein is one of three proteins found in the flavoprotein fraction of the complex. The specific function of the encoded protein is unknown. Two transcript variants encoding different isoforms have been found for this gene.
23090	Catechol-O-methyltransferase catalyzes the transfer of a methyl group from S-adenosylmethionine to catecholamines, including the neurotransmitters dopamine, epinephrine, and norepinephrine.  This O-methylation results in one of the major degradative pathways of the catecholamine transmitters.  In addition to its role in the metabolism of endogenous substances, COMT is important in the metabolism of catechol drugs used in the treatment of hypertension, asthma, and Parkinson disease.  COMT is found in two forms in tissues, a soluble form (S-COMT) and a membrane-bound form (MB-COMT).  The differences between S-COMT and MB-COMT reside within the N-termini.  The transcript variants are formed through the use of alternative translation initiation sites and promoters.
23091	
23092	
23093	Peroxisome proliferators include hypolipidemic drugs, herbicides, leukotriene antagonists, and plasticizers; this term arises because they induce an increase in the size and number of peroxisomes. Peroxisomes are subcellular organelles found in plants and animals which contain enzymes for respiration and for cholesterol and lipid metabolism. The action of peroxisome proliferators is thought to be mediated via specific receptors, called PPARs, which belong to the steroid hormone receptor superfamily. PPARs affect the expression of target genes involved in cell proliferation, cell differentiation and in immune and inflammation responses. Three closely related subtypes (alpha, beta/delta, and gamma) have been identified. This gene encodes the subtype PPAR-alpha, which is a nuclear transcription factor. Multiple alternatively spliced transcript variants have been described for this gene, although the full-nature of only some have been defined.
23094	This gene encodes a member of the neurexin family which functions in the vertebrate nervous system as cell adhesion molecules and receptors. This protein, like other neurexin proteins, contains epidermal growth factor repeats and laminin G domains. In addition, it includes an F5/8 type C domain, discoidin/neuropilin- and fibrinogen-like domains, thrombospondin N-terminal-like domains and a putative PDZ binding site. This protein is localized at the juxtaparanodes of myelinated axons and associated with potassium channels. It may play a role in the local differentiation of the axon into distinct functional subdomains. This gene encompasses almost 1.5% of chromosome 7 and is one of the largest genes in the human genome. It may represent a positional candidate gene for the DFNB13 form of nonsyndromic deafness.
23095	
23096	
23097	The protein encoded by this gene has sequence similarity with members of the WD40 repeat-containing protein family. The WD40 group is a large family of proteins, which appear to have a regulatory function. It is believed that the WD40 repeats mediate protein-protein interactions and members of the family are involved in signal transduction, RNA processing, gene regulation, vesicular trafficking, cytoskeletal assembly and may play a role in the control of cytotypic differentiation. This encoded protein is found as a subunit in corepressor SMRT (silencing mediator for retinoid and thyroid receptors) complex along with histone deacetylase 3 protein. This gene is located adjacent to the ocular albinism gene and it is thought to be involved in the pathogenesis of the ocular albinism with late-onset sensorineural deafness phenotype. This gene is highly similar to the Y chromosome TBL1Y gene.
23098	
23099	
23100	
23101	
23102	
23103	
23104	
23105	
23106	The product encoded by this gene is a member of a RNA-binding protein family and possesses nucleolytic activity against cytotoxic lymphocyte (CTL) target cells. It has been suggested that this protein may be involved in the induction of apoptosis as it preferentially recognizes poly(A) homopolymers and induces DNA fragmentation in CTL targets. The major granule-associated species is a 15-kDa protein that is thought to be derived from the carboxyl terminus of the 40-kDa product by proteolytic processing. Alternative splicing resulting in different isoforms of this gene product has been described in the literature.
23107	
23108	Stearoyl-CoA desaturases catalyze the committed step in the biosynthesis of monounsaturated fatty acids from saturated fatty acids. This reaction involves the introduction of a cis-double bond between carbons 9 and 10 in a spectrum of methylene-interrupted fatty acyl-CoAs.[supplied by OMIM]
23109	
23110	
23111	
23112	
23113	
23114	The nuclear envelope creates distinct nuclear and cytoplasmic compartments in eukaryotic cells. It consists of two concentric membranes perforated by nuclear pores, large protein complexes that form aqueous channels to regulate the flow of macromolecules between the nucleus and the cytoplasm. These complexes are composed of at least 100 different polypeptide subunits, many of which belong to the nucleoporin family. This gene encodes a member of the FG-repeat-containing nucleoporins. The protein encoded by this gene is an integral membrane protein that localizes to the central spoke ring complex and participates in anchoring the nuclear pore complex to the nuclear envelope. Alternatively spliced variants that encode different protein isoforms have been described but the full-length nature of only one has been determined.
23115	
23116	
23117	
23118	
23119	
23120	
23121	
23122	
23123	
23124	
23125	
23126	HLA-A belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon 1 encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Hundreds of HLA-A alleles have been described.
23127	
23128	
23129	
23130	
23131	
23132	The protein encoded by this gene belongs to the NCP family of cell-recognition molecules. This family represents a distinct subgroup of the neurexins. NCP proteins mediate neuron-glial interactions in vertebrates and glial-glial contact in invertebrates. The protein encoded by this gene may play a role in cell recognition within the nervous system. Alternatively spliced transcript variants encoding different isoforms have been described but their biological nature has not been determined.
23133	
23134	
23135	
23136	
23137	
23138	
23139	
23140	
23141	The Ra-reactive factor (RARF) is a complement-dependent bactericidal factor that binds to the Ra and R2 polysaccharides expressed by certain enterobacteria. Alternate splicing of this gene results in two transcript variants encoding two RARF components that are involved in the mannan-binding lectin (MBL) pathway of complement activation. Each isoform is cleaved into two chains which form a heterodimer linked by a disulfide bond. The encoded proteins are members of the trypsin family of peptidases.
23142	
23143	
23144	
23145	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S10E family of ribosomal proteins. It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
23146	The protein encoded by this gene belongs to the plakin family of cytoskeletal linker proteins. This protein family forms bridges between different cytoskeletal elements through specialized modular domains. The encoded protein is one of the largest size proteins identified in human cytoskeletal proteins. It has functional actin and microtubule binding domains, and it appears to stabilize actin at sites where microtubules and microfilaments meet. It may function in microtubule dynamics to facilitate actin-microtubule interactions at the cell periphery and to couple the microtubule network to cellular junctions. Alternatively spliced transcript variants encoding distinct isoforms have been described.
23147	
23148	
23149	
23150	The bone morphogenetic protein (BMP) receptors are a family of transmembrane serine/threonine kinases that include the type I receptors BMPR1A and BMPR1B and the type II receptor BMPR2. These receptors are also closely related to the activin receptors, ACVR1 and ACVR2. The ligands of these receptors are members of the TGF-beta superfamily. TGF-betas and activins transduce their signals through the formation of heteromeric complexes with 2 different types of serine (threonine) kinase receptors: type I receptors of about 50-55 kD and type II receptors of about 70-80 kD. Type II receptors bind ligands in the absence of type I receptors, but they require their respective type I receptors for signaling, whereas type I receptors require their respective type II receptors for ligand binding.
23151	
23152	This gene encodes a protein containing multiple SH3 domains. It has been shown to bind many components of the JNK apoptosis signaling pathway including MLKs, MKKs and JNKs.
23153	
23154	
23155	
23156	
23157	
23158	
23159	
23160	
23161	
23162	The protein encoded by this gene is a member of the gelsolin/villin family of actin regulatory proteins. This protein has structural similarity to villin. It binds actin and may play a role in the development of neuronal cells that form ganglia.
23163	
23164	This gene encodes a member of the tenascin family of extracellular matrix proteins. The tenascins have anti-adhesive effects, as opposed to fibronectin which is adhesive. The specific function of this protein is unknown, but its deficiency has been associated with the connective tissue disorder Ehlers-Danlos syndrome. This gene localizes to the major histocompatibility complex (MHC) class III region on chromosome 6. It is one of four genes in this cluster which have been duplicated. The duplicated copy of this gene is incomplete and is a pseudogene which is transcribed but does not encode a protein. The structure of this gene is unusual in that it overlaps the CREBL1 and CYP21A2 genes at its 5@#$%& and 3@#$%& ends, respectively. The gene expresses two transcripts.
23165	This gene is one of two human homologs of Drosophila bicaudal-D and a member of the Bicoid family. It has been implicated in dynein-mediated, minus end-directed motility along microtubules. It has also been reported to be a phosphorylation target of NIMA related kinase 8. Two alternative splice variants have been described.
23166	
23167	The protein encoded by this gene is a nucleoplasmic protein. It binds cytidine-rich sequences in double-stranded DNA. This protein has three types of domains: MH1, MH2 (repeated three times) and MH3. It is associated with packaging, transferring, or processing transcripts. Multiple alternatively spliced transcript variants have been found for this gene, but the biological validity of some variants has not been determined.
23168	
23169	This gene belongs to the ephrin receptor subfamily of the protein-tyrosine kinase family. EPH and EPH-related receptors have been implicated in mediating developmental events, particularly in the nervous system. Receptors in the EPH subfamily typically have a single kinase domain and an extracellular region containing a Cys-rich domain and 2 fibronectin type III repeats. The ephrin receptors are divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands. The protein encoded by this gene functions as a receptor for ephrin A2, A3 and A5 and plays a role in short-range contact-mediated axonal guidance during development of the mammalian nervous system.
23170	This gene encodes a protein that contains the GAP-related domain (GRD), a characteristic domain of GTPase-activating proteins (GAPs). GAPs function as activators of Ras superfamily of small GTPases. The protein encoded by this gene is able to complement the defective RasGAP function in a yeast system. Two alternatively spliced transcript variants of this gene encoding distinct isoforms have been reported.
23171	
23172	The protein encoded by this gene is a component of the exocyst complex, a multiple protein complex essential for targeting exocytic vesicles to specific docking sites on the plasma membrane. Though best characterized in yeast, the component proteins and functions of exocyst complex have been demonstrated to be highly conserved in higher eukaryotes. At least eight components of the exocyst complex, including this protein, are found to interact with the v-ral simian leukemia viral oncogene homolog A (RALA), a Ras-related small GTPase involved in controlling actin cytoskeletal remodeling and vesicle transport. The protein complex containing the similar protein in rat was reported to be involved in synaptic vesicle docking and fusion at the presynaptic terminal.
23173	SPF38 is a component of the U5 small nuclear ribonucleoprotein (snRNP) particle. The U5 snRNP is part of the spliceosome, a multiprotein complex involved in the removal of introns from nuclear pre-mRNAs.[supplied by OMIM]
23174	
23175	
23176	
23177	
23178	
23179	
23180	
23181	
23182	
23183	
23184	Heterogeneous nuclear RNAs (hnRNAs) which include mRNA precursors and mature mRNAs are associated with specific proteins to form heterogenous ribonucleoprotein (hnRNP) complexes. Heterogeneous nuclear ribonucleoprotein L is among the proteins that are stably associated with hnRNP complexes and along with other hnRNP proteins is likely to play a major role in the formation, packaging, processing, and function of mRNA. Heterogeneous nuclear ribonucleoprotein L is present in the nucleoplasm as part of the HNRP complex. HNRP proteins have also been identified outside of the nucleoplasm. Exchange of hnRNP for mRNA-binding proteins accompanies transport of mRNA from the nucleus to the cytoplasm. Since HNRP proteins have been shown to shuttle between the nucleus and the cytoplasm, it is possible that they also have cytoplasmic functions. Two transcript variants encoding different isoforms have been found for this gene.
23185	KRT18 encodes the type I intermediate filament chain keratin 18. Keratin 18, together with its filament partner keratin 8, are perhaps the most commonly found members of the intermediate filament gene family. They are expressed in single layer epithelial tissues of the body. Mutations in this gene have been linked to cryptogenic cirrhosis. Two transcript variants encoding the same protein have been found for this gene.
23186	
23187	LYZ encodes human lysozyme, whose natural substrate is the bacterial cell wall peptidoglycan (cleaving the beta[1-4]glycosidic linkages between N-acetylmuramic acid and N-acetylglucosamine). Lysozyme is one of the anti-microbial agents found in human milk, and is also present in spleen, lung, kidney, white blood cells, plasma, aliva, and tears.  Missense mutations in LYZ have been identified in heritable renal anyloidosis.
23188	
23189	
23190	
23191	
23192	
23193	
23194	RICS is a neuron-associated GTPase-activating protein that may regulate dendritic spine morphology and strength by modulating Rho GTPase (see RHOA; MIM 165390) activity (Okabe et al., 2003 [PubMed 12531901]).[supplied by OMIM]
23195	The exact function of this gene is not known, however, its encoded product is highly similar to purine-rich element binding protein A. The latter is a DNA-binding protein which binds preferentially to the single strand of the purine-rich element termed PUR, and has been implicated in the control of both DNA replication and transcription. This gene lies in close proximity to the Werner syndrome gene, but on the opposite strand, on chromosome 8p11. Two transcript variants encoding different isoforms have been found for this gene.
23196	
23197	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation.
23198	
23199	Retention of resident soluble proteins in the lumen of the endoplasmic reticulum (ER) is achieved in both yeast and animal cells by their continual retrieval from the cis-Golgi, or a pre-Golgi compartment.  Sorting of these proteins is dependent on a C-terminal tetrapeptide signal, usually lys-asp-glu-leu (KDEL) in animal cells, and his-asp-glu-leu (HDEL) in S. cerevisiae.  This process is mediated by a receptor that recognizes, and binds the tetrapeptide-containing protein, and returns it to the ER.  In yeast, the sorting receptor encoded by a single gene, ERD2, is a seven-transmembrane protein.  Unlike yeast, several human homologs of the ERD2 gene, constituting the KDEL receptor gene family, have been described. KDELR2 was the second member of the family to be identified, and it encodes a protein which is 83% identical to the KDELR1 gene product.
23200	This gene product belongs to the SCAMP family of proteins which are secretory carrier membrane proteins. They function as carriers to the cell surface in post-golgi recycling pathways. Different family members are highly related products of distinct genes, and are usually expressed together. These findings suggest that the SCAMPs may function at the same site during vesicular transport rather than in separate pathways. Multiple transcript variants encoding different isoforms have been found for this gene, but the full-length nature of some variants has not been defined.
23201	
23202	
23203	The activation of gene transcription is a multistep process that is triggered by factors that recognize transcriptional enhancer sites in DNA. These factors work with co-activators to direct transcriptional initiation by the RNA polymerase II apparatus. The protein encoded by this gene is a subunit of the CRSP (cofactor required for SP1 activation) complex, which, along with TFIID, is required for efficient activation by SP1. This protein is also a component of other multisubunit complexes e.g. thyroid hormone receptor-(TR-) associated proteins which interact with TR and facilitate TR function on DNA templates in conjunction with initiation factors and cofactors.
23204	
23205	This gene encodes a member of the oxysterol-binding protein (OSBP) family, a group of intracellular lipid receptors. Like most members, the encoded protein contains an N-terminal pleckstrin homology domain and a highly conserved C-terminal OSBP-like sterol-binding domain.
23206	
23207	The protein encoded by this gene is a small GTPase of the Rho-subfamily, which regulates signaling pathways that control diverse cellular functions including cell morphology, migration, endocytosis and cell cycle progression. This protein is highly similar to Saccharomyces cerevisiae Cdc 42, and is able to complement the yeast cdc42-1 mutant. The product of oncogene Dbl was reported to specifically catalyze the dissociation of GDP from this protein. This protein could regulate actin polymerization through its direct binding to Neural Wiskott-Aldrich syndrome protein (N-WASP), which subsequently activates Arp2/3 complex. Alternative splicing of this gene results in at least two transcript variants.
23208	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes one of the non-ATPase subunits of the 19S regulator lid. In addition to participation in proteasome function, this subunit may also participate in the TNF signalling pathway since it interacts with the tumor necrosis factor type 1 receptor. A pseudogene has been identified on chromosome 1.
23209	
23210	Oncogene FLT belongs to the src gene family and is related to oncogene ROS (MIM 165020). Like other members of this family, it shows tyrosine protein kinase activity that is important for the control of cell proliferation and differentiation. The sequence structure of the FLT gene resembles that of the FMS gene (MIM 164770); hence, Yoshida et al. (1987) proposed the name FLT as an acronym for FMS-like tyrosine kinase.[supplied by OMIM]
23211	
23212	
23213	
23214	
23215	
23216	The protein encoded by this gene was identified as a binding protein of the MAD2 mitotic arrest deficient-like 1 (MAD2/MAD2L1). MAD2 is a key component of the spindle checkpoint that delays the onset of anaphase until all the kinetochores are attached to the spindle. This protein may interact with the spindle checkpoint and coordinate cell cycle events in late mitosis. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
23217	
23218	
23219	
23220	
23221	
23222	
23223	This gene encodes a protein that is highly similar to the yeast SIN1 protein, a stress-activated protein kinase. Alternatively spliced transcript variants encoding distinct isoforms have been described. Alternate polyadenylation sites as well as alternate 3@#$%& UTRs have been identified for transcripts of this gene.
23224	
23225	The protein encoded by this gene is an inositol 1,4,5-trisphosphate (InsP3) 5-phosphatase and contains a Sac domain. The activity of this protein is specific for phosphatidylinositol 4,5-bisphosphate and phosphatidylinositol 3,4,5-trisphosphate. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
23226	This gene encodes a member of the F-box protein family, members of which are characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into three classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene contains a WD-40 domain, in addition to an F-box motif, so it belongs to the Fbw class. Alternatively spliced transcript variants encoding distinct isoforms have been identified for this gene.
23227	
23228	
23229	This gene is a member of the tropomyosin family of highly conserved, widely distributed actin-binding proteins involved in the contractile system of striated and smooth muscles and the cytoskeleton of non-muscle cells. Tropomyosin is composed of two alpha-helical chains arranged as a coiled-coil. It is polymerized end to end along the two grooves of actin filaments and provides stability to the filaments. The encoded protein is one type of alpha helical chain that forms the predominant tropomyosin of striated muscle, where it also functions in association with the troponin complex to regulate the calcium-dependent interaction of actin and myosin during muscle contraction. In smooth muscle and non-muscle cells, alternatively spliced transcript variants encoding a range of isoforms have been described. Mutations in this gene are associated with type 3 familial hypertrophic cardiomyopathy.
23230	The protein encoded by this gene is a member of the calcium-binding protein family.  The specific function of this protein has not yet been determined; however this protein is known to interact with DNA-dependent protein kinase and may play a role in kinase-phosphatase regulation of DNA end joining.  This protein also interacts with integrin alpha(IIb)beta(3), which may implicate this protein as a regulatory molecule for alpha(IIb)beta(3).
23231	
23232	
23233	
23234	CDC42, a small Rho GTPase, regulates the formation of F-actin-containing structures through its interaction with the downstream effector proteins. The protein encoded by this gene is a member of the Borg family of CDC42 effector proteins. Borg family proteins contain a CRIB (Cdc42/Rac interactive-binding) domain. They bind to, and negatively regulate the function of, CDC42. The specific function of this protein has not yet been determined.
23235	
23236	
23237	
23238	
23239	
23240	
23241	
23242	
23243	
23244	
23245	
23246	
23247	
23248	
23249	
23250	GRASP1 is a neuron-specific guanine nucleotide exchange factor for the Ras family of small G proteins (RasGEF) and is associated with the GRIP/AMPA receptor complex in brain.[supplied by OMIM]
23251	ATF6 is an endoplasmic reticulum (ER) stress-regulated transmembrane transcription factor that activates the transcription of ER molecules.[supplied by OMIM]
23252	
23253	
23254	
23255	
23256	The protein encoded by this gene is a calcyclin binding protein. It may be involved in calcium-dependent ubiquitination and subsequent proteosomal degradation of target proteins. It probably serves as a molecular bridge in ubiquitin E3 complexes and participates in the ubiquitin-mediated degradation of beta-catenin. Two alternatively spliced transcript variants encoding different isoforms have been found for this gene.
23257	
23258	Moesin (for membrane-organizing extension spike protein) is a member of the ERM family which includes ezrin and radixin. ERM proteins appear to function as cross-linkers between plasma membranes and actin-based cytoskeletons. Moesin is localized to filopodia and other membranous protrusions that are important for cell-cell recognition and signaling and for cell movement.
23259	
23260	
23261	
23262	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It is one of the most abundant core proteins of hnRNP complexes and it is localized to the nucleoplasm. This protein, along with other hnRNP proteins, is exported from the nucleus, probably bound to mRNA, and is immediately re-imported. Its M9 domain acts as both a nuclear localization and nuclear export signal. The encoded protein is involved in the packaging of pre-mRNA into hnRNP particles, transport of poly A+ mRNA from the nucleus to the cytoplasm, and may modulate splice site selection. It is also thought have a primary role in the formation of specific myometrial protein species in parturition. Multiple alternatively spliced transcript variants have been found for this gene but only two transcripts are fully described. These variant have multiple alternative transcription initiation sites and multiple polyA sites.
23263	
23264	The protein encoded by this gene is a type I transmembrane protein, belonging to the CD2 subfamily of the immunoglobulin superfamily. This encoded protein is expressed on Natural killer (NK), T, and B lymphocytes. It undergoes tyrosine phosphorylation and associates with the Src homology 2 domain-containing protein (SH2D1A) as well as with SH2 domain-containing phosphatases (SHPs). It may function as a coreceptor in the process of NK cell activation. It can also mediate inhibitory signals in NK cells from X-linked lymphoproliferative patients.
23265	
23266	The protein encoded by this gene is a member of the AKT, also called PKB,  serine/threonine protein kinase family. AKT kinases are known to be regulators of cell signaling in response to insulin and growth factors. They are involved in a wide variety of biological processes including cell proliferation, differentiation, apoptosis, tumorigenesis, as well as glycogen synthesis and glucose uptake. This kinase has been shown to be stimulated by platelet-derived growth factor (PDGF), insulin, and insulin-like growth factor 1 (IGF1). Alternatively splice transcript variants encoding distinct isoforms have been described.
23267	
23268	
23269	
23270	
23271	
23272	This gene encodes a mature peptide that is an active cyclic heptapeptide absolutely conserved from lamprey to human. The active peptide acts as a vasoconstrictor and is expressed only in brain tissue. Despite the gene family name similarity, this gene is not homologous to urocortin, a member of the sauvagine/corticotropin-releasing factor/urotensin I family. Most of the proprotein is cleaved to make the mature peptide. Transcript variants encoding different preproprotein isoforms have been described for this gene.
23273	
23274	
23275	
23276	
23277	
23278	
23279	The protein encoded by this gene is a member of the cyclin family of proteins. The encoded protein interacts with cyclin-dependent kinase 8 and induces the phophorylation of the carboxy-terminal domain of the large subunit of RNA polymerase II. The level of mRNAs for this gene peaks in the G1 phase of the cell cycle. Two transcript variants encoding different isoforms have been found for this gene.
23280	
23281	
23282	
23283	
23284	
23285	
23286	
23287	
23288	The Sec61 complex is the central component of the protein translocation apparatus of the endoplasmic reticulum (ER) membrane. The protein encoded by this gene and SEC62 protein are found to be associated with ribosome-free SEC61 complex. It is speculated that Sec61-Sec62-Sec63 may perform post-translational protein translocation into the ER. The Sec61-Sec62-Sec63 complex might also perform the backward transport of ER proteins that are subject to the ubiquitin-proteasome-dependent degradation pathway. The encoded protein is an integral membrane protein located in the rough ER.
23289	
23290	
23291	
23292	
23293	
23294	Many vertebrate homeo box-containing genes have been identified on the basis of their sequence similarity with Drosophila developmental genes. Members of the Dlx gene family contain a homeobox that is related to that of Distal-less (Dll), a gene expressed in the head and limbs of the developing fruit fly. The Distal-less (Dlx) family of genes comprises at least 6 different members, DLX1-DLX6. The DLX proteins are postulated to play a role in forebrain and craniofacial development
23295	The protein encoded by this gene is a cell-surface glycoprotein found on melanoma cells. The protein shares sequence similarity and iron-binding properties with members of the transferrin superfamily. The importance of the iron binding function has not yet been identified. This gene resides in the same region of chromosome 3 as members of the transferrin superfamily. Alternative splicing results in two transcript variants.
23296	
23297	
23298	
23299	
23300	
23301	
23302	
23303	This gene belongs to subfamily P of the forkhead box (FOX) transcription factor family. Forkhead box transcription factors play important roles in the regulation of tissue- and cell type-specific gene transcription during both development and adulthood. Forkhead box P1 protein contains both DNA-binding- and protein-protein binding-domains. This gene may act as a tumor suppressor as it is lost in several tumor types and maps to a chromosomal region (3p14.1) reported to contain a tumor suppressor gene(s). Alternative splicing results in multiple transcript variants encoding different isoforms.
23304	
23305	
23306	The protein encoded by this gene belongs to the protein kinase C (PKC) family of serine/threonine protein kinases. The PKC family comprises at least eight members, which are differentially expressed and are involved in a wide variety of cellular processes. This protein kinase is calcium-independent and phospholipid-dependent. It is not activated by phorbolesters or diacylglycerol. This kinase can be recruited to vesicle tubular clusters (VTCs) by direct interaction with the small GTPase RAB2, where this kinase phosphorylates glyceraldehydes-3-phosphate dehydrogenase (GAPD/GAPDH) and plays a role in microtubule dynamics in the early secretory pathway. This kinase is found to be necessary for BCL-ABL-mediated resistance to drug-induced apoptosis and therefore protects leukemia cells against drug-induced apoptosis. There is a single exon pseudogene mapped on chromosome X.
23307	Prostatic acid phosphatase precursor catalyzes the conversion of orthophosphoric monoester to alcohol and orthophosphate.  ACPP is synthesized under androgen regulation and secreted by the epithelial cells of the prostrate gland.  Sequence comparison has indicated the presence of several polymorphisms.  ACPP undergoes alternative splicing in the 3@#$%& UTR.
23308	
23309	
23310	
23311	
23312	
23313	
23314	
23315	
23316	The protein encoded by this gene is a member of the fibroblast growth factor receptor (FGFR) family, where amino acid sequence is highly conserved between members and throughout evolution. FGFR family members differ from one another in their ligand affinities and tissue distribution. A full-length representative protein would consist of an extracellular region, composed of three immunoglobulin-like domains, a single hydrophobic membrane-spanning segment and a cytoplasmic tyrosine kinase domain. The extracellular portion of the protein interacts with fibroblast growth factors, setting in motion a cascade of downstream signals, ultimately influencing mitogenesis and differentiation. A marked difference between this gene product and the other family members is its lack of a cytoplasmic tyrosine kinase domain. The result is a transmembrane receptor that could interact with other family members and potentially inhibit signaling. Multiple alternatively spliced transcript variants encoding the same isoform have been found for this gene.
23317	
23318	
23319	Transforming acidic coiled-coil proteins are a conserved family of centrosome- and microtubule-interacting proteins that are implicated in cancer. This gene encodes a protein that concentrates at centrosomes throughout the cell cycle. This gene lies within a chromosomal region associated with tumorigenesis. Expression of this gene is thought to affect the progression of breast tumors. Expression of this gene is also induced by erythropoietin.
23320	
23321	The protein encoded by this gene is a member of the Ser/Thr portein kinase family. This protein contains a Cdc42/Rac-binding p21 binding domain resembling that of PAK kinase. The kinase domain of this protein is most closely related to that of myotonic dystrophy kinase-related ROK. Studies of the similar gene in rat suggested that this kinase may act as a downstream effector of Cdc42 in cytoskeletal reorganization.
23322	
23323	
23324	
23325	
23326	
23327	
23328	
23329	
23330	
23331	
23332	
23333	
23334	
23335	
23336	
23337	
23338	
23339	
23340	
23341	
23342	
23343	
23344	This gene encodes a protein with similarity to the Saccharomyces cerevisiae Rpa12p subunit of RNA polymerase I. Alternate splicing of this gene results in two transcript variants encoding the same protein. Additional splice variants have been described, but their full length sequences have not been determined.
23345	
23346	Immunoglobulins recognize foreign antigens and initiate immune responses such as phagocytosis and the complement system. Each immunoglobulin molecule consists of two identical heavy chains and two identical light chains. There are two classes of light chains, kappa and lambda. This region represents the germline organization of the lambda light chain locus. The locus includes V (variable), J (joining), and C (constant) segments. During B cell development, a recombination event at the DNA level joins a single V segment with a J segment; the C segment is later joined by splicing at the RNA level. Recombination of many different V segments with several J segments provides a wide range of antigen recognition. Additional diversity is attained by junctional diversity, resulting from the random additional of nucleotides by terminal deoxynucleotidyltransferase, and by somatic hypermutation, which occurs during B cell maturation in the spleen and lymph nodes. Several V segments and three C segments are known to be incapable of encoding a protein and are considered pseudogenes. The locus also includes several non-immunoglobulin genes, many of which are pseudogenes or are predicted by automated computational analysis or homology to other species.
23347	This gene has higher expression in p53-expressing cells than in control cells and is an immediate-early induction target of p53-mediated apoptosis. The protein encoded by this gene contains six putative transmembrane domains and may suppress cell growth by inducing apoptotic cell death through the caspase 9 and mitochondrial pathways. This gene is located on human chromosome 11q24, a region frequently altered in cancers. Alternative splicing results in two transcript variants encoding different isoforms.
23348	
23349	
23350	This gene encodes a member of the eyes absent (EYA) family of proteins. The encoded protein may act as a transcriptional activator and have a role during development. A similar protein in mice can act as a transcriptional activator. Two transcript variants encoding distinct isoforms have been identified for this gene.
23351	
23352	
23353	
23354	
23355	
23356	This gene encodes a subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This subunit contains an N-terminal glutathione transferase domain, which may be involved in regulating the assembly of multisubunit complexes containing this elongation factor and aminoacyl-tRNA synthetases.
23357	
23358	
23359	
23360	Malate dehydrogenase catalyzes the reversible oxidation of malate to oxaloacetate, utilizing the NAD/NADH cofactor system in the citric acid cycle. The protein encoded by this gene is localized to the mitochondria and may play pivotal roles in the malate-aspartate shuttle that operates in the metabolic coordination between cytosol and mitochondria.
23361	
23362	The protein encoded by this gene is a member of the NR1 subfamily of nuclear hormone receptors. It can bind as a monomer or as a homodimer to hormone response elements upstream of several genes to enhance the expression of those genes. The specific functions of this protein are not known, but it has been shown to interact with NM23-2, a nucleoside diphosphate kinase involved in organogenesis and differentiation, as well as with NM23-1, the product of a tumor metastasis suppressor candidate gene. Four transcript variants encoding different isoforms have been described for this gene.
23363	
23364	
23365	This gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins.
23366	
23367	This gene encodes a cytoplasmic protein which specifically colocalizes and interacts with the serine/threonine protein kinase, receptor-interacting protein (RIP). Zinc finger domains of the encoded protein are required for its interaction with RIP and for inhibition of TNF- and IL1-induced NF-kappa B activation pathways. The encoded protein may also function as an E3 ubiquitin-protein ligase which accepts ubiquitin from E2 ubiquitin-conjugating enzymes and transfers it to substrates. Several alternatively spliced transcript variants have been described for this locus but the full-length natures of only some are known.
23368	
23369	
23370	
23371	
23372	
23373	
23374	
23375	
23376	
23377	
23378	Vinculin is a cytoskeletal protein associated with cell-cell and cell-matrix junctions, where it is thought to function as one of several interacting proteins involved in anchoring F-actin to the membrane. Multiple alternatively spliced transcript variants have been found for this gene, but the biological validity of some variants has not been determined. Human vinculin protein exhibits a greater than 95% sequence identity to the chicken vinculin protein.
23379	The protein encoded by this gene contains a RING finger domain, a motif present in a variety of functionally distinct proteins and known to be involved in protein-protein and protein-DNA interactions.
23380	
23381	
23382	ADP-ribosylation factor 6 (ARF6) is a member of the human ARF gene family. These genes encode small guanine nucleotide-binding proteins that stimulate the ADP-ribosyltransferase activity of cholera toxin and play a role in vesicular trafficking and as activators of phospholipase D.  The gene products include 6 ARF proteins and 11 ARF-like proteins and constitute 1 family of the RAS superfamily. The ARF proteins are categorized as class I (ARF1, ARF2,and ARF3), class II (ARF4 and ARF5) and class III (ARF6). The ARF6 protein is uniquely localized to the plasma membrane.
23383	
23384	
23385	
23386	ODF3 is a component of sperm flagella outer dense fibers, which add stiffness, elastic recoil, and protection against shearing forces during sperm movement.[supplied by OMIM]
23387	
23388	
23389	
23390	
23391	
23392	
23393	
23394	RAN (ras-related nuclear protein) is a small GTP binding protein belonging to the RAS superfamily that is essential for the translocation of RNA and proteins through the nuclear pore complex. The RAN protein is also involved in control of DNA synthesis and cell cycle progression. Nuclear localization of RAN requires the presence of regulator of chromosome condensation 1 (RCC1). Mutations in RAN disrupt DNA synthesis. Because of its many functions, it is likely that RAN interacts with several other proteins. RAN regulates formation and organization of the microtubule network independently of its role in the nucleus-cytosol exchange of macromolecules. RAN could be a key signaling molecule regulating microtubule polymerization during mitosis. RCC1 generates a high local concentration of RAN-GTP around chromatin which, in turn, induces the local nucleation of microtubules.  RAN is an androgen receptor (AR) coactivator that binds differentially with different lengths of polyglutamine within the androgen receptor. Polyglutamine repeat expansion in the AR is linked to Kennedy@#$%&s disease (X-linked spinal and bulbar muscular atrophy). RAN coactivation of the AR diminishes with polyglutamine expansion within the AR, and this weak coactivation may lead to partial androgen insensitivity during the development of Kennedy@#$%&s disease.
23395	
23396	
23397	
23398	
23399	
23400	
23401	
23402	
23403	
23404	
23405	
23406	
23407	
23408	
23409	
23410	
23411	
23412	
23413	
23414	The A-kinase anchor proteins (AKAPs) are a group of structurally diverse proteins, which have the common function of binding to the regulatory subunit of protein kinase A (PKA) and confining the holoenzyme to discrete locations within the cell. This gene encodes a member of the AKAP family. The encoded protein interacts with both the type I and type II regulatory subunits of PKA; therefore, it is a dual-specific AKAP. This protein is highly enriched in mitochondria. It contains RGS (regulator of G protein signalling) domains, in addition to a PKA-RII subunit-binding domain. The mitochondrial localization and the presence of RGS domains may have important implications for the function of this protein in PKA and G protein signal transduction.
23415	
23416	
23417	
23418	
23419	
23420	
23421	DAD1, the defender against apoptotic cell death, was initially identified as a negative regulator of programmed cell death in the temperature sensitive tsBN7 cell line.  The DAD1 protein disappeared in temperature-sensitive cells following a shift to the nonpermissive temperature, suggesting that loss of the DAD1 protein triggered apoptosis.  DAD1 is believed to be a tightly associated subunit of oligosaccharyltransferase both in the intact membrane and in the purified enzyme, thus reflecting the essential nature of N-linked glycosylation in eukaryotes.
23422	
23423	
23424	
23425	
23426	
23427	
23428	This gene product is a member of the serine-arginine (SR) family of proteins, which is involved in constitutive and regulated RNA splicing. Members of this family are characterized by N-terminal RNP1 and RNP2 motifs, which are required for binding to RNA, and multiple C-terminal SR/RS repeats, which are important in mediating association with other cellular proteins. This protein can influence splice site selection of adenovirus E1A pre-mRNA. It interacts with the oncoprotein TLS, and abrogates the influence of TLS on E1A pre-mRNA splicing. Alternative splicing of this gene results in at least two transcript variants encoding different isoforms. In addition, transcript variants utilizing alternative polyA sites exist.
23429	
23430	
23431	
23432	
23433	This gene encodes a member of the STAT-induced STAT inhibitor (SSI), also known as suppressor of cytokine signaling (SOCS), family. SSI family members are cytokine-inducible negative regulators of cytokine signaling. The expression of this gene can be induced by a subset of cytokines, including IL2, IL3 erythropoietin (EPO), CSF2/GM-CSF, and interferon (IFN)-gamma. The protein encoded by this gene functions downstream of cytokine receptors, and takes part in a negative feedback loop to attenuate cytokine signaling. Knockout studies in mice suggested the role of this gene as a modulator of IFN-gamma action, which is required for normal postnatal growth and survival.
23434	
23435	
23436	IRX5 is a member of the Iroquois homeobox gene family. Members of this family appear to play multiple roles during pattern formation of vertebrate embryos.[supplied by OMIM]
23437	
23438	
23439	
23440	
23441	
23442	
23443	
23444	This gene is a member of the ADAR (adenosine deaminase acting on RNA) family.  Using site-specific adenosine modification, proteins encoded by these genes participate in the pre-mRNA editing of nuclear transcripts.  The protein encoded by this gene, tRNA-specific adenosine deaminase 1, is responsible for the deamination of adenosine 37 to inosine in eukaryotic tRNA.
23445	
23446	The Golgi apparatus, which participates in glycosylation and transport of proteins and lipids in the secretory pathway, consists of a series of stacked cisternae (flattened membrane sacs). Interactions between the Golgi and microtubules are thought to be important for the reorganization of the Golgi after it fragments during mitosis. The golgins are a family of proteins, of which the protein encoded by this gene is a member, that are localized to the Golgi. This encoded protein is associated with Sjogren@#$%&s syndrome.
23447	
23448	
23449	
23450	
23451	The protein encoded by this gene is a type II membrane protein that catalyzes the transfer of sialic acid from CMP-sialic acid to galactose-containing substrates. The encoded protein is normally found in the Golgi but can be proteolytically processed to a soluble form. This protein, which is a member of glycosyltransferase family 29, can use the same acceptor substrates as does sialyltransferase 4A.
23452	
23453	
23454	
23455	
23456	
23457	
23458	
23459	This gene encodes a transmembrane and secreted protein with characteristics of a type 1a transmembrane protein. It is found in a perinuclear Golgi-like pattern and thought to be involved in hematopoietic and/or immune system processes.
23460	
23461	
23462	
23463	
23464	
23465	The E2F transcription factor family (see MIM 189971) regulates the expression of various cellular promoters, particularly those involved in the cell cycle. E2F factors bind to DNA as homodimers or heterodimers in association with dimerization partner DP1. TFDP1 may be the first example of a family of related transcription factors; see TFDP2 (MIM 602160).[supplied by OMIM]
23466	
23467	
23468	
23469	
23470	
23471	
23472	This gene encodes a protein that is similar to the S. cerevisiae suppressor of actin mutations 2 gene. The rat, mouse, and human proteins share a high percentage of identity to each other (95-98%). This gene lies in a head-to-head orientation with the gene encoding ribosomal protein S18. Two transcript variants have been identified for this gene.
23473	This gene is the cellular homolog of the Yamaguchi sarcoma virus oncogene. The encoded protein has tyrosine kinase activity and belongs to the src family of proteins. This gene lies in close proximity to thymidylate synthase gene on chromosome 18, and a corresponding pseudogene has been found on chromosome 22.
23474	
23475	
23476	
23477	
23478	
23479	The Fanconi anemia complementation group (FANC) currently includes FANCA, FANCB, FANCC, FANCD1 (also called BRCA2), FANCD2, FANCE, FANCF, FANCG, and FANCL. Fanconi anemia is a genetically heterogeneous recessive disorder characterized by cytogenetic instability, hypersensitivity to DNA crosslinking agents, increased chromosomal breakage, and defective DNA repair. The members of the Fanconi anemia complementation group do not share sequence similarity; they are related by their assembly into a common nuclear protein complex. This gene encodes the protein for complementation group D2. This protein is monoubiquinated in response to DNA damage, resulting in its localization to nuclear foci with other proteins (BRCA1 AND BRCA2) involved in homology-directed DNA repair. Alternative splicing results in two transcript variants encoding different isoforms.
23480	
23481	
23482	
23483	
23484	
23485	
23486	
23487	
23488	
23489	
23490	Presenilins, which are components of the gamma-secretase protein complex, are required for intramembranous processing of some type I transmembrane proteins, such as the Notch proteins and the beta-amyloid precursor protein. Signaling by Notch receptors mediates a wide range of developmental cell fates. Processing of the beta-amyloid precursor protein generates neurotoxic amyloid beta peptides, the major component of senile plaques associated with Alzheimer@#$%&s disease. This gene encodes a protein that is required for Notch pathway signaling, and for the activity and accumulation of gamma-secretase.
23491	
23492	
23493	
23494	
23495	
23496	
23497	
23498	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. However, this protein is considered a member of the cytochrome P450 superfamily on the basis of sequence similarity rather than functional similarity. This endoplasmic reticulum membrane protein catalyzes the conversion of prostglandin H2 to prostacyclin (prostaglandin I2), a potent vasodilator and inhibitor of platelet aggregation. An imbalance of prostacyclin and its physiological antagonist thromboxane A2 contribute to the development of myocardial infarction, stroke, and atherosclerosis.
23499	
23500	Pre-mRNA splicing is catalyzed by the spliceosome. U12-type spliceosome binds U12-type pre-mRNAs and recognizes the 5@#$%& splice site and branch-point sequence. U11 and U12 snRNPs are components of U12-type spliceosome and function as a molecular bridge connecting both ends of the intron. The protein encoded by this gene contains a RNA recognition motif. It was identified as one of the protein components of U11/U12 snRNPs. This protein and many other U11/U12 snRNP proteins are highly conserved in organisms known to contain U12-type introns. These proteins have been shown to be essential for cell viability, suggesting the key roles in U12-type splicing.
23501	
23502	
23503	
23504	
23505	Eukaryotic translation initiation factor 2 (EIF-2) functions in the early steps of protein synthesis by forming a ternary complex with GTP and initiator tRNA and binding to a 40S ribosomal subunit. EIF-2 is composed of three subunits, alpha, beta, and gamma, with the protein encoded by this gene representing the beta subunit. The beta subunit catalyzes the exchange of GDP for GTP, which recycles the EIF-2 complex for another round of initiation.
23506	
23507	This gene encodes an ethanolamine kinase, which functions in the first committed step of the phosphatidylethanolamine synthesis pathway. This cytosolic enzyme is specific for ethanolamine and exhibits negligible kinase activity on choline.
23508	
23509	
23510	
23511	The protein encoded by this gene is involved in the transsulfuration pathway. The first step of this pathway, from homocysteine to cystathionine, is catalyzed by this protein. CBS deficiency can cause homocystinuria which affects many organs and tissues, including the eyes and the skeletal, vascular and central nervous systems.
23512	
23513	
23514	
23515	
23516	This gene is a member of the apolipoprotein L gene family. The encoded protein is found in the cytoplasm, where it may affect the movement of lipids or allow the binding of lipids to organelles. Two transcript variants encoding the same protein have been found for this gene.
23517	
23518	
23519	
23520	
23521	
23522	
23523	
23524	
23525	
23526	
23527	
23528	
23529	
23530	Alpha actinins belong to the spectrin gene superfamily which represents a diverse group of cytoskeletal proteins, including the alpha and beta spectrins and dystrophins.  Alpha actinin is an actin-binding protein with multiple roles in different cell types.  In nonmuscle cells, the cytoskeletal isoform is found along microfilament bundles and adherens-type junctions, where it is involved in binding actin to the membrane.  In contrast, skeletal, cardiac, and smooth muscle isoforms are localized to the Z-disc and analogous dense bodies, where they help anchor the myofibrillar actin filaments.  This gene encodes a nonmuscle, alpha actinin isoform which is concentrated in the cytoplasm, and thought to be involved in metastatic processes.  Mutations in this gene have been associated with focal and segmental glomerulosclerosis.
23531	
23532	
23533	
23534	
23535	
23536	
23537	
23538	
23539	
23540	
23541	
23542	
23543	This gene product belongs to the 14-3-3 family of proteins which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals, and this protein is 99% identical to the mouse, rat and sheep orthologs. The encoded protein interacts with IRS1 protein, suggesting a role in regulating insulin sensitivity. Two transcript variants differing in the 5@#$%& UTR, but encoding the same protein, have been identified for this gene.
23544	
23545	
23546	
23547	
23548	
23549	
23550	
23551	
23552	
23553	Selenoproteins, such as SELO, contain the rare twenty-first amino acid, selenocysteine (sec). These proteins lack common amino acid sequence motifs, but the 3-prime untranslated regions of selenoprotein genes have a common stem-loop structure, the sec insertion sequence (SECIS), that is necessary for the recognition of UGA as a sec codon rather than as a stop signal.[supplied by OMIM]
23554	
23555	
23556	
23557	
23558	
23559	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum and catalyzes the last steps of estrogen biosynthesis, three successive hydroxylations of the A ring of androgens. Mutations in this gene can result in either increased or decreased aromatase activity; the associated phenotypes suggest that estrogen functions both as a sex steroid hormone and in growth or differentiation. The gene expresses two transcript variants.
23560	
23561	
23562	
23563	
23564	
23565	
23566	
23567	This gene encodes a protein that is part of the EGFR pathway. The protein is present at clatherin-coated pits and is involved in receptor-mediated endocytosis of EGF. Notably, this gene is rearranged with the HRX/ALL/MLL gene in acute myelogeneous leukemias. Alternate transcriptional splice variants of this gene have been observed but have not been thoroughly characterized.
23568	
23569	Vasoactive intestinal peptide (VIP) is a neuroprotective factor that has been shown to have a stimulatory effect on the growth of some tumor cells and an inhibitory effect on others. The protein encoded by this gene is upregulated by VIP and may be involved in the stimulatory effect of VIP on certain tumor cells. The encoded protein contains one homeobox and nine zinc finger domains, suggesting that it functions as a transcription factor. This gene is also upregulated in normal proliferative tissues. Finally, the encoded protein may increase the viability of certain cell types through modulation of p53 activity. Two transcript variants encoding the same protein have been found for this gene.
23570	
23571	
23572	
23573	
23574	
23575	
23576	
23577	
23578	
23579	
23580	This gene encodes a protein that is part of a post-splicing multiprotein complex involved in both mRNA nuclear export and mRNA surveillance. The encoded protein is one of two functional homologs to yeast Upf3p. mRNA surveillance detects exported mRNAs with truncated open reading frames and initiates nonsense-mediated mRNA decay (NMD). When translation ends upstream from the last exon-exon junction, this triggers NMD to degrade mRNAs containing premature stop codons. This protein binds to the mRNA and remains bound after nuclear export, acting as a nucleocytoplasmic shuttling protein. It forms with Y14 a complex that binds specifically 20 nt upstream of exon-exon junctions. This gene is located on the long arm of chromosome 13. Two splice variants encoding different isoforms have been found for this gene.
23581	
23582	
23583	
23584	
23585	
23586	This gene encodes a member of the RSK (ribosomal S6 kinase) family of serine/threonine kinases. This kinase contains 2 non-identical kinase catalytic domains and phosphorylates several residues of the S6 ribosomal protein. The kinase activity of this protein leads to an increase in protein synthesis and cell proliferation. Amplification of the region of DNA encoding this gene and overexpression of this kinase are seen in some breast cancer cell lines. Alternate translational start sites have been described and alternate transcriptional splice variants have been observed but have not been thoroughly characterized.
23587	
23588	
23589	The protein encoded by this gene is one subunit of a trimeric complex, forming a highly conserved transcription factor that binds with high specificity to CCAAT motifs in the promoter regions in a variety of genes. This gene product, subunit B, forms a tight dimer with the C subunit, a prerequisite for subunit A association. The resulting trimer binds to DNA with high specificity and affinity. Subunits B and C each contain a histone-like motif. Observation of the histone nature of these subunits is supported by two types of evidence; protein sequence alignments and experiments with mutants.
23590	
23591	
23592	
23593	
23594	
23595	
23596	
23597	
23598	
23599	
23600	
23601	
23602	
23603	
23604	
23605	
23606	
23607	
23608	X inactivation is an early developmental process in mammalian females that transcriptionally silences one of the pair of X chromosomes, thus providing dosage equivalence between males and females. The process is regulated by several factors, including a region of chromosome X called the X inactivation center (XIC). The XIST gene is expressed exclusively from the XIC of the inactive X chromosome. The transcript is spliced but apparently does not encode a protein. The transcript remains in the nucleus where it coats the inactive X chromosome. Alternatively spliced transcript variants have been identified, but their full length sequences have not been determined. Mutations in the XIST promoter cause familial skewed X inactivation.
23609	HIPK2 is a conserved serine/threonine nuclear kinase that interacts with homeodomain transcription factors.[supplied by OMIM]
23610	
23611	
23612	Adenylate kinases are involved in regulating the adenine nucleotide composition within a cell by catalyzing the reversible transfer of phosphate groups among adenine nucleotides. Three isozymes of adenylate kinase, namely 1, 2, and 3, have been identified in vertebrates; this gene encodes isozyme 2. Expression of these isozymes is tissue-specific and developmentally regulated. Isozyme 2 is localized in the mitochondrial intermembrane space and may play a role in apoptosis. Transcript variants encoding distinct isoforms have been identified for this gene.
23613	
23614	
23615	
23616	
23617	
23618	
23619	
23620	
23621	HLA-DMB belongs to the HLA class II beta chain paralogues. This class II molecule is a heterodimer consisting of an alpha (DMA) and a beta (DMB) chain, both anchored in the membrane. It is located in intracellular vesicles. DM plays a central role in the peptide loading of MHC class II molecules by helping to release the CLIP (class II-associated invariant chain peptide) molecule from the peptide binding site. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa and its gene contains 6 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail.
23622	
23623	
23624	
23625	
23626	
23627	
23628	
23629	
23630	
23631	
23632	
23633	Immunoglobulins recognize foreign antigens and initiate immune responses such as phagocytosis and the complement system. Each immunoglobulin molecule consists of two identical heavy chains and two identical light chains. There are two classes of light chains, kappa and lambda. This region represents the germline organization of the lambda light chain locus. The locus includes V (variable), J (joining), and C (constant) segments. During B cell development, a recombination event at the DNA level joins a single V segment with a J segment; the C segment is later joined by splicing at the RNA level. Recombination of many different V segments with several J segments provides a wide range of antigen recognition. Additional diversity is attained by junctional diversity, resulting from the random additional of nucleotides by terminal deoxynucleotidyltransferase, and by somatic hypermutation, which occurs during B cell maturation in the spleen and lymph nodes. Several V segments and three C segments are known to be incapable of encoding a protein and are considered pseudogenes. The locus also includes several non-immunoglobulin genes, many of which are pseudogenes or are predicted by automated computational analysis or homology to other species.
23634	
23635	This gene encodes a fibrillar collagen that is found in extensible connective tissues such as skin, lung, and the vascular system, frequently in association with type I collagen. Mutations in this gene are associated with Ehlers-Danlos syndrome type IV, and with aortic and arterial aneurysms. Although alternate transcripts have been detected for this gene, they are the result of mutations; these mutations alter splicing, often leading to the exclusion of multiple exons.
23636	WD40 repeat proteins are key components of many essential biologic functions. They regulate the assembly of multiprotein complexes by presenting a beta-propeller platform for simultaneous and reversible protein-protein interactions. Members of the WIPI subfamily of WD40 repeat proteins, such as WIPI4, have a 7-bladed propeller structure and contain a conserved motif for interaction with phospholipids (Proikas-Cezanne et al., 2004).[supplied by OMIM]
23637	
23638	
23639	
23640	
23641	
23642	
23643	This gene encodes a telomere specific protein which is a component of the telomere nucleoprotein complex. This protein is present at telomeres throughout the cell cycle and functions as an inhibitor of telomerase, acting in cis to limit the elongation of individual chromosome ends. The protein structure contains a C-terminal Myb motif, a dimerization domain near its N-terminus and an acidic N-terminus. Two transcripts of this gene are alternatively spliced products.
23644	
23645	
23646	
23647	
23648	
23649	
23650	This gene is specifically expressed in germ cells at round spermatid stages during spermatogenesis. The protein encoded by this gene contains an N-terminal RING finger motif, a B-box, and a C-terminal B-30.2-like domain. When overexpressed in COS-7 and HEK-293 cells, this protein was shown to induce cell apoptosis. Alternatively spliced transcript variants encoding distinct isoforms have been described.
23651	
23652	
23653	
23654	
23655	
23656	
23657	
23658	
23659	
23660	
23661	
23662	
23663	
23664	
23665	
23666	The protein encoded by this gene regulates the turnover of diphosphoinositol polyphosphates. The turnover of these high-energy diphosphoinositol polyphosphates represents a molecular switching activity with important regulatory consequences. Molecular switching by diphosphoinositol polyphosphates may contribute to regulating intracellular trafficking. Several alternatively spliced transcript variants have been described, but the full-length nature of some variants has not been determined. Isoforms DIPP2alpha and DIPP2beta are distinguishable from each other solely by DIPP2beta possessing one additional amino acid due to intron boundary skidding in alternate splicing.
23667	This gene encodes a multifunctional proinflammatory cytokine that belongs to the tumor necrosis factor (TNF) superfamily. This cytokine is mainly secreted by macrophages. It can bind to, and thus functions through its receptors TNFRSF1A/TNFR1 and TNFRSF1B/TNFBR. This cytokine is involved in the regulation of a wide spectrum of biological processes including cell proliferation, differentiation, apoptosis, lipid metabolism, and coagulation. This cytokine has been implicated in a variety of diseases, including autoimmune diseases, insulin resistance, and cancer. Knockout studies in mice also suggested the neuroprotective function of this cytokine.
23668	The protein encoded by this gene is a member of the serine/threonine protein kinase family. Although this kinase is found in many tissues, its expression in lymphoid follicles is restricted to the cells of germinal centre, where it may participate in B-cell differentiation. This kinase can be activated by TNF-alpha, and has been shown to specifically activate MAP kinases. This kinase is also found to interact with TNF receptor-associated factor 2 (TRAF2), which is involved in the activation of MAP3K1/MEKK1.
23669	This gene encodes a member of the neuropilin family of receptor proteins. The encoded transmembrane protein binds to SEMA3C protein {sema domain, immunoglobulin domain (Ig), short basic domain, secreted, (semaphorin) 3C} and SEMA3F protein {sema domain, immunoglobulin domain (Ig), short basic domain, secreted, (semaphorin) 3F}, and interacts with vascular endothelial growth factor (VEGF). This protein may play a role in cardiovascular development, axon guidance, and tumorigenesis. Multiple transcript variants encoding distinct isoforms have been identified for this gene.
23670	
23671	TRPCs, mammalian homologs of the Drosophila transient receptor potential (trp) protein, are ion channels that are thought to mediate capacitative calcium entry into the cell. TRP-PLIK is a protein that is both an ion channel and a kinase. As a channel, it conducts calcium and monovalent cations to depolarize cells and increase intracellular calcium. As a kinase, it is capable of phosphorylating itself and other substrates. The kinase activity is necessary for channel function, as shown by its dependence on intracellular ATP and by the kinase mutants.[supplied by OMIM]
23672	
23673	Laminins, a family of extracellular matrix glycoproteins, are the major noncollagenous constituent of basement membranes. They have been implicated in a wide variety of biological processes including cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis. Many of the effects of laminin are mediated through interactions with cell surface receptors. These receptors include members of the integrin family, as well as non-integrin laminin-binding proteins. This gene encodes a high-affinity, non-integrin family, laminin receptor 1. This receptor has been variously called 67 kD laminin receptor, 37 kD laminin receptor precursor (37LRP) and p40 ribosome-associated protein. The amino acid sequence of laminin receptor 1 is highly conserved through evolution, suggesting a key biological function. It has been observed that the level of the laminin receptor transcript is higher in colon carcinoma tissue and lung cancer cell line than their normal counterparts. Also, there is a correlation between the upregulation of this polypeptide in cancer cells and their invasive and metastatic phenotype. Multiple copies of this gene exist, however, most of them are pseudogenes thought to have arisen from retropositional events. Two alternatively spliced transcript variants encoding the same protein have been found for this gene.
23674	
23675	This gene encodes a protein with multiple ankyrin repeat domains and a single KH-domain. Co-transcription of this gene and the neighboring downstream gene (EIF4EBP3) generates a transcript (MASK-BP3) which encodes a fusion protein comprised of the MASK protein sequence for the majority of the protein and a different C-terminus due to an alternate reading frame for the EIF4EBP3 segments.
23676	
23677	
23678	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This protein localizes to the nucleus and the cytoplasm. Its function has not been identified.
23679	
23680	
23681	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. This enzyme is located in the membrane of the endoplasmic reticulum. Multiple alternatively spliced transcript variants have been found for this gene, but the full-length nature of some variants has not been defined.
23682	
23683	
23684	
23685	
23686	
23687	This gene encodes a protein with ankyrin repeats, which are associated with protein-protein interactions. Studies in mice suggest that this protein is involved in liver development. Two transcript variants encoding different isoforms have been found for this gene.
23688	
23689	Immunoglobulins recognize foreign antigens and initiate immune responses such as phagocytosis and the complement system. Each immunoglobulin molecule consists of two identical heavy chains and two identical light chains. There are two classes of light chains, kappa and lambda. This region represents the germline organization of the lambda light chain locus. The locus includes V (variable), J (joining), and C (constant) segments. During B cell development, a recombination event at the DNA level joins a single V segment with a J segment; the C segment is later joined by splicing at the RNA level. Recombination of many different V segments with several J segments provides a wide range of antigen recognition. Additional diversity is attained by junctional diversity, resulting from the random additional of nucleotides by terminal deoxynucleotidyltransferase, and by somatic hypermutation, which occurs during B cell maturation in the spleen and lymph nodes. Several V segments and three C segments are known to be incapable of encoding a protein and are considered pseudogenes. The locus also includes several non-immunoglobulin genes, many of which are pseudogenes or are predicted by automated computational analysis or homology to other species.
23690	The first mRNA transcript isolated for this gene was part of an artificial chimera derived from two distinct gene transcripts and a primer used in the cloning process (see Genbank accession M29204). A positively charged amino terminus present only in the chimera was determined to bind GC-rich DNA, thus mistakenly thought to identify a transcription factor gene.
23691	
23692	
23693	Phosphatidylinositol transfer protein is a member of a diverse set of cytosolic phospholipid transfer proteins that are distinguished by their ability to transfer phospholipids between membranes in vitro (Wirtz, 1991).[supplied by OMIM]
23694	
23695	
23696	
23697	
23698	This gene encodes an enzyme that plays a major role in polyamine metabolism and is important for the salvage of both adenine and methionine. The encoded enzyme is deficient in many cancers because this gene and the tumor suppressor p16 gene are co-deleted. Multiple alternatively spliced transcript variants have been described for this gene, but their full-length natures remain unknown.
23699	
23700	
23701	
23702	
23703	
23704	
23705	
23706	
23707	
23708	
23709	
23710	
23711	
23712	
23713	
23714	
23715	
23716	
23717	
23718	
23719	
23720	
23721	
23722	
23723	
23724	The protein encoded by this gene is a neuron-specific activator of CDK5 kinase. It associates with CDK5 to form an active kinase. This protein and neuron-specific CDK5 activator CDK5R1/p39NCK5A both share limited similarity to cyclins, and thus may define a distinct family of cyclin-dependent kinase activating proteins.
23725	
23726	Cytoglobin is a ubiquitously expressed hexacoordinate hemoglobin that may facilitate diffusion of oxygen through tissues, scavenge nitric oxide or other reactive oxygen species, or serve a protective function during oxidative stress (Trent and Hargrove, 2002).[supplied by OMIM]
23727	
23728	
23729	
23730	
23731	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It is one of the most abundant core proteins of hnRNP complexes and it is localized to the nucleoplasm. This protein, along with other hnRNP proteins, is exported from the nucleus, probably bound to mRNA, and is immediately re-imported. Its M9 domain acts as both a nuclear localization and nuclear export signal. The encoded protein is involved in the packaging of pre-mRNA into hnRNP particles, transport of poly A+ mRNA from the nucleus to the cytoplasm, and may modulate splice site selection. It is also thought have a primary role in the formation of specific myometrial protein species in parturition. Multiple alternatively spliced transcript variants have been found for this gene but only two transcripts are fully described. These variant have multiple alternative transcription initiation sites and multiple polyA sites.
23732	
23733	
23734	
23735	
23736	
23737	
23738	
23739	
23740	Proteins encoded by the complexin/synaphin gene family are cytosolic proteins that function in synaptic vesicle exocytosis.  These proteins bind syntaxin, part of the SNAP receptor.  The protein product of this gene binds to the SNAP receptor complex and disrupts it, allowing transmitter release.
23741	
23742	Immunoglobulins recognize foreign antigens and initiate immune responses such as phagocytosis and the complement system. Each immunoglobulin molecule consists of two identical heavy chains and two identical light chains. There are two classes of light chains, kappa and lambda. This region represents the germline organization of the lambda light chain locus. The locus includes V (variable), J (joining), and C (constant) segments. During B cell development, a recombination event at the DNA level joins a single V segment with a J segment; the C segment is later joined by splicing at the RNA level. Recombination of many different V segments with several J segments provides a wide range of antigen recognition. Additional diversity is attained by junctional diversity, resulting from the random additional of nucleotides by terminal deoxynucleotidyltransferase, and by somatic hypermutation, which occurs during B cell maturation in the spleen and lymph nodes. Several V segments and three C segments are known to be incapable of encoding a protein and are considered pseudogenes. The locus also includes several non-immunoglobulin genes, many of which are pseudogenes or are predicted by automated computational analysis or homology to other species.
23743	This gene encodes a member of a large protein family which contains the LIM domain, a unique cysteine-rich zinc-binding domain. The encoded protein may function as a transcriptional regulator and be involved in control of differentiation and development of the pituitary gland. Mutations in this gene are associated with syndromic short stature and pituitary and hindbrain defects. An alternative splice variant has been described but its biological nature has not been determined.
23744	
23745	
23746	
23747	
23748	
23749	
23750	
23751	IGSF11 is an immunoglobulin (Ig) superfamily member that is preferentially expressed in brain and testis. It shares significant homology with coxsackievirus and adenovirus receptor (CXADR; MIM 602621) and endothelial cell-selective adhesion molecule (ESAM).[supplied by OMIM]
23752	
23753	
23754	This gene encodes a member of the RSK (ribosomal S6 kinase) family of serine/threonine kinases. This kinase contains 2 non-identical kinase catalytic domains and phosphorylates several residues of the S6 ribosomal protein. The kinase activity of this protein leads to an increase in protein synthesis and cell proliferation. Amplification of the region of DNA encoding this gene and overexpression of this kinase are seen in some breast cancer cell lines. Alternate translational start sites have been described and alternate transcriptional splice variants have been observed but have not been thoroughly characterized.
23755	
23756	
23757	
23758	
23759	
23760	
23761	
23762	
23763	
23764	
23765	
23766	
23767	
23768	
23769	
23770	
23771	
23772	
23773	
23774	
23775	The protein encoded by this gene belongs to the plakin family of cytoskeletal linker proteins. This protein family forms bridges between different cytoskeletal elements through specialized modular domains. The encoded protein is one of the largest size proteins identified in human cytoskeletal proteins. It has functional actin and microtubule binding domains, and it appears to stabilize actin at sites where microtubules and microfilaments meet. It may function in microtubule dynamics to facilitate actin-microtubule interactions at the cell periphery and to couple the microtubule network to cellular junctions. Alternatively spliced transcript variants encoding distinct isoforms have been described.
23776	
23777	
23778	
23779	
23780	
23781	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum. It has both 17alpha-hydroxylase and 17,20-lyase activities and is a key enzyme in the steroidogenic pathway that produces progestins, mineralocorticoids, glucocorticoids, androgens, and estrogens. Mutations in this gene are associated with isolated steroid-17 alpha-hydroxylase deficiency, 17-alpha-hydroxylase/17,20-lyase deficiency, pseudohermaphroditism, and adrenal hyperplasia.
23782	
23783	
23784	
23785	This gene encodes a protein that may function as a ribonuclease. An alternative splice variant has been described but its biological nature has not been determined. A pseudogene for this gene has been identified on chromosome Y, near the sex determining region Y (SRY) gene.
23786	
23787	The protein encoded by this gene is a receptor for interleukin 10. This protein is structurally related to interferon receptors. It has been shown to mediate the immunosuppressive signal of interleukin 10, and thus inhibits the synthesis of proinflammatory cytokines. This receptor is reported to promote survival of progenitor myeloid cells through the insulin receptor substrate-2/PI 3-kinase/AKT pathway. Activation of this receptor leads to tyrosine phosphorylation of JAK1 and TYK2 kinases.
23788	
23789	Otubains are deubiquitylating cysteine proteases (DUBs; see MIM 602519) that belong to the ovarian tumor (OTU) protein superfamily. Like other DUBs, otubains cleave proteins precisely at the ubiquitin (UB; see MIM 191339)-protein bond.[supplied by OMIM]
23790	
23791	
23792	
23793	Activator of apoptosis Hrk regulates apoptosis through interaction with death-repressor proteins Bcl-2 and Bcl-X(L). The HRK protein lacks significant homology to other BCL2 family members except for an 8-amino acid region that was similar to the BCL2 homology domain-3 (BH3) motif of BIK.  HRK interacts with BCL2 and BCLXL via the BH3 domain, but not with the death-promoting BCL2-related proteins BAX, BAK, or BCLXS. HRK localizes to membranes of intracellular organelles in a pattern similar to that previously reported for BCL2 and BCLXL.
23794	
23795	
23796	This gene encodes a protein with protease activity. Its specific function has not been determined. The protein may be useful as a tumor marker.
23797	
23798	
23799	
23800	There are 2 forms of glucose-6-phosphate dehydrogenase. G form is X-linked and H form, encoded by this gene, is autosomally linked. This H form shows activity with other hexose-6-phosphates, especially galactose-6-phosphate, whereas the G form is specific for glucose-6-phosphate. Both forms are present in most tissues, but H form is not found in red cells.
23801	
23802	
23803	
23804	
23805	ETFA participates in catalyzing the initial step of the mitochondrial fatty acid beta-oxidation.  It shuttles electrons between primary flavoprotein dehydrogenases and the membrane-bound electron transfer flavoprotein ubiquinone oxidoreductase. Defects in electron-transfer-flavoprotein have been implicated in type II glutaricaciduria in which multiple acyl-CoA dehydrogenase deficiencies result in large excretion of glutaric, lactic, ethylmalonic, butyric, isobutyric, 2-methyl-butyric, and isovaleric acids.
23806	
23807	
23808	
23809	
23810	
23811	
23812	Tumor necrosis factor (TNF; see MIM 191160) receptor-associated factors, such as TRAF7, are signal transducers for members of the TNF receptor superfamily (see MIM 191190). TRAFs are composed of an N-terminal cysteine/histidine-rich region containing zinc RING and/or zinc finger motifs; a coiled-coil (leucine zipper) motif; and a homologous region that defines the TRAF family, the TRAF domain, which is involved in self-association and receptor binding.[supplied by OMIM]
23813	
23814	
23815	
23816	
23817	
23818	
23819	The product of this gene belongs to the CRE (cAMP response element)-binding protein family. Members of this family contain zinc-finger and bZIP DNA-binding domains. The encoded protein specifically binds to CRE as a homodimer or a heterodimer with c-Jun or CRE-BP1, and functions as a CRE-dependent trans-activator. Alternatively spliced transcript variants encoding different isoforms have been identified.
23820	
23821	Myocardin is a smooth and cardiac muscle-specific transcriptional coactivator of serum response factor. When expressed ectopically in nonmuscle cells, myocardin can induce smooth muscle differentiation by its association with serum response factor (SRF; MIM 600589).[supplied by OMIM]
23822	
23823	
23824	
23825	
23826	
23827	
23828	The metallocarboxypeptidase family of enzymes is divided into 2 subfamilies based on sequence similarities. The pancreatic carboxypeptidase-like and the regulatory B-type carboxypeptidase subfamilies. Carboxypeptidase D has been identified as a regulatory B-type carboxypeptidase. CPD is a homolog of duck gp180, a hepatitis B virus-binding protein. Transcript variants utilizing alternative polyadenylation signals exist for this gene.
23829	
23830	
23831	
23832	
23833	
23834	
23835	
23836	
23837	
23838	This gene contains a predicted ORF that encodes a protein with two zinc finger domains. The function of the encoded protein is not known. Sequence analysis suggests that multiple alternatively spliced transcript variants are derived from this gene but the full-length nature of only two of them is known. These two splice variants encode different isoforms. A pseudogene for this gene is located on Xq28.
23839	
23840	The Wiskott-Aldrich syndrome (WAS) family of proteins share similar domain structure, and are involved in transduction of signals from receptors on the cell surface to the actin cytoskeleton. The presence of a number of different motifs suggests that they are regulated by a number of different stimuli, and interact with multiple proteins. Recent studies have demonstrated that these proteins, directly or indirectly, associate with the small GTPase, Cdc42, known to regulate formation of actin filaments, and the cytoskeletal organizing complex, Arp2/3. The WASL gene product is a homolog of WAS protein, however, unlike the latter, it is ubiquitously expressed and shows highest expression in neural tissues. It has been shown to bind Cdc42 directly, and induce formation of long actin microspikes.
23841	
23842	This gene has a highly complex imprinted expression pattern. It encodes maternally, paternally, and biallelically expressed proteins which are derived from alternatively spliced transcripts with alternate 5@#$%& exons. Each of the upstream exons is within a differentially methylated region, commonly found in imprinted genes. However, the close proximity (14 kb) of two oppositely expressed promoter regions is unusual. In addition, one of the alternate 5@#$%& exons introduces a frameshift relative to the other transcripts, resulting in one isoform which is structurally unrelated to the others. An antisense transcript exists, and may regulate imprinting in this region. Mutations in this gene result in pseudohypoparathyroidism type 1a (PHP1a), which has an atypical autosomal dominant inheritance pattern requiring maternal transmission for full penetrance. There are RefSeqs representing four transcript variants of this gene. Other transcript variants including four additional exons have been described; however, their full length sequences have not been determined.
23843	Multiple variants exist for this gene, three of which have been characterized and found to encode different isoforms. One of the isoforms is a membrane-bound protein with sequence similarity to the mouse mahogany protein, a receptor involved in controlling obesity. The other two characterized isoforms are secreted proteins involved in the initial immune cell clustering during inflammatory responses that may regulate the chemotactic activity of chemokines.
23844	This gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins. The encoded protein may play a role in chondrogenesis. A pseudogene of this gene is located on chromosome 8. Multiple transcript variants encoding distinct isoforms have been identified for this gene.
23845	
23846	
23847	
23848	
23849	
23850	
23851	
23852	
23853	ROCK1 is a protein serine/threonine kinase that is activated when bound to the GTP-bound form of Rho. The small GTPase Rho regulates formation of focal adhesions and stress fibers of fibroblasts, as well as adhesion and aggregation of platelets and lymphocytes by shuttling between the inactive GDP-bound form and the active GTP-bound form. Rho is also essential in cytokinesis and plays a role in transcriptional activation by serum response factor. ROCK, a downstream effector of Rho, phosphorylates and activates LIM kinase, which in turn, phosphorylates cofilin, inhibiting its actin-depolymerizing activity.
23854	
23855	
23856	
23857	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L15E family of ribosomal proteins. It is located in the cytoplasm. This gene shares sequence similarity with the yeast ribosomal protein YL10 gene. Although this gene has been referred to as RPL10, its official symbol is RPL15. This gene has been shown to be overexpressed in some esophageal tumors compared to normal matched tissues. Transcript variants utilizing alternative polyA signals exist. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
23858	Glyceraldehyde 3-phosphate dehydrogenases (EC 1.2.1.12), such as GAPDS, are NAD-dependent enzymes that remove hydrogen and add phosphate to glyceraldehyde 3-phosphate to form 1,3-diphosphoglycerate.[supplied by OMIM]
23859	
23860	
23861	
23862	
23863	
23864	
23865	
23866	
23867	Carbonic anhydrases (CAs) are a large family of zinc metalloenzymes that catalyze the reversible hydration of carbon dioxide.  They participate in a variety of biological processes, including respiration, calcification, acid-base balance, bone resorption, and the formation of aqueous humor, cerebrospinal fluid, saliva, and gastric acid.  They show extensive diversity in tissue distribution and in their subcellular localization.  CA VA is localized in the mitochondria and expressed primarily in the liver.  It may play an important role in ureagenesis and gluconeogenesis.  CA5A gene maps to chromosome 16q24.3 and an unprocessed pseudogene has been assigned to 16p12-p11.2.
23868	
23869	
23870	The protein encoded by this gene is a member of the peptidyl-prolyl cis-trans isomerase (PPIase) family. PPIases catalyze the cis-trans isomerization of proline imidic peptide bonds in oligopeptides and accelerate the folding of proteins. This protein contains a highly conserved cyclophilin (CYP) domain as well as an RNA-binding domain. It was shown to possess PPIase and protein folding activities and also exhibit RNA-binding activity. Three alternatively spliced transcript variants encoding distinct isoforms have been observed.
23871	
23872	
23873	
23874	
23875	
23876	
23877	
23878	The protein encoded by this gene is a member of the TNF-receptor superfamily. This protein and TNF-receptor 1 form a heterocomplex that mediates the recruitment of two anti-apoptotic proteins, c-IAP1 and c-IAP2, which possess E3 ubiquitin ligase activity. The function of IAPs in TNF-receptor signalling is unknown, however, c-IAP1 is thought to potentiate TNF-induced apoptosis by the ubiquitination and degradation of TNF-receptor-associated factor 2, which mediates anti-apoptotic signals. Knockout studies in mice also suggest a role of this protein in protecting neurons from apoptosis by stimulating antioxidative pathways.
23879	
23880	
23881	
23882	
23883	The protein encoded by this gene is the ligand for FAS. Both are transmembrane proteins. Interaction of FAS with this ligand is critical in triggering apoptosis of some types of cells such as lymphocytes. Defects in this gene may be related to some cases of systemic lupus erythematosus (SLE).
23884	
23885	
23886	
23887	
23888	
23889	
23890	
23891	
23892	
23893	
23894	
23895	
23896	The calpains, calcium-activated neutral proteases, are nonlysosomal, intracellular cysteine proteases. The mammalian calpains include ubiquitous, stomach-specific, and muscle-specific proteins. The ubiquitous enzymes consist of heterodimers with distinct large, catalytic subunits associated with a common small, regulatory subunit. This gene encodes the large subunit of the ubiquitous enzyme, calpain 1.
23897	The signal sequence receptor (SSR) is a glycosylated endoplasmic reticulum (ER) membrane receptor associated with protein translocation across the ER membrane. The SSR consists of 2 subunits, a 34-kD glycoprotein encoded by this gene and a 22-kD glycoprotein. This gene generates several mRNA species as a result of complex alternative polyadenylation. This gene is unusual in that it utilizes arrays of polyA signal sequences that are exclusively non-canonical.
23898	
23899	
23900	
23901	
23902	
23903	
23904	
23905	
23906	
23907	
23908	The heterotetrameric adaptor protein (AP) complexes sort integral membrane proteins at various stages of the endocytic and secretory pathways. AP4 is composed of 2 large chains, beta-4 (AP4B1; MIM 607245) and epsilon-4 (AP4E1; MIM 607244), a medium chain, mu-4 (AP4M1; MIM 602296), and a small chain, sigma-4 (AP4S1).[supplied by OMIM]
23909	
23910	
23911	This gene is an ortholog of the mouse jumonji gene, which encodes a nuclear protein essential for mouse embryogenesis, including neural tube formation. Overexpression of mouse jumonji negatively regulates cell proliferation. The jumonji proteins contain a DNA-binding domain, called an AT-rich interaction domain (ARID), and share regions of similarity with human retinoblastoma-binding protein-2 and the human SMCX protein.
23912	
23913	
23914	The enzyme encoded by this gene is an important intermediary in mammalian carbohydrate interconversions. It transfers a glucose moiety from glucose-1-phosphate to MgUTP and forms UDP-glucose and MgPPi. In liver and muscle tissue, UDP-glucose is a direct precursor of glycogen; in lactating mammary gland it is converted to UDP-galactose which is then converted to lactose. The eukaryotic enzyme has no significant sequence similarity to the prokaryotic enzyme. Two transcript variants encoding different isoforms have been found for this gene.
23915	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit and a member of the S14P family of ribosomal proteins. The protein, which contains a C2-C2 zinc finger-like domain that can bind to zinc, can enhance the tumor suppressor activity of Ras-related protein 1A (KREV1). It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
23916	
23917	
23918	
23919	
23920	
23921	The protein encoded by this gene is a type III glycoprotein that is located primarily in limiting membranes of lysosomes and endosomes. Studies of the similar protein in mice and rat suggested that this protein may participate in membrane transportation and the reorganization of endosomal/lysosomal compartment. Deficiency of the similar protein in mice was reported to impair cell membrane transport processes and cause pelvic junction obstruction, deafness, and peripheral neuropathy.
23922	
23923	
23924	
23925	
23926	
23927	
23928	
23929	
23930	
23931	
23932	
23933	
23934	CD109 is a GPI-linked cell surface antigen expressed by CD34+ acute myeloid leukemia cell lines, T-cell lines, activated T lymphoblasts, endothelial cells, and activated platelets (Lin et al., 2002). In addition, the platelet-specific Gov antigen system, implicated in refractoriness to platelet transfusion, neonatal alloimmune thrombocytopenia, and posttransfusion purpura, is carried by CD109 (Kelton et al., 1990; Lin et al., 2002).[supplied by OMIM]
23935	
23936	
23937	
23938	
23939	
23940	
23941	
23942	
23943	
23944	
23945	
23946	
23947	
23948	
23949	Gap junction protein, alpha 1 is a member of the connexin gene family and a component of gap junctions. Gap junctions are composed of arrays of intercellular channels and provide a route for the diffusion of materials of low molecular weight from cell to cell. Connexin 43 is the major protein of gap junctions in the heart, and gap junctions are thought to have a crucial role in the synchronized contraction of the heart and in embryonic development. Connexin 43 is targeted by several protein kinases that regulate myocardial cell-cell coupling. A related intron-less connexin 43 pseudogene, GJA1P, has been mapped to chromosome 5.
23950	
23951	
23952	
23953	
23954	
23955	
23956	Ephrin receptors and their ligands, the ephrins, mediate numerous developmental processes, particularly in the nervous system. Based on their structures and sequence relationships, ephrins are divided into the ephrin-A (EFNA) class, which are anchored to the membrane by a glycosylphosphatidylinositol linkage, and the ephrin-B (EFNB) class, which are transmembrane proteins. The Eph family of receptors are divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands. Ephrin receptors make up the largest subgroup of the receptor tyrosine kinase (RTK) family. The protein encoded by this gene is a receptor for ephrin-B family members.
23957	Cancer-associated chromosomal changes often involve regions containing fragile sites. This gene maps to a commom fragile site on chromosome 7q31.2 designated FRA7G. This gene is similar to mouse Testin, a testosterone-responsive gene encoding a Sertoli cell secretory protein containing three LIM domains. LIM domains are double zinc-finger motifs that mediate protein-protein interactions between transcription factors, cytoskeletal proteins and signaling proteins. Multiple protein isoforms are encoded by transcript variants of this gene.
23958	
23959	
23960	
23961	
23962	Gamma-aminobutyric acid A receptors [GABA(A) receptors] are ligand-gated chloride channels that mediate inhibitory neurotransmission. This gene encodes GABA(A) receptor-associated protein, which is highly positively charged in its N-terminus and shares sequence similarity with light chain-3 of microtubule-associated proteins 1A and 1B. This protein clusters neurotransmitter receptors by mediating interaction with the cytoskeleton.
23963	
23964	
23965	
23966	
23967	
23968	
23969	
23970	
23971	
23972	
23973	
23974	
23975	
23976	This gene is located in the surfeit gene cluster, a group of very tightly linked housekeeping genes that do not share sequence similarity. The gene is oriented in a head-to-head fashion with RPL7A (SURF3) and the two genes share a bidirectional promoter. The encoded proteins are localized to the cytoplasm. Multiple alternative transcript variants encoding multiple isoforms have been identified for this gene.
23977	
23978	
23979	X-linked inhibitor of apoptosis (XIAP; MIM 300079) is a potent member of the IAP family. All members of this family possess baculoviral IAP (BIR) repeats, cysteine-rich domains of approximately 80 amino acids that bind and inhibit caspases (e.g., CASP3; MIM 600636). XIAP has 3 BIR domains and a C-terminal RING zinc finger that possesses E3 ubiquitin ligase (see MIM 601623) activity. XAF1 antagonizes the anticaspase activity of XIAP and may be important in mediating apoptosis resistance in cancer cells (Liston et al., 2001 [PubMed 11175744]).[supplied by OMIM]
23980	
23981	
23982	
23983	
23984	
23985	
23986	
23987	
23988	
23989	
23990	
23991	
23992	The protein encoded by this gene is a member of the zinc finger family of proteins. The gene product contains C2H2 type domains, which are the classical zinc finger domains found in numerous nucleic acid-binding proteins. In addition to the monocistronic transcript originating from this locus, a co-transcribed variant composed of ZFP91 and CNTF sequence has been identified. The monocistronic and co-transcribed variants encode distinct isoforms. The co-transcription of ZFP91 and CNTF has also been observed in mouse.
23993	
23994	
23995	
23996	
23997	
23998	
23999	
24000	
24001	This gene encodes a protein with a proline-rich domain. This gene is located in a region of chromosome 22 reported to contain a tumor suppressor gene that may be involved in breast and colorectal tumorigenesis. Rare read-through transcripts, containing exons from the ARHGAP8 gene which is located immediately downstream, led to the original description of PRR5 and ARHGAP8 as a single gene. Alternative splicing and the use of alternative promoters results in transcripts encoding different isoforms.
24002	cAMP response element (CRE)-binding protein-like-2 (CREBL2) was identified in a search to find genes in a commonly deleted region on chromosome 12p13 flanked by ETV6 and CDKN1B genes, frequently associated with hematopoietic malignancies, as well as breast, non-small-cell lung and ovarian cancers. CREBL2 shares a 41% identity with CRE-binding protein (CREB) over a 48-base long region which encodes the bZip domain of CREB. The bZip domain consists of about 30 amino acids rich in basic residues involved in DNA binding, followed by a leucine zipper motif involved in protein dimerization. This suggests that CREBL2 encodes a protein with DNA binding capabilities. The occurance of CREBL2 deletion in malignancy suggests that CREBL2 may act as a tumor suppressor gene.
24003	The MMAB gene encodes cob(I)alamin adenosyltransferase (EC 2.5.1.17), which catalyzes the final step in the synthesis of the coenzyme adenosylcobalamin (AdoCbl). AdoCbl is a vitamin B12-containing coenzyme for methylmalonyl-CoA mutase (MIM 609058).[supplied by OMIM]
24004	
24005	Calcium-dependent membrane-binding proteins may regulate molecular events at the interface of the cell membrane and cytoplasm. This gene is one of several genes that encode a calcium-dependent protein containing two N-terminal type II C2 domains and an integrin A domain-like sequence in the C-terminus.
24006	Myosin phosphatase target subunit 1, which is also called the myosin-binding subunit of myosin phosphatase, is one of the subunits of myosin phosphatase. Myosin phosphatase regulates the interaction of actin and myosin downstream of the guanosine triphosphatase Rho. The small guanosine triphosphatase Rho is implicated in myosin light chain (MLC) phosphorylation, which results in contraction of smooth muscle and interaction of actin and myosin in nonmuscle cells. The guanosine triphosphate (GTP)-bound, active form of RhoA (GTP.RhoA) specifically interacted with the myosin-binding subunit (MBS) of myosin phosphatase, which regulates the extent of phosphorylation of MLC. Rho-associated kinase (Rho-kinase), which is activated by GTP. RhoA, phosphorylated MBS and consequently inactivated myosin phosphatase. Overexpression of RhoA or activated RhoA in NIH 3T3 cells increased phosphorylation of MBS and MLC. Thus, Rho appears to inhibit myosin phosphatase through the action of Rho-kinase.
24007	
24008	
24009	
24010	Fibrillar collagens, such as COL27A1, compose one of the most ancient families of extracellular matrix molecules. They form major structural elements in extracellular matrices of cartilage, skin, and tendon (Boot-Handford et al., 2003).[supplied by OMIM]
24011	This gene encodes a C3H-type zinc finger protein, which is similar to the Drosophila melanogaster muscleblind B protein. Drosophila muscleblind is a gene required for photoreceptor differentiation. Several alternatively spliced transcript variants have been described but the full-length natures of only some have been determined.
24012	
24013	
24014	
24015	
24016	This gene encodes a member of the SLC2A facilitative glucose transporter family. Members of this family play a significant role in maintaining glucose homeostasis. The encoded protein may play a role in the development and survival of chondrocytes in cartilage matrices. Two transcript variants encoding distinct isoforms have been identified for this gene.
24017	Hypoxia-inducible factor-1 (HIF1) is a transcription factor found in mammalian cells cultured under reduced oxygen tension that plays an essential role in cellular and systemic homeostatic responses to hypoxia. HIF1 is a heterodimer composed of an alpha subunit and a beta subunit. The beta subunit has been identified as the aryl hydrocarbon receptor nuclear translocator (ARNT). This gene encodes the alpha subunit of HIF-1. Overexpression of a natural antisense transcript (aHIF) of this gene has been shown to be associated with nonpapillary renal carcinomas. Two alternative transcripts encoding different isoforms have been identified.
24018	
24019	
24020	
24021	The protein encoded by this gene is a lysosomal cysteine proteinase composed of a dimer of disulfide-linked heavy and light chains, both produced from a single protein precursor. It is also known as amyloid precursor protein secretase and is involved in the proteolytic processing of amyloid precursor protein (APP). Incomplete proteolytic processing of APP has been suggested to be a causative factor in Alzheimer disease, the most common cause of dementia. Overexpression of the encoded protein, which is a member of the peptidase C1 family, has been associated with esophageal adenocarcinoma and other tumors. At least five transcript variants encoding the same protein have been found for this gene.
24022	
24023	
24024	DAD1, the defender against apoptotic cell death, was initially identified as a negative regulator of programmed cell death in the temperature sensitive tsBN7 cell line.  The DAD1 protein disappeared in temperature-sensitive cells following a shift to the nonpermissive temperature, suggesting that loss of the DAD1 protein triggered apoptosis.  DAD1 is believed to be a tightly associated subunit of oligosaccharyltransferase both in the intact membrane and in the purified enzyme, thus reflecting the essential nature of N-linked glycosylation in eukaryotes.
24025	
24026	
24027	
24028	
24029	
24030	This gene encodes a predicted transmembrane protein containing two extracellular CUB domains followed by a low-density lipoprotein class A (LDLa) domain. It also has an intracellular FXNPXY-like motif, which has been shown in other proteins to be essential for the internalization of clathrin coated pits during endocytosis. Alternatively spliced transcript variants encoding distinct proteins have been described.
24031	
24032	
24033	
24034	
24035	The biosynthesis of lipid-linked oligosaccharides is highly conserved among eukaryotes and is catalyzed by 14 glycosyltransferases in an ordered stepwise manner. Mannosyltransferase I (MT I) catalyzes the first mannosylation step in this process.[supplied by OMIM]
24036	
24037	
24038	
24039	
24040	
24041	This gene encodes an F-box protein which functions as a substrate recognition component of the SCF ubiquitin ligase complex. It binds to cyclin F, S-phase kinase-associated protein 2, and other regulatory proteins involved in ubiquitin proteolysis through an F-box motif. The encoded protein also collaborates with a network of proteins to control beta-catenin levels and affects the activity level of beta-catenin dependent TCF transcription factors. Studies have also characterized the protein as an RNA polymerase II elongation factor. Alternative splicing of this gene results in two transcript variants. A related pseudogene has been identified on chromosome 7.
24042	
24043	
24044	ADP-ribosylation factor 1 (ARF1) is a member of the human ARF gene family. The family members encode small guanine nucleotide-binding proteins that stimulate the ADP-ribosyltransferase activity of cholera toxin and play a role in vesicular trafficking as activators of phospholipase D. The gene products, including 6 ARF proteins and 11 ARF-like proteins, constitute a family of the RAS superfamily. The ARF proteins are categorized as class I (ARF1, ARF2 and ARF3), class II (ARF4 and ARF5) and class III (ARF6), and members of each class share a common gene organization. The ARF1 protein is localized to the Golgi apparatus and has a central role in intra-Golgi transport. Multiple alternatively spliced transcript variants encoding the same protein have been found for this gene.
24045	
24046	
24047	
24048	
24049	
24050	
24051	
24052	
24053	
24054	
24055	
24056	The protein encoded by this gene is a member of the ankyrin repeat and SOCS box-containing (ASB) family of proteins. They contain ankyrin repeat sequence and a SOCS box domain. The SOCS box serves to couple suppressor of cytokine signalling (SOCS) proteins and their binding partners with the elongin B and C complex, possibly targeting them for degradation.
24057	
24058	
24059	
24060	
24061	
24062	
24063	
24064	Peroxisome proliferators include hypolipidemic drugs, herbicides, leukotriene antagonists, and plasticizers; this term arises because they induce an increase in the size and number of peroxisomes. Peroxisomes are subcellular organelles found in plants and animals which contain enzymes for respiration and for cholesterol and lipid metabolism. The action of peroxisome proliferators is thought to be mediated via specific receptors, called PPARs, which belong to the steroid hormone receptor superfamily. PPARs affect the expression of target genes involved in cell proliferation, cell differentiation and in immune and inflammation responses. Three closely related subtypes (alpha, beta/delta, and gamma) have been identified. This gene encodes the subtype PPAR-alpha, which is a nuclear transcription factor. Multiple alternatively spliced transcript variants have been described for this gene, although the full-nature of only some have been defined.
24065	This gene encodes a member of the delta protein ligand family. This family functions as Notch ligands that are characterized by a DSL domain, EGF repeats, and a transmembrane domain. Mutations in this gene cause autosomal recessive spondylocostal dysostosis 1. Two transcript variants encoding distinct isoforms have been identified for this gene.
24066	
24067	
24068	
24069	
24070	
24071	
24072	
24073	
24074	
24075	
24076	
24077	
24078	
24079	
24080	
24081	
24082	S-adenosylhomocysteine hydrolase catalyzes the reversible hydrolysis of S-adenosylhomocysteine (AdoHcy) to adenosine (Ado) and L-homocysteine (Hcy). Thus, it regulates the intracellular S-adenosylhomocysteine (SAH) concentration thought to be important for transmethylation reactions. Deficiency in this protein is one of the different causes of hypermethioninemia. S-adenosylhomocysteine hydrolase belongs to the adenosylhomocysteinase family.
24083	
24084	
24085	
24086	
24087	
24088	
24089	
24090	
24091	
24092	
24093	
24094	
24095	
24096	Decapping is a key step in general and regulated mRNA decay. The protein encoded by this gene is a decapping enzyme. This protein and another decapping enzyme form a decapping complex, which interacts with the nonsense-mediated decay factor hUpf1 and may be recruited to mRNAs containing premature termination codons. This protein also participates in the TGF-beta signaling pathway.
24097	Oncogene FLT belongs to the src gene family and is related to oncogene ROS (MIM 165020). Like other members of this family, it shows tyrosine protein kinase activity that is important for the control of cell proliferation and differentiation. The sequence structure of the FLT gene resembles that of the FMS gene (MIM 164770); hence, Yoshida et al. (1987) proposed the name FLT as an acronym for FMS-like tyrosine kinase.[supplied by OMIM]
24098	Cell surface heparan sulfate proteoglycans are composed of a membrane-associated protein core substituted with a variable number of heparan sulfate chains. Members of the glypican-related integral membrane proteoglycan family (GRIPS) contain a core protein anchored to the cytoplasmic membrane via a glycosyl phosphatidylinositol linkage. These proteins may play a role in the control of cell division and growth regulation. Deletion mutations in this gene are associated with Simpson-Golabi-Behmel syndrome.
24099	
24100	
24101	
24102	
24103	Albumin is a soluble, monomeric protein which comprises about one-half of the blood serum protein. Albumin functions primarily as a carrier protein for steroids, fatty acids, and thyroid hormones and plays a role in stabilizing extracellular fluid volume. Mutations in this gene on chromosome 4 result in various anomalous proteins. Albumin is a globular unglycosylated serum protein of molecular weight 65,000. The human albumin gene is 16,961 nucleotides long from the putative @#$%&cap@#$%& site to the first poly(A) addition site. It is split into 15 exons which are symmetrically placed within the 3 domains that are thought to have arisen by triplication of a single primordial domain. Albumin is synthesized in the liver as preproalbumin which has an N-terminal peptide that is removed before the nascent protein is released from the rough endoplasmic reticulum. The product, proalbumin, is in turn cleaved in the Golgi vesicles to produce the secreted albumin.
24104	
24105	The members of the zinc fingers and homeoboxes gene family are nuclear homodimeric transcriptional repressors that interact with the A subunit of nuclear factor-Y (NF-YA) and contain two C2H2-type zinc fingers and five homeobox DNA-binding domains. This gene encodes member 3 of this gene family. In addition to forming homodimers, this protein heterodimerizes with member 1 of the zinc fingers and homeoboxes family.
24106	
24107	
24108	This gene encodes a form of adenylate cyclase expressed in brain. A similar protein in mouse is involved in pattern formation of the brain.
24109	
24110	
24111	
24112	
24113	
24114	
24115	
24116	
24117	
24118	
24119	
24120	
24121	
24122	
24123	
24124	
24125	
24126	
24127	
24128	
24129	
24130	
24131	
24132	
24133	
24134	
24135	
24136	
24137	
24138	
24139	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
24140	
24141	
24142	This gene belongs to a family of Sp1 related genes that encode transcription factors that regulate transcription by binding to consensus GC- and GT-box regulatory elements in target genes. This protein contains a zinc finger DNA-binding domain and several transactivation domains, and has been reported to function as a bifunctional transcription factor that either stimulates or represses the transcription of numerous genes. Transcript variants encoding different isoforms have been described for this gene, and one has been reported to initiate translation from a non-AUG (AUA) start codon. Additional isoforms, resulting from the use of alternate downstream translation initiation sites, have also been noted.
24143	
24144	
24145	
24146	
24147	
24148	
24149	The biosynthesis of lipid-linked oligosaccharides is highly conserved among eukaryotes and is catalyzed by 14 glycosyltransferases in an ordered stepwise manner. Mannosyltransferase I (MT I) catalyzes the first mannosylation step in this process.[supplied by OMIM]
24150	This gene encodes the heavy subunit of the ferritin protein. Ferritin is the major intracellular iron storage protein in prokaryotes and eukaryotes. It is composed of 24 subunits of the heavy and light ferritin chains. Variation in ferritin subunit composition may affect the rates of iron uptake and release in different tissues. A major function of ferritin is the storage of iron in a soluble and nontoxic state. Defects in ferritin proteins are associated with several neurodegenerative diseases. This gene has multiple pseudogenes. Several alternatively spliced transcript variants have been observed, but their biological validity has not been determined.
24151	
24152	
24153	
24154	
24155	
24156	
24157	
24158	
24159	
24160	
24161	
24162	
24163	
24164	
24165	
24166	The protein encoded by this gene is a cytokine that belongs to the tumor necrosis factor (TNF) ligand family. This cytokine is a ligand for TNFRSF9/4-1BB, which is a costimulatory molecule in T lymphocytes. This cytokine and its receptor are involved in the antigen presentation process, and in the generation of cytotoxic T cells. This cytokine is also shown to be required for the optimal CD8 responses in CD8 T cells. This cytokine is found to be expressed in various carcinoma cell lines, and is thought to be involved in T cell-tumor cell interaction.
24167	
24168	
24169	
24170	
24171	Guanine nucleotide-binding proteins are a family of heterotrimeric proteins that couple cell surface, 7-transmembrane domain receptors to intracellular signaling pathways. Receptor activation catalyzes the exchange of GTP for GDP bound to the inactive G protein alpha subunit resulting in a conformational change and dissociation of the complex. The G protein alpha and beta-gamma subunits are capable of regulating various cellular effectors. Activation is terminated by a GTPase intrinsic to the G-alpha subunit. G-alpha-q is the alpha subunit of one of the heterotrimeric GTP-binding proteins that mediates stimulation of phospholipase C-beta (MIM 600230).[supplied by OMIM]
24172	
24173	
24174	
24175	
24176	
24177	The protein encoded by this gene is a cytokine that belongs to the tumor necrosis factor (TNF) ligand family. This cytokine is a ligand for TNFRSF8/CD30, which is a cell surface antigen and a marker for Hodgkin lymphoma and related hematologic malignancies. The engagement of this cytokine expressed on B cell surface plays an inhibitory role in modulating Ig class switch. This cytokine was shown to enhance cell proliferation of some lymphoma cell lines, while to induce cell death and reduce cell proliferation of other lymphoma cell lines. The pleiotropic biologic activities of this cytokine on different CD30+ lymphoma cell lines may play a pathophysiologic role in Hodgkin@#$%&s and some non-Hodgkin@#$%&s lymphomas.
24178	This gene encodes subunit 3 of the splicing factor 3a protein complex. The splicing factor 3a heterotrimer includes subunits 1, 2 and 3 and is necessary for the in vitro conversion of 15S U2 snRNP into an active 17S particle that performs pre-mRNA splicing. Subunit 3 interacts with subunit 1 through its amino-terminus while the zinc finger domain of subunit 3 plays a role in its binding to the 15S U2 snRNP. This gene has a pseudogene on chromosome 20.
24179	
24180	
24181	
24182	The biosynthesis of lipid-linked oligosaccharides is highly conserved among eukaryotes and is catalyzed by 14 glycosyltransferases in an ordered stepwise manner. Mannosyltransferase I (MT I) catalyzes the first mannosylation step in this process.[supplied by OMIM]
24183	
24184	
24185	
24186	
24187	
24188	
24189	
24190	
24191	
24192	
24193	
24194	
24195	
24196	
24197	
24198	
24199	
24200	
24201	
24202	
24203	
24204	
24205	
24206	
24207	
24208	
24209	
24210	
24211	
24212	
24213	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene contains introns and its mRNA is polyadenylated, unlike most histone genes. The protein encoded is a replication-independent member of the histone H3 family.
24214	
24215	
24216	
24217	
24218	The protein encoded by this gene is the human ortholog of bovine and rat dipeptidylpeptidase VI. They are transmembrane proteins and likely members of the dipeptidyl aminopeptidase family. Two transcript variants encoding different isoforms have been identified in all 3 species. In bovine and rat, the longer form has been shown to be expressed almost exclusively in brain, whereas the shorter form has a more widespread expression.
24219	
24220	
24221	
24222	
24223	
24224	
24225	
24226	
24227	
24228	
24229	
24230	
24231	
24232	
24233	This gene encodes a dual serine/threonine and tyrosine protein kinase which is expressed in multiple tissues. Multiple alternatively spliced transcript variants have been found, but the biological validity of some variants has not been determined.
24234	
24235	The protein encoded by this gene is a homolog of the protein transgelin, which is one of the earliest markers of differentiated smooth muscle. The function of this protein has not yet been determined.
24236	
24237	
24238	NRP1 is a membrane-bound coreceptor to a tyrosine kinase receptor for both vascular endothelial growth factor (VEGF; MIM 192240) and semaphorin (see SEMA3A; MIM 603961) family members. NRP1 plays versatile roles in angiogenesis, axon guidance, cell survival, migration, and invasion.[supplied by OMIM]
24239	
24240	
24241	
24242	
24243	
24244	The protein encoded by this gene is an integral membrane ATPase. The encoded protein is probably phosphorylated in its intermediate state and likely drives the transport of ions such as calcium across membranes. Two transcript variants encoding different isoforms have been found for this gene.
24245	
24246	
24247	
24248	
24249	
24250	
24251	
24252	
24253	
24254	
24255	
24256	
24257	
24258	
24259	
24260	
24261	Nucleobindin-2 is a calcium-binding EF-hand protein.[supplied by OMIM]
24262	
24263	
24264	
24265	
24266	
24267	
24268	
24269	
24270	This gene encodes a calcium-binding protein belonging to the penta-EF-hand protein family. Calcium binding is important for homodimerization and for conformational changes required for binding to other protein partners. This gene product participates in T cell receptor-,  Fas-, and glucocorticoid-induced programmed cell death. In mice deficient for this gene product, however, apoptosis was not blocked suggesting this gene product is functionally redundant.
24271	
24272	
24273	JAK3 encodes Janus kinase 3, a tyrosine kinase that belongs to the Janus family. JAK3 functions in signal transduction and interacts with members of the STAT (signal transduction and activators of transcription) family. JAK3 is predominantly expressed in immune cells and transduces a signal in response to its activation via tyrosine phosphorylation by interleukin receptors. Mutations that abrogate Janus kinase 3 function cause an autosomal SCID (severe combined immunodeficiency disease).
24274	
24275	This gene encodes a cytoplasmic protein which contains SH3 domains. This protein is a member of a family of proteins involved in clathrin-mediated endocytosis. Intersectin 2 is thought to regulate the formation of clathrin-coated vesicles and also may function in the induction of T cell antigen receptor (TCR) endocytosis. Alternatively spliced transcript variants have been found for this gene that encode three distinct isoforms. Additional variants have been found but their full length nature has not been determined.
24276	
24277	
24278	This gene encodes one of the major intermediate filament proteins of mature astrocytes. It is used as a marker to distinguish astrocytes from other glial cells during development. Mutations in this gene cause Alexander disease, a rare disorder of astrocytes in the central nervous system. An additional transcript variant has been described, but its full length sequence has not been determined.
24279	
24280	
24281	This gene encodes a nuclear protein which interacts with the tumor suppressor retinoblastoma 1. The two interacting proteins are thought to act as a transcriptional repressor for promoters which are activated by the E2F1 transcription factor. This protein contains a Kruppel-associated box (KRAB), which is a transcriptional repressor motif.
24282	
24283	In acute myeloid leukemia, especially in the M2 subtype, the t(8;21)(q22;q22) translocation is one of the most frequent karyotypic abnormalities. The translocation produces a chimeric gene made up of the 5@#$%&-region of the RUNX1 (AML1) gene fused to the 3@#$%&-region of the CBFA2T1 (MTG8) gene. The chimeric protein is thought to associate with the nuclear corepressor/histone deacetylase complex to block hematopoietic differentiation. The protein encoded by this gene binds to the AML1-MTG8 complex and may be important in promoting leukemogenesis. Several transcript variants are thought to exist for this gene, but the full-length natures of only two have been described.
24284	
24285	Integrins are heterodimeric proteins made up of alpha and beta subunits. At least 18 alpha and 8 beta subunits have been described in mammals. Integrin family members are membrane receptors involved in cell adhesion and recognition in a variety of processes including embryogenesis, hemostasis, tissue repair, immune response and metatastatic diffusion of tumor cells. The protein encoded by this gene is a beta subunit. Six alternatively spliced variants have been found for this gene which encode five proteins with alternate carboxy termini.
24286	
24287	
24288	
24289	
24290	
24291	
24292	
24293	
24294	
24295	
24296	
24297	
24298	
24299	
24300	
24301	
24302	
24303	
24304	
24305	
24306	
24307	
24308	
24309	
24310	
24311	
24312	
24313	
24314	
24315	
24316	
24317	
24318	
24319	
24320	
24321	
24322	
24323	Functions of the cell cortex, including motility, adhesion, and cytokinesis, are mediated by the reorganization of the actin cytoskeleton and recent evidence suggests a role for the Formin homology (FH) proteins in these processes. The protein encoded by this gene contains FH domains and belongs to a novel FH protein subfamily implicated in cell polarity. Wnt/Fz signaling activates the small GTPase Rho, a key regulator of cytoskeleton architecture, to control cell polarity and movement during development. Activation requires Dvl-Rho complex formation, an assembly mediated by this gene product, which is thought to function as a scaffolding protein. Evidence of alternative splicing has been observed for this gene but the full-length nature of these variants has not been determined.
24324	
24325	
24326	
24327	
24328	The protein encoded by this gene belongs to the thrombospondin family. It is a disulfide-linked homotrimeric glycoprotein that mediates cell-to-cell and cell-to-matrix interactions. This protein has been shown to function as a potent inhibitor of tumor growth and angiogenesis. Studies of the mouse counterpart suggest that this protein may modulate the cell surface properties of mesenchymal cells and be involved in cell adhesion and migration.
24329	
24330	
24331	
24332	
24333	
24334	
24335	
24336	
24337	
24338	This gene encodes an alpha integrin. Integrins are heterodimeric integral membrane glycoproteins composed of an alpha chain and a beta chain that mediate cell-cell and cell-matrix adhesion. The protein encoded by this gene, when bound to the beta 1 chain, forms an integrin that is a receptor for VCAM1, cytotactin and osteopontin. Expression of this gene has been found to be upregulated in small cell lung cancers.
24339	
24340	
24341	
24342	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein localizes to the endoplasmic reticulum; its substrate has not yet been determined. This gene, which produces two transcript variants, is part of a large cluster of cytochrome P450 genes from the CYP2A, CYP2B and CYP2F subfamilies on chromosome 19q.
24343	
24344	Beta-catenin is an adherens junction protein. Adherens junctions (AJs; also called the zonula adherens) are critical for the establishment and maintenance of epithelial layers, such as those lining organ surfaces. AJs mediate adhesion between cells, communicate a signal that neighboring cells are present, and anchor the actin cytoskeleton. In serving these roles, AJs regulate normal cell growth and behavior. At several stages of embryogenesis, wound healing, and tumor cell metastasis, cells form and leave epithelia. This process, which involves the disruption and reestablishment of epithelial cell-cell contacts, may be regulated by the disassembly and assembly of AJs. AJs may also function in the transmission of the @#$%&contact inhibition@#$%& signal, which instructs cells to stop dividing once an epithelial sheet is complete.[supplied by OMIM]
24345	
24346	Acetylcholinesterase hydrolyzes the neurotransmitter, acetylcholine at neuromuscular junctions and brain cholinergic synapses, and thus terminates signal transmission. It is also found on the red blood cell membranes, where it constitutes the Yt blood group antigen. Acetylcholinesterase exists in multiple molecular forms which possess similar catalytic properties, but differ in their oligomeric assembly and mode of cell attachment to the cell surface. It is encoded by the single ACHE gene, and the structural diversity in the gene products arises from alternative mRNA splicing, and post-translational associations of catalytic and structural subunits. The major form of acetylcholinesterase found in brain, muscle and other tissues is the hydrophilic species, which forms disulfide-linked oligomers with collagenous, or lipid-containing structural subunits. The other, alternatively spliced form, expressed primarily in the erythroid tissues, differs at the C-terminal end, and contains a cleavable hydrophobic peptide with a GPI-anchor site. It associates with the membranes through the phosphoinositide (PI) moieties added post-translationally.
24347	
24348	
24349	Angiopoietins are proteins with important roles in vascular development and angiogenesis. All angiopoietins bind with similar affinity to an endothelial cell-specific tyrosine-protein kinase receptor. The protein encoded by this gene is a secreted glycoprotein that activates the receptor by inducing its tyrosine phosphorylation. It plays a critical role in mediating reciprocal interactions between the endothelium and surrounding matrix and mesenchyme. The protein also contributes to blood vessel maturation and stability, and may be involved in early development of the heart. Two transcript variants encoding different isoforms have been found for this gene.
24350	
24351	The BMP1 locus encodes a protein that is capable of inducing formation of cartilage in vivo.  Although other bone morphogenetic proteins are members of the TGF-beta superfamily, BMP1 encodes a protein that is not closely related to other known growth factors. BMP1 protein and procollagen C proteinase (PCP), a secreted metalloprotease requiring calcium and needed for cartilage and bone formation, are identical. PCP or BMP1 protein cleaves the C-terminal propeptides of procollagen I, II, and III and its activity is increased by the procollagen C-endopeptidase enhancer protein.  The BMP1 gene is expressed as alternatively spliced variants that share an N-terminal protease domain but differ in their C-terminal region
24352	
24353	
24354	
24355	
24356	
24357	
24358	
24359	
24360	
24361	CENTG2 belongs to an ADP-ribosylation factor GTPase-activating (ARF-GAP) protein family involved in membrane traffic and actin cytoskeleton dynamics (Nie et al., 2002).[supplied by OMIM]
24362	MaxiK channels are large conductance, voltage and calcium-sensitive potassium channels which are fundamental to the control of smooth muscle tone and neuronal excitability. MaxiK channels can be formed by 2 subunits: the pore-forming alpha subunit, which is the product of this gene, and the modulatory beta subunit. Intracellular calcium regulates the physical association between the alpha and beta subunits. Alternatively spliced transcript variants encoding different isoforms have been identified.
24363	
24364	
24365	
24366	
24367	
24368	Several different potassium channels are known to be involved with electrical signaling in the nervous system. One class is activated by depolarization whereas a second class is not. The latter are referred to as inwardly rectifying K+ channels, and they have a greater tendency to allow potassium to flow into the cell rather than out of it. This asymmetry in potassium ion conductance plays a key role in the excitability of muscle cells and neurons. The protein encoded by this gene is an integral membrane protein and member of the inward rectifier potassium channel family. The encoded protein has a small unitary conductance compared to other members of this protein family. Two transcript variants encoding the same protein have been found for this gene.
24369	
24370	
24371	
24372	
24373	
24374	
24375	
24376	
24377	
24378	
24379	
24380	
24381	
24382	
24383	
24384	The protein encoded by this gene is a cytoplasmic enzyme involved in energy homeostasis. The encoded protein reversibly catalyzes the transfer of phosphate between ATP and various phosphogens such as creatine phosphate. It acts as a homodimer in brain as well as in other tissues, and as a heterodimer with a similar muscle isozyme in heart. The encoded protein is a member of the ATP:guanido phosphotransferase protein family. A pseudogene of this gene has been characterized.
24385	
24386	
24387	
24388	
24389	
24390	
24391	
24392	
24393	
24394	
24395	
24396	Mitochondrial creatine kinase (MtCK) is responsible for the transfer of high energy phosphate from mitochondria to the cytosolic carrier, creatine. It belongs to the creatine kinase isoenzyme family. It exists as two isoenzymes, sarcomeric MtCK and ubiquitous MtCK, encoded by separate genes. Mitochondrial creatine kinase occurs in two different oligomeric forms: dimers and octamers, in contrast to the exclusively dimeric cytosolic creatine kinase isoenzymes. Sarcomeric mitochondrial creatine kinase has 80% homology with the coding exons of ubiquitous mitochondrial creatine kinase. This gene contains sequences homologous to several motifs that are shared among some nuclear genes encoding mitochondrial proteins and thus may be essential for the coordinated activation of these genes during mitochondrial biogenesis.
24397	
24398	
24399	
24400	
24401	
24402	
24403	
24404	
24405	
24406	
24407	
24408	
24409	
24410	
24411	
24412	
24413	
24414	
24415	
24416	
24417	
24418	
24419	
24420	
24421	
24422	This gene encodes a progestin-induced protein, which belongs to the HECT (homology to E6-AP carboxyl terminus) family. The HECT family proteins function as E3 ubiquitin-protein ligases, targeting specific proteins for ubiquitin-mediated proteolysis. This gene is localized to chromosome 8q22 which is disrupted in a variety of cancers. This gene potentially has a role in regulation of cell proliferation or differentiation.
24423	The ITGB2 protein product is the integrin beta chain beta 2. Integrins are integral cell-surface proteins composed of an alpha chain and a beta chain.  A given chain may combine with multiple partners resulting in different integrins.  For example, beta 2 combines with the alpha L chain to form the integrin LFA-1, and combines with the alpha M chain to form the integrin Mac-1. Integrins are known to participate in cell adhesion as well as cell-surface mediated signalling.
24424	
24425	
24426	
24427	
24428	
24429	
24430	Alzheimer@#$%&s disease (AD) patients with an inherited form of the disease carry mutations in the presenilin proteins (PSEN1; PSEN2) or the amyloid precursor protein (APP).  These disease-linked mutations result in increased production of the    longer form of amyloid-beta (main component of amyloid deposits found in AD brains).  Presenilins are postulated to regulate APP processing through their effects on gamma-secretase, an enzyme that cleaves APP.  Also, it is thought that the presenilins are involved in the cleavage of the Notch receptor, such that that they either directly regulate gamma-secretase activity or themselves are protease enzymes.  Two alternative transcripts of PSEN2 have been identified.
24431	This gene encodes a member of the Rho family of GTPases. This protein can activate PAK1 and JNK1, and can induce filopodium formation and stress fiber dissolution. It may also mediate the effects of WNT1 signaling in the regulation of cell morphology, cytoskeletal organization, and cell proliferation.
24432	
24433	
24434	
24435	The protein encoded by this gene is related to the S. cerevisiae CAP protein, which is involved in the cyclic AMP pathway. The human protein is able to interact with other molecules of the same protein, as well as with CAP2 and actin.
24436	
24437	
24438	
24439	There are 2 forms of glucose-6-phosphate dehydrogenase. G form is X-linked and H form, encoded by this gene, is autosomally linked. This H form shows activity with other hexose-6-phosphates, especially galactose-6-phosphate, whereas the G form is specific for glucose-6-phosphate. Both forms are present in most tissues, but H form is not found in red cells.
24440	
24441	
24442	
24443	
24444	
24445	
24446	
24447	
24448	The intracellular fatty acid-binding proteins (FABPs) belongs to a multigene family. FABPs are divided into at least three distinct types, namely the hepatic-, intestinal- and cardiac-type. They form 14-15 kDa proteins and are thought to participate in the uptake, intracellular metabolism and/or transport of long-chain fatty acids. They may also be responsible in the modulation of cell growth and proliferation. Fatty acid-binding protein 3 gene contains four exons and its function is to arrest growth of mammary epithelial cells. This gene is a candidate tumor suppressor gene for human breast cancer.
24449	
24450	
24451	
24452	
24453	
24454	The protein encoded by this gene is closely related to STAM, an adaptor protein involved in the downstream signaling of cytokine receptors, both of which contain a SH3 domain and the immunoreceptor tyrosine-based activation motif (ITAM). Similar to STAM, this protein acts downstream of JAK kinases, and is phosphorylated in response to cytokine stimulation. This protein and STAM thus are thought to exhibit compensatory effects on the signaling pathway downstream of JAK kinases upon cytokine stimulation.
24455	
24456	TTL is a cytosolic enzyme involved in the posttranslational modification of alpha-tubulin (see MIM 602529). Alpha-tubulin within assembled microtubules is detyrosinated over time at the C terminus. After microtubule disassembly, TTL restores the tyrosine residues and consequently participates in a cycle of tubulin detyrosination and tyrosination.[supplied by OMIM]
24457	
24458	
24459	
24460	
24461	
24462	
24463	
24464	
24465	
24466	
24467	
24468	
24469	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L10E family of ribosomal proteins. It is located in the cytoplasm. In vitro studies have shown that the chicken protein can bind to c-Jun and can repress c-Jun-mediated transcriptional activation, but these activities have not been demonstrated in vivo. This gene was initially identified as a candidate for a Wilms tumor suppressor gene, but later studies determined that this gene is not involved in the suppression of Wilms tumor. This gene has been referred to as @#$%&laminin receptor homolog@#$%& because a chimeric transcript consisting of sequence from this gene and sequence from the laminin receptor gene was isolated; however, it is not believed that this gene encodes a laminin receptor. Transcript variants utilizing alternative polyA signals exist. The variant with the longest 3@#$%& UTR overlaps the deoxyribonuclease I-like 1 gene on the opposite strand. This gene is co-transcribed with the small nucleolar RNA gene U70, which is located in its fifth intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
24470	
24471	
24472	
24473	
24474	
24475	The product of this gene belongs to the family of G-protein coupled receptors. This family has several receptor subtypes with different pharmacological selectivity, which overlaps in some cases, for various adenosine and uridine nucleotides. This receptor is activated by ADP. Two transcript variants encoding the same protein have been identified for this gene.
24476	
24477	
24478	The protein encoded by this gene is a member of the L1 gene family of neural cell adhesion molecules. It is a neural recognition molecule that may be involved in signal transduction pathways. The deletion of one copy of this gene may be responsible for mental defects in patients with 3p- syndrome. Several alternatively spliced transcript variants of this gene have been described, but their full length nature is not known.
24479	
24480	
24481	
24482	
24483	
24484	This gene encodes a type I integral membrane protein found only in the rough endoplasmic reticulum. The encoded protein is part of an N-oligosaccharyl transferase complex that links high mannose oligosaccharides to asparagine residues found in the Asn-X-Ser/Thr consensus motif of nascent polypeptide chains. This protein forms part of the regulatory subunit of the 26S proteasome and may mediate binding of ubiquitin-like domains to this proteasome.
24485	
24486	
24487	
24488	
24489	
24490	
24491	
24492	
24493	This gene encodes a member of the bone morphogenetic protein (BMP) receptor family of transmembrane serine/threonine kinases. The ligands of this receptor are BMPs, which are members of the TGF-beta superfamily. BMPs are involved in endochondral bone formation and embryogenesis. These proteins transduce their signals through the formation of heteromeric complexes of 2 different types of serine (threonine) kinase receptors: type I receptors of about 50-55 kD and type II receptors of about 70-80 kD. Type II receptors bind ligands in the absence of type I receptors, but they require their respective type I receptors for signaling, whereas type I receptors require their respective type II receptors for ligand binding. Mutations in this gene have been associated with primary pulmonary hypertension. This gene expresses two transcript variants.
24494	
24495	
24496	
24497	
24498	
24499	
24500	
24501	
24502	
24503	
24504	Aminopeptidases play a role in the metabolism of several peptides that may be involved in blood pressure and the pathogenesis of essential hypertension (MIM 145500). Adipocyte-derived leucine aminopeptidase (ALAP) is a member of the M1 family of zinc metallopeptidases.[supplied by OMIM]
24505	
24506	
24507	This gene is a member of the H/ACA snoRNPs (small nucleolar ribonucleoproteins) gene family. snoRNPs are involved in various aspects of rRNA processing and modification and have been classified into two families: C/D and H/ACA. The H/ACA snoRNPs also include the DKC1, NOLA1 and NOLA3 proteins. These four H/ACA snoRNP proteins localize to the dense fibrillar components of nucleoli and to coiled (Cajal) bodies in the nucleus. Both 18S rRNA production and rRNA pseudouridylation are impaired if any one of the four proteins is depleted. The four H/ACA snoRNP proteins are also components of the telomerase complex. This gene encodes a protein related to Saccharomyces cerevisiae Nhp2p.
24508	
24509	
24510	
24511	
24512	
24513	
24514	The protein encoded by this gene is an embryonic nuclear transcription factor that binds to a specific DNA element, the palindromic T-site. It binds through a region in its N-terminus, called the T-box, and effects transcription of genes required for mesoderm formation and differentiation. The protein is localized to notochord-derived cells.
24515	
24516	
24517	
24518	
24519	
24520	
24521	
24522	
24523	
24524	
24525	This gene encodes a member of the DNAJ protein family. DNAJ family members are characterized by a highly conserved amino acid stretch called the @#$%&J-domain@#$%& and function as one of the two major classes of molecular chaperones involved in a wide range of cellular events, such as protein folding and oligomeric protein complex assembly. This family member may also play a role in polyglutamine aggregation in specific neurons. Alternative splicing of this gene results in multiple transcript variants; however, not all variants have been fully described.
24526	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S26E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
24527	
24528	
24529	
24530	
24531	The protein encoded by this gene is part of the clathrin coat assembly complex which links clathrin to receptors in coated vesicles. These vesicles are involved in endocytosis and Golgi processing. This protein, as well as beta-prime-adaptin, gamma-adaptin, and the medium (mu) chain AP47, form the AP-1 assembly protein complex located at the Golgi vesicle. Two alternatively spliced transcript variants of this gene, which encode distinct isoforms, have been reported.
24532	
24533	
24534	
24535	
24536	
24537	
24538	
24539	
24540	This gene encodes a subunit of the heterotetrameric adaptor-related protein comlex 3 (AP-3), which belongs to the adaptor complexes medium subunits family. The AP-3 complex plays a role in protein trafficking to lysosomes and specialized organelles.
24541	
24542	
24543	
24544	Calcium-dependent membrane-binding proteins may regulate molecular events at the interface of the cell membrane and cytoplasm.  Copine III contains two type II C2 domains in the amino-terminus and an A domain-like sequence in the carboxy-terminus.  The A domain mediates interactions between integrins and extracellular ligands.
24545	
24546	Prostaglandin-endoperoxide synthase (PTGS), also known as cyclooxygenase, is the key enzyme in prostaglandin biosynthesis, and acts both as a dioxygenase and as a peroxidase. There are two isozymes of PTGS: a constitutive PTGS1 and an inducible PTGS2, which differ in their regulation of expression and tissue distribution. This gene encodes PTGS1, which regulates angiogenesis in endothelial cells, and is inhibited by nonsteroidal anti-inflammatory drugs such as aspirin. PTGS1 is thought to be involved in cell-cell signaling and maintaining tissue homeostasis. Alternative splicing of this gene generates two transcript variants. The expression of these two transcripts is differentially regulated by relevant cytokines and growth factors.
24547	
24548	
24549	
24550	
24551	
24552	This protein belongs to the aldehyde dehydrogenases family of proteins. Aldehyde dehydrogenase is the second enzyme of the major oxidative pathway of alcohol metabolism. This gene does not contain introns in the coding sequence. The variation of this locus may affect the development of alcohol-related problems.
24553	
24554	
24555	
24556	
24557	The protein encoded by this gene is a member of the serine/threonine protein kinase family. The specific function of this kinase is not known.
24558	
24559	
24560	Linker or adaptor proteins provide mechanisms by which receptors can amplify and regulate downstream effector proteins. The B-cell linker protein is essential for normal B-cell development.[supplied by OMIM]
24561	
24562	
24563	
24564	
24565	
24566	
24567	
24568	This gene encodes a protein that belongs to an ETS transcription factor subfamily characterized by epithelial-specific expression (ESEs). The encoded protein acts as a transcriptional repressor and may be associated with asthma susceptibility. This protein may be involved in epithelial differentiation and carcinogenesis.
24569	
24570	
24571	
24572	
24573	
24574	
24575	
24576	Although the function of this gene is not known, its existence is supported by mRNA and EST data. The predicted gene product contains a region similar to the STAR-related lipid transfer (START) domain, which is often present in proteins involved in the cell signaling mediated by lipid binding. Two alternatively spliced transcript variants that encode an identical protein have been reported.
24577	
24578	
24579	
24580	
24581	
24582	
24583	
24584	
24585	
24586	
24587	Histones play a critical role in transcriptional regulation, cell cycle progression, and developmental events. Histone acetylation/deacetylation alters chromosome structure and affects transcription factor access to DNA. The protein encoded by this gene has sequence homology to members of the histone deacetylase family. Two alternatively spliced transcript variants have been observed for the gene. This gene is orthologous to mouse HDAC7 gene whose protein promotes repression mediated via transcriptional corepressor SMRT.
24588	
24589	
24590	
24591	
24592	
24593	
24594	
24595	This gene appears to be a retrotransposed pseudogene based on its lack of exons compared to other family members, one of which is found on chromosome 19 (MEIS3). It does appear to be transcribed and has an ORF that may encode a protein of similar size and sequence to that encoded by MEIS3.
24596	
24597	
24598	
24599	
24600	
24601	
24602	
24603	
24604	
24605	
24606	
24607	
24608	
24609	
24610	
24611	
24612	
24613	
24614	
24615	
24616	
24617	
24618	
24619	
24620	
24621	
24622	
24623	
24624	
24625	The protein encoded by this gene belongs to the NCP family of cell-recognition molecules. This family represents a distinct subgroup of the neurexins. NCP proteins mediate neuron-glial interactions in vertebrates and glial-glial contact in invertebrates. The protein encoded by this gene may play a role in cell recognition within the nervous system. Alternatively spliced transcript variants encoding different isoforms have been described but their biological nature has not been determined.
24626	
24627	
24628	
24629	
24630	
24631	
24632	ITGA2B encodes integrin alpha chain 2b. Integrins are heterodimeric integral membrane proteins composed of an alpha chain and a beta chain. Alpha chain 2b undergoes post-translational cleavage to yield disulfide-linked light and heavy chains that join with beta 3 to form a fibronectin receptor expressed in platelets that plays a crucial role in coagulation. Mutations that interfere with this role result in thrombasthenia. In addition to adhesion, integrins are known to participate in cell-surface mediated signalling.
24633	
24634	
24635	
24636	
24637	
24638	
24639	
24640	The protein encoded by this gene is the human ortholog of bovine and rat dipeptidylpeptidase VI. They are transmembrane proteins and likely members of the dipeptidyl aminopeptidase family. Two transcript variants encoding different isoforms have been identified in all 3 species. In bovine and rat, the longer form has been shown to be expressed almost exclusively in brain, whereas the shorter form has a more widespread expression.
24641	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the ultrahigh sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
24642	
24643	
24644	
24645	
24646	
24647	
24648	
24649	
24650	
24651	
24652	
24653	
24654	
24655	
24656	
24657	
24658	
24659	
24660	
24661	N-ethylmaleimide-sensitive factor (NSF) and valosin-containing protein (p97) are two ATPases known to be involved in transport vesicle/target membrane fusion and fusions between membrane compartments. A trimer of the protein encoded by this gene binds a hexamer of cytosolic p97 and is required for p97-mediated regrowth of Golgi cisternae from mitotic Golgi fragments. Multiple transcript variants encoding several different isoforms have been found for this gene.
24662	
24663	
24664	
24665	
24666	
24667	
24668	Core binding factor (CBF) is a heterodimeric transcription factor that binds to the core element of many enhancers and promoters. The protein encoded by this gene represents the alpha subunit of CBF and is thought to be involved in the development of normal hematopoiesis. Chromosomal translocations involving this gene are well-documented and have been associated with several types of leukemia. Two transcript variants encoding different isoforms have been found for this gene.
24669	The protein encoded by this gene phosphorylates vitamin B6, a step required for the conversion of vitamin B6 to pyridoxal-5-phosphate, an important cofactor in intermediary metabolism. The encoded protein is cytoplasmic and probably acts as a homodimer.
24670	
24671	
24672	
24673	
24674	
24675	
24676	
24677	
24678	
24679	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP contains an extracellular domain, a single transmembrane segment and one intracytoplasmic catalytic domain, thus belongs to receptor type PTP. The extracellular region of this PTP is composed of multiple fibronectin type_III repeats, which was shown to interact with neuronal receptor and cell adhesion molecules, such as contactin and tenascin C. This protein was also found to interact with sodium channels, and thus may regulate sodium channels by altering tyrosine phosphorylation status. The functions of the interaction partners of this protein implicate the roles of this PTP in cell adhesion, neurite growth, and neuronal differentiation.
24680	
24681	
24682	
24683	
24684	
24685	
24686	This gene encodes a protein which contains a conserved nuclear localization signal. The encoded protein may have a role in spermatogenesis, cell cycle control, and in meiotic cell division.
24687	Gopalakrishnan and Scarpulla (1995) noted that the electron transport chain and oxidative phosphorylation system rely on the functional interplay of gene products expressed from both nuclear and mitochondrial genomes. Because of the limited coding capacity of the mitochondrial chromosome, nuclear genes must provide most of the respiratory subunits and all of the gene products necessary for mitochondrial DNA transcription and replication. Nuclear respiratory factor-1 (NRF1) is a transcription factor that acts on nuclear genes encoding respiratory subunits and components of the mitochondrial transcription and replication machinery.[supplied by OMIM]
24688	
24689	
24690	
24691	
24692	
24693	
24694	
24695	
24696	
24697	
24698	
24699	
24700	The protein encoded by this gene is a type II membrane protein that is thought to catalyze the transfer of sialic acid from CMP-sialic acid to N-linked oligosaccharides and glycoproteins. The encoded protein may be found in the Golgi apparatus and may be involved in the production of polysialic acid, a modulator of the adhesive properties of neural cell adhesion molecule (NCAM1). This protein is a member of glycosyltransferase family 29.
24701	Cytochrome c oxidase (COX), the terminal enzyme of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. It is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in the electron transfer and the nuclear-encoded subunits may function in the regulation and assembly of the complex. This nuclear gene encodes polypeptide 1 (liver isoform) of subunit VIa, and polypeptide 1 is found in all non-muscle tissues. Polypeptide 2 (heart/muscle isoform) of subunit VIa is encoded by a different gene, and is present only in striated muscles. These two polypeptides share 66% amino acid sequence identity. It has been reported that there may be several pseudogenes on chromosomes 1, 6, 7q21, 7q31-32 and 12. However, only one pseudogene (COX6A1P) on chromosome 1p31.1 has been documented.
24702	
24703	
24704	
24705	
24706	
24707	
24708	
24709	
24710	
24711	
24712	
24713	
24714	
24715	
24716	
24717	
24718	
24719	
24720	
24721	
24722	
24723	
24724	
24725	
24726	
24727	
24728	This gene is one of seven beta-1,4-galactosyltransferase (beta4GalT) genes. They encode type II membrane-bound glycoproteins that appear to have exclusive specificity for the donor substrate UDP-galactose; all transfer galactose in a beta1,4 linkage to similar acceptor sugars: GlcNAc, Glc, and Xyl. Each beta4GalT has a distinct function in the biosynthesis of different glycoconjugates and saccharide structures. As type II membrane proteins, they have an N-terminal hydrophobic signal sequence that directs the protein to the Golgi apparatus and which then remains uncleaved to function as a transmembrane anchor. By sequence similarity, the beta4GalTs form four groups: beta4GalT1 and beta4GalT2, beta4GalT3 and beta4GalT4, beta4GalT5 and beta4GalT6, and beta4GalT7. The enzyme encoded by this gene is a lactosylceramide synthase important for glycolipid biosynthesis.
24729	
24730	
24731	
24732	
24733	
24734	
24735	
24736	
24737	
24738	
24739	
24740	
24741	
24742	
24743	The protein encoded by this gene is a member of the CLK family of dual specificity protein kinases. CLK family members have shown to interact with, and phosphorylate, serine- and arginine-rich (SR) proteins of the spliceosomal complex, which is a part of the regulatory mechanism that enables the SR proteins to control RNA splicing. This protein kinase is found to be involved in the regulation of several cellular processes and may serve as a link between cell cycle progression, apoptosis, and telomere length regulation. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
24744	
24745	
24746	
24747	
24748	
24749	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intracellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, and White). This encoded protein is a member of the ABC1 subfamily. Members of the ABC1 subfamily comprise the only major ABC subfamily found exclusively in multicellular eukaryotes. This gene is clustered among 4 other ABC1 family members on 17q24, but neither the substrate nor the function of this gene is known. Alternative splicing of this gene results in several transcript variants; however, not all variants have been fully described.
24750	
24751	
24752	The protein encoded by this gene binds to a specific DNA sequence upstream of the upstream regulatory sequence of the core promoter and second enhancer of human hepatitis B virus (HBV). Through this binding, it represses HBV core promoter activity, transcription of HBV genes, and production of HBV virions. The protein shows sequence similarities with other DNA-binding structural proteins such as gallin, oncoproteins of the MYC family, and the oncoprotein MOS. It may also be involved in protecting cells from apoptosis and in pre-mRNA splicing. Several transcript variants encoding different isoforms have been described for this gene, but the full-length nature of only six of them has been determined.
24753	The protein encoded by this gene is a member of a superfamily of WD repeat proteins.  The function of this protein is not yet determined; however, other related family members perform functions involved in regulatory processes.
24754	
24755	
24756	
24757	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. The function of this gene has not yet been determined. The expression pattern of the mouse homolog implies a role in nervous system development.
24758	
24759	
24760	
24761	
24762	
24763	
24764	This gene encodes the fatty acid binding protein found in epidermal cells, and was first identified as being upregulated in psoriasis tissue. Fatty acid binding proteins are a family of small, highly conserved, cytoplasmic proteins that bind long-chain fatty acids and other hydrophobic ligands. It is thought that FABPs roles include fatty acid uptake, transport, and metabolism.
24765	
24766	
24767	
24768	The Rh blood group system is the second most clinically significant of the blood groups, second only to ABO. It is also the most polymorphic of the blood groups, with variations due to deletions, gene conversions, and missense mutations. The Rh blood group includes this gene which encodes the RhD protein and a second gene which encodes both the RhC and RhE antigens on a single polypeptide. The two genes are found in a cluster which includes a third unrelated gene on chromosome 1. The classification of Rh-positive and Rh-negative individuals is determined by the presence or absence of the highly immunogenic RhD protein on the surface of erythrocytes. Alternative splicing of this gene results in two transcript variants encoding two different isoforms.
24769	
24770	
24771	This gene encodes the enzyme that metabolizes thiopurine drugs via S-adenosyl-L-methionine as the S-methyl donor and S-adenosyl-L-homocysteine as a byproduct. Thiopurine drugs such as 6-mercaptopurine are used as chemotherapeutic agents. Genetic polymorphisms that affect this enzymatic activity are correlated with variations in sensitivity and toxicity to such drugs within individuals. A pseudogene for this locus is located on chromosome 18q.
24772	
24773	
24774	
24775	
24776	
24777	
24778	
24779	
24780	
24781	
24782	
24783	
24784	
24785	
24786	
24787	
24788	
24789	
24790	
24791	
24792	
24793	
24794	
24795	
24796	
24797	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L23P family of ribosomal proteins. It is located in the cytoplasm. The protein may be one of the target molecules involved in mediating growth inhibition by interferon. In yeast, the corresponding protein binds to a specific site on the 26S rRNA. This gene is co-transcribed with the U42A, U42B, U101A, and U101B small nucleolar RNA genes, which are located in its third, first, second, and fourth introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
24798	
24799	
24800	
24801	
24802	
24803	
24804	
24805	
24806	
24807	
24808	
24809	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the ultrahigh sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
24810	
24811	
24812	
24813	
24814	
24815	
24816	The protein encoded by this gene belongs to the stathmin/oncoprotein 18 family of microtubule-destabilizing phosphoproteins. It is similar to the SCG10 protein and is involved in signal transduction and regulation of microtubule dynamics.
24817	
24818	
24819	
24820	
24821	
24822	
24823	
24824	
24825	
24826	
24827	The protein encoded by this gene regulates the turnover of diphosphoinositol polyphosphates. The turnover of these high-energy diphosphoinositol polyphosphates represents a molecular switching activity with important regulatory consequences. Molecular switching by diphosphoinositol polyphosphates may contribute to regulating intracellular trafficking. Several alternatively spliced transcript variants have been described, but the full-length nature of some variants has not been determined. Isoforms DIPP2alpha and DIPP2beta are distinguishable from each other solely by DIPP2beta possessing one additional amino acid due to intron boundary skidding in alternate splicing.
24828	
24829	
24830	
24831	Studies in Drosophila have identified this gene as encoding a positive regulator of the Notch-signaling pathway. The human gene encodes a protein of unknown function; however, it may play a role in basic helix-loop-helix transcription factor activity.
24832	
24833	The high affinity IgE receptor is a key molecule involved in allergic reactions. It is a tetramer composed of 1 alpha, 1 beta, and 2 gamma chains. The gamma chains are also subunits of other Fc receptors.
24834	
24835	
24836	
24837	
24838	
24839	
24840	
24841	
24842	
24843	
24844	
24845	
24846	
24847	The protein encoded by this gene is a member of the somatotropin/prolactin family of hormones and plays an important role in growth control. The gene is located at the growth hormone locus on chromosome 17 along with four other related genes in the same transcriptional orientation; an arrangement which is thought to have evolved by a series of gene duplications. Although the five genes share a remarkably high degree of sequence identity, they are expressed selectively in different tissues. Alternative splicing generates additional isoforms of each of the five growth hormones, leading to further diversity and potential for specialization. This particular family member is expressed mainly in the placenta and utilizes multiple transcription initiation sites. Expression of the identical mature proteins for chorionic somatomammotropin hormones 1 and 2 is upregulated during development, although the ratio of 1 to 2 increases by term. Mutations in this gene result in placental lactogen deficiency and Silver-Russell syndrome.
24848	
24849	
24850	HIPK2 is a conserved serine/threonine nuclear kinase that interacts with homeodomain transcription factors.[supplied by OMIM]
24851	
24852	
24853	
24854	
24855	
24856	
24857	
24858	
24859	
24860	This gene encodes a member of a family of calcium-regulated actin-binding proteins.  This protein represents a dominant part of the brush border cytoskeleton which functions in the capping, severing, and bundling of actin filaments.  Two mRNAs of 2.7 kb and 3.5 kb have been observed; they result from utilization of alternate poly-adenylation signals present in the terminal exon.
24861	
24862	This intronless gene encodes a 70kDa heat shock protein which is a member of the heat shock protein 70 family. In conjuction with other heat shock proteins, this protein stabilizes existing proteins against aggregation and mediates the folding of newly translated proteins in the cytosol and in organelles. It is also involved in the ubiquitin-proteasome pathway through interaction with the AU-rich element RNA-binding protein 1. The gene is located in the major histocompatibility complex class III region, in a cluster with two closely related genes which encode similar proteins.
24863	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to cytoplasmic bodies. Its function has not been identified. Alternative splicing of this gene generates two transcript variants.
24864	
24865	
24866	
24867	
24868	
24869	
24870	
24871	
24872	
24873	This gene encodes the major type IV alpha collagen chain of basement membranes. Like the other members of the type IV collagen gene family, this gene is organized in a head-to-head conformation with another type IV collagen gene so that each gene pair shares a common promoter.
24874	
24875	
24876	
24877	
24878	
24879	
24880	
24881	
24882	
24883	
24884	
24885	
24886	
24887	
24888	
24889	
24890	The proliferation and differentiation of B cells is dependent upon a B-cell antigen receptor (BCR) complex. Binding of antigens to specific B-cell receptors results in a tyrosine phosphorylation reaction through the BCR complex and leads to multiple signal transduction pathways.
24891	
24892	
24893	
24894	
24895	
24896	
24897	
24898	
24899	
24900	
24901	
24902	Laminins are basement membrane components thought to mediate the attachment, migration and organization of cells into tissues during embryonic development by interacting with other extracellular matrix components. The protein encoded by this gene is the alpha-3 chain of laminin 5, which is a complex glycoprotein composed of three subunits (alpha, beta, and gamma). Laminin 5 is thought to be involved in cell adhesion, signal transduction and differentiation of keratinocytes. Mutations in this gene have been identified as the cause of Herlitz type junctional epidermolysis bullosa. Alternatively spliced transcript variants encoding different isoforms have been identified.
24903	
24904	
24905	
24906	
24907	The protein encoded by this gene is a member of the interleukin 1 cytokine family. This cytokine was shown to specifically inhibit the activation of NF-kappaB induced by interleukin 1 family, member 6 (IL1F6). This gene and eight other interleukin 1 family genes form a cytokine gene cluster on chromosome 2. Two alternatively spliced transcript variants encoding the same protein have been reported.
24908	
24909	
24910	
24911	
24912	
24913	
24914	
24915	
24916	
24917	
24918	
24919	
24920	
24921	
24922	
24923	This gene encodes a member of the ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) protein family. ADAMTS family members share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. The protein encoded by this gene has a high sequence similarity to the protein encoded by ADAMTS18, another family member. The function of this protein has not been determined.
24924	The protein encoded by this gene is a member of the SEC23 subfamily of the SEC23/SEC24 family, which is involved in vesicle trafficking. The encoded protein has similarity to yeast Sec23p component of COPII. COPII is the coat protein complex responsible for vesicle budding from the ER. The function of this gene product has been implicated in cargo selection and concentration. Multiple alternatively spliced transcript variants encoding the same protein have been found for this gene.
24925	DNA damage and/or hyperproliferative signals activate wildtype p53 tumor suppressor protein (TP53; MIM 191170), inducing cell cycle arrest or apoptosis. Mutations that inactivate p53 occur in 50% of all tumors. Polyak et al. (1997) used serial analysis of gene expression (SAGE) to evaluate cellular mRNA levels in a colorectal cancer cell line transfected with p53. Of 7,202 transcripts identified, only 14 were expressed at levels more than 10-fold higher in p53-expressing cells than in control cells. Polyak et al. (1997) termed these genes @#$%&p53-induced genes,@#$%& or PIGs, several of which were predicted to encode redox-controlling proteins. They noted that reactive oxygen species (ROS) are potent inducers of apoptosis. Flow cytometric analysis showed that p53 expression induces ROS production, which increases as apoptosis progresses under some conditions. The authors stated that the PIG10 gene, also called ENC1, encodes an actin-binding protein.[supplied by OMIM]
24926	
24927	
24928	
24929	The protein encoded by this gene is a member of the homeodomain family of DNA binding proteins. It may regulate gene expression, morphogenesis, and differentiation and it may also play a role in the cell cycle progession. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
24930	
24931	
24932	
24933	
24934	
24935	
24936	
24937	This gene is a member of the RUNX family of transcription factors and encodes a nuclear protein with an Runt DNA-binding domain. This protein is essential for osteoblastic differentiation and skeletal morphogenesis, acting as a scaffold for nucleic acids and regulatory factors involved in skeletal gene expression. The protein can bind DNA both as a monomer or, with more affinity, as a subunit of a heterodimeric complex. Mutations in this gene have been associated with the bone development disorder cleidocranial dysplasia (CCD). Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
24938	
24939	
24940	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S5P family of ribosomal proteins. It is located in the cytoplasm. This gene shares sequence similarity with mouse LLRep3. It is co-transcribed with the small nucleolar RNA gene U64, which is located in its third intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
24941	
24942	
24943	
24944	
24945	
24946	
24947	
24948	
24949	The protein encoded by this gene is a member of a family of RIM-binding proteins. RIMs are active zone proteins that regulate neurotransmitter release. This gene has been found fused to the receptor-type tyrosine kinase gene RET by gene rearrangement due to the translocation t(10;12)(q11;p13). Five transcript variants encoding different isoforms have been found for this gene.
24950	
24951	
24952	
24953	
24954	The protein encoded by this gene is a member of the dual specificity protein tyrosine phosphatase family. This protein is highly similar to Saccharomyces cerevisiae Cdc14, a protein tyrosine phosphatase involved in the exit of cell mitosis and initiation of DNA replication, which suggests the role in cell cycle control. This protein has been shown to interact with and dephosphorylates tumor suppressor protein p53, and is thought to regulate the function of p53. Alternative splice of this gene results in 3 transcript variants encoding distinct isoforms.
24955	
24956	
24957	
24958	
24959	
24960	
24961	
24962	This gene encodes a nuclear protein that has three zinc fingers at the end of its C-terminal domain, a serine/threonine-rich central region, and an acidic domain lying within the N-terminal region. The zinc fingers of this protein are responsible for the specific DNA binding with the guanine-rich core promoter elements. The central region might be involved in activation or posttranslational regulatory pathways, and the acidic N-terminal domain might play an important role in the process of transcriptional activation. It is capable of activating transcription approximately 4-fold either on homologous or heterologous promoters. The DNA binding and transcriptional activity of this protein, in conjunction with its expression pattern, suggests that this protein may participate in the regulation and/or maintenance of the basal expression of pregnancy-specific glycoprotein genes and possibly other TATA box-less genes. Two transcript variants encoding the same protein have been found for this gene.
24963	
24964	The protein encoded by this gene is a member of serine/threonine protein kinase family. This kinase contains a dual leucine-zipper motif, and has been shown to form dimers/oligomers through its leucine-zipper motif. This kinase can phosphorylate and activate MAPK8/JNK, MAP2K7/MKK7, which suggests a role in the JNK signaling pathway.
24965	
24966	
24967	
24968	
24969	
24970	This gene encodes a protein with a putative role in the localization of trans-Golgi network (TGN) membrane proteins. Mouse and rat homologs have been identified and studies of the homologous rat protein indicate a role in directing TGN localization of furin by binding to the protease@#$%&s phosphorylated cytosolic domain. In addition, the human protein plays a role in HIV-1 Nef-mediated downregulation of cell surface MHC-I molecules to the TGN, thereby enabling HIV-1 to escape immune surveillance.
24971	
24972	
24973	
24974	
24975	
24976	
24977	
24978	This gene belongs to the class II amino-acyl tRNA family. The encoded enzyme catalyzes the transfer of L-serine to tRNA (Ser) and is related to bacterial and yeast counterparts.
24979	
24980	
24981	
24982	
24983	
24984	
24985	
24986	
24987	
24988	Rho GTPases play a fundamental role in numerous cellular processes that are initiated by extracellular stimuli that work through G protein coupled receptors. The encoded protein may form complex with G proteins and stimulate Rho-dependent signals. This protein may be involved in the control of cytoskeletal organization. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
24989	
24990	The protein encoded by this gene is a smooth muscle myosin belonging to the myosin heavy chain family. The gene product is a subunit of a hexameric protein that consists of two heavy chain subunits and two pairs of non-identical light chain subunits. It functions as a major contractile protein, converting chemical energy into mechanical energy through the hydrolysis of ATP. The gene encoding a human ortholog of rat NUDE1 is transcribed from the reverse strand of this gene, and its 3@#$%& end overlaps with that of the latter. The pericentric inversion of chromosome 16 [inv(16)(p13q22)] produces a chimeric transcript that encodes a protein consisting of the first 165 residues from the N terminus of core-binding factor beta in a fusion with the C-terminal portion of the smooth muscle myosin heavy chain. This chromosomal rearrangement is associated with acute myeloid leukemia of the M4Eo subtype. Alternative splicing generates isoforms that are differentially expressed, with ratios changing during muscle cell maturation. Additional splice variants have been described, but their full-length natures have not been determined.
24991	
24992	
24993	
24994	Aminopeptidase N is located in the small-intestinal and renal microvillar membrane, and also in other plasma membranes. In the small intestine aminopeptidase N plays a role in the final digestion of peptides generated from hydrolysis of proteins by gastric and pancreatic proteases. Its function in proximal tubular epithelial cells and other cell types is less clear. The large extracellular carboxyterminal domain contains a pentapeptide consensus sequence characteristic of members of the zinc-binding metalloproteinase superfamily. Sequence comparisons with known enzymes of this class showed that CD13 and aminopeptidase N are identical. The latter enzyme was thought to be involved in the metabolism of regulatory peptides by diverse cell types, including small intestinal and renal tubular epithelial cells, macrophages, granulocytes, and synaptic membranes from the CNS. Human aminopeptidase N is a receptor for one strain of human coronavirus that is an important cause of upper respiratory tract infections. Defects in this gene appear to be a cause of various types of leukemia or lymphoma.
24995	
24996	
24997	
24998	L-type calcium channels are composed of five subunits. The protein encoded by this gene represents one of these subunits, gamma, and is one of several gamma subunit proteins. It is an integral membrane protein that is thought to stabilize the calcium channel in an inactive (closed) state. This gene is a member of the neuronal calcium channel gamma subunit gene subfamily of the PMP-22/EMP/MP20 family and is located in a cluster with two similar gamma subunit-encoding genes.
24999	Kynureninase is a pyridoxal-5@#$%&-phosphate (pyridoxal-P) dependent enzyme that catalyses the cleavage of L-kynurenine and L-3-hydroxykynurenine into anthranilic and 3-hydroxyanthranilic acids, respectively. Kynureninase is involved in the biosynthesis of NAD cofactors from tryptophan through the kynurenine pathway.
25000	
25001	
25002	
25003	
25004	
25005	
25006	
25007	
25008	
25009	
25010	This gene encodes a hematopoeitin/interferon-class receptor belonging to the cytokine superfamily of receptors. The encoded gene product represents the CNTF-specific alpha subunit of a heterotrimer forming the CNTF receptor complex, which also includes LIFR and gp130. The receptor is attached to the membrane by a glycosyl-phosphatidylinositol linkage and contains an immunoglobulin-like C2-type domain and a fibronectin type-III domain. Signal transduction requires that CNTF bind first to this alpha component, which permits the recruitment of gp130 and LIFR beta to form the tripartite receptor complex. Signal transduction stimulates gene expression, cell survival or differentiation in a variety of neuronal cell types. Alternative splicing has been observed at this locus and two variants, both encoding the same protein, have been identified.
25011	
25012	
25013	
25014	
25015	
25016	
25017	
25018	
25019	
25020	
25021	
25022	
25023	
25024	
25025	
25026	
25027	This gene is a member of the paired box (PAX) family of transcription factors.  Members of this gene family typically contain a paired box domain, an octapeptide, and a paired-type homeodomain.  These genes play critical roles during fetal development and cancer growth.  The specific function of the paired box gene 8 is unknown but it may involve kidney cell differentiation, thyroid development, or thyroid dysgenesis.   Alternative splicing in this gene by inclusion or exclusion of exons 7 and/or 8 has produced several known products but the biological significance of the variants is unknown.  Several other splice variants have been proposed but the full nature of these products has not been described.
25028	
25029	
25030	
25031	
25032	
25033	
25034	
25035	Heterotrimeric guanine nucleotide-binding proteins (G proteins), which integrate signals between receptors and effector proteins, are composed of an alpha, a beta, and a gamma subunit. These subunits are encoded by families of related genes. This gene encodes a beta subunit. Beta subunits are important regulators of alpha subunits, as well as of certain signal transduction receptors and effectors. This gene uses alternative polyadenylation signals.
25036	
25037	
25038	
25039	
25040	The protein encoded by this gene is a nuclear transcription factor which binds DNA as a homodimer. The encoded protein controls the expression of several genes, including hepatocyte nuclear factor 1 alpha, a transcription factor which regulates the expression of several hepatic genes. This gene may play a role in development of the liver, kidney, and intestines. Mutations in this gene have been associated with monogenic autosomal dominant non-insulin-dependent diabetes mellitus type I. At least three different transcript variants encoding three different isoforms have been found for this gene.
25041	
25042	
25043	
25044	
25045	
25046	
25047	
25048	
25049	The product of the CHML gene supports geranylgeranylation of most Rab proteins and may substitute for REP-1 in tissues other than retina. CHML is localized close to the gene for Usher syndrome type II.
25050	
25051	Phosphoinositide 3-kinases (PI3Ks) phosphorylate the 3-prime OH position of the inositol ring of inositol lipids. See MIM 602838. The class I PI3Ks display a broad phosphoinositide lipid substrate specificity and include p110-alpha (MIM 171834), p110-beta (MIM 602925), and p110-gamma (MIM 601232). p110-alpha and p110-beta interact with SH2/SH3-domain-containing p85 adaptor proteins (see MIM 171833) and with GTP-bound Ras.[supplied by OMIM]
25052	
25053	
25054	
25055	
25056	
25057	
25058	Metabolic N-oxidation of the diet-derived amino-trimethylamine (TMA) is mediated by flavin-containing monooxygenase and is subject to an inherited FMO3 polymorphism in man resulting in a small subpopulation with reduced TMA N-oxidation capacity resulting in fish odor syndrome Trimethylaminuria. Three forms of the enzyme, FMO1 found in fetal liver, FMO2 found in adult liver, and FMO3 are encoded by genes clustered in the 1q23-q25 region.   Flavin-containing monooxygenases are NADPH-dependent flavoenzymes that catalyzes the oxidation of soft nucleophilic heteroatom centers in drugs, pesticides, and xenobiotics.
25059	
25060	
25061	
25062	
25063	
25064	
25065	This gene belongs to a group of vacuolar protein sorting (VPS) genes. The encoded protein is a component of a large multimeric complex, termed the retromer complex, involved in retrograde transport of proteins from endosomes to the trans-Golgi network. The close structural similarity between the yeast and human proteins that make up this complex suggests a similarity in function. Expression studies in yeast and mammalian cells indicate that this protein interacts directly with VPS35, which serves as the core of the retromer complex.
25066	
25067	
25068	This gene encodes human pituitary glutaminyl cyclase, which is responsible for the presence of pyroglutamyl residues in many neuroendocrine peptides. The amino acid sequence of this enzyme is 86% identical to that of bovine glutaminyl cyclase.
25069	
25070	
25071	
25072	
25073	
25074	
25075	
25076	
25077	Endocytosis of cell surface proteins is mediated by a complex molecular machinery that assembles on the inner surface of the plasma membrane. This gene encodes one of two human homologs of the Drosophila melanogaster stoned B protein. This protein is related to components of the endocytic machinery and exhibits a modular structure consisting of an N-terminal proline-rich domain, a central region of homology specific to the human stoned B-like proteins, and a C-terminal region homologous to the mu subunits of adaptor protein (AP) complexes. Co-transcription of this gene and the neighboring downstream gene generates a rare transcript (SALF), which encodes a fusion protein comprised of sequence sharing identity with each individual gene product.
25078	
25079	
25080	
25081	
25082	Alpha-synuclein is a member of the synuclein family, which also includes beta- and gamma-synuclein. Synucleins are abundantly expressed in the brain and alpha- and beta-synuclein inhibit phospholipase D2 selectively. SNCA may serve to integrate presynaptic signaling and membrane trafficking. Defects in SNCA have been implicated in the pathogenesis of Parkinson disease. SNCA peptides are a major component of amyloid plaques in the brains of patients with Alzheimer@#$%&s disease. Two alternatively spliced transcripts of SNCA have been identified. Additional splicing may be present but the full-length nature of these variants has not been determined.
25083	
25084	
25085	
25086	The protein encoded by this gene is similar to the chicken and mouse CRTAP genes. The encoded protein is a scaffolding protein that may influence the activity of at least one member of the cytohesin/ARNO family in response to specific cellular stimuli.
25087	
25088	
25089	
25090	
25091	
25092	
25093	SEP1 localizes to cytoplasmic foci containing a complex of mRNA-degrading enzymes. In addition to mRNA metabolism, yeast Sep1 has been implicated in a variety of nuclear and cytoplasmic functions, including homologous recombination, meiosis, telomere maintenance, and microtubule assembly.[supplied by OMIM]
25094	
25095	
25096	
25097	
25098	
25099	
25100	
25101	
25102	
25103	
25104	
25105	
25106	
25107	
25108	Nucleotides are involved in numerous biochemical reactions and pathways within the cell as substrates, cofactors, and effectors. Nudix hydrolases, such as NUDT12, regulate the concentrations of individual nucleotides and of nucleotide ratios in response to changing circumstances (Abdelraheim et al., 2003).[supplied by OMIM]
25109	
25110	
25111	
25112	
25113	This gene encodes a protein that is a transmembrane receptor for growth hormone. Binding of growth hormone to the receptor leads to receptor dimerization and the activation of an intra- and intercellular signal transduction pathway leading to growth. A common alternate allele of this gene, called GHRd3, lacks exon three and has been well-characterized. Mutations in this gene have been associated with Laron syndrome, also known as the growth hormone insensitivity syndrome (GHIS), a disorder characterized by short stature. Other splice variants, including one encoding a soluble form of the protein (GHRtr), have been observed but have not been thoroughly characterized.
25114	
25115	This gene is part of a 500 kb inverted duplication on chromosome 5q13. This duplicated region contains at least four genes and repetitive elements which make it prone to rearrangements and deletions. The repetitiveness and complexity of the sequence have also caused difficulty in determining the organization of this genomic region. This copy of the gene is full length; additional copies with truncations and internal deletions are also present in this region of chromosome 5q13. It is thought that this gene is a modifier of spinal muscular atrophy caused by mutations in a neighboring gene, SMN1. The protein encoded by this gene contains regions of homology to two baculovirus inhibitor of apoptosis proteins, and it is able to suppress apoptosis induced by various signals. A second transcript for this gene has been described, but its full length nature has not been determined.
25116	RAS genes (e.g., MIM 190020) encode membrane-bound guanine nucleotide-binding proteins that function in the transduction of signals that control cell growth and differentiation. Binding of GTP activates RAS proteins, and subsequent hydrolysis of the bound GTP to GDP and phosphate inactivates signaling by these proteins. GTP binding can be catalyzed by guanine nucleotide exchange factors for RAS, and GTP hydrolysis can be accelerated by GTPase-activating proteins (GAPs). The first exchange factor to be identified for RAS was the S. cerevisiae CDC25 gene product. Genetic analysis indicated that CDC25 is essential for activation of RAS proteins. In Drosophila, the protein encoded by the @#$%&son of sevenless@#$%& gene (Sos) contains a domain that shows sequence similarity with the catalytic domain of CDC25. Sos may act as a positive regulator of RAS by promoting guanine nucleotide exchange.[supplied by OMIM]
25117	The protein encoded by this gene is a plasma glycoprotein of unknown function. The protein shows sequence similarity to the variable regions of some immunoglobulin supergene family member proteins.
25118	Apolipoprotein(a) is the distinguishing protein moiety of lipoprotein(a), of which elevated plasma levels are correlated with an increased risk of atherosclerosis. The protein encoded by this gene is secreted and is structurally similar to apolipoprotein(a), although apolipoprotein(a) contains many kringle domain repeats while this protein contains only a single kringle domain. Kringle domains have been shown to mediate protein-protein interactions. Two transcript variants encoding the same protein have been found for this gene.
25119	The protein encoded by this gene belongs to the family of P-type cation transport ATPases. This gene encodes a polypeptide that acts as a membrane copper-transport protein. Defects in this gene are the cause of Wilson disease (WD). WD is an autosomal recessive disorder of copper metabolism in which copper cannot be incorporated into ceruloplasmin in liver, and cannot be excreted from the liver into the bile. Alternatively spliced transcript variants encoding different isoforms have been identified.
25120	
25121	
25122	
25123	
25124	This gene encodes a multifunctional protein. It has two enzymatically active domains with catalytic activities - peptidylglycine alpha-hydroxylating monooxygenase (PHM) and peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL). These catalytic domains work sequentially to catalyze neuroendocrine peptides to active alpha-amidated products. Multiple alternatively spliced transcript variants encoding different isoforms have been described for this gene but some of their full length sequences are not yet known.
25125	
25126	ETFA participates in catalyzing the initial step of the mitochondrial fatty acid beta-oxidation.  It shuttles electrons between primary flavoprotein dehydrogenases and the membrane-bound electron transfer flavoprotein ubiquinone oxidoreductase. Defects in electron-transfer-flavoprotein have been implicated in type II glutaricaciduria in which multiple acyl-CoA dehydrogenase deficiencies result in large excretion of glutaric, lactic, ethylmalonic, butyric, isobutyric, 2-methyl-butyric, and isovaleric acids.
25127	
25128	
25129	This gene encodes a protein involved in regulating responses to cytokines by members of the Rel/NF-kappaB transcription factor family. These factors play a central role in innate immunity in response to pathogens, inflammatory signals and stress. This gene product interacts with TRAF proteins (tumor necrosis factor receptor-associated factors) and either I-kappaB kinase or MAP kinase to activate either NF-kappaB or Jun kinase. Two alternative transcripts encoding different proteins have been identified. A third transcript, which does not encode a protein and is transcribed in the opposite orientation, has been identified. Overexpression of this transcript has been shown to reduce expression of at least one of the protein encoding transcripts, suggesting it has a regulatory role in the expression of this gene.
25130	
25131	
25132	
25133	
25134	The protein encoded by this gene associates with stem loop IV of U2 small nuclear ribonucleoprotein (U2 snRNP) in the presence of snRNP-A@#$%&. The encoded protein may play a role in pre-mRNA splicing. Autoantibodies from patients with systemic lupus erythematosus frequently recognize epitopes on the encoded protein. Two transcript variants encoding the same protein have been found for this gene.
25135	
25136	
25137	The protein encoded by this gene interacts with the dedicator of cyto-kinesis 1 protein to promote phagocytosis and effect cell shape changes. Similarity to a C. elegans protein suggests that this protein may function in apoptosis and in cell migration. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
25138	
25139	This gene encodes one of seven subunits of the human Arp2/3 protein complex. This subunit is a member of the SOP2 family of proteins and is most similar to the protein encoded by gene ARPC1B. The similarity between these two proteins suggests that they both may function as p41 subunit of the human Arp2/3 complex that has been implicated in the control of actin polymerization in cells. It is possible that the p41 subunit is involved in assembling and maintaining the structure of the Arp2/3 complex. Multiple versions of the p41 subunit may adapt the functions of the complex to different cell types or developmental stages.
25140	
25141	
25142	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has three repeats of quasi-RRM domains that bind to RNAs and also contains a nuclear localization motif.
25143	
25144	
25145	
25146	
25147	This gene encodes a protein that is part of a post-splicing multiprotein complex involved in both mRNA nuclear export and mRNA surveillance. mRNA surveillance detects exported mRNAs with truncated open reading frames and initiates nonsense-mediated mRNA decay (NMD). When translation ends upstream from the last exon-exon junction, this triggers NMD to degrade mRNAs containing premature stop codons. This protein binds to the mRNA and remains bound after nuclear export, acting as a nucleocytoplasmic shuttling protein. This protein contains many serine residues. Two splice variants have been found for this gene; both variants encode the same protein.
25148	
25149	
25150	
25151	
25152	
25153	Bin/amphiphysin/Rvs (BAR) domain adaptor proteins, such as BIN3, are named for their unique N-terminal region and appear to integrate signal transduction pathways that regulate membrane dynamics, F-actin cytoskeleton, and nuclear processes.[supplied by OMIM]
25154	
25155	The protein encoded by this gene is one of the highly conserved mini-chromosome maintenance proteins (MCM) that are essential for the initiation of eukaryotic genome replication. The hexameric protein complex formed by MCM proteins is a key component of the pre-replication complex (pre_RC) and may be involved in the formation of replication forks and in the recruitment of other DNA replication related proteins. The MCM complex consisting of this protein and MCM2, 6 and 7 proteins possesses DNA helicase activity, and may act as a DNA unwinding enzyme. The phosphorylation of this protein by CDC2 kinase reduces the DNA helicase activity and chromatin binding of the MCM complex. This gene is mapped to a region on the chromosome 8 head-to-head next to the PRKDC/DNA-PK, a DNA-activated protein kinase involved in the repair of DNA double-strand breaks. Alternatively spliced transcript variants encoding the same protein have been reported.
25156	
25157	
25158	
25159	
25160	
25161	
25162	
25163	
25164	
25165	
25166	
25167	
25168	
25169	
25170	
25171	
25172	Annexin IV (ANX4) belongs to the annexin family of calcium-dependent phospholipid binding proteins. Although their functions are still not clearly defined, several members of the annexin family have been implicated in membrane-related events along exocytotic and endocytotic pathways.  ANX4 has 45 to 59% identity with other members of its family and shares a similar size and exon-intron organization.  Isolated from human placenta, ANX4 encodes a protein that has possible interactions with ATP, and has in vitro anticoagulant activity and also inhibits phospholipase A2 activity.  ANX4 is almost exclusively expressed in epithelial cells.
25173	
25174	
25175	The protein encoded by this gene removes 5@#$%& overhanging flaps in DNA repair and processes the 5@#$%& ends of Okazaki fragments in lagging strand DNA synthesis. Direct physical interaction between this protein and AP endonuclease 1 during long-patch base excision repair provides coordinated loading of the proteins onto the substrate, thus passing the substrate from one enzyme to another. The protein is a member of the XPG/RAD2 endonuclease family and is one of ten proteins essential for cell-free DNA replication. DNA secondary structure can inhibit flap processing at certain trinucleotide repeats in a length-dependent manner by concealing the 5@#$%& end of the flap that is necessary for both binding and cleavage by the protein encoded by this gene. Therefore, secondary structure can deter the protective function of this protein, leading to site-specific trinucleotide expansions.
25176	
25177	
25178	
25179	
25180	
25181	
25182	
25183	
25184	The activation of mitogen-activated protein kinase (MAPK) cascades transduces various extracellular signals to the nucleus to induce gene expression, cell proliferation, differentiation, cell cycle arrest, and apoptosis. For full activation of MAPKs, dual-specificity kinases phosphorylate both threonine and tyrosine residues in MAPK TXY motifs. MKPs are dual-specificity phosphatases that dephosphorylate the TXY motif, thereby negatively regulating MAPK activity.[supplied by OMIM]
25185	This gene encodes one of the SERCA Ca(2+)-ATPases, which are intracellular pumps located in the sarcoplasmic or endoplasmic reticula of muscle cells. This enzyme catalyzes the hydrolysis of ATP coupled with the translocation of calcium from the cytosol to the sarcoplasmic reticulum lumen, and is involved in regulation of the contraction/relaxation cycle. Mutations in this gene cause Darier-White disease, also known as keratosis follicularis, an autosomal dominant skin disorder characterized by loss of adhesion between epidermal cells and abnormal keratinization. Alternative splicing results in two transcript variants encoding different isoforms.
25186	
25187	
25188	
25189	
25190	
25191	
25192	
25193	
25194	This gene encodes a subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This subunit functions as guanine nucleotide exchange factor. It is reported that this subunit interacts with HIV-1 Tat, and thus it represses the translation of host-cell, but not HIV-1, mRNAs. Several alternatively spliced transcript variants have been found for this gene, however, the full length nature of only two variants has been determined.
25195	
25196	
25197	
25198	
25199	
25200	
25201	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
25202	This protein belongs to the aldehyde dehydrogenase family of proteins. The product of this gene is an enzyme that catalyzes the synthesis of retinoic acid (RA) from retinaldehyde. Retinoic acid, the active derivative of vitamin A (retinol), is a hormonal signaling molecule that functions in developing and adult tissues. The studies of a similar mouse gene suggest that this enzyme and  the cytochrome CYP26A1, concurrently establish local embryonic retinoic acid levels which facilitate posterior organ development and prevent spina bifida. Three transcript variants encoding distinct isoforms have been identified for this gene.
25203	The protein encoded by this gene is a pleiotropic cytokine that functions as a chemoattractant, a modulator of T cell activation, and an inhibitor of HIV replication. The signaling process of this cytokine is mediated by CD4. The product of this gene undergoes proteolytic processing, which is found to yield two functional proteins. The cytokine function is exclusively attributed to the secreted C-terminal peptide, while the N-terminal product may play a role in cell cycle control. Caspase 3 is reported to be involved in the proteolytic processing of this protein. Two alternatively spliced transcript variants encoding distinct isoforms have been reported.
25204	
25205	
25206	
25207	
25208	
25209	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of the ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbls class. It shares 78% sequence identity with the mouse protein.
25210	
25211	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L13E family of ribosomal proteins. It is located in the cytoplasm. This gene is expressed at significantly higher levels in benign breast lesions than in breast carcinomas. Transcript variants derived from alternative splicing and/or alternative polyadenylation exist; these variants encode the same protein. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
25212	
25213	
25214	
25215	
25216	
25217	
25218	
25219	
25220	
25221	
25222	
25223	
25224	
25225	
25226	
25227	
25228	
25229	
25230	Oxygen sensing is essential for homeostasis in all aerobic organisms. A phagocyte-type oxidase, similar to that responsible for the production of large amounts of reactive oxygen species (ROS) in neutrophil granulocytes, with resultant antimicrobial activity, has been postulated to function in the kidney as an oxygen sensor that regulates the synthesis of erythropoietin (EPO; MIM 133170) in the renal cortex.[supplied by OMIM]
25231	
25232	Cytochrome c oxidase (COX), the terminal component of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. This component is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in electron transfer, and the nuclear-encoded subunits may function in the regulation and assembly of the complex. This nuclear gene encodes a protein which is not a structural subunit, but may be a heme A biosynthetic enzyme involved in COX formation, according to the yeast mutant studies. However, the studies in Rhodobacter sphaeroides suggest that this gene is not required for heme A biosynthesis, but required for stable formation of the Cu(B) and magnesium centers of COX. This human protein is predicted to contain a transmembrane domain localized in the mitochondrial inner membrane. A pseudogene COX11P has been found on chromosome 6.
25233	LMO2 encodes a cysteine-rich, two LIM-domain protein that is required for yolk sac erythropoiesis.  The LMO2 protein has a central and crucial role in hematopoietic development and is highly conserved.  The LMO2 transcription start site is located approximately 25 kb downstream from the 11p13 T-cell translocation cluster (11p13 ttc), where a number T-cell acute lymphoblastic leukemia-specific translocations occur.
25234	
25235	
25236	
25237	
25238	
25239	
25240	
25241	
25242	
25243	
25244	This gene belongs to the protocadherin gene family, a subfamily of the cadherin superfamily. The mRNA encodes a cadherin-related neuronal receptor that localizes to synaptic junctions and is putatively involved in specific neuronal connections and signal transduction. Sharing a characteristic with other protocadherin genes, this gene has a notably large exon that encodes 6 cadherin domains and a transmembrane region. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
25245	
25246	
25247	
25248	
25249	
25250	The protein encoded by this gene belongs to the small nuclear ribonucleoprotein core protein family. It is required for pre-mRNA splicing and small nuclear ribonucleoprotein biogenesis. Alternative splicing occurs at this locus and two transcript variants encoding the same protein have been identified.
25251	
25252	
25253	This gene encodes a member of the @#$%&regulator of G protein signaling@#$%& (RGS) gene family. The encoded protein may function as a guanosine triphosphatase (GTPase)-activating protein as well as a transcriptional repressor. This protein may play a role in tumorigenesis. Multiple transcript variants encoding distinct isoforms have been identified for this gene. Other alternative splice variants have been described but their biological nature has not been determined.
25254	
25255	
25256	
25257	This gene encodes a glycoprotein which associates with lysosomal enzymes beta-galactosidase and neuraminidase to form a complex of high molecular weight multimers.  The formation of this complex provides a protective role for stability and activity.  Deficiencies in this gene are linked to multiple forms of galactosialidosis.
25258	This gene encodes a glycoprotein which associates with lysosomal enzymes beta-galactosidase and neuraminidase to form a complex of high molecular weight multimers.  The formation of this complex provides a protective role for stability and activity.  Deficiencies in this gene are linked to multiple forms of galactosialidosis.
25259	
25260	
25261	
25262	
25263	
25264	
25265	This gene belongs to a family of Sp1 related genes that encode transcription factors that regulate transcription by binding to consensus GC- and GT-box regulatory elements in target genes. This protein contains a zinc finger DNA-binding domain and several transactivation domains, and has been reported to function as a bifunctional transcription factor that either stimulates or represses the transcription of numerous genes. Transcript variants encoding different isoforms have been described for this gene, and one has been reported to initiate translation from a non-AUG (AUA) start codon. Additional isoforms, resulting from the use of alternate downstream translation initiation sites, have also been noted.
25266	
25267	
25268	
25269	
25270	
25271	
25272	Pleckstrin homology, Sec7 and coiled/coil domains 4 (PSCD4) is a member of the PSCD family. Members of this family have identical structural organization that consists of an N-terminal coiled-coil motif, a central Sec7 domain, and a C-terminal pleckstrin homology (PH) domain. The coiled-coil motif is involved in homodimerization, the Sec7 domain contains guanine-nucleotide exchange protein (GEP) activity, and the PH domain interacts with phospholipids and is responsible for association of PSCDs with membranes. Members of this family appear to mediate the regulation of protein sorting and membrane trafficking. The PSCD4 exhibits GEP activity in vitro with both ARF1 and ARF5 but is inactive with ARF6. The PSCD4 and PSCD1 gene structures are very similar.
25273	
25274	Beta-catenin is a transcriptional activator and oncoprotein involved in the development of several cancers. The protein encoded by this gene interacts directly with the C-terminal region of beta-catenin, inhibiting oncogenic beta-catenin-mediated transcriptional activation by competing with transcription factors for binding to beta-catenin. Two transcript variants encoding different isoforms have been found for this gene.
25275	
25276	
25277	
25278	
25279	
25280	
25281	
25282	
25283	
25284	
25285	
25286	
25287	
25288	
25289	
25290	
25291	
25292	
25293	
25294	
25295	
25296	
25297	
25298	This gene encodes a nuclear protein containing a SWIRM domain, a FAD-binding motif, and an amine oxidase domain. This protein is a component of several histone deacetylase complexes, though it silences genes by functioning as a histone demethylase. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
25299	
25300	
25301	
25302	
25303	
25304	
25305	
25306	
25307	
25308	SMUG1 is a glycosylase that removes uracil from single- and double-stranded DNA in nuclear chromatin, thus contributing to base excision repair.[supplied by OMIM]
25309	
25310	
25311	
25312	
25313	
25314	
25315	
25316	
25317	
25318	
25319	The delta (HBD) and beta (HBB) genes are normally expressed in the adult: two alpha chains plus two beta chains constitute HbA, which in normal adult life comprises about 97% of the total hemoglobin. Two alpha chains plus two delta chains constitute HbA-2, which with HbF comprises the remaining 3% of adult hemoglobin. Five beta-like globin genes are found within a 45 kb cluster on chromosome 11 in the following order:      5@#$%&-epsilon--Ggamma--Agamma--delta--beta-3@#$%&. Mutations in the delta-globin gene are associated with beta-thalassemia.
25320	
25321	
25322	
25323	The activity of nonmuscle myosin II (see MYH9; MIM 160775) is regulated by phosphorylation of a regulatory light chain, such as MRLC2. This phosphorylation results in higher MgATPase activity and the assembly of myosin II filaments (Iwasaki et al., 2001).[supplied by OMIM]
25324	
25325	
25326	
25327	
25328	
25329	
25330	
25331	
25332	This gene encodes a member of the bone morphogenetic protein (BMP) receptor family of transmembrane serine/threonine kinases. The ligands of this receptor are BMPs, which are members of the TGF-beta superfamily. BMPs are involved in endochondral bone formation and embryogenesis. These proteins transduce their signals through the formation of heteromeric complexes of 2 different types of serine (threonine) kinase receptors: type I receptors of about 50-55 kD and type II receptors of about 70-80 kD. Type II receptors bind ligands in the absence of type I receptors, but they require their respective type I receptors for signaling, whereas type I receptors require their respective type II receptors for ligand binding. Mutations in this gene have been associated with primary pulmonary hypertension. This gene expresses two transcript variants.
25333	
25334	
25335	
25336	
25337	The protein encoded by this gene, a member of the peptidase C1 family, is a lysosomal cysteine proteinase that may play an important role in corneal physiology. This gene is expressed in colorectal and breast carcinomas but not in normal colon, mammary gland, or peritumoral tissues, suggesting a possible role for this gene in tumor processes.
25338	
25339	
25340	
25341	
25342	
25343	
25344	
25345	Nucleolin (NCL), a eukaryotic nucleolar phosphoprotein, is involved in the synthesis and maturation of ribosomes. It is located mainly in dense fibrillar regions of the nucleolus. Human NCL gene consists of 14 exons with 13 introns and spans approximately 11kb. The intron 11 of the NCL gene encodes a small nucleolar RNA, termed U20.
25346	
25347	
25348	
25349	
25350	
25351	
25352	
25353	
25354	
25355	
25356	
25357	
25358	
25359	
25360	
25361	
25362	This intronless gene encodes a member of the SRY-related HMG-box (SOX) family of transcription factors involved in the regulation of embryonic development and in the determination of cell fate. The encoded protein may act as a transcriptional activator after forming a protein complex with other proteins. Mutations in this gene have been associated with bilateral anophthalmia, a severe form of structural eye malformation. This gene lies within an intron of another gene called SOX2 overlapping transcript (SOX2OT).
25363	
25364	
25365	
25366	
25367	
25368	
25369	
25370	
25371	
25372	
25373	
25374	
25375	
25376	
25377	
25378	
25379	
25380	
25381	
25382	
25383	
25384	
25385	
25386	
25387	
25388	
25389	This gene is a member of the apolipoprotein L gene family. The encoded protein is found in the cytoplasm, where it may affect the movement of lipids or allow the binding of lipids to organelles.
25390	
25391	
25392	
25393	The mouse and human DIO3OS and DIO3 (MIM 601038) genes overlap and are transcribed in opposite directions. The mouse Dio3 gene is imprinted from the paternal allele during fetal development, suggesting that DIO3OS is a noncoding gene that may have a role in maintaining monoallelic expression of DIO3 (Hernandez et al., 2004).[supplied by OMIM]
25394	The region of chromosome 21 between genes CBR and ERG (CBR-ERG region), which spans 2.5 Mb on 21q22.2, has been defined by analysis of patients with partial trisomy 21. It contributes significantly to the pathogenesis of many characteristics of Down syndrome, including morphological features, hypotonia, and mental retardation. This gene is found in this region and multiple transcripts may exist. It is mainly expressed in the placenta.
25395	
25396	
25397	
25398	
25399	
25400	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbls class and, in addition to an F-box, contains several tandem leucine-rich repeats.
25401	
25402	
25403	
25404	
25405	
25406	
25407	Mitogen-activated protein kinase (MAPK) signaling cascades include MAPK or extracellular signal-regulated kinase (ERK), MAPK kinase (MKK or MEK), and MAPK kinase kinase (MAPKKK or MEKK). MAPKK kinase/MEKK phosphorylates and activates its downstream protein kinase, MAPK kinase/MEK, which in turn activates MAPK. The kinases of these signaling cascades are highly conserved, and homologs exist in yeast, Drosophila, and mammalian cells. MAPKKK5 contains 1,374 amino acids with all 11 kinase subdomains. Northern blot analysis shows that MAPKKK5 transcript is abundantly expressed in human heart and pancreas. The MAPKKK5 protein phosphorylates and activates MKK4 (aliases SERK1, MAPKK4) in vitro, and activates c-Jun N-terminal kinase (JNK)/stress-activated protein kinase (SAPK) during transient expression in COS and 293 cells; MAPKKK5 does not activate MAPK/ERK.
25408	
25409	
25410	
25411	
25412	
25413	
25414	
25415	
25416	
25417	
25418	
25419	
25420	
25421	
25422	
25423	
25424	Human gamma-glutamyltransferase catalyzes the transfer of the glutamyl moiety of glutathione to a variety of amino acids and dipeptide acceptors. This heteroduplex enzyme is composed of a heavy chain and a light chain, which are derived from a single precursor protein, and is present in tissues involved in absorption and secretion. This enzyme is a member of the gamma-glutamyltransferase protein family, of which many members have not yet been fully characterized. This gene encodes several transcript variants; studies suggest that many transcripts of this gene family may be non-functional or represent pseudogenes. The functional transcripts which have been fully characterized have been grouped and classified as type I gamma-glutamyltransferase. Complex splicing events may take place in a tissue-specific manner, resulting in marked dissimilarity in the 5@#$%&UTRs. Several 5@#$%&UTR transcript variants of the type I gene have been identified in different tissues and cancer cells.
25425	
25426	
25427	
25428	
25429	
25430	
25431	This gene encodes an isoform of the alpha subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This isoform (alpha 1) is expressed in brain, placenta, lung, liver, kidney, and pancreas, and the other isoform (alpha 2) is expressed in brain, heart and skeletal muscle. This isoform is identified as an autoantigen in 66% of patients with Felty syndrome. This gene has been found to have multiple copies on many chromosomes, some of which, if not all, represent different pseudogenes.
25432	
25433	
25434	
25435	
25436	
25437	This gene encodes the fatty acid binding protein found in epidermal cells, and was first identified as being upregulated in psoriasis tissue. Fatty acid binding proteins are a family of small, highly conserved, cytoplasmic proteins that bind long-chain fatty acids and other hydrophobic ligands. It is thought that FABPs roles include fatty acid uptake, transport, and metabolism.
25438	
25439	
25440	
25441	
25442	
25443	
25444	
25445	
25446	
25447	
25448	
25449	This gene encodes a member of the Wiskott-Aldrich syndrome protein family. The gene product is a protein that forms a multiprotein complex that links receptor kinases and actin. Binding to actin occurs through a C-terminal verprolin homology domain in all family members. The multiprotein complex serves to tranduce signals that involve changes in cell shape, motility or function. The published map location (PMID:10381382) has been changed based on recent genomic sequence comparisons, which indicate that the expressed gene is located on chromosome 1, and a pseudogene may be located on chromosome X.
25450	
25451	
25452	
25453	
25454	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
25455	
25456	
25457	Thyrotroph embryonic factor (TEF), a transcription factor, is a member of the PAR (proline and acidic amino acid-rich) subfamily of basic region/leucine zipper (bZIP) transcription factors.  It is expressed in a broad range of cells and tissues in adult animals, however, during embryonic development, TEF expression appears to be restricted to the developing anterior pituitary gland, coincident with the appearance of thyroid-stimulating hormone, beta (TSHB).  Indeed, TEF can bind to, and transactivate the TSHB promoter.  It shows homology (in the functional domains) with other members of the PAR-bZIP subfamily of transcription factors, which include albumin D box-binding protein (DBP), human hepatic leukemia factor (HLF) and chicken vitellogenin gene-binding protein (VBP); VBP is considered the chicken homologue of TEF.  Different members of the subfamily can readily form heterodimers, and share DNA-binding, and transcriptional regulatory properties.
25458	MAP3K, or MEK kinase, is a serine/threonine kinase that occupies a pivotal role in a network of phosphorylating enzymes integrating cellular responses to a number of mitogenic and metabolic stimuli, including insulin (MIM 176730) and many growth factors.[supplied by OMIM]
25459	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L3P family of ribosomal proteins. It is located in the cytoplasm. The protein can bind to the HIV-1 TAR mRNA, and it has been suggested that the protein contributes to tat-mediated transactivation. This gene is co-transcribed with the small nucleolar RNA genes U43, U86, U83a, and U83b, which are located in its first, third, fifth, and seventh introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
25460	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L3P family of ribosomal proteins. It is located in the cytoplasm. The protein can bind to the HIV-1 TAR mRNA, and it has been suggested that the protein contributes to tat-mediated transactivation. This gene is co-transcribed with the small nucleolar RNA genes U43, U86, U83a, and U83b, which are located in its first, third, fifth, and seventh introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
25461	
25462	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
25463	
25464	
25465	This gene encodes a protein that is part of a post-splicing multiprotein complex involved in both mRNA nuclear export and mRNA surveillance. mRNA surveillance detects exported mRNAs with truncated open reading frames and initiates nonsense-mediated mRNA decay (NMD). When translation ends upstream from the last exon-exon junction, this triggers NMD to degrade mRNAs containing premature stop codons. This protein binds to the mRNA and remains bound after nuclear export, acting as a nucleocytoplasmic shuttling protein. This protein contains many serine residues. Two splice variants have been found for this gene; both variants encode the same protein.
25466	
25467	
25468	
25469	
25470	
25471	The function of this gene has not yet been determined. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of these variants has not been determined.
25472	
25473	
25474	
25475	
25476	
25477	
25478	
25479	Dehydrodolichyl diphosphate (dedol-PP) synthase catalyzes cis-prenyl chain elongation to produce the polyprenyl backbone of dolichol, a glycosyl carrier lipid required for the biosynthesis of several classes of glycoproteins.[supplied by OMIM]
25480	This gene encodes a receptor-type protein tyrosine phosphatase containing a single intracellular catalytic domain with a characteristic signature motif. The gene product, which has a transmembrane domain, is an integral membrane protein. Several alternatively spliced transcript variants, some of which encode different isoforms of the protein, have been described. These variants exhibit tissue-specific expression.
25481	
25482	
25483	
25484	
25485	
25486	The product of this gene belongs to the phosphatase 2A regulatory subunit B family. Protein phosphatase 2A is one of the four major Ser/Thr phosphatases, and it is implicated in the negative control of cell growth and division. It consists of a common heteromeric core enzyme, which is composed of a catalytic subunit and a constant regulatory subunit, that associates with a variety of regulatory subunits. The B regulatory subunit might modulate substrate selectivity and catalytic activity. This gene encodes a gamma isoform of the regulatory subunit B56 subfamily. Alternatively spliced transcript variants encoding different isoforms have been identified.
25487	
25488	The WNK1 gene encodes a cytoplasmic serine-threonine kinase expressed in distal nephron.[supplied by OMIM]
25489	
25490	
25491	
25492	
25493	Prostaglandin-endoperoxide synthase (PTGS), also known as cyclooxygenase, is the key enzyme in prostaglandin biosynthesis, and acts both as a dioxygenase and as a peroxidase. There are two isozymes of PTGS: a constitutive PTGS1 and an inducible PTGS2, which differ in their regulation of expression and tissue distribution. This gene encodes PTGS2, which shows 86% - 89% amino acid sequence identity with mouse, rat, sheep, bovine, horse and rabit PTGS2 proteins, respectively. Human PTGS2 is expressed in a limited number of cell types and regulated by specific stimulatory events, suggesting that it is responsible for the prostanoid biosynthesis involved in inflammation and mitogenesis. The expression of this gene is deregulated in epithelial tumors.
25494	
25495	
25496	
25497	L-type calcium channels are composed of five subunits. The protein encoded by this gene represents one of these subunits, gamma, and is one of several gamma subunit proteins. It is an integral membrane protein that is thought to stabilize the calcium channel in an inactive (closed) state. This protein is similar to the mouse stargazin protein, mutations in which have been associated with absence seizures, also known as petit-mal or spike-wave seizures. This gene is a member of the neuronal calcium channel gamma subunit gene subfamily of the PMP-22/EMP/MP20 family.
25498	Aminoacyl-tRNA synthetases catalyze the aminoacylation of tRNA by their cognate amino acid. Because of their central role in linking amino acids with nucleotide triplets contained in tRNAs, aminoacyl-tRNA synthetases are thought to be among the first proteins that appeared in evolution. Arginyl-tRNA synthetase belongs to the class-I aminoacyl-tRNA synthetase family.
25499	Opsins, including OPN3, are members of the superfamily of guanine nucleotide-binding protein (G protein)-coupled receptors, which function through the activation of a G protein and an effector enzyme. Opsin proteins consist of a single polypeptide chain of 340 to 500 amino acids that form 7 alpha-helical transmembrane regions connected by cytoplasmic and extracellular loops.[supplied by OMIM]
25500	
25501	
25502	
25503	
25504	The protein encoded by this gene is a member of the peptidyl-prolyl cis-trans isomerase (PPIase) family. PPIases catalyze the cis-trans isomerization of proline imidic peptide bonds in oligopeptides and accelerate the folding of proteins. This protein is a cyclosporin binding-protein. It may play a role in cyclosporin A-mediated immunosuppression. This protein can interact with several HIV proteins including p55 gag, Vpr, and capsid protein. It has been shown to be necessary for the formation of infectious HIV virions. Multiple pseudogenes that map to different chromosomes have been reported. Three alternatively spliced transcript variants encoding two distinct isoforms have been observed.
25505	
25506	
25507	
25508	
25509	
25510	
25511	
25512	
25513	
25514	
25515	
25516	
25517	This gene encodes a protein that contains laminin EGF motifs, a pentaxin domain, and many fibronectin type III motifs. The encoded basement membrane-associated protein may be important in development and homeostasis of the inner ear and retina. Mutations within this gene have been associated with Usher syndrome type IIa. Alternatively spliced transcript variants that encode different isoforms have been described.
25518	Bilateral symmetric nervous systems have special midline structures that establish a partition between the two mirror image halves. Some axons project toward and across the midline in response to long-range chemoattractants emanating from the midline. In Drosophila, the roundabout gene, a member of the immunoglobulin gene superfamily, encodes an integral membrane protein that is both an axon guidance receptor and a cell adhesion receptor. This receptor is involved in the decision by axons to cross the central nervous system midline. The protein encoded by this gene is structurally similar to the Drosophila roundabout protein. Two transcript variants encoding different isoforms have been found for this gene.
25519	
25520	
25521	
25522	The product encoded by this gene belongs to the heat shock protein 70 family which contains both heat-inducible and constitutively expressed members. The latter are called heat-shock cognate proteins. This gene encodes a heat-shock cognate protein. This protein plays a role in the control of cell proliferation. It may also act as a chaperone.
25523	
25524	The protein encoded by this gene interacts with poly(A)-binding protein and with the cap-binding complex eIF4A. It is involved in translational initiation and protein biosynthesis. Overexpression of this gene in COS7 cells stimulates translation. Alternative splicing occurs at this locus and three transcript variants encoding three distinct isoforms have been identified.
25525	
25526	
25527	
25528	
25529	
25530	
25531	This gene encodes a protein-disulfide isomerase. Its expression is induced by hypoxia and its role may be to protect hypoxic cells from apoptosis. This gene can be co-transcribed with the upstream gene MU.
25532	
25533	
25534	
25535	
25536	
25537	This gene encodes a member of the WAP-type four-disulfide core (WFDC) domain family. The WFDC domain, or WAP signature motif, contains eight cysteines forming four disulfide bonds at the core of the protein, and functions as a protease inhibitor. Most WFDC gene members are localized to chromosome 20q12-q13 in two clusters: centromeric and telomeric. This gene belongs to the telomeric cluster. Two alternatively spliced transcript variants have been found for this gene, and they encode distinct isoforms.
25538	In vertebrates, the genes encoding the class of transcription factors called homeobox genes are found in clusters named A, B, C, and D on four separate chromosomes. Expression of these proteins is spatially and temporally regulated during embryonic development. This gene is part of the A cluster on chromosome 7 and encodes a DNA-binding transcription factor which may regulate gene expression, morphogenesis, and differentiation. More specifically, it may function in fertility, embryo viability, and regulation of hematopoietic lineage commitment. Two transcript variants encoding different isoforms have been found for this gene.
25539	
25540	
25541	Laminins, a family of extracellular matrix glycoproteins, are the major noncollagenous constituent of basement membranes. They have been implicated in a wide variety of biological processes including cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis. Many of the effects of laminin are mediated through interactions with cell surface receptors. These receptors include members of the integrin family, as well as non-integrin laminin-binding proteins. This gene encodes a high-affinity, non-integrin family, laminin receptor 1. This receptor has been variously called 67 kD laminin receptor, 37 kD laminin receptor precursor (37LRP) and p40 ribosome-associated protein. The amino acid sequence of laminin receptor 1 is highly conserved through evolution, suggesting a key biological function. It has been observed that the level of the laminin receptor transcript is higher in colon carcinoma tissue and lung cancer cell line than their normal counterparts. Also, there is a correlation between the upregulation of this polypeptide in cancer cells and their invasive and metastatic phenotype. Multiple copies of this gene exist, however, most of them are pseudogenes thought to have arisen from retropositional events. Two alternatively spliced transcript variants encoding the same protein have been found for this gene.
25542	
25543	
25544	
25545	
25546	
25547	
25548	
25549	
25550	
25551	
25552	
25553	
25554	
25555	
25556	
25557	
25558	The protein encoded by this gene belongs to the cytokine receptor family. It is an accessory chain essential for the active interleukin 10 receptor complex. Coexpression of this and IL10RA proteins has been shown to be required for IL10-induced signal transduction. This gene and three other interferon receptor genes, IFAR2, IFNAR1, and IFNGR2, form a class II cytokine receptor gene cluster located in a small region on chromosome 21.
25559	
25560	
25561	
25562	
25563	
25564	The protein encoded by this gene is a member of the interleukin 1 receptor family and is similar to the interleukin 1 accessory proteins. It is most closely related to interleukin 1 receptor accessory protein-like 2 (IL1RAPL2). This gene and IL1RAPL2 are located at a region on chromosome X that is associated with X-linked non-syndromic mental retardation. Deletions and mutations in this gene were found in patients with mental retardation. This gene is expressed at a high level in post-natal brain structures involved in the hippocampal memory system, which suggests a specialized role in the physiological processes underlying memory and learning abilities.
25565	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of the ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class. Two transcript variants encoding different isoforms exist for this gene.
25566	
25567	
25568	This gene encodes a member of the C-type lectin/C-type lectin-like domain (CTL/CTLD) superfamily. Members of this family share a common protein fold and have diverse functions, such as cell adhesion, cell-cell signalling, glycoprotein turnover, and roles in inflammation and immune response. The encoded type II transmembrane protein is a downstream target of CCAAT/enhancer binding protein (C/EBP), beta (CEBPB) and may play a role in an inflammation. Alternative splice variants have been described but their full-length sequence has not been determined. This gene is closely linked to other CTL/CTLD superfamily members on chromosome 12p13 in the natural killer gene complex region.
25569	
25570	
25571	
25572	
25573	
25574	
25575	
25576	
25577	
25578	
25579	
25580	
25581	The protein encoded by this gene is a receptor for glucocorticoids and can act as both a transcription factor and a regulator of other transcription factors. The encoded protein can bind DNA as a homodimer or as a heterodimer with another protein such as the retinoid X receptor. This protein can also be found in heteromeric cytoplasmic complexes along with heat shock factors and immunophilins. The protein is typically found in the cytoplasm until it binds a ligand, which induces transport into the nucleus. Mutations in this gene are a cause of glucocorticoid resistance, or cortisol resistance. Alternate splicing, the use of at least three different promoters, and alternate translation initiation sites result in several transcript variants encoding the same protein or different isoforms, but the full-length nature of some variants has not been determined.
25582	
25583	
25584	
25585	
25586	EP300 encodes the adenovirus E1A-associated cellular p300 transcriptional co-activator protein. p300 is related by sequence to CBP (CREB-binding protein [CREB: cyclic-AMP responsive element binding protein]), and like CBP can stimulate transcription through activation of CREB. This EP300 activity is specifically inhibited by the adenovirus oncoprotein E1A. EP300 has also been identified as a co-activator of HIF1A (hypoxia-inducible factor 1 alpha), and thus plays a role in the stimulation of hypoxia-induced genes such as VEGF.
25587	
25588	
25589	
25590	This gene encodes a member of the adenylate kinase family of enzymes. The encoded protein is localized to the mitochondrial matrix. Adenylate kinases regulate the adenine and guanine nucleotide compositions within a cell by catalyzing the reversible transfer of phosphate group among these nucleotides. Five isozymes of adenylate kinase have been identified in vertebrates. Expression of these isozymes is tissue-specific and developmentally regulated. A pseudogene for this gene has been located on chromosome 17. Three transcript variants encoding the same protein have been identified for this gene. Sequence alignment suggests that the gene defined by NM_013410, NM_203464, and NM_001005353 is located on chromosome 1.
25591	
25592	
25593	
25594	
25595	TGFB is a multifunctional peptide that controls proliferation, differentiation, and other functions in many cell types. TGFB acts synergistically with TGFA (MIM 190170) in inducing transformation. It also acts as a negative autocrine growth factor. Dysregulation of TGFB activation and signaling may result in apoptosis. Many cells synthesize TGFB and almost all of them have specific receptors for this peptide. TGFB1, TGFB2 (MIM 190220), and TGFB3 (MIM 190230) all function through the same receptor signaling systems.[supplied by OMIM]
25596	
25597	In mammalian cells, 2 broad groups of centromere-interacting proteins have been described: constitutively binding centromere proteins and @#$%&passenger,@#$%& or transiently interacting, proteins (reviewed by Choo, 1997). The constitutive proteins include CENPA (centromere protein A; MIM 117139), CENPB (MIM 117140), CENPC1 (MIM 117141), and CENPD (MIM 117142). The term @#$%&passenger proteins@#$%& encompasses a broad collection of proteins that localize to the centromere during specific stages of the cell cycle (Earnshaw and Mackay, 1994). These include CENPE (MIM 117143); MCAK (MIM 604538); KID (MIM 603213); cytoplasmic dynein (e.g., MIM 600112); CliPs (e.g., MIM 179838); and CENPF/mitosin (MIM 600236). The inner centromere proteins (INCENPs) (Earnshaw and Cooke, 1991), the initial members of the passenger protein group, display a broad localization along chromosomes in the early stages of mitosis but gradually become concentrated at centromeres as the cell cycle progresses into mid-metaphase. During telophase, the proteins are located within the midbody in the intercellular bridge, where they are discarded after cytokinesis (Cutts et al., 1999).[supplied by OMIM]
25598	
25599	F-box proteins are an expanding family of eukaryotic proteins characterized by an approximately 40 amino acid motif, the F box. Some F-box proteins have been shown to be critical for the ubiquitin-mediated degradation of cellular regulatory proteins. In fact, F-box proteins are one of the four subunits of ubiquitin protein ligases, called SCFs. SCF ligases bring ubiquitin conjugating enzymes to substrates that are specifically recruited by the different F-box proteins. A large family of mammalian F-box proteins has recently been identified and classified into three groups based on the presence of either the WD-40 repeats, the leucine-rich repeats, or the presence or absence of other protein-protein interacting domains. The FBXW2 gene product, the second identified member of the F-box gene family, contains multiple WD-40 repeats.
25600	
25601	
25602	
25603	
25604	
25605	
25606	The protein encoded by this gene is a nuclear transcriptional co-activator for peroxisome proliferator activated receptor alpha. The encoded protein contains a zinc finger and is a helicase that appears to be part of the peroxisome proliferator activated receptor alpha interacting complex. This gene is a member of the DNA2/NAM7 helicase gene family.
25607	
25608	
25609	
25610	
25611	
25612	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes one of the ATPase subunits, a member of the triple-A family of ATPases which have a chaperone-like activity. Pseudogenes have been identified on chromosomes 8 and 12.
25613	
25614	
25615	This gene encodes a member of the claudin family. Claudins are integral membrane proteins and components of tight junction strands. Tight junction strands serve as a physical barrier to prevent solutes and water from passing freely through the paracellular space between epithelial or endothelial cell sheets. Mutations in this gene have been found in patients with velocardiofacial syndrome.
25616	
25617	
25618	
25619	
25620	
25621	
25622	
25623	
25624	
25625	
25626	The protein encoded by this gene is a component of the exocyst complex, a multiple protein complex essential for targeting exocytic vesicles to specific docking sites on the plasma membrane. Though best characterized in yeast, the component proteins and functions of exocyst complex have been demonstrated to be highly conserved in higher eukaryotes. At least eight components of the exocyst complex, including this protein, are found to interact with the v-ral simian leukemia viral oncogene homolog A (RALA), a Ras-related small GTPase involved in controlling actin cytoskeletal remodeling and vesicle transport. The exocyst complex containing this protein is required for targeting the glucose transporter GLUT4 to the plasma membrane by insulin. The protein complex containing the similar protein in rat was reported to be involved in synaptic vesicle docking and fusion at the presynaptic terminal.
25627	
25628	
25629	
25630	VAV3, a third representative of the Vav family, participates in signal transduction processes. It is also involved in the activation of several members of the Rho family. This activity, when deregulated, leads to marked cytoskeletal changes and to alterations in the process of cell division.
25631	
25632	
25633	
25634	
25635	
25636	
25637	
25638	
25639	
25640	
25641	
25642	
25643	
25644	
25645	
25646	
25647	
25648	
25649	This gene encodes a member of the aldo/keto reductase superfamily, which consists of more than 40 known enzymes and proteins. This member, also known as aldehyde reductase, is involved in the reduction of biogenic and xenobiotic aldehydes and is present in virtually every tissue. Alternative splicing of this gene results in two transcript variants encoding the same protein.
25650	This gene encodes a member of the aldo/keto reductase superfamily, which consists of more than 40 known enzymes and proteins. This member, also known as aldehyde reductase, is involved in the reduction of biogenic and xenobiotic aldehydes and is present in virtually every tissue. Alternative splicing of this gene results in two transcript variants encoding the same protein.
25651	
25652	
25653	
25654	
25655	
25656	
25657	
25658	
25659	
25660	
25661	
25662	
25663	
25664	
25665	
25666	
25667	FSHD region gene 1 maps 100 kb centromeric of the repeat units on chromosome 4q35 which are deleted in facioscapulohumeral muscular dystrophy. The FRG1 gene is evolutionarily conserved and has related sequences on multiple human chromosomes.  DNA sequence analysis did not reveal any homology to known genes.
25668	
25669	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP contains a PEST motif, which often serves as a protein-protein interaction domain, and may be related to protein intracellular half-live. This gene was found to be expressed in brain, colon tissues, and several different tumor-derived cell lines. The bilogical function of this PTP has not yet been determined.
25670	This gene encodes a member of the phosducin-like protein family and is a putative modulator of heterotrimeric G proteins. The protein shares extensive amino acid sequence homology with phosducin. Members of the phosducin-like protein family have been shown to bind to the beta-gamma subunits of G proteins.
25671	
25672	A cluster of genes, BAT1-BAT5, has been localized in the vicinity of the genes for TNF alpha and TNF beta. This gene is found near this cluster; it was mapped near the gene for C2 within a 120-kb region that included a HSP70 gene pair. These genes are all within the human major histocompatibility complex class III region. This gene was thought to be two different genes, NG36 and G9a, adjacent to each other but a recent publication shows that there is only a single gene. The protein encoded by this gene is thought to be involved in intracellular protein-protein interaction. There are three alternatively spliced transcript variants of this gene but only two are fully described.
25673	
25674	
25675	This gene encodes an element of the machinery for raft-mediated trafficking in endothelial cells. The encoded protein, a member of the MAL proteolipid family, predominantly localizes in glycolipid- and cholesterol-enriched membrane (GEM) rafts. It interacts with caveolin-1.
25676	
25677	
25678	
25679	
25680	
25681	
25682	
25683	
25684	
25685	
25686	
25687	This gene encodes the beta subunit of casein kinase II, a ubiquitous protein kinase which regulates metabolic pathways, signal transduction, transcription, translation, and replication. The enzyme is composed of three subunits, alpha, alpha prime and beta, which form a tetrameric holoenzyme. The alpha and alpha prime subunits are catalytic, while the beta subunit serves regulatory functions. The enzyme localizes to the endoplasmic reticulum and the Golgi apparatus.
25688	
25689	
25690	
25691	
25692	
25693	The eukaryotic cell cycle is governed by cyclin-dependent protein kinases (CDKs) whose activities are regulated by cyclins and CDK inhibitors. The 8 species of cyclins reported in mammals, cyclins A through H, share a conserved amino acid sequence of about 90 residues called the cyclin box. The amino acid sequence of cyclin G is well conserved among mammals. The nucleotide sequence of cyclin G1 and cyclin G2 are 53% identical. Unlike cyclin G1, cyclin G2 contains a C-terminal PEST protein destabilization motif, suggesting that cyclin G2 expression is tightly regulated through the cell cycle.
25694	
25695	
25696	
25697	
25698	
25699	
25700	
25701	
25702	
25703	Precerebellin is similar to the noncollagen domain of complement component C1qB. It gives rise to several truncated derivatives, including the hexadecapeptide cerebellin which is highly enriched in postsynaptic structures of cerebellar Purkinje cells in cartwheel neurons of the dorsal cochlear nucleus.
25704	
25705	
25706	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene includes 2 alternatively spliced transcripts encoding 2 different isoforms. The larger isoform is a DEAD box protein with RNA helicase activity. It may participate in melting of DNA:RNA hybrids, such as those that occur during transcription, and may play a role in X-linked gene expression. It contains 2 copies of a double-stranded RNA-binding domain, a DEXH core domain and an RGG box. The RNA-binding domains and RGG box influence and regulate RNA helicase activity. The smaller isoform is a lymphocyte granule protein. It lacks RNA-binding domains and DEXH core domain, but contains an RGG box, which may render this isoform RNA binding function.
25707	The protein encoded by this gene belongs to the ser/thr protein kinase family. It is the catalytic subunit of the 5@#$%&-prime-AMP-activated protein kinase (AMPK). AMPK is a cellular energy sensor conserved in all eukaryotic cells. The kinase activity of AMPK is activated by the stimuli that increase the cellular AMP/ATP ratio. AMPK regulates the activities of a number of key metabolic enzymes through phosphorylation. It protects cells from stresses that cause ATP depletion by switching off ATP-consuming biosynthetic pathways. Alternatively spliced transcript variants encoding distinct isoforms have been observed.
25708	This gene includes two alternatively spliced transcript variants, which encode different isoforms. The function of this gene has not been determined.
25709	
25710	
25711	This gene encodes a protein containing a SET domain, 2 LXXLL motifs, 3 nuclear translocation signals (NLSs), 4 plant homeodomain (PHD) finger regions, and a proline-rich region. The encoded protein enhances androgen receptor (AR) transactivation, and this enhancement can be increased further in the presence of other androgen receptor associated coregulators. This protein may act as a nucleus-localized, basic transcriptional factor and also as a bifunctional transcriptional regulator. Mutations of this gene have been associated with Sotos syndrome and Weaver syndrome. One version of childhood acute myeloid leukemia is the result of a cryptic translocation with the breakpoints occurring within nuclear receptor-binding Su-var, enhancer of zeste, and trithorac domain protein 1 on chromosome 5 and nucleoporin, 98-kd on chromosome 11. Two transcript variants encoding distinct isoforms have been identified for this gene.
25712	
25713	
25714	
25715	
25716	
25717	
25718	
25719	
25720	
25721	
25722	
25723	The protein encoded by this gene is a member of the LAR protein-tyrosine phosphatase-interacting protein (liprin) family. Liprins interact with members of LAR family of transmembrane protein tyrosine phosphatases, which are known to be important for axon guidance and mammary gland development. It has been proposed that liprins are multivalent proteins that form complex structures and act as scaffolds for the recruitment and anchoring of LAR family of tyrosine phosphatases. This protein was found to interact with S100A4, a calcium-binding protein related to tumor invasiveness and metastasis. In vitro experiment demonstrated that the interaction inhibited the phosphorylation of this protein by protein kinase C and protein kinase CK2. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
25724	
25725	
25726	Defensins are a family of microbicidal and cytotoxic peptides thought to be involved in host defense. They are abundant in the granules of neutrophils and also found in the epithelia of mucosal surfaces such as those of the intestine, respiratory tract, urinary tract, and vagina. Members of the defensin family are highly similar in protein sequence and distinguished by a conserved cysteine motif. Several alpha defensin genes appear to be clustered on chromosome 8. The protein encoded by this gene, defensin, alpha 1, is found in the microbicidal granules of neutrophils and likely plays a role in phagocyte-mediated host defense. It differs from defensin, alpha 3 by only one amino acid.
25727	
25728	
25729	
25730	
25731	
25732	
25733	
25734	
25735	
25736	
25737	
25738	
25739	
25740	
25741	
25742	Glutamic-oxaloacetic transaminase is a pyridoxal phosphate-dependent enzyme which exists in cytoplasmic and mitochondrial forms, GOT1 and GOT2, respectively.  GOT plays a role in amino acid metabolism and the urea and tricarboxylic acid cycles. The two enzymes are homodimeric and show close homology.
25743	
25744	
25745	The protein encoded by this gene has been identified as a brain-specific angiogenesis inhibitor (BAI1)-binding protein.  This interaction at the cytoplasmic membrane is crucial to the function of this protein, which may be involved in neuronal growth-cone guidance.  This protein functions as an insulin receptor tyrosine kinase substrate and suggests a role for insulin in the central nervous system.  This protein has also been identified as interacting with the dentatorubral-pallidoluysian atrophy gene, which is associated with an autosomal dominant neurodegenerative disease.  It also associates with a downstream effector of Rho small G proteins, which is associated with the formation of stress fibers and cytokinesis.  Alternative splicing of the 3@#$%&-end of this gene results in three products of undetermined function.
25746	
25747	
25748	
25749	
25750	
25751	
25752	
25753	
25754	
25755	
25756	
25757	
25758	
25759	
25760	
25761	
25762	Transaldolase 1 is a key enzyme of the nonoxidative pentose phosphate pathway providing ribose-5-phosphate for nucleic acid synthesis and NADPH for lipid biosynthesis. This pathway can also maintain glutathione at a reduced state and thus protect sulfhydryl groups and cellular integrity from oxygen radicals. The functional gene of transaldolase 1 is located on chromosome 11 and a pseudogene is identified on chromosome 1 but there are conflicting map locations. The second and third exon of this gene were developed by insertion of a retrotransposable element. This gene is thought to be involved in multiple sclerosis.
25763	
25764	
25765	
25766	
25767	
25768	This gene encodes a cyclin-dependent kinase inhibitor, which shares a limited similarity with CDK inhibitor CDKN1A/p21. The encoded protein binds to and prevents the activation of cyclin E-CDK2 or cyclin D-CDK4 complexes, and thus controls the cell cycle progression at G1. The degradation of this protein, which is triggered by its CDK dependent phosphorylation and subsequent ubiquitination by SCF complexes, is required for the cellular transition from quiescence to the proliferative state.
25769	
25770	
25771	
25772	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation.
25773	
25774	This gene encodes a protein that is part of a post-splicing multiprotein complex involved in both mRNA nuclear export and mRNA surveillance. The encoded protein is one of two functional homologs to yeast Upf3p. mRNA surveillance detects exported mRNAs with truncated open reading frames and initiates nonsense-mediated mRNA decay (NMD). When translation ends upstream from the last exon-exon junction, this triggers NMD to degrade mRNAs containing premature stop codons. This protein binds to the mRNA and remains bound after nuclear export, acting as a nucleocytoplasmic shuttling protein. It forms with Y14 a complex that binds specifically 20 nt upstream of exon-exon junctions. This gene is located on the long arm of chromosome 13. Two splice variants encoding different isoforms have been found for this gene.
25775	
25776	
25777	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a non-ATPase subunit of the 19S regulator. A pseudogene has been identified on chromosome 17.
25778	
25779	ARHJ belongs to the Rho family of small GTP-binding proteins. Rho proteins regulate the dynamic assembly of cytoskeletal components for several physiologic processes, such as cell proliferation and motility and the establishment of cell polarity. They are also involved in pathophysiologic process, such as cell transformation and metastasis.[supplied by OMIM]
25780	
25781	
25782	The protein encoded by this gene is a member of the BAG1-related protein family. BAG1 is an anti-apoptotic protein that functions through interactions with a variety of cell apoptosis and growth related proteins including BCL-2, Raf-protein kinase, steroid hormone receptors, growth factor receptors and members of the heat shock protein 70 kDa family. This protein contains a BAG domain near the C-terminus, which could bind and inhibit the chaperone activity of Hsc70/Hsp70. Three transcript variants encoding two different isoforms have been found for this gene.
25783	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development, reproduction, and tissue remodeling, as well as in disease processes, such as arthritis and metastasis. Most MMP@#$%&s are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. However, the protein encoded by this gene is a member of the membrane-type MMP (MT-MMP) subfamily; each member of this subfamily contains a potential transmembrane domain suggesting that these proteins are expressed at the cell surface rather than secreted. This protein activates MMP2 protein, and this activity may be involved in tumor invasion.
25784	
25785	This gene encodes a subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This subunit functions as guanine nucleotide exchange factor. It is reported that this subunit interacts with HIV-1 Tat, and thus it represses the translation of host-cell, but not HIV-1, mRNAs. Several alternatively spliced transcript variants have been found for this gene, however, the full length nature of only two variants has been determined.
25786	
25787	
25788	
25789	
25790	Heparan sulfate biosynthetic enzymes are key components in generating a myriad of distinct heparan sulfate fine structures that carry out multiple biologic activities. The enzyme encoded by this gene is a member of the heparan sulfate biosynthetic enzyme family. It is a type II integral membrane protein and responsible for 6-O-sulfation of heparan sulfate. This enzyme does not share significant sequence similarity to other known sulfotransferases.
25791	
25792	
25793	
25794	
25795	
25796	
25797	The beta-adrenergic receptor kinase specifically phosphorylates the agonist-occupied form of the beta-adrenergic and related G protein-coupled receptors. Overall, the ADRBK2 enzyme has 85% amino acid similarity with ADRBK1, with the protein kinase catalytic domain having 95% similarity. The ADRBK2 mRNA is approximately 8 kilobases with a distribution similar to that of ADRBK1. These data suggest the existence of a family of receptor kinases which may serve broadly to regulate receptor function
25798	
25799	
25800	
25801	
25802	
25803	
25804	Tumor necrosis factor alpha (TNF-alpha) is a signaling molecule that interacts with one of two receptors on cells targeted for apoptosis. The apoptotic signal is transduced inside these cells by cytoplasmic adaptor proteins. The protein encoded by this gene is a death domain-containing adaptor protein that interacts with the death domain of TNF-alpha receptor 1 to activate mitogen-activated protein kinase (MAPK) and propagate the apoptotic signal. It is membrane-bound and expressed at a higher level in neoplastic cells than in normal cells. Several transcript variants encoding different isoforms have been described for this gene.
25805	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene contains introns and its mRNA is poyadenylated, unlike most histone genes. The protein encoded is a member of the histone H3 family.
25806	
25807	
25808	Growth arrest-specific 7 is expressed primarily in terminally differentiated brain cells and predominantly in mature cerebellar Purkinje neurons. GAS7 plays a putative role in neuronal development. Several transcript variants encoding proteins which vary in the N-terminus have been described.
25809	
25810	Proteins of the large Rab GTPase family (see RAB1A; MIM 179508) have regulatory roles in the formation, targeting, and fusion of intracellular transport vesicles. RAB11FIP3 is one of many proteins that interact with and regulate Rab GTPases (Hales et al., 2001).[supplied by OMIM]
25811	
25812	
25813	
25814	
25815	
25816	
25817	
25818	
25819	
25820	
25821	
25822	
25823	
25824	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein that is known to complex with integrins and other transmembrane 4 superfamily proteins. It may play a role in T-cell-B-cell interactions.
25825	
25826	
25827	
25828	
25829	
25830	
25831	
25832	
25833	
25834	
25835	
25836	
25837	
25838	
25839	
25840	In vertebrates, the genes encoding the class of transcription factors called homeobox genes are found in clusters named A, B, C, and D on four separate chromosomes. Expression of these proteins is spatially and temporally regulated during embryonic development. This gene is part of the A cluster on chromosome 7 and encodes a DNA-binding transcription factor which may regulate gene expression, morphogenesis, and differentiation. Methylation of this gene may result in the loss of its expression and, since the encoded protein upregulates the tumor suppressor p53, this protein may play an important role in tumorigenesis.
25841	
25842	
25843	
25844	
25845	
25846	
25847	
25848	Pyrimidine 5-prime-nucleotidase (P5N; EC 3.1.3.5), also called uridine 5-prime monophosphate hydrolase (UMPH), catalyzes the dephosphorylation of the pyrimidine 5-prime monophosphates UMP and CMP to the corresponding nucleosides. There are 2 isozymes of pyrimidine 5-prime nucleotidase in red blood cells, referred to as type I (UMPH1) and type II (UMPH2; MIM 191720). The 2 enzymes are not separable by electrophoresis in humans but have distinct kinetic properties, and the proteins show no homology.[supplied by OMIM]
25849	
25850	This gene encodes one of the members of the superfamily of potassium channel proteins containing two pore-forming P domains. The gene product is an outwardly rectifying channel that is sensitive to changes in extracellular pH and is inhibited by extracellular acidification. Also referred to as an acid-sensitive potassium channel, it is activated by the anesthetics halothane and isoflurane. Although three transcripts are detected in northern blots, there is currently no sequence available to confirm transcript variants for this gene.
25851	
25852	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L21E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
25853	
25854	
25855	
25856	
25857	
25858	The preB cell receptor is found on the surface of proB and preB cells, where it is involved in transduction of signals for cellular proliferation, differentiation from the proB cell to the preB cell stage, allelic exclusion at the Ig heavy chain gene locus, and promotion of Ig light chain gene rearrangements. The preB cell receptor is composed of a membrane-bound Ig mu heavy chain in association with a heterodimeric surrogate light chain. This gene encodes one of the surrogate light chain subunits and is a member of the immunoglobulin gene superfamily. This gene does not undergo rearrangement. Mutations in this gene can result in B cell deficiency and agammaglobulinemia, an autosomal recessive disease in which few or no gamma globulins or antibodies are made. Two transcript variants encoding different isoforms have been found for this gene.
25859	
25860	
25861	
25862	This gene is expressed ubiquitously with higher levels in fetal than in adult tissues. It encodes a protein sharing 93% sequence identity with the mouse protein. Wolf-Hirschhorn syndrome (WHS) is a malformation syndrome associated with a hemizygous deletion of the distal short arm of chromosome 4. This gene is mapped to the 165 kb WHS critical region, and may play a role in the phenotype of the WHS or Pitt-Rogers-Danks syndrome. The encoded protein is found to be capable of reacting with HLA-A2-restricted and tumor-specific cytotoxic T lymphocytes, suggesting a target for use in specific immunotherapy for a large number of cancer patients. This protein has also been shown to be a member of the NELF (negative elongation factor) protein complex that participates in the regulation of RNA polymerase II transcription elongation.
25863	
25864	
25865	
25866	
25867	
25868	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein and is highly similar in sequence to the transmembrane 4 superfamily member 2. The use of alternate polyadenylation sites has been found for this gene.
25869	This gene is a member of the MAGEB gene family. The members of this family have their entire coding sequences located in the last exon, and the encoded proteins show 50 to 68% sequence identity to each other. The promoters and first exons of the MAGEB genes show considerable variability, suggesting that the existence of this gene family enables the same function to be expressed under different transcriptional controls. The MAGEB genes are clustered on chromosome Xp22-p21. This gene sequence ends in the first intron of MAGEB1, another family member. This gene is expressed in testis.
25870	
25871	
25872	
25873	
25874	
25875	
25876	This gene encodes a protein which contains WD repeats and an FYVE domain. Multiple alternatively spliced transcript variants have been found for this gene, but the full-length nature of some variants has not been defined.
25877	
25878	
25879	
25880	
25881	This gene encodes a member of the EDG family of receptors, which are G protein-coupled receptors. This protein has been identified as a functional receptor for sphingosine 1-phosphate and likely contributes to the regulation of angiogenesis and vascular endothelial cell function.
25882	
25883	The protein encoded by this gene binds to calmodulin and is expressed in the centrosome. It is an integral component of the pericentriolar material (PCM). The function of this protein has not been determined.
25884	
25885	
25886	
25887	
25888	
25889	
25890	
25891	
25892	This gene encodes a member of the Ced-4 family of apoptosis proteins. Ced-family members contain a caspase recruitment domain (CARD) and are known to be key mediators of programmed cell death. The encoded protein contains a distinct N-terminal pyrin-like motif, which is possibly involved in protein-protein interactions. This protein interacts strongly with caspase 2 and weakly with caspase 9. Overexpression of this gene was demonstrated to induce apoptosis in cells. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene, but the biological validity of some variants has not been determined.
25893	
25894	
25895	
25896	
25897	
25898	
25899	
25900	This gene encodes a protein with multiple coiled coil motifs which is located in the nucleus, cytoplasm and mitochondria. The protein is involved in neurite outgrowth and cortical development through its interaction with other proteins. This gene is disrupted in a t(1;11)(q42.1;q14.3) translocation which segregates with schizophrenia and related psychiatric disorders in a large Scottish family. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
25901	
25902	
25903	
25904	
25905	
25906	
25907	
25908	
25909	L-glutamate is the major excitatory neurotransmitter in the central nervous system and activates both ionotropic and metabotropic glutamate receptors.  Glutamatergic neurotransmission is involved in most aspects of normal brain function and can be perturbed in many neuropathologic conditions. The metabotropic glutamate receptors are a family of G protein-coupled receptors, that have been divided into 3 groups on the basis of sequence homology, putative signal transduction mechanisms, and pharmacologic properties.  Group I includes GRM1 and GRM5 and these receptors have been shown to activate phospholipase C.  Group II includes GRM2 and GRM3 while Group III includes GRM4, GRM6, GRM7 and GRM8.  Group II and III receptors are linked to the inhibition of the cyclic AMP cascade but differ in their agonist selectivities.  Alternative splice variants of GRM8 have been described but their full-length nature has not been determined.
25910	
25911	
25912	
25913	
25914	
25915	
25916	
25917	
25918	
25919	
25920	
25921	
25922	
25923	
25924	
25925	
25926	
25927	
25928	
25929	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
25930	
25931	
25932	
25933	This gene encodes a member of the neuropilin family of receptor proteins. The encoded transmembrane protein binds to SEMA3C protein {sema domain, immunoglobulin domain (Ig), short basic domain, secreted, (semaphorin) 3C} and SEMA3F protein {sema domain, immunoglobulin domain (Ig), short basic domain, secreted, (semaphorin) 3F}, and interacts with vascular endothelial growth factor (VEGF). This protein may play a role in cardiovascular development, axon guidance, and tumorigenesis. Multiple transcript variants encoding distinct isoforms have been identified for this gene.
25934	
25935	
25936	
25937	
25938	This gene encodes a protein related to talin 1, a cytoskeletal protein that plays a significant role in the assembly of actin filaments and in spreading and migration of various cell types, including fibroblasts and osteoclasts. This protein has a different pattern of expression compared to talin 1 but, like talin 1, is thought to associate with unique transmembrane receptors to form novel linkages between extracellular matrices and the actin cytoskeleton.
25939	
25940	
25941	
25942	
25943	
25944	
25945	
25946	
25947	
25948	
25949	
25950	
25951	
25952	
25953	
25954	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility.
25955	
25956	
25957	
25958	
25959	VAV2 is the second member of the VAV oncogene family. Unlike VAV1, which is expressed exclusively in hematopoietic cells, VAV2 transcripts were found in most tissues.
25960	The protein encoded by this gene is a calcyclin binding protein. It may be involved in calcium-dependent ubiquitination and subsequent proteosomal degradation of target proteins. It probably serves as a molecular bridge in ubiquitin E3 complexes and participates in the ubiquitin-mediated degradation of beta-catenin. Two alternatively spliced transcript variants encoding different isoforms have been found for this gene.
25961	
25962	
25963	
25964	The poly-Ig receptor is expressed on several glandular epithelia including those of liver and breast. It mediates transcellular transport of polymeric immunoglobulin molecules. It is a member of the immunoglobulin superfamily (Hood et al., 1985). The receptor has 5 units with homology to the variable (V) units of immunoglobulins and a transmembrane region, which also has some homology to certain immunoglobulin variable regions.[supplied by OMIM]
25965	
25966	
25967	
25968	
25969	HLA-DRB1 belongs to the HLA class II beta chain paralogues. The class II molecule is a heterodimer consisting of an alpha (DRA) and a beta chain (DRB), both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa. It is encoded by 6 exons, exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail. Within the DR molecule the beta chain contains all the polymorphisms specifying the peptide binding specificities. Hundreds of DRB1 alleles have been described and typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. DRB1 is expressed at a level five times higher than its paralogues DRB3, DRB4 and DRB5. DRB1 is present in all individuals. Allelic variants of DRB1 are linked with either none or one of the genes DRB3, DRB4 and DRB5. There are 4 related pseudogenes: DRB2, DRB6, DRB7, DRB8 and DRB9.
25970	Extracellular signals are transduced into cells through mitogen-activated protein kinases. The structural organization of these kinases into specific signaling domains is facilitated by scaffolding proteins involved in closely tethering different kinases so that successive phosphorylation events can occur. The protein encoded by this gene is a scaffolding protein that brings together mitogen-activated protein kinases and their transcription factor targets for the activation of specific signaling pathways. This gene which is abundantly expressed in testicular haploid germ cells encodes a protein that is recognized by sperm-agglutinating antibodies and implicated in infertility. Alternatively spliced variants that encode different protein isoforms have been described but the full-length sequences of only two have been determined.
25971	
25972	
25973	
25974	
25975	
25976	
25977	
25978	
25979	
25980	
25981	
25982	
25983	
25984	
25985	
25986	
25987	
25988	
25989	
25990	
25991	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and a single intracellular catalytic domain, and thus represents a receptor-type PTP. The catalytic domain of this PTP is most closely related to PTPRN/IA-2beta. This PTP and PTPRN are both found to be major autoantigens associated with insulin-dependent diabetes mellitus. Three alternatively spliced transcript variants of this gene, which encode distinct proteins, have been reported.
25992	
25993	
25994	
25995	
25996	
25997	The protein encoded by this gene removes 5@#$%& overhanging flaps in DNA repair and processes the 5@#$%& ends of Okazaki fragments in lagging strand DNA synthesis. Direct physical interaction between this protein and AP endonuclease 1 during long-patch base excision repair provides coordinated loading of the proteins onto the substrate, thus passing the substrate from one enzyme to another. The protein is a member of the XPG/RAD2 endonuclease family and is one of ten proteins essential for cell-free DNA replication. DNA secondary structure can inhibit flap processing at certain trinucleotide repeats in a length-dependent manner by concealing the 5@#$%& end of the flap that is necessary for both binding and cleavage by the protein encoded by this gene. Therefore, secondary structure can deter the protective function of this protein, leading to site-specific trinucleotide expansions.
25998	
25999	
26000	
26001	The Sec61 complex is the central component of the protein translocation apparatus of the endoplasmic reticulum (ER) membrane. Oligomers of the Sec61 complex form a transmembrane channel where proteins are translocated across and integrated into the ER membrane. This complex consists of three membrane proteins- alpha, beta, and gamma. This gene encodes the beta-subunit protein. The Sec61 subunits are also observed in the post-ER compartment, suggesting that these proteins can escape the ER and recycle back. There is evidence for multiple polyadenylated sites for this transcript.
26002	
26003	
26004	The protein encoded by this gene is part of a complex termed negative elongation factor (NELF) which represses RNA polymerase II transcript elongation. This protein bears similarity to nuclear RNA-binding proteins; however, it has not been demonstrated that this protein binds RNA. The protein contains a tract of alternating basic and acidic residues, largely arginine (R) and aspartic acid (D). The gene localizes to the major histocompatibility complex (MHC) class III region on chromosome 6.
26005	
26006	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a cytoplasmic ribosomal protein that is a component of the 60S subunit. The protein belongs to the L22E family of ribosomal proteins. Its initiating methionine residue is post-translationally removed. The protein can bind specifically to Epstein-Barr virus-encoded RNAs (EBERs) 1 and 2. The mouse protein has been shown to be capable of binding to heparin. Transcript variants utilizing alternative polyA signals exist. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome. It was previously thought that this gene mapped to 3q26 and that it was fused to the acute myeloid leukemia 1 (AML1) gene located at 21q22 in some therapy-related myelodysplastic syndrome patients with 3;21 translocations; however, these fusions actually involve a ribosomal protein L22 pseudogene located at 3q26, and this gene actually maps to 1p36.3-p36.2.
26007	
26008	
26009	
26010	
26011	
26012	
26013	
26014	The protein encoded by this gene is a member of the IRS1-like multisubstrate docking protein family. The encoded protein is an important mediator of branching tubulogenesis and plays a central role in cellular growth response, transformation and apoptosis. Two transcript variants encoding different isoforms have been found for this gene.
26015	
26016	The protein encoded by this gene is a synaptotagmin-like protein (SLP) that belongs to a C2 domain-containing protein family. The SLP homology domain (SHD) of this protein has been shown to specifically bind the GTP-bound form of Ras-related protein Rab-27A (RAB27A), which suggests a role in vesicle trafficking. Multiple alternatively spiced transcript variants encoding distinct isoforms have been observed.
26017	
26018	
26019	This gene encodes a protein with a proline-rich domain. This gene is located in a region of chromosome 22 reported to contain a tumor suppressor gene that may be involved in breast and colorectal tumorigenesis. Rare read-through transcripts, containing exons from the ARHGAP8 gene which is located immediately downstream, led to the original description of PRR5 and ARHGAP8 as a single gene. Alternative splicing and the use of alternative promoters results in transcripts encoding different isoforms.
26020	
26021	
26022	
26023	
26024	This gene encodes a member of the sirtuin family of proteins, homologs to the yeast Sir2 protein. Members of the sirtuin family are characterized by a sirtuin core domain and grouped into four classes. The functions of human sirtuins have not yet been determined; however, yeast sirtuin proteins are known to regulate epigenetic gene silencing and suppress recombination of rDNA. Studies suggest that the human sirtuins may function as intracellular regulatory proteins with mono-ADP-ribosyltransferase activity. The protein encoded by this gene is included in class I of the sirtuin family.
26025	
26026	
26027	
26028	
26029	
26030	
26031	
26032	
26033	
26034	
26035	
26036	
26037	
26038	
26039	
26040	
26041	
26042	
26043	
26044	
26045	
26046	
26047	
26048	This gene encodes a protein, which has high sequence similarity to rat, xenopus and zebrafish proteins. The protein function is unknown.
26049	Prohibitin is an evolutionarily conserved gene that is ubiquitously expressed.  It is thought to be a negative regulator of cell proliferation and may be a tumor suppressor.  Mutations in PHB have been linked to sporadic breast cancer.  Prohibitin is expressed as two transcripts with varying lengths of 3@#$%& untranslated region.  The longer transcript is present at higher levels in proliferating tissues and cells, suggesting that this longer 3@#$%& untranslated region may function as a trans-acting regulatory RNA.
26050	
26051	
26052	
26053	
26054	
26055	
26056	
26057	
26058	
26059	
26060	
26061	The protein encoded by this gene is a member of serine/threonine protein kinase family. This kinase contains a dual leucine-zipper motif, and has been shown to form dimers/oligomers through its leucine-zipper motif. This kinase can phosphorylate and activate MAPK8/JNK, MAP2K7/MKK7, which suggests a role in the JNK signaling pathway.
26062	This gene encodes fibronectin, a glycoprotein present in a soluble dimeric form in plasma, and in a dimeric or multimeric form at the cell surface and in extracellular matrix. Fibronectin is involved in cell adhesion and migration processes including embryogenesis, wound healing, blood coagulation, host defense, and metastasis. The gene has three regions subject to alternative splicing, with the potential to produce 20 different transcript variants. However, the full-length nature of some variants has not been determined.
26063	
26064	
26065	Coagulation factor II is proteolytically cleaved to form thrombin in the first step of the coagulation cascade which ultimately results in the stemming of blood loss.  F2 also plays a role in maintaining vascular integrity during development and postnatal life.  Mutations in F2 leads to various forms of thrombosis and dysprothrombinemia.
26066	
26067	
26068	
26069	
26070	
26071	
26072	
26073	
26074	
26075	
26076	
26077	
26078	
26079	
26080	
26081	
26082	
26083	
26084	
26085	
26086	
26087	
26088	
26089	
26090	
26091	
26092	The protein encoded by this gene is a member of the bone morphogenetic protein (BMP) family and the TGF-beta superfamily. This group of proteins is characterized by a polybasic proteolytic processing site which is cleaved to produce a mature protein containing seven conserved cysteine residues. The members of this family are regulators of cell growth and differentiation in both embryonic and adult tissues. Mutations in this gene are associated with acromesomelic dysplasia, Hunter-Thompson type; brachydactyly, type C; and chondrodysplasia, Grebe type. These associations confirm that the gene product plays a role in skeletal development.
26093	
26094	
26095	
26096	HLA-A belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon 1 encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Hundreds of HLA-A alleles have been described.
26097	
26098	
26099	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation. This protein contains 2 bromodomains and multiple WD repeats, and the function of this protein is not known. This gene is located within the Down syndrome region-2 on chromosome 21. Alternative splicing of this gene generates 3 transcript variants diverging at the 3@#$%& ends.
26100	
26101	
26102	
26103	
26104	
26105	
26106	This gene is the human homolog of mouse BMP-2-inducible kinase. Bone morphogenic proteins (BMPs) play a key role in skeletal development and patterning. Expression of the mouse gene is increased during BMP-2 induced differentiation and the gene product is a putative serine/threonine protein kinase containing a nuclear localization signal. Therefore, the protein encoded by this human homolog is thought to be a protein kinase with a putative regulatory role in attenuating the program of osteoblast differentiation. Two transcript variants encoding different isoforms have been found for this gene.
26107	
26108	
26109	This gene encodes the D5 subtype of the dopamine receptor. The D5 subtype is a G-protein coupled receptor which stimulates adenylyl cyclase. This receptor is expressed in neurons in the limbic regions of the brain. It has a 10-fold higher affinity for dopamine than the D1 subtype. Pseudogenes related to this gene reside on chromosomes 1 and 2.
26110	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
26111	
26112	
26113	
26114	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a member of the histone H2A family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the small histone gene cluster on chromosome 6p22-p21.3.
26115	
26116	
26117	
26118	
26119	
26120	
26121	
26122	
26123	This gene encodes a member of the eukaryote-specific amino acid/auxin permease (AAAP) 1 transporter family. The encoded protein functions as a proton-dependent, small amino acid transporter. This gene is clustered with related family members on chromosome 5q33.1.
26124	
26125	
26126	
26127	
26128	
26129	
26130	
26131	
26132	
26133	
26134	
26135	
26136	
26137	
26138	
26139	
26140	Tight junctions represent one mode of cell-to-cell adhesion in epithelial or endothelial cell sheets, forming continuous seals around cells and serving as a physical barrier to prevent solutes and water from passing freely through the paracellular space. The protein encoded by this immunoglobulin superfamily gene member is localized in the tight junctions between high endothelial cells. Unlike other proteins in this family, the this protein is unable to adhere to leukocyte cell lines and only forms weak homotypic interactions. The encoded protein is a member of the junctional adhesion molecule protein family and acts as a receptor for another member of this family.
26141	
26142	
26143	
26144	
26145	
26146	
26147	
26148	
26149	
26150	
26151	
26152	
26153	
26154	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order      structures. This gene is intronless and encodes a member of the histone H2A family.
26155	
26156	The RCOR gene encodes a functional corepressor required for regulation of neural-specific gene expression.[supplied by OMIM]
26157	
26158	
26159	This zinc finger gene has been identified at the breakpoints of a recurrent chromosomal translocation reported in endometrial stromal sarcoma. Recombination of these breakpoints results in the fusion of this gene and JAZF1. The protein encoded by this gene contains a zinc finger domain in the C terminus of the coding region. The specific function of this gene has not yet been determined.
26160	
26161	
26162	
26163	
26164	
26165	
26166	
26167	
26168	
26169	
26170	
26171	
26172	
26173	
26174	
26175	
26176	
26177	
26178	
26179	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
26180	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
26181	
26182	
26183	
26184	This gene encodes a member of the GTP-binding translation elongation factor family. This protein is an essential factor for protein synthesis. It promotes the GTP-dependent translocation of the nascent protein chain from the A-site to the P-site of the ribosome. This protein is completely inactivated by EF-2 kinase phosporylation.
26185	
26186	The protein encoded by this gene is a receptor for activated C-kinase (RACK) protein. The encoded protein has been shown to bind in vitro to activated protein kinase C beta I. In addition, this protein is a cutaneous T-cell lymphoma-associated antigen. Finally, the protein contains a bromodomain and two zinc fingers, and is thought to be a transcriptional regulator. Multiple transcript variants encoding several different isoforms have been found for this gene.
26187	
26188	
26189	
26190	
26191	
26192	
26193	
26194	Apolipoprotein A-I is a relatively abundant plasma protein and is the major apoprotein of HDL. It is involved in the formation of most cholesteryl esters in plasma and also promotes efflux of cholesterol from cells. The protein encoded by this gene is a secreted high density lipoprotein which binds to apolipoprotein A-I. This apolipoprotein L family member may play a role in lipid exchange and transport throughout the body, as well as in reverse cholesterol transport from peripheral cells to the liver. At least three transcript variants encoding two different isoforms exist for this gene.
26195	
26196	
26197	
26198	
26199	
26200	
26201	
26202	
26203	
26204	
26205	
26206	
26207	
26208	
26209	
26210	
26211	
26212	
26213	
26214	
26215	
26216	
26217	
26218	
26219	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This phosphoprotein localizes to nuclear bodies where it functions as a transcription factor and tumor suppressor. Its expression is cell-cycle related and it regulates the p53 response to oncogenic signals. The gene is often involved in the translocation with the retinoic acid receptor alpha gene associated with acute promyelocytic leukemia (APL). Extensive alternative splicing of this gene results in several variations of the protein@#$%&s central and C-terminal regions; all variants encode the same N-terminus. Fourteen transcript variants for this gene have been described; however, the full length nature of three of the variants has not been determined.
26220	
26221	
26222	
26223	
26224	
26225	
26226	The protein encoded by this gene contains a RING zinc finger, a motif known to be involved in protein-protein interactions. The specific function of this gene has not yet been determined. Five alternatively spliced transcript variants encoding three distinct isoforms have been reported.
26227	
26228	
26229	
26230	This gene is a member of the paired box (PAX) family of transcription factors.  Members of this gene family typically contain a paired box domain, an octapeptide, and a paired-type homeodomain.  These genes play critical roles during fetal development and cancer growth.  The specific function of the paired box gene 8 is unknown but it may involve kidney cell differentiation, thyroid development, or thyroid dysgenesis.   Alternative splicing in this gene by inclusion or exclusion of exons 7 and/or 8 has produced several known products but the biological significance of the variants is unknown.  Several other splice variants have been proposed but the full nature of these products has not been described.
26231	
26232	
26233	The protein encoded by this gene shares similarity with BRE1 of S. cerevisiae. Yeast BRE1 is a ubiquitin ligase required for the ubiquitination of histone H2B and the methylation of histone H3.
26234	
26235	
26236	
26237	
26238	
26239	The protein encoded by this gene is a member of the chromogranin/secretogranin family of neuroendocrine secretory proteins. Studies in rodents suggest that the full length protein, secretogranin II, is involved in the packaging or sorting of peptide hormones and neuropeptides into secretory vesicles. The full length protein is cleaved to produce the active peptide secretoneurin, which exerts chemotaxic effects on specific cell types, and EM66, whose function is unknown.
26240	
26241	
26242	
26243	The RABL2B protein is a member of the RAB gene family which belongs to the RAS GTPase superfamily. RABL2B is located within a subtelomeric region of 22q13.3. Multiple alternatively spliced transcript variants have been found, but the biological validity of some variants has not been determined.
26244	
26245	
26246	This gene encodes a serine protease. The protein has been localized in the endoplasmic reticulum and interacts with an alternatively spliced form of mitogen-activated protein kinase 14. The protein has also been localized to the mitochondria with release to the cytosol following apoptotic stimulus. The protein is thought to induce apoptosis by binding the apoptosis inhibitory protein baculoviral IAP repeat-containing 4. Nuclear localization of this protein has also been observed. Alternate splicing of this gene results in two transcript variants encoding different isoforms. Additional transcript variants have been described, but their full-length sequences have not been determined.
26247	
26248	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that belongs to the ribosomal protein S5P family. Pseudogenes corresponding to this gene are found on chromosomes 4q, 5q, and 18q.
26249	
26250	
26251	Cytochrome c oxidase (COX), the terminal component of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. This component is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in electron transfer, and the nuclear-encoded subunits may function in the regulation and assembly of the complex. This nuclear gene encodes a protein similar to polypeptides 1 and 2 of subunit VIIa in the C-terminal region, and also highly similar to the mouse Sig81 protein sequence. This gene is expressed in all tissues, and upregulated in a breast cancer cell line after estrogen treatment. It is possible that this gene represents a regulatory subunit of COX and mediates the higher level of energy production in target cells by estrogen.
26252	This gene encodes a protein with src homology domain 3 (SH3) patterns. Mutations in this gene cause familial juvenile nephronophthisis.
26253	Members of the RAS superfamily of GTP-binding proteins, which includes MRAS, are membrane-anchored, intracellular signal transducers responsible for a variety of normal cellular functions. They are oncogenically activated in a significant fraction of tumors.[supplied by OMIM]
26254	
26255	
26256	
26257	
26258	Histones play a critical role in transcriptional regulation, cell cycle progression, and developmental events. Histone acetylation/deacetylation alters chromosome structure and affects transcription factor access to DNA. The protein encoded by this gene belongs to class II of the histone deacetylase/acuc/apha family. It possesses histone deacetylase activity and represses transcription when tethered to a promoter. This protein does not bind DNA directly, but through transcription factors MEF2C and MEF2D. It seems to interact in a multiprotein complex with RbAp48 and HDAC3.
26259	
26260	
26261	
26262	
26263	The ITGB3 protein product is the integrin beta chain beta 3. Integrins are integral cell-surface proteins composed of an alpha chain and a beta chain. A given chain may combine with multiple partners resulting in different integrins. Integrin beta 3 is found along with the alpha IIb chain in platelets. Integrins are known to participate in cell adhesion as well as cell-surface mediated signalling.
26264	This gene encodes a type 1 transmembrane protein of unknown function. It is one of several genes located in the imprinted gene domain of 11p15.5, an important tumor-suppressor gene region. Alterations in this region have been associated with the Beckwith-Wiedemann syndrome, Wilms tumor, rhabdomyosarcoma, adrenocrotical carcinoma, and lung, ovarian, and breast cancer. This gene may play a role in malignancies and disease that involve this region, as well as early hematopoiesis, by mediating attachment to stromal cells. This gene is oriented in a head-to-tail configuration with the ribonucleotide reductase 1 gene (RRM1), with the 3@#$%& end of this gene situated 1.6 kb from the 5@#$%& end of the RRM1 gene.
26265	
26266	
26267	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
26268	
26269	
26270	Deoxycytidine kinase (DCK) is required for the phosphorylation of several deoxyribonucleosides and their nucleoside analogs.  Deficiency of DCK is associated with resistance to antiviral and anticancer chemotherapeutic agents.  Conversely, increased deoxycytidine kinase activity is associated with increased activation of these compounds to cytotoxic nucleoside triphosphate derivatives.  DCK is clinically important because of its relationship to drug resistance and sensitivity.
26271	
26272	
26273	
26274	
26275	
26276	
26277	
26278	
26279	
26280	
26281	
26282	
26283	
26284	
26285	
26286	
26287	
26288	This gene encodes a loosely associated peripheral membrane protein related to the LanC family of bacterial membrane-associated proteins involved in the biosynthesis of antimicrobial peptides. This protein may play a role as a peptide-modifying enzyme component in eukaryotic cells. Previously considered a member of the G-protein-coupled receptor superfamily, this protein is now in the LanC family.
26289	
26290	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The function of this protein has not been identified. This gene lies within the major histocompatibility complex class I region on chromosome 6. Alternate splicing results in two transcript variants encoding different isoforms.
26291	
26292	The protein encoded by this gene is an apolipoprotein and member of the lipocalin protein family. It is found associated with high density lipoproteins and to a lesser extent with low density lipoproteins and triglyceride-rich lipoproteins. The encoded protein is secreted through the plasma membrane but remains membrane-bound, where it is involved in lipid transport. Two transcript variants encoding two different isoforms have been found for this gene, but only one of them has been fully characterized.
26293	
26294	
26295	
26296	
26297	
26298	
26299	This gene encodes one of the two alpha chains of type XI collagen, a minor fibrillar collagen. Type XI collagen is a heterotrimer but the third alpha chain is a post-translationally modified alpha 1 type II chain. Mutations in this gene are associated with type II Stickler syndrome and with Marshall syndrome. Three transcript variants encoding different isoforms have been identified for this gene.
26300	
26301	
26302	
26303	
26304	
26305	
26306	
26307	
26308	
26309	
26310	
26311	
26312	
26313	
26314	
26315	The protein encoded by this gene is a member of the TNF receptor (TNFR) associated factor (TRAF) protein family. TRAF proteins associate with, and mediate the signal transduction from various receptors of the TNFR superfamily. This protein and TRAF2 form a heterodimeric complex, which is required for TNF-alpha-mediated activation of MAPK8/JNK and NF-kappaB. The protein complex formed by this protein and TRAF2 also interacts with inhibitor-of-apoptosis proteins (IAPs), and thus mediates the anti-apoptotic signals from TNF receptors. The expression of this protein can be induced by Epstein-Barr virus (EBV). EBV infection membrane protein 1 (LMP1) is found to interact with this and other TRAF proteins; this interaction is thought to link LMP1-mediated B lymphocyte transformation to the signal transduction from TNFR family receptors.
26316	
26317	
26318	
26319	
26320	
26321	The protein encoded by this gene is a protein serine/threonine phosphatase of the PP2C family. This protein can interact with integrin-linked kinase (ILK/ILK1), a regulator of integrin mediated signaling, and regulate the kinase activity of ILK. Through the interaction with ILK, this protein may selectively affect the signaling process of ILK-mediated glycogen synthase kinase 3 beta (GSK3beta), and thus participate in Wnt signaling pathway. Alternatively spliced transcript variants encoding distinct isoforms have been described.
26322	This gene is located in the surfeit gene cluster, which is comprised of very tightly linked housekeeping genes that do not share sequence similarity. The encoded protein is a conserved integral membrane protein containing multiple putative transmembrane regions. In eukaryotic cells, protein transport between the endoplasmic reticulum and Golgi compartments is mediated in part by non-clathrin-coated vesicular coat proteins (COPs). The specific function of this protein has not been determined but its yeast homolog is directly required for packaging glycosylated pro-alpha-factor into COPII vesicles. This gene uses multiple polyadenylation sites, resulting in transcript length variation. The existence of alternatively spliced transcript variants has been suggested, but their validity has not been determined.
26323	
26324	
26325	This protein encodes an enzyme that synthesizes selenophosphate from selenide and ATP. Selenophosphate is the selenium donor used to synthesize selenocysteine, which is co-translationally incorporated into selenoproteins at in-frame UGA codons.
26326	
26327	
26328	Adducins are heteromeric proteins composed of different subunits referred to as adducin alpha, beta and gamma. The three subunits are encoded by distinct genes and belong to a family of membrane skeletal proteins involved in the assembly of spectrin-actin network in erythrocytes and at sites of cell-cell contact in epithelial tissues. While adducins alpha and gamma are ubiquitously expressed, the expression of adducin beta is restricted to brain and hematopoietic tissues. Adducin, originally purified from human erythrocytes, was found to be a heterodimer of adducins alpha and beta. Polymorphisms resulting in amino acid substitutions in these two subunits have been associated with the regulation of blood pressure in an animal model of hypertension. Heterodimers consisting of alpha and gamma subunits have also been described. Structurally, each subunit is comprised of two distinct domains. The amino-terminal region is protease resistant and globular in shape, while the carboxy-terminal region is protease sensitive. The latter contains multiple phosphorylation sites for protein kinase C, the binding site for calmodulin, and is required for association with spectrin and actin. Alternatively spliced adducin gamma transcripts encoding different isoforms have been described. The functions of the different isoforms are not known.
26329	
26330	
26331	
26332	
26333	
26334	
26335	
26336	
26337	Alkylglycerone phosphate synthase (AGPS) belongs to a group of peroxisomal enzymes that catalyze the first step in ether phospholipid biosynthesis which involves the acylation of dihydroxyacetonephosphate followed by the introduction of the ether linkage by alkyl-DHAP synthase.  Human AGPS shares a 92% nucleotide and amino acid sequence similarity with the guinea pig sequences, and the human sequence was identified by virtue of this homology.  Both proteins contain indentical peroxisomal targeting signal 2 sequences in their N-terminal presequences. Deficiency of AGPS in peroxisomes has been found in patients with the following peroxisomal disorders: Zellweger syndrome, rhizomelic chondrodysplasia punctate and adrenoleukodystrophy.
26338	
26339	
26340	
26341	
26342	
26343	
26344	
26345	
26346	
26347	
26348	
26349	
26350	
26351	
26352	
26353	The protein encoded by this gene is a member of the DEAD protein family of ATP-dependent RNA helicases. Members of this family are involved in a number of cellular functions including initiation of translation, RNA splicing, and ribosome assembly. A cluster of genes, BAT1-BAT5, has been localized in the vicinity of the genes for TNF alpha and TNF beta. These genes are all within the human major histocompatibility complex class III region. This protein is a negative regulator of inflammation. It is also thought to be a translation initiation factor. This gene is a strong candidate gene for rheumatoid arthritis. There are multiple alternatively spliced transcript variants known for this gene but only two have been fully described. Both of these variants encode the same isoform. This gene has been found to have multiple polyadenylation sites.
26354	
26355	The protein encoded by this gene contains a SH2 domain and a SOCS BOX domain. The protein thus belongs to the suppressor of cytokine signaling (SOCS), also known as STAT-induced STAT inhibitor (SSI), protein family. SOCS family members are known to be cytokine-inducible negative regulators of cytokine signaling. Two alternatively spliced transcript variants encoding the same protein have been found for this gene.
26356	
26357	
26358	
26359	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two RRM domains that bind to RNAs. Two alternatively spliced transcript variants have been described for this gene. The variants encode the same protein but have different 3@#$%& UTRs. This protein is similar to its family member HNRPD.
26360	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the proteasome B-type family, also known as the T1B family, that is a 20S core beta subunit. Proteolytic processing is required to generate a mature subunit. Expression of this gene is induced by gamma interferon, and this gene product replaces catalytic subunit 2 (proteasome beta 7 subunit) in the immunoproteasome.
26361	
26362	
26363	
26364	
26365	
26366	
26367	
26368	
26369	
26370	
26371	
26372	
26373	The Golgi apparatus, which participates in glycosylation and transport of proteins and lipids in the secretory pathway, consists of a series of stacked cisternae (flattened membrane sacs). Interactions between the Golgi and microtubules are thought to be important for the reorganization of the Golgi after it fragments during mitosis. The golgins are a family of proteins, of which the protein encoded by this gene is a member, that are localized to the Golgi. This encoded protein has been postulated to play roles in the stacking of Golgi cisternae and in vesicular transport. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of these variants has not been determined.
26374	This gene encodes a transcription factor that contains both basic helix-loop-helix and leucine zipper structural features. It regulates the differentiation and development of melanocytes retinal pigment epithelium and is also responsible for pigment cell-specific transcription of the melanogenesis enzyme genes. Heterozygous mutations in the this gene cause auditory-pigmentary syndromes, such as Waardenburg syndrome type 2 and Tietz syndrome. Alternatively spliced transcript variants encoding different isoforms have been identified.
26375	
26376	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene contains introns and its mRNA is poyadenylated, unlike most histone genes. The protein encoded is a member of the histone H3 family.
26377	
26378	
26379	A large family of ATPases has been described, whose key feature is that they share a conserved region of about 220 amino acids that contains an ATP-binding site. The proteins that belong to this family either contain one or two AAA (ATPases Associated with diverse cellular Activities) domains. AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes. The protein encoded by this gene contains two AAA domains, as well as a bromodomain.
26380	
26381	
26382	
26383	
26384	The Golgi apparatus, which participates in glycosylation and transport of proteins and lipids in the secretory pathway, consists of a series of stacked, flattened membrane sacs referred to as cisternae. Interactions between the Golgi and microtubules are thought to be important for the reorganization of the Golgi after it fragments during mitosis. The golgins constitute a family of proteins which are localized to the Golgi. This gene encodes a golgin which structurally resembles its family member GOLGA2, suggesting that they may share a similar function. Several alternatively spliced transcript variants of this gene have been described, but the full-length natures of only three have been determined.
26385	
26386	
26387	
26388	
26389	
26390	
26391	
26392	
26393	
26394	
26395	
26396	
26397	
26398	This gene belongs to the polypeptide N-acetylgalactosaminyltransferase (pp-GalNAc-T) gene family. Polypeptide GalNAc transferases initiate the synthesis of mucin-type oligosaccharides by transferring GalNAc from UDP-GalNAc to the hydroxyl group of either a serine or threonine residue on the polypeptide acceptor. Following expression in insect cells, recombinant GalNAc transferase 10 showed significant GalNAcT activity toward mucin-derived peptides, and it utilized both nonglycosylated and glycosylated peptide substrates. Two transcript variants encoding distinct isoforms have been identified for this gene.
26399	
26400	
26401	
26402	
26403	
26404	
26405	
26406	
26407	The ASPM gene is the human ortholog of the Drosophila melanogaster @#$%&abnormal spindle@#$%& gene (asp), which is essential for normal mitotic spindle function in embryonic neuroblasts (Bond et al., 2002). The mouse gene Aspm is expressed specifically in the primary sites of prenatal cerebral cortical neurogenesis.[supplied by OMIM]
26408	
26409	
26410	
26411	
26412	
26413	
26414	
26415	
26416	
26417	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L23P family of ribosomal proteins. It is located in the cytoplasm. The protein may be one of the target molecules involved in mediating growth inhibition by interferon. In yeast, the corresponding protein binds to a specific site on the 26S rRNA. This gene is co-transcribed with the U42A, U42B, U101A, and U101B small nucleolar RNA genes, which are located in its third, first, second, and fourth introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
26418	
26419	
26420	
26421	
26422	
26423	
26424	
26425	The WNT gene family consists of structurally related genes which encode secreted signaling proteins. These proteins have been implicated in oncogenesis and in several developmental processes, including regulation of cell fate and patterning during embryogenesis. This gene is a member of the WNT gene family. It encodes a protein which shows 99% and 91% amino acid identity to the mouse and Xenopus Wnt7A proteins, respectively. Among members of the human WNT family, this protein is most similar to WNT7A protein (77.1% total amino acid identity). This gene may play important roles in the development and progression of gastric cancer, esophageal cancer, and pancreatic cancer.
26426	
26427	
26428	
26429	The protein encoded by this gene belongs to the calcium channel beta subunit family. It plays an important role in the calcium channel by modulating G protein inhibition, increasing peak calcium current, controlling the alpha-1 subunit membrane targeting and shifting the voltage dependence of activation and inactivation. Alternative splicing occurs at this locus and three transcript variants encoding three distinct isoforms have been identified.
26430	
26431	This intronless gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins, such as syndecan binding protein (syntenin). The protein may function in the apoptosis pathway leading to cell death as well as to tumorigenesis and may mediate downstream effects of parathyroid hormone (PTH) and PTH-related protein (PTHrP) in bone development. The solution structure has been resolved for the HMG-box of a similar mouse protein.
26432	This gene encodes a cGMP-binding, cGMP-specific phosphodiesterase, a member of the cyclic nucleotide phosphodiesterase family. This phosphodiesterase specifically hydrolyzes cGMP to 5@#$%&-GMP. It is involved in the regulation of intracellular concentrations of cyclic nucleotides and is important for smooth muscle relaxation in the cardiovascular system. Alternative splicing of this gene results in three transcript variants encoding distinct isoforms.
26433	
26434	
26435	
26436	
26437	
26438	
26439	
26440	HLA-DOB belongs to the HLA class II beta chain paralogues. This class II molecule is a heterodimer consisting of an alpha (DOA) and a beta chain (DOB), both anchored in the membrane. It is located in intracellular vesicles. DO suppresses peptide loading of MHC class II molecules by inhibiting HLA-DM. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa and its gene contains 6 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail.
26441	
26442	
26443	
26444	This gene belongs to the protocadherin gene family, a subfamily of the cadherin superfamily. This gene encodes a protein which contains 6 extracellular cadherin domains, a transmembrane domain and a cytoplasmic tail differing from those of the classical cadherins. Although its specific function is undetermined, the cadherin-related neuronal receptor is thought to play a role in the establishment and function of specific cell-cell connections in the brain.
26445	
26446	
26447	The muscarinic cholinergic receptors belong to a larger family of G protein-coupled receptors. The functional diversity of these receptors is defined by the binding of acetylcholine to these receptors and includes cellular responses such as adenylate cyclase inhibition, phosphoinositide degeneration, and potassium channel mediation. Muscarinic receptors influence many effects of acetylcholine in the central and peripheral nervous system. The muscarinic cholinergic receptor 2 is involved in mediation of bradycardia and a decrease in cardiac contractility. Multiple alternatively spliced transcript variants have been described for this gene.
26448	
26449	
26450	
26451	
26452	
26453	
26454	
26455	This gene encodes a ubiquitously expressed nuclear protein which belongs to a highly conserved subfamily of WD-repeat proteins. It is present in protein complexes involved in histone acetylation and chromatin assembly. It is part of the Mi-2 complex which has been implicated in chromatin remodeling and transcriptional repression associated with histone deacetylation. This encoded protein is also part of co-repressor complexes, which is an integral component of transcriptional silencing. It is found among several cellular proteins that bind directly to retinoblastoma protein to regulate cell proliferation. This protein also seems to be involved in transcriptional repression of E2F-responsive genes.
26456	
26457	
26458	
26459	
26460	
26461	
26462	The protein encoded by this gene belongs to the plakin family of cytoskeletal linker proteins. This protein family forms bridges between different cytoskeletal elements through specialized modular domains. The encoded protein is one of the largest size proteins identified in human cytoskeletal proteins. It has functional actin and microtubule binding domains, and it appears to stabilize actin at sites where microtubules and microfilaments meet. It may function in microtubule dynamics to facilitate actin-microtubule interactions at the cell periphery and to couple the microtubule network to cellular junctions. Alternatively spliced transcript variants encoding distinct isoforms have been described.
26463	
26464	HLA-A belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon 1 encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Hundreds of HLA-A alleles have been described.
26465	
26466	
26467	
26468	
26469	
26470	
26471	
26472	
26473	
26474	
26475	Plasma prekallikrein is a glycoprotein that participates in the surface-dependent activation of blood coagulation, fibrinolysis, kinin generation and inflammation.  It is synthesized in the liver and secreted into the blood as a single polypeptide chain.  Plasma prekallikrein is converted to plasma kallikrein by factor XIIa by the cleavage of an internal Arg-Ile bond.  Plasma kallikrein therefore is composed of a heavy chain and a light chain held together by a disulphide bond.  The heavy chain originates from the amino-terminal end of the zymogen and contains 4 tandem repeats of 90 or 91 amino acids.  Each repeat harbors a novel structure called the apple domain.  The heavy chain is required for the surface-dependent pro-coagulant activity of plasma kallikrein.  The light chain contains the active site or catalytic domain of the enzyme and is homologous to the trypsin family of serine proteases. Plasma prekallikrein deficiency causes a prolonged activated partial thromboplastin time in patients.
26476	
26477	
26478	
26479	
26480	
26481	
26482	
26483	
26484	
26485	
26486	The protein encoded by this gene catalyzes the fourth step of porphyrin biosynthesis in the heme biosynthetic pathway. Defects in this gene cause congenital erythropoietic porphyria (Gunther@#$%&s disease).
26487	
26488	The product of this gene belongs to the plakin or cytolinker family of genes. The encoded protein is involved in cytoskeleton-membrane attachment in epithelial cells, muscle, and other tissues. It also may regulate changes of lymphocyte cytoarchitecture during polarization and extravasation. This gene has alternative splicing of different 5@#$%& exons to the first constant exon, which generates transcript variants that encode distinct isoforms.
26489	
26490	
26491	
26492	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes one of the ATPase subunits, a member of the triple-A family of ATPases which have a chaperone-like activity. This subunit has been shown to interact with an orphan member of the nuclear hormone receptor superfamily highly expressed in liver, and with gankyrin, a liver oncoprotein. Two transcript variants encoding different isoforms have been identified.
26493	
26494	This gene encodes a member of a family of candidate taste receptors that are members of the G protein-coupled receptor superfamily and that are specifically expressed by taste receptor cells of the tongue and palate epithelia. Each of these apparently intronless taste receptor genes encodes a 7-transmembrane receptor protein, functioning as a bitter taste receptor. This gene is clustered with another 3 candidate taste receptor genes in chromosome 7 and is genetically linked to loci that influence bitter perception.
26495	The protein encoded by this gene is a homeodomain protein that lacks certain conserved residues required for DNA binding. It was reported that choriocarcinoma cell lines and tissues failed to express this gene, which suggested the possible involvement of this gene in malignant conversion of placental trophoblasts. Studies in mice suggested that this protein may interact with serum response factor (SRF) and modulate SRF-dependent cardiac-specific gene expression and cardiac development. Multiple alternatively spliced transcript variants encoding the same protein have been observed, the full-length natures of only some have been determined.
26496	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This enzyme is a hydroxylase that metabolizes arachidonic acid, docosahexaenoic acid, and other long chain fatty acids.
26497	
26498	
26499	
26500	
26501	
26502	The protein encoded by this gene shares strong similarity with Saccharomyces cerevisiae Cdc23, a protein essential for cell cycle progression through the G2/M transition. This protein is a component of anaphase-promoting complex (APC), which is composed of eight protein subunits and highly conserved in eucaryotic cells. APC catalyzes the formation of cyclin B-ubiquitin conjugate that is responsible for the ubiquitin-mediated proteolysis of B-type cyclins. This protein and 3 other members of the APC complex contain the TPR (tetratricopeptide repeat), a protein domain important for protein-protein interaction.
26503	
26504	
26505	
26506	
26507	
26508	
26509	
26510	
26511	
26512	
26513	
26514	
26515	
26516	
26517	The protein encoded by this gene interacts with the transcription factor myocardin, a key regulator of smooth muscle cell differentiation. The encoded protein is predominantly nuclear and may help transduce signals from the cytoskeleton to the nucleus. This gene is involved in a specific translocation event that creates a fusion of this gene and the RNA-binding motif protein-15 gene. This translocation has been associated with acute megakaryocytic leukemia.
26518	
26519	
26520	
26521	
26522	
26523	
26524	
26525	
26526	In cardiac myocytes, Ca(2+) concentrations alternate between high levels during contraction and low levels during relaxation. The increase in Ca(2+) concentration during contraction is primarily due to release of Ca(2+) from intracellular stores. However, some Ca(2+) also enters the cell through the sarcolemma (plasma membrane). During relaxation, Ca(2+) is sequestered within the intracellular stores. To prevent overloading of intracellular stores, the Ca(2+) that entered across the sarcolemma must be extruded from the cell. The Na(+)-Ca(2+) exchanger is the primary mechanism by which the Ca(2+) is extruded from the cell during relaxation. In the heart, the exchanger may play a key role in digitalis action. The exchanger is the dominant mechanism in returning the cardiac myocyte to its resting state following excitation.[supplied by OMIM]
26527	
26528	This gene encodes glucose-6-phosphate dehydrogenase. This protein is a cytosolic enzyme encoded by a housekeeping X-linked gene whose main function is to produce NADPH, a key electron donor in the defense against oxidizing agents and in reductive biosynthetic reactions. G6PD is remarkable for its genetic diversity. Many variants of G6PD, mostly produced from missense mutations, have been described with wide ranging levels of enzyme activity and associated clinical symptoms. G6PD deficiency may cause neonatal jaundice, acute hemolysis, or severe chronic non-spherocytic hemolytic anemia.
26529	
26530	
26531	
26532	
26533	
26534	
26535	
26536	
26537	
26538	The protein encoded by this gene is a membrane glycosylphosphatidylinositol-anchored glycoprotein that tends to aggregate into rod-like structures. The encoded protein contains a highly unstable region of five tandem octapeptide repeats. This gene is found on chromosome 20, approximately 20 kbp upstream of a gene which encodes a biochemically and structurally similar protein to the one encoded by this gene. Mutations in the repeat region as well as elsewhere in this gene have been associated with Creutzfeldt-Jakob disease, fatal familial insomnia, Gerstmann-Straussler disease, Huntington disease-like 1, and kuru. Two transcript variants encoding the same protein have been found for this gene.
26539	
26540	
26541	
26542	
26543	
26544	
26545	
26546	The erythropoietin receptor is a member of the cytokine receptor family. Upon erythropoietin binding, the erythropoietin receptor activates Jak2 tyrosine kinase which activates different intracellular pathways including: Ras/MAP kinase, phosphatidylinositol 3-kinase and STAT transcription factors. The stimulated erythropoietin receptor appears to have a role in erythroid cell survival. Defects in the erythropoietin receptor may produce erythroleukemia and familial erythrocytosis.
26547	
26548	Cytosolic and membrane-bound forms of glutathione S-transferase are encoded by two distinct supergene families. At present, eight distinct classes of the soluble cytoplasmic mammalian glutathione S-transferases have been identified: alpha, kappa, mu, omega, pi, sigma, theta and zeta. This gene encodes a glutathione S-transferase that belongs to the mu class. The mu class of enzymes functions in the detoxification of electrophilic compounds, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress, by conjugation with glutathione. The genes encoding the mu class of enzymes are organized in a gene cluster on chromosome 1p13.3 and are known to be highly polymorphic. These genetic variations can change an individual@#$%&s susceptibility to carcinogens and toxins as well as affect the toxicity and efficacy of certain drugs. Mutations of this class mu gene have been linked with a slight increase in a number of cancers, likely due to exposure with environmental toxins.
26549	Protein kinase C (PKC) is a family of serine- and threonine-specific protein kinases that can be activated by calcium and second messenger diacylglycerol. PKC family members phosphorylate a wide variety of protein targets and are known to be involved in diverse cellular signaling pathways. PKC family members also serve as major receptors for phorbol esters, a class of tumor promoters. Each member of the PKC family has a specific expression profile and is believed to play a distinct role in cells. The protein encoded by this gene is one of the PKC family members. This protein kinase has been reported to be involved in many different cellular functions, such as B cell activation, apoptosis induction, endothelial cell proliferation, and intestinal sugar absorption. Studies in mice also suggest that this kinase may also regulate neuronal functions and correlate fear-induced conflict behavior after stress. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
26550	
26551	
26552	EMK (ELKL Motif Kinase) is a small family of ser/thr protein kinases involved in the control of cell polarity, microtubule stability and cancer. Several cDNA clones have been isolated that encoded two isoforms of the human ser/thr protein kinase EMK1. These isoforms were characterized by the presence of a 162-bp alternative exon that gave rise to two forms, one containing the exon and the other one lacking it. Both forms were found to be coexpressed in a number of selected cell lines and tissue samples. The human EMK1 was shown to be encoded by a single mRNA ubiquitously expressed.
26553	Branched-chain keto acid dehydrogenase is a multienzyme complex associated with the inner membrane of mitochondria, and functions in the catabolism of branched-chain amino acids. The complex consists of multiple copies of 3 components: branched-chain alpha-keto acid decarboxylase (E1), lipoamide acyltransferase (E2) and lipoamide dehydrogenase (E3). This gene encodes the E1 beta subunit, and mutations therein have been associated with maple syrup urine disease (MSUD), type 1B. Alternative splicing at this locus results in transcript variants with different 3@#$%& noncoding regions, but encoding the same isoform.
26554	Cytochrome b (-245) is composed of cytochrome b alpha (CYBA) and beta (CYBB) chain. It has been proposed as a primary component of the microbicidal oxidase system of phagocytes. CYBB deficiency is one of five described biochemical defects associated with chronic granulomatous disease (CGD). In this disorder, there is decreased activity of phagocyte NADPH oxidase; neutrophils are able to phagocytize bacteria but cannot kill them in the phagocytic vacuoles. The cause of the killing defect is an inability to increase the cell@#$%&s respiration and consequent failure to deliver activated oxygen into the phagocytic vacuole.
26555	
26556	
26557	
26558	
26559	
26560	
26561	
26562	Glutamate receptors are the predominant excitatory neurotransmitter receptors in the mammalian brain and are activated in a variety of normal neurophysiologic processes. These receptors are heteromeric protein complexes with multiple subunits, each possessing transmembrane regions, and all arranged to form a ligand-gated ion channel. The classification of glutamate receptors is based on their activation by different pharmacologic agonists. The GRIA1 belongs to a family of alpha-amino-3-hydroxy-5-methyl-4-isoxazole propionate (AMPA) receptors. Each of the members (GRIA1-4) include fip and flop isoforms generated by alternative RNA splicing resulting in a difference of 9 amino acids. The receptor subunits encoded by each isoform vary in their signal transduction properties. The isoform presented here is the flip isoform. AMPA-sensitive channels become impermeable to calcium by editing of mRNA in the coding region of the second transmembrane domain of glutamate receptor subunit. It involves a change of the base A in genomic DNA to the base G in mRNA. The GRIA2 subunit mRNA is completely edited in human brain.
26563	The N-myc oncogene has been implicated in the pathogenesis of a number of human tumors, including childhood neuroblastoma and adult small cell lung cancer. Complementary DNA clones derived from a transcription unit, N-cym, located on the opposite DNA strand to N-myc have been characterized. There is extensive overlap between the 5@#$%& ends of the two transcription units. The N-cym gene, which can encode a 109-amino acid protein, is expressed during fetal development, as well as in tumor cell lines containing amplified N-myc loci, where it is expressed at very high levels. Although other examples of overlapping, opposite-strand eukaryotic genes exist, N-myc and N-cym are unique in that they appear to be coregulated in tumor cell lines under basal growth conditions and in response to the differentiating agent retinoic acid. This coregulation suggests that their protein products may be functionally interrelated during normal development and oncogenesis.
26564	
26565	The product of this gene belongs to the plakin or cytolinker family of genes. The encoded protein is involved in cytoskeleton-membrane attachment in epithelial cells, muscle, and other tissues. It also may regulate changes of lymphocyte cytoarchitecture during polarization and extravasation. This gene has alternative splicing of different 5@#$%& exons to the first constant exon, which generates transcript variants that encode distinct isoforms.
26566	
26567	
26568	
26569	
26570	
26571	
26572	
26573	
26574	
26575	
26576	
26577	
26578	
26579	
26580	
26581	
26582	
26583	
26584	
26585	
26586	Amylases are secreted proteins that hydrolyze 1,4-alpha-glucoside bonds in oligosaccharides and polysaccharides, and thus catalyze the first step in digestion of dietary starch and glycogen. The human genome has a cluster of several amylase genes that are expressed at high levels in either salivary gland or pancreas. This gene encodes an amylase isoenzyme produced by the salivary gland. Alternative splicing results in multiple transcript variants encoding the same protein.
26587	
26588	
26589	This gene encodes a protein best known as a hematopoietic cell granule proteoglycan. Proteoglycans stored in the secretory granules of many hematopoietic cells also contain a protease-resistant peptide core, which may be important for neutralizing hydrolytic enzymes. This encoded protein was found to be associated with the macromolecular complex of granzymes and perforin, which may serve as a mediator of granule-mediated apoptosis.
26590	
26591	
26592	
26593	
26594	
26595	
26596	
26597	
26598	Striated muscle sarcomeres are highly organized structures composed of actin (thin) and myosin (thick) filaments that slide past each other during contraction. The integrity of sarcomeres is controlled by a set of structural proteins, among which are titin (TTN; MIM 188840), a giant molecule that contains several immunoglobulin (Ig)-like domains and associates with thin and thick filaments, and alpha-actinin (ACTN1; MIM 102575), an actin cross-linking protein. Mutations in several sarcomeric and sarcolemmal proteins have been shown to result in muscular dystrophy and cardiomyopathy.[supplied by OMIM]
26599	
26600	
26601	
26602	
26603	
26604	
26605	
26606	
26607	This gene encodes a trypsinogen, which is a member of the trypsin family of serine proteases. This enzyme is secreted by the pancreas and cleaved to its active form in the small intestine. It is active on peptide linkages involving the carboxyl group of lysine or arginine. Mutations in this gene are associated with hereditary pancreatitis. This gene and several other trypsinogen genes are localized to the T cell receptor beta locus on chromosome 7.
26608	
26609	
26610	Pleckstrin homology domains bind polyphosphoinositides and are involved in platelet cytoskeletal changes. This gene contains a (CAG)n trinucleotide repeat which has been associated with several neurological disorders in which the repeats code for polyglutamine in the protein. This gene may play an important role in the anti-apoptotic effects of insulin-like growth factor I.
26611	
26612	
26613	
26614	
26615	In mammals, histamine is metabolized by two major pathways: N(tau)-methylation via histamine N-methyltransferase and oxidative deamination via diamine oxidase. This gene encodes the first enzyme which is found in the cytosol and uses S-adenosyl-L-methionine as the methyl donor. In the mammalian brain, the neurotransmitter activity of histamine is controlled by N(tau)-methylation as diamine oxidase is not found in the central nervous system. A common genetic polymorphism affects the activity levels of this gene product in red blood cells.
26616	
26617	
26618	
26619	This gene is a candidate tumor suppressor gene. It is located within a homozygously deleted region of a metastatic prostate cancer. The gene is expressed in most nonlymphoid human tissues including prostate, lung, liver, and colon. Expression was also detected in many epithelial tumor cell lines. Two transcript variants encoding distinct isoforms have been identified for this gene.
26620	
26621	
26622	
26623	
26624	
26625	
26626	
26627	
26628	The protein encoded by this gene is only expressed in the S and G2 phases of the cell cycle, where it colocalizes with cytoplasmic tubulin and microtubules. In response to DNA damage, the encoded protein accumulates in the nucleus and binds the tumor suppressor protein p53, shuttling it out of the nucleus and repressing its ability to induce apoptosis.
26629	
26630	
26631	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L28E family of ribosomal proteins. It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
26632	
26633	YY1 is a ubiquitously distributed transcription factor belonging to the GLI-Kruppel class of zinc finger proteins. The protein is involved in repressing and activating a diverse number of promoters.  YY1 may direct histone deacetylases and histone acetyltransferases to a promoter in order to activate or repress the promoter, thus implicating histone modification in the function of YY1.
26634	
26635	The product of this gene is a membrane-associated protein that colocalizes with huntingtin. This protein has similarities to cytoskeleton proteins and its interaction with huntingtin is thought to play a functional role in the cell filament network. Loss of normal huntingtin-HIP1 interaction in Huntington disease may contribute to a defect in membrane-cytoskeletal integrity in the brain. This gene could help in the understanding of the normal function of huntingtin and also the pathogenesis of Huntington disease. It also has been implicated in the pathogenesis of hematopoietic malignancies. An alternative splice variant of this gene has been described but its full length sequence has not been determined.
26636	This gene belongs to the forkhead family of transcription factors which is characterized by a distinct DNA-binding forkhead domain.  The specific function of this gene has not yet been determined; however, it has been shown to play a role in the regulation of embryonic and ocular development.  Mutations in this gene cause various glaucoma phenotypes including primary congenital glaucoma, autosomal dominant iridogoniodysgenesis anomaly, and Axenfeld-Rieger anomaly.
26637	
26638	
26639	
26640	
26641	
26642	
26643	
26644	
26645	
26646	
26647	The PAX5 gene is a member of the paired box (PAX) family of transcription factors.  The central feature of this gene family is a novel, highly conserved DNA-binding motif, known as the paired box.  The PAX proteins are important regulators in early development, and alterations in the expression of their genes are thought to contribute to neoplastic transformation.  The PAX5 gene encodes the B-cell lineage specific activator protein (BSAP) that is expressed at early, but not late stages of B-cell differentiation.  Its expression has also been detected in developing CNS and testis, therefore, PAX5 gene product may not only play an important role in B-cell differentiation, but also in neural development and spermatogenesis.  The PAX5 gene is located in chromosome 9p13 region, which is involved in t(9;14)(p13;q32) translocations recurring in small lymphocytic lymphomas of the plasmacytoid subtype, and in derived large-cell lymphomas.  This translocation brings the potent E-mu enhancer of the IgH gene into close proximity of the PAX5 promoters, suggesting that the deregulation of PAX5 gene transcription contributes to the pathogenesis of these lymphomas.  A transcript variant arising as a consequence of alternative promoter usage, and containing a different coding exon 1(B), has been described, however, its full-length nature is not known.
26648	
26649	
26650	
26651	ENTPD6 is similar to E-type nucleotidases (NTPases). NTPases, such as CD39, mediate catabolism of extracellular nucleotides. ENTPD6 contains 4 apyrase-conserved regions which is characteristic of NTPases.
26652	
26653	ROD1 is a functional homolog of nrd1, an S. pombe RNA-binding protein that suppresses the onset of differentiation.[supplied by OMIM]
26654	
26655	
26656	
26657	
26658	
26659	
26660	
26661	
26662	
26663	
26664	
26665	
26666	
26667	Diacylglycerol kinases are thought to be involved mainly in the regeneration of phosphatidylinositol (PI) from diacylglycerol in the PI-cycle during cell signal transduction.  When expressed in mammalian cells, DGK-epsilon shows specificity for arachidonyl-containing diacylglycerol. DGK-epsilon is expressed predominantly in testis.
26668	
26669	Proper cohesion of sister chromatids is a prerequisite for the correct segregation of chromosomes during cell division. The cohesin multiprotein complex is required for sister chromatid cohesion. This complex is composed partly of two structural maintenance of chromosomes (SMC) proteins, SMC3 and either SMC1L2 or the protein encoded by this gene. Most of the cohesin complexes dissociate from the chromosomes before mitosis, although those complexes at the kinetochore remain. Therefore, the encoded protein is thought to be an important part of functional kinetochores. In addition, this protein interacts with BRCA1 and is phosphorylated by ATM, indicating a potential role for this protein in DNA repair. This gene, which belongs to the SMC gene family, is located in an area of the X-chromosome that escapes X inactivation.
26670	
26671	
26672	
26673	
26674	
26675	The ERM protein family members ezrin, radixin, and moesin are cytoskeletal effector proteins linking actin to membrane-bound proteins at the cell surface. Myosin regulatory light chain interacting protein (MIR) is a novel ERM-like protein that interacts with myosin regulatory light chain and inhibits neurite outgrowth.
26676	
26677	
26678	
26679	
26680	
26681	
26682	Childhood- and adolescent-onset forms of familial ALS (see ALS1, MIM 105400) carry the designation @#$%&juvenile ALS.@#$%& Several forms of autosomal recessive juvenile ALS have been identified; see ALS2 (MIM 205100) and ALS5 (MIM 602099).[supplied by OMIM]
26683	
26684	
26685	
26686	
26687	
26688	
26689	
26690	
26691	
26692	
26693	
26694	
26695	
26696	
26697	The protein encoded by this gene is a regulatory subunit of the AMP-activated protein kinase (AMPK). AMPK is a heterotrimer consisting of an alpha catalytic subunit, and non-catalytic beta and gamma subunits. AMPK is an important energy-sensing enzyme that monitors cellular energy status. In response to cellular metabolic stresses, AMPK is activated, and thus phosphorylates and inactivates acetyl-CoA carboxylase (ACC) and beta-hydroxy beta-methylglutaryl-CoA reductase (HMGCR), key enzymes involved in regulating de novo biosynthesis of fatty acid and cholesterol. This subunit may be a positive regulator of AMPK activity. It is highly expressed in skeletal muscle and thus may have tissue-specific roles.
26698	
26699	
26700	
26701	Core binding factor (CBF) is a heterodimeric transcription factor that binds to the core element of many enhancers and promoters. The protein encoded by this gene represents the alpha subunit of CBF and is thought to be involved in the development of normal hematopoiesis. Chromosomal translocations involving this gene are well-documented and have been associated with several types of leukemia. Two transcript variants encoding different isoforms have been found for this gene.
26702	ADM2 belongs to a family of calcitonin (MIM 114130)-related peptide hormones important for regulating diverse physiologic functions and the chemical composition of fluids and tissues.[supplied by OMIM]
26703	
26704	
26705	
26706	
26707	
26708	
26709	
26710	
26711	
26712	
26713	
26714	
26715	
26716	
26717	
26718	The NF-kappa-B (NFKB) complex of proteins is inhibited by I-kappa-B (IKB) proteins, which inactivate NFKB by trapping it in the cytoplasm. Phosphorylation of serine residues on the IKB proteins by IKB kinases marks them for destruction via the ubiquitination pathway, thereby allowing activation and nuclear translocation of the NFKB complex. The protein encoded by this gene is similar to IKB kinases and can mediate NFKB activation in response to certain growth factors. For example, the protein can form a complex with the IKB protein TANK and TRAF2 and release the NFKB inhibition caused by TANK.
26719	
26720	
26721	
26722	
26723	
26724	
26725	
26726	This gene encodes a protein containing a SET domain, 2 LXXLL motifs, 3 nuclear translocation signals (NLSs), 4 plant homeodomain (PHD) finger regions, and a proline-rich region. The encoded protein enhances androgen receptor (AR) transactivation, and this enhancement can be increased further in the presence of other androgen receptor associated coregulators. This protein may act as a nucleus-localized, basic transcriptional factor and also as a bifunctional transcriptional regulator. Mutations of this gene have been associated with Sotos syndrome and Weaver syndrome. One version of childhood acute myeloid leukemia is the result of a cryptic translocation with the breakpoints occurring within nuclear receptor-binding Su-var, enhancer of zeste, and trithorac domain protein 1 on chromosome 5 and nucleoporin, 98-kd on chromosome 11. Two transcript variants encoding distinct isoforms have been identified for this gene.
26727	
26728	
26729	The protein encoded by this gene is a serine/threonine protein kinase and a member of the casein kinase I protein family, whose members have been implicated in the control of cytoplasmic and nuclear processes, including DNA replication and repair. The encoded protein is found in the cytoplasm as a monomer and can phosphorylate a variety of proteins, including itself. This protein has been shown to phosphorylate period, a circadian rhythm protein. Two transcript variants encoding the same protein have been found for this gene.
26730	
26731	
26732	
26733	
26734	
26735	
26736	
26737	
26738	
26739	
26740	This gene encodes a tumor suppressor protein that includes among its many intracellular functions one of nuclear export. Defects in this gene cause familial adenomatous polyposis (FAP), an autosomal dominant pre-malignant disease that usually progresses to malignancy. Disease-associated mutations tend to be clustered in a small region designated the mutation cluster region (MCR) and result in a truncated protein product.
26741	
26742	
26743	The specific function of this gene has yet to be determined; however, it has been speculated to play a role in transcription regulation and/or chromatin and histone metabolism.  It is considered the primary candidate gene in some haploinsufficiency syndromes such as DiGeorge syndrome, and insufficient production of the gene may disrupt normal embryonic development.  This gene is mapped to 22q11.21, centromeric to COMT.
26744	
26745	Ornithine decarboxylase catalyzes the conversion of ornithine to putrescine in the first and apparently rate-limiting step in polyamine biosynthesis. The ornithine decarboxylase antizymes play a role in the regulation of polyamine synthesis by binding to and inhibiting ornithine decarboxylase. Antizyme expression is auto-regulated by polyamine-enhanced translational frameshifting. The antizyme encoded by this gene inhibits ornithine decarboxylase and accelerates its degradation.
26746	
26747	
26748	
26749	
26750	
26751	
26752	
26753	
26754	
26755	
26756	This gene encodes a member of the epidermal growth factor receptor (EGFR) family of receptor tyrosine kinases. This membrane-bound protein has a neuregulin binding domain but not an active kinase domain. It therefore can bind this ligand but not convey the signal into the cell through protein phosphorylation. However, it does form heterodimers with other EGF receptor family members which do have kinase activity. Heterodimerization leads to the activation of pathways which lead to cell proliferation or differentiation. Amplification of this gene and/or overexpression of its protein have been reported in numerous cancers, including prostate, bladder, and breast tumors. Alternate transcriptional splice variants encoding different isoforms have been characterized. One isoform lacks the intermembrane region and is secreted outside the cell. This form acts to modulate the activity of the membrane-bound form. Additional splice variants have also been reported, but they have not been thoroughly characterized.
26757	
26758	
26759	
26760	
26761	
26762	This gene encodes a member of the epidermal growth factor receptor (EGFR) family of receptor tyrosine kinases. This membrane-bound protein has a neuregulin binding domain but not an active kinase domain. It therefore can bind this ligand but not convey the signal into the cell through protein phosphorylation. However, it does form heterodimers with other EGF receptor family members which do have kinase activity. Heterodimerization leads to the activation of pathways which lead to cell proliferation or differentiation. Amplification of this gene and/or overexpression of its protein have been reported in numerous cancers, including prostate, bladder, and breast tumors. Alternate transcriptional splice variants encoding different isoforms have been characterized. One isoform lacks the intermembrane region and is secreted outside the cell. This form acts to modulate the activity of the membrane-bound form. Additional splice variants have also been reported, but they have not been thoroughly characterized.
26763	
26764	Aggrecan 1 is a member of the aggrecan/versican proteoglycan family.  As an integral part of the extracellular matrix in cartilagenous tissue, the function of aggrecan 1 is to withstand compression in cartilage.  Mutations in this gene may be involved in skeletal dysplasia and spinal degeneration.  Alternative splicing of two domains has been observed in the aggrecan 1 gene; however, it is unknown whether other transcripts exist or what effects they have on gene function.
26765	
26766	
26767	
26768	
26769	
26770	
26771	
26772	
26773	This gene encodes a member of the eyes absent (EYA) family of proteins. The encoded protein may act as a transcriptional activator and have a role during development. A similar protein in mice can act as a transcriptional activator. Two transcript variants encoding distinct isoforms have been identified for this gene.
26774	
26775	
26776	This gene encodes the heavy subunit of the ferritin protein. Ferritin is the major intracellular iron storage protein in prokaryotes and eukaryotes. It is composed of 24 subunits of the heavy and light ferritin chains. Variation in ferritin subunit composition may affect the rates of iron uptake and release in different tissues. A major function of ferritin is the storage of iron in a soluble and nontoxic state. Defects in ferritin proteins are associated with several neurodegenerative diseases. This gene has multiple pseudogenes. Several alternatively spliced transcript variants have been observed, but their biological validity has not been determined.
26777	
26778	
26779	
26780	
26781	
26782	
26783	
26784	
26785	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S5P family of ribosomal proteins. It is located in the cytoplasm. This gene shares sequence similarity with mouse LLRep3. It is co-transcribed with the small nucleolar RNA gene U64, which is located in its third intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
26786	
26787	Members of the F-box protein family, such as FBXO34, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
26788	
26789	The protein encoded by this gene is a cell-surface glycoprotein involved in cell-cell interactions, cell adhesion and migration. It is a receptor for hyaluronic acid (HA) and can also interact with other ligands, such as osteopontin, collagens, and matrix metalloproteinases (MMPs). This protein participates in a wide variety of cellular functions including lymphocyte activation, recirculation and homing, hematopoiesis, and tumor metastasis. Transcripts for this gene undergo complex alternative splicing that results in many functionally distinct isoforms, however, the full length nature of some of these variants has not been determined. Alternative splicing is the basis for the structural and functional diversity of this protein, and may be related to tumor metastasis.
26790	
26791	
26792	
26793	
26794	The protein encoded by this gene is a cell-surface glycoprotein involved in cell-cell interactions, cell adhesion and migration. It is a receptor for hyaluronic acid (HA) and can also interact with other ligands, such as osteopontin, collagens, and matrix metalloproteinases (MMPs). This protein participates in a wide variety of cellular functions including lymphocyte activation, recirculation and homing, hematopoiesis, and tumor metastasis. Transcripts for this gene undergo complex alternative splicing that results in many functionally distinct isoforms, however, the full length nature of some of these variants has not been determined. Alternative splicing is the basis for the structural and functional diversity of this protein, and may be related to tumor metastasis.
26795	
26796	The protein encoded by this gene belongs to the stathmin/oncoprotein 18 family of microtubule-destabilizing phosphoproteins. It is similar to the SCG10 protein and is involved in signal transduction and regulation of microtubule dynamics.
26797	
26798	
26799	
26800	
26801	
26802	
26803	This gene encodes the cellular form of serine hydroxymethyltransferase, a pyridoxal phosphate-containing enzyme that catalyzes the reversible conversion of serine and tetrahydrofolate to glycine and 5,10-methylene tetrahydrofolate. This reaction provides one carbon units for synthesis of methionine, thymidylate, and purines in the cytoplasm. This gene is located within the Smith-Magenis syndrome region on chromosome 17. Alternative splicing of this gene results in 2 transcript variants encoding 2 different isoforms. Additional transcript variants have been described, but their biological validity has not been determined.
26804	
26805	
26806	
26807	
26808	
26809	Renin catalyzes the first step in the activation pathway of angiotensinogen--a cascade that can result in aldosterone release,vasoconstriction, and increase in blood pressure. Renin, an aspartyl protease, cleaves angiotensinogen to form angiotensin I, which is converted to angiotensin II by angiotensin I converting enzyme, an important regulator of blood pressure and electrolyte balance. Transcript variants that encode different protein isoforms and that arise from alternative splicing and the use of alternative promoters have been described, but their full-length nature has not been determined. Mutations in this gene have been shown to cause familial hyperproreninemia.
26810	
26811	
26812	
26813	
26814	
26815	
26816	The protein encoded by this gene is a member of the peptidyl-prolyl cis-trans isomerase (PPIase) family. PPIases catalyze the cis-trans isomerization of proline imidic peptide bonds in oligopeptides and accelerate the folding of proteins. This protein is a cyclosporin binding-protein. It may play a role in cyclosporin A-mediated immunosuppression. This protein can interact with several HIV proteins including p55 gag, Vpr, and capsid protein. It has been shown to be necessary for the formation of infectious HIV virions. Multiple pseudogenes that map to different chromosomes have been reported. Three alternatively spliced transcript variants encoding two distinct isoforms have been observed.
26817	
26818	
26819	
26820	
26821	
26822	
26823	
26824	The protein encoded by this gene is the human ortholog of bovine and rat dipeptidylpeptidase VI. They are transmembrane proteins and likely members of the dipeptidyl aminopeptidase family. Two transcript variants encoding different isoforms have been identified in all 3 species. In bovine and rat, the longer form has been shown to be expressed almost exclusively in brain, whereas the shorter form has a more widespread expression.
26825	
26826	Translation initiation is mediated by specific recognition of the cap structure by eukaryotic translation initiation factor 4F (eIF4F), which is a cap binding protein complex that consists of three subunits: eIF4A, eIF4E and eIF4G. The protein encoded by this gene shares similarity with the C-terminal region of eIF4G, that contains the binding sites for eIF4A and eIF3; eIF4G in addition, contains a binding site for eIF4E at the N-terminus. Unlike eIF4G which supports cap-dependent and independent translation, this gene product functions as a general repressor of translation by forming translationally inactive complexes. In vitro and in vivo studies indicate that translation of this mRNA initiates exclusively at a non-AUG (GUG) codon.
26827	
26828	
26829	
26830	
26831	
26832	This gene encodes a dual-specificity phosphatase that associates with polyribosomes. The encoded protein may be involved in the regulation of glycogen metabolism. Mutations in this gene have been associated with myoclonic epilepsy of Lafora. Alternative splicing results in multiple transcript variants.
26833	
26834	
26835	
26836	
26837	
26838	
26839	
26840	Cytosolic and membrane-bound forms of glutathione S-transferase are encoded by two distinct supergene families. At present, eight distinct classes of the soluble cytoplasmic mammalian glutathione S-transferases have been identified: alpha, kappa, mu, omega, pi, sigma, theta and zeta. This gene encodes a glutathione S-transferase that belongs to the mu class. The mu class of enzymes functions in the detoxification of electrophilic compounds, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress, by conjugation with glutathione. The genes encoding the mu class of enzymes are organized in a gene cluster on chromosome 1p13.3 and are known to be highly polymorphic. These genetic variations can change an individual@#$%&s susceptibility to carcinogens and toxins as well as affect the toxicity and efficacy of certain drugs. Mutations of this class mu gene have been linked with a slight increase in a number of cancers, likely due to exposure with environmental toxins.
26841	A reciprocal translocation between chromosomes 22 and 9 produces the Philadelphia chromosome, which is often found in patients with chronic myelogenous leukemia. The chromosome 22 breakpoint for this translocation is located within the BCR gene. The translocation produces a fusion protein which is encoded by sequence from both BCR and ABL, the gene at the chromosome 9 breakpoint. Although the BCR-ABL fusion protein has been extensively studied, the function of the normal BCR gene product is not clear. The protein has serine/threonine kinase activity and is a GTPase-activating protein for p21rac.
26842	
26843	The second major step in the catabolism of the branched-chain amino acids, isoleucine, leucine, and valine, is catalyzed by the branched-chain alpha-keto acid dehydrogenase complex (BCKD; EC 1.2.4.4), an inner-mitochondrial enzyme complex that consists of 3 catalytic components: a heterotetrameric (alpha2, beta2) branched-chain alpha-ketoacid decarboxylase (E1-alpha; MIM 608348 and E1-beta; MIM 248611), a homo-24-meric dihydrolipoyl transacylase (E2), and a homodimeric dihydrolipoamide dehydrogenase (E3; MIM 246900). The reaction is irreversible and constitutes the first committed step in BCAA oxidation. The complex also contains 2 regulatory enzymes, a kinase and a phosphorylase. The DBT gene encodes the E2 component.[supplied by OMIM]
26844	
26845	
26846	
26847	
26848	This gene encodes a cisplatin resistance-associated overexpressed protein (CROP). The N-terminal half of the CROP contains cysteine/histidine motifs and leucine zipper-like repeats, and the C-terminal half is rich in arginine and glutamate residues (RE domain) and arginine and serine residues (RS domain). This protein localizes with a speckled pattern in the nucleus, and could be involved in the formation of splicesome via the RE and RS domains. Two alternatively spliced transcript variants encoding the same protein have been found for this gene.
26849	
26850	
26851	
26852	
26853	
26854	
26855	
26856	
26857	
26858	
26859	
26860	
26861	
26862	
26863	
26864	
26865	
26866	
26867	
26868	
26869	In cardiac myocytes, Ca(2+) concentrations alternate between high levels during contraction and low levels during relaxation. The increase in Ca(2+) concentration during contraction is primarily due to release of Ca(2+) from intracellular stores. However, some Ca(2+) also enters the cell through the sarcolemma (plasma membrane). During relaxation, Ca(2+) is sequestered within the intracellular stores. To prevent overloading of intracellular stores, the Ca(2+) that entered across the sarcolemma must be extruded from the cell. The Na(+)-Ca(2+) exchanger is the primary mechanism by which the Ca(2+) is extruded from the cell during relaxation. In the heart, the exchanger may play a key role in digitalis action. The exchanger is the dominant mechanism in returning the cardiac myocyte to its resting state following excitation.[supplied by OMIM]
26870	
26871	The protein encoded by this gene is a member of the LAR protein-tyrosine phosphatase-interacting protein (liprin) family. Liprins interact with members of LAR family of transmembrane protein tyrosine phosphatases, which are known to be important for axon guidance and mammary gland development. This protein binds to the intracellular membrane-distal phosphatase domain of tyrosine phosphatase LAR, and appears to localize LAR to cell focal adhesions. This interaction may regulate the disassembly of focal adhesion and thus help orchestrate cell-matrix interactions. Alternatively spliced transcript variants encoding distinct isoforms have been described.
26872	
26873	
26874	
26875	
26876	
26877	
26878	
26879	
26880	
26881	
26882	
26883	
26884	
26885	
26886	
26887	
26888	
26889	
26890	
26891	
26892	
26893	
26894	This gene product, Aldolase A (fructose-bisphosphate aldolase) is a glycolytic enzyme that catalyzes the reversible conversion of fructose-1,6-bisphosphate to glyceraldehyde 3-phosphate and dihydroxyacetone phosphate. Three aldolase isozymes (A, B, and C), encoded by three different genes, are differentially expressed during development. Aldolase A is found in the developing embryo and is produced in even greater amounts in adult muscle. Aldolase A expression is repressed in adult liver, kidney and intestine and similar to aldolase C levels in brain and other nervous tissue. Aldolase A deficiency has been associated with myopathy and hemolytic anemia. Alternative splicing of this gene results in multiple transcript variants which encode the same protein.
26895	
26896	The protein encoded by this gene is one polypeptide of a small nuclear ribonucleoprotein complex and belongs to the snRNP SMB/SMN family. The protein plays a role in pre-mRNA processing, possibly tissue-specific alternative splicing events. Although individual snRNPs are believed to recognize specific nucleic acid sequences through RNA-RNA base pairing, the specific role of this family member is unknown. The protein arises from a bicistronic transcript that also encodes a protein identified as the SNRPN upstream reading frame (SNURF). Multiple transcription initiation sites have been identified and extensive alternative splicing occurs in the 5@#$%& untranslated region. Additional splice variants have been described but sequences for the complete transcripts have not been determined. The 5@#$%& UTR of this gene has been identified as an imprinting center. Alternative splicing or deletion caused by a translocation event in this paternally-expressed region is responsible for Angelman syndrome or Prader-Willi syndrome due to parental imprint switch failure.
26897	
26898	
26899	
26900	
26901	
26902	
26903	
26904	
26905	
26906	
26907	
26908	
26909	The protein encoded by this gene is involved in the import of precursor proteins into mitochondria. The encoded protein has a chaperone-like activity, binding the mature portion of unfolded proteins and aiding their import into mitochondria. This protein, which is found in the cytoplasm and sometimes associated with the outer mitochondrial membrane, has a weak ATPase activity and contains 6 TPR repeats.
26910	
26911	
26912	
26913	
26914	Hook proteins are cytosolic coiled-coil proteins that contain conserved N-terminal domains, which attach to microtubules, and more divergent C-terminal domains, which mediate binding to organelles. The Drosophila Hook protein is a component of the endocytic compartment.[supplied by OMIM]
26915	
26916	
26917	
26918	
26919	The origin recognition complex (ORC) is a highly conserved six subunits protein complex essential for the initiation of the DNA replication in eukaryotic cells. Studies in yeast demonstrated that ORC binds specifically to origins of replication and serves as a platform for the assembly of additional initiation factors such as Cdc6 and Mcm proteins. The protein encoded by this gene is a subunit of the ORC complex. This protein forms a core complex with ORC3L, -4L, and -5L. It also interacts with CDC45L and MCM10, which are proteins known to be important for the initiation of DNA replication. This protein has been demonstrated to specifically associate with the origin of replication of Epstein-Barr virus in human cells, and is thought to be required for DNA replication from viral origin of replication.
26920	
26921	
26922	
26923	
26924	HSF1 is a heat-shock transcription factor. Transcription of heat-shock genes is rapidly induced after temperature stress.  Hsp90, by itself and/or associated with multichaperone complexes, is a major repressor of HSF1.
26925	
26926	
26927	
26928	This gene encodes a member of the TGF-beta family of proteins. The encoded protein is secreted and plays a role in left-right asymmetry determination of organ systems during development. The protein may also play a role in endometrial bleeding. Mutations in this gene have been associated with left-right axis malformations, particularly in the heart and lungs. Some types of infertility have been associated with dysregulated expression of this gene in the endometrium. Alternative processing of this protein can yield three different products. This gene is closely linked to both a related family member and a related pseudogene.
26929	A reciprocal translocation between chromosomes 22 and 9 produces the Philadelphia chromosome, which is often found in patients with chronic myelogenous leukemia. The chromosome 22 breakpoint for this translocation is located within the BCR gene. The translocation produces a fusion protein which is encoded by sequence from both BCR and ABL, the gene at the chromosome 9 breakpoint. Although the BCR-ABL fusion protein has been extensively studied, the function of the normal BCR gene product is not clear. The protein has serine/threonine kinase activity and is a GTPase-activating protein for p21rac.
26930	
26931	Thyrotroph embryonic factor (TEF), a transcription factor, is a member of the PAR (proline and acidic amino acid-rich) subfamily of basic region/leucine zipper (bZIP) transcription factors.  It is expressed in a broad range of cells and tissues in adult animals, however, during embryonic development, TEF expression appears to be restricted to the developing anterior pituitary gland, coincident with the appearance of thyroid-stimulating hormone, beta (TSHB).  Indeed, TEF can bind to, and transactivate the TSHB promoter.  It shows homology (in the functional domains) with other members of the PAR-bZIP subfamily of transcription factors, which include albumin D box-binding protein (DBP), human hepatic leukemia factor (HLF) and chicken vitellogenin gene-binding protein (VBP); VBP is considered the chicken homologue of TEF.  Different members of the subfamily can readily form heterodimers, and share DNA-binding, and transcriptional regulatory properties.
26932	
26933	
26934	The specific function of this gene has yet to be determined; however, it has been speculated to play a role in transcription regulation and/or chromatin and histone metabolism.  It is considered the primary candidate gene in some haploinsufficiency syndromes such as DiGeorge syndrome, and insufficient production of the gene may disrupt normal embryonic development.  This gene is mapped to 22q11.21, centromeric to COMT.
26935	
26936	
26937	
26938	
26939	
26940	
26941	
26942	
26943	
26944	
26945	
26946	
26947	NUP160 is 1 of up to 60 proteins that make up the 120-MD nuclear pore complex, which mediates nucleoplasmic transport.[supplied by OMIM]
26948	
26949	
26950	
26951	
26952	
26953	
26954	The protein encoded by this gene has sequence similarity with members of the WD40 repeat-containing protein family. The WD40 group is a large family of proteins, which appear to have a regulatory function. It is believed that the WD40 repeats mediate protein-protein interactions and members of the family are involved in signal transduction, RNA processing, gene regulation, vesicular trafficking, cytoskeletal assembly and may play a role in the control of cytotypic differentiation.
26955	
26956	
26957	
26958	
26959	
26960	
26961	
26962	
26963	
26964	This gene encodes tissue-type plasminogen activator, a secreted serine protease which converts the proenzyme plasminogen to plasmin, a fibrinolytic enzyme. Tissue-type plasminogen activator is synthesized as a single chain which is cleaved by plasmin to a two chain disulfide linked protein. This enzyme plays a role in cell migration and tissue remodeling. Increased enzymatic activity causes hyperfibrinolysis, which manifests as excessive bleeding; decreased activity leads to hypofibrinolysis which can result in thrombosis or embolism. Alternative splicing of this gene produces three transcripts.
26965	
26966	
26967	
26968	
26969	
26970	
26971	
26972	
26973	
26974	
26975	
26976	
26977	CD4 (MIM 186940)-positive T helper lymphocytes can be divided into types 1 (Th1) and 2 (Th2) on the basis of their cytokine secretion patterns. Th1 cells and their associated cytokines are involved in cell-mediated immunity to intracellular pathogens and delayed-type hypersensitivity reactions, whereas Th2 cells are involved in the control of extracellular helminthic infections and the promotion of atopic and allergic diseases. The 2 types of cells also cross-regulate the functions of the other. TIM3 is a Th1-specific cell surface protein that regulates macrophage activation and enhances the severity of experimental autoimmune encephalomyelitis in mice.[supplied by OMIM]
26978	This gene is a member of the inhibitor of apoptosis (IAP) gene family, which encode negative regulatory proteins that prevent apoptotic cell death. IAP family members usually contain multiple baculovirus IAP repeat (BIR) domains, but this gene encodes proteins with only a single BIR domain. The encoded proteins also lack a C-terminus RING finger domain. Gene expression is high during fetal development and in most tumors yet low in adult tissues. Antisense transcripts are involved in the regulation of this gene@#$%&s expression. At least four transcript variants encoding distinct isoforms have been found for this gene, but the full-length natures of only three of them have been determined.
26979	
26980	
26981	
26982	
26983	
26984	
26985	Chloride channels are a diverse group of proteins that regulate fundamental cellular processes including stabilization of cell membrane potential, transepithelial transport, maintenance of intracellular pH, and regulation of cell volume. Chloride intracellular channel 2 is a member of the p64 family; the protein is detected in fetal liver and adult skeletal muscle tissue.  This gene maps to the candidate region on chromosome X for incontinentia pigmenti.
26986	This gene is a member of the cyclophilin family of peptidylprolyl isomerases. The cyclophilins are a highly conserved ubiquitous family, members of which play an important role in protein folding, immunosuppression by cyclosporin A, and infection of HIV-1 virions. This protein interacts with the proteinase inhibitor eglin c and is localized in the nucleus. This gene encodes three transcript variants encoding two distinct isoforms.
26987	
26988	
26989	
26990	
26991	
26992	
26993	
26994	
26995	
26996	
26997	
26998	
26999	
27000	
27001	
27002	
27003	
27004	This gene encodes a disintegrin and metalloproteinase with thrombospondin (ADAMTS) motifs-13, which is a member of the ADAMTS protein family. Members of the family share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS) motif. Individual members of this family differ in the number of C-terminal TS motifs, and some have unique C-terminal domains. The enzyme encoded by this gene is the von Willebrand Factor (vWF)-cleaving protease, which is responsible for cleaving at the site of Tyr842-Met843 of the vWF molecule. A deficiency of this enzyme is associated with thrombotic thrombocytopenic purpura. Alternative splicing of this gene generates at least 4 transcript variants encoding different isoforms.
27005	
27006	
27007	
27008	
27009	
27010	
27011	
27012	The protein encoded by this gene is a member of the STAT protein family. In response to cytokines and growth factors, STAT family members are phosphorylated by the receptor associated kinases, and then form homo- or heterodimers that translocate to the cell nucleus where they act as transcription activators. This protein is activated through phosphorylation in response to various cytokines and growth factors including IFNs, EGF, IL5, IL6, HGF, LIF and BMP2. This protein mediates the expression of a variety of genes in response to cell stimuli, and thus plays a key role in many cellular processes such as cell growth and apoptosis. The small GTPase Rac1 has been shown to bind and regulate the activity of this protein. PIAS3 protein is a specific inhibitor of this protein. Three alternatively spliced transcript variants encoding distinct isoforms have been described.
27013	
27014	
27015	
27016	
27017	
27018	
27019	
27020	
27021	
27022	
27023	
27024	
27025	
27026	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that belongs to the ribosomal protein S6P family. Pseudogenes corresponding to this gene are found on chromosomes 1p and 12q.
27027	Casein kinase II is a serine/threonine protein kinase that phosphorylates acidic proteins such as casein. The kinase exists as a tetramer and is composed of an alpha, an alpha-prime, and two beta subunits. The alpha subunits contain the catalytic activity while the beta subunits undergo autophosphorylation. The protein encoded by this gene represents the alpha subunit. While this gene is found on chromosome 20, a related transcribed pseudogene is found on chromosome 11. Three transcript variants encoding two different proteins have been found for this gene.
27028	
27029	
27030	
27031	
27032	
27033	
27034	
27035	
27036	
27037	Transcriptional regulatory proteins containing tandemly repeated zinc finger domains are thought to be involved in both normal and abnormal cellular proliferation and differentiation. ZNF161 is a C2H2-type zinc finger protein. See MIM 603971 for general information on zinc finger proteins.[supplied by OMIM]
27038	Tyrosine hydroxylase (EC 1.14.16.2) is involved in the conversion of phenylalanine to dopamine. As the rate-limiting enzyme in the synthesis of catecholamines, tyrosine hydroxylase has a key role in the physiology of adrenergic neurons.[supplied by OMIM]
27039	
27040	Histones play a critical role in transcriptional regulation, cell cycle progression, and developmental events. Histone acetylation/deacetylation alters chromosome structure and affects transcription factor access to DNA. The protein encoded by this gene belongs to class II of the histone deacetylase/acuc/apha family. It possesses histone deacetylase activity and represses transcription when tethered to a promoter. This protein does not bind DNA directly, but through transcription factors MEF2C and MEF2D. It seems to interact in a multiprotein complex with RbAp48 and HDAC3.
27041	
27042	
27043	
27044	
27045	
27046	
27047	
27048	This gene is a member of the tropomyosin family of highly conserved, widely distributed actin-binding proteins involved in the contractile system of striated and smooth muscles and the cytoskeleton of non-muscle cells. Tropomyosin is composed of two alpha-helical chains arranged as a coiled-coil. It is polymerized end to end along the two grooves of actin filaments and provides stability to the filaments. The encoded protein is one type of alpha helical chain that forms the predominant tropomyosin of striated muscle, where it also functions in association with the troponin complex to regulate the calcium-dependent interaction of actin and myosin during muscle contraction. In smooth muscle and non-muscle cells, alternatively spliced transcript variants encoding a range of isoforms have been described. Mutations in this gene are associated with type 3 familial hypertrophic cardiomyopathy.
27049	
27050	
27051	
27052	
27053	This gene encodes a protein that has sequence similarity to zeta crystallin, also known as quinone oxidoreductase. This zeta crystallin-like protein also contains an NAD(P)H binding site. The specific function of this protein is not known. Alternatively spliced variants that encode different protein isoforms have been identified but the full-length nature of only some of them is known. This gene also seems to contain a rare AT-AC splice-site.
27054	
27055	
27056	
27057	This gene encodes a protein containing an alpha/beta hydrolase fold, which is a catalytic domain found in a very wide range of enzymes. The function of this protein has not been determined. Alternative splicing of this gene results in two transcript variants encoding the same protein.
27058	This gene encodes a protein containing an alpha/beta hydrolase fold, which is a catalytic domain found in a very wide range of enzymes. The function of this protein has not been determined. Alternative splicing of this gene results in two transcript variants encoding the same protein.
27059	
27060	KRT18 encodes the type I intermediate filament chain keratin 18. Keratin 18, together with its filament partner keratin 8, are perhaps the most commonly found members of the intermediate filament gene family. They are expressed in single layer epithelial tissues of the body. Mutations in this gene have been linked to cryptogenic cirrhosis. Two transcript variants encoding the same protein have been found for this gene.
27061	WW domain-containing proteins are found in all eukaryotes and play an important role in the regulation of a wide variety of cellular functions such as protein degradation, transcription, and RNA splicing. This gene encodes a protein which contains 2 WW domains and a short-chain dehydrogenase/reductase domain (SRD). The highest normal expression of this gene is detected in hormonally regulated tissues such as testis, ovary, and prostate. This expression pattern and the presence of an SRD domain suggest a role for this gene in steroid metabolism. The encoded protein is more than 90% identical to the mouse protein, which is an essential mediator of tumor necrosis factor-alpha-induced apoptosis, suggesting a similar, important role in apoptosis for the human protein. In addition, there is evidence that this gene behaves as a suppressor of tumor growth. Alternative splicing of this gene generates 7 transcript variants, which encode different isoforms.
27062	
27063	
27064	The protein encoded by this gene is a major component of the inner surface of caveolae, small invaginations of the plasma membrane, and is involved in essential cellular functions, including signal transduction, lipid metabolism, cellular growth control and apoptosis. This protein may function as a tumor suppressor. CAV1 and CAV2 are located next to each other on chromosome 7 and express colocalizing proteins that form a stable hetero-oligomeric complex. Two transcript variants encoding distinct isoforms have been identified for this gene. By using alternative initiation codons in the same reading frame, two isoforms (alpha and beta) are encoded by one transcript.
27065	
27066	The protein encoded by this gene is predicted to have a single transmembrane (TM)-spanning domain and multiple copies of an immunoglobulin-like plexin-transcription-factor domain. Alternative splicing results in two transcript variants encoding different isoforms. Other alternatively spliced transcripts have been described, but the full length sequences have not been determined. Several of these transcripts are predicted to encode truncated products which lack the TM and may be secreted. Mutations in this gene cause autosomal recessive polycystic kidney disease, also known as polycystic kidney and hepatic disease-1.
27067	
27068	
27069	
27070	
27071	This gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins. The encoded protein may play a role in chondrogenesis. A pseudogene of this gene is located on chromosome 8. Multiple transcript variants encoding distinct isoforms have been identified for this gene.
27072	
27073	
27074	
27075	
27076	
27077	
27078	
27079	
27080	
27081	The protein encoded by this gene is a small glycolipid transport protein which acts as a substrate specific co-factor for the lysosomal enzyme beta-hexosaminidase A. Beta-hexosaminidase A, together with GM2 ganglioside activator, catalyzes the degradation of the ganglioside GM2, and other molecules containing terminal N-acetyl hexosamines. Mutations in this gene result in GM2-gangliosidosis type AB or the AB variant of Tay-Sachs disease.
27082	
27083	
27084	
27085	
27086	
27087	
27088	
27089	
27090	
27091	
27092	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and splicesosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, which is an ATPase activated by a variety of RNA species but not by dsDNA. This protein and that encoded by DDX5 gene are more closely related to each other than to any other member of the DEAD box family. Alternative splicing of this gene generates 2 transcript variants encoding different isoforms with the longer transcript reported to also initiate translation at a non-AUG (CUG) start site.
27093	The dystrophin gene is the largest gene found in nature, measuring 2.4 Mb. The gene was identified through a positional cloning approach, targeted at the isolation of the gene responsible for Duchenne (DMD) and Becker (BMD) Muscular Dystrophies. DMD is a recessive, fatal, X-linked disorder occurring at a frequency of about 1 in 3,500 new-born males. BMD is a milder allelic form. In general, DMD patients carry mutations which cause premature translation termination (nonsense or frame shift mutations), while in BMD patients dystrophin is reduced either in molecular weight (derived from in-frame deletions) or in expression level. The dystrophin gene is highly complex, containing at least eight independent, tissue-specific promoters and two polyA-addition sites. Furthermore, dystrophin RNA is differentially spliced, producing a range of different transcripts, encoding a large set of protein isoforms. Dystrophin (as encoded by the Dp427 transcripts) is a large, rod-like cytoskeletal protein which is found at the inner surface of muscle fibers. Dystrophin is part of the dystrophin-glycoprotein complex (DGC), which bridges the inner cytoskeleton (F-actin) and the extra-cellular matrix.
27094	
27095	
27096	
27097	
27098	
27099	
27100	
27101	
27102	
27103	
27104	
27105	
27106	
27107	
27108	
27109	This gene encodes a subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This subunit functions as guanine nucleotide exchange factor. It is reported that this subunit interacts with HIV-1 Tat, and thus it represses the translation of host-cell, but not HIV-1, mRNAs. Several alternatively spliced transcript variants have been found for this gene, however, the full length nature of only two variants has been determined.
27110	
27111	
27112	
27113	
27114	
27115	
27116	
27117	
27118	
27119	
27120	
27121	
27122	
27123	
27124	This gene encodes a member of the Golgi-localized, gamma adaptin ear-containing, ARF-binding (GGA) protein family. Members of this family are ubiquitous coat proteins that regulate the trafficking of proteins between the trans-Golgi network and the lysosome. These proteins share an amino-terminal VHS domain which mediates sorting of the mannose 6-phosphate receptors at the trans-Golgi network. They also contain a carboxy-terminal region with homology to the ear domain of gamma-adaptins. Multiple alternatively spliced transcript variants encoding different isoforms have been found for this gene.
27125	The protein encoded by this gene is a receptor for activated C-kinase (RACK) protein. The encoded protein has been shown to bind in vitro to activated protein kinase C beta I. In addition, this protein is a cutaneous T-cell lymphoma-associated antigen. Finally, the protein contains a bromodomain and two zinc fingers, and is thought to be a transcriptional regulator. Multiple transcript variants encoding several different isoforms have been found for this gene.
27126	
27127	
27128	
27129	
27130	
27131	Aminoacyl-tRNA synthetases catalyze the aminoacylation of tRNA by their cognate amino acid. Because of their central role in linking amino acids with nucleotide triplets contained in tRNAs, aminoacyl-tRNA synthetases are thought to be among the first proteins that appeared in evolution. Two forms of tryptophanyl-tRNA synthetase exist, a cytoplasmic form, named WARS, and a mitochondrial form, named WARS2. This gene encodes the mitochondrial tryptophanyl-tRNA synthetase. Two alternative transcripts encoding different isoforms have been described.
27132	
27133	
27134	
27135	
27136	
27137	
27138	
27139	
27140	
27141	
27142	
27143	
27144	
27145	This gene encodes a membrane-anchored protein with a hydrophobic amino terminal domain and a cyclophilin-like PPIase domain. It is present on the surface of natural killer cells and facilitates their binding to targets. Alternate transcriptional splice variants, encoding different isoforms, have been characterized and expression of the splice variants is regulated by IL2 activation of the cells.
27146	
27147	
27148	
27149	
27150	
27151	
27152	
27153	
27154	
27155	
27156	
27157	
27158	
27159	
27160	
27161	
27162	
27163	
27164	
27165	This gene encodes a CCCH-type zinc finger protein that is thought to prevent infection by retroviruses. Studies of the rat homolog indicate that the protein may primarily function to inhibit viral gene expression and induce an innate immunity to viral infection. Alternative splicing occurs at this locus and two variants, each encoding distinct isoforms, are described.
27166	
27167	
27168	
27169	
27170	
27171	
27172	
27173	
27174	
27175	
27176	
27177	
27178	
27179	
27180	In some non-lymphoid tissues, the unrearranged T cell receptor gamma (TRG@) locus is expressed. The resulting transcript contains a subset of the TRG@ gene segments and is shorter than TRG@ transcripts expressed in lymphoid tissues. This RefSeq record represents the unrearranged TRG@ locus transcript; the complete TRG@ locus is represented by the genomic RefSeq NG_001336. The transcript represented by this RefSeq has two open reading frames (ORFs) that encode different proteins. The downstream ORF is in the same frame as TRG@ and its protein product is similar to TRG@ proteins. The upstream ORF uses a different reading frame and encodes a novel protein.
27181	
27182	
27183	
27184	
27185	
27186	This gene encodes a member of the oxysterol-binding protein (OSBP) family, a group of intracellular lipid receptors. Most members contain an N-terminal pleckstrin homology domain and a highly conserved C-terminal OSBP-like sterol-binding domain, although some members contain only the sterol-binding domain. Transcript variants derived from alternative promoter usage and/or alternative splicing exist; they encode different isoforms.
27187	
27188	
27189	
27190	
27191	
27192	
27193	
27194	
27195	
27196	
27197	
27198	
27199	The protein encoded by this gene is an isozyme of the long-chain fatty-acid-coenzyme A ligase family. Although differing in substrate specificity, subcellular localization, and tissue distribution, all isozymes of this family convert free long-chain fatty acids into fatty acyl-CoA esters, and thereby play a key role in lipid biosynthesis and fatty acid degradation.
27200	
27201	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It is one of the most abundant core proteins of hnRNP complexes and it is localized to the nucleoplasm. This protein, along with other hnRNP proteins, is exported from the nucleus, probably bound to mRNA, and is immediately re-imported. Its M9 domain acts as both a nuclear localization and nuclear export signal. The encoded protein is involved in the packaging of pre-mRNA into hnRNP particles, transport of poly A+ mRNA from the nucleus to the cytoplasm, and may modulate splice site selection. It is also thought have a primary role in the formation of specific myometrial protein species in parturition. Multiple alternatively spliced transcript variants have been found for this gene but only two transcripts are fully described. These variant have multiple alternative transcription initiation sites and multiple polyA sites.
27202	Isocitrate dehydrogenases catalyze the oxidative decarboxylation of isocitrate to 2-oxoglutarate. These enzymes belong to two distinct subclasses, one of which utilizes NAD(+) as the electron acceptor and the other NADP(+). Five isocitrate dehydrogenases have been reported: three NAD(+)-dependent isocitrate dehydrogenases, which localize to the mitochondrial matrix, and two NADP(+)-dependent isocitrate dehydrogenases, one of which is mitochondrial and the other predominantly cytosolic. NAD(+)-dependent isocitrate dehydrogenases catalyze the allosterically regulated rate-limiting step of the tricarboxylic acid cycle. Each isozyme is a heterotetramer that is composed of two alpha subunits, one beta subunit, and one gamma subunit. The protein encoded by this gene is the alpha subunit of one isozyme of NAD(+)-dependent isocitrate dehydrogenase.
27203	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. Two alternatively spliced transcript variants of this gene have been reported, one of which encodes a receptor-type PTP that possesses a short extracellular domain, a single transmembrane region, and two tandem intracytoplasmic catalytic domains; Another one encodes a PTP that contains a distinct hydrophilic N-terminus, and thus represents a nonreceptor-type isoform of this PTP. Studies of the similar gene in mice suggested the regulatory roles of this PTP in RAS related signal transduction pathways, cytokines induced SATA signaling, as well as the activation of voltage-gated K+ channels.
27204	
27205	
27206	
27207	
27208	
27209	
27210	
27211	This gene may belong to a protease family responsible for intercellular peptide signalling.  Though its role in the cellular processing of insulin has not yet been defined, insulin-degrading enzyme is thought to be involved in the termination of the insulin response.
27212	NRP1 is a membrane-bound coreceptor to a tyrosine kinase receptor for both vascular endothelial growth factor (VEGF; MIM 192240) and semaphorin (see SEMA3A; MIM 603961) family members. NRP1 plays versatile roles in angiogenesis, axon guidance, cell survival, migration, and invasion.[supplied by OMIM]
27213	Initiation of transcription by RNA polymerase II requires the activities of more than 70 polypeptides. The protein that coordinates these activities is transcription factor IID (TFIID), which binds to the core promoter to position the polymerase properly, serves as the scaffold for assembly of the remainder of the transcription complex, and acts as a channel for regulatory signals. TFIID is composed of the TATA-binding protein (TBP) and a group of evolutionarily conserved proteins known as TBP-associated factors or TAFs. TAFs may participate in basal transcription, serve as coactivators, function in promoter recognition or modify general transcription factors (GTFs) to facilitate complex assembly and transcription initiation. This gene encodes a subunit of TFIID present in a subset of TFIID complexes. Translocations involving chromosome 17 and chromosome 9, where the gene for the nuclear receptor CSMF is located, result in a gene fusion product that is an RNA binding protein associated with a subset of extraskeletal myxoid chondrosarcomas. Two transcripts encoding different isoforms have been identified.
27214	
27215	
27216	
27217	
27218	Protocadherins form a subfamily of calcium-dependent cell-cell adhesion molecules in the cadherin superfamily. Protocadherin-10 belongs to a subclass of protocadherins that share a highly conserved 17-amino acid cytoplasmic motif (Wolverton and Lalande, 2001).[supplied by OMIM]
27219	
27220	
27221	
27222	
27223	
27224	
27225	
27226	
27227	
27228	
27229	
27230	
27231	
27232	
27233	
27234	
27235	
27236	This gene encodes a trafficking membrane protein which transports proteins among the medial- and trans-Golgi compartments. Due to its chromosomal location and trafficking function, this gene may be involved in familial essential hypertension. Three transcript variants encoding three different isoforms have been found for this gene.
27237	
27238	This gene belongs to a group of vacuolar protein sorting (VPS) genes that, when functionally impaired, disrupt the efficient delivery of vacuolar hydrolases. The protein encoded by this gene is a component of a large multimeric complex, termed the retromer complex, which is involved in retrograde transport of proteins from endosomes to the trans-Golgi network. This VPS protein may be involved in the formation of the inner shell of the retromer coat for retrograde vesicles leaving the prevacuolar compartment. Alternative splice variants encoding different isoforms, and usage of multiple polyadenylation sites have been found for this gene.
27239	
27240	
27241	
27242	
27243	
27244	
27245	
27246	Protein kinase B (PKB) is a major downstream target of receptor tyrosine kinases that signal via phosphatidylinositol 3-kinase. Upon cell stimulation, PKB is translocated to the plasma membrane, where it is phosphorylated in the C-terminal regulatory domain. The protein encoded by this gene negatively regulates PKB activity by inhibiting phosphorylation. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of these variants has not been determined.
27247	
27248	
27249	
27250	
27251	
27252	This gene encodes a protein containing zinc-finger-like repeats and a calpain-like protease domain. The encoded protein may function as a transcription factor, RNA-binding protein, or in protein-protein interactions during visual system development.
27253	
27254	
27255	
27256	
27257	
27258	
27259	
27260	
27261	
27262	
27263	
27264	
27265	Copper is an element essential for life, but excessive copper can be toxic or even lethal to the cell. Therefore, cells have developed sophisticated ways to maintain a critical copper balance, with the intake, export, and intracellular compartmentalization or buffering of copper strictly regulated. The 2 related genes ATP7A (MIM 300011) and ATP7B (MIM 606882), responsible for the human diseases Menkes syndrome (MIM 309400) and Wilson disease (MIM 277900), respectively, are involved in copper export. In S. cerevisiae, the copper uptake genes CTR1, CTR2, and CTR3 have been identified, and in human the CTR1 and CTR2 (MIM 603088) genes have been identified.[supplied by OMIM]
27266	
27267	
27268	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This protein contains a C-terminal PTP domain and an N-terminal domain homologous to the band 4.1 superfamily of cytoskeletal-associated proteins. P97, a cell cycle regulator involved in a variety of membrane related functions, has been shown to be a substrate of this PTP. This PTP was also found to interact with, and be regulated by adaptor protein 14-3-3 beta.
27269	The ITGB3 protein product is the integrin beta chain beta 3. Integrins are integral cell-surface proteins composed of an alpha chain and a beta chain. A given chain may combine with multiple partners resulting in different integrins. Integrin beta 3 is found along with the alpha IIb chain in platelets. Integrins are known to participate in cell adhesion as well as cell-surface mediated signalling.
27270	
27271	
27272	
27273	
27274	
27275	This gene encodes the ligand of the tyrosine-kinase receptor encoded by the KIT locus. This ligand is a pleiotropic factor that acts in utero in germ cell and neural cell development, and hematopoiesis, all believed to reflect a role in cell migration. In adults, it functions pleiotropically, while mostly noted for its continued requirement in hematopoiesis. Two transcript variants encoding different isoforms have been found for this gene.
27276	
27277	
27278	
27279	The protein encoded by this gene shares sequence similarity with Hir1p and Hir2p, the two corepressors of histone gene transcription characterized in the yeast, Saccharomyces cerevisiae. The structural features of this protein suggest that it may function as part of a multiprotein complex. Several cDNAs encoding interacting proteins, HIRIPs, have been identified. HIRIP4 was isolated by virtue of its interaction with this protein; however, its exact function is not known. The sequence of HIRIP4 protein is highly homologous to the human DNJ3/CPR3, mouse Dj3 and rat Dj2 gene products.
27280	
27281	The heterotetrameric adaptor protein (AP) complexes sort integral membrane proteins at various stages of the endocytic and secretory pathways. AP4 is composed of 2 large chains, beta-4 (AP4B1; MIM 607245) and epsilon-4 (AP4E1), a medium chain, mu-4 (AP4M1; MIM 602296), and a small chain, sigma-4 (AP4S1; MIM 607243).[supplied by OMIM]
27282	
27283	
27284	
27285	
27286	
27287	
27288	
27289	The protein encoded by this gene is a member of the SWI/SNF family of proteins, whose members have helicase and ATPase activities and are thought to regulate transcription of certain genes by altering the chromatin structure around those genes. The encoded protein is part of the large ATP-dependent chromatin remodeling complex SNF/SWI, which is required for transcriptional activation of genes normally repressed by chromatin. It possesses at least two conserved domains that could be important for its function. First, it has a DNA-binding domain which can specifically bind an AT-rich DNA sequence known to be recognized by a SNF/SWI complex at the beta-globin locus. Second, the C-terminus of the protein can stimulate glucocorticoid receptor-dependent transcriptional activation. It is thought that the protein encoded by this gene confers specificity to the SNF/SWI complex and may recruit the complex to its targets through either protein-DNA or protein-protein interactions. Three transcript variants encoding different isoforms have been found for this gene.
27290	
27291	
27292	
27293	
27294	The protein encoded by this gene binds to Bruton@#$%&s tyrosine kinase (BTK) and downregulates BTK@#$%&s kinase activity. In addition, the encoded protein disrupts BTK-mediated calcium mobilization and negatively regulates the activation of nuclear factor-kappa-B-driven transcription.
27295	
27296	
27297	
27298	
27299	
27300	
27301	
27302	
27303	
27304	
27305	
27306	
27307	
27308	
27309	
27310	
27311	
27312	
27313	
27314	
27315	
27316	
27317	
27318	
27319	
27320	
27321	
27322	
27323	
27324	
27325	
27326	This gene belongs to the forkhead family of transcription factors which are characterized by a distinct forkhead domain.  The specific function of this gene has not yet been determined; however, it may play a role in myogenic growth and differentiation. Translocation of this gene with PAX3 has been associated with alveolar rhabdomyosarcoma.
27327	
27328	This gene encodes a member of the RRM family of RNA-binding proteins. The RRM domain is between 80-100 amino acids in length and family members contain one to four copies of the domain. The RRM domain consists of two short stretches of conserved sequence called RNP1 and RNP2, as well as a few highly conserved hydrophobic residues. The protein encoded by this gene has a single, putative RRM domain in its N-terminus. Alternative splicing results in multiple transcript variants encoding different isoforms.
27329	
27330	
27331	
27332	
27333	
27334	
27335	
27336	
27337	
27338	
27339	
27340	
27341	
27342	
27343	
27344	
27345	
27346	
27347	
27348	
27349	
27350	
27351	
27352	This gene encodes a protein that is a member of the Shank family of synaptic proteins that may function as molecular scaffolds in the postsynaptic density (PSD). Shank proteins contain multiple domains for protein-protein interaction, including ankyrin repeats, an SH3 domain, a PSD-95/Dlg/ZO-1 domain, a sterile alpha motif domain, and a proline-rich region. This particular family member contains a PDZ domain, a consensus sequence for cortactin SH3 domain-binding peptides and a sterile alpha motif. The alternative splicing demonstrated in Shank genes has been suggested as a mechanism for regulating the molecular structure of Shank and the spectrum of Shank-interacting proteins in the PSDs of adult and developing brain. Two alternative splice variants, encoding distinct isoforms, are reported. Additional splice variants exist but their full-length nature has not been determined.
27353	
27354	
27355	
27356	The protein encoded by this gene is a member of the BAG1-related protein family. BAG1 is an anti-apoptotic protein that functions through interactions with a variety of cell apoptosis and growth related proteins including BCL-2, Raf-protein kinase, steroid hormone receptors, growth factor receptors and members of the heat shock protein 70 kDa family. This protein contains a BAG domain near the C-terminus, which could bind and inhibit the chaperone activity of Hsc70/Hsp70. This protein was found to be associated with the death domain of tumor necrosis factor receptor type 1 (TNF-R1) and death receptor-3 (DR3), and thereby negatively regulates downstream cell death signaling. The regulatory role of this protein in cell death was demonstrated in epithelial cells which undergo apoptosis while integrin mediated matrix contacts are lost.
27357	
27358	
27359	
27360	
27361	
27362	
27363	
27364	
27365	
27366	
27367	The protein encoded by this gene is a member of the acyl coenzyme family. It hydrolyzes the CoA thioester of palmitoyl-CoA and other long-chain fatty acids. Alternative splicing of this gene results in multiple transcript variants.
27368	
27369	
27370	
27371	
27372	
27373	
27374	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the MRP subfamily which is involved in multi-drug resistance. This protein functions as a chloride channel and controls the regulation of other transport pathways. Mutations in this gene have been observed in patients with the autosomal recessive disorders cystic fibrosis (CF) and congenital bilateral aplasia of the vas deferens (CBAVD). Alternative splice variants have been described, many of which result from mutations in the CFTR gene.
27375	
27376	
27377	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members bind heparin and possess broad mitogenic and angiogenic activities. This protein has been implicated in diverse biological processes, such as limb and nervous system development, wound healing, and tumor growth. The mRNA for this gene contains multiple polyadenylation sites, and is alternatively translated from AUG and non-AUG (CUG) initiation codons resulting in five different isoforms with distinct properties. The CUG-initiated isoforms are localized in the nucleus and are responsible for the intracrine effect, whereas, the AUG-initiated form is mostly cytosolic and is responsible for the paracrine and autocrine effects of this FGF.
27378	
27379	
27380	
27381	
27382	
27383	
27384	
27385	
27386	
27387	
27388	The protein encoded by this gene is a member of the membrane-associated guanylate kinase homologue (MAGUK) family. Characterized by two WW domains, a guanylate kinase domain, and five PDZ domains, this protein interacts with the cytoplasmic region of BAI1. Together, these proteins may play a role in cell adhesion and signal transduction.
27389	
27390	
27391	
27392	
27393	
27394	
27395	
27396	
27397	
27398	
27399	The protein encoded by this gene catalyzes the hydrolysis of cAMP and cGMP to their corresponding monophosphates. The encoded protein plays a role in signal transduction by regulating the intracellular concentration of these cyclic nucleotides. Multiple transcript variants encoding several different isoforms have been found for this gene.
27400	
27401	
27402	
27403	
27404	
27405	
27406	
27407	
27408	
27409	
27410	
27411	
27412	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a member of the histone H3 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the small histone gene cluster on chromosome 6p22-p21.3.
27413	
27414	
27415	
27416	
27417	The protein encoded by this gene is involved in the sodium-independent transport and excretion of organic anions, some of which are potentially toxic. The encoded protein is an integral membrane protein and is found mainly in the kidney and in the placenta, where it may act to prevent potentially harmful organic anions from reaching the fetus.
27418	
27419	
27420	
27421	
27422	
27423	
27424	
27425	
27426	
27427	
27428	
27429	
27430	
27431	
27432	
27433	
27434	
27435	Naturally occurring antagonist for both ANGPT1 and TIE2; expressed only at the sites of vascular remodeling; similar to angiopoietin-1
27436	
27437	
27438	
27439	
27440	The protein encoded by this gene is a cytokine receptor that belongs to the interleukin 1 receptor family. This protein is a receptor for interleukin alpha (IL1A), interleukin beta (IL1B), and interleukin 1 receptor, type I(IL1R1/IL1RA). It is an important mediator involved in many cytokine induced immune and inflammatory responses. This gene along with interleukin 1 receptor, type II (IL1R2), interleukin 1 receptor-like 2 (IL1RL2), and interleukin 1 receptor-like 1 (IL1RL1) form a cytokine receptor gene cluster in a region mapped to chromosome 2q12.
27441	
27442	This gene encodes a protein that is similar to a serine/threonine kinase in C. elegans which is involved in axonal elongation. The structure of this protein is similar to the C. elegans protein in that both proteins have an N-terminal kinase domain, a central proline/serine rich (PS) domain, and a C-terminal (C) domain. The gene is located within the Smith-Magenis syndrome region on chromosome 17.
27443	
27444	
27445	
27446	
27447	
27448	
27449	
27450	
27451	
27452	
27453	
27454	
27455	
27456	
27457	
27458	
27459	
27460	This gene encodes a nuclear protein that has three zinc fingers at the end of its C-terminal domain, a serine/threonine-rich central region, and an acidic domain lying within the N-terminal region. The zinc fingers of this protein are responsible for the specific DNA binding with the guanine-rich core promoter elements. The central region might be involved in activation or posttranslational regulatory pathways, and the acidic N-terminal domain might play an important role in the process of transcriptional activation. It is capable of activating transcription approximately 4-fold either on homologous or heterologous promoters. The DNA binding and transcriptional activity of this protein, in conjunction with its expression pattern, suggests that this protein may participate in the regulation and/or maintenance of the basal expression of pregnancy-specific glycoprotein genes and possibly other TATA box-less genes. Two transcript variants encoding the same protein have been found for this gene.
27461	Expression of this gene is thought to serve as a marker for differentiated vascular smooth muscle cells which may have a role in regulating growth and differentiation of this cell type. The encoded protein is highly similar to the corresponding rat and mouse proteins. Multiple alternatively spliced transcript variants have been found for this gene, but the full-length nature of only one variant has been defined.
27462	
27463	
27464	
27465	
27466	The protein encoded by this gene is known to interact with cellular and viral survival-promoting proteins, such as BCL2 and the Epstein-Barr virus in order to enhance programed cell death.  Because its activity is suppressed in the presence of survival-promoting proteins, this protein is suggested as a likely target for antiapoptotic proteins.  This protein shares a critical BH3 domain with other death-promoting proteins, BAX and BAK.
27467	
27468	
27469	This gene belongs to the homeobox family of genes. The homeobox genes encode a highly conserved family of transcription factors that play an important role in morphogenesis in all multicellular organisms. Mammals possess four similar homeobox gene clusters, HOXA, HOXB, HOXC and HOXD, located on different chromosomes, consisting of 9 to 11 genes arranged in tandem. This gene is one of several homeobox HOXB genes located in a cluster on chromosome 17. The exact role of this gene has yet to be determined.
27470	
27471	
27472	
27473	
27474	
27475	
27476	
27477	
27478	
27479	
27480	
27481	
27482	
27483	
27484	Members of the nuclear hormone receptor family, such as NR2C2, act as ligand-activated transcription factors. The proteins have an N-terminal transactivation domain, a central DNA-binding domain with 2 zinc fingers, and a ligand-binding domain at the C terminus. The activated receptor/ligand complex is translocated to the nucleus where it binds to hormone response elements of target genes (Yoshikawa et al., 1996).[supplied by OMIM]
27485	
27486	
27487	
27488	
27489	The vitamin A metabolite, all-trans retinoic acid (atRA), plays an important role in neuronal development, including neurite outgrowth. RAINB1 is an atRA-responsive gene.[supplied by OMIM]
27490	
27491	This gene encodes a cytoplasmic protein tyrosine kinase which is found concentrated in the focal adhesions that form between cells growing in the presence of extracellular matrix constituents. The encoded protein is a member of the FAK subfamily of protein tyrosine kinases but lacks significant sequence similarity to kinases from other subfamilies. Activation of this gene may be an important early step in cell growth and intracellular signal transduction pathways triggered in response to certain neural peptides or to cell interactions with the extracellular matrix. At least four transcript variants encoding four different isoforms have been found for this gene, but the full-length natures of only two of them have been determined.
27492	
27493	
27494	This gene encodes a multifunctional protein. It has two enzymatically active domains with catalytic activities - peptidylglycine alpha-hydroxylating monooxygenase (PHM) and peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL). These catalytic domains work sequentially to catalyze neuroendocrine peptides to active alpha-amidated products. Multiple alternatively spliced transcript variants encoding different isoforms have been described for this gene but some of their full length sequences are not yet known.
27495	
27496	
27497	
27498	
27499	
27500	
27501	
27502	
27503	
27504	The protein encoded by this gene is a type I transmembrane glycoprotein which shows homology to the pMEL17 precursor, a melanocyte-specific protein. GPNMB shows expression in the lowly metastatic human melanoma cell lines and xenografts but does not show expression in the highly metastatic cell lines. GPNMB may be involved in growth delay and reduction of metastatic potential. Two transcript variants encoding different isoforms have been found for this gene.
27505	
27506	
27507	
27508	
27509	
27510	
27511	
27512	
27513	
27514	MYH7 encodes the cardiac muscle beta (or slow) isoform of myosin. Changes in the relative abundance of MYH7 and MYH6 (the alpha, or fast, isoform of cardiac myosin heavy chain) correlate with the contractile velocity of cardiac muscle. Mutations in MYH7 are associated with familial hypertrophic cardiomyopathy.
27515	
27516	
27517	
27518	
27519	
27520	
27521	
27522	This gene encodes dystrobrevin beta, a component of the dystrophin-associated protein complex (DPC). The DPC consists of dystrophin and several integral and peripheral membrane proteins, including dystroglycans, sarcoglycans, syntrophins and dystrobrevin alpha and beta. The DPC localizes to the sarcolemma and its disruption is associated with various forms of muscular dystrophy. Dystrobrevin beta is thought to interact with syntrophin and the DP71 short form of dystrophin. Alternatively spliced transcript variants encoding different isoforms have been identified.
27523	
27524	
27525	
27526	
27527	
27528	The product of this gene is a membrane-associated protein that colocalizes with huntingtin. This protein has similarities to cytoskeleton proteins and its interaction with huntingtin is thought to play a functional role in the cell filament network. Loss of normal huntingtin-HIP1 interaction in Huntington disease may contribute to a defect in membrane-cytoskeletal integrity in the brain. This gene could help in the understanding of the normal function of huntingtin and also the pathogenesis of Huntington disease. It also has been implicated in the pathogenesis of hematopoietic malignancies. An alternative splice variant of this gene has been described but its full length sequence has not been determined.
27529	Carbonyl reductase 3 catalyzes the reduction of a large number of biologically and pharmacologically active carbonyl compounds to their corresponding alcohols.  The enzyme is classified as a monomeric NADPH-dependent oxidoreductase.  CBR3 contains three exons spanning 11.2 kilobases and is closely linked to another carbonyl reductase gene - CBR1.
27530	
27531	
27532	
27533	
27534	
27535	
27536	
27537	This gene encodes a member of the SFRP family that contains a cysteine-rich domain homologous to the putative Wnt-binding site of Frizzled proteins. SFRPs act as soluble modulators of Wnt signaling. Methylation of this gene is a potential marker for the presence of colorectal cancer.
27538	
27539	
27540	
27541	
27542	
27543	
27544	
27545	
27546	
27547	
27548	
27549	
27550	
27551	
27552	
27553	
27554	
27555	RAS (MIM 190020) GTPases cycle between an inactive GDP-bound state and an active GTP-bound state. Guanine-nucleotide exchange factors (GEFs), such as RASGRFs, stimulate the conversion of the GDP-bound form into the active form.[supplied by OMIM]
27556	
27557	
27558	
27559	
27560	
27561	v-CRK avian sarcoma virus CT10-homolog-like contains one SH2 domain and two SH3 domains.  CRKL has been shown to activate the RAS and JUN kinase signaling pathways and transform fibroblasts in a RAS-dependent fashion.  It is a substrate of the BCR-ABL tyrosine kinase and plays a role in fibroblast transformation by BCR-ABL.  In addition, CRKL has oncogenic potential.
27562	
27563	
27564	
27565	
27566	
27567	
27568	Adenylate cyclase is a membrane bound enzyme that catalyses the formation of cyclic AMP from ATP. It is regulated by a family of G protein-coupled receptors, protein kinases, and calcium. The type 9 adenylyl cyclase is a widely distributed adenylyl cyclase, and it is stimulated by beta-adrenergic receptor activation but is insensitive to forskolin, calcium, and somatostatin.
27569	
27570	
27571	
27572	
27573	
27574	
27575	
27576	
27577	
27578	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that contains a high level of sequence similarity with ribosomal protein S11P family members. A pseudogene corresponding to this gene is found on chromosome 20. Sequence analysis identified two transcript variants that encode different protein isoforms.
27579	Protein kinase C (PKC) is a family of serine- and threonine-specific protein kinases that can be activated by calcium and second messenger diacylglycerol. PKC family members phosphorylate a wide variety of protein targets and are known to be involved in diverse cellular signaling pathways. PKC family members also serve as major receptors for phorbol esters, a class of tumor promoters. Each member of the PKC family has a specific expression profile and is believed to play a distinct role in cells. The protein encoded by this gene is one of the PKC family members. This protein kinase has been reported to be involved in many different cellular functions, such as B cell activation, apoptosis induction, endothelial cell proliferation, and intestinal sugar absorption. Studies in mice also suggest that this kinase may also regulate neuronal functions and correlate fear-induced conflict behavior after stress. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
27580	AKT2 is a putative oncogene encoding a protein belonging to a subfamily of serine/threonine kinases containing SH2-like (Src homology 2-like) domains. Furthermore, AKT2 was shown to be amplified and overexpressed in 2 of 8 ovarian carcinoma cell lines and 2 of 15 primary ovarian tumors. Overexpression of AKT2 contributes to the malignant phenotype of a subset of human ductal pancreatic cancers. AKT2 is a general protein kinase capable of phophorylating several known proteins.
27581	
27582	
27583	
27584	
27585	
27586	
27587	
27588	This gene encodes a member of the DEAD box protein family. DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. The protein encoded by this gene shows ATPase activity in the presence of polynucleotides and associates with nucleoplasmic 65S preribosomal particles. This gene may be involved in ribosome synthesis, most likely during assembly of the large 60S ribosomal subunit.
27589	
27590	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
27591	
27592	
27593	
27594	
27595	In cardiac myocytes, Ca(2+) concentrations alternate between high levels during contraction and low levels during relaxation. The increase in Ca(2+) concentration during contraction is primarily due to release of Ca(2+) from intracellular stores. However, some Ca(2+) also enters the cell through the sarcolemma (plasma membrane). During relaxation, Ca(2+) is sequestered within the intracellular stores. To prevent overloading of intracellular stores, the Ca(2+) that entered across the sarcolemma must be extruded from the cell. The Na(+)-Ca(2+) exchanger is the primary mechanism by which the Ca(2+) is extruded from the cell during relaxation. In the heart, the exchanger may play a key role in digitalis action. The exchanger is the dominant mechanism in returning the cardiac myocyte to its resting state following excitation.[supplied by OMIM]
27596	
27597	
27598	
27599	
27600	
27601	
27602	This gene encodes a subunit of a kainate glutamate receptor. Glutamate receptors mediate the majority of excitatory neurotransmission in the brain. This receptor may have a role in synaptic plasticity and may be important for learning and memory. It also may be involved in the transmission of light information from the retina to the hypothalamus. The structure and function of the encoded protein is changed by RNA editing. Alternatively spliced transcript variants encoding distinct isoforms have been described for this gene.
27603	
27604	
27605	
27606	
27607	
27608	
27609	This gene encodes the cytosolic form of the enzyme branched-chain amino acid transaminase. This enzyme catalyzes the reversible transamination of branched-chain alpha-keto acids to branched-chain L-amino acids essential for cell growth. Two different clinical disorders have been attributed to a defect of branched-chain amino acid transamination: hypervalinemia and hyperleucine-isoleucinemia. As there is also a gene encoding a mitochondrial form of this enzyme, mutations in either gene may contribute to these disorders.
27610	This gene encodes a member of a LIM protein subfamily which is characterized by a LIM motif and a domain of Src homology region 3.  This protein functions as an actin-binding protein and possibly in cytoskeletal organization.
27611	
27612	
27613	
27614	
27615	
27616	
27617	
27618	
27619	The protein encoded by this gene, a member of the peptidase C1 family, is a lysosomal cysteine proteinase that may play an important role in corneal physiology. This gene is expressed in colorectal and breast carcinomas but not in normal colon, mammary gland, or peritumoral tissues, suggesting a possible role for this gene in tumor processes.
27620	
27621	
27622	
27623	HLA-B belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon 1 encodes the leader peptide, exon 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Hundreds of HLA-B alleles have been described
27624	
27625	
27626	
27627	
27628	
27629	
27630	
27631	
27632	
27633	
27634	
27635	
27636	Isocitrate dehydrogenases catalyze the oxidative decarboxylation of isocitrate to 2-oxoglutarate. These enzymes belong to two distinct subclasses, one of which utilizes NAD(+) as the electron acceptor and the other NADP(+). Five isocitrate dehydrogenases have been reported: three NAD(+)-dependent isocitrate dehydrogenases, which localize to the mitochondrial matrix, and two NADP(+)-dependent isocitrate dehydrogenases, one of which is mitochondrial and the other predominantly cytosolic. NAD(+)-dependent isocitrate dehydrogenases catalyze the allosterically regulated rate-limiting step of the tricarboxylic acid cycle. Each isozyme is a heterotetramer that is composed of two alpha subunits, one beta subunit, and one gamma subunit. The protein encoded by this gene is the beta subunit of one isozyme of NAD(+)-dependent isocitrate dehydrogenase. Three alternatively spliced transcript variants encoding different isoforms have been described for this gene.
27637	
27638	
27639	
27640	
27641	
27642	
27643	
27644	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. The protein localizes to the cytoplasm. The presence of potential DNA-binding and dimerization-transactivation domains suggests that this protein may act as a transcription factor, similar to several other members of the TRIM family. Expression of the gene is upregulated in response to estrogen, and it is thought to mediate estrogen actions in breast cancer as a primary response gene.
27645	
27646	
27647	
27648	
27649	
27650	
27651	
27652	
27653	
27654	
27655	
27656	
27657	
27658	
27659	
27660	
27661	
27662	
27663	
27664	
27665	
27666	
27667	
27668	
27669	
27670	NFKB1 (MIM 164011) or NFKB2 (MIM 164012) is bound to REL (MIM 164910), RELA, or RELB (MIM 604758) to form the NFKB complex. The p50 (NFKB1)/p65 (RELA) heterodimer is the most abundant form of NFKB. The NFKB complex is inhibited by I-kappa-B proteins (NFKBIA, MIM 164008 or NFKBIB, MIM 604495), which inactivate NFKB by trapping it in the cytoplasm. Phosphorylation of serine residues on the I-kappa-B proteins by kinases (IKBKA, MIM 600664, or IKBKB, MIM 603258) marks them for destruction via the ubiquitination pathway, thereby allowing activation of the NFKB complex. Activated NFKB complex translocates into the nucleus and binds DNA at kappa-B-binding motifs such as 5-prime GGGRNNYYCC 3-prime or 5-prime HGGARNYYCC 3-prime (where H is A, C, or T; R is an A or G purine; and Y is a C or T pyrimidine).[supplied by OMIM]
27671	
27672	
27673	
27674	
27675	
27676	This gene encodes a member of the trypsin family of serine proteases. Its specific function is unknown.
27677	
27678	
27679	
27680	
27681	
27682	
27683	
27684	
27685	
27686	
27687	
27688	This gene encodes a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, namely a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region.
27689	
27690	
27691	
27692	
27693	
27694	
27695	
27696	
27697	
27698	
27699	
27700	Protein kinase C (PKC) is a family of serine- and threonine-specific protein kinases that can be activated by calcium and the second messenger diacylglycerol. PKC family members phosphorylate a wide variety of protein targets and are known to be involved in diverse cellular signaling pathways. PKC family members also serve as major receptors for phorbol esters, a class of tumor promoters. Each member of the PKC family has a specific expression profile and is believed to play a distinct role in cells. The protein encoded by this gene is one of the PKC family members. It is a calcium-independent and phospholipids-dependent protein kinase. It is predominantly expressed in epithelial tissues and has been shown to reside specifically in the cell nucleus. This protein kinase can regulate keratinocyte differentiation by activating the MAP kinase MAPK13 (p38delta)-activated protein kinase cascade that targets CCAAT/enhancer-binding protein alpha (CEBPA). It is also found to mediate the transcription activation of the transglutaminase 1 (TGM1) gene.
27701	
27702	This gene includes two alternatively spliced transcript variants, which encode different isoforms. The function of this gene has not been determined.
27703	
27704	Zinc finger proteins (ZNFs) bind DNA and, through this binding, regulate gene transcription. Most ZNFs contain conserved C2H2 motifs and are classified as Kruppel-type zinc fingers. A conserved protein motif, termed the Kruppel-associated box (KRAB) domain, mediates protein-protein interactions. For further description of these proteins, see ZNF91 (MIM 603971).[supplied by OMIM]
27705	
27706	
27707	The giant protein titin, together with its associated proteins, interconnects the major structure of sarcomeres, the M bands and Z discs. The C-terminal end of the titin string extends into the M line, where it binds tightly to M-band constituents of apparent molecular masses of 190 kD (myomesin 1) and 165 kD (myomesin 2). This protein, myomesin 1, like myomesin 2, titin, and other myofibrillar proteins contains structural modules with strong homology to either fibronectin type III (motif I) or immunoglobulin C2 (motif II) domains. Myomesin 1 and myomesin 2 each have a unique N-terminal region followed by 12 modules of motif I or motif II, in the arrangement II-II-I-I-I-I-I-II-II-II-II-II. The two proteins share 50% sequence identity in this repeat-containing region. The head structure formed by these 2 proteins on one end of the titin string extends into the center of the M band. The integrating structure of the sarcomere arises from muscle-specific members of the superfamily of immunoglobulin-like proteins.
27708	
27709	
27710	
27711	The protein encoded by this gene belongs to the family of P-type cation transport ATPases, and to the subfamily of Na+/K+ -ATPases. Na+/K+ -ATPase is an integral membrane protein responsible for establishing and maintaining the electrochemical gradients of Na and K ions across the plasma membrane. These gradients are essential for osmoregulation, for sodium-coupled transport of a variety of organic and inorganic molecules, and for electrical excitability of nerve and muscle. This enzyme is composed of two subunits, a large catalytic subunit (alpha) and a smaller glycoprotein subunit (beta). The catalytic subunit of Na+/K+ -ATPase is encoded by multiple genes. This gene encodes an alpha 1 subunit. Alternatively spliced transcript variants encoding different isoforms have been identified.
27712	
27713	
27714	
27715	
27716	This protein belongs to the basic helix-loop-helix family of transcription factors. It is a transcriptional repressor of genes that require a bHLH protein for their transcription. The protein has a particular type of basic domain that contains a helix interrupting protein that binds to the N-box rather than the canonical E-box.
27717	Gastrin-releasing peptide (GRP) regulates numerous functions of the gastrointestinal and central nervous systems, including release of gastrointestinal hormones, smooth muscle cell contraction, and epithelial cell proliferation and is a potent mitogen for neoplastic tissues. The effects of GRP are mediated through the gastrin-releasing peptide receptor. This receptor is a glycosylated, 7-transmembrane G-protein coupled receptor that activates the phospholipase C signaling pathway. The receptor is aberrantly expressed in numerous cancers such as those of the lung, colon, and prostate. An individual with autism and multiple exostoses was found to have a balanced translocation between chromosome 8 and a chromosome X breakpoint located within the gastrin-releasing peptide receptor gene.
27718	This gene, a member of the histidine triad gene family, encodes a diadenosine 5@#$%&,5@#$%&@#$%&@#$%&-P1,P3-triphosphate hydrolase involved in purine metabolism. The gene encompasses the common fragile site FRA3B on chromosome 3, where carcinogen-induced damage can lead to translocations and aberrant transcripts of this gene. In fact, aberrant transcripts from this gene have been found in about half of all esophageal, stomach, and colon carcinomas.
27719	
27720	The protein encoded by this gene is a membrane-bound arginine/lysine carboxypeptidase. Its expression is associated with monocyte to macrophage differentiation. This encoded protein contains hydrophobic regions at the amino and carboxy termini and has 6 potential asparagine-linked glycosylation sites. The active site residues of carboxypeptidases A and B are conserved in this protein. Three alternatively spliced transcript variants encoding the same protein have been described for this gene.
27721	
27722	Junctional complexes between the plasma membrane and endoplasmic/sarcoplasmic reticulum are a common feature of all excitable cell types and mediate cross talk between cell surface and intracellular ion channels. The protein encoded by this gene is a component of junctional complexes and is composed of a C-terminal hydrophobic segment spanning the endoplasmic/sarcoplasmic reticulum membrane and a remaining cytoplasmic domain that shows specific affinity for the plasma membrane. CAG/CTG repeat expansions at the Huntington@#$%&s disease (HD)-like 2 locus have been identified in this gene, which is a member of the junctophilin gene family.
27723	
27724	
27725	
27726	
27727	
27728	
27729	This gene belongs to subfamily P of the forkhead box (FOX) transcription factor family. Forkhead box transcription factors play important roles in the regulation of tissue- and cell type-specific gene transcription during both development and adulthood. Many members of the forkhead box gene family, including members of subfamily P, have roles in mammalian oncogenesis. This gene may play a role in the development of tumors of the kidney and larynx. Alternative splicing of this gene produces multiple transcript variants, some encoding different isoforms.
27730	
27731	
27732	
27733	Breakdown products of phosphoinositides are ubiquitous second messengers that function downstream of many G protein-coupled receptors and tyrosine kinases regulating cell growth, calcium metabolism, and protein kinase C activity. This gene encodes an enzyme which regulates the amount of phosphatidylinositol available for signaling by catalyzing the conversion of phosphatidic acid to CDP-diacylglycerol. This enzyme is an integral membrane protein localized to two subcellular domains, the matrix side of the inner mitochondrial membrane where it is thought to be involved in the synthesis of phosphatidylglycerol and cardiolipin and the cytoplasmic side of the endoplasmic reticulum where it functions in phosphatidylinositol biosynthesis. Two genes encoding this enzyme have been identified in humans, one mapping to human chromosome 4q21 and a second to 20p13.
27734	
27735	
27736	
27737	
27738	
27739	This gene encodes a member of the kinesin family of proteins. Members of this family are part of a multisubunit complex that functions as a microtubule motor in intracellular organelle transport. Mutations in this gene cause autosomal dominant spastic paraplegia 10.
27740	X-linked mental retardation associated with marXq28, or fragile X syndrome, is characterized by moderate to severe mental retardation, macroorchidism, large ears, prominent jaw, and high-pitched jocular speech. Expression is variable, with mental retardation being the most common feature. This phenotype is associated with mutations in the FMR1 gene. McCabe et al. (1999) [PubMed 10398250] summarized the proceedings of a workshop on the fragile X syndrome held in December 1998.[supplied by OMIM]
27741	
27742	
27743	
27744	
27745	
27746	
27747	
27748	This gene is deleted in the primary tumor of hepatocellular carcinoma. It maps to 8p22-p21.3, a region frequently deleted in solid tumors. It is suggested that this gene is a candidate tumor suppressor gene for human liver cancer, as well as for prostate, lung, colorectal, and breast cancers. Alternative splicing at this locus results in several transcript variants encoding different isoforms.
27749	The protein encoded by this gene is a subunit of the CCR4-NOT core transcriptional regulation complex. The encoded protein has a 3@#$%&-5@#$%& RNase activity and prefers polyadenylated substates.
27750	
27751	
27752	
27753	
27754	
27755	
27756	
27757	
27758	
27759	
27760	
27761	
27762	
27763	
27764	
27765	
27766	
27767	
27768	
27769	
27770	
27771	
27772	
27773	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a member of the histone H2A family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the small histone gene cluster on chromosome 6p22-p21.3.
27774	
27775	This gene encodes class I alcohol dehydrogenase, gamma subunit, which is a member of the alcohol dehydrogenase family. Members of this enzyme family metabolize a wide variety of substrates, including ethanol, retinol, other aliphatic alcohols, hydroxysteroids, and lipid peroxidation products. Class I alcohol dehydrogenase, consisting of several homo- and heterodimers of alpha, beta, and gamma subunits, exhibits high activity for ethanol oxidation and plays a major role in ethanol catabolism. Three genes encoding alpha, beta and gamma subunits are tandemly organized in a genomic segment as a gene cluster.
27776	
27777	
27778	
27779	
27780	The protein encoded by this gene is a cytoplasmic kinase that is structurally similar to the yeast Ste20p kinase, which acts upstream of the stress-induced mitogen-activated protein kinase cascade. The encoded protein can phosphorylate myelin basic protein and undergoes autophosphorylation. A caspase-cleaved fragment of the encoded protein has been shown to be capable of phosphorylating histone H2B. The particular phosphorylation catalyzed by this protein has been correlated with apoptosis, and it@#$%&s possible that this protein induces the chromatin condensation observed in this process.
27781	
27782	This gene is a member of the protocadherin beta gene cluster, one of three related gene clusters tandemly linked on chromosome five. The gene clusters demonstrate an unusual genomic organization similar to that of B-cell and T-cell receptor gene clusters. The beta cluster contains 16 genes and 3 pseudogenes, each encoding 6 extracellular cadherin domains and a cytoplasmic tail that deviates from others in the cadherin superfamily. The extracellular domains interact in a homophilic manner to specify differential cell-cell connections. Unlike the alpha and gamma clusters, the transcripts from these genes are made up of only one large exon, not sharing common 3@#$%& exons as expected. These neural cadherin-like cell adhesion proteins are integral plasma membrane proteins. Their specific functions are unknown but they most likely play a critical role in the establishment and function of specific cell-cell neural connections.
27783	
27784	
27785	
27786	
27787	
27788	This gene is a classical cadherin from the cadherin superfamily. The encoded protein is a calcium-dependent cell-cell adhesion glycoprotein comprised of five extracellular cadherin repeats, a transmembrane region and a highly conserved cytoplasmic tail. Based on studies in chicken and mouse, this cadherin is thought to play an important role during brain segmentation and neuronal outgrowth. In addition, a role in kidney and muscle development is indicated. Of particular interest are studies showing stable cis-heterodimers of cadherins 2 and 4 in cotransfected cell lines. Previously thought to interact in an exclusively homophilic manner, this is the first evidence of cadherin heterodimerization.
27789	
27790	
27791	
27792	
27793	
27794	
27795	
27796	
27797	The product of this gene belongs to the family of selenoproteins. Selenoproteins contain the rare twenty-first amino acid, selenocysteine (sec). These proteins lack common amino acid sequence motifs, but the 3-prime untranslated regions of selenoprotein genes have a common stem-loop structure, the sec insertion sequence (SECIS), that is necessary for the recognition of UGA as a sec codon rather than as a stop signal. Selenoproteins are thought to be responsible for most biomedical effects of dietary selenium and are essential to mammals. This gene encodes a selenoprotein I.
27798	
27799	
27800	
27801	
27802	
27803	Childhood- and adolescent-onset forms of familial ALS (see ALS1, MIM 105400) carry the designation @#$%&juvenile ALS.@#$%& Several forms of autosomal recessive juvenile ALS have been identified; see ALS2 (MIM 205100) and ALS5 (MIM 602099).[supplied by OMIM]
27804	
27805	
27806	
27807	Dihydropyrimidinase catalyzes the conversion of 5,6-dihydrouracil to 3-ureidopropionate in pyrimidine metabolism.  Dihydropyrimidinase is expressed at a high level in liver and kidney as a major 2.5-kb transcript and a minor 3.8-kb transcript.  Defects in the DPYS gene are linked to dihydropyrimidinuria.
27808	
27809	
27810	
27811	
27812	
27813	This gene is a member of the apolipoprotein L gene family. The encoded protein is found in the cytoplasm, where it may affect the movement of lipids or allow the binding of lipids to organelles.
27814	
27815	
27816	
27817	Carbonic anhydrases (CAs) are a large family of zinc metalloenzymes that catalyze the reversible hydration of carbon dioxide.  They participate in a variety of biological processes, including respiration, calcification, acid-base balance, bone resorption, and the formation of aqueous humor, cerebrospinal fluid, saliva, and gastric acid.  They show extensive diversity in tissue distribution and in their subcellular localization.  CA VB is localized in the mitochondria and shows the highest sequence similarity to the other mitochondrial CA, CA VA.  It has a wider tissue distribution than CA VA, which is restricted to the liver.  The differences in tissue distribution suggest that the two mitochondrial carbonic anhydrases evolved to assume different physiologic roles.
27818	
27819	
27820	
27821	KCNE1 together with KCNQ1 encode the two subunits of the K+ channel responsible for the delayed-rectifier K+ current in cardiac myocytes (referred to as IsK).
27822	The protein encoded by this gene stimulates the formation of a non-mitotic multinucleate syncytium from proliferative cytotrophoblasts during trophoblast differentiation. An alternatively spliced transcript variant of this gene may exist.
27823	
27824	
27825	
27826	
27827	
27828	
27829	
27830	
27831	
27832	
27833	
27834	
27835	
27836	
27837	The protein encoded by this gene catalyzes the formation of inositol 1,4,5-trisphosphate and diacylglycerol from phosphatidylinositol 4,5-bisphosphate. This reaction uses calcium as a cofactor and plays an important role in the intracellular transduction of many extracellular signals. This gene is activated by two G-protein alpha subunits, alpha-q and alpha-11. Two transcript variants encoding different isoforms have been found for this gene.
27838	
27839	
27840	
27841	
27842	
27843	
27844	
27845	
27846	
27847	The protein encoded by this gene is highly similar to Saccharomyces cerevisiae Rad50, a protein involved in DNA double-strand break repair. This protein forms a complex with MRE11 and NBS1. The protein complex binds to DNA and displays numerous enzymatic activities that are required for nonhomologous joining of DNA ends. This protein, cooperating with its partners, is important for DNA double-strand break repair, cell cycle checkpoint activation, telomere maintenance, and meiotic recombination. Knockout studies of the mouse homolog suggest this gene is essential for cell growth and viability. Two alternatively spliced transcript variants of this gene, which encode distinct proteins, have been reported.
27848	
27849	
27850	
27851	
27852	
27853	
27854	
27855	This gene encodes a member of the AAA (ATPases associated with a variety of cellular activities) protein family. Members of this protein family share an ATPase domain and have roles in diverse cellular processes including membrane trafficking, intracellular motility, organelle biogenesis, protein folding, and proteolysis. The encoded ATPase may be involved in the assembly or function of nuclear protein complexes. Two transcript variants encoding distinct isoforms have been identified for this gene. Other alternative splice variants have been described but their full length sequences have not been determined. Mutations associated with this gene cause the most frequent form of autosomal dominant spastic paraplegia 4.
27856	
27857	
27858	
27859	
27860	
27861	
27862	
27863	
27864	
27865	
27866	
27867	This gene encodes a member of the p34Cdc2 protein kinase family. p34Cdc2 kinase family members are known to be essential for eukaryotic cell cycle control. This gene is in close proximity to CDC2L1, a nearly identical gene in the same chromosomal region. The gene loci including this gene, CDC2L1, as well as metalloprotease MMP21/22, consist of two identical, tandemly linked genomic regions, which are thought to be a part of the larger region that has been duplicated. This gene and CDC2L1 were shown to be deleted or altered frequently in neuroblastoma with amplified MYCN genes. The protein kinase encoded by this gene could be cleaved by caspases and was demonstrated to play roles in cell apoptosis. A large number of the alternatively spliced variants of this gene have been reported.
27868	This gene encodes a protein with an N-terminal filamin-binding domain, a central proline-rich domain, and, multiple C-terminal LIM domains. This protein localizes at cell junctions and may link cell adhesion structures to the actin cytoskeleton. This protein may be involved in the assembly and stabilization of actin-filaments and likely plays a role in modulating cell adhesion, cell morphology and cell motility. This protein also localizes to the nucleus and may affect cardiomyocyte differentiation after binding with the CSX/NKX2-5 transcription factor. Alternative splicing results in multiple transcript variants encoding different isoforms.
27869	
27870	
27871	
27872	
27873	Dihydrofolate reductase converts dihydrofolate into tetrahydrofolate, a methyl group shuttle required for the de novo synthesis of purines, thymidylic acid, and certain amino acids.  While the functional dihydrofolate reductase gene has been mapped to chromosome 5, four intronless processed pseudogenes have been identified on separate chromosomes.  Dihydrofolate reductase deficiency has been linked to megaloblastic anemia.
27874	
27875	Phosphatidylinositol transfer protein is a member of a diverse set of cytosolic phospholipid transfer proteins that are distinguished by their ability to transfer phospholipids between membranes in vitro (Wirtz, 1991).[supplied by OMIM]
27876	
27877	
27878	
27879	Neurexins are a family of proteins that function in the vertebrate nervous system as cell adhesion molecules and receptors. They are encoded by several unlinked genes of which two, NRXN1 and NRXN3, are among the largest known human genes. Three of the genes (NRXN1-3) utilize two alternate promoters and include numerous alternatively spliced exons to generate thousands of distinct mRNA transcripts and protein isoforms. The majority of transcripts are  produced from the upstream promoter and encode alpha-neurexin isoforms; a much smaller number of transcripts are produced from the downstream promoter and encode beta-neurexin isoforms. The alpha-neurexins contain epidermal growth factor-like (EGF-like) sequences and laminin G domains, and have been shown to interact with neurexophilins. The beta-neurexins lack EGF-like sequences and contain fewer laminin G domains than alpha-neurexins.
27880	This gene encodes a protein with multiple PDZ domains. PDZ domains mediate protein-protein interactions, and proteins with multiple PDZ domains often organize multimeric complexes at the plasma membrane. This protein localizes to tight junctions and to the apical membrane of epithelial cells. A similar protein in Drosophila is a scaffolding protein which tethers several members of a multimeric signaling complex in photoreceptors. Alternative splicing results in four transcript variants encoding different isoforms. Additional transcript variants have been described but their full-length sequences have not been determined.
27881	
27882	This gene has similarity to a mouse gene which suggest that this encoded protein might play an important role in the regulation of vascular remodeling.
27883	The PALM2-AKAP2 mRNA is a naturally occurring co-transcribed product of the neighboring PALM2 and AKAP2 genes. The significance of this co-transcribed mRNA and the function of its protein product have not yet been determined. Alternative splicing of this gene results in several transcript variants encoding different isoforms, but the full-length nature of some of these variants has not been defined.
27884	
27885	
27886	
27887	The protein encoded by this gene is a cytoplasmic serine-threonine kinase that contains cysteine in place of the lysine found at the conserved ATP-binding location in subdomain II of protein kinases. Since this protein does have kinase activity, it is possible that another lysine in the kinase subdomain I can substitute for the missing conserved lysine.
27888	Proper cohesion of sister chromatids is a prerequisite for the correct segregation of chromosomes during cell division. The cohesin multiprotein complex is required for sister chromatid cohesion. This complex is composed partly of two structural maintenance of chromosomes (SMC) proteins, SMC3 and either SMC1L2 or the protein encoded by this gene. Most of the cohesin complexes dissociate from the chromosomes before mitosis, although those complexes at the kinetochore remain. Therefore, the encoded protein is thought to be an important part of functional kinetochores. In addition, this protein interacts with BRCA1 and is phosphorylated by ATM, indicating a potential role for this protein in DNA repair. This gene, which belongs to the SMC gene family, is located in an area of the X-chromosome that escapes X inactivation.
27889	
27890	
27891	
27892	
27893	
27894	
27895	
27896	
27897	
27898	
27899	
27900	
27901	
27902	
27903	
27904	
27905	
27906	
27907	
27908	
27909	
27910	
27911	
27912	
27913	
27914	
27915	
27916	
27917	
27918	
27919	
27920	
27921	
27922	
27923	Similarity to a related locus suggests that this gene encodes a subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. This subunit exists as a heterodimer with another polymerase subunit; together they form a core subassembly unit of the polymerase. Two similar genes are located downstream of this locus. Alternative splicing of this gene results in three transcript variants encoding different isoforms.
27924	
27925	
27926	
27927	cAMP is a signaling molecule important for a variety of cellular functions. cAMP exerts its effects by activating the cAMP-dependent protein kinase (AMPK), which transduces the signal through phosphorylation of different target proteins. The inactive holoenzyme of AMPK is a tetramer composed of two regulatory and two catalytic subunits. cAMP causes the dissociation of the inactive holoenzyme into a dimer of regulatory subunits bound to four cAMP and two free monomeric catalytic subunits. Four different regulatory subunits and three catalytic subunits of AMPK have been identified in humans. The protein encoded by this gene is one of the regulatory subunits. This subunit can be phosphorylated by the activated catalytic subunit. It may interact with various A-kinase anchoring proteins and determine the subcellular localization of AMPK. This subunit has been shown to regulate protein transport from endosomes to the Golgi apparatus and further to the endoplasmic reticulum (ER).
27928	
27929	
27930	
27931	
27932	
27933	
27934	
27935	
27936	
27937	Chromosomal DNA of eukaryotic cells is compacted by nuclear proteins to form chromatin, an organized nucleoprotein structure that can inhibit gene expression. Several multisubunit protein complexes exist to remodel the chromatin to allow patterns of cell type-specific gene expression. The protein encoded by this gene is thought to be a core member of one or more of these complexes. The encoded protein, which is a member of the SNF2/RAD54 helicase family, contains two chromodomains, a helicase domain, and an ATPase domain.
27938	Pleckstrin homology domains bind polyphosphoinositides and are involved in platelet cytoskeletal changes. This gene contains a (CAG)n trinucleotide repeat which has been associated with several neurological disorders in which the repeats code for polyglutamine in the protein. This gene may play an important role in the anti-apoptotic effects of insulin-like growth factor I.
27939	
27940	
27941	
27942	
27943	This gene encodes a membrane protein, which is involved in the increase in intracellular calcium concentration that occurs upon cell adhesion to extracellular matrix. The encoded protein is also a receptor for the C-terminal cell binding domain of thrombospondin, and it may play a role in membrane transport and signal transduction. This gene has broad tissue distribution, and is reduced in expression on Rh erythrocytes. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
27944	
27945	
27946	This intronless gene encodes a member of the MAGE superfamily. The encoded protein is 76% identical to the mouse mage-g1 protein.
27947	
27948	
27949	
27950	
27951	
27952	
27953	This gene is a member of the MYC family and encodes a protein with a basic helix-loop-helix (bHLH) domain. This protein is located in the nucleus and must dimerize with another bHLH protein in order to bind DNA. Amplification of this gene is associated with a variety of tumors, most notably neuroblastomas.
27954	
27955	
27956	For background information on chemokines, see CXCL11 (SCYB11; MIM 604852). Stromal cell-derived factors 1-alpha and 1-beta are small cytokines that belong to the intercrine family, members of which activate leukocytes and are often induced by proinflammatory stimuli such as lipopolysaccharide, TNF (see MIM 191160), or IL1 (see MIM 147760). The intercrines are characterized by the presence of 4 conserved cysteines which form 2 disulfide bonds. They can be classified into 2 subfamilies. In the CC subfamily, which includes beta chemokine, the cysteine residues are adjacent to each other. In the CXC subfamily, which includes alpha chemokine, they are separated by an intervening amino acid. The SDF1 proteins belong to the latter group.[supplied by OMIM]
27957	
27958	
27959	Apoptosis, a major form of cell death, is an efficient mechanism for eliminating unwanted cells and is of central importance for development and homeostasis in metazoan animals. In mice, the death inducer-obliterator-1 gene is upregulated by apoptotic signals and encodes a cytoplasmic protein that translocates to the nucleus upon apoptotic signal activation. When overexpressed, the mouse protein induced apoptosis in cell lines growing in vitro. This gene is similar to the mouse gene and therefore is thought to be involved in apoptosis. At least four alternatively spliced transcripts have been found for this gene, but the full-length nature of some variants has not been determined.
27960	
27961	
27962	
27963	
27964	
27965	
27966	
27967	
27968	
27969	
27970	
27971	
27972	The protein encoded by this gene is highly similar to the protein product encoded by gene INSIG1. Both INSIG1 protein and this protein are endoplasmic reticulum proteins that block the processing of sterol regulatory element binding proteins (SREBPs) by binding to SREBP cleavage-activating protein (SCAP), and thus prevent SCAP from escorting SREBPs to the Golgi.
27973	
27974	
27975	
27976	
27977	
27978	
27979	
27980	
27981	
27982	
27983	
27984	
27985	
27986	
27987	
27988	This gene was identified based on homology to Pichia pastoris GSA7 and Saccharomyces cerevisiae APG7. In the yeast, the protein appears to be required for fusion of peroxisomal and vaculuolar membranes. The protein shows homlogy to the ATP-binding and catalytic sites of the E1 ubiquitin activating enzymes.
27989	
27990	
27991	
27992	
27993	
27994	
27995	Proteins of the large Rab GTPase family (see RAB1A; MIM 179508) have regulatory roles in the formation, targeting, and fusion of intracellular transport vesicles. RAB11FIP1 is one of many proteins that interact with and regulate Rab GTPases (Hales et al., 2001).[supplied by OMIM]
27996	
27997	
27998	
27999	Voltage-dependent calcium channels control the rapid entry of Ca(2+) into a variety of cell types and are therefore involved in both electrical and cellular signaling. T-type channels, such as CACNA1I, are activated by small membrane depolarizations and can generate burst firing and pacemaker activity.[supplied by OMIM]
28000	
28001	
28002	
28003	
28004	This gene encodes a protein thought to participate in programmed cell death. Studies using mouse cells have shown that overexpression of this protein can block apoptosis. In addition, the product of this gene binds to the product of the PDCD6 gene, a protein required for apoptosis, in a calcium-dependent manner.  This gene product also binds to endophilins, proteins that regulate membrane shape during endocytosis. Overexpression of this gene product and endophilins results in cytoplasmic vacuolization which may be partly responsible for the protection against cell death.
28005	
28006	
28007	
28008	This gene encodes a protein that acts in the sorting of transmembrane proteins into lysosomes/vacuoles via the multivesicular body (MVB) pathway. This protein, along with other soluble coiled-coil containing proteins, forms part of the ESCRT-III protein complex that binds to the endosomal membrane and recruits additional cofactors for protein sorting into the MVB. This protein may also co-immunoprecipitate with a member of the IFG-binding protein superfamily. Alternative splicing results in multiple transcript variants encoding different isoforms.
28009	
28010	
28011	This gene encodes subunit 3 of the splicing factor 3b protein complex. Splicing factor 3b, together with splicing factor 3a and a 12S RNA unit, forms the U2 small nuclear ribonucleoproteins complex (U2 snRNP). The splicing factor 3b/3a complex binds pre-mRNA upstream of the intron@#$%&s branch site in a sequence independent manner and may anchor the U2 snRNP to the pre-mRNA. Splicing factor 3b is also a component of the minor U12-type spliceosome. Subunit 3 has also been identified as a component of the STAGA (SPT3-TAF(II)31-GCN5L acetylase) transcription coactivator-HAT (histone acetyltransferase) complex, and the TFTC (TATA-binding-protein-free TAF(II)-containing complex). These complexes may function in chromatin modification, transcription, splicing, and DNA repair.
28012	This gene encodes one subunit of the 2-oxoglutarate dehydrogenase complex. This complex catalyzes the overall conversion of 2-oxoglutarate (alpha-ketoglutarate) to succinyl-CoA and CO(2) during the Krebs cycle. The protein is located in the mitocondrial matrix and uses thiamine pyrophosphate as a cofactor. A congential deficiency in 2-oxoglutarate dehydrogenase activity is believed to lead to hypotonia, metabolic acidosis, and hyperlactatemia.
28013	
28014	This gene encodes a tyrosine kinase that binds Cdc42Hs in its GTP-bound form and inhibits both the intrinsic and GTPase-activating protein (GAP)-stimulated GTPase activity of Cdc42Hs. This binding is mediated by a unique sequence of 47 amino acids C-terminal to an SH3 domain. The protein may be involved in a regulatory mechanism that sustains the GTP-bound active form of Cdc42Hs and which is directly linked to a tyrosine phosphorylation signal transduction pathway. Several alternatively spliced transcript variants have been identified from this gene, but the full-length nature of only two transcript variants has been determined.
28015	
28016	
28017	
28018	
28019	
28020	
28021	
28022	
28023	This gene is a candidate tumor suppressor gene. It is located within a homozygously deleted region of a metastatic prostate cancer. The gene is expressed in most nonlymphoid human tissues including prostate, lung, liver, and colon. Expression was also detected in many epithelial tumor cell lines. Two transcript variants encoding distinct isoforms have been identified for this gene.
28024	
28025	Childhood- and adolescent-onset forms of familial ALS (see ALS1, MIM 105400) carry the designation @#$%&juvenile ALS.@#$%& Several forms of autosomal recessive juvenile ALS have been identified; see ALS2 (MIM 205100) and ALS5 (MIM 602099).[supplied by OMIM]
28026	This gene localizes to the immunoglobulin heavy chain locus (IGH@) on chromosome 14. The function of its protein product has not been determined.
28027	Stromal cell derived receptor 1 is a type I transmembrane protein belonging to the Ig superfamily.  The protein is believed to be involved in cell-cell interactions or cell-substrate interactions.  The alpha and beta transcripts show differential localization within the brain.
28028	
28029	
28030	
28031	
28032	Core binding factor (CBF) is a heterodimeric transcription factor that binds to the core element of many enhancers and promoters. The protein encoded by this gene represents the alpha subunit of CBF and is thought to be involved in the development of normal hematopoiesis. Chromosomal translocations involving this gene are well-documented and have been associated with several types of leukemia. Two transcript variants encoding different isoforms have been found for this gene.
28033	
28034	
28035	Glial cell line-derived neurotrophic factor (GDNF) and neurturin (NTN) are two structurally related, potent neurotrophic factors that play key roles in the control of neuron survival and differentiation. The protein encoded by this gene is a member of the GDNF receptor family. It is a glycosylphosphatidylinositol(GPI)-linked cell surface receptor for both GDNF and NTN, and mediates activation of the RET tyrosine kinase receptor. This encoded protein acts preferentially as a receptor for NTN compared to its other family member, GDNF family receptor alpha 1. This gene is a candidate gene for RET-associated diseases.
28036	Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. It is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, F0, which comprises the proton channel. The F1 complex consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled in a ratio of 3 alpha, 3 beta, and a single representative of the other 3. The F0 seems to have nine subunits (a, b, c, d, e, f, g, F6 and 8). This gene encodes the g subunit of the F0 complex.
28037	
28038	This gene encodes a member of the p34Cdc2 protein kinase family. p34Cdc2 kinase family members are known to be essential for eukaryotic cell cycle control. This gene is in close proximity to CDC2L1, a nearly identical gene in the same chromosomal region. The gene loci including this gene, CDC2L1, as well as metalloprotease MMP21/22, consist of two identical, tandemly linked genomic regions, which are thought to be a part of the larger region that has been duplicated. This gene and CDC2L1 were shown to be deleted or altered frequently in neuroblastoma with amplified MYCN genes. The protein kinase encoded by this gene could be cleaved by caspases and was demonstrated to play roles in cell apoptosis. A large number of the alternatively spliced variants of this gene have been reported.
28039	
28040	This gene encodes a member of the Argonaute family of proteins which play a role in RNA interference. The encoded protein is highly basic, and contains a PAZ domain and a PIWI domain. It may interact with dicer1 and play a role in short-interfering-RNA-mediated gene silencing. This gene is located on chromosome 1 in a cluster of closely related family members including argonaute 3, and argonaute 4.
28041	
28042	This gene encodes a protein that associates with the centromere-kinetochore complex. The protein is a component of the nuclear matrix during the G2 phase of interphase. In late G2 the protein associates with the kinetochore and maintains this association through early anaphase. It localizes to the spindle midzone and the intracellular bridge in late anaphase and telophase, respectively, and is thought to be subsequently degraded. The localization of this protein suggests that it may play a role in chromosome segregation during mitotis. It is thought to form either a homodimer or heterodimer. Autoantibodies against this protein have been found in patients with cancer or graft versus host disease.
28043	
28044	
28045	
28046	
28047	
28048	
28049	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. Sequence analysis identified three transcript variants. Pseudogenes corresponding to this gene are found on chromosomes 6p, 10q, and Xp.
28050	
28051	
28052	This gene encodes a member of a conserved protein family that shares a common DNA-binding domain, the T-box. T-box genes encode transcription factors involved in the regulation of developmental processes. A similiar gene disrupted in mice is shown to be essential during trophoblast development and gastrulation.
28053	This gene encodes a chemokine receptor like protein, which is predicted to be a seven transmembrane protein and most closely related to CCR1. Chemokines and their receptors mediated signal transduction are critical for the recruitment of effector immune cells to the site of inflammation. This gene is expressed at high levels in primary neutrophils and primary monocytes, and is further upregulated on neutrophil activation and during monocyte to macrophage differentiation. The function of this gene is unknown. This gene is mapped to the region where the chemokine receptor gene cluster is located.
28054	
28055	This gene belongs to the chemokine-like factor gene superfamily, a novel superfamily that links the chemokine gene family and the transmembrane 4 superfamily. This gene is one of several chemokine-like factor genes located in a cluster on chromosome 3. The protein encoded by this gene is highly expressed in leukocytes, but its exact function is unknown. Alternatively spliced transcript variants encoding different isoforms have been identified.
28056	
28057	
28058	
28059	
28060	
28061	The protein encoded by this gene is a cytoplasmic actin-binding protein thought to play a role in the process of neuronal growth. It is a member of the drebrin family of proteins that are developmentally regulated in the brain. A decrease in the amount of this protein in the brain has been implicated as a possible contributing factor in the pathogenesis of memory disturbance in Alzheimer@#$%&s disease. At least two alternative splice variants encoding different protein isoforms have been described for this gene.
28062	
28063	This gene encodes the beta subunit of casein kinase II, a ubiquitous protein kinase which regulates metabolic pathways, signal transduction, transcription, translation, and replication. The enzyme is composed of three subunits, alpha, alpha prime and beta, which form a tetrameric holoenzyme. The alpha and alpha prime subunits are catalytic, while the beta subunit serves regulatory functions. The enzyme localizes to the endoplasmic reticulum and the Golgi apparatus.
28064	
28065	
28066	The product of this gene is a microtubule-associated protein that is predominantly expressed in cells of epithelial origin. Microtubule-associated proteins are thought to be involved in microtubule dynamics, which is essential for cell polarization and differentiation. This protein has been shown to be able to stabilize microtubules, and may serve to modulate microtubule functions. Studies of the related mouse protein also suggested an essential role in microtubule function required for spermatogenesis.
28067	SLC16A10 is a member of a family of plasma membrane amino acid transporters that mediate the Na(+)-independent transport of aromatic amino acids across the plasma membrane.[supplied by OMIM]
28068	
28069	
28070	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein does not contain a coiled coil region, like some family members. This gene encodes a protein whose function has not been determined.
28071	
28072	
28073	
28074	The function of this gene has not yet been determined; however, it is speculated that it may represent a breast cancer candidate gene.  It is located close to FGFR1 on a region of chromosome 8 that is amplified in some breast cancers.
28075	This gene encodes a potential transmembrane protein that may function in vesicle-mediated transport and sorting of proteins within the cell. This protein may play a role in the development and the function of the eye, hematological system, and central nervous system. Mutations in this gene have been associated with Cohen syndrome. Multiple splice variants encoding distinct isoforms have been identified for this gene.
28076	This gene was identified as an estrogen-responsive gene. Regulation of transcription by estrogen is mediated by estrogen receptor which binds to the estrogen-responsive element (ERE) found in the 5@#$%&-flanking region of this gene. The encoded protein is a tumor-associated antigen that is expressed at high frequency in a variety of cancers. Two transcript variants differing in the 5@#$%& UTR, but encoding the same protein, have been identified for this gene.
28077	
28078	
28079	
28080	The protein encoded by this gene shares similarity with BRE1 of S. cerevisiae. Yeast BRE1 is a ubiquitin ligase required for the ubiquitination of histone H2B and the methylation of histone H3.
28081	When Chinese hamster K12 cells are starved of glucose, the synthesis of several proteins, called glucose-regulated proteins (GRPs), is markedly increased. Hendershot et al. (1994) pointed out that one of these, GRP78 (HSPA5), also referred to as @#$%&immunoglobulin heavy chain-binding protein@#$%& (BiP), is a member of the heat-shock protein-70 (HSP70) family and is involved in the folding and assembly of proteins in the endoplasmic reticulum (ER). Because so many ER proteins interact transiently with GRP78, it may play a key role in monitoring protein transport through the cell.[supplied by OMIM]
28082	
28083	This gene encodes a member of the nucleoporin family that shares 87% sequence identity with rat nucleoporin p58. The protein is localized to the nuclear rim and is a component of the nuclear pore complex (NPC). All molecules entering or leaving the nucleus either diffuse through or are actively transported by the NPC. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
28084	
28085	The CTRP5 protein is a member of the C1q (see MIM 120550)/tumor necrosis factor (MIM 191160) superfamily, which shows diverse functions including cell adhesion and basement membrane components (Shapiro and Scherer, 1998). C1QTNF5 is mutant in late-onset retinal degeneration (LORD; MIM 605670).[supplied by OMIM]
28086	
28087	This gene is a member of the GRB2-associated binding protein (GAB) gene family and is similar to the GAB1 gene. These proteins contain pleckstrin homology (PH) domain, and bind SHP2 tyrosine phosphatase and GRB2 adapter protein. They act as adapters for transmitting various signals in response to stimuli through cytokine and growth factor receptors, and T- and B-cell antigen receptors. The protein encoded by this gene is the principal activator of phosphatidylinositol-3 kinase in response to activation of the high affinity IgE receptor. Two alternatively spliced transcripts encoding different isoforms have been described for this gene.
28088	
28089	
28090	
28091	
28092	Nucleocytoplasmic transport, a signal- and energy-dependent process, takes place through nuclear pore complexes embedded in the nuclear envelope. The import of proteins containing a nuclear localization signal (NLS) requires the NLS import receptor, a heterodimer of importin alpha and beta subunits also known as karyopherins. Importin alpha binds the NLS-containing cargo in the cytoplasm and importin beta docks the complex at the cytoplasmic side of the nuclear pore complex. In the presence of nucleoside triphosphates and the small GTP binding protein Ran, the complex moves into the nuclear pore complex and the importin subunits dissociate. Importin alpha enters the nucleoplasm with its passenger protein and importin beta remains at the pore. Interactions between importin beta and the FG repeats of nucleoporins are essential in translocation through the pore complex. The protein encoded by this gene is a member of the importin beta family.
28093	
28094	This gene is a member of the TIS11 family of early response genes. Family members are induced by various agonists such as the phorbol ester TPA and the polypeptide mitogen EGF. The gene is well conserved across species and has a promoter that contains motifs seen in other early-response genes. The encoded protein contains a distinguishing putative zinc finger domain with a repeating cys-his motif. This putative nuclear transcription factor most likely functions in regulating the response to growth factors.
28095	
28096	
28097	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. This protein is a potent epithelial cell-specific growth factor, whose mitogenic activity is predominantly exhibited in keratinocytes but not in fibroblasts and endothelial cells. Studies of mouse and rat homologs of this gene implicated roles in morphogenesis of epithelium, reepithelialization of wounds, hair development and early lung organogenesis.
28098	
28099	
28100	This gene encodes a member of the tweety family of proteins. Members of this family function as chloride anion channels. The encoded protein functions as a calcium(2+)-activated large conductance chloride(-) channel, and may play a role in kidney tumorigenesis. Two transcript variants encoding distinct isoforms have been identified for this gene.
28101	This gene encodes a NIMA-related kinase. Members of this serine/threonine protein kinase family are structurally-related to NIMA (never in mitosis, gene A) which controls mitotic signaling in Aspergillus nidulans.
28102	This gene encodes the largest subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. The product of this gene contains a carboxy terminal domain composed of heptapeptide repeats that are essential for polymerase activity. These repeats contain serine and threonine residues that are phosphorylated in actively transcribing RNA polymerase. In addition, this subunit, in combination with several other polymerase subunits, forms the DNA binding domain of the polymerase, a groove in which the DNA template is transcribed into RNA.
28103	
28104	
28105	
28106	
28107	
28108	
28109	
28110	
28111	
28112	
28113	
28114	
28115	
28116	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes (E1s), ubiquitin-conjugating enzymes (E2s) and ubiquitin-protein ligases (E3s). This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. This enzyme is demonstrated to participate in the ubiquitination of p53, c-Fos, and the NF-kB precursor p105 in vitro. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
28117	
28118	
28119	This gene encodes a member of the neurexin family which functions in the vertebrate nervous system as cell adhesion molecules and receptors. This protein, like other neurexin proteins, contains epidermal growth factor repeats and laminin G domains. In addition, it includes an F5/8 type C domain, discoidin/neuropilin- and fibrinogen-like domains, thrombospondin N-terminal-like domains and a putative PDZ binding site. This protein is localized at the juxtaparanodes of myelinated axons and associated with potassium channels. It may play a role in the local differentiation of the axon into distinct functional subdomains. This gene encompasses almost 1.5% of chromosome 7 and is one of the largest genes in the human genome. It may represent a positional candidate gene for the DFNB13 form of nonsyndromic deafness.
28120	
28121	
28122	
28123	
28124	
28125	
28126	
28127	
28128	
28129	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L24E family of ribosomal proteins. It is located in the cytoplasm. This gene has been referred to as ribosomal protein L30 because the encoded protein shares amino acid identity with the L30 ribosomal proteins from S. cerevisiae; however, its official name is ribosomal protein L24. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
28130	
28131	
28132	
28133	
28134	
28135	
28136	
28137	
28138	
28139	
28140	UHRF1 binds to an inverted CCAAT box in the promoter of topoisomerase II-alpha (TOP2A; MIM 126430). TOP2A introduces transient double-stranded breaks in DNA, which are required during the cell cycle for DNA replication, chromosome condensation, and segregation. UHRF1 is likely involved in TOPO2A expression (Hopfner et al., 2000).[supplied by OMIM]
28141	
28142	
28143	
28144	
28145	
28146	
28147	
28148	
28149	
28150	
28151	
28152	
28153	
28154	
28155	
28156	
28157	
28158	
28159	
28160	
28161	
28162	
28163	This gene belongs to the forkhead family of transcription factors which are characterized by a distinct forkhead domain. This gene likely functions as a trigger for apoptosis through expression of genes necessary for cell death. Translocation of this gene with the MLL gene is associated with secondary acute leukemia. Alternatively spliced transcript variants encoding the same protein have been observed.
28164	
28165	
28166	
28167	
28168	
28169	
28170	
28171	
28172	
28173	
28174	
28175	
28176	
28177	Vasoactive intestinal peptide (VIP; MIM 192320) and pituitary adenylate cyclase activating polypeptide (PACAP; MIM 102980) are homologous peptides that function as neurotransmitters and neuroendocrine hormones. While the receptors for VIP and PACAP share homology, they differ in their substrate specificities and expression patterns. See VIPR1 (MIM 192321) and ADCYAP1R1(MIM 102981).[supplied by OMIM]
28178	
28179	
28180	
28181	
28182	
28183	
28184	
28185	This gene encodes a disintegrin and metalloproteinase with thrombospondin motifs-12 (ADAMTS12), which is a member of the ADAMTS protein family. Members of the family share several distinct protein modules, including a propeptide region, a metalloproteinase domain, a disintegrin-like domain, and a thrombospondin type 1 (TS-1) motif. Individual members of this family differ in the number of C-terminal TS-1 motifs, and some have unique C-terminal domains. The enzyme encoded by this gene contains 8 TS-1 motifs. It may play roles in pulmonary cells during fetal development or in tumor processes through its proteolytic activity or as a molecule potentially involved in regulation of cell adhesion.
28186	
28187	SMYD3 is a histone methyltransferase that plays a role in transcriptional regulation as a member of an RNA polymerase complex.[supplied by OMIM]
28188	
28189	
28190	
28191	
28192	Endocytosis of cell surface proteins requires a dynamic complex of proteins that assemble on the inner surface of the plasma membrane. Stonin-2 is a component of the endocytic machinery that likely regulates vesicle endocytosis.[supplied by OMIM]
28193	
28194	
28195	The protein encoded by this gene is one of the highly conserved mini-chromosome maintenance proteins (MCM) that are involved in the initiation of eukaryotic genome replication. The hexameric protein complex formed by MCM proteins is a key component of the pre-replication complex (pre_RC) and may be involved in the formation of replication forks and in the recruitment of other DNA replication related proteins. This protein forms a complex with MCM4, 6, and 7, and has been shown to regulate the helicase activity of the complex. This protein is phosphorylated, and thus regulated by, protein kinases CDC2 and CDC7.
28196	
28197	
28198	
28199	
28200	This gene encodes a tumor antigen recognized by autologous cytolytic lymphocytes (CTL).
28201	
28202	
28203	
28204	This gene is one of seven beta-1,4-galactosyltransferase (beta4GalT) genes. They encode type II membrane-bound glycoproteins that appear to have exclusive specificity for the donor substrate UDP-galactose; all transfer galactose in a beta1,4 linkage to similar acceptor sugars: GlcNAc, Glc, and Xyl. Each beta4GalT has a distinct function in the biosynthesis of different glycoconjugates and saccharide structures. As type II membrane proteins, they have an N-terminal hydrophobic signal sequence that directs the protein to the Golgi apparatus and which then remains uncleaved to function as a transmembrane anchor. By sequence similarity, the beta4GalTs form four groups: beta4GalT1 and beta4GalT2, beta4GalT3 and beta4GalT4, beta4GalT5 and beta4GalT6, and beta4GalT7. The enzyme encoded by this gene appears to mainly play a role in glycolipid biosynthesis. Two alternatively spliced transcript variants have been found for this gene.
28205	
28206	This gene encodes a protein which contains WD repeats and an FYVE domain. Multiple alternatively spliced transcript variants have been found for this gene, but the full-length nature of some variants has not been defined.
28207	
28208	
28209	
28210	
28211	
28212	
28213	
28214	The p21 activated kinases (PAK) are critical effectors that link Rho GTPases to cytoskeleton reorganization and nuclear signaling. The PAK proteins are a family of serine/threonine kinases that serve as targets for the small GTP binding proteins, CDC42 and RAC1, and have been implicated in a wide range of biological activities. The protein encoded by this gene is activated by proteolytic cleavage during caspase-mediated apoptosis, and may play a role in regulating the apoptotic events in the dying cell.
28215	HLA-A belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon 1 encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Hundreds of HLA-A alleles have been described.
28216	
28217	
28218	
28219	
28220	
28221	
28222	
28223	
28224	
28225	
28226	
28227	
28228	
28229	
28230	
28231	
28232	
28233	
28234	
28235	
28236	
28237	
28238	
28239	
28240	
28241	
28242	
28243	
28244	
28245	
28246	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal phosphoprotein that is a component of the 60S subunit. The protein, which is a functional equivalent of the E. coli L7/L12 ribosomal protein, belongs to the L12P family of ribosomal proteins. It plays an important role in the elongation step of protein synthesis. Unlike most ribosomal proteins, which are basic, the encoded protein is acidic. Its C-terminal end is nearly identical to the C-terminal ends of the ribosomal phosphoproteins P0 and P2. The P1 protein can interact with P0 and P2 to form a pentameric complex consisting of P1 and P2 dimers, and a P0 monomer. The protein is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome. Through sequence analysis, two alternatively spliced transcript variants that encode different proteins were found.
28247	
28248	
28249	
28250	
28251	
28252	
28253	
28254	
28255	
28256	
28257	
28258	
28259	
28260	
28261	
28262	
28263	
28264	
28265	
28266	
28267	
28268	
28269	
28270	
28271	
28272	
28273	
28274	
28275	This gene encodes a member of the Golgi-localized, gamma adaptin ear-containing, ARF-binding (GGA) family. This family includes ubiquitous coat proteins that regulate the trafficking of proteins between the trans-Golgi network and the lysosome. These proteins share an amino-terminal VHS domain which mediates sorting of the mannose 6-phosphate receptors at the trans-Golgi network. They also contain a carboxy-terminal region with homology to the ear domain of gamma-adaptins. This family member may play a significant role in cargo molecules regulation and clathrin-coated vesicle assembly. Alternative splicing of this gene results in two transcript variants.
28276	
28277	
28278	
28279	
28280	
28281	
28282	
28283	
28284	
28285	
28286	
28287	
28288	
28289	
28290	
28291	
28292	
28293	
28294	
28295	
28296	
28297	
28298	
28299	
28300	
28301	The activation peptides of aspartic proteinases plays role as inhibitors of the active site. These peptide segments, or pro-parts, are deemed important for correct folding, targeting, and control of the activation of aspartic proteinase zymogens. The pronapsin A gene is expressed predominantly in lung and kidney. Its translation product is predicted to be a fully functional, glycosylated aspartic proteinase precursor containing an RGD motif and an additional 18 residues at its C-terminus.
28302	
28303	
28304	
28305	
28306	
28307	Guanylate binding protein expression is induced by interferon.  Guanylate binding proteins are characterized by their ability to specifically bind guanine nucleotides (GMP, GDP, and GTP) and are distinguished from the GTP-binding proteins by the presence of 2 binding motifs rather than 3
28308	
28309	
28310	
28311	The protein encoded by this gene belongs to the CDK subfamily of the Ser/Thr protein kinase family. The CDK subfamily members are highly similar to the gene products of S. cerevisiae cdc28, and S. pombe cdc2, and are known to be essential for cell cycle progression. This kinase has been shown to play a role in cellular proliferation. Its function is limited to cell cycle G2-M phase. At least three alternatively spliced transcript variants encoding different isoforms have been reported, two of which contain multiple non-AUG translation initiation sites.
28312	
28313	
28314	
28315	
28316	
28317	
28318	
28319	
28320	
28321	
28322	
28323	The FRAS1 gene encodes a putative extracellular matrix (ECM) protein and is mutated in Fraser syndrome (MIM 219000) (McGregor et al., 2003).[supplied by OMIM]
28324	
28325	
28326	This gene was identified based on homology to Pichia pastoris GSA7 and Saccharomyces cerevisiae APG7. In the yeast, the protein appears to be required for fusion of peroxisomal and vaculuolar membranes. The protein shows homlogy to the ATP-binding and catalytic sites of the E1 ubiquitin activating enzymes.
28327	
28328	
28329	
28330	
28331	
28332	
28333	
28334	
28335	
28336	
28337	
28338	
28339	
28340	
28341	The protein encoded by this gene is a member of the NR1 subfamily of nuclear hormone receptors. It can bind as a monomer or as a homodimer to hormone response elements upstream of several genes to enhance the expression of those genes. The specific functions of this protein are not known, but it has been shown to interact with NM23-2, a nucleoside diphosphate kinase involved in organogenesis and differentiation, as well as with NM23-1, the product of a tumor metastasis suppressor candidate gene. Four transcript variants encoding different isoforms have been described for this gene.
28342	
28343	
28344	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the OABP subfamily. Alternatively referred to as the RNase L inhibitor, this protein functions to block the activity of ribonuclease L. Activation of ribonuclease L leads to inhibition of protein synthesis in the 2-5A/RNase L system, the central pathway for viral interferon action.
28345	
28346	
28347	
28348	
28349	
28350	
28351	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L10E family of ribosomal proteins. It is located in the cytoplasm. In vitro studies have shown that the chicken protein can bind to c-Jun and can repress c-Jun-mediated transcriptional activation, but these activities have not been demonstrated in vivo. This gene was initially identified as a candidate for a Wilms tumor suppressor gene, but later studies determined that this gene is not involved in the suppression of Wilms tumor. This gene has been referred to as @#$%&laminin receptor homolog@#$%& because a chimeric transcript consisting of sequence from this gene and sequence from the laminin receptor gene was isolated; however, it is not believed that this gene encodes a laminin receptor. Transcript variants utilizing alternative polyA signals exist. The variant with the longest 3@#$%& UTR overlaps the deoxyribonuclease I-like 1 gene on the opposite strand. This gene is co-transcribed with the small nucleolar RNA gene U70, which is located in its fifth intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
28352	This gene encodes a member of the adenylate kinase family of enzymes. The encoded protein is localized to the mitochondrial matrix. Adenylate kinases regulate the adenine and guanine nucleotide compositions within a cell by catalyzing the reversible transfer of phosphate group among these nucleotides. Five isozymes of adenylate kinase have been identified in vertebrates. Expression of these isozymes is tissue-specific and developmentally regulated. A pseudogene for this gene has been located on chromosome 17. Three transcript variants encoding the same protein have been identified for this gene. Sequence alignment suggests that the gene defined by NM_013410, NM_203464, and NM_001005353 is located on chromosome 1.
28353	This gene encodes a member of the G protein-coupled receptor family, as well as the EDG family of proteins. This protein functions as a cellular receptor for lysophosphatidic acid and mediates lysophosphatidic acid-evoked calcium mobilization. This receptor couples predominantly to G(q/11) alpha proteins.
28354	
28355	
28356	
28357	
28358	
28359	
28360	
28361	
28362	
28363	
28364	
28365	
28366	The protein encoded by this gene is a member of the type 3 G protein-coupled receptor family. Members of this superfamily are characterized by a signature 7-transmembrane domain motif. The specific function of this protein is unknown; however, this protein may mediate the cellular effects of retinoic acid on the G protein signal transduction cascade. Two transcript variants encoding different isoforms have been found for this gene.
28367	
28368	
28369	
28370	
28371	
28372	
28373	
28374	
28375	This gene encodes a type II classical cadherin from the cadherin superfamily of integral membrane proteins that mediate calcium-dependent cell-cell adhesion. Mature cadherin proteins are composed of a large N-terminal extracellular domain, a single membrane-spanning domain, and a small, highly conserved C-terminal cytoplasmic domain. Type II (atypical) cadherins are defined based on their lack of a HAV cell adhesion recognition sequence specific to type I cadherins. This particular cadherin appears to be expressed specifically in the brain and its temporal pattern of expression would be consistent with a role during a critical period of neuronal development, perhaps specifically during synaptogenesis.
28376	
28377	
28378	
28379	
28380	
28381	Heterogeneous nuclear RNAs (hnRNAs) which include mRNA precursors and mature mRNAs are associated with specific proteins to form heterogenous ribonucleoprotein (hnRNP) complexes. Heterogeneous nuclear ribonucleoprotein L is among the proteins that are stably associated with hnRNP complexes and along with other hnRNP proteins is likely to play a major role in the formation, packaging, processing, and function of mRNA. Heterogeneous nuclear ribonucleoprotein L is present in the nucleoplasm as part of the HNRP complex. HNRP proteins have also been identified outside of the nucleoplasm. Exchange of hnRNP for mRNA-binding proteins accompanies transport of mRNA from the nucleus to the cytoplasm. Since HNRP proteins have been shown to shuttle between the nucleus and the cytoplasm, it is possible that they also have cytoplasmic functions. Two transcript variants encoding different isoforms have been found for this gene.
28382	
28383	
28384	
28385	
28386	The protein encoded by this gene is a member of the BTG/Tob family. This family has structurally related proteins that appear to have antiproliferative properties. This encoded protein is involved in the regulation of the G1/S transition of the cell cycle.
28387	
28388	
28389	This gene is a member of the GAGE family, which is expressed in a variety of tumors and in some fetal and reproductive tissues. The protein encoded by this gene shares a sequence similarity with other GAGE/PAGE proteins. It may also belong to a family of CT (cancer-testis) antigens. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene, but the biological validity of some variants have not been determined.
28390	This gene encodes a human homolog of Drosophila splicing regulatory protein. This gene autoregulates its expression by control of splicing of its first two introns. In addition, it also regulates the splicing of fibronectin and CD45 genes. Multiple alternatively spliced variants have been identified. Two alternatively spliced variants have been characterized to date.
28391	
28392	
28393	
28394	
28395	
28396	
28397	
28398	
28399	
28400	
28401	
28402	
28403	
28404	
28405	In cardiac myocytes, Ca(2+) concentrations alternate between high levels during contraction and low levels during relaxation. The increase in Ca(2+) concentration during contraction is primarily due to release of Ca(2+) from intracellular stores. However, some Ca(2+) also enters the cell through the sarcolemma (plasma membrane). During relaxation, Ca(2+) is sequestered within the intracellular stores. To prevent overloading of intracellular stores, the Ca(2+) that entered across the sarcolemma must be extruded from the cell. The Na(+)-Ca(2+) exchanger is the primary mechanism by which the Ca(2+) is extruded from the cell during relaxation. In the heart, the exchanger may play a key role in digitalis action. The exchanger is the dominant mechanism in returning the cardiac myocyte to its resting state following excitation.[supplied by OMIM]
28406	
28407	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases that catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein lacks one amino acid of the conserved heme binding site. It also lacks the conserved I-helix motif AGX(D,E)T, suggesting that its substrate may carry its own oxygen. Alternative splicing results in two transcript variants encoding different isoforms.
28408	
28409	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbls class and, in addition to an F-box, contains several tandem leucine-rich repeats.
28410	
28411	
28412	
28413	
28414	
28415	
28416	
28417	
28418	
28419	
28420	
28421	
28422	
28423	
28424	
28425	
28426	
28427	
28428	
28429	
28430	
28431	
28432	
28433	This gene encodes the heavy subunit of the ferritin protein. Ferritin is the major intracellular iron storage protein in prokaryotes and eukaryotes. It is composed of 24 subunits of the heavy and light ferritin chains. Variation in ferritin subunit composition may affect the rates of iron uptake and release in different tissues. A major function of ferritin is the storage of iron in a soluble and nontoxic state. Defects in ferritin proteins are associated with several neurodegenerative diseases. This gene has multiple pseudogenes. Several alternatively spliced transcript variants have been observed, but their biological validity has not been determined.
28434	
28435	TOM7 is a small regulatory component of the translocase of the outer mitochondrial membrane (TOM), a general import pore complex that translocates preproteins into mitochondria.[supplied by OMIM]
28436	
28437	
28438	
28439	
28440	
28441	
28442	
28443	
28444	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
28445	
28446	
28447	
28448	The protein encoded by this gene is a member of the SWI/SNF family of proteins, whose members display helicase and ATPase activities and which are thought to regulate transcription of certain genes by altering the chromatin structure around those genes. The encoded protein is part of the large ATP-dependent chromatin remodeling complex SNF/SWI and contains a predicted leucine zipper motif typical of many transcription factors.
28449	
28450	
28451	
28452	
28453	
28454	
28455	
28456	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. The function of this gene has not yet been determined. The expression pattern of the mouse homolog implies a role in nervous system development.
28457	
28458	
28459	
28460	
28461	
28462	
28463	
28464	
28465	
28466	
28467	This gene encodes a member of the sorting nexin family. Members of this family contain a phox (PX) domain, which is a phosphoinositide binding domain, and are involved in intracellular trafficking. This protein does not contain a coiled coil region, like some family members. This gene encodes a protein of unknown function.
28468	
28469	
28470	
28471	
28472	The protein encoded by this gene is a microsomal enzyme that catalyzes the esterification of all-trans-retinol into all-trans-retinyl ester, an essential reaction for the retinoid cycle in visual system and vitamin A status in liver. Mutations in this gene have been associated with early-onset severe retinal dystrophy.
28473	
28474	
28475	
28476	
28477	
28478	
28479	
28480	
28481	
28482	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It is one of the most abundant core proteins of hnRNP complexes and it is localized to the nucleoplasm. This protein, along with other hnRNP proteins, is exported from the nucleus, probably bound to mRNA, and is immediately re-imported. Its M9 domain acts as both a nuclear localization and nuclear export signal. The encoded protein is involved in the packaging of pre-mRNA into hnRNP particles, transport of poly A+ mRNA from the nucleus to the cytoplasm, and may modulate splice site selection. It is also thought have a primary role in the formation of specific myometrial protein species in parturition. Multiple alternatively spliced transcript variants have been found for this gene but only two transcripts are fully described. These variant have multiple alternative transcription initiation sites and multiple polyA sites.
28483	
28484	
28485	
28486	
28487	
28488	
28489	
28490	
28491	
28492	
28493	
28494	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L21E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
28495	
28496	
28497	
28498	
28499	DISCONTINUED: The record for PBM1 has been withdrawn by HGNC.
28500	
28501	
28502	
28503	
28504	
28505	
28506	
28507	
28508	
28509	
28510	
28511	
28512	
28513	
28514	The protein encoded by this gene contains a RING finger, which is a motif known to be involved in protein-protein interactions. This protein is a membrane-bound ubiquitin ligase. It can regulate cell motility by targeting paxillin ubiquitination and altering the distribution and localization of paxillin in cytoplasm and cell focal adhesions.
28515	The protein encoded by this gene is an atypical member of the family of growth factor receptor protein tyrosine kinases, differing from other members at a number of conserved residues in the activation and nucleotide binding domains. This gene product belongs to a subfamily whose members do not appear to be regulated by phosphorylation in the activation segment. It has been suggested that mediation of biological activity by recruitment of a signaling-competent auxiliary protein may occur through an as yet uncharacterized mechanism. Two alternative splice variants have been identified, encoding distinct isoforms.
28516	
28517	
28518	
28519	
28520	
28521	
28522	The protein encoded by this gene is a calcyclin binding protein. It may be involved in calcium-dependent ubiquitination and subsequent proteosomal degradation of target proteins. It probably serves as a molecular bridge in ubiquitin E3 complexes and participates in the ubiquitin-mediated degradation of beta-catenin. Two alternatively spliced transcript variants encoding different isoforms have been found for this gene.
28523	N-methyl-D-aspartate (NMDA) receptors are a class of ionotropic glutamate receptors. NMDA receptor channel has been shown to be involved in long-term potentiation, an activity-dependent increase in the efficiency of synaptic transmission thought to underlie certain kinds of memory and learning.  NMDA receptor channels are heteromers composed of the key receptor subunit NMDAR1 (GRIN1) and 1 or more of the 4 NMDAR2 subunits: NMDAR2A (GRIN2A), NMDAR2B (GRIN2B), NMDAR2C (GRIN2C), and NMDAR2D (GRIN2D).  GRIN2B may be a candidate gene for the neurodegenerative disorder dentato-rubro-pallidoluysian atrophy (DRPLA)
28524	This gene is the cellular homolog of the fox sequence in the Finkel-Biskis-Reilly murine sarcoma virus (FBR-MuSV). It encodes a fusion protein consisting of the ubiquitin-like protein fubi at the N terminus and ribosomal protein S30 at the C terminus. It has been proposed that the fusion protein is post-translationally processed to generate free fubi and free ribosomal protein S30. Fubi is a member of the ubiquitin family, and ribosomal protein S30 belongs to the S30E family of ribosomal proteins. Whereas the function of fubi is currently unknown, ribosomal protein S30 is a component of the 40S subunit of the cytoplasmic ribosome. Pseudogenes derived from this gene are present in the genome. Similar to ribosomal protein S30, ribosomal proteins S27a and L40 are synthesized as fusion proteins with ubiquitin.
28525	
28526	
28527	This gene is the cellular homolog of the fox sequence in the Finkel-Biskis-Reilly murine sarcoma virus (FBR-MuSV). It encodes a fusion protein consisting of the ubiquitin-like protein fubi at the N terminus and ribosomal protein S30 at the C terminus. It has been proposed that the fusion protein is post-translationally processed to generate free fubi and free ribosomal protein S30. Fubi is a member of the ubiquitin family, and ribosomal protein S30 belongs to the S30E family of ribosomal proteins. Whereas the function of fubi is currently unknown, ribosomal protein S30 is a component of the 40S subunit of the cytoplasmic ribosome. Pseudogenes derived from this gene are present in the genome. Similar to ribosomal protein S30, ribosomal proteins S27a and L40 are synthesized as fusion proteins with ubiquitin.
28528	
28529	
28530	
28531	
28532	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes, or E1s, ubiquitin-conjugating enzymes, or E2s, and ubiquitin-protein ligases, or E3s. This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. Studies in mouse suggest that this protein plays a role in DNA postreplication repair.
28533	
28534	The protein encoded by this gene is a small GTPase of the Rho-subfamily, which regulates signaling pathways that control diverse cellular functions including cell morphology, migration, endocytosis and cell cycle progression. This protein is highly similar to Saccharomyces cerevisiae Cdc 42, and is able to complement the yeast cdc42-1 mutant. The product of oncogene Dbl was reported to specifically catalyze the dissociation of GDP from this protein. This protein could regulate actin polymerization through its direct binding to Neural Wiskott-Aldrich syndrome protein (N-WASP), which subsequently activates Arp2/3 complex. Alternative splicing of this gene results in at least two transcript variants.
28535	
28536	
28537	
28538	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein that belongs to the YmL27 ribosomal protein family.
28539	
28540	
28541	
28542	
28543	
28544	
28545	
28546	
28547	
28548	
28549	
28550	
28551	
28552	
28553	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of the ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class. Alternative splicing of this gene generates at least 3 transcript variants diverging at the 5@#$%& terminus.
28554	
28555	
28556	
28557	
28558	
28559	
28560	
28561	
28562	This gene encodes a subunit of a kainate glutamate receptor. Glutamate receptors mediate the majority of excitatory neurotransmission in the brain. This receptor may have a role in synaptic plasticity and may be important for learning and memory. It also may be involved in the transmission of light information from the retina to the hypothalamus. The structure and function of the encoded protein is changed by RNA editing. Alternatively spliced transcript variants encoding distinct isoforms have been described for this gene.
28563	
28564	
28565	
28566	
28567	
28568	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
28569	
28570	
28571	
28572	
28573	The Tousled-like kinases, first described in Arabidopsis, are nuclear serine/threonine kinases that are potentially involved in the regulation of chromatin assembly.[supplied by OMIM]
28574	
28575	
28576	Protein kinase C (PKC) is a family of serine- and threonine-specific protein kinases that can be activated by calcium and second messenger diacylglycerol. PKC family members phosphorylate a wide variety of protein targets and are known to be involved in diverse cellular signaling pathways. PKC family members also serve as major receptors for phorbol esters, a class of tumor promoters. Each member of the PKC family has a specific expression profile and is believed to play a distinct role in cells. The protein encoded by this gene is one of the PKC family members. This protein kinase has been reported to be involved in many different cellular functions, such as B cell activation, apoptosis induction, endothelial cell proliferation, and intestinal sugar absorption. Studies in mice also suggest that this kinase may also regulate neuronal functions and correlate fear-induced conflict behavior after stress. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
28577	
28578	
28579	
28580	
28581	
28582	
28583	
28584	
28585	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein, which is the functional equivalent of the E. coli L10 ribosomal protein, belongs to the L10P family of ribosomal proteins. It is a neutral phosphoprotein with a C-terminal end that is nearly identical to the C-terminal ends of the acidic ribosomal phosphoproteins P1 and P2. The P0 protein can interact with P1 and P2 to form a pentameric complex consisting of P1 and P2 dimers, and a P0 monomer. The protein is located in the cytoplasm. Transcript variants derived from alternative splicing exist; they encode the same protein. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
28586	
28587	
28588	
28589	
28590	
28591	
28592	
28593	The protein encoded by this gene contains a leucine-rich repeat and a death domain. This protein has been shown to interact with other death domain proteins, such as Fas (TNFRSF6)-associated via death domain (FADD) and MAP-kinase activating death domain-containing protein (MADD), and thus may function as an adaptor protein in cell death-related signaling processes. The expression of the mouse counterpart of this gene has been found to be positively regulated by the tumor suppressor p53 and to induce cell apoptosis in response to DNA damage, which suggests a role for this gene as an effector of p53-dependent apoptosis. Three alternatively spliced transcript variants encoding distinct isoforms have been reported.
28594	
28595	
28596	
28597	
28598	
28599	
28600	
28601	
28602	
28603	This gene encodes a protein localized in mitochondria and closely related to paraplegin. The paraplegin gene is responsible for an autosomal recessive form of hereditary spastic paraplegia. This gene is a candidate gene for other hereditary spastic paraplegias or neurodegenerative disorders.
28604	
28605	
28606	
28607	
28608	
28609	
28610	WW domain-containing proteins are found in all eukaryotes and play an important role in the regulation of a wide variety of cellular functions such as protein degradation, transcription, and RNA splicing. This gene encodes a protein which contains 2 WW domains and a short-chain dehydrogenase/reductase domain (SRD). The highest normal expression of this gene is detected in hormonally regulated tissues such as testis, ovary, and prostate. This expression pattern and the presence of an SRD domain suggest a role for this gene in steroid metabolism. The encoded protein is more than 90% identical to the mouse protein, which is an essential mediator of tumor necrosis factor-alpha-induced apoptosis, suggesting a similar, important role in apoptosis for the human protein. In addition, there is evidence that this gene behaves as a suppressor of tumor growth. Alternative splicing of this gene generates 7 transcript variants, which encode different isoforms.
28611	
28612	
28613	This gene is a member of the TAFA family which is composed of five highly homologous genes that encode small secreted proteins. These proteins contain conserved cysteine residues at fixed positions, and are distantly related to MIP-1alpha, a member of the CC-chemokine family. The TAFA proteins are predominantly expressed in specific regions of the brain, and are postulated to function as brain-specific chemokines or neurokines that act as regulators of immune and nervous cells.
28614	
28615	The protein encoded by this gene regulates the turnover of diphosphoinositol polyphosphates. The turnover of these high-energy diphosphoinositol polyphosphates represents a molecular switching activity with important regulatory consequences. Molecular switching by diphosphoinositol polyphosphates may contribute to regulating intracellular trafficking. Several alternatively spliced transcript variants have been described, but the full-length nature of some variants has not been determined. Isoforms DIPP2alpha and DIPP2beta are distinguishable from each other solely by DIPP2beta possessing one additional amino acid due to intron boundary skidding in alternate splicing.
28616	
28617	
28618	
28619	
28620	
28621	The protein encoded by this gene is involved in mRNA splicing and may play a role in the determination of alternative splicing. The encoded nuclear protein belongs to the splicing factor SR family and has been shown to bind with and modulate another member of the family, SFRS12.
28622	
28623	
28624	
28625	
28626	
28627	This gene encodes a protein of unknown function. However, similarity to proteins in yeast and other species suggests that this protein may be an N-acetyltransferase.
28628	
28629	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L21E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
28630	
28631	
28632	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L23P family of ribosomal proteins. It is located in the cytoplasm. The protein may be one of the target molecules involved in mediating growth inhibition by interferon. In yeast, the corresponding protein binds to a specific site on the 26S rRNA. This gene is co-transcribed with the U42A, U42B, U101A, and U101B small nucleolar RNA genes, which are located in its third, first, second, and fourth introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
28633	
28634	
28635	
28636	
28637	
28638	
28639	
28640	
28641	
28642	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and two tandem intracytoplasmic catalytic domains, and thus represents a receptor-type PTP. The extracellular region of this PTP contains a carbonic anhydrase-like (CAH) domain, which is also found in the extracellular region of PTPRBETA/ZETA. This gene is located in a chromosomal region that is frequently deleted in renal cell carcinoma and lung carcinoma, thus is thought to be a candidate tumor suppressor gene.
28643	
28644	
28645	
28646	
28647	
28648	The protein encoded by this gene is located in the cytoplasm and is part of the GAP1 family of GTPase-activating proteins. The gene product stimulates the GTPase activity of normal RAS p21 but not its oncogenic counterpart. Acting as a suppressor of RAS function, the protein enhances the weak intrinsic GTPase activity of RAS proteins resulting in the inactive GDP-bound form of RAS, thereby allowing control of cellular proliferation and differentiation. Mutations leading to changes in the binding sites of either protein are associated with basal cell carcinomas. Alternative splicing results in two isoforms where the shorter isoform, lacking the N-terminal hydrophobic region but retaining the same activity, appears to be abundantly expressed in placental but not adult tissues.
28649	
28650	
28651	
28652	
28653	
28654	
28655	
28656	
28657	
28658	
28659	
28660	
28661	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S10E family of ribosomal proteins. It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
28662	
28663	
28664	
28665	
28666	
28667	
28668	
28669	
28670	ETFA participates in catalyzing the initial step of the mitochondrial fatty acid beta-oxidation.  It shuttles electrons between primary flavoprotein dehydrogenases and the membrane-bound electron transfer flavoprotein ubiquinone oxidoreductase. Defects in electron-transfer-flavoprotein have been implicated in type II glutaricaciduria in which multiple acyl-CoA dehydrogenase deficiencies result in large excretion of glutaric, lactic, ethylmalonic, butyric, isobutyric, 2-methyl-butyric, and isovaleric acids.
28671	
28672	This gene encodes a member of the plakin protein family of adhesion junction plaque proteins. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene, but the full-length nature of some variants has not been defined. It has been known that some isoforms are expressed in neural and muscle tissue, anchoring neural intermediate filaments to the actin cytoskeleton, and some isoforms are expressed in epithelial tissue, anchoring keratin-containing intermediate filaments to hemidesmosomes. Consistent with the expression, mice defective for this gene show skin blistering and neurodegeneration.
28673	
28674	
28675	
28676	
28677	
28678	
28679	The Axin-related protein, Axin2, presumably plays an important role in the regulation of the stability of beta-catenin in the Wnt signaling pathway, like its rodent homologs, mouse conductin/rat axil. In mouse, conductin organizes a multiprotein complex of APC (adenomatous polyposis of the colon), beta-catenin, glycogen synthase kinase 3-beta, and conductin, which leads to the degradation of beta-catenin. Apparently, the deregulation of beta-catenin is an important event in the genesis of a number of malignancies. The AXIN2 gene has been mapped to 17q23-q24, a region that shows frequent loss of heterozygosity in breast cancer, neuroblastoma, and other tumors. Mutations in this gene have been associated with colorectal cancer with defective mismatch repair.
28680	
28681	
28682	
28683	
28684	
28685	
28686	
28687	
28688	
28689	This gene product is a member of a family of proteins characterized by a specific cysteine-rich C-terminal domain, that is involved in transcriptional regulation of viral genome expression. Translation initiation from two different sites, including a non-AUG (GUG) codon, results in the production of two different isoforms (p32 and p40) which have different subcellular localization; p32 is mainly found in the cytoplasm, whereas p40 is targeted to the nucleolus. Both isoforms have transcriptional regulatory activity that is attributable to the cysteine-rich C-terminal domain.
28690	
28691	
28692	
28693	
28694	
28695	
28696	
28697	
28698	
28699	
28700	
28701	
28702	
28703	
28704	
28705	
28706	The nuclear pore complex is a massive structure that extends across the nuclear envelope, forming a gateway that regulates the flow of macromolecules between the nucleus and the cytoplasm. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells. The protein encoded by this gene is a member of the FG-repeat containing nucleoporins that functions as a soluble cofactor in importin-alpha:beta-mediated nuclear protein import. Pseudogenes of this gene are found on chromosomes 5, 6, and 14. Three transcript variants encoding two different isoforms have been found for this gene.
28707	
28708	
28709	
28710	
28711	
28712	
28713	
28714	
28715	
28716	Transforming acidic coiled-coil proteins are a conserved family of centrosome- and microtubule-interacting proteins that are implicated in cancer. This gene encodes a protein that concentrates at centrosomes throughout the cell cycle. This gene lies within a chromosomal region associated with tumorigenesis. Expression of this gene is thought to affect the progression of breast tumors. Expression of this gene is also induced by erythropoietin.
28717	
28718	
28719	
28720	
28721	
28722	
28723	
28724	
28725	
28726	
28727	
28728	
28729	
28730	
28731	
28732	
28733	
28734	
28735	
28736	
28737	
28738	
28739	
28740	
28741	
28742	
28743	
28744	
28745	
28746	
28747	
28748	
28749	
28750	
28751	
28752	
28753	Carbonyl reductase 3 catalyzes the reduction of a large number of biologically and pharmacologically active carbonyl compounds to their corresponding alcohols.  The enzyme is classified as a monomeric NADPH-dependent oxidoreductase.  CBR3 contains three exons spanning 11.2 kilobases and is closely linked to another carbonyl reductase gene - CBR1.
28754	
28755	
28756	
28757	
28758	
28759	TIRP is a Toll/interleukin-1 receptor (IL1R; MIM 147810) (TIR) domain-containing adaptor protein involved in Toll receptor signaling (see TLR4; MIM 603030).[supplied by OMIM]
28760	
28761	Hyperpolarization-activated cation channels of the HCN gene family, such as HCN1, contribute to spontaneous rhythmic activity in both heart and brain.[supplied by OMIM]
28762	
28763	
28764	
28765	
28766	
28767	
28768	
28769	
28770	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
28771	
28772	
28773	
28774	
28775	
28776	
28777	
28778	
28779	
28780	This gene encodes a subunit of mitochondrial ATP synthase. Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. ATP synthase is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, Fo, comprising the proton channel. The catalytic portion of mitochondrial ATP synthase consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled with a stoichiometry of 3 alpha, 3 beta, and a single representative of the other 3. The proton channel consists of three main subunits (a, b, c). This gene encodes the epsilon subunit of the catalytic core. Two pseudogenes of this gene are located on chromosomes 4 and 13.
28781	
28782	
28783	
28784	
28785	
28786	This gene encodes a nuclear protein expressed in a variety of tissues. The rat homolog, Rp8, is transiently expressed in immature thymocytes and is thought to be involved in programmed cell death. Expression of the human gene has been shown to be repressed by BCL6, a transcriptional repressor required for lymph node germinal center development, suggesting that BCL6 regulates apoptosis by its effects on PDCD2. This gene is closely linked on chromosome 6 to the gene for TBP, the TATA binding protein. Two transcripts encoding different proteins have been identified.
28787	
28788	
28789	
28790	
28791	
28792	This gene encodes a protein that contains a MIT (Microtubule Interacting and Trafficking molecule) domain. This protein may be involved in endosomal trafficking, microtubule dynamics, or both functions. Mutations associated with this gene cause autosomal recessive spastic paraplegia 20 (Troyer syndrome).
28793	
28794	
28795	
28796	
28797	
28798	
28799	
28800	
28801	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
28802	
28803	
28804	
28805	The reticulons are a group of highly conserved genes with preferential expression in neuroendocrine tissues (see, e.g., RTN1; MIM 600865).[supplied by OMIM]
28806	
28807	
28808	
28809	
28810	
28811	
28812	
28813	RAN (ras-related nuclear protein) is a small GTP binding protein belonging to the RAS superfamily that is essential for the translocation of RNA and proteins through the nuclear pore complex. The RAN protein is also involved in control of DNA synthesis and cell cycle progression. Nuclear localization of RAN requires the presence of regulator of chromosome condensation 1 (RCC1). Mutations in RAN disrupt DNA synthesis. Because of its many functions, it is likely that RAN interacts with several other proteins. RAN regulates formation and organization of the microtubule network independently of its role in the nucleus-cytosol exchange of macromolecules. RAN could be a key signaling molecule regulating microtubule polymerization during mitosis. RCC1 generates a high local concentration of RAN-GTP around chromatin which, in turn, induces the local nucleation of microtubules.  RAN is an androgen receptor (AR) coactivator that binds differentially with different lengths of polyglutamine within the androgen receptor. Polyglutamine repeat expansion in the AR is linked to Kennedy@#$%&s disease (X-linked spinal and bulbar muscular atrophy). RAN coactivation of the AR diminishes with polyglutamine expansion within the AR, and this weak coactivation may lead to partial androgen insensitivity during the development of Kennedy@#$%&s disease.
28814	
28815	
28816	
28817	
28818	
28819	
28820	
28821	
28822	
28823	
28824	
28825	
28826	
28827	
28828	
28829	
28830	
28831	
28832	The protein encoded by this gene belongs to the SMAD, a family of proteins similar to the gene products of the Drosophila gene @#$%&mothers against decapentaplegic@#$%& (Mad) and the C. elegans gene Sma. SMAD proteins are signal transducers and transcriptional modulators that mediate multiple signaling pathways. This protein mediates the signal of the transforming growth factor (TGF)-beta, and thus regulates multiple cellular processes, such as cell proliferation, apoptosis, and differentiation. This protein is recruited to the TGF-beta receptors through its interaction with the SMAD anchor for receptor activation (SARA) protein. In response to TGF-beta signal, this protein is phosphorylated by the TGF-beta receptors. The phosphorylation induces the dissociation of this protein with SARA and the association with the family member SMAD4. The association with SMAD4 is important for the translocation of this protein into the nucleus, where it binds to target promoters and forms a transcription repressor complex with other cofactors. This protein can also be phosphorylated by activin type 1 receptor kinase, and mediates the signal from the activin. Alternatively spliced transcript variants encoding the same protein have been observed.
28833	
28834	
28835	
28836	
28837	
28838	
28839	The protein encoded by this gene is an RNA binding protein and possible splicing factor. The encoded protein is found in the nucleus, where it colocalizes with core spliceosomal proteins. Studies of a mouse protein with high sequence similarity to this protein suggest that this protein may act as a transcriptional coactivator for JUN/AP-1 and estrogen receptors. Multiple transcript variants encoding several different isoforms have been found for this gene.
28840	The protein encoded by this gene is a cytokine that belongs to the tumor necrosis factor (TNF) ligand family. This protein preferentially induces apoptosis in transformed and tumor cells, but does not appear to kill normal cells although it is expressed at a significant level in most normal tissues. This protein binds to several members of TNF receptor superfamily including TNFRSF10A/TRAILR1, TNFRSF10B/TRAILR2, TNFRSF10C/TRAILR3, TNFRSF10D/TRAILR4, and possibly also to TNFRSF11B/OPG. The activity of this protein may be modulated by binding to the decoy receptors TNFRSF10C/TRAILR3, TNFRSF10D/TRAILR4, and TNFRSF11B/OPG that cannot induce apoptosis. The binding of this protein to its receptors has been shown to trigger the activation of MAPK8/JNK, caspase 8, and caspase 3.
28841	
28842	Sialyltransferases such as beta-galactoside alpha-2,6-sialyltransferase II (ST6GalII; EC 2.4.99.1) are type 2 transmembrane proteins that catalyze the transfer of sialic acid from CMP-sialic acid to an acceptor carbohydrate, usually to the terminal ends of carbohydrate chains.[supplied by OMIM]
28843	
28844	
28845	
28846	
28847	
28848	This gene encodes a protein containing four conserved nuclear localization signals. The encoded protein may function during the development of the eyes, teeth, and brain. Mutations in this gene have been shown to cause Nance-Horan syndrome. An alternative splice variant has been described, but its full-length nature has not been determined.
28849	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a non-ATPase subunit of the 19S regulator. Two transcripts encoding different isoforms have been described. A pseudogene has been identified on chromosome 3.
28850	
28851	
28852	
28853	
28854	
28855	
28856	
28857	
28858	
28859	
28860	N-methyl-D-aspartate (NMDA) receptors are a class of ionotropic glutamate receptors.  NMDA channel has been shown to be involved in long-term potentiation, an activity-dependent increase in the efficiency of synaptic transmission thought to underlie certain kinds of memory and learning.  NMDA receptor channels are heteromers composed of the key receptor subunit NMDAR1 (GRIN1) and 1 or more of the 4 NMDAR2 subunits: NMDAR2A (GRIN2A), NMDAR2B (GRIN2B), NMDAR2C (GRIN2C), and NMDAR2D (GRIN2D).
28861	
28862	
28863	
28864	
28865	
28866	
28867	
28868	
28869	
28870	
28871	
28872	
28873	
28874	
28875	
28876	
28877	
28878	
28879	
28880	
28881	
28882	
28883	
28884	
28885	Albumin is a soluble, monomeric protein which comprises about one-half of the blood serum protein. Albumin functions primarily as a carrier protein for steroids, fatty acids, and thyroid hormones and plays a role in stabilizing extracellular fluid volume. Mutations in this gene on chromosome 4 result in various anomalous proteins. Albumin is a globular unglycosylated serum protein of molecular weight 65,000. The human albumin gene is 16,961 nucleotides long from the putative @#$%&cap@#$%& site to the first poly(A) addition site. It is split into 15 exons which are symmetrically placed within the 3 domains that are thought to have arisen by triplication of a single primordial domain. Albumin is synthesized in the liver as preproalbumin which has an N-terminal peptide that is removed before the nascent protein is released from the rough endoplasmic reticulum. The product, proalbumin, is in turn cleaved in the Golgi vesicles to produce the secreted albumin.
28886	
28887	
28888	
28889	
28890	
28891	
28892	
28893	
28894	
28895	
28896	
28897	
28898	
28899	
28900	
28901	
28902	
28903	
28904	
28905	
28906	
28907	The protein encoded by this gene is a member of the tripartite motif (TRIM) family, also known as the @#$%&RING-B box-coiled coil@#$%& (RBCC) subgroup of RING finger proteins. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This protein forms homodimers which associate with microtubules in the cytoplasm. The protein is likely involved in the formation of multiprotein structures acting as anchor points to microtubules. Mutations in this gene have been associated with the X-linked form of Opitz syndrome, which is characterized by midline abnormalities such as cleft lip, laryngeal cleft, heart defects, hypospadias, and agenesis of the corpus callosum. This gene was also the first example of a gene subject to X inactivation in human while escaping it in mouse. Several different transcript variants are generated by alternate splicing; however, the full length nature of two variants has not been determined.
28908	
28909	
28910	
28911	
28912	
28913	
28914	
28915	
28916	
28917	
28918	
28919	
28920	
28921	
28922	
28923	
28924	The ADF (actin-depolymerizing factor)/cofilin family (see MIM 601442) is composed of stimulus-responsive mediators of actin dynamics. ADF/cofilin proteins are inactivated by kinases such as LIM domain kinase-1 (LIMK1; MIM 601329). The SSH family appears to play a role in actin dynamics by reactivating ADF/cofilin proteins in vivo (Niwa et al., 2002).[supplied by OMIM]
28925	
28926	
28927	
28928	
28929	
28930	
28931	
28932	
28933	
28934	
28935	
28936	
28937	
28938	
28939	
28940	
28941	
28942	
28943	
28944	
28945	
28946	
28947	
28948	
28949	
28950	
28951	
28952	
28953	
28954	
28955	
28956	
28957	
28958	
28959	
28960	
28961	
28962	
28963	
28964	
28965	
28966	
28967	
28968	
28969	
28970	
28971	
28972	
28973	
28974	
28975	
28976	
28977	
28978	
28979	
28980	
28981	
28982	
28983	
28984	
28985	
28986	
28987	
28988	
28989	
28990	
28991	
28992	USP18, a member of the deubiquitinating protease family of enzymes, removes ubiquitin adducts from a broad range of protein substrates.[supplied by OMIM]
28993	
28994	
28995	
28996	
28997	
28998	
28999	
29000	
29001	The A-kinase anchor proteins (AKAPs) are a group of structurally diverse proteins, which have the common function of binding to the regulatory subunit of protein kinase A (PKA) and confining the holoenzyme to discrete locations within the cell. This gene encodes a member of the AKAP family. Alternative splicing of this gene results in 2 transcript variants encoding 2 isoforms with different sizes. Both of the isoforms bind to types I and II regulatory subunits of PKA and anchor them to mitochondria. As compared to the longer isoform, the shorter isoform lacks a K-homologous motif, which is an RNA-binding domain typically associated with proteins involved in RNA catalysis, mRNA processing, or translation. The longer isoform is speculated to be involved in the cAMP-dependent signal transduction pathway and in directing RNA to a specific cellular compartment. The function of the shorter isoform has not been determined.
29002	
29003	
29004	This gene encodes a member of the C-type lectin/C-type lectin-like domain (CTL/CTLD) superfamily. Members of this family share a common protein fold and have diverse functions, such as cell adhesion, cell-cell signalling, glycoprotein turnover, and roles in inflammation and immune response. The encoded type 2 transmembrane protein may play a role in inflammatory and immune response. Multiple transcript variants encoding distinct isoforms have been identified for this gene. This gene is closely linked to other CTL/CTLD superfamily members on chromosome 12p13 in the natural killer gene complex region.
29005	
29006	
29007	
29008	
29009	
29010	
29011	
29012	
29013	
29014	
29015	
29016	
29017	
29018	
29019	
29020	Exonucleolytic degradation of the poly(A) tail is often the first step in the decay of eukaryotic mRNAs.  The amino acid sequence of poly(A)-specific ribonuclease shows homology to the RNase D family of 3@#$%&-exonucleases. The protein appears to be localized in both the nucleus and the cytoplasm. It is not stably associated with polysomes or ribosomal subunits.
29021	
29022	
29023	
29024	
29025	
29026	
29027	
29028	
29029	
29030	
29031	
29032	
29033	
29034	
29035	This gene is located within the region of chromosome 9p that harbors tumor suppressor genes critical in carcinogenesis. It is an intronless gene which is downregulated in non-small-cell lung cancer and small-cell lung cancer cell lines, suggesting that it may play a role in lung tumorigenesis.
29036	
29037	
29038	
29039	
29040	
29041	
29042	
29043	
29044	
29045	
29046	
29047	
29048	
29049	
29050	
29051	
29052	
29053	
29054	
29055	
29056	
29057	
29058	
29059	This gene encodes a protein which interacts with clathrin and adaptor-related protein complex 2, alpha 1 subunit. The protein is found in a brain-derived clathrin-coated vesicle fraction and localizes to the peri-Golgi region and the cell periphery. The protein is thought to be involved in clathrin-mediated endocytosis. Alternate splicing of this gene results in two transcript variants encoding different isoforms.
29060	
29061	
29062	
29063	
29064	Cytoplasmic ribosomes, organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L7AE family of ribosomal proteins. It can interact with a subclass of nuclear hormone receptors, including thyroid hormone receptor, and inhibit their ability to transactivate by preventing their binding to their DNA response elements. This gene is included in the surfeit gene cluster, a group of very tightly linked genes that do not share sequence similarity. It is co-transcribed with the U24, U36a, U36b, and U36c small nucleolar RNA genes, which are located in its second, fifth, fourth, and sixth introns, respectively. This gene rearranges with the trk proto-oncogene to form the chimeric oncogene trk-2h, which encodes an oncoprotein consisting of the N terminus of ribosomal protein L7a fused to the receptor tyrosine kinase domain of trk. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
29065	
29066	
29067	
29068	
29069	
29070	
29071	
29072	
29073	
29074	
29075	
29076	
29077	
29078	
29079	
29080	
29081	
29082	
29083	
29084	
29085	
29086	
29087	
29088	
29089	The protein encoded by this gene is localized in the nuclear matrix. It may play a role in transcription or may interact with other nuclear matrix proteins to form the internal fibrogranular network. Two transcript variants encoding the same protein have been identified for this gene.
29090	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S3AE family of ribosomal proteins. It is located in the cytoplasm. Disruption of the gene encoding rat ribosomal protein S3a, also named v-fos transformation effector protein, in v-fos-transformed rat cells results in reversion of the transformed phenotype. Transcript variants utilizing alternative transcription start sites have been described. This gene is co-transcribed with the U73A and U73B small nucleolar RNA genes, which are located in its fourth and third introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
29091	
29092	
29093	
29094	Neurofascin is an L1 family immunoglobulin cell adhesion molecule (see L1CAM; MIM 308840) involved in axon subcellular targeting and synapse formation during neural development (Ango et al., 2004).[supplied by OMIM]
29095	
29096	
29097	
29098	
29099	
29100	
29101	
29102	
29103	
29104	Cytoplasmic ribosomes, organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein, which shares sequence similarity with yeast ribosomal protein L44, belongs to the L44E (L36AE) family of ribosomal proteins. Although this gene has been referred to as ribosomal protein L44 (RPL44), its official name is ribosomal protein L36a (RPL36A). This gene and the human gene officially named ribosomal protein L36a-like (RPL36AL) encode nearly identical proteins; however, they are distinct genes. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
29105	
29106	
29107	
29108	
29109	
29110	
29111	
29112	
29113	
29114	
29115	
29116	
29117	
29118	
29119	
29120	
29121	
29122	
29123	The protein encoded by this gene is a member of the tripartite motif (TRIM) family. The TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 2, and a coiled-coil region. This protein localizes to the nuclear matrix. It interacts with the enhancer of polycomb protein and represses gene transcription. It is also thought to be involved in the differentiation of male germ cells. Rearrangements of this gene and the RET gene result in a gene encoding the ret transforming protein. Two alternatively spliced transcript variants for this gene have been described.
29124	
29125	
29126	This gene encodes a protein belonging to the RAN-GTPase exportin family that mediates export of tRNA from the nucleus to the cytoplasm. Translocation of tRNA to the cytoplasm occurs once exportin has bound both tRNA and GTP-bound RAN.
29127	
29128	The product encoded by this gene belongs to the heat shock protein 70 family which contains both heat-inducible and constitutively expressed members. The latter are called heat-shock cognate proteins. This gene encodes a heat-shock cognate protein. This protein binds to nascent polypeptides to facilitate correct folding. It also functions as an ATPase in the disassembly of clathrin-coated vesicles during transport of membrane components through the cell. Two alternatively spliced variants have been characterized to date.
29129	
29130	
29131	
29132	
29133	
29134	
29135	
29136	
29137	Members of the F-box protein family, such as FBXL17, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
29138	
29139	
29140	
29141	
29142	
29143	
29144	
29145	
29146	
29147	Beta actin is one of six different actin isoforms which have been identified.  ACTB is one of the two nonmuscle cytoskeletal actins.  Actins are highly conserved proteins that are involved in cell motility, structure and integrity.  Alpha actins are a major constituent of the contractile apparatus.
29148	
29149	
29150	
29151	
29152	
29153	
29154	
29155	
29156	
29157	
29158	
29159	This protein is ubiquitous and highly conserved. It carries out the enzymatic reduction of methionine sulfoxide to methionine. Human and animal studies have shown the highest levels of expression in kidney and nervous tissue. Its proposed function is the repair of oxidative damage to proteins to restore biological activity.
29160	
29161	
29162	
29163	
29164	
29165	
29166	
29167	
29168	
29169	
29170	
29171	
29172	
29173	
29174	
29175	
29176	
29177	This gene is a member of the regulatory factor X gene family, which encodes transcription factors that contain a highly-conserved winged helix DNA binding domain. The protein encoded by this gene is structurally related to regulatory factors X1, X2, X3, and X5. It has been shown to interact with itself as well as with regulatory factors X2 and X3, but it does not interact with regulatory factor X1. This protein may be a transcriptional repressor rather than a transcriptional activator. Three transcript variants encoding different isoforms have been described for this gene.
29178	
29179	
29180	
29181	This gene encodes a protein that belongs to the non-histone chromosomal high mobility group (HMG) protein family. HMG proteins function as architectural factors and are essential components of the enhancesome. This protein contains structural DNA-binding domains and may act as a transcriptional regulating factor. Identification of the deletion, amplification, and rearrangement of this gene that are associated with myxoid liposarcoma suggests a role in adipogenesis and mesenchymal differentiation. A gene knock out study of the mouse counterpart demonstrated that this gene is involved in diet-induced obesity. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
29182	
29183	
29184	
29185	
29186	
29187	
29188	
29189	
29190	Cancer-associated chromosomal changes often involve regions containing fragile sites. This gene maps to a commom fragile site on chromosome 7q31.2 designated FRA7G. This gene is similar to mouse Testin, a testosterone-responsive gene encoding a Sertoli cell secretory protein containing three LIM domains. LIM domains are double zinc-finger motifs that mediate protein-protein interactions between transcription factors, cytoskeletal proteins and signaling proteins. Multiple protein isoforms are encoded by transcript variants of this gene.
29191	
29192	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
29193	
29194	
29195	
29196	Synovial sarcomas occur most frequently in the extremities around large joints. More than 90% of cases have a recurrent and specific chromosomal translocation, t(X;18)(p11.2;q11.2), in which the 5-prime end of the SS18 gene (MIM 600192) is fused in-frame to the 3-prime end of the SSX1 (MIM 312820), SSX2 (MIM 300192), or SSX4 (MIM 300326) gene. The SS18L1 gene is homologous to SS18.[supplied by OMIM]
29197	
29198	
29199	
29200	
29201	
29202	The vitamin A metabolite, all-trans retinoic acid (atRA), plays an important role in neuronal development, including neurite outgrowth. RAINB1 is an atRA-responsive gene.[supplied by OMIM]
29203	
29204	This gene encodes the coagulation factor XIII A subunit. Coagulation factor XIII is the last zymogen to become activated in the blood coagulation cascade. Plasma factor XIII is a heterotetramer composed of 2 A subunits and 2 B subunits. The A subunits have catalytic function, and the B subunits do not have enzymatic activity and may serve as a plasma carrier molecules. Platelet factor XIII is comprised only of 2 A subunits, which are identical to those of plasma origin. Upon activation by the cleavage of the activation peptide by thrombin and in the presence of calcium ion, the plasma factor XIII dissociates its B subunits and yields the same active enzyme, factor XIIIa, as platelet factor XIII. This enzyme acts as a transglutaminase to catalyze the formation of gamma-glutamyl-epsilon-lysine crosslinking between fibrin molecules, thus stabilizing the fibrin clot. It also crosslinks alpha-2-plasmin inhibitor, or fibronectin, to the alpha chains of fibrin. Factor XIII deficiency is classified into two categories: type I deficiency, characterized by the lack of both the A and B subunits; and type II deficiency, characterized by the lack of the A subunit alone. These defects can result in a lifelong bleeding tendency, defective wound healing, and habitual abortion.
29205	
29206	
29207	
29208	
29209	
29210	
29211	
29212	
29213	
29214	
29215	The protein encoded by this gene is a signal transducer shared by many cytokines, including interleukin 6 (IL6), ciliary neurotrophic factor (CNTF), leukemia inhibitory factor (LIF), and oncostatin M (OSM). This protein functions as a part of the cytokine receptor complex. The activation of this protein is dependent upon the binding of cytokines to their receptors. vIL6, a protein related to IL6 and encoded by the Kaposi sarcoma-associated herpesvirus, can bypass the interleukin 6 receptor (IL6R) and directly activate this protein. Knockout studies in mice suggested a critical role of the gene encoding this protein in regulating myocyte apoptosis. Alternatively spliced transcript variants encoding distinct isoforms have been described.
29216	This gene is one of the PMS2 gene family members which are found in clusters on chromosome 7. The product of this gene is involved in DNA mismatch repair. The protein forms a heterodimer with MLH1 and this complex interacts with MSH2 bound to mismatched bases. Defects in this gene are associated with hereditary nonpolyposis colorectal cancer, with a Turcot syndrome, and are a cause of supratentorial primitive neuroectodermal tumors. Alternatively spliced transcript variants encoding different isoforms have been identified.
29217	
29218	
29219	
29220	Human T-cell leukemia virus enhancer factor (HTLF) may be a forkhead domain binding protein. It may function in the transcriptional regulation of the human T-cell leukemia virus long terminal repeat.
29221	
29222	
29223	
29224	
29225	
29226	
29227	
29228	The protein encoded by this gene binds to an anti-proliferative protein, B-cell translocation protein 1, which negatively regulates cell proliferation. Binding of the two proteins, which is driven by phosphorylation of the anti-proliferative protein, causes signaling events in cell division that lead to changes in cell proliferation associated with cell-cell contact. The protein has both mouse and yeast orthologs. Alternate splicing of this gene results in two transcript variants encoding different isoforms.
29229	
29230	
29231	
29232	
29233	
29234	
29235	
29236	
29237	
29238	
29239	
29240	
29241	
29242	
29243	
29244	This gene encodes a member of the 1-acylglycerol-3-phosphate O-acyltransferase family. This integral membrane protein converts lysophosphatidic acid to phosphatidic acid, the second step in de novo phospholipid biosynthesis. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
29245	
29246	
29247	This gene encodes a protein that is a member of the SUMO (small ubiquitin-like modifier) protein family. It functions in a manner similar to ubiquitin in that it is bound to target proteins as part of a post-translational modification system. However, unlike ubiquitin which targets proteins for degradation, this protein is involved in a variety of cellular processes, such as nuclear transport, transcriptional regulation, apoptosis, and protein stability. It is not active until the last two amino acids of the carboxy-terminus have been cleaved off. Numerous pseudogenes have been reported for this gene. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
29248	
29249	
29250	
29251	
29252	
29253	
29254	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has one RRM domain that binds to RNAs. Two alternatively spliced transcript variants have been described for this gene.
29255	
29256	
29257	
29258	
29259	
29260	
29261	
29262	
29263	
29264	This gene encodes a proprotein convertase belonging to the proteinase K subfamily of the secretory subtilase family. The encoded protein is synthesized as a soluble zymogen that undergoes autocatalytic intramolecular processing in the endoplasmic reticulum. The protein may function as a proprotein convertase. This protein plays a role in cholesterol homeostasis and may have a role in the differentiation of cortical neurons. Mutations in this gene have been associated with a third form of autosomal dominant familial hypercholesterolemia (HCHOLA3).
29265	
29266	
29267	
29268	
29269	
29270	
29271	The gamma globin genes (HBG1 and HBG2) are normally expressed in the fetal liver, spleen and bone marrow. Two gamma chains together with two alpha chains constitute fetal hemoglobin (HbF) which is normally replaced by adult hemoglobin (HbA) at birth. In some beta-thalassemias and related conditions, gamma chain production continues into adulthood. The two types of gamma chains differ at residue 136 where glycine is found in the G-gamma product (HBG2) and alanine is found in the A-gamma product (HBG1). The former is predominant at birth. The order of the genes in the beta-globin cluster is: 5@#$%&-epsilon -- gamma-G -- gamma-A -- delta -- beta--3@#$%&
29272	
29273	
29274	
29275	
29276	
29277	
29278	This gene encodes a member of a homeobox gene family similiar to the Drosophila distal-less gene. The encoded protein is localized to the nucleus where it may function as a transcriptional regulator of signals from multiple TGF-{beta} superfamily members, and may play a role in neuronal development in the brain.
29279	
29280	
29281	cAMP is a signaling molecule important for a variety of cellular functions. cAMP exerts its effects by activating the cAMP-dependent protein kinase (AMPK), which transduces the signal through phosphorylation of different target proteins. The inactive holoenzyme of AMPK is a tetramer composed of two regulatory and two catalytic subunits. cAMP causes the dissociation of the inactive holoenzyme into a dimer of regulatory subunits bound to four cAMP and two free monomeric catalytic subunits. Four different regulatory subunits and three catalytic subunits of AMPK have been identified in humans. The protein encoded by this gene is one of the regulatory subunits. This subunit can be phosphorylated by the activated catalytic subunit. It may interact with various A-kinase anchoring proteins and determine the subcellular localization of AMPK. This subunit has been shown to regulate protein transport from endosomes to the Golgi apparatus and further to the endoplasmic reticulum (ER).
29282	This intronless gene encodes a member of the MAGE superfamily. The encoded protein is 76% identical to the mouse mage-g1 protein.
29283	This gene encodes a double zinc finger (FYVE domain) protein that interacts directly with SMAD2 and SMAD3, and is involved in Alzheimer@#$%&s disease. SMAD proteins transmit signals from transmembrane serine/threonine kinase receptors to the nucleus. The FYVE domain has been identified in a number of unrelated signaling molecules. This protein functions to recruit SMAD2 to the transforming growth factor-beta receptor. The FYVE domain is required to maintain the normal localization of this protein but is not involved in mediating interaction with SMADs. The C-terminal domain of this protein interacts with the TGFB receptor. This protein is a component of the TGFB pathway that brings the SMAD substrate to the receptor. Three alternatively spliced transcripts encoding distinct isoforms have been found for this gene.
29284	
29285	
29286	
29287	
29288	
29289	
29290	
29291	
29292	
29293	This gene encodes a cysteine protease that has a strict specificity for hydrolysis of asparaginyl bonds. This enzyme may be involved in the processing of bacterial peptides and endogenous proteins for MHC class II presentation in the lysosomal/endosomal systems. Enzyme activation is triggered by acidic pH and appears to be autocatalytic. Protein expression occurs after monocytes differentiate into dendritic cells. A fully mature, active enzyme is produced following lipopolysaccharide expression in mature dendritic cells. Overexpression of this gene may be associated with the majority of solid tumor types. This gene has a pseudogene on chromosome 13. Several alternatively spliced transcript variants have been described, but the biological validity of only two has been determined. These two variants encode the same isoform.
29294	
29295	
29296	
29297	
29298	
29299	
29300	
29301	
29302	
29303	
29304	
29305	
29306	
29307	
29308	
29309	
29310	
29311	
29312	The laminar organization of multiple neuronal types in the cerebral cortex is required for normal cognitive function. In mice, the disabled-1 gene plays a central role in brain development, directing the migration of cortical neurons past previously formed neurons to reach their proper layer. This gene is similar to disabled-1, and the protein encoded by this gene is thought to be a signal transducer that interacts with protein kinase pathways to regulate neuronal positioning in the developing brain. Alternatively spliced transcript variants of this gene have been reported, but their full length nature has not been determined.
29313	
29314	
29315	This gene encodes poly(ADP-ribosyl)transferase-like 1 protein, which contains a catalytic domain and is capable of catalyzing a poly(ADP-ribosyl)ation reaction. This protein has a catalytic domain which is homologous to that of poly (ADP-ribosyl) transferase, but lacks an N-terminal DNA binding domain which activates the C-terminal catalytic domain of poly (ADP-ribosyl) transferase. Since this protein is not capable of binding DNA directly, its transferase activity may be activated by other factors such as protein-protein interaction mediated by the extensive carboxyl terminus.
29316	Apolipoprotein (Apo-) A-II is the second most abundant protein of the high density lipoprotein particles. The apo-A-II gene consists of 4 exons and 3 introns.  The four exons encode the 5@#$%& untranslated region, pre-peptide, a short N-terminal domain and a C-terminal domain composed of a variable number of lipid-binding amphipathic helices.  Familial apo-A-II deficiency may result from a splice-junction alteration which blocks splicing of intron 3 from the primary transcript and result the formation of a non-functional mRNA.
29317	
29318	
29319	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. Members of the PTP family share a highly conserved catalytic motif, which is essential for the catalytic activity. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. Epidermal growth factor receptor and the adaptor protein Shc were reported to be substrates of this PTP, which suggested the roles in growth factor mediated cell signaling. Three alternatively spliced variants of this gene, which encode isoforms differing at their extreme C-termini, have been described. The different C-termini are thought to determine the substrate specificity, as well as the cellular localization of the isoforms. Two highly related but distinctly processed pseudogenes that localize to distinct chromosomes have been reported.
29320	
29321	
29322	
29323	This gene encodes a potential transmembrane protein that may function in vesicle-mediated transport and sorting of proteins within the cell. This protein may play a role in the development and the function of the eye, hematological system, and central nervous system. Mutations in this gene have been associated with Cohen syndrome. Multiple splice variants encoding distinct isoforms have been identified for this gene.
29324	
29325	This gene produces two transcripts encoding two distinct proteins. One protein is a transcriptional repressor, while the other isoform is a major component of specialized synapses known as synaptic ribbons. Both proteins contain a NAD+ binding domain similar to NAD+-dependent 2-hydroxyacid dehydrogenases. A portion of the 3@#$%& untranslated region was used to map this gene to chromosome 21q21.3; however, it was noted that similar loci elsewhere in the genome are likely. Blast analysis shows that this gene is present on chromosome 10.
29326	
29327	The protein encoded by this gene is member of the GAP1 family of GTPase-activating proteins. The gene product stimulates the GTPase activity of normal RAS p21 but not its oncogenic counterpart. Acting as a suppressor of RAS function, the protein enhances the weak intrinsic GTPase activity of RAS proteins resulting in the inactive GDP-bound form of RAS, thereby allowing control of cellular proliferation and differentiation. This particular family member has a perinuclear localization and is an inositol 1,3,4,5-tetrakisphosphate-binding protein; a compound suggested to function as a second messenger.
29328	
29329	
29330	
29331	
29332	
29333	This gene encodes a protein that is a member of the SUMO (small ubiquitin-like modifier) protein family. It functions in a manner similar to ubiquitin in that it is bound to target proteins as part of a post-translational modification system. However, unlike ubiquitin which targets proteins for degradation, this protein is involved in a variety of cellular processes, such as nuclear transport, transcriptional regulation, apoptosis, and protein stability. It is not active until the last two amino acids of the carboxy-terminus have been cleaved off. Numerous pseudogenes have been reported for this gene. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
29334	
29335	
29336	
29337	
29338	
29339	
29340	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L31E family of ribosomal proteins. It is located in the cytoplasm. Higher levels of expression of this gene in familial adenomatous polyps compared to matched normal tissues have been observed. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
29341	
29342	
29343	
29344	
29345	
29346	
29347	
29348	Proteolysis by the ubiquitin-proteasome system controls the concentration of many regulatory proteins. The selectivity of ubiquitylation is determined by ubiquitin E3 ligases, which recognize the substrate@#$%&s destabilization signal, or degron. The E3 ligase UBR2 participates in the N-end rule pathway, which targets proteins bearing an N-terminal degron, or N-degron (Kwon et al., 2003).[supplied by OMIM]
29349	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S5P family of ribosomal proteins. It is located in the cytoplasm. This gene shares sequence similarity with mouse LLRep3. It is co-transcribed with the small nucleolar RNA gene U64, which is located in its third intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
29350	
29351	
29352	
29353	
29354	
29355	
29356	
29357	
29358	
29359	
29360	
29361	
29362	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S10P family of ribosomal proteins. It is located in the cytoplasm. This gene is co-transcribed with the small nucleolar RNA gene U54, which is located in its second intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
29363	
29364	
29365	
29366	
29367	The protein encoded by this gene is a LIM domain protein. LIM domains are cysteine-rich double zinc fingers composed of 50 to 60 amino acids that are involved in protein-protein interactions. LIM domain-containing proteins are scaffolds for the formation of multiprotein complexes. The proteins are involved in cytoskeleton organization, cell lineage specification, organ development, and oncogenesis. The encoded protein is also a member of the Enigma class of proteins, a family of proteins that possess a 100-amino acid PDZ domain in the N terminus and 1 to 3 LIM domains in the C terminus. Multiple transcript variants encoding different isoforms have been found for this gene, although not all of them have been fully characterized.
29368	
29369	The zona pellucida is an extracellular matrix that surrounds the oocyte and early embryo. It is composed primarily of three or four glycoproteins with various functions during fertilization and preimplantation development. The nascent protein contains a N-terminal signal peptide sequence, a conserved ZP domain, a consensus furin cleavage site, and a C-terminal transmembrane domain. It is hypothesized that furin cleavage results in release of the mature protein from the plasma membrane for subsequent incorporation into the zona pellucida matrix. However, the requirement for furin cleavage in this process remains controversial based on mouse studies. Previously, this gene has been referred to as ZP1 or ZPB and thought to have similar functions as mouse Zp1. However, a human gene with higher similarity and chromosomal synteny to mouse Zp1 has been assigned the symbol ZP1 and this gene has been assigned the symbol ZP4.
29370	
29371	
29372	
29373	
29374	
29375	
29376	
29377	Laminins, a family of extracellular matrix glycoproteins, are the major noncollagenous constituent of basement membranes. They have been implicated in a wide variety of biological processes including cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis. Many of the effects of laminin are mediated through interactions with cell surface receptors. These receptors include members of the integrin family, as well as non-integrin laminin-binding proteins. This gene encodes a high-affinity, non-integrin family, laminin receptor 1. This receptor has been variously called 67 kD laminin receptor, 37 kD laminin receptor precursor (37LRP) and p40 ribosome-associated protein. The amino acid sequence of laminin receptor 1 is highly conserved through evolution, suggesting a key biological function. It has been observed that the level of the laminin receptor transcript is higher in colon carcinoma tissue and lung cancer cell line than their normal counterparts. Also, there is a correlation between the upregulation of this polypeptide in cancer cells and their invasive and metastatic phenotype. Multiple copies of this gene exist, however, most of them are pseudogenes thought to have arisen from retropositional events. Two alternatively spliced transcript variants encoding the same protein have been found for this gene.
29378	
29379	
29380	
29381	
29382	
29383	
29384	
29385	
29386	
29387	
29388	The MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism) family consists of six human proteins, two of which are involved in the production of leukotrienes and prostaglandin E, important mediators of inflammation. Other family members, demonstrating glutathione S-transferase and peroxidase activities, are involved in cellular defense against toxic, carcinogenic, and pharmacologically active electrophilic compounds. This gene encodes a protein that catalyzes the conjugation of glutathione to electrophiles and the reduction of lipid hydroperoxides. This protein is localized to the endoplasmic reticulum and outer mitochondrial membrane where it is thought to protect these membranes from oxidative stress. Four transcript variants of this gene encode one protein isoform.
29389	
29390	
29391	
29392	
29393	
29394	The protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White). This protein is a member of the White subfamily. It is involved in macrophage cholesterol and phospholipids transport, and may regulate cellular lipid homeostasis in other cell types. Several alternative splice variants have been identified.
29395	In cardiac myocytes, Ca(2+) concentrations alternate between high levels during contraction and low levels during relaxation. The increase in Ca(2+) concentration during contraction is primarily due to release of Ca(2+) from intracellular stores. However, some Ca(2+) also enters the cell through the sarcolemma (plasma membrane). During relaxation, Ca(2+) is sequestered within the intracellular stores. To prevent overloading of intracellular stores, the Ca(2+) that entered across the sarcolemma must be extruded from the cell. The Na(+)-Ca(2+) exchanger is the primary mechanism by which the Ca(2+) is extruded from the cell during relaxation. In the heart, the exchanger may play a key role in digitalis action. The exchanger is the dominant mechanism in returning the cardiac myocyte to its resting state following excitation.[supplied by OMIM]
29396	
29397	
29398	
29399	
29400	The protein encoded by this gene is a pyruvate kinase that catalyzes the production of phosphoenolpyruvate from pyruvate and ATP. This protein has been shown to interact with thyroid hormone, and thus may mediate cellular metabolic effects induced by thyroid hormones. This protein has been found to bind Opa protein, a bacterial outer membrane protein involved in gonococcal adherence to and invasion of human cells, suggesting a role of this protein in bacterial pathogenesis. Three alternatively spliced transcript variants encoding two distinct isoforms have been reported.
29401	
29402	
29403	
29404	
29405	
29406	
29407	
29408	
29409	
29410	
29411	
29412	
29413	
29414	
29415	
29416	
29417	
29418	
29419	
29420	
29421	
29422	This gene encodes a member of the tob/btg1 family of anti-proliferative proteins that have the potential to regulate cell growth. When exogenously expressed, this protein supresses cell growth in tissue culture. The protein undergoes phophorylation by a serine/threonine kinase, 90 kDa ribosomal S6 kinase. Interactions of this protein with the v-erb-b2 erythroblastic leukemia viral oncogene homolog 2 gene product p185 interferes with growth suppression. This protein inhibits T cell proliferation and transcription of cytokines and cyclins. The protein interacts with both mothers against decapentaplegic Drosophila homolog 2 and 4 to enhance their DNA binding activity. This interaction inhibits interleukin 2 transcription in T cells.
29423	
29424	
29425	
29426	
29427	
29428	
29429	
29430	This gene encodes a member of the annexin family. Members of this calcium-dependent phospholipid-binding protein family play a role in the regulation of cellular growth and in signal transduction pathways. This protein functions as an autocrine factor which heightens osteoclast formation and bone resorption. This gene has three pseudogenes located on chromosomes 4, 9 and 10, respectively. Multiple alternatively spliced transcript variants encoding different isoforms have been found for this gene.
29431	
29432	
29433	
29434	
29435	
29436	The protein encoded by this gene contains multiple N-terminal WD40 domains and a C-terminal high mobility group (HMG) box. WD40 domains are found in a variety of eukaryotic proteins and may function as adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly. HMG boxes are found in many eukaryotic proteins involved in chromatin assembly, transcription and replication. Alternative splicing results in two transcript variants encoding different isoforms.
29437	
29438	This gene encodes a member of the DEAD box protein family. DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. The protein encoded by this gene shows ATPase activity in the presence of polynucleotides and associates with nucleoplasmic 65S preribosomal particles. This gene may be involved in ribosome synthesis, most likely during assembly of the large 60S ribosomal subunit.
29439	This gene encodes a small nuclear ribonucleoprotein that belongs to the SNRNP core protein family. The protein may act as a charged protein scaffold to promote SNRNP assembly or strengthen SNRNP-SNRNP interactions through nonspecific electrostatic contacts with RNA.
29440	
29441	Cofilin is a widely distributed intracellular actin-modulating protein that binds and depolymerizes filamentous F-actin and inhibits the polymerization of monomeric G-actin in a pH-dependent manner. It is involved in the translocation of actin-cofilin complex from cytoplasm to nucleus.[supplied by OMIM]
29442	
29443	
29444	
29445	
29446	
29447	
29448	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S10E family of ribosomal proteins. It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
29449	
29450	
29451	
29452	
29453	
29454	
29455	
29456	
29457	
29458	
29459	
29460	
29461	
29462	The protein encoded by this gene is localized in the nuclear matrix. It may play a role in transcription or may interact with other nuclear matrix proteins to form the internal fibrogranular network. Two transcript variants encoding the same protein have been identified for this gene.
29463	
29464	
29465	
29466	
29467	
29468	
29469	This gene encodes a protein that is a member of the SUMO (small ubiquitin-like modifier) protein family. It functions in a manner similar to ubiquitin in that it is bound to target proteins as part of a post-translational modification system. However, unlike ubiquitin which targets proteins for degradation, this protein is involved in a variety of cellular processes, such as nuclear transport, transcriptional regulation, apoptosis, and protein stability. It is not active until the last two amino acids of the carboxy-terminus have been cleaved off. Numerous pseudogenes have been reported for this gene. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
29470	
29471	
29472	
29473	
29474	
29475	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
29476	
29477	
29478	
29479	
29480	
29481	The protein encoded by this gene is a member of the PP2C family of Ser/Thr protein phosphatases. PP2C family members are known to be negative regulators of cell stress response pathways. The expression of this gene is induced in a p53-dependent manner in response to various environmental stresses. While being induced by tumor suppressor protein TP53/p53, this phosphatase negatively regulates the activity of p38 MAP kinase, MAPK/p38, through which it reduces the phosphorylation of p53, and in turn suppresses p53-mediated transcription and apoptosis. This phosphatase thus mediates a feedback regulation of p38-p53 signaling that contributes to growth inhibition and the suppression of stress induced apoptosis. This gene is located in a chromosomal region known to be amplified in breast cancer. The amplification of this gene has been detected in both breast cancer cell line and primary breast tumors, which suggests a role of this gene in cancer development.
29482	
29483	
29484	
29485	
29486	RICTOR and MTOR (FRAP1; MIM 601231) are components of a protein complex that integrates nutrient- and growth factor-derived signals to regulate cell growth (Sarbassov et al., 2004).[supplied by OMIM]
29487	
29488	
29489	
29490	The glycoprotein encoded by this gene functions as both an antihemophilic factor carrier and a platelet-vessel wall mediator in the blood coagulation system.  It is crucial to the hemostasis process.  Mutations in this gene or deficiencies in this protein result in von Willebrand@#$%&s disease. An unprocessed pseudogene has been found on chromosome 22.
29491	
29492	
29493	The evolutionarily conserved THRAP genes encode a family of proteins that regulate embryonic development. THRAP2 is involved in early development of the heart and brain (Muncke et al., 2003).[supplied by OMIM]
29494	
29495	
29496	
29497	
29498	
29499	
29500	
29501	
29502	
29503	
29504	
29505	Heparan sulfate biosynthetic enzymes are key components in generating a myriad of distinct heparan sulfate fine structures that carry out multiple biologic activities. This gene encodes heparan sulfate 2-O-sulfotransferase, a member of the heparan sulfate biosynthetic enzyme family. This family member transfers sulfate to the 2 position of the iduronic acid residue of heparan sulfate. The disruption of this gene resulted in no kidney formation in knockout embryonic mice, indicating that the absence of this enzyme may interfere with the signaling required for kidney formation.
29506	
29507	
29508	
29509	Moesin (for membrane-organizing extension spike protein) is a member of the ERM family which includes ezrin and radixin. ERM proteins appear to function as cross-linkers between plasma membranes and actin-based cytoskeletons. Moesin is localized to filopodia and other membranous protrusions that are important for cell-cell recognition and signaling and for cell movement.
29510	
29511	
29512	
29513	The protein encoded by this gene has sequence similarity with members of the WD40 repeat-containing protein family. The WD40 group is a large family of proteins, which appear to have a regulatory function. It is believed that the WD40 repeats mediate protein-protein interactions and members of the family are involved in signal transduction, RNA processing, gene regulation, vesicular trafficking, cytoskeletal assembly and may play a role in the control of cytotypic differentiation.
29514	
29515	
29516	
29517	
29518	
29519	
29520	
29521	
29522	
29523	
29524	
29525	
29526	KRT18 encodes the type I intermediate filament chain keratin 18. Keratin 18, together with its filament partner keratin 8, are perhaps the most commonly found members of the intermediate filament gene family. They are expressed in single layer epithelial tissues of the body. Mutations in this gene have been linked to cryptogenic cirrhosis. Two transcript variants encoding the same protein have been found for this gene.
29527	
29528	
29529	
29530	The protein encoded by this gene is a member of the SWI/SNF family of proteins and is highly similar to the brahma protein of Drosophila. Members of this family have helicase and ATPase activities and are thought to regulate transcription of certain genes by altering the chromatin structure around those genes. The encoded protein is part of the large ATP-dependent chromatin remodeling complex SNF/SWI, which is required for transcriptional activation of genes normally repressed by chromatin. Two transcript variants encoding different isoforms have been found for this gene, which contains a trinucleotide repeat (CAG) length polymorphism.
29531	
29532	
29533	
29534	
29535	
29536	
29537	
29538	
29539	
29540	
29541	This gene encodes a member of the Kruppel family of C2H2-type zinc-finger transcription factor proteins. The encoded protein acts as a transcriptional activator. Two transcript variants encoding distinct isoforms have been identified for this gene. Other transcript variants have been described, but their full length sequence has not been determined.
29542	
29543	
29544	
29545	
29546	
29547	
29548	
29549	
29550	The protein encoded by this gene is a member of the membrane-associated guanylate kinase homologue (MAGUK) family. Characterized by two WW domains, a guanylate kinase domain, and five PDZ domains, this protein interacts with the cytoplasmic region of BAI1. Together, these proteins may play a role in cell adhesion and signal transduction.
29551	This gene encodes a calcium-binding protein belonging to the penta-EF-hand protein family. Calcium binding is important for homodimerization and for conformational changes required for binding to other protein partners. This gene product participates in T cell receptor-,  Fas-, and glucocorticoid-induced programmed cell death. In mice deficient for this gene product, however, apoptosis was not blocked suggesting this gene product is functionally redundant.
29552	
29553	
29554	The A-kinase anchor proteins (AKAPs) are a group of structurally diverse proteins, which have the common function of binding to the regulatory subunit of protein kinase A (PKA) and confining the holoenzyme to discrete locations within the cell. This gene encodes a member of the AKAP family. The encoded protein is expressed in endothelial cells, cultured fibroblasts, and osteosarcoma cells. It associates with protein kinases A and C and phosphatase, and serves as a scaffold protein in signal transduction. This protein and RII PKA colocalize at the cell periphery. This protein is a cell growth-related protein. Antibodies to this protein can be produced by patients with myasthenia gravis. Alternative splicing of this gene results in two transcript variants encoding different isoforms.
29555	
29556	
29557	
29558	
29559	The protein encoded by this gene is a RING-H2 zinc finger protein. It has been shown to be a ubiquitin protein ligase that targets LIM domain binding 1 (LDB1/CLIM), and causes proteasome-dependent degradation of LDB1. This protein and LDB1 are co-repressors of LHX1/LIM-1, a homeodomain transcription factor. Alternatively spliced transcript variants encoding the same protein have been reported.
29560	
29561	
29562	
29563	
29564	
29565	
29566	
29567	This gene encodes an isoform of the alpha subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This isoform (alpha 1) is expressed in brain, placenta, lung, liver, kidney, and pancreas, and the other isoform (alpha 2) is expressed in brain, heart and skeletal muscle. This isoform is identified as an autoantigen in 66% of patients with Felty syndrome. This gene has been found to have multiple copies on many chromosomes, some of which, if not all, represent different pseudogenes.
29568	
29569	
29570	
29571	
29572	The prompt clearance of cells undergoing apoptosis is critical during embryonic development, normal tissue turnover, inflammation, and autoimmunity. CED6 is an evolutionarily conserved adaptor protein required for efficient engulfment of apoptotic cells by phagocytes.[supplied by OMIM]
29573	
29574	
29575	The ABL1 protooncogene encodes a cytoplasmic and nuclear protein tyrosine kinase that has been implicated in processes of cell differentiation, cell division, cell adhesion, and stress response.  Activity of c-Abl protein is negatively regulated by its SH3 domain, and deletion of the SH3 domain turns ABL1 into an oncogene.  The t(9;22) translocation results in the head-to-tail fusion of the BCR (MIM:151410) and ABL1 genes present in many cases of chronic myelogeneous leukemia.  The DNA-binding activity of the ubiquitously expressed ABL1 tyrosine kinase is regulated by CDC2-mediated phosphorylation, suggesting a cell cycle function for ABL1.  The ABL1 gene is expressed as either a 6- or 7-kb mRNA transcript, with alternatively spliced first exons spliced to the common exons 2-11.
29576	
29577	
29578	
29579	
29580	
29581	
29582	Voltage-dependent Ca(2+) channels are multisubunit complexes found in the membrane of many excitable cells that regulate calcium entry (see MIM 601011). N-type calcium channels, which control neurotransmitter release from neurons, are dihydropyridine-insensitive and omega-conotoxin-sensitive. The alpha-1 subunit forms the pore through which calcium enters the cell, and is encoded by a family of at least 5 genes.[supplied by OMIM]
29583	This gene encodes a gluconeogenesis regulatory enzyme which catalyzes the hydrolysis of fructose 1,6-bisphosphate to fructose 6-phosphate and inorganic phosphate.
29584	
29585	
29586	
29587	
29588	
29589	
29590	
29591	
29592	
29593	
29594	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
29595	
29596	
29597	
29598	
29599	
29600	
29601	This gene product belongs to the 14-3-3 family of proteins which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals, and this protein is 99% identical to the mouse and rat orthologs. This gene is upregulated in patients with amyotrophic lateral sclerosis. It contains in its 5@#$%& UTR a 6 bp tandem repeat sequence which is polymorphic, however, there is no correlation between the repeat number and the disease.
29602	
29603	
29604	SLCO4C1 belongs to the organic anion transporter (OATP) family. OATPs are involved in the membrane transport of bile acids, conjugated steroids, thyroid hormone, eicosanoids, peptides, and numerous drugs in many tissues (Mikkaichi et al., 2004).[supplied by OMIM]
29605	
29606	
29607	
29608	
29609	
29610	
29611	
29612	
29613	
29614	
29615	
29616	
29617	This gene encodes a protein that may be involved in the degradation of G proteins via the ubiquitin-dependent proteasome pathway. The encoded protein binds to members of subfamily A of the regulator of the G-protein signaling (RGS) family through an N-terminal leucine-rich region. This protein also has a central RING finger-like domain and E3 ubiquitin ligase activity. This protein is highly conserved from flies to humans. Defects in this gene may cause the autosomal recessive, infantile malignant form of osteopetrosis.
29618	
29619	
29620	
29621	
29622	
29623	
29624	
29625	
29626	
29627	
29628	
29629	
29630	
29631	
29632	
29633	
29634	
29635	
29636	
29637	Complex II of the respiratory chain, which is specifically involved in the oxidation of succinate, carries electrons from FADH to CoQ.  The complex is composed of four nuclear-encoded subunits and is localized in the mitochondrial inner membrane.  The subunit D protein is one of two integral membrane proteins anchoring the complex to the matrix side of the membrane.  Mutations in SDHD have been linked to hereditary paraganglioma.
29638	
29639	This gene encodes a member of the neutral endopeptidase (NEP)-related family. It is expressed specifically in the nervous system. The gene disruption in mouse embryonic stem cells results in neonatal lethality due to respiratory failure shortly after birth. Based on the specific expression of this gene and the phenotype of the gene deficiency in mouse embryos, it is suggested that the protein encoded by this gene play a critical role in the nervous regulation of the respiratory system.
29640	
29641	
29642	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S5P family of ribosomal proteins. It is located in the cytoplasm. This gene shares sequence similarity with mouse LLRep3. It is co-transcribed with the small nucleolar RNA gene U64, which is located in its third intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
29643	
29644	
29645	
29646	
29647	
29648	
29649	
29650	
29651	
29652	
29653	
29654	
29655	
29656	
29657	
29658	
29659	
29660	
29661	
29662	
29663	
29664	
29665	The protein encoded by this gene is an adaptor protein that mediates the association of the heat shock proteins HSP70 and HSP90. This protein has been shown to be involved in the assembly process of glucocorticoid receptor, which requires the assistance of multiple molecular chaperones. The expression of this gene is reported to be downregulated in colorectal carcinoma tissue suggesting that is a candidate tumor suppressor gene.
29666	Actins are highly conserved proteins that are involved in various types of cell motility, and maintenance of the cytoskeleton.  In vertebrates, three main groups of actin isoforms, alpha, beta and gamma have been identified.  The alpha actins are found in muscle tissues and are a major constituent of the contractile apparatus.  The beta and gamma actins co-exist in most cell types as components of the cytoskeleton, and as mediators of internal cell motility.  Actin, gamma 1, encoded by this gene, is a cytoplasmic actin found in nonmuscle cells.
29667	
29668	
29669	
29670	
29671	
29672	
29673	
29674	
29675	
29676	This gene encodes a type II transmembrane glycoprotein belonging to the M28 peptidase family. Three functionally distinct proteins are encoded, including folylpoly-gamma-glutamate carboxypeptidase in the intestine, N-acetylated alpha-linked acidic dipeptidase 1 in the brain, and prostate-specific membrane antigen in the prostate. A mutation in the intestinal form may be associated with impaired intestinal absorption of dietary folates, resulting in low blood folate levels and consequent hyperhomocysteinemia. The form expressed in the brain may be involved in a number of pathological conditions associated with glutamate excitotoxicity. The prostate form is up-regulated in cancerous cells and is used as an effective diagnostic and prognostic indicator of prostate cancer. This gene likely arose from a duplication event of a nearby chromosomal region. Alternative splicing gives rise to multiple transcript variants.
29677	
29678	
29679	
29680	
29681	
29682	
29683	This gene encodes a member of the phosducin-like protein family and is a putative modulator of heterotrimeric G proteins. The protein shares extensive amino acid sequence homology with phosducin. Members of the phosducin-like protein family have been shown to bind to the beta-gamma subunits of G proteins.
29684	This gene encodes a member of the phosducin-like protein family and is a putative modulator of heterotrimeric G proteins. The protein shares extensive amino acid sequence homology with phosducin. Members of the phosducin-like protein family have been shown to bind to the beta-gamma subunits of G proteins.
29685	Vasoactive intestinal peptide (VIP) is a neuroprotective factor that has been shown to have a stimulatory effect on the growth of some tumor cells and an inhibitory effect on others. The protein encoded by this gene is upregulated by VIP and may be involved in the stimulatory effect of VIP on certain tumor cells. The encoded protein contains one homeobox and nine zinc finger domains, suggesting that it functions as a transcription factor. This gene is also upregulated in normal proliferative tissues. Finally, the encoded protein may increase the viability of certain cell types through modulation of p53 activity. Two transcript variants encoding the same protein have been found for this gene.
29686	
29687	
29688	
29689	
29690	
29691	
29692	Desmosomes are intercellular junctions that tightly link adjacent cells. Desmoplakin is an obligate component of functional desmosomes that anchors intermediate filaments to desmosomal plaques. The N-terminus of desmoplakin is required for localization to the desmosome and interacts with the N-terminal region of plakophilin 1 and plakoglobin. The C-terminus of desmoplakin binds with intermediate filaments. In the mid-region of desmoplakin, a coiled-coiled rod domain is responsible for homodimerization. Mutations in this gene are the cause of several cardiomyopathies and keratodermas as well as the autoimmune disease paraneoplastic pemphigus.
29693	Eukaryotic translation initiation factor 2 (EIF-2) functions in the early steps of protein synthesis by forming a ternary complex with GTP and initiator tRNA and binding to a 40S ribosomal subunit. EIF-2 is composed of three subunits, alpha, beta, and gamma, with the protein encoded by this gene representing the beta subunit. The beta subunit catalyzes the exchange of GDP for GTP, which recycles the EIF-2 complex for another round of initiation.
29694	
29695	
29696	
29697	
29698	
29699	
29700	
29701	
29702	
29703	This gene encodes one of the numerous actin-binding proteins which regulate the actin cytoskeleton. This protein binds F-actin, and also interacts with 5-lipoxygenase, which is the first committed enzyme in leukotriene biosynthesis. Although this gene has been reported to map to chromosome 17 in the Smith-Magenis syndrome region, the best alignments for this gene are to chromosome 16. The Smith-Magenis syndrome region is the site of two related pseudogenes.
29704	
29705	This gene encodes the light subunit of the ferritin protein. Ferritin is the major intracellular iron storage protein in prokaryotes and eukaryotes. It is composed of 24 subunits of the heavy and light ferritin chains. Variation in ferritin subunit composition may affect the rates of iron uptake and release in different tissues. A major function of ferritin is the storage of iron in a soluble and nontoxic state. Defects in this light chain ferritin gene are associated with several neurodegenerative diseases and hyperferritinemia-cataract syndrome. This gene has multiple pseudogenes.
29706	
29707	
29708	
29709	
29710	
29711	
29712	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L22P family of ribosomal proteins. It is located in the cytoplasm. This gene has been referred to as rpL23 because the encoded protein shares amino acid identity with ribosomal protein L23 from Halobacterium marismortui; however, its official symbol is RPL17. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
29713	
29714	
29715	
29716	
29717	
29718	
29719	
29720	
29721	
29722	This gene encodes one family member of vacuolar protein sorting 10 (VPS10) domain-containing receptor proteins. The VPS10 domain name comes from the yeast carboxypeptidase Y sorting receptor Vps10 protein. Members of this gene family are large with many exons but the CDS lengths are usually less than 3700 nt. Very large introns typically separate the exons encoding the VPS10 domain; the remaining exons are separated by much smaller-sized introns. These genes are strongly expressed in the central nervous system. Two of the five family members (sortilin and sortilin-related receptor) are synthesized as preproproteins; it is not yet known if this encoded protein is also a preproprotein.
29723	
29724	This gene encodes a subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This subunit contains an N-terminal glutathione transferase domain, which may be involved in regulating the assembly of multisubunit complexes containing this elongation factor and aminoacyl-tRNA synthetases.
29725	
29726	
29727	
29728	
29729	
29730	
29731	
29732	
29733	This gene encodes a progestin-induced protein, which belongs to the HECT (homology to E6-AP carboxyl terminus) family. The HECT family proteins function as E3 ubiquitin-protein ligases, targeting specific proteins for ubiquitin-mediated proteolysis. This gene is localized to chromosome 8q22 which is disrupted in a variety of cancers. This gene potentially has a role in regulation of cell proliferation or differentiation.
29734	
29735	
29736	
29737	
29738	This gene encodes a member of the degenerin/epithelial sodium channel (DEG/ENaC) superfamily. The members of this family are amiloride-sensitive sodium channels that contain intracellular N and C termini, 2 hydrophobic transmembrane regions, and a large extracellular loop, which has many cysteine residues with conserved spacing. The member encoded by this gene may play a role in neurotransmission. In addition, a heteromeric association between this member and ACCN3 (variant 1) has been observed to co-assemble into proton-gated channels sensitive to gadolinium. Alternative splicing has been observed at this locus and two variants, encoding distinct isoforms, have been identified.
29739	
29740	CBP and p300 are large nuclear proteins that bind to many sequence-specific factors involved in cell growth and/or differentiation, including c-jun and the adenoviral oncoprotein E1A. The protein encoded by this gene associates with p300/CBP. It has in vitro and in vivo binding activity with CBP and p300, and competes with E1A for binding sites in p300/CBP. It has histone acetyl transferase activity with core histones and nucleosome core particles, indicating that this protein plays a direct role in transcriptional regulation.
29741	
29742	
29743	
29744	
29745	
29746	
29747	
29748	
29749	
29750	
29751	
29752	
29753	
29754	
29755	
29756	
29757	
29758	
29759	
29760	
29761	
29762	
29763	
29764	
29765	
29766	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth, and invasion. This growth factor lacks the N-terminal signal sequence present in most of the FGF family members, but it contains clusters of basic residues that have been demonstrated to act as a nuclear localization signal. When transfected into mammalian cells, this protein accumulated in the nucleus, but was not secreted. The specific function of this gene has not yet been determined. Two alternatively spliced transcript variants encoding distinct isoforms have been reported.
29767	
29768	
29769	This gene encodes a member of the IgLON subfamily in the immunoglobulin protein superfamily. The encoded protein is localized in the plasma membrane and may have an accessory role in opioid receptor function. This gene has an ortholog in rat and bovine. The opioid binding-cell adhesion molecule encoded by the rat gene binds opioid alkaloids in the presence of acidic lipids, exhibits selectivity for mu ligands and acts as a GPI-anchored protein. Since the encoded protein is highly conserved in species during evolution, it may have a fundamental role in mammalian systems. Two transcript variants encoding different isoforms have been found for this gene.
29770	
29771	
29772	
29773	
29774	
29775	
29776	
29777	
29778	
29779	
29780	
29781	
29782	
29783	
29784	
29785	
29786	
29787	
29788	
29789	
29790	
29791	
29792	
29793	
29794	
29795	
29796	This gene is a member of the RUNX family of transcription factors and encodes a nuclear protein with an Runt DNA-binding domain. This protein is essential for osteoblastic differentiation and skeletal morphogenesis, acting as a scaffold for nucleic acids and regulatory factors involved in skeletal gene expression. The protein can bind DNA both as a monomer or, with more affinity, as a subunit of a heterodimeric complex. Mutations in this gene have been associated with the bone development disorder cleidocranial dysplasia (CCD). Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
29797	
29798	
29799	
29800	
29801	
29802	
29803	
29804	
29805	
29806	
29807	
29808	Component C2 is part of the classical pathway of complement system.  Activated C1 cleaves C2 into C2a and C2b.  C2a leads to activation of C3.  Deficiency of C2 has been reported to associated with certain autoimmune diseases.
29809	Component C2 is part of the classical pathway of complement system.  Activated C1 cleaves C2 into C2a and C2b.  C2a leads to activation of C3.  Deficiency of C2 has been reported to associated with certain autoimmune diseases.
29810	
29811	
29812	
29813	
29814	
29815	
29816	
29817	
29818	Glial cell line-derived neurotrophic factor (GDNF) and neurturin (NTN) are two structurally related, potent neurotrophic factors that play key roles in the control of neuron survival and differentiation. The protein encoded by this gene is a member of the GDNF receptor family. It is a glycosylphosphatidylinositol(GPI)-linked cell surface receptor for both GDNF and NTN, and mediates activation of the RET tyrosine kinase receptor. This encoded protein acts preferentially as a receptor for NTN compared to its other family member, GDNF family receptor alpha 1. This gene is a candidate gene for RET-associated diseases.
29819	
29820	
29821	
29822	In yeast and rat, Sec15 is part of a multiprotein complex that is required for targeted exocytosis.[supplied by OMIM]
29823	
29824	
29825	
29826	This gene product constitutes one of the subunits of the multisubunit particle called exosome, which mediates mRNA degradation. The composition of human exosome is similar to its yeast counterpart. This protein is homologous to the yeast Mtr3 protein. Its exact function is not known, however, it has been shown using a cell-free RNA decay system that the exosome is required for rapid degradation of unstable mRNAs containing AU-rich elements (AREs), but not for poly(A) shortening. The exosome does not recognize ARE-containing mRNAs on its own, but requires ARE-binding proteins that could interact with the exosome and recruit it to unstable mRNAs, thereby promoting their rapid degradation.
29827	
29828	
29829	
29830	
29831	
29832	
29833	
29834	
29835	
29836	
29837	
29838	The protein encoded by this gene is predicted to be a five-pass transmembrane protein. This gene may be predominately expressed in brain.
29839	
29840	
29841	
29842	
29843	
29844	
29845	The CD8 antigen is a cell surface glycoprotein found on most cytotoxic T lymphocytes that mediates efficient cell-cell interactions within the immune system. The CD8 antigen, acting as a coreceptor, and the T-cell receptor on the T lymphocyte recognize antigen displayed by an antigen presenting cell (APC) in the context of class I MHC molecules. The functional coreceptor is either a homodimer composed of two alpha chains, or a heterodimer composed of one alpha and one beta chain. Both alpha and beta chains share significant homology to immunoglobulin variable light chains. This gene encodes the CD8 alpha chain isoforms. Two alternative transcripts encoding distinct isoforms, one membrane associated and one secreted, have been identified.
29846	
29847	
29848	
29849	
29850	
29851	
29852	
29853	
29854	
29855	
29856	
29857	
29858	
29859	
29860	
29861	
29862	
29863	
29864	
29865	
29866	
29867	
29868	
29869	
29870	
29871	
29872	
29873	
29874	
29875	The protein encoded by this gene is the human ortholog of bovine and rat dipeptidylpeptidase VI. They are transmembrane proteins and likely members of the dipeptidyl aminopeptidase family. Two transcript variants encoding different isoforms have been identified in all 3 species. In bovine and rat, the longer form has been shown to be expressed almost exclusively in brain, whereas the shorter form has a more widespread expression.
29876	
29877	
29878	
29879	
29880	
29881	
29882	
29883	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S10E family of ribosomal proteins. It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
29884	
29885	This gene encodes a DNA-binding protein that has high specificity for scaffold or matrix attachment region DNA elements (S/MAR DNA). This protein is thought to be involved in attaching the base of chromatin loops to the nuclear matrix but there is conflicting evidence as to whether this protein is a component of chromatin or a nuclear matrix protein. Scaffold attachment factors are a specific subset of nuclear matrix proteins (NMP) that specifically bind to S/MAR. This encoded protein is thought to serve as a molecular base to assemble a @#$%&transcriptosome complex@#$%& in the vicinity of actively transcribed genes. It is involved in the regulation of the heat shock protein 27 transcription and also can act as an estrogen receptor corepressor. This gene is a candidate gene for breast tumorigenesis.
29886	
29887	
29888	
29889	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. Sequence analysis identified three transcript variants. Pseudogenes corresponding to this gene are found on chromosomes 6p, 10q, and Xp.
29890	
29891	
29892	
29893	
29894	
29895	
29896	
29897	
29898	
29899	
29900	
29901	
29902	
29903	This gene belongs to the dimethylarginine dimethylaminohydrolase (DDAH) gene family. The encoded enzyme plays a role in nitric oxide generation by regulating cellular concentrations of methylarginines, which in turn inhibit nitric oxide synthase activity.
29904	SLC7A11 is a member of a heteromeric Na(+)-independent anionic amino acid transport system highly specific for cystine and glutamate. In this system, designated system Xc(-), the anionic form of cystine is transported in exchange for glutamate.[supplied by OMIM]
29905	
29906	
29907	
29908	
29909	This gene is a member of the BCL2/adenovirus E1B 19 kd-interacting protein (BNIP) family.  It interacts with the E1B 19 kDa protein which is responsible for the protection of virally-induced cell death, as well as E1B 19 kDa-like sequences of BCL2, also an apoptotic protector.  The protein encoded by this gene is a functional homolog of BNIP3, a proapoptotic protein.  This protein may function simultaneously with BNIP3 and may play a role in tumor suppression.
29910	
29911	
29912	
29913	
29914	This gene encodes a calcium-binding protein belonging to the penta-EF-hand protein family. Calcium binding is important for homodimerization and for conformational changes required for binding to other protein partners. This gene product participates in T cell receptor-,  Fas-, and glucocorticoid-induced programmed cell death. In mice deficient for this gene product, however, apoptosis was not blocked suggesting this gene product is functionally redundant.
29915	
29916	
29917	
29918	
29919	
29920	
29921	
29922	
29923	
29924	
29925	
29926	
29927	
29928	
29929	
29930	
29931	
29932	
29933	
29934	
29935	
29936	
29937	
29938	
29939	
29940	
29941	
29942	Chromosomal protein HMG14 and its close analog HMG17 (MIM 163910) bind to the inner side of the nucleosomal DNA, potentially altering the interaction between the DNA and the histone octamer. The 2 proteins may be involved in the process that maintains transcribable genes in a unique chromatin conformation. Their ubiquitous distribution and relative abundance, as well as the high evolutionary conservation of the DNA-binding domain of the HMG14 family of proteins, suggest that they may be involved in an important cellular function.[supplied by OMIM]
29943	
29944	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L21E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
29945	
29946	Histone acetylation plays a key role in the regulation of eukaryotic gene expression. Histone acetylation and deacetylation are catalyzed by multisubunit complexes. The protein encoded by this gene is a component of the histone deacetylase complex which includes SIN3, SAP30, HDAC1, HDAC2, RbAp46, RbAp48 and other polypeptides. This protein directly interacts with SIN3 and enhances SIN3-mediated transcriptional repression when tethered to the promoter.
29947	
29948	
29949	
29950	
29951	
29952	
29953	
29954	
29955	
29956	
29957	
29958	
29959	
29960	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It is one of the most abundant core proteins of hnRNP complexes and it is localized to the nucleoplasm. This protein, along with other hnRNP proteins, is exported from the nucleus, probably bound to mRNA, and is immediately re-imported. Its M9 domain acts as both a nuclear localization and nuclear export signal. The encoded protein is involved in the packaging of pre-mRNA into hnRNP particles, transport of poly A+ mRNA from the nucleus to the cytoplasm, and may modulate splice site selection. It is also thought have a primary role in the formation of specific myometrial protein species in parturition. Multiple alternatively spliced transcript variants have been found for this gene but only two transcripts are fully described. These variant have multiple alternative transcription initiation sites and multiple polyA sites.
29961	Cytochrome c oxidase (COX), the terminal enzyme of the mitochondrial respiratory chain, catalyzes the electron transfer from reduced cytochrome c to oxygen. It is a heteromeric complex consisting of 3 catalytic subunits encoded by mitochondrial genes and multiple structural subunits encoded by nuclear genes. The mitochondrially-encoded subunits function in the electron transfer and the nuclear-encoded subunits may function in the regulation and assembly of the complex. This nuclear gene encodes polypeptide 1 (liver isoform) of subunit VIa, and polypeptide 1 is found in all non-muscle tissues. Polypeptide 2 (heart/muscle isoform) of subunit VIa is encoded by a different gene, and is present only in striated muscles. These two polypeptides share 66% amino acid sequence identity. It has been reported that there may be several pseudogenes on chromosomes 1, 6, 7q21, 7q31-32 and 12. However, only one pseudogene (COX6A1P) on chromosome 1p31.1 has been documented.
29962	
29963	This gene is a member of the immunoglobulin (Ig) superfamily and is located in the major histocompatibility complex (MHC) class III region. The protein encoded by this gene is a glycosylated, plasma membrane-bound cell surface receptor, but soluble isoforms encoded by some transcript variants have been found in the endoplasmic reticulum and Golgi before being secreted. Seven transcript variants encoding different isoforms have been described for this gene.
29964	This gene is a member of the immunoglobulin (Ig) superfamily and is located in the major histocompatibility complex (MHC) class III region. The protein encoded by this gene is a glycosylated, plasma membrane-bound cell surface receptor, but soluble isoforms encoded by some transcript variants have been found in the endoplasmic reticulum and Golgi before being secreted. Seven transcript variants encoding different isoforms have been described for this gene.
29965	
29966	
29967	
29968	
29969	
29970	
29971	
29972	
29973	
29974	
29975	
29976	
29977	
29978	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
29979	
29980	
29981	
29982	The biosynthesis of lipid-linked oligosaccharides is highly conserved among eukaryotes and is catalyzed by 14 glycosyltransferases in an ordered stepwise manner. Mannosyltransferase I (MT I) catalyzes the first mannosylation step in this process.[supplied by OMIM]
29983	
29984	
29985	
29986	Isocitrate dehydrogenases catalyze the oxidative decarboxylation of isocitrate to 2-oxoglutarate. These enzymes belong to two distinct subclasses, one of which utilizes NAD(+) as the electron acceptor and the other NADP(+). Five isocitrate dehydrogenases have been reported: three NAD(+)-dependent isocitrate dehydrogenases, which localize to the mitochondrial matrix, and two NADP(+)-dependent isocitrate dehydrogenases, one of which is mitochondrial and the other predominantly cytosolic. Each NADP(+)-dependent isozyme is a homodimer. The protein encoded by this gene is the NADP(+)-dependent isocitrate dehydrogenase found in the cytoplasm and peroxisomes. It contains the PTS-1 peroxisomal targeting signal sequence. The presence of this enzyme in peroxisomes suggests roles in the regeneration of NADPH for intraperoxisomal reductions, such as the conversion of 2, 4-dienoyl-CoAs to 3-enoyl-CoAs, as well as in peroxisomal reactions that consume 2-oxoglutarate, namely the alpha-hydroxylation of phytanic acid. The cytoplasmic enzyme serves a significant role in cytoplasmic NADPH production.
29987	
29988	
29989	
29990	
29991	
29992	
29993	
29994	
29995	
29996	
29997	This gene encodes a protein that has been directly linked to Bardet-Biedl syndrome. The primary features of this syndrome include retinal dystrophy, obesity, polydactyly, renal abnormalities and learning disabilities. Experimentation in non-human eukaryotes suggests that this gene is expressed in ciliated cells and that it is involved in the formation of cilia. Alternate transcriptional splice variants have been characterized.
29998	
29999	
30000	
30001	
30002	
30003	
30004	
30005	
30006	
30007	
30008	
30009	
30010	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP is a large protein that possesses a PTP domain at C-terminus, and multiple noncatalytic domains, which include a domain with similarity to band 4.1 superfamily of cytoskeletal-associated proteins, a region consisting of five PDZ domains, and a leucine zipper motif. This PTP was found to interact with, and dephosphorylate Fas receptor, as well as IkappaBalpha through the PDZ domains, which suggested its role in Fas mediated programmed cell death. This PTP was also shown to interact with GTPase-activating protein, and thus may function as a regulator of Rho signaling pathway. Four alternatively spliced transcript variants, which encode distinct proteins, have been reported.
30011	
30012	
30013	This gene encodes a protein related to talin 1, a cytoskeletal protein that plays a significant role in the assembly of actin filaments and in spreading and migration of various cell types, including fibroblasts and osteoclasts. This protein has a different pattern of expression compared to talin 1 but, like talin 1, is thought to associate with unique transmembrane receptors to form novel linkages between extracellular matrices and the actin cytoskeleton.
30014	
30015	
30016	
30017	
30018	
30019	
30020	
30021	
30022	
30023	
30024	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
30025	
30026	
30027	
30028	
30029	
30030	The protein encoded by this gene binds to an anti-proliferative protein, B-cell translocation protein 1, which negatively regulates cell proliferation. Binding of the two proteins, which is driven by phosphorylation of the anti-proliferative protein, causes signaling events in cell division that lead to changes in cell proliferation associated with cell-cell contact. The protein has both mouse and yeast orthologs. Alternate splicing of this gene results in two transcript variants encoding different isoforms.
30031	
30032	
30033	Thioredoxins are small proteins that catalyze redox reactions. Using the reducing power of NADPH, the selenocysteine (sec)-containing flavoenzyme thioredoxin reductase (EC 1.6.4.5; see TR1, MIM 601112) maintains thioredoxins in the reduced state. The sec residue, which is essential for enzyme activity, is located in the penultimate C-terminal position and is encoded by a UGA codon. Sec differs from cys by substituting selenium, a better nucleophile, for sulfur. Termination of polypeptide synthesis at the UGA codon is prevented by a stem-loop structure called the selenocysteine insertion sequence (SECIS) located in the 3-prime untranslated region (UTR).[supplied by OMIM]
30034	L-type calcium channels are composed of five subunits. The protein encoded by this gene represents one of these subunits, gamma, and is one of several gamma subunit proteins. It is an integral membrane protein that is thought to stabilize the calcium channel in an inactive (closed) state. This gene is a member of the neuronal calcium channel gamma subunit gene subfamily of the PMP-22/EMP/MP20 family and is located in a cluster with two similar gamma subunit-encoding genes. Two transcript variants encoding different isoforms have been found for this gene.
30035	
30036	This gene is a member of the basic helix-loop-helix (BHLH) family of transcription factors. It activates transcription by binding to the E box (5@#$%&-CANNTG-3@#$%&). Dimerization with other BHLH proteins is required for efficient DNA binding. Involved in the determination of the neuronal precursors in the peripheral nervous system and the central nervous system.
30037	
30038	
30039	
30040	
30041	
30042	
30043	
30044	
30045	
30046	
30047	The protein encoded by this gene is a family member of enzymes known as ubiquitin carrier proteins, or E2s, which are an essential component of the ubiquitin-protein conjugation system. This protein exhibits E2 enzyme activity in its ability to form a thiol ester linkage with ubiquitin in a ubiquitin activating enzyme (E1)-dependent manner. This protein is the first member of this E2 family to be cloned from an epidermal source.
30048	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the proteasome B-type family, also known as the T1B family, that is a 20S core beta subunit. This gene is located in the class II region of the MHC (major histocompatibility complex). Expression of this gene is induced by gamma interferon and this gene product replaces catalytic subunit 1 (proteasome beta 6 subunit) in the immunoproteasome. Proteolytic processing is required to generate a mature subunit. Two alternative transcripts encoding different isoforms have been identified; both isoforms are processed to yield the same mature subunit.
30049	
30050	
30051	
30052	
30053	
30054	The leukemia inhibitory factor is a polyfunctional cytokine that affects the differentiation, survival, and proliferation of a wide variety of cells in the adult and the embryo. LIF action appears to be mediated through a high-affinity receptor complex composed of a low-affinity LIF binding chain (LIF receptor) and a high-affinity converter subunit, gp130. Both LIFR and gp130 are members of a family of cytokine receptors that includes components of the receptors for the majority of hematopoietic cytokines and for cytokines that affect other systems, including the ciliary neurotrophic factor, growth hormone and prolactin.
30055	
30056	
30057	This gene encodes a protein containing a SET domain, 2 LXXLL motifs, 3 nuclear translocation signals (NLSs), 4 plant homeodomain (PHD) finger regions, and a proline-rich region. The encoded protein enhances androgen receptor (AR) transactivation, and this enhancement can be increased further in the presence of other androgen receptor associated coregulators. This protein may act as a nucleus-localized, basic transcriptional factor and also as a bifunctional transcriptional regulator. Mutations of this gene have been associated with Sotos syndrome and Weaver syndrome. One version of childhood acute myeloid leukemia is the result of a cryptic translocation with the breakpoints occurring within nuclear receptor-binding Su-var, enhancer of zeste, and trithorac domain protein 1 on chromosome 5 and nucleoporin, 98-kd on chromosome 11. Two transcript variants encoding distinct isoforms have been identified for this gene.
30058	
30059	
30060	
30061	
30062	
30063	
30064	
30065	
30066	
30067	
30068	CDON and BOC (MIM 608708) are cell surface receptors of the immunoglobulin (Ig)/fibronectin type III (FNIII; see MIM 135600) repeat family involved in myogenic differentiation. CDON and BOC are coexpressed during development, form complexes with each other in a cis fashion, and are related to each other in their ectodomains, but each has a unique long cytoplasmic tail.[supplied by OMIM]
30069	
30070	
30071	
30072	
30073	
30074	Glutamate receptors are the predominant excitatory neurotransmitter receptors in the mammalian brain and are activated in a variety of normal neurophysiologic processes. These receptors are heteromeric protein complexes with multiple subunits, each possessing transmembrane regions, and all arranged to form a ligand-gated ion channel. The classification of glutamate receptors is based on their activation by different pharmacologic agonists. The GRIA1 belongs to a family of alpha-amino-3-hydroxy-5-methyl-4-isoxazole propionate (AMPA) receptors. Each of the members (GRIA1-4) include fip and flop isoforms generated by alternative RNA splicing resulting in a difference of 9 amino acids. The receptor subunits encoded by each isoform vary in their signal transduction properties. The isoform presented here is the flip isoform. AMPA-sensitive channels become impermeable to calcium by editing of mRNA in the coding region of the second transmembrane domain of glutamate receptor subunit. It involves a change of the base A in genomic DNA to the base G in mRNA. The GRIA2 subunit mRNA is completely edited in human brain.
30075	
30076	
30077	
30078	
30079	
30080	
30081	
30082	
30083	
30084	
30085	
30086	
30087	
30088	
30089	
30090	
30091	
30092	
30093	
30094	This gene encodes subunit 3 of the splicing factor 3b protein complex. Splicing factor 3b, together with splicing factor 3a and a 12S RNA unit, forms the U2 small nuclear ribonucleoproteins complex (U2 snRNP). The splicing factor 3b/3a complex binds pre-mRNA upstream of the intron@#$%&s branch site in a sequence independent manner and may anchor the U2 snRNP to the pre-mRNA. Splicing factor 3b is also a component of the minor U12-type spliceosome. Subunit 3 has also been identified as a component of the STAGA (SPT3-TAF(II)31-GCN5L acetylase) transcription coactivator-HAT (histone acetyltransferase) complex, and the TFTC (TATA-binding-protein-free TAF(II)-containing complex). These complexes may function in chromatin modification, transcription, splicing, and DNA repair.
30095	
30096	
30097	
30098	The protein encoded by this gene was identified by its RED repeat, a stretch of repeated arginine, glutamic acid and aspartic acid residues. The protein localizes to discrete dots within the nucleus, excluding the nucleolus. Its function is unknown. This gene maps to chromosome 5; however, a pseudogene may exist on chromosome 2.
30099	
30100	
30101	
30102	
30103	
30104	Selenoproteins, such as SELV, contain the rare twenty-first amino acid, selenocysteine (sec). These proteins lack common amino acid sequence motifs, but the 3-prime untranslated regions of selenoprotein genes have a common stem-loop structure, the sec insertion sequence (SECIS), that is necessary for the recognition of UGA as a sec codon rather than as a stop signal.[supplied by OMIM]
30105	
30106	
30107	
30108	
30109	
30110	
30111	
30112	
30113	
30114	
30115	
30116	
30117	
30118	
30119	
30120	
30121	
30122	
30123	
30124	
30125	
30126	
30127	
30128	
30129	
30130	
30131	
30132	
30133	
30134	
30135	
30136	
30137	
30138	
30139	
30140	
30141	
30142	Actins are highly conserved proteins that are involved in various types of cell motility, and maintenance of the cytoskeleton.  In vertebrates, three main groups of actin isoforms, alpha, beta and gamma have been identified.  The alpha actins are found in muscle tissues and are a major constituent of the contractile apparatus.  The beta and gamma actins co-exist in most cell types as components of the cytoskeleton, and as mediators of internal cell motility.  Actin, gamma 1, encoded by this gene, is a cytoplasmic actin found in nonmuscle cells.
30143	
30144	
30145	
30146	
30147	Components of the extracellular matrix exert myriad effects on tissues throughout the body. In particular, the laminins, a family of heterotrimeric extracellular glycoproteins, affect tissue development and integrity in such diverse organs as the kidney, lung, skin, and nervous system. It is thought that laminins mediate the attachment, migration, and organization of cells into tissues during embryonic development by interacting with other extracellular matrix components. Laminins function as heterotrimeric complexes of alpha, beta, and gamma chains, with each chain type representing a different subfamily of proteins. The protein encoded by this gene belongs to the alpha subfamily of laminin chains and is a major component of basement membranes. Two transcript variants encoding different isoforms have been found for this gene, but the full-length nature of one of them has not been determined.
30148	
30149	
30150	Reticulocalbin 2 is a calcium-binding protein located in the lumen of the ER. The protein contains six conserved regions with similarity to a high affinity Ca(+2)-binding motif, the EF-hand. The RCN2 gene maps to the same region as type 4 Bardet-Biedl syndrome (MIM:600374), suggesting a possible causative role for reticulocalbin 2 in the disorder.
30151	
30152	
30153	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S8P family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
30154	
30155	
30156	
30157	Hexokinases phosphorylate glucose to produce glucose-6-phosphate, thus committing glucose to the glycolytic pathway. This gene encodes hexokinase 2, the predominant form found in skeletal muscle. It localizes to the outer membrane of mitochondria. Expression of this gene is insulin-responsive, and studies in rat suggest that it is involved in the increased rate of glycolysis seen in rapidly growing cancer cells.
30158	
30159	
30160	
30161	
30162	
30163	
30164	The protein encoded by this gene is a member of the NR1 subfamily of nuclear hormone receptors. It is a DNA-binding protein that can bind as a monomer or as a homodimer to hormone response elements upstream of several genes to enhance the expression of those genes. The specific functions of this protein are not known, but it has been shown to interact with NM23-2, a nucleoside diphosphate kinase involved in organogenesis and differentiation.
30165	
30166	
30167	
30168	
30169	
30170	
30171	
30172	
30173	
30174	
30175	
30176	
30177	
30178	Sialic acids are terminal components of the carbohydrate chains of glycoconjugates involved in ligand-receptor, cell-cell, and cell-pathogen interactions. The two most common forms of sialic acid found in mammalian cells are N-acetylneuraminic acid (Neu5Ac) and its hydroxylated derivative, N-glycolylneuraminic acid (Neu5Gc). Studies of sialic acid distribution show that Neu5Gc is not detectable in normal human tissues although it was an abundant sialic acid in other mammals. Neu5Gc is, in actuality, immunogenic in humans. The absense of Neu5Gc in humans is due to a deletion within the human gene CMAH encoding cytidine monophosphate-N-acetylneuraminic acid hydroxylase, an enzyme responsible for Neu5Gc biosynthesis. Sequences encoding the mouse, pig, and chimpanzee hydroxylase enzymes were obtained by cDNA cloning and found to be highly homologous. However, the homologous human cDNA differs from these cDNAs by a 92-bp deletion in the 5@#$%& region. This deletion, corresponding to exon 6 of the mouse hydroxylase gene, causes a frameshift mutation and premature termination of the polypeptide chain in human. It seems unlikely that the truncated human hydroxylase mRNA encodes for an active enzyme explaining why Neu5Gc is undetectable in normal human tissues. Human genomic DNA also shows evidence of this deletion which does not occur in the genomes of African great apes. Nonetheless, the CMAH gene maps to 6p21.32 in humans and great apes indicating that mutation of the CMAH gene occurred following human divergence from chimpanzees and bonobos.
30179	
30180	
30181	
30182	This gene encodes a member of a highly conserved protein family that exists from archaea to vertebrates and plants. The encoded protein may function in RNA metabolism. Mutations within this gene are associated with Shwachman-Bodian-Diamond syndrome. An alternative transcript has been described, but its biological nature has not been determined. This gene has a closely linked pseudogene that is distally located.
30183	
30184	
30185	
30186	
30187	
30188	
30189	
30190	
30191	
30192	
30193	
30194	
30195	
30196	
30197	
30198	
30199	
30200	
30201	
30202	
30203	
30204	
30205	
30206	
30207	
30208	The protein encoded by this gene is a Src binding partner. It may represent a potential modulator of actin filament integrity in response to cellular signals, and may function as an adaptor protein by linking Src family members and/or other signaling proteins to actin filaments. Two alternative transcripts encoding the same protein have been identified.
30209	
30210	
30211	
30212	
30213	
30214	
30215	This gene encodes a type II transmembrane glycoprotein belonging to the M28 peptidase family. Three functionally distinct proteins are encoded, including folylpoly-gamma-glutamate carboxypeptidase in the intestine, N-acetylated alpha-linked acidic dipeptidase 1 in the brain, and prostate-specific membrane antigen in the prostate. A mutation in the intestinal form may be associated with impaired intestinal absorption of dietary folates, resulting in low blood folate levels and consequent hyperhomocysteinemia. The form expressed in the brain may be involved in a number of pathological conditions associated with glutamate excitotoxicity. The prostate form is up-regulated in cancerous cells and is used as an effective diagnostic and prognostic indicator of prostate cancer. This gene likely arose from a duplication event of a nearby chromosomal region. Alternative splicing gives rise to multiple transcript variants.
30216	
30217	Nuclear pore complexes are extremely elaborate structures that mediate the regulated movement of macromolecules between the nucleus and cytoplasm. These complexes are composed of at least 100 different polypeptide subunits, many of which belong to the nucleoporin family. Nucleoporins are pore complex-specific glycoproteins characterized by cytoplasmically oriented O-linked N-acetylglucosamine residues and numerous repeats of the pentapeptide sequence XFXFG. The protein encoded by this gene has three distinct domains: a N-terminal region within which a pore targeting domain has been identified, a central region containing multiple zinc finger motifs, and a C-terminal region containing multiple XFXFG repeats.
30218	
30219	
30220	
30221	
30222	
30223	
30224	
30225	
30226	
30227	
30228	Soluble guanylate cyclase (sGC), a heterodimeric protein consisting of an alpha and a beta subunit, catalyzes the conversion of GTP to the second messenger cGMP and functions as the main receptor for nitric oxide and nitrovasodilator drugs (Zabel et al., 1998).[supplied by OMIM]
30229	
30230	
30231	
30232	
30233	
30234	The protein encoded by this gene is a member of the RAB family of small GTPases. The GTP-bound form of the encoded protein has been shown to interact with early-endosomal antigen 1, and may be involved in the trafficking of and interaction between endosomal compartments.
30235	
30236	
30237	
30238	
30239	
30240	
30241	
30242	
30243	
30244	
30245	TOB2 belongs to the TOB (see TOB1; MIM 605523)/BTG1 (MIM 109580) family of antiproliferative proteins, which are involved in the regulation of cell cycle progression.[supplied by OMIM]
30246	
30247	
30248	
30249	
30250	
30251	
30252	
30253	
30254	
30255	
30256	
30257	
30258	
30259	
30260	
30261	
30262	
30263	
30264	
30265	
30266	
30267	
30268	
30269	
30270	
30271	
30272	
30273	
30274	
30275	This gene encodes a tyrosine kinase that binds Cdc42Hs in its GTP-bound form and inhibits both the intrinsic and GTPase-activating protein (GAP)-stimulated GTPase activity of Cdc42Hs. This binding is mediated by a unique sequence of 47 amino acids C-terminal to an SH3 domain. The protein may be involved in a regulatory mechanism that sustains the GTP-bound active form of Cdc42Hs and which is directly linked to a tyrosine phosphorylation signal transduction pathway. Several alternatively spliced transcript variants have been identified from this gene, but the full-length nature of only two transcript variants has been determined.
30276	
30277	
30278	
30279	
30280	
30281	
30282	This gene localizes to the major histocompatibility complex (MHC) class III region on chromosome 6. The function of its protein product is unknown, but its ubiquitous expression and conservation in both simple and complex eukaryotes suggests that this may be a housekeeping gene.
30283	
30284	
30285	
30286	
30287	
30288	
30289	
30290	
30291	
30292	
30293	
30294	
30295	
30296	
30297	
30298	
30299	
30300	
30301	IGSF11 is an immunoglobulin (Ig) superfamily member that is preferentially expressed in brain and testis. It shares significant homology with coxsackievirus and adenovirus receptor (CXADR; MIM 602621) and endothelial cell-selective adhesion molecule (ESAM).[supplied by OMIM]
30302	
30303	This gene encodes a member of the POP family of proteins containing three putative transmembrane domains. This gene is expressed in cardiac and skeletal muscle and may play an important role in development of these tissues. The mouse ortholog may be involved in the regeneration of adult skeletal muscle and may act as a cell adhesion molecule in coronary vasculogenesis. Two transcript variants encoding the same protein have been found for this gene.
30304	
30305	
30306	
30307	
30308	
30309	
30310	
30311	
30312	
30313	
30314	
30315	
30316	
30317	
30318	
30319	
30320	
30321	
30322	
30323	
30324	
30325	
30326	
30327	
30328	Transaldolase 1 is a key enzyme of the nonoxidative pentose phosphate pathway providing ribose-5-phosphate for nucleic acid synthesis and NADPH for lipid biosynthesis. This pathway can also maintain glutathione at a reduced state and thus protect sulfhydryl groups and cellular integrity from oxygen radicals. The functional gene of transaldolase 1 is located on chromosome 11 and a pseudogene is identified on chromosome 1 but there are conflicting map locations. The second and third exon of this gene were developed by insertion of a retrotransposable element. This gene is thought to be involved in multiple sclerosis.
30329	
30330	
30331	
30332	The protein encoded by this gene was identified as an interacting corepressor of BCL6, a POZ/zinc finger transcription repressor that is required for germinal center formation and may influence apoptosis. This protein selectively interacts with the POZ domain of BCL6, but not with eight other POZ proteins. Specific class I and II histone deacetylases (HDACs) have been shown to interact with this protein, which suggests a possible link between the two classes of HDACs. At least two alternatively spliced transcript variants, which encode different isoforms, have been reported for this gene.
30333	
30334	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S25E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
30335	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit, where it forms part of the domain where translation is initiated. The protein belongs to the S3P family of ribosomal proteins. Studies of the mouse and rat proteins have demonstrated that the protein has an extraribosomal role as an endonuclease involved in the repair of UV-induced DNA damage. The protein appears to be located in both the cytoplasm and nucleus but not in the nucleolus. Higher levels of expression of this gene in colon adenocarcinomas and adenomatous polyps compared to adjacent normal colonic mucosa have been observed. This gene is co-transcribed with the small nucleolar RNA genes U15A and U15B, which are located in its first and fifth introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
30336	
30337	
30338	
30339	
30340	
30341	
30342	
30343	
30344	
30345	
30346	
30347	
30348	
30349	The protein encoded by this gene is a member of the ankyrin repeat and SOCS box-containing (ASB) family of proteins. They contain ankyrin repeat sequence and SOCS box domain. The SOCS box serves to couple suppressor of cytokine signalling (SOCS) proteins and their binding partners with the elongin B and C complex, possibly targeting them for degradation.
30350	
30351	
30352	
30353	
30354	
30355	
30356	
30357	
30358	
30359	
30360	
30361	
30362	
30363	
30364	
30365	
30366	
30367	
30368	
30369	
30370	
30371	
30372	
30373	
30374	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L21E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
30375	
30376	
30377	
30378	Nucleocytoplasmic transport, a signal- and energy-dependent process, takes place through nuclear pore complexes embedded in the nuclear envelope. The import of proteins containing a nuclear localization signal (NLS) requires the NLS import receptor, a heterodimer of importin alpha and beta subunits also known as karyopherins. Importin alpha binds the NLS-containing cargo in the cytoplasm and importin beta docks the complex at the cytoplasmic side of the nuclear pore complex. In the presence of nucleoside triphosphates and the small GTP binding protein Ran, the complex moves into the nuclear pore complex and the importin subunits dissociate. Importin alpha enters the nucleoplasm with its passenger protein and importin beta remains at the pore. Interactions between importin beta and the FG repeats of nucleoporins are essential in translocation through the pore complex. The protein encoded by this gene is a member of the importin beta family.
30379	
30380	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein, which is the functional equivalent of the E. coli L10 ribosomal protein, belongs to the L10P family of ribosomal proteins. It is a neutral phosphoprotein with a C-terminal end that is nearly identical to the C-terminal ends of the acidic ribosomal phosphoproteins P1 and P2. The P0 protein can interact with P1 and P2 to form a pentameric complex consisting of P1 and P2 dimers, and a P0 monomer. The protein is located in the cytoplasm. Transcript variants derived from alternative splicing exist; they encode the same protein. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
30381	
30382	Heterotrimeric guanine nucleotide-binding proteins (G proteins), which integrate signals between receptors and effector proteins, are composed of an alpha, a beta, and a gamma subunit. These subunits are encoded by families of related genes. This gene encodes a beta subunit. Beta subunits are important regulators of alpha subunits, as well as of certain signal transduction receptors and effectors.
30383	
30384	The protein encoded by this gene is a family member of enzymes known as ubiquitin carrier proteins, or E2s, which are an essential component of the ubiquitin-protein conjugation system. This protein exhibits E2 enzyme activity in its ability to form a thiol ester linkage with ubiquitin in a ubiquitin activating enzyme (E1)-dependent manner. This protein is the first member of this E2 family to be cloned from an epidermal source.
30385	
30386	
30387	
30388	This gene is a member of the tropomyosin family of highly conserved, widely distributed actin-binding proteins involved in the contractile system of striated and smooth muscles and the cytoskeleton of non-muscle cells. Tropomyosin is composed of two alpha-helical chains arranged as a coiled-coil. It is polymerized end to end along the two grooves of actin filaments and provides stability to the filaments. The encoded protein is one type of alpha helical chain that forms the predominant tropomyosin of striated muscle, where it also functions in association with the troponin complex to regulate the calcium-dependent interaction of actin and myosin during muscle contraction. In smooth muscle and non-muscle cells, alternatively spliced transcript variants encoding a range of isoforms have been described. Mutations in this gene are associated with type 3 familial hypertrophic cardiomyopathy.
30389	The protein encoded by this gene contains a RING finger, which is a motif known to be involved in protein-protein interactions. This protein is a membrane-bound ubiquitin ligase. It can regulate cell motility by targeting paxillin ubiquitination and altering the distribution and localization of paxillin in cytoplasm and cell focal adhesions.
30390	The protein encoded by this gene belongs to the family of P-type cation transport ATPases, and to the subfamily of aminophospholipid-transporting ATPases. The aminophospholipid translocases transport phosphatidylserine and phosphatidylethanolamine from one side of a bilayer to another. Alternatively spliced transcript variants encoding different isoforms have been identified.
30391	
30392	The protein encoded by this gene is one polypeptide of a small nuclear ribonucleoprotein complex and belongs to the snRNP SMB/SMN family. The protein plays a role in pre-mRNA processing, possibly tissue-specific alternative splicing events. Although individual snRNPs are believed to recognize specific nucleic acid sequences through RNA-RNA base pairing, the specific role of this family member is unknown. The protein arises from a bicistronic transcript that also encodes a protein identified as the SNRPN upstream reading frame (SNURF). Multiple transcription initiation sites have been identified and extensive alternative splicing occurs in the 5@#$%& untranslated region. Additional splice variants have been described but sequences for the complete transcripts have not been determined. The 5@#$%& UTR of this gene has been identified as an imprinting center. Alternative splicing or deletion caused by a translocation event in this paternally-expressed region is responsible for Angelman syndrome or Prader-Willi syndrome due to parental imprint switch failure.
30393	
30394	This gene encodes a member of the Dual-specificity tyrosine phosphorylation-regulated kinase (DYRK) family. This member contains a nuclear targeting signal sequence, a protein kinase domain, a leucine zipper motif, and a highly conservative 13-consecutive-histidine repeat. It catalyzes its autophosphorylation on serine/threonine and tyrosine residues. It may play a significant role in a signaling pathway regulating cell proliferation and may be involved in brain development. This gene is a homolog of Drosophila mnb (minibrain) gene and rat Dyrk gene. It is localized in the Down syndrome critical region of chromosome 21, and is considered to be a strong candidate gene for learning defects associated with Down syndrome. Alternative splicing of this gene generates several transcript variants differing from each other either in the 5@#$%& region or in the 3@#$%& part of the coding region. These variants encode at least five different isoforms.
30395	
30396	
30397	
30398	
30399	
30400	
30401	
30402	
30403	
30404	
30405	
30406	
30407	
30408	
30409	
30410	The protein encoded by this gene is a member of the cyclin family of proteins. The encoded protein interacts with cyclin-dependent kinase 8 and induces the phophorylation of the carboxy-terminal domain of the large subunit of RNA polymerase II. The level of mRNAs for this gene peaks in the G1 phase of the cell cycle. Two transcript variants encoding different isoforms have been found for this gene.
30411	
30412	
30413	The importin-alpha/beta complex and the GTPase Ran mediate nuclear import of proteins with a classical nuclear localization signal. The protein encoded by this gene is a member of a class of approximately 20 potential Ran targets that share a sequence motif related to the Ran-binding site of importin-beta. Similar to importin-beta, this protein prevents the activation of Ran@#$%&s GTPase by RanGAP1 and inhibits nucleotide exchange on RanGTP, and also binds directly to nuclear pore complexes where it competes for binding sites with importin-beta and transportin. This protein has a Ran-dependent transport cycle and it can cross the nuclear envelope rapidly and in both directions. At least four importin beta-like transport receptors, namely importin beta itself, transportin, RanBP5 and RanBP7, directly bind and import ribosomal proteins.
30414	This gene encodes a member of the diacylglycerol kinase (DGK) enzyme family of proteins, specifically the type II DGK subfamily. Members of this family are involved in regulating the intracellular concentrations of diacylglycerol and phosphatidic acid. Two transcript variants encoding distinct isoforms have been identified for this gene.
30415	
30416	
30417	
30418	Rho GTPases control a variety of cellular processes. There are 3 subtypes of Rho GTPases in the Ras superfamily of small G proteins: RHO (see MIM 165370), RAC (see RAC1; MIM 602048), and CDC42 (MIM 116952). GTPase-activating proteins (GAPs) bind activated forms of Rho GTPases and stimulate GTP hydrolysis. Through this catalytic function, Rho GAPs negatively regulate Rho-mediated signals. GAPs may also serve as effector molecules and play a role in signaling downstream of Rho and other Ras-like GTPases.[supplied by OMIM]
30419	
30420	
30421	
30422	
30423	
30424	
30425	Nuclear receptor interacting protein 1 (NRIP1) is a nuclear protein that specifically interacts with the hormone-dependent activation domain AF2 of nuclear receptors. Also known as RIP140, this protein modulates transcriptional activity of the estrogen receptor.
30426	
30427	
30428	
30429	
30430	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L22P family of ribosomal proteins. It is located in the cytoplasm. This gene has been referred to as rpL23 because the encoded protein shares amino acid identity with ribosomal protein L23 from Halobacterium marismortui; however, its official symbol is RPL17. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
30431	
30432	
30433	
30434	
30435	
30436	
30437	
30438	
30439	
30440	
30441	
30442	
30443	
30444	
30445	
30446	
30447	
30448	
30449	
30450	
30451	
30452	This gene encodes a transcription factor that was originally identified as a widely expressed mammalian DNA binding protein that could bind a tax-responsive enhancer element in the LTR of HTLV-1. The encoded protein was also isolated and characterized as the cAMP-response element binding protein 2 (CREB-2). The protein encoded by this gene belongs to a family of DNA-binding proteins that includes the AP-1 family of transcription factors, cAMP-response element binding proteins (CREBs) and CREB-like proteins. These transcription factors share a leucine zipper region that is involved in protein-protein interactions, located C-terminal to a stretch of basic amino acids that functions as a DNA binding domain. Two alternative transcripts encoding the same protein have been described. Two pseudogenes are located on the X chromsome at q28 in a region containing a large inverted duplication.
30453	
30454	
30455	
30456	
30457	Members of the F-box protein family, such as FBXO31, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
30458	
30459	
30460	
30461	
30462	
30463	
30464	
30465	
30466	
30467	
30468	
30469	
30470	
30471	
30472	
30473	
30474	
30475	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface glycoprotein that is known to complex with integrins and other transmembrane 4 superfamily proteins. It may play a role in T-cell-B-cell interactions.
30476	Fibroblast growth factors (FGFs; see MIM 603726) are secreted proteins involved in cellular proliferation, migration, differentiation, and survival. FGF activity is negatively regulated by members of the @#$%&sprouty@#$%& family (e.g., SPRY1, MIM 602465). The SEF protein is a modulator of FGF signaling.[supplied by OMIM]
30477	
30478	
30479	
30480	
30481	
30482	
30483	
30484	
30485	
30486	
30487	
30488	
30489	
30490	
30491	
30492	
30493	
30494	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It is one of the most abundant core proteins of hnRNP complexes and it is localized to the nucleoplasm. This protein, along with other hnRNP proteins, is exported from the nucleus, probably bound to mRNA, and is immediately re-imported. Its M9 domain acts as both a nuclear localization and nuclear export signal. The encoded protein is involved in the packaging of pre-mRNA into hnRNP particles, transport of poly A+ mRNA from the nucleus to the cytoplasm, and may modulate splice site selection. It is also thought have a primary role in the formation of specific myometrial protein species in parturition. Multiple alternatively spliced transcript variants have been found for this gene but only two transcripts are fully described. These variant have multiple alternative transcription initiation sites and multiple polyA sites.
30495	
30496	
30497	
30498	
30499	
30500	
30501	
30502	
30503	
30504	
30505	
30506	
30507	
30508	
30509	
30510	
30511	The product of this gene is a microtubule-associated protein that is predominantly expressed in cells of epithelial origin. Microtubule-associated proteins are thought to be involved in microtubule dynamics, which is essential for cell polarization and differentiation. This protein has been shown to be able to stabilize microtubules, and may serve to modulate microtubule functions. Studies of the related mouse protein also suggested an essential role in microtubule function required for spermatogenesis.
30512	
30513	
30514	
30515	This gene encodes the third of three enzymes that posttranslationally modify isoprenylated C-terminal cysteine residues in certain proteins and target those proteins to the cell membrane. This enzyme localizes to the endoplasmic reticulum. Alternative splicing results in two transcript variants encoding different isoforms.
30516	
30517	
30518	
30519	
30520	
30521	
30522	
30523	
30524	
30525	
30526	
30527	
30528	
30529	
30530	
30531	Acrosin is the major proteinase present in the acrosome of mature spermatozoa. It is a typical serine proteinase with trypsin-like specificity. It is stored in the acrosome in its precursor form, proacrosin. The active enzyme functions in the lysis of the zona pellucida, thus facilitating penetration of the sperm through the innermost glycoprotein layers of the ovum. The mRNA for proacrosin is synthesized only in the postmeiotic stages of spermatogenesis. In humans proacrosin first appears in the haploid spermatids.
30532	
30533	
30534	
30535	
30536	
30537	
30538	
30539	
30540	
30541	
30542	
30543	
30544	
30545	
30546	
30547	
30548	
30549	
30550	
30551	
30552	
30553	
30554	
30555	
30556	
30557	
30558	CKS1B protein binds to the catalytic subunit of the cyclin dependent kinases and is essential for their biological function. The CKS1B mRNA is found to be expressed in different patterns through the cell cycle in HeLa cells, which reflects a specialized role for the encoded protein.
30559	
30560	
30561	This gene encodes a member of the transcription elongation factor A (SII)-like (TCEAL) gene family. Members of this family contain TFA domains and may function as nuclear phosphoproteins that modulate transcription in a promoter context-dependent manner. Multiple family members are located on the X chromosome. Alternative splicing results in multiple transcript variants encoding a single isoform.
30562	
30563	
30564	
30565	
30566	MaxiK channels are large conductance, voltage and calcium-sensitive potassium channels which are fundamental to the control of smooth muscle tone and neuronal excitability. MaxiK channels can be formed by 2 subunits: the pore-forming alpha subunit, which is the product of this gene, and the modulatory beta subunit. Intracellular calcium regulates the physical association between the alpha and beta subunits. Alternatively spliced transcript variants encoding different isoforms have been identified.
30567	
30568	This intronless gene is specifically expressed in the testis, and encodes a protein that is highly homologous to the two subunits for phosphoribosylpyrophosphate synthetase encoded by human X-linked genes, PRPS1 and PRPS2. These enzymes convert pyrimidine, purine or pyridine bases to the corresponding ribonucleoside monophosphates. Studies indicate that translation of this mRNA initiates exclusively at a non-AUG (ACG) codon.
30569	
30570	
30571	
30572	
30573	
30574	
30575	This protein belongs to the aldehyde dehydrogenase family of proteins. This enzyme is a mitochondrial matrix NAD-dependent dehydrogenase which catalyzes the second step of the proline degradation pathway, converting pyrroline-5-carboxylate to glutamate. Deficiency of this enzyme is associated with type II hyperprolinemia, an autosomal recessive disorder characterized by accumulation of delta-1-pyrroline-5-carboxylate (P5C) and proline. Two transcript variants encoding the same protein have been identified for this gene.
30576	
30577	
30578	
30579	
30580	
30581	
30582	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L18AE family of ribosomal proteins. It is located in the cytoplasm. This gene is co-transcribed with the U68 snoRNA, which is located in its third intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
30583	RICTOR and MTOR (FRAP1; MIM 601231) are components of a protein complex that integrates nutrient- and growth factor-derived signals to regulate cell growth (Sarbassov et al., 2004).[supplied by OMIM]
30584	
30585	This gene belongs to the RBMY gene family which includes candidate Y chromosome spermatogenesis genes. This gene, an active X chromosome homolog of the Y chromosome RBMY gene, is widely expressed whereas the RBMY gene evolved a male-specific function in spermatogenesis. Pseudogenes of this gene, found on chromosomes 1, 4, 9, 11, and 6, were likely derived by retrotransposition from the original gene. Alternatively spliced transcript variants encoding different isoforms have been identified but their biological nature has not been determined.
30586	
30587	
30588	
30589	
30590	
30591	
30592	
30593	
30594	
30595	
30596	
30597	
30598	
30599	
30600	
30601	
30602	
30603	
30604	
30605	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It is one of the most abundant core proteins of hnRNP complexes and it is localized to the nucleoplasm. This protein, along with other hnRNP proteins, is exported from the nucleus, probably bound to mRNA, and is immediately re-imported. Its M9 domain acts as both a nuclear localization and nuclear export signal. The encoded protein is involved in the packaging of pre-mRNA into hnRNP particles, transport of poly A+ mRNA from the nucleus to the cytoplasm, and may modulate splice site selection. It is also thought have a primary role in the formation of specific myometrial protein species in parturition. Multiple alternatively spliced transcript variants have been found for this gene but only two transcripts are fully described. These variant have multiple alternative transcription initiation sites and multiple polyA sites.
30606	The protein encoded by this gene is a member of the immunoglobulin superfamily. The protein contains two immunoglobulin domains and thirteen fibronectin type III domains. Fibronectin type III domains are present in both extracellular and intracellular proteins and tandem repeats are known to contain binding sites for DNA, heparin and the cell surface. This protein, and a homologous mouse sequence, are very similar to the Drosophila sidekick gene product but the specific function of this superfamily member is not yet known. Evidence for alternative splicing at this gene locus has been observed but the full-length nature of additional variants has not yet been determined.
30607	
30608	
30609	
30610	
30611	
30612	
30613	
30614	
30615	
30616	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L21E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
30617	The protein encoded by this gene is a homolog of the protein transgelin, which is one of the earliest markers of differentiated smooth muscle. The function of this protein has not yet been determined.
30618	
30619	
30620	
30621	
30622	
30623	
30624	
30625	
30626	
30627	
30628	
30629	
30630	RAS guanyl nucleotide-releasing protein (RASGRP) is a member of a family of genes characterized by the presence of a Ras superfamily guanine nucleotide exchange factor (GEF) domain. The corresponding rat gene rbc7, which lacks a 5-prime exon, represents a 5-prime and 3-prime truncated version of a larger normal rat transcript that encodes a predicted 90-kD protein. This shorter transcript has not been found in humans.
30631	
30632	
30633	
30634	
30635	
30636	
30637	
30638	
30639	
30640	
30641	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. This protein is a potent epithelial cell-specific growth factor, whose mitogenic activity is predominantly exhibited in keratinocytes but not in fibroblasts and endothelial cells. Studies of mouse and rat homologs of this gene implicated roles in morphogenesis of epithelium, reepithelialization of wounds, hair development and early lung organogenesis.
30642	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. This protein is a potent epithelial cell-specific growth factor, whose mitogenic activity is predominantly exhibited in keratinocytes but not in fibroblasts and endothelial cells. Studies of mouse and rat homologs of this gene implicated roles in morphogenesis of epithelium, reepithelialization of wounds, hair development and early lung organogenesis.
30643	
30644	
30645	
30646	
30647	
30648	
30649	
30650	The protein encoded by this gene belongs to the ADAM-TS/metallospondin family of proteins, which are structurally homologous to the ADAM proteins, but they also contain at least 1 C-terminal thrombospondin type 1 (TSP1) repeat, and are secreted, instead of being membrane bound. Members of the ADAM-TS family have been implicated in the cleavage of proteoglycans, the control of organ shape during development, and the inhibition of angiogenesis. This gene is localized to chromosome 3p14.3-p14.2, an area known to be lost in hereditary renal tumors. Three transcript variants encoding different isoforms have been identified for this locus.
30651	
30652	
30653	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S3AE family of ribosomal proteins. It is located in the cytoplasm. Disruption of the gene encoding rat ribosomal protein S3a, also named v-fos transformation effector protein, in v-fos-transformed rat cells results in reversion of the transformed phenotype. Transcript variants utilizing alternative transcription start sites have been described. This gene is co-transcribed with the U73A and U73B small nucleolar RNA genes, which are located in its fourth and third introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
30654	
30655	
30656	
30657	
30658	
30659	The elongation of primed DNA templates by DNA polymerase delta and DNA polymerase epsilon requires the accessory proteins proliferating cell nuclear antigen (PCNA) and replication factor C (RFC). RFC, also named activator 1, is a protein complex consisting of five distinct subunits of 140, 40, 38, 37, and 36 kD. This gene encodes the 38 kD subunit. This subunit is essential for the interaction between the 140 kD subunit and the core complex that consists of the 36, 37, and 40 kD subunits. Alternatively spliced transcript variants encoding distinct isoforms have been described.
30660	
30661	
30662	
30663	
30664	
30665	
30666	
30667	
30668	
30669	
30670	
30671	
30672	KLF13 belongs to a family of transcription factors that contain 3 classical zinc finger DNA-binding domains consisting of a zinc atom tetrahedrally coordinated by 2 cysteines and 2 histidines (C2H2 motif). These transcription factors bind to GC-rich sequences and related GT and CACCC boxes (Scohy et al., 2000).[supplied by OMIM]
30673	
30674	
30675	
30676	
30677	
30678	
30679	
30680	
30681	
30682	
30683	
30684	
30685	
30686	
30687	
30688	
30689	
30690	
30691	
30692	This gene encodes glutathione-dependent formaldehyde dehydrogenase or class III alcohol dehydrogenase chi subunit, which is a member of the alcohol dehydrogenase family. Members of this family metabolize a wide variety of substrates, including ethanol, retinol, other aliphatic alcohols, hydroxysteroids, and lipid peroxidation products. Class III alcohol dehydrogenase is a homodimer composed of 2 chi subunits. It has virtually no activity for ethanol oxidation, but exhibits high activity for oxidation of long-chain primary alcohols and for oxidation of S-hydroxymethyl-glutathione, a spontaneous adduct between formaldehyde and glutathione. This enzyme is an important component of cellular metabolism for the elimination of formaldehyde, a potent irritant and sensitizing agent that causes lacrymation, rhinitis, pharyngitis, and contact dermatitis.
30693	
30694	
30695	
30696	
30697	
30698	Protein kinase C (PKC) is a family of serine- and threonine-specific protein kinases that can be activated by calcium and the second messenger diacylglycerol. PKC family members phosphorylate a wide variety of protein targets and are known to be involved in diverse cellular signaling pathways. PKC family members also serve as major receptors for phorbol esters, a class of tumor promoters. Each member of the PKC family has a specific expression profile and is believed to play a distinct role in cells. The protein encoded by this gene is one of the PKC family members. This kinase has been shown to be involved in many different cellular functions, such as neuron channel activation, apoptosis, cardioprotection from ischemia, heat shock response, as well as insulin exocytosis. Knockout studies in mice suggest that this kinase is important for lipopolysaccharide (LPS)-mediated signaling in activated macrophages and may also play a role in controlling anxiety-like behavior.
30699	
30700	
30701	
30702	
30703	
30704	
30705	This gene product belongs to the 14-3-3 family of proteins which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals, and this protein is 99% identical to the mouse, rat and sheep orthologs. The encoded protein interacts with IRS1 protein, suggesting a role in regulating insulin sensitivity. Two transcript variants differing in the 5@#$%& UTR, but encoding the same protein, have been identified for this gene.
30706	
30707	
30708	
30709	
30710	
30711	
30712	
30713	
30714	
30715	
30716	
30717	The E2F transcription factor family (see MIM 189971) regulates the expression of various cellular promoters, particularly those involved in the cell cycle. E2F factors bind to DNA as homodimers or heterodimers in association with dimerization partner DP1. TFDP1 may be the first example of a family of related transcription factors; see TFDP2 (MIM 602160).[supplied by OMIM]
30718	
30719	
30720	
30721	The protein encoded by this gene belongs to the immunoglobulin superfamily. It is a type I membrane protein and may function in selective fasciculation and zone-to-zone projection of the primary olfactory axons.
30722	
30723	Transport of folate compounds into mammalian cells can occur via receptor-mediated (see MIM 136430) or carrier-mediated mechanisms. A functional coordination between these 2 mechanisms has been proposed to be the method of folate uptake in certain cell types. Methotrexate (MTX) is an antifolate chemotherapeutic agent that is actively transported by the carrier-mediated uptake system. RFC1 plays a role in maintaining intracellular concentrations of folate.[supplied by OMIM]
30724	
30725	
30726	
30727	
30728	
30729	
30730	
30731	
30732	
30733	
30734	
30735	
30736	The leiomodin 1 protein has a putative membrane-spanning region and 2 types of tandemly repeated blocks.  The transcript is expressed in all tissues tested, with the highest levels in thyroid, eye muscle, skeletal muscle, and ovary. Increased expression of leiomodin 1 may be linked to Graves@#$%& disease and thyroid-associated ophthalmopathy.
30737	
30738	
30739	
30740	
30741	
30742	
30743	
30744	
30745	
30746	The protein encoded by this gene is a member of the ELAVL protein family. This encoded protein contains 3 RNA-binding domains and binds cis-acting AU-rich elements. It destabilizes mRNAs and thereby regulates gene expression.
30747	
30748	This gene encodes a myotubularin-related protein that is atypical to most other members of the myotubularin-related protein family because it has no dual-specificity phosphatase domain. The encoded protein contains a double-helical motif similar to the SET interaction domain, which is thought to have a role in the control of cell proliferation. In mouse, a protein similar to the encoded protein binds with MTMR7, and together they dephosphorylate phosphatidylinositol 3-phosphate and inositol 1,3-bisphosphate.
30749	
30750	
30751	
30752	
30753	
30754	
30755	
30756	
30757	
30758	
30759	
30760	
30761	
30762	The protein encoded by this gene belongs to the tigger subfamily of the pogo superfamily of DNA-mediated transposons in humans. These proteins are related to DNA transposons found in fungi and nematodes, and more distantly to the Tc1 and mariner transposases. They are also very similar to the major mammalian centromere protein B. The exact function of this gene is not known.
30763	
30764	This gene encodes a lymphoid-specific helicase. Other helicases function in processes involving DNA strand separation, including replication, repair, recombination, and transcription. This protein is thought to be involved with cellular proliferation and may play a role in leukemogenesis. Alternatively spliced transcript variants have been described, but their biological validity has not been determined.
30765	
30766	
30767	
30768	
30769	
30770	
30771	This gene encodes a protein involved in nonsense-mediated mRNA decay (NMD) as part of the mRNA surveillance complex. The protein has kinase activity and is thought to function in NMD by phosphorylating the regulator of nonsense transcripts 1 protein. Alternative spliced transcript variants have been described, but their full-length natures have not been determined.
30772	
30773	
30774	
30775	
30776	
30777	
30778	
30779	
30780	
30781	
30782	
30783	
30784	
30785	
30786	
30787	
30788	
30789	
30790	
30791	Formin homology (FH) domain proteins (e.g., FMN; MIM 136535) play a role in cytoskeletal organization and/or establishment of cell polarity.[supplied by OMIM]
30792	
30793	
30794	
30795	
30796	
30797	
30798	
30799	
30800	
30801	
30802	
30803	
30804	
30805	This gene belongs to a family of Sp1 related genes that encode transcription factors that regulate transcription by binding to consensus GC- and GT-box regulatory elements in target genes. This protein contains a zinc finger DNA-binding domain and several transactivation domains, and has been reported to function as a bifunctional transcription factor that either stimulates or represses the transcription of numerous genes. Transcript variants encoding different isoforms have been described for this gene, and one has been reported to initiate translation from a non-AUG (AUA) start codon. Additional isoforms, resulting from the use of alternate downstream translation initiation sites, have also been noted.
30806	
30807	
30808	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L21E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
30809	
30810	
30811	
30812	
30813	
30814	
30815	
30816	
30817	
30818	
30819	
30820	
30821	
30822	
30823	
30824	
30825	
30826	
30827	PLCE1 belongs to the phospholipase family that catalyzes the hydrolysis of polyphosphoinositides such as phosphatidylinositol-4,5-bisphosphate (PtdIns(4,5)P2) to generate the second messengers Ins(1,4,5)P3 and diacylglycerol. These products initiate a cascade of intracellular responses that result in cell growth and differentiation and gene expression.[supplied by OMIM]
30828	
30829	
30830	
30831	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L21E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
30832	
30833	This gene encodes a member of the serine/threonine protein kinase family, that is highly similar to yeast SPS1/STE20 kinase. Yeast SPS1/STE20 functions near the beginning of the MAP kinase signal cascades that is essential for yeast pheromone response. This kinase was shown to activate Jun kinase in mammalian cells, which suggested a role in stress response. Two alternatively spliced transcript variants encoding the same protein have been described for this gene.
30834	
30835	
30836	ROCK1 is a protein serine/threonine kinase that is activated when bound to the GTP-bound form of Rho. The small GTPase Rho regulates formation of focal adhesions and stress fibers of fibroblasts, as well as adhesion and aggregation of platelets and lymphocytes by shuttling between the inactive GDP-bound form and the active GTP-bound form. Rho is also essential in cytokinesis and plays a role in transcriptional activation by serum response factor. ROCK, a downstream effector of Rho, phosphorylates and activates LIM kinase, which in turn, phosphorylates cofilin, inhibiting its actin-depolymerizing activity.
30837	
30838	
30839	
30840	
30841	
30842	
30843	
30844	This gene encodes a protein involved in nonsense-mediated mRNA decay (NMD) as part of the mRNA surveillance complex. The protein has kinase activity and is thought to function in NMD by phosphorylating the regulator of nonsense transcripts 1 protein. Alternative spliced transcript variants have been described, but their full-length natures have not been determined.
30845	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S7E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
30846	
30847	
30848	
30849	DAB2 mRNA is expressed in normal ovarian epithelial cells but is down-regulated or absent from ovarian carcinoma cell lines. The 770-amino acid predicted protein has an overall 83% identity with the mouse p96 protein, a putative mitogen-responsive phosphoprotein; homology is strongest in the amino-terminal end of the protein in a region corresponding to the phosphotyrosine interaction domain.  The down-regulation of DAB2 may play an important role in ovarian carcinogenesis. This gene was initially named DOC2 (for Differentially expressed in Ovarian Cancer) and is distinct from the DOC2A and DOC2B genes (for double C2-like domains, alpha and beta).
30850	
30851	
30852	
30853	
30854	
30855	
30856	
30857	
30858	
30859	GABA is the major inhibitory neurotransmitter in the mammalian brain where it acts at GABA-A receptors, which are ligand-gated chloride channels. Chloride conductance of these channels can be modulated by agents such as benzodiazepines that bind to the GABA-A receptor. At least 16 distinct subunits of GABA-A receptors have been identified.
30860	
30861	
30862	
30863	
30864	
30865	
30866	
30867	
30868	This gene encodes the fatty acid binding protein found in epidermal cells, and was first identified as being upregulated in psoriasis tissue. Fatty acid binding proteins are a family of small, highly conserved, cytoplasmic proteins that bind long-chain fatty acids and other hydrophobic ligands. It is thought that FABPs roles include fatty acid uptake, transport, and metabolism.
30869	
30870	
30871	The protein encoded by this gene is a member of the L1 gene family of neural cell adhesion molecules. It is a neural recognition molecule that may be involved in signal transduction pathways. The deletion of one copy of this gene may be responsible for mental defects in patients with 3p- syndrome. Several alternatively spliced transcript variants of this gene have been described, but their full length nature is not known.
30872	
30873	
30874	
30875	
30876	
30877	
30878	
30879	
30880	
30881	
30882	
30883	
30884	
30885	
30886	
30887	
30888	
30889	
30890	
30891	
30892	This gene encodes a protein that shares high sequence similarity with ribosomal protein L26. It is not currently known whether the encoded protein is a functional ribosomal protein or whether it has evolved a function that is independent of the ribosome. Transcript variants utilizing alternative polyA signals exist.
30893	This gene encodes a protein that shares high sequence similarity with ribosomal protein L26. It is not currently known whether the encoded protein is a functional ribosomal protein or whether it has evolved a function that is independent of the ribosome. Transcript variants utilizing alternative polyA signals exist.
30894	The protein encoded by this gene catalyzes the formation of inositol 1,4,5-trisphosphate and diacylglycerol from phosphatidylinositol 4,5-bisphosphate. This reaction uses calcium as a cofactor and plays an important role in the intracellular transduction of many extracellular signals. This gene is activated by two G-protein alpha subunits, alpha-q and alpha-11. Two transcript variants encoding different isoforms have been found for this gene.
30895	
30896	
30897	
30898	
30899	
30900	This gene encodes a protein belonging to the vacuolar-protein-sorting-13 gene family. In yeast, vacuolar-protein-sorting-13 proteins are involved in trafficking of membrane proteins between the trans-Golgi network and the prevacuolar compartment. While several transcript variants may exist for this gene, the full-length natures of only two have been described to date. These two represent the major variants of this gene and encode distinct isoforms.
30901	
30902	
30903	
30904	
30905	
30906	
30907	
30908	
30909	
30910	
30911	
30912	
30913	
30914	
30915	
30916	
30917	
30918	
30919	
30920	
30921	This gene encodes a type 1 transmembrane protein of unknown function. It is one of several genes located in the imprinted gene domain of 11p15.5, an important tumor-suppressor gene region. Alterations in this region have been associated with the Beckwith-Wiedemann syndrome, Wilms tumor, rhabdomyosarcoma, adrenocrotical carcinoma, and lung, ovarian, and breast cancer. This gene may play a role in malignancies and disease that involve this region, as well as early hematopoiesis, by mediating attachment to stromal cells. This gene is oriented in a head-to-tail configuration with the ribonucleotide reductase 1 gene (RRM1), with the 3@#$%& end of this gene situated 1.6 kb from the 5@#$%& end of the RRM1 gene.
30922	CKS1B protein binds to the catalytic subunit of the cyclin dependent kinases and is essential for their biological function. The CKS1B mRNA is found to be expressed in different patterns through the cell cycle in HeLa cells, which reflects a specialized role for the encoded protein.
30923	
30924	
30925	
30926	
30927	
30928	
30929	
30930	
30931	
30932	CLK3 encodes a serine/threonine type protein kinase with a non-conserved N-terminal domain. A long and short isoform (phclk3 and pclk3/152) result from alternative splicing and coexist in different tissues. Isoform phclk3/152 lacks the kinase domain.
30933	
30934	
30935	CLK3 encodes a serine/threonine type protein kinase with a non-conserved N-terminal domain. A long and short isoform (phclk3 and pclk3/152) result from alternative splicing and coexist in different tissues. Isoform phclk3/152 lacks the kinase domain.
30936	
30937	
30938	This gene encodes a putative RNA binding protein of unknown function.  Mutations in this gene, specifically a t(11;22)(q24;q12) translocation, are known to cause Ewing sarcoma as well as neuroectodermal and various other tumors.  Alternative splicing of this gene results in two products.
30939	
30940	
30941	
30942	
30943	
30944	
30945	
30946	
30947	Adenylate kinases are involved in regulating the adenine nucleotide composition within a cell by catalyzing the reversible transfer of phosphate groups among adenine nucleotides. Three isozymes of adenylate kinase, namely 1, 2, and 3, have been identified in vertebrates; this gene encodes isozyme 2. Expression of these isozymes is tissue-specific and developmentally regulated. Isozyme 2 is localized in the mitochondrial intermembrane space and may play a role in apoptosis. Transcript variants encoding distinct isoforms have been identified for this gene.
30948	
30949	
30950	
30951	
30952	
30953	
30954	
30955	
30956	
30957	The protein encoded by this gene has sequence similarity with members of the WD40 repeat-containing protein family. The WD40 group is a large family of proteins, which appear to have a regulatory function. It is believed that the WD40 repeats mediate protein-protein interactions and members of the family are involved in signal transduction, RNA processing, gene regulation, vesicular trafficking, cytoskeletal assembly and may play a role in the control of cytotypic differentiation.
30958	
30959	
30960	
30961	Mannosyl (alpha-1,6-)-glycoprotein beta-1,6-N-acetyl-glucosaminyltransferase protein is a glycosyltransferase involved in the synthesis of protein-bound and lipid-bound oligosaccharides.  Alterations of the oligosaccharides on cell surface glycoproteins cause significant changes in the adhesive or migratory behavior of a cell.  Increase in GMAT5 enzyme activity may correlate with the progression of invasive malignancies.
30962	
30963	Methylenetetrahydrofolate reductase (EC 1.5.1.20) catalyzes the conversion of 5,10-methylenetetrahydrofolate to 5-methyltetrahydrofolate, a cosubstrate for homocysteine remethylation to methionine.[supplied by OMIM]
30964	
30965	This gene encodes a member of the Ca2+/calmodulin-dependent protein kinase 1 subfamily of serine/threonine kinases. The encoded protein may be involved in the regulation of granulocyte function through the chemokine signal transduction pathway. Alternatively spliced transcript variants encoding different isoforms of this gene have been described.
30966	
30967	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S3AE family of ribosomal proteins. It is located in the cytoplasm. Disruption of the gene encoding rat ribosomal protein S3a, also named v-fos transformation effector protein, in v-fos-transformed rat cells results in reversion of the transformed phenotype. Transcript variants utilizing alternative transcription start sites have been described. This gene is co-transcribed with the U73A and U73B small nucleolar RNA genes, which are located in its fourth and third introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
30968	
30969	
30970	
30971	
30972	
30973	
30974	Heterotrimeric G proteins play vital roles in cellular responses to external signals. The specificity of a G protein-receptor interaction is primarily mediated by the gamma subunit.[supplied by OMIM]
30975	
30976	
30977	
30978	
30979	
30980	
30981	
30982	
30983	
30984	
30985	
30986	
30987	
30988	
30989	
30990	
30991	
30992	
30993	
30994	
30995	Calcyphosine-2 is a calcium-binding protein with 2 EF-hand motifs (Wang et al., 2002).[supplied by OMIM]
30996	CITED4 belongs to a family of transcriptional coactivators that bind several proteins, including CREB-binding protein (MIM 600140) and p300 (MIM 602700).[supplied by OMIM]
30997	
30998	
30999	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L3P family of ribosomal proteins. It is located in the cytoplasm. The protein can bind to the HIV-1 TAR mRNA, and it has been suggested that the protein contributes to tat-mediated transactivation. This gene is co-transcribed with the small nucleolar RNA genes U43, U86, U83a, and U83b, which are located in its first, third, fifth, and seventh introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
31000	
31001	
31002	
31003	
31004	
31005	
31006	
31007	
31008	
31009	
31010	
31011	
31012	
31013	MHC2TA encodes a non-DNA binding transactivator that functions both in constitutive and inducible MHC Class II expression. Mutations in MHC2TA result in the complementation group A of Bare Lymphocyte Syndrome.
31014	
31015	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein.
31016	
31017	
31018	
31019	
31020	
31021	
31022	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a cytoplasmic ribosomal protein that is a component of the 60S subunit. The protein belongs to the L22E family of ribosomal proteins. Its initiating methionine residue is post-translationally removed. The protein can bind specifically to Epstein-Barr virus-encoded RNAs (EBERs) 1 and 2. The mouse protein has been shown to be capable of binding to heparin. Transcript variants utilizing alternative polyA signals exist. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome. It was previously thought that this gene mapped to 3q26 and that it was fused to the acute myeloid leukemia 1 (AML1) gene located at 21q22 in some therapy-related myelodysplastic syndrome patients with 3;21 translocations; however, these fusions actually involve a ribosomal protein L22 pseudogene located at 3q26, and this gene actually maps to 1p36.3-p36.2.
31023	
31024	
31025	
31026	
31027	
31028	This gene encodes a member of the F-box protein family which is characterized by an approximately 40 amino acid motif, the F-box. The F-box proteins constitute one of the four subunits of ubiquitin protein ligase complex called SCFs (SKP1-cullin-F-box), which function in phosphorylation-dependent ubiquitination. The F-box proteins are divided into 3 classes: Fbws containing WD-40 domains, Fbls containing leucine-rich repeats, and Fbxs containing either different protein-protein interaction modules or no recognizable motifs. The protein encoded by this gene belongs to the Fbxs class. Three alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
31029	
31030	
31031	
31032	
31033	
31034	
31035	
31036	
31037	
31038	
31039	
31040	
31041	Dihydrofolate reductase converts dihydrofolate into tetrahydrofolate, a methyl group shuttle required for the de novo synthesis of purines, thymidylic acid, and certain amino acids.  While the functional dihydrofolate reductase gene has been mapped to chromosome 5, four intronless processed pseudogenes have been identified on separate chromosomes.  Dihydrofolate reductase deficiency has been linked to megaloblastic anemia.
31042	
31043	
31044	
31045	
31046	
31047	
31048	
31049	
31050	
31051	
31052	
31053	
31054	
31055	
31056	
31057	
31058	
31059	
31060	
31061	
31062	This gene encodes a member of the basic helix-loop-helix leucine zipper family, and can function as a cellular transcription factor. The encoded protein can activate transcription through pyrimidine-rich initiator (Inr) elements and E-box motifs. Two transcript variants encoding distinct isoforms have been identified for this gene.
31063	
31064	
31065	
31066	
31067	
31068	
31069	
31070	
31071	
31072	
31073	
31074	
31075	
31076	
31077	
31078	
31079	
31080	Hook proteins are cytosolic coiled-coil proteins that contain conserved N-terminal domains, which attach to microtubules, and more divergent C-terminal domains, which mediate binding to organelles. The Drosophila Hook protein is a component of the endocytic compartment.[supplied by OMIM]
31081	
31082	
31083	
31084	
31085	
31086	
31087	
31088	
31089	
31090	
31091	This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. However, this protein is considered a member of the cytochrome P450 superfamily on the basis of sequence similarity rather than functional similarity. This endoplasmic reticulum membrane protein catalyzes the conversion of prostglandin H2 to thromboxane A2, a potent vasoconstrictor and inducer of platelet aggregation. The enzyme plays a role in several pathophysiological processes including hemostasis, cardiovascular disease, and stroke. The gene expresses two transcript variants.
31092	
31093	
31094	
31095	
31096	
31097	
31098	
31099	
31100	
31101	
31102	
31103	
31104	
31105	
31106	
31107	
31108	
31109	The protein encoded by this gene is found as a pentamer and is a major substrate for the cAMP-dependent protein kinase in cardiac muscle. The encoded protein is an inhibitor of cardiac muscle sarcoplasmic reticulum Ca(2+)-ATPase in the unphosphorylated state, but inhibition is relieved upon phosphorylation of the protein. The subsequent activation of the Ca(2+) pump leads to enhanced muscle relaxation rates, thereby contributing to the inotropic response elicited in heart by beta-agonists. The encoded protein is a key regulator of cardiac diastolic function. Mutations in this gene are a cause of inherited human dilated cardiomyopathy with refractory congestive heart failure.
31110	
31111	
31112	
31113	
31114	
31115	
31116	
31117	
31118	
31119	
31120	This gene is a member of the glutathione S-transferase (GSTs) super-family which encodes multifunctional enzymes important in the detoxification of electrophilic molecules, including carcinogens, mutagens, and several therapeutic drugs, by conjugation with glutathione. This enzyme also plays a significant role in the catabolism of phenylalanine and tyrosine. Thus defects in this enzyme may lead to severe metabolic disorders including alkaptonuria, phenylketonuria and tyrosinaemia. Several transcript variants of this gene encode multiple protein isoforms.
31121	
31122	
31123	
31124	
31125	
31126	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L22P family of ribosomal proteins. It is located in the cytoplasm. This gene has been referred to as rpL23 because the encoded protein shares amino acid identity with ribosomal protein L23 from Halobacterium marismortui; however, its official symbol is RPL17. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
31127	
31128	
31129	
31130	
31131	
31132	
31133	
31134	
31135	
31136	
31137	
31138	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein, which is the functional equivalent of the E. coli L10 ribosomal protein, belongs to the L10P family of ribosomal proteins. It is a neutral phosphoprotein with a C-terminal end that is nearly identical to the C-terminal ends of the acidic ribosomal phosphoproteins P1 and P2. The P0 protein can interact with P1 and P2 to form a pentameric complex consisting of P1 and P2 dimers, and a P0 monomer. The protein is located in the cytoplasm. Transcript variants derived from alternative splicing exist; they encode the same protein. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
31139	
31140	
31141	
31142	
31143	
31144	
31145	
31146	
31147	
31148	
31149	
31150	Malate dehydrogenase catalyzes the reversible oxidation of malate to oxaloacetate, utilizing the NAD/NADH cofactor system in the citric acid cycle. The protein encoded by this gene is localized to the mitochondria and may play pivotal roles in the malate-aspartate shuttle that operates in the metabolic coordination between cytosol and mitochondria.
31151	
31152	
31153	
31154	The protein encoded by this gene appears to be multifunctional. It along with PCBP-1 and hnRNPK corresponds to the major cellular poly(rC)-binding proteins. It contains three K-homologous (KH) domains which may be involved in RNA binding. This encoded protein together with PCBP-1 also functions as translational coactivators of poliovirus RNA via a sequence-specific interaction with stem-loop IV of the IRES and promote poliovirus RNA replication by binding to its 5@#$%&-terminal cloverleaf structure. It has also been implicated in translational control of the 15-lipoxygenase mRNA, human Papillomavirus type 16 L2 mRNA, and hepatitis A virus RNA. The encoded protein is also suggested to play a part in formation of a sequence-specific alpha-globin mRNP complex which is associated with alpha-globin mRNA stability. This multiexon structural mRNA is thought to be retrotransposed to generate PCBP-1 intronless gene which has similar functions. This gene and PCBP-1 has paralogues PCBP3 and PCBP4 which is thought to arose as a result of duplication events of entire genes. It also has two processed pseudogenes PCBP2P1 and PCBP2P2. There are presently two alternatively spliced transcript variants described for this gene.
31155	
31156	
31157	
31158	
31159	
31160	
31161	
31162	
31163	
31164	
31165	
31166	This gene encodes a cell surface tyrosine kinase receptor for members of the platelet-derived growth factor family. These growth factors are mitogens for cells of mesenchymal origin. The identity of the growth factor bound to a receptor monomer determines whether the functional receptor is a homodimer or a heterodimer, composed of both platelet-derived growth factor receptor alpha and beta polypeptides. This gene is flanked on chromosome 5 by the genes for granulocyte-macrophage colony-stimulating factor and macrophage-colony stimulating factor receptor; all three genes may be implicated in the 5-q syndrome. A translocation between chromosomes 5 and 12, that fuses this gene to that of the translocation, ETS, leukemia gene, results in chronic myelomonocytic leukemia.
31167	
31168	
31169	
31170	
31171	
31172	
31173	
31174	
31175	
31176	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L6E family of ribosomal proteins. It is located in the cytoplasm. The protein can bind specifically to domain C of the tax-responsive enhancer element of human T-cell leukemia virus type 1, and it has been suggested that the protein may participate in tax-mediated transactivation of transcription. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
31177	
31178	
31179	
31180	
31181	
31182	
31183	
31184	
31185	
31186	
31187	
31188	The members of the zinc fingers and homeoboxes gene family are nuclear homodimeric transcriptional repressors that interact with the A subunit of nuclear factor-Y (NF-YA) and contain two C2H2-type zinc fingers and five homeobox DNA-binding domains. This gene encodes member 2 of this gene family. In addition to forming homodimers, this protein heterodimerizes with member 1 of the zinc fingers and homeoboxes family.
31189	
31190	
31191	
31192	
31193	This gene encodes a protein that is part of a post-splicing multiprotein complex involved in both mRNA nuclear export and mRNA surveillance. The encoded protein is one of two functional homologs to yeast Upf3p. mRNA surveillance detects exported mRNAs with truncated open reading frames and initiates nonsense-mediated mRNA decay (NMD). When translation ends upstream from the last exon-exon junction, this triggers NMD to degrade mRNAs containing premature stop codons. This protein binds to the mRNA and remains bound after nuclear export, acting as a nucleocytoplasmic shuttling protein. It forms with Y14 a complex that binds specifically 20 nt upstream of exon-exon junctions. This gene is located on the long arm of chromosome 13. Two splice variants encoding different isoforms have been found for this gene.
31194	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It is one of the most abundant core proteins of hnRNP complexes and it is localized to the nucleoplasm. This protein, along with other hnRNP proteins, is exported from the nucleus, probably bound to mRNA, and is immediately re-imported. Its M9 domain acts as both a nuclear localization and nuclear export signal. The encoded protein is involved in the packaging of pre-mRNA into hnRNP particles, transport of poly A+ mRNA from the nucleus to the cytoplasm, and may modulate splice site selection. It is also thought have a primary role in the formation of specific myometrial protein species in parturition. Multiple alternatively spliced transcript variants have been found for this gene but only two transcripts are fully described. These variant have multiple alternative transcription initiation sites and multiple polyA sites.
31195	
31196	
31197	WNT proteins are extracellular signaling molecules involved in the control of embryonic development. This gene encodes a secreted protein, which binds WNT proteins and inhibits their activities. This protein contains a WNT inhibitory factor (WIF) domain and 5 epidermal growth factor (EGF)-like domains. It may be involved in mesoderm segmentation. This protein is found to be present in fish, amphibia and mammals.
31198	The protein encoded by this gene is a cytosolic protein which contains a phosphotyrosine binding (PTD) domain. The PTD domain has been found to interact with the cytoplasmic tail of the LDL receptor. Mutations in this gene lead to LDL receptor malfunction and cause the disorder autosomal recessive hypercholesterolaemia.
31199	
31200	
31201	
31202	
31203	
31204	
31205	
31206	
31207	
31208	
31209	
31210	
31211	Homeobox genes encode DNA-binding proteins, many of which are thought to be involved in early embryonic development. Homeobox genes encode a DNA-binding domain of 60 to 63 amino acids referred to as the homeodomain. This gene is a member of the RAXLX homeobox gene family.
31212	
31213	
31214	
31215	
31216	
31217	
31218	
31219	
31220	
31221	
31222	
31223	
31224	
31225	
31226	
31227	
31228	The protein encoded by this gene is a GTPase which belongs to the RAS superfamily of small GTP-binding proteins. Members of this superfamily appear to regulate a diverse array of cellular events, including the control of cell growth, cytoskeletal reorganization, and the activation of protein kinases. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
31229	The protein encoded by this gene is an interleukin 3 specific subunit of a heterodimeric cytokine receptor. The receptor is comprised of a ligand specific alpha subunit and a signal transducing beta subunit shared by the receptors for interleukin 3 (IL3), colony stimulating factor 2 (CSF2/GM-CSF), and interleukin 5 (IL5). The binding of this protein to IL3 depends on the beta subunit. The beta subunit is activated by the ligand binding, and is required for the biological activities of IL3. This gene and the gene encoding the colony stimulating factor 2 receptor alpha chain (CSF2RA) form a cytokine receptor gene cluster in a X-Y pseudoautosomal region on chromosomes X or Y.
31230	
31231	
31232	The product of this gene belongs to the family of transient receptor potential (TRP) channels. TRP channels are cation-selective channels important for cellular calcium signaling and homeostasis. The protein encoded by this gene mediates calcium entry, and this entry is potentiated by calcium store depletion. Alternatively spliced transcript variants encoding different isoforms have been identified.
31233	
31234	
31235	
31236	
31237	
31238	
31239	
31240	
31241	
31242	
31243	
31244	
31245	
31246	
31247	
31248	
31249	
31250	
31251	
31252	
31253	This gene encodes an alpha chain for one of the low abundance fibrillar collagens. Fibrillar collagen molecules are trimers that can be composed of one or more types of alpha chains. Type V collagen is found in tissues containing type I collagen and appears to regulate the assembly of heterotypic fibers composed of both type I and type V collagen. This gene product is closely related to type XI collagen and it is possible that the collagen chains of types V and XI constitute a single collagen type with tissue-specific chain combinations. Mutations in this gene are associated with Ehlers-Danlos syndrome, types I and II. Two transcripts that differ in the length of the 3@#$%&UTR due to the use of alternative polyadenylation signals have been identified for this gene.
31254	
31255	
31256	
31257	This gene encodes a member of the Ca2+/calmodulin-dependent protein kinase 1 subfamily of serine/threonine kinases. The encoded protein may be involved in the regulation of granulocyte function through the chemokine signal transduction pathway. Alternatively spliced transcript variants encoding different isoforms of this gene have been described.
31258	
31259	
31260	
31261	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L21E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
31262	
31263	
31264	
31265	
31266	
31267	
31268	
31269	
31270	
31271	
31272	
31273	
31274	
31275	
31276	
31277	This gene encodes the fourth largest subunit of RNA polymerase II, the polymerase responsible for synthesizing messenger RNA in eukaryotes. In yeast, this polymerase subunit is associated with the polymerase under suboptimal growth conditions and may have a stress protective role. A sequence for a ribosomal pseudogene is contained within the 3@#$%& untranslated region of the transcript from this gene.
31278	
31279	Human echinoderm microtubule-associated protein-like is a strong candidate for the Usher syndrome type 1A gene. Usher syndromes (USHs) are a group of genetic disorders consisting of congenital deafness, retinitis pigmentosa, and vestibular dysfunction of variable onset and severity depending on the genetic type. The disease process in USHs involves the entire brain and is not limited to the posterior fossa or auditory and visual systems. The USHs are catagorized as type I (USH1A, USH1B, USH1C, USH1D, USH1E and USH1F), type II (USH2A and USH2B) and type III (USH3). The type I is the most severe form. Gene loci responsible for these three types are all mapped. Two transcript variants encoding different isoforms have been found for this gene.
31280	
31281	
31282	
31283	
31284	
31285	
31286	
31287	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L34E family of ribosomal proteins. It is located in the cytoplasm. This gene originally was thought to be located at 17q21, but it has been mapped to 4q. Transcript variants derived from alternative splicing, alternative transcription initiation sites, and/or alternative polyadenylation exist; these variants encode the same protein. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
31288	
31289	
31290	
31291	
31292	
31293	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the S39E family of ribosomal proteins. It is located in the cytoplasm. In rat, the protein is the smallest, and one of the most basic, proteins of the ribosome. This gene is co-transcribed with the U69 small nucleolar RNA gene, which is located in its second intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
31294	
31295	
31296	
31297	
31298	
31299	
31300	
31301	
31302	
31303	This gene encodes a telomere specific protein which is a component of the telomere nucleoprotein complex. This protein is present at telomeres throughout the cell cycle and functions as an inhibitor of telomerase, acting in cis to limit the elongation of individual chromosome ends. The protein structure contains a C-terminal Myb motif, a dimerization domain near its N-terminus and an acidic N-terminus. Two transcripts of this gene are alternatively spliced products.
31304	
31305	The product of this gene is a subunit of neuronal microtubule-associated MAP1A and MAP1B proteins, which are involved in microtubule assembly and important for neurogenesis. Studies on the rat homolog implicate a role for this gene in autophagy, a process that involves the bulk degradation of cytoplasmic component.
31306	
31307	
31308	
31309	
31310	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S4P family of ribosomal proteins. It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, multiple processed pseudogenes derived from this gene are dispersed through the genome.
31311	CD34 is a monomeric cell surface antigen with a molecular mass of approximately 110 kD that is selectively expressed on human hematopoietic progenitor cells.[supplied by OMIM]
31312	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
31313	
31314	
31315	
31316	
31317	
31318	
31319	
31320	
31321	
31322	
31323	
31324	
31325	
31326	
31327	
31328	
31329	
31330	
31331	This gene encodes a UDP-glucuronosyltransferase, an enzyme of the glucuronidation pathway that transforms small lipophilic molecules, such as steroids, bilirubin, hormones, and drugs, into water-soluble, excretable metabolites. This gene is part of a complex locus that encodes several UDP-glucuronosyltransferases. The locus includes thirteen unique alternate first exons followed by four common exons. Four of the alternate first exons are considered pseudogenes. Each of the remaining nine 5@#$%& exons may be spliced to the four common exons, resulting in nine proteins with different N-termini and identical C-termini. Each first exon encodes the substrate binding site, and is regulated by its own promoter. The enzyme encoded by this gene is active on phenols.
31332	
31333	
31334	
31335	
31336	
31337	
31338	
31339	
31340	
31341	EIF2AK4 belongs to a family of kinases that phosphorylate the alpha subunit of eukaryotic translation initiation factor-2 (EIF2S1; MIM 603907) to downregulate protein synthesis in response to varied cellular stresses (Berlanga et al., 1999).[supplied by OMIM]
31342	The leukemia inhibitory factor is a polyfunctional cytokine that affects the differentiation, survival, and proliferation of a wide variety of cells in the adult and the embryo. LIF action appears to be mediated through a high-affinity receptor complex composed of a low-affinity LIF binding chain (LIF receptor) and a high-affinity converter subunit, gp130. Both LIFR and gp130 are members of a family of cytokine receptors that includes components of the receptors for the majority of hematopoietic cytokines and for cytokines that affect other systems, including the ciliary neurotrophic factor, growth hormone and prolactin.
31343	Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a member of the histone H2A family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the small histone gene cluster on chromosome 6p22-p21.3.
31344	
31345	
31346	
31347	
31348	
31349	
31350	
31351	
31352	
31353	
31354	
31355	
31356	
31357	
31358	
31359	
31360	
31361	
31362	
31363	
31364	
31365	
31366	
31367	
31368	
31369	
31370	
31371	Chromosomal protein HMG14 and its close analog HMG17 (MIM 163910) bind to the inner side of the nucleosomal DNA, potentially altering the interaction between the DNA and the histone octamer. The 2 proteins may be involved in the process that maintains transcribable genes in a unique chromatin conformation. Their ubiquitous distribution and relative abundance, as well as the high evolutionary conservation of the DNA-binding domain of the HMG14 family of proteins, suggest that they may be involved in an important cellular function.[supplied by OMIM]
31372	
31373	Chromosomal protein HMG14 and its close analog HMG17 (MIM 163910) bind to the inner side of the nucleosomal DNA, potentially altering the interaction between the DNA and the histone octamer. The 2 proteins may be involved in the process that maintains transcribable genes in a unique chromatin conformation. Their ubiquitous distribution and relative abundance, as well as the high evolutionary conservation of the DNA-binding domain of the HMG14 family of proteins, suggest that they may be involved in an important cellular function.[supplied by OMIM]
31374	
31375	
31376	
31377	
31378	This gene belongs to the RBMY gene family which includes candidate Y chromosome spermatogenesis genes. This gene, an active X chromosome homolog of the Y chromosome RBMY gene, is widely expressed whereas the RBMY gene evolved a male-specific function in spermatogenesis. Pseudogenes of this gene, found on chromosomes 1, 4, 9, 11, and 6, were likely derived by retrotransposition from the original gene. Alternatively spliced transcript variants encoding different isoforms have been identified but their biological nature has not been determined.
31379	
31380	
31381	This gene encodes a member of the serine/threonine protein kinase family. The encoded kinase was identified by its interaction with MAP3K5/ASK, a protein kinase and an activator of c-Jun kinase (MAPK7/JNK) and MAPK14/p38 kinase. This kinase was found to weakly activate MAPK7, but not MAPK1/ERK or MAPK14. An alternatively spliced transcript variant has been found for this gene, but its biological validity has not been determined.
31382	
31383	
31384	
31385	
31386	
31387	
31388	
31389	
31390	
31391	
31392	
31393	
31394	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S4P family of ribosomal proteins. It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, multiple processed pseudogenes derived from this gene are dispersed through the genome.
31395	
31396	
31397	
31398	
31399	
31400	
31401	
31402	
31403	
31404	
31405	
31406	
31407	
31408	
31409	
31410	
31411	
31412	
31413	
31414	
31415	
31416	
31417	
31418	
31419	
31420	
31421	
31422	
31423	
31424	
31425	The protein encoded by this gene is a member of the F-actin capping protein alpha subunit family. It is the alpha subunit of the barbed-end actin binding protein Cap Z. By capping the barbed end of actin filaments, Cap Z regulates the growth of the actin filaments at the barbed end.
31426	
31427	
31428	
31429	
31430	The mouse homolog of this gene produces fused toes and thymic hyperplasia in heterozygous mutant animals while homozygous mutants die in early development. This gene may play a role in apoptosis as these morphological abnormalities are caused by altered patterns of programmed cell death. The protein encoded by this gene is similar to the ubiquitin ligase domain of other ubiquitin-conjugating enzymes but lacks the conserved cysteine residue that enables those enzymes to conjugate ubiquitin to the target protein. This protein interacts directly with serine/threonine kinase protein kinase B (PKB)/Akt and modulates PKB activity by enhancing the phosphorylation of PKB@#$%&s regulatory sites. Alternative splicing results in two transcript variants encoding the same protein.
31431	
31432	
31433	This gene is a member of the protein-tyrosine kinase oncogene family. It encodes a membrane-associated tyrosine kinase that has been implicated in the control of cell growth. The protein associates with the p85 subunit of phosphatidylinositol 3-kinase and interacts with the fyn-binding protein. Alternatively spliced transcript variants encoding distinct isoforms exist.
31434	
31435	
31436	
31437	
31438	
31439	
31440	
31441	
31442	
31443	
31444	
31445	
31446	
31447	The protein encoded by this gene is highly similar to cytoplasmic polyadenylation element binding protein (CPEB), an mRNA-binding protein that regulates cytoplasmic polyadenylation of mRNA as a trans factor in oogenesis and spermatogenesis. Studies of the similar gene in mice suggested a possible role of this protein in transcriptionally inactive haploid spermatids. Alternatively spliced transcript variants encoding distinct isoforms have been described.
31448	
31449	
31450	
31451	
31452	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L18P family of ribosomal proteins. It is located in the cytoplasm. The protein binds 5S rRNA to form a stable complex called the 5S ribonucleoprotein particle (RNP), which is necessary for the transport of nonribosome-associated cytoplasmic 5S rRNA to the nucleolus for assembly into ribosomes. The protein interacts specifically with the beta subunit of casein kinase II. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. This gene is co-transcribed with the small nucleolar RNA gene U21, which is located in its fifth intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
31453	
31454	
31455	
31456	
31457	
31458	
31459	The reciprocal translocation t(1;3)(p36;q21) occurs in a subset of myelodysplastic syndrome (MDS) and acute myeloid leukemia (AML). This gene is located near the 1p36.3 breakpoint and has been shown to be specifically expressed in the t(1:3)(p36,q21)-positive MDS/AML. The protein encoded by this gene is a zinc finger transcription factor and contains an N-terminal PR domain. The translocation results in the overexpression of a truncated version of this protein that lacks the PR domain, which may play an important role in the pathogenesis of MDS and AML. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
31460	SPRED2 is a member of the Sprouty (see SPRY1; MIM 602465)/SPRED family of proteins that regulate growth factor-induced activation of the MAP kinase cascade (see MAPK1; MIM 176948) (Nonami et al., 2004).[supplied by OMIM]
31461	
31462	
31463	
31464	
31465	
31466	Ornithine decarboxylase catalyzes the conversion of ornithine to putrescine in the first and apparently rate-limiting step in polyamine biosynthesis. The ornithine decarboxylase antizymes play a role in the regulation of polyamine synthesis by binding to and inhibiting ornithine decarboxylase. Antizyme expression is auto-regulated by polyamine-enhanced translational frameshifting. In contrast to antizymes 1 and 2, which are widely expressed throughout the body, antizyme 3 transcription is restricted to testis germ cells. Thus antizyme 3, encoded by this gene, is a possible candidate for heritable forms of human male infertility.
31467	This gene product belongs to the 14-3-3 family of proteins which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals, and this protein is 99% identical to the mouse, rat and sheep orthologs. The encoded protein interacts with IRS1 protein, suggesting a role in regulating insulin sensitivity. Two transcript variants differing in the 5@#$%& UTR, but encoding the same protein, have been identified for this gene.
31468	
31469	
31470	
31471	
31472	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent  process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the peptidase T1A family, that is a 20S core alpha subunit.
31473	
31474	
31475	
31476	
31477	
31478	
31479	The nuclear ALAS2 gene specifies an erythroid-specific mitochondrially located enzyme.  The ALAS2 gene product catalyzes the first step in the heme biosynthetic pathway.  A second delta-aminolevulinate synthase gene (ALAS1)is located on chromosome 3 and is expressed in all tissues. A defective ALAS2 gene causes X-linked pyridoxine-responsive sideroblastic anemia (Hypochromic Anemia).
31480	This gene encodes a protein that is part of a post-splicing multiprotein complex involved in both mRNA nuclear export and mRNA surveillance. The encoded protein is one of two functional homologs to yeast Upf3p. mRNA surveillance detects exported mRNAs with truncated open reading frames and initiates nonsense-mediated mRNA decay (NMD). When translation ends upstream from the last exon-exon junction, this triggers NMD to degrade mRNAs containing premature stop codons. This protein binds to the mRNA and remains bound after nuclear export, acting as a nucleocytoplasmic shuttling protein. It forms with Y14 a complex that binds specifically 20 nt upstream of exon-exon junctions. This gene is located on the long arm of chromosome 13. Two splice variants encoding different isoforms have been found for this gene.
31481	
31482	
31483	
31484	
31485	
31486	
31487	
31488	
31489	
31490	
31491	
31492	
31493	
31494	
31495	
31496	
31497	
31498	
31499	
31500	
31501	This gene encodes a protein containing four ankyrin repeat domains in its C-terminus. The suggested role for this protein is in tumorigenesis of renal cell carcinoma. Two alternatively spliced transcript variants encoding different isoforms have been identified.
31502	
31503	
31504	
31505	
31506	
31507	
31508	
31509	
31510	The members of the zinc fingers and homeoboxes gene family are nuclear homodimeric transcriptional repressors that interact with the A subunit of nuclear factor-Y (NF-YA) and contain two C2H2-type zinc fingers and five homeobox DNA-binding domains. This gene encodes member 2 of this gene family. In addition to forming homodimers, this protein heterodimerizes with member 1 of the zinc fingers and homeoboxes family.
31511	
31512	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and two tandem intracytoplasmic catalytic domains, and thus represents a receptor-type PTP. The extracellular region of this PTP contains a carbonic anhydrase-like (CAH) domain, which is also found in the extracellular region of PTPRBETA/ZETA. This gene is located in a chromosomal region that is frequently deleted in renal cell carcinoma and lung carcinoma, thus is thought to be a candidate tumor suppressor gene.
31513	
31514	
31515	
31516	
31517	
31518	
31519	
31520	
31521	Guanine nucleotide-binding proteins are a family of heterotrimeric proteins that couple cell surface, 7-transmembrane domain receptors to intracellular signaling pathways. Receptor activation catalyzes the exchange of GTP for GDP bound to the inactive G protein alpha subunit resulting in a conformational change and dissociation of the complex. The G protein alpha and beta-gamma subunits are capable of regulating various cellular effectors. Activation is terminated by a GTPase intrinsic to the G-alpha subunit. G-alpha-q is the alpha subunit of one of the heterotrimeric GTP-binding proteins that mediates stimulation of phospholipase C-beta (MIM 600230).[supplied by OMIM]
31522	
31523	
31524	
31525	
31526	
31527	
31528	
31529	
31530	The protein encoded by this gene is a nuclear transcription factor which binds DNA as a homodimer. The encoded protein controls the expression of several genes, including hepatocyte nuclear factor 1 alpha, a transcription factor which regulates the expression of several hepatic genes. This gene may play a role in development of the liver, kidney, and intestines. Mutations in this gene have been associated with monogenic autosomal dominant non-insulin-dependent diabetes mellitus type I. At least three different transcript variants encoding three different isoforms have been found for this gene.
31531	
31532	
31533	
31534	
31535	The protein encoded by this gene is highly similar to Saccharomyces cerevisiae Cdc26, a component of cell cycle anaphase-promoting complex (APC). APC is composed of a group of highly conserved proteins and functions as a cell cycle-regulated ubiquitin-protein ligase. APC thus is responsible for the cell cycle regulated proteolysis of various proteins.
31536	
31537	
31538	The protein encoded by this gene is a member of a small family of proteins which bind single stranded DNA/RNA. These proteins are characterized by the presence of two sets of ribonucleoprotein consensus sequence (RNP-CS) that contain conserved motifs, RNP1 and RNP2, originally described in RNA binding proteins, and required for DNA binding. These proteins have been implicated in such diverse functions as DNA replication, gene transcription, cell cycle progression and apoptosis. The encoded protein was isolated by virtue of its binding to an upstream element of the alpha2(I) collagen promoter. The observation that this protein localizes mostly in the cytoplasm suggests that it may be involved in a cytoplasmic function such as controlling RNA metabolism, rather than transcription. Multiple alternatively spliced transcript variants encoding different isoforms have been found for this gene.
31539	
31540	
31541	
31542	
31543	
31544	
31545	
31546	
31547	
31548	This gene encodes a protein that is a member of the SUMO (small ubiquitin-like modifier) protein family. It functions in a manner similar to ubiquitin in that it is bound to target proteins as part of a post-translational modification system. However, unlike ubiquitin which targets proteins for degradation, this protein is involved in a variety of cellular processes, such as nuclear transport, transcriptional regulation, apoptosis, and protein stability. It is not active until the last two amino acids of the carboxy-terminus have been cleaved off. Numerous pseudogenes have been reported for this gene. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
31549	
31550	
31551	
31552	
31553	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene contains a RNA binding domain and scaffold-associated region (SAR)-specific bipartite DNA-binding domain. This protein is also thought to be involved in the packaging of hnRNA into large ribonucleoprotein complexes. During apoptosis, this protein is cleaved in a caspase-dependent way. Cleavage occurs at the SALD site, resulting in a loss of DNA-binding activity and a concomitant detachment of this protein from nuclear structural sites. But this cleavage does not affect the function of the encoded protein in RNA metabolism. Two alternatively spliced transcript variants have been described for this gene.
31554	Formin homology (FH) domain proteins (e.g., FMN; MIM 136535) play a role in cytoskeletal organization and/or establishment of cell polarity.[supplied by OMIM]
31555	
31556	This gene is a member of the plant homeodomain (PHD)-like finger (PHF) family. It encodes a protein with two PHD-type zinc finger domains, indicating a potential role in transcriptional regulation, that localizes to the nucleolus. Mutations affecting the coding region of this gene or the splicing of the transcript have been associated with Borjeson-Forssman-Lehmann syndrome (BFLS), a disorder characterized by mental retardation, epilepsy, hypogonadism, hypometabolism, obesity, swelling of subcutaneous tissue of the face, narrow palpebral fissures, and large ears. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
31557	
31558	
31559	This gene encodes one of three enolase isoenzymes found in mammals; it encodes alpha-enolase, a homodimeric soluble enzyme, and also encodes a shorter monomeric structural lens protein, tau-crystallin. The two proteins are made from the same message. The full length protein, the isoenzyme, is found in the cytoplasm. The shorter protein is produced from an alternative translation start, is localized to the nucleus, and has been found to bind to an element in the c-myc promoter. A pseudogene has been identified that is located on the other arm of the same chromosome.
31560	
31561	
31562	
31563	This gene encodes a member of the DNAJ protein family. DNAJ family members are characterized by a highly conserved amino acid stretch called the @#$%&J-domain@#$%& and function as one of the two major classes of molecular chaperones involved in a wide range of cellular events, such as protein folding and oligomeric protein complex assembly. This family member may also play a role in polyglutamine aggregation in specific neurons. Alternative splicing of this gene results in multiple transcript variants; however, not all variants have been fully described.
31564	
31565	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and two tandem intracytoplasmic catalytic domains, and thus represents a receptor-type PTP. The extracellular region of this PTP contains a carbonic anhydrase-like (CAH) domain, which is also found in the extracellular region of PTPRBETA/ZETA. This gene is located in a chromosomal region that is frequently deleted in renal cell carcinoma and lung carcinoma, thus is thought to be a candidate tumor suppressor gene.
31566	
31567	
31568	
31569	
31570	
31571	
31572	
31573	
31574	
31575	
31576	
31577	
31578	
31579	
31580	
31581	
31582	
31583	
31584	
31585	
31586	
31587	
31588	
31589	
31590	
31591	
31592	
31593	
31594	
31595	
31596	
31597	
31598	
31599	
31600	Lactate dehydrogenase A catalyzes the conversion of L-lactate and NAD to pyruvate and NADH in the final step of anaerobic glycolysis.  LDHA is found predominantly in muscle tissue and belongs to the lactate dehydrogenase family.  Mutations in LDHA have been linked to exertional myoglobinuria.
31601	
31602	Cyclin-dependent kinases (CDKs), which play an essential role in cell cycle control of eukaryotic cells, are phosphorylated and thus activated by the CDK-activating kinase (CAK). CAK is a multisubunit protein that includes CDK7 (MIM 601955), cyclin H (CCNH; MIM 601953), and MAT1. MAT1 (for @#$%&menage a trois-1@#$%&) is involved in the assembly of the CAK complex.[supplied by OMIM]
31603	The protein encoded by this gene belongs to the Ser/Thr family of protein kinases and HIPK subfamily. It phosphorylates homeodomain transcription factors and may also function as a co-repressor for homeodomain transcription factors. Alternative splicing results in four transcript variants encoding four distinct isoforms.
31604	
31605	
31606	
31607	
31608	
31609	
31610	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. This encoded protein is a cell surface antigen and is highly expressed in different carcinomas.
31611	
31612	
31613	
31614	
31615	
31616	
31617	
31618	
31619	
31620	
31621	
31622	
31623	
31624	
31625	
31626	
31627	
31628	
31629	
31630	
31631	
31632	
31633	
31634	
31635	This gene encodes a mitochondrial transcription termination factor. This protein participates in attenuating transcription from the mitochondrial genome; this attenuation allows higher levels of expression of 16S ribosomal RNA relative to the tRNA gene downstream. The product of this gene has three leucine zipper motifs bracketed by two basic domains that are all required for DNA binding. There is evidence that, for this protein, the zippers participate in intramolecular interactions that establish the three-dimensional structure required for DNA binding.
31636	The protein encoded by this gene belongs to the claudin family of tight junction associated proteins and is a major component of central nervous system myelin that is necessary for normal CNS function. There is growing evidence that the protein determines the permeability between layers of myelin sheaths via focal adhesion and, with its expression highly regulated during development, may play an important role in cellular proliferation and migration. In addition, the protein is a candidate autoantigen in the development of autoimmune demyelinating disease.
31637	
31638	
31639	
31640	
31641	
31642	A reciprocal translocation between chromosomes 22 and 9 produces the Philadelphia chromosome, which is often found in patients with chronic myelogenous leukemia. The chromosome 22 breakpoint for this translocation is located within the BCR gene. The translocation produces a fusion protein which is encoded by sequence from both BCR and ABL, the gene at the chromosome 9 breakpoint. Although the BCR-ABL fusion protein has been extensively studied, the function of the normal BCR gene product is not clear. The protein has serine/threonine kinase activity and is a GTPase-activating protein for p21rac.
31643	
31644	
31645	
31646	
31647	
31648	
31649	
31650	
31651	
31652	
31653	
31654	
31655	
31656	
31657	The protein encoded by this gene is one polypeptide of a small nuclear ribonucleoprotein complex and belongs to the snRNP SMB/SMN family. The protein plays a role in pre-mRNA processing, possibly tissue-specific alternative splicing events. Although individual snRNPs are believed to recognize specific nucleic acid sequences through RNA-RNA base pairing, the specific role of this family member is unknown. The protein arises from a bicistronic transcript that also encodes a protein identified as the SNRPN upstream reading frame (SNURF). Multiple transcription initiation sites have been identified and extensive alternative splicing occurs in the 5@#$%& untranslated region. Additional splice variants have been described but sequences for the complete transcripts have not been determined. The 5@#$%& UTR of this gene has been identified as an imprinting center. Alternative splicing or deletion caused by a translocation event in this paternally-expressed region is responsible for Angelman syndrome or Prader-Willi syndrome due to parental imprint switch failure.
31658	
31659	Mice homozygous for the rd mutation display hereditary retinal degeneration which has been considered a model for human retinitis pigmentosa. In affected animals, the retinal rod photoreceptor cells begin degenerating at about postnatal day 8, and by 4 weeks no photoreceptors are left. Farber and Lolley (1974, 1976) showed that degeneration is preceded by accumulation of cyclic GMP in the retina and is correlated with deficient activity of the rod photoreceptor cGMP-phosphodiesterase. Bennett et al. (1996) tested the possibility of altering the course of retinal degeneration through subretinal injection of recombinant replication defective adenovirus that contained the murine cDNA for wildtype beta-PDE. Subretinal injection of rd mice was carried out 4 days after birth, before the onset of rd retinal degeneration. Following therapy, beta-PDE transcripts and enzyme activity were detected, and histologic studies revealed that photoreceptor cell death was significantly retarded.[supplied by OMIM]
31660	MaxiK channels are large conductance, voltage and calcium-sensitive potassium channels which are fundamental to the control of smooth muscle tone and neuronal excitability. MaxiK channels can be formed by 2 subunits: the pore-forming alpha subunit, which is the product of this gene, and the modulatory beta subunit. Intracellular calcium regulates the physical association between the alpha and beta subunits. Alternatively spliced transcript variants encoding different isoforms have been identified.
31661	
31662	
31663	
31664	
31665	
31666	
31667	
31668	
31669	This gene product belongs to the 14-3-3 family of proteins which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals, and this protein is 99% identical to the mouse, rat and sheep orthologs. The encoded protein interacts with IRS1 protein, suggesting a role in regulating insulin sensitivity. Two transcript variants differing in the 5@#$%& UTR, but encoding the same protein, have been identified for this gene.
31670	
31671	
31672	
31673	
31674	
31675	
31676	
31677	
31678	The protein encoded by this gene has sequence similarity with members of the WD40 repeat-containing protein family. The WD40 group is a large family of proteins, which appear to have a regulatory function. It is believed that the WD40 repeats mediate protein-protein interactions and members of the family are involved in signal transduction, RNA processing, gene regulation, vesicular trafficking, cytoskeletal assembly and may play a role in the control of cytotypic differentiation.
31679	
31680	
31681	
31682	
31683	
31684	
31685	HLA-A belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. Class I molecules play a central role in the immune system by presenting peptides derived from the endoplasmic reticulum lumen. They are expressed in nearly all cells. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon 1 encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail. Polymorphisms within exon 2 and exon 3 are responsible for the peptide binding specificity of each class one molecule. Typing for these polymorphisms is routinely done for bone marrow and kidney transplantation. Hundreds of HLA-A alleles have been described.
31686	The Sec61 complex is the central component of the protein translocation apparatus of the endoplasmic reticulum (ER) membrane. Oligomers of the Sec61 complex form a transmembrane channel where proteins are translocated across and integrated into the ER membrane. This complex consists of three membrane proteins- alpha, beta, and gamma. This gene encodes the beta-subunit protein. The Sec61 subunits are also observed in the post-ER compartment, suggesting that these proteins can escape the ER and recycle back. There is evidence for multiple polyadenylated sites for this transcript.
31687	
31688	
31689	
31690	
31691	The protein encoded by this gene is a ubiquitous actin monomer-binding protein belonging to the profilin family. It is thought to regulate actin polymerization in response to extracellular signals. Deletion of this gene is associated with Miller-Dieker syndrome.
31692	
31693	
31694	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L21E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
31695	
31696	
31697	
31698	
31699	
31700	
31701	HIV-1, the causative agent of acquired immunodeficiency syndrome (AIDS), contains an RNA genome that produces a chromosomally integrated DNA during the replicative cycle. Activation of HIV-1 gene expression by the transactivator Tat is dependent on an RNA regulatory element (TAR) located downstream of the transcription initiation site. The protein encoded by this gene is a transcriptional repressor that binds to chromosomally integrated TAR DNA and represses HIV-1 transcription. In addition, this protein regulates alternate splicing of the CFTR gene. A similar pseudogene is present on chromosome 20.
31702	
31703	
31704	
31705	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L37E family of ribosomal proteins. It is located in the cytoplasm. The protein contains a C2C2-type zinc finger-like motif. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
31706	
31707	
31708	
31709	
31710	
31711	
31712	
31713	
31714	
31715	
31716	
31717	
31718	
31719	
31720	
31721	The membrane-associated protein encoded by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters.  ABC proteins transport various molecules across extra- and intra-cellular membranes.  ABC genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, GCN20, White).  This protein is a member of the MDR/TAP subfamily.  Members of the MDR/TAP subfamily are involved in multidrug resistance.  The function of this mitochondrial protein is unknown.
31722	
31723	
31724	
31725	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
31726	
31727	
31728	
31729	
31730	
31731	
31732	
31733	The protein encoded by this gene belongs to the stathmin/oncoprotein 18 family of microtubule-destabilizing phosphoproteins. It is similar to the SCG10 protein and is involved in signal transduction and regulation of microtubule dynamics.
31734	
31735	
31736	The Fanconi anemia complementation group (FANC) currently includes FANCA, FANCB, FANCC, FANCD1 (also called BRCA2), FANCD2, FANCE, FANCF, FANCG, and FANCL. Fanconi anemia is a genetically heterogeneous recessive disorder characterized by cytogenetic instability, hypersensitivity to DNA crosslinking agents, increased chromosomal breakage, and defective DNA repair. The members of the Fanconi anemia complementation group do not share sequence similarity; they are related by their assembly into a common nuclear protein complex. This gene encodes the protein for complementation group D2. This protein is monoubiquinated in response to DNA damage, resulting in its localization to nuclear foci with other proteins (BRCA1 AND BRCA2) involved in homology-directed DNA repair. Alternative splicing results in two transcript variants encoding different isoforms.
31737	This zinc finger gene has been identified at the breakpoints of a recurrent chromosomal translocation reported in endometrial stromal sarcoma. Recombination of these breakpoints results in the fusion of this gene and JAZF1. The protein encoded by this gene contains a zinc finger domain in the C terminus of the coding region. The specific function of this gene has not yet been determined.
31738	
31739	The protein encoded by this gene has sequence similarity with members of the WD40 repeat-containing protein family. The WD40 group is a large family of proteins, which appear to have a regulatory function. It is believed that the WD40 repeats mediate protein-protein interactions and members of the family are involved in signal transduction, RNA processing, gene regulation, vesicular trafficking, cytoskeletal assembly and may play a role in the control of cytotypic differentiation.
31740	
31741	
31742	
31743	
31744	
31745	
31746	
31747	
31748	Ubiquitin, a highly conserved protein that has a major role in targeting cellular proteins for degradation by the 26S proteosome, is synthesized as a precursor protein consisting of either polyubiquitin chains or a single ubiquitin fused to an unrelated protein. This gene encodes a fusion protein consisting of ubiquitin at the N terminus and ribosomal protein S27a at the C terminus. When expressed in yeast, the protein is post-translationally processed, generating free ubiquitin monomer and ribosomal protein S27a. Ribosomal protein S27a is a component of the 40S subunit of the ribosome and belongs to the S27AE family of ribosomal proteins. It contains C4-type zinc finger domains and is located in the cytoplasm. Pseudogenes derived from this gene are present in the genome. As with ribosomal protein S27a, ribosomal protein L40 is also synthesized as a fusion protein with ubiquitin; similarly, ribosomal protein S30 is synthesized as a fusion protein with the ubiquitin-like protein fubi.
31749	
31750	
31751	
31752	
31753	
31754	
31755	
31756	
31757	
31758	
31759	
31760	
31761	RDH10 generates all-trans retinal from all-trans retinol and may plan an important role in the photic visual cycle. All-trans retinal is isomerized to 11-cis retinal by the retinal G protein-coupled receptor (RGR; MIM 600342) when the retinal pigment epithelium (RPE) is illuminated.[supplied by OMIM]
31762	
31763	
31764	The PRKDC gene encodes the catalytic subunit of a nuclear DNA-dependent serine/threonine protein kinase (DNA-PK). The second component is the autoimmune antigen Ku (MIM 152690), which is encoded by the G22P1 gene on chromosome 22q. On its own, the catalytic subunit of DNA-PK is inactive and relies on the G22P1 component to direct it to the DNA and trigger its kinase activity; PRKDC must be bound to DNA to express its catalytic properties.[supplied by OMIM]
31765	
31766	
31767	This gene encodes a member of the chaperonin family. The encoded mitochondrial protein may function as a signaling molecule in the innate immune system. This protein is essential for the folding and assembly of newly imported proteins in the mitochondria. This gene is adjacent to a related family member and the region between the 2 genes functions as a bidirectional promoter. Two pseudogenes, both located on chromosome 8, have been associated with this gene. Two transcript variants encoding the same protein have been identified for this gene. Mutations associated with this gene cause autosomal recessive spastic paraplegia 13.
31768	
31769	
31770	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L29P family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
31771	
31772	
31773	
31774	
31775	Gamma-aminobutyric acid (GABA), the major inhibitory neurotransmitter in the brain, mediates neuronal inhibition by binding to GABA receptors. The type A GABA receptors are pentameric chloride channels assembled from among many genetic variants of GABA(A) subunits. This gene encodes the gamma 2 subunit of GABA(A) receptor. Mutations in this gene have been associated with epilepsy and febrile seizures. Alternative splicing of this gene results in transcript variants encoding different isoforms.
31776	
31777	
31778	
31779	
31780	
31781	
31782	Vesicle mediated protein sorting plays an important role in segregation of intracellular molecules into distinct organelles. Genetic studies in yeast have identified more than 40 vacuolar protein sorting (VPS) genes involved in vesicle transport to vacuoles. This gene encodes the human ortholog of yeast Vps41 protein which is also conserved in Drosophila, tomato, and Arabidopsis. Expression studies in yeast and human indicate that this protein may be involved in the formation and fusion of transport vesicles from the Golgi. Several transcript variants encoding different isoforms have been described for this gene, however, the full-length nature of not all is known.
31783	
31784	
31785	
31786	
31787	
31788	
31789	
31790	The protein encoded by this gene is a member of the TNF-receptor superfamily. This receptor has been shown to activate NF-kappaB through its interaction with adaptor proteins TRAF2 and TRAF5. Knockout studies in mice suggested that this receptor promotes the expression of apoptosis inhibitors BCL2 and BCL2lL1/BCL2-XL, and thus suppresses apoptosis. The knockout studies also suggested the roles of this receptor in CD4+ T cell response, as well as in T cell-dependent B cell proliferation and differentiation.
31791	This gene encodes a member of the 1-acylglycerol-3-phosphate O-acyltransferase family. The protein is located within the endoplasmic reticulum membrane and converts lysophosphatidic acid to phosphatidic acid, the second step in de novo phospholipid biosynthesis. Mutations in this gene have been associated with congenital generalized lipodystrophy (CGL), or Berardinelli-Seip syndrome, a disease characterized by a near absence of adipose tissue and severe insulin resistance. Alternate transcriptional splice variants, encoding different isoforms, have been characterized.
31792	
31793	
31794	
31795	
31796	
31797	
31798	
31799	Alpha-1-adrenergic receptors (alpha-1-ARs) are members of the G protein-coupled receptor superfamily. They activate mitogenic responses and regulate growth and proliferation of many cells. There are 3 alpha-1-AR subtypes: alpha-1A, -1B and -1D, all of which signal through the Gq/11 family of G-proteins and different subtypes show different patterns of activation. This gene encodes alpha-1B-adrenergic receptor, which induces neoplastic transformation when transfected into NIH 3T3 fibroblasts and other cell lines. Thus, this normal cellular gene is identified as a protooncogene. This gene comprises 2 exons and a single large intron of at least 20 kb that interrupts the coding region.
31800	
31801	
31802	
31803	
31804	
31805	
31806	
31807	
31808	The protein encoded by this gene is a member of the chromogranin/secretogranin family of neuroendocrine secretory proteins. It is found in secretory vesicles of neurons and endocrine cells. This gene product is a precursor to three biologically active peptides; vasostatin, pancreastatin, and parastatin. These peptides act as autocrine or paracrine negative modulators of the neuroendocrine system. Other peptides, including chromostatin, beta-granin, WE-14 and GE-25, are also derived from the full-length protein. However, biological activities for these molecules have not been shown.
31809	
31810	
31811	This gene belongs to the protocadherin gene family, a subfamily of the cadherin superfamily. The mRNA encodes a cadherin-related neuronal receptor that localizes to synaptic junctions and is putatively involved in specific neuronal connections and signal transduction. Sharing a characteristic with other protocadherin genes, this gene has a notably large exon that encodes 6 cadherin domains and a transmembrane region. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
31812	
31813	
31814	
31815	
31816	
31817	
31818	
31819	
31820	
31821	
31822	
31823	
31824	
31825	This gene encodes a member of the PUF family. The encoded protein contains a sequence-specific RNA binding domain comprised of eight repeats and N- and C-terminal flanking regions, and serves as a translational regulator of specific mRNAs by binding to their 3@#$%& untranslated regions. The evolutionarily conserved function of the encoded protein in invertebrates and lower vertebrates suggests that the human protein may be involved in translational regulation of embryogenesis, and cell development and differentiation. Alternatively spliced transcript variants encoding different isoforms have been described.
31826	
31827	
31828	
31829	
31830	
31831	
31832	
31833	
31834	
31835	
31836	
31837	
31838	
31839	
31840	
31841	
31842	
31843	
31844	This gene encodes a H1 histone binding protein that is involved in transporting histones into the nucleus of dividing cells. Multiple isoforms are encoded by transcript variants of this gene. The somatic form is expressed in all mitotic cells, is localized to the nucleus, and is coupled to the cell cycle. The testicular form is expressed in embryonic tissues, tumor cells, and the testis. In male germ cells, this protein is localized to the cytoplasm of primary spermatocytes, the nucleus of spermatids, and the periacrosomal region of mature spermatozoa.
31845	
31846	
31847	
31848	
31849	
31850	
31851	
31852	
31853	
31854	
31855	
31856	Transaldolase 1 is a key enzyme of the nonoxidative pentose phosphate pathway providing ribose-5-phosphate for nucleic acid synthesis and NADPH for lipid biosynthesis. This pathway can also maintain glutathione at a reduced state and thus protect sulfhydryl groups and cellular integrity from oxygen radicals. The functional gene of transaldolase 1 is located on chromosome 11 and a pseudogene is identified on chromosome 1 but there are conflicting map locations. The second and third exon of this gene were developed by insertion of a retrotransposable element. This gene is thought to be involved in multiple sclerosis.
31857	
31858	
31859	
31860	
31861	
31862	
31863	This gene encodes class V alcohol dehydrogenase, which is a member of the alcohol dehydrogenase family. Members of this family metabolize a wide variety of substrates, including ethanol, retinol, other aliphatic alcohols, hydroxysteroids, and lipid peroxidation products. This gene is expressed in the stomach as well as in the liver, and it contains a glucocorticoid response element upstream of its 5@#$%& UTR, which is a steroid hormone receptor binding site. The deduced amino acid sequence of the open reading frame of this gene shows about 60% positional identity with other known alcohol dehydrogenases. This gene may have a distinct physiologic function.
31864	
31865	
31866	
31867	
31868	
31869	
31870	
31871	
31872	
31873	
31874	
31875	The protein encoded by this gene is a type I membrane receptor for group B coxsackieviruses and subgroup C adenoviruses. Pseudogenes of this gene are found on chromosomes 15, 18, and 21.
31876	
31877	The protein encoded by this gene was identified by its RED repeat, a stretch of repeated arginine, glutamic acid and aspartic acid residues. The protein localizes to discrete dots within the nucleus, excluding the nucleolus. Its function is unknown. This gene maps to chromosome 5; however, a pseudogene may exist on chromosome 2.
31878	
31879	The protein encoded by this gene is a death domain (CARD/DD)-containing protein and has been shown to induce cell apoptosis. Through its CARD domain, this protein interacts with, and thus recruits, caspase 2/ICH1 to the cell death signal transduction complex that includes tumor necrosis factor receptor 1 (TNFR1A), RIPK1/RIP kinase, and numbers of other CARD domain-containing proteins.
31880	
31881	
31882	This gene is a member of the MAGEA gene family. The members of this family encode proteins with 50 to 80% sequence identity to each other. The promoters and first exons of the MAGEA genes show considerable variability, suggesting that the existence of this gene family enables the same function to be expressed under different transcriptional controls. The MAGEA genes are clustered at chromosomal location Xq28. They have been implicated in some hereditary disorders, such as dyskeratosis congenita.
31883	
31884	
31885	
31886	
31887	
31888	
31889	
31890	
31891	
31892	
31893	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. The function of this gene has not yet been determined. The expression pattern of the mouse homolog implies a role in nervous system development.
31894	
31895	
31896	
31897	
31898	
31899	
31900	
31901	
31902	
31903	
31904	
31905	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L14P family of ribosomal proteins. It is located in the cytoplasm. This gene has been referred to as rpL17 because the encoded protein shares amino acid identity with ribosomal protein L17 from Saccharomyces cerevisiae; however, its official symbol is RPL23. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
31906	
31907	
31908	
31909	
31910	
31911	
31912	
31913	
31914	
31915	
31916	
31917	
31918	
31919	
31920	
31921	Polyspecific organic cation transporters in the liver, kidney, intestine, and other organs are critical for elimination of many endogenous small organic cations as well as a wide array of drugs and environmental toxins. This gene is one of three similar cation transporter genes located in a cluster on chromosome 6. The encoded protein contains twelve putative transmembrane domains and is a plasma integral membrane protein.
31922	
31923	
31924	
31925	
31926	
31927	Mitochondrial ATP synthase catalyzes ATP synthesis, utilizing an electrochemical gradient of protons across the inner membrane during oxidative phosphorylation. It is composed of two linked multi-subunit complexes: the soluble catalytic core, F1, and the membrane-spanning component, F0, which comprises the proton channel. The F1 complex consists of 5 different subunits (alpha, beta, gamma, delta, and epsilon) assembled in a ratio of 3 alpha, 3 beta, and a single representative of the other 3. The F0 seems to have nine subunits (a, b, c, d, e, f, g, F6 and 8). This gene encodes the F6 subunit of the F0 complex, required for F1 and F0 interactions. Alternatively spliced transcript variants encoding different isoforms have been identified for this gene.
31928	
31929	
31930	
31931	
31932	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L30P family of ribosomal proteins. It contains an N-terminal basic region-leucine zipper (BZIP)-like domain and the RNP consensus submotif RNP2. In vitro the BZIP-like domain mediates homodimerization and stable binding to DNA and RNA, with a preference for 28S rRNA and mRNA. The protein can inhibit cell-free translation of mRNAs, suggesting that it plays a regulatory role in the translation apparatus. It is located in the cytoplasm. The protein has been shown to be an autoantigen in patients with systemic autoimmune diseases, such as systemic lupus erythematosus. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
31933	
31934	
31935	
31936	
31937	
31938	
31939	
31940	
31941	The protein encoded by this gene is a receptor for activated C-kinase (RACK) protein. The encoded protein has been shown to bind in vitro to activated protein kinase C beta I. In addition, this protein is a cutaneous T-cell lymphoma-associated antigen. Finally, the protein contains a bromodomain and two zinc fingers, and is thought to be a transcriptional regulator. Multiple transcript variants encoding several different isoforms have been found for this gene.
31942	
31943	
31944	
31945	
31946	This gene encodes subunit 1 of the splicing factor 3b protein complex. Splicing factor 3b, together with splicing factor 3a and a 12S RNA unit, forms the U2 small nuclear ribonucleoproteins complex (U2 snRNP). The splicing factor 3b/3a complex binds pre-mRNA upstream of the intron@#$%&s branch site in a sequence independent manner and may anchor the U2 snRNP to the pre-mRNA. Splicing factor 3b is also a component of the minor U12-type spliceosome. The carboxy-terminal two-thirds of subunit 1 have 22 non-identical, tandem HEAT repeats that form rod-like, helical structures. Alternative splicing results in multiple transcript variants encoding different isoforms.
31947	
31948	
31949	
31950	The protein encoded by this gene contains a catalytic domain of flavin adenine dinucleotide synthetase, which catalyzes the adenylation of flavin mononucleotide (FMN) to form flavin adenine dinucleotide (FAD) coenzyme. Two alternatively spliced transcript variants encoding distinct isoforms have been observed.
31951	
31952	
31953	
31954	
31955	
31956	
31957	
31958	
31959	
31960	
31961	
31962	
31963	
31964	
31965	
31966	
31967	The collagens are a superfamily of proteins that play a role in maintaining the integrity of various tissues. Collagens are extracellular matrix proteins and have a triple-helical domain as their common structural element. Collagen VI is a major structural component of microfibrils. The basic structural unit of collagen VI is a heterotrimer of the alpha1(VI), alpha2(VI), and alpha3(VI) chains. The alpha2(VI) and alpha3(VI) chains are encoded by the COL6A2 and COL6A3 genes, respectively. The protein encoded by this gene is the alpha 1 subunit of type VI collagen (alpha1(VI) chain). Mutations in the genes that code for the collagen VI subunits result in the autosomal dominant disorder, Bethlem myopathy.
31968	
31969	
31970	Olfactory receptors interact with odorant molecules in the nose, to initiate a neuronal response that triggers the perception of a smell. The olfactory receptor proteins are members of a large family of G-protein-coupled receptors (GPCR) arising from single coding-exon genes. Olfactory receptors share a 7-transmembrane domain structure with many neurotransmitter and hormone receptors and are responsible for the recognition and G protein-mediated transduction of odorant signals. The olfactory receptor gene family is the largest in the genome. The nomenclature assigned to the olfactory receptor genes and proteins for this organism is independent of other organisms.
31971	
31972	
31973	
31974	
31975	
31976	
31977	
31978	
31979	
31980	
31981	
31982	
31983	
31984	
31985	
31986	
31987	
31988	
31989	
31990	This gene is a member of the protocadherin gamma gene cluster, one of three related clusters tandemly linked on chromosome five. These gene clusters have an immunoglobulin-like organization, suggesting that a novel mechanism may be involved in their regulation and expression. The gamma gene cluster includes 22 genes divided into 3 subfamilies. Subfamily A contains 12 genes, subfamily B contains 7 genes and 2 pseudogenes, and the more distantly related subfamily C contains 3 genes. The tandem array of 22 large, variable region exons are followed by a constant region, containing 3 exons shared by all genes in the cluster. Each variable region exon encodes the extracellular region, which includes 6 cadherin ectodomains and a transmembrane region. The constant region exons encode the common cytoplasmic region. These neural cadherin-like cell adhesion proteins most likely play a critical role in the establishment and function of specific cell-cell connections in the brain. Alternative splicing has been described for the gamma cluster genes.
31991	
31992	
31993	
31994	
31995	
31996	
31997	This gene belongs to the VCX/Y gene family, which has multiple members on both X and Y chromosomes, and all are expressed exclusively in male germ cells. The X-linked members are clustered on chromosome Xp22 and Y-linked members are two identical copies of the gene within a palindromic region on Yq11. The family members share a high degree of sequence identity, with the exception that a 30-bp unit is tandemly repeated in X-linked members but occurs only once in Y-linked members. The VCX gene cluster is polymorphic in terms of copy number; different individuals may have a different number of VCX genes. VCX/Y genes encode small and highly charged proteins of unknown function. The presence of a putative bipartite nuclear localization signal suggests that VCX/Y members are nuclear proteins. This gene contains 2 repeats of the 30-bp unit.
31998	
31999	
32000	
32001	
32002	
32003	
32004	
32005	
32006	
32007	
32008	
32009	
32010	
32011	Ankyrins are a family of proteins that are believed to link the integral membrane proteins to the underlying spectrin-actin cytoskeleton and play key roles in activities such as cell motility, activation, proliferation, contact, and the maintenance of specialized membrane domains. Multiple isoforms of ankyrin with different affinities for various target proteins are expressed in a tissue-specific, developmentally regulated manner. Most ankyrins are typically composed of three structural domains: an amino-terminal domain containing multiple ankyrin repeats; a central region with a highly conserved spectrin binding domain; and a carboxy-terminal regulatory domain which is the least conserved and subject to variation. Ankyrin 3 is an immunologically distinct gene product from ankyrins 1 and 2, and was originally found at the axonal initial segment and nodes of Ranvier of neurons in the central and peripheral nervous systems. Alternatively spliced variants may be expressed in other tissues. Although multiple transcript variants encoding several different isoforms have been found for this gene, the full-length nature of only two have been characterized.
32012	
32013	
32014	
32015	
32016	The evolutionarily conserved THRAP genes encode a family of proteins that regulate embryonic development. THRAP2 is involved in early development of the heart and brain (Muncke et al., 2003).[supplied by OMIM]
32017	
32018	
32019	
32020	
32021	
32022	
32023	
32024	
32025	
32026	
32027	
32028	
32029	
32030	
32031	
32032	
32033	
32034	This gene encodes a WD repeat-containing protein. It is abundantly expressed in retina and testis. Alternatively spliced transcript variants have been found for this gene, and they encode distinct isoforms.
32035	
32036	
32037	
32038	
32039	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has three repeats of quasi-RRM domains that bind to RNAs. It is very similar to the family member HNRPF. This gene is thought to be potentially involved in hereditary lymphedema type I phenotype.
32040	
32041	
32042	This gene encodes the heavy subunit of the ferritin protein. Ferritin is the major intracellular iron storage protein in prokaryotes and eukaryotes. It is composed of 24 subunits of the heavy and light ferritin chains. Variation in ferritin subunit composition may affect the rates of iron uptake and release in different tissues. A major function of ferritin is the storage of iron in a soluble and nontoxic state. Defects in ferritin proteins are associated with several neurodegenerative diseases. This gene has multiple pseudogenes. Several alternatively spliced transcript variants have been observed, but their biological validity has not been determined.
32043	
32044	
32045	
32046	
32047	
32048	
32049	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation.
32050	
32051	In eukaryotic cells, DNA polymerase beta (POLB) performs base excision repair (BER) required for DNA maintenance, replication, recombination, and drug resistance. Also see POLA (MIM 312040).[supplied by OMIM]
32052	
32053	
32054	
32055	
32056	
32057	
32058	
32059	
32060	
32061	
32062	
32063	Sodium bicarbonate cotransporters (NBCs) mediate the coupled movement of sodium and bicarbonate ions across the plasma membrane of many cells. This is an electrogenic process with an apparent stoichiometry of 3 bicarbonate ions per sodium ion. Sodium bicarbonate cotransport is involved in bicarbonate secretion/absorption and intracellular pH regulation. Romero and Boron (1999) reviewed NBCs. Soleimani and Burnham (2000) reviewed NBCs and their regulation in physiologic and pathophysiologic states.[supplied by OMIM]
32064	This intronless gene belongs to the forkhead family of transcription factors which is characterized by a distinct forkhead domain.  The specific function of this gene has not yet been determined; however, it may play a role in tumor formation.
32065	The protein encoded by this gene belongs to the membrane-associated guanylate kinase (MAGUK) family, a class of proteins that functions as molecular scaffolds for the assembly of multiprotein complexes at specialized regions of the plasma membrane. This protein is also a member of the CARD protein family, which is defined by carrying a characteristic caspase-associated recruitment domain (CARD). This protein shares a similar domain structure with CARD11 protein. The CARD domains of both proteins have been shown to specifically interact with BCL10, a protein known to function as a positive regulator of cell apoptosis and NF-kappaB activation. When expressed in cells, this protein activated NF-kappaB and induced the phosphorylation of BCL10. Two alternatively spliced variants of this gene encoding distinct isoforms have been reported.
32066	
32067	
32068	
32069	
32070	
32071	
32072	
32073	
32074	
32075	
32076	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
32077	HLA-DMB belongs to the HLA class II beta chain paralogues. This class II molecule is a heterodimer consisting of an alpha (DMA) and a beta (DMB) chain, both anchored in the membrane. It is located in intracellular vesicles. DM plays a central role in the peptide loading of MHC class II molecules by helping to release the CLIP (class II-associated invariant chain peptide) molecule from the peptide binding site. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The beta chain is approximately 26-28 kDa and its gene contains 6 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, exon 4 encodes the transmembrane domain and exon 5 encodes the cytoplasmic tail.
32078	
32079	
32080	
32081	
32082	
32083	
32084	This gene is a member of the TAFA family which is composed of five highly homologous genes that encode small secreted proteins. These proteins contain conserved cysteine residues at fixed positions, and are distantly related to MIP-1alpha, a member of the CC-chemokine family. The TAFA proteins are predominantly expressed in specific regions of the brain, and are postulated to function as brain-specific chemokines or neurokines, that act as regulators of immune and nervous cells.
32085	
32086	
32087	This gene encodes an abundant nuclear protein that binds with high affinity to nascent poly(A) tails. The protein is required for progressive and efficient polymerization of poly(A) tails on the 3@#$%& ends of eukaryotic genes and controls the size of the poly(A) tail to about 250 nt. At steady-state, this protein is localized in the nucleus whereas a different poly(A) binding protein is localized in the cytoplasm. An expansion of the  trinucleotide (GCG) repeat from normal 6 to 8-13 at the 5@#$%& end of the coding region of this gene leads to autosomal dominant oculopharyngeal muscular dystrophy (OPMD) disease. Multiple splice variants have been described but their full-length nature is not known. One splice variant includes introns 1 and 6 but no protein is formed.
32088	
32089	
32090	
32091	
32092	
32093	
32094	
32095	
32096	
32097	
32098	
32099	
32100	This gene belongs to a group of vacuolar protein sorting (VPS) genes. The encoded protein is a component of a large multimeric complex, termed the retromer complex, involved in retrograde transport of proteins from endosomes to the trans-Golgi network. The close structural similarity between the yeast and human proteins that make up this complex suggests a similarity in function. Expression studies in yeast and mammalian cells indicate that this protein interacts directly with VPS35, which serves as the core of the retromer complex.
32101	
32102	
32103	
32104	
32105	HLA-DOA belongs to the HLA class II alpha chain paralogues. HLA-DOA forms a heterodimer with HLA-DOB. The heterodimer, HLA-DO, is found in lysosomes in B cells and regulates HLA-DM-mediated peptide loading on MHC class II molecules. In comparison with classical HLA class II molecules, this gene exhibits very little sequence variation, especially at the protein level.
32106	
32107	
32108	
32109	
32110	The protein encoded by this gene is one of the several proteins that become sequentially incorporated into the cornified cell envelope during the terminal differentiation of keratinocyte at the outer layers of epidermis. This protein interacts with periplakin, which is known as a precursor of the cornified cell envelope. The cellular localization pattern and insolubility of this protein suggest that it may play a role in epithelial differentiation and contribute to epidermal integrity and barrier formation. Multiple alternatively spliced transcript variants encoding distinct isoforms have been observed.
32111	
32112	
32113	
32114	
32115	
32116	
32117	
32118	
32119	
32120	Sodium bicarbonate cotransporters (NBCs) mediate the coupled movement of sodium and bicarbonate ions across the plasma membrane of many cells. This is an electrogenic process with an apparent stoichiometry of 3 bicarbonate ions per sodium ion. Sodium bicarbonate cotransport is involved in bicarbonate secretion/absorption and intracellular pH regulation. Romero and Boron (1999) reviewed NBCs. Soleimani and Burnham (2000) reviewed NBCs and their regulation in physiologic and pathophysiologic states.[supplied by OMIM]
32121	
32122	
32123	
32124	
32125	Myosin is a hexameric ATPase cellular motor protein. It is composed of two heavy chains, two nonphosphorylatable alkali light chains, and two phosphorylatable regulatory light chains. This gene encodes a myosin alkali light chain that is expressed in smooth muscle and non-muscle tissues. Genomic sequences representing several pseudogenes have been described and four transcript variants have been identified for this gene.
32126	
32127	
32128	
32129	
32130	
32131	The protein encoded by this gene belongs to the ARL (ADP-ribosylation factor-like) family of proteins, which are structurally related to ADP-ribosylation factors (ARFs). ARFs, described as activators of cholera toxin (CT) ADP-ribosyltransferase activity, regulate intracellular vesicular membrane trafficking, and stimulate a phospholipase D (PLD) isoform. Although, ARL proteins were initially thought not to activate CT or PLD, later work showed that they are weak stimulators of PLD and CT in a phospholipid dependent manner.
32132	
32133	
32134	
32135	
32136	
32137	
32138	
32139	
32140	
32141	
32142	
32143	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and two tandem intracellular catalytic domains, and thus represents a receptor-type PTP. The extracellular region contains a meprin-A5 antigen-PTP (MAM) domain, Ig-like and fibronectin type III-like repeats. The protein domain structure and the expression pattern of the mouse counterpart of this PTP suggest its roles in both signal transduction and cellular adhesion in the central nervous system. Two alternatively spliced transcript variants of this gene, which encode distinct proteins, have been reported.
32144	
32145	
32146	
32147	
32148	
32149	The protein encoded by this gene is a member of the S100 family of proteins containing 2 EF-hand calcium-binding motifs. S100 proteins are localized in the cytoplasm and/or nucleus of a wide range of cells, and involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation. S100 genes include at least 13 members which are located as a cluster on chromosome 1q21. This protein may function in exocytosis and endocytosis.
32150	
32151	
32152	
32153	
32154	
32155	
32156	
32157	
32158	
32159	
32160	
32161	
32162	This gene encodes a protein with possible ATPase function. The protein contains a P-loop motif and a five-domain structure that is conserved in fly, yeast, and bacteria. Two conserved estrogen receptor binding sites are located within 2.5 kb of this gene.
32163	
32164	
32165	
32166	
32167	
32168	
32169	
32170	
32171	
32172	
32173	
32174	
32175	
32176	
32177	
32178	
32179	
32180	
32181	
32182	
32183	
32184	
32185	
32186	
32187	
32188	
32189	
32190	
32191	
32192	
32193	
32194	
32195	The protein encoded by this gene is a member of the interleukin 1 cytokine family. This protein inhibits the activities of interleukin 1, alpha (IL1A) and interleukin 1, beta (IL1B), and modulates a variety of interleukin 1 related immune and inflammatory responses. This gene and five other closely related cytokine genes form a gene cluster spanning approximately 400 kb on chromosome 2. A polymorphism of this gene is reported to be associated with increased risk of osteoporotic fractures and gastric cancer. Four alternatively spliced transcript variants encoding distinct isoforms have been reported.
32196	The poly(A)-binding protein (PABP), which is found complexed to the 3-prime poly(A) tail of eukaryotic mRNA, is required for poly(A) shortening and translation initiation. In humans, the PABPs comprise a small nuclear isoform and a conserved gene family that displays at least 3 functional proteins: PABP1 (PABPC1), inducible PABP (iPABP, or PABPC4; MIM 603407), and PABP3 (PABPC3; MIM 604680). In addition, there are at least 4 pseudogenes, PABPCP1 to PABPCP4.[supplied by OMIM]
32197	
32198	
32199	
32200	
32201	
32202	
32203	
32204	
32205	
32206	
32207	
32208	
32209	C7 is a component of the complement system.  It participates in the formation of Membrane Attack Complex (MAC).  People with C7 deficiency are prone to bacterial infection.
32210	
32211	
32212	
32213	
32214	
32215	
32216	
32217	
32218	
32219	
32220	
32221	
32222	
32223	
32224	
32225	
32226	
32227	
32228	
32229	
32230	
32231	
32232	
32233	
32234	
32235	
32236	
32237	
32238	This gene product belongs to the 14-3-3 family of proteins which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals, and this protein is 100% identical to the mouse ortholog. It interacts with CDC25 phosphatases, RAF1 and IRS1 proteins, suggesting its role in diverse biochemical activities related to signal transduction, such as cell division and regulation of insulin sensitivity. It has also been implicated in the pathogenesis of small cell lung cancer.
32239	
32240	
32241	
32242	
32243	
32244	
32245	
32246	
32247	
32248	
32249	
32250	
32251	
32252	The function of this gene is not known. This gene is located in a region commonly deleted in esophageal squamous cell carcinomas.  Gene expression is reduced or absent in these carcinomas and thus this is a candidate tumor suppressor gene for esophageal squamous cell carcinomas.
32253	
32254	
32255	
32256	
32257	
32258	
32259	
32260	
32261	
32262	
32263	
32264	
32265	This intronless gene encodes a protein containing a chromodomain and a histone acetyltransferase catalytic domain. Chromodomain proteins are components of heterochromatin-like complexes and can act as gene repressors. This protein is localized to the nucleus of late spermatids where histone hyperacetylation takes place. Histone hyperacetylation is thought to facilitate the transition in which protamines replace histones as the major DNA-packaging protein. Two nearly identical copies of this gene are found in a palindromic region on chromosome Y; this record represents the telomeric copy. Chromosome Y also contains a pair of closely related genes in another more telomeric palindrome as well as several related pseudogenes.
32266	
32267	
32268	
32269	The protein encoded by this gene is a receptor protein tyrosine kinase whose cellular role has not been determined. It is a type I membrane protein and belongs to the ROR subfamily of cell surface receptors. Studies of a similar protein in mouse suggest that this protein may interact with another receptor protein tyrosine kinase and may be involved in skeletal and cardiac development.
32270	
32271	
32272	Pre-mRNA splicing occurs in 2 sequential transesterification steps. The protein encoded by this gene is a component of both U2- and U12-dependent spliceosomes, and found to be essential for the catalytic step II in pre-mRNA splicing process. It contains several WD repeats, which function in protein-protein interactions. This protein has a sequence similarity to yeast Prp8 protein. This gene is a candidate gene for autosomal dominant retinitis pigmentosa.
32273	This gene is a member of the septin gene family of nucleotide binding proteins, originally described in yeast as cell division cycle regulatory proteins. Septins are highly conserved in yeast, Drosophila, and mouse and appear to regulate cytoskeletal organization. The protein encoded by this gene is thought to be part of a complex involved in cytokinesis. Alternate splicing of this gene results in four transcript variants which encode different isoforms.
32274	The protein encoded by this gene is a member of the NR1 subfamily of nuclear hormone receptors. It can bind as a monomer or as a homodimer to hormone response elements upstream of several genes to enhance the expression of those genes. The specific functions of this protein are not known, but it has been shown to interact with NM23-2, a nucleoside diphosphate kinase involved in organogenesis and differentiation, as well as with NM23-1, the product of a tumor metastasis suppressor candidate gene. Four transcript variants encoding different isoforms have been described for this gene.
32275	
32276	
32277	
32278	
32279	
32280	
32281	
32282	
32283	
32284	
32285	
32286	
32287	
32288	
32289	
32290	
32291	
32292	
32293	Potassium channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. Four sequence-related potassium channel genes - shaker, shaw, shab, and shal - have been identified in Drosophila, and each has been shown to have human homolog(s). This gene encodes a member of the potassium channel, voltage-gated, shaker-related subfamily. This member contains six membrane-spanning domains with a shaker-type repeat in the fourth segment. It belongs to the delayed rectifier class, members of which allow nerve cells to efficiently repolarize following an action potential. The coding region of this gene is intronless, and the gene is clustered with genes KCNA3 and KCNA10 on chromosome 1.
32294	This gene encodes a member of the family of adenylate cyclases, which are membrane-associated enzymes that catalyze the formation of the secondary messenger cyclic adenosine monophosphate (cAMP). This enzyme is insensitive to Ca(2+)/calmodulin, and is stimulated by the G protein beta and gamma subunit complex. It is expressed in brain.
32295	
32296	
32297	
32298	
32299	
32300	
32301	
32302	
32303	
32304	
32305	
32306	
32307	
32308	In Drosophila, the Additional sex combs (Asx) gene encodes a chromatin-binding protein required for normal determination of segment identity in the developing embryo. The protein is a member of the Polycomb group of proteins, which are necessary for the maintenance of stable repression of homeotic and other loci. The protein is thought to disrupt chromatin in localized areas, enhancing transcription of certain genes while repressing the transcription of other genes. Although the function of the protein encoded by this gene is not known, it does show some sequence similarity to the protein encoded by the Drosophila Asx gene.
32309	
32310	
32311	
32312	
32313	
32314	
32315	Prefoldin 1 is a heterohexameric chaperone protein which assists in the correct folding of other proteins. It binds specifically to cytosolic chaperonin and transfers target proteins. Prefoldin may function by selectively targeting nascent actin and tubulin chains pending their transfer to cytosolic chaperonin for final folding and/or assembly. It promotes folding in an environment in which there are many competing pathways for nonnative proteins.
32316	
32317	
32318	
32319	
32320	
32321	
32322	
32323	
32324	
32325	
32326	
32327	
32328	TEX1 is part of the TREX (transcription/export) complex, which includes THO2 (MIM 300395), HPR1 (MIM 606930), ALY (MIM 604171), and UAP56 (MIM 606390).[supplied by OMIM]
32329	This gene encodes a member of the Wiskott-Aldrich syndrome protein family. The gene product is a protein that forms a multiprotein complex that links receptor kinases and actin. Binding to actin occurs through a C-terminal verprolin homology domain in all family members. The multiprotein complex serves to tranduce signals that involve changes in cell shape, motility or function. The published map location (PMID:10381382) has been changed based on recent genomic sequence comparisons, which indicate that the expressed gene is located on chromosome 1, and a pseudogene may be located on chromosome X.
32330	
32331	
32332	
32333	
32334	
32335	
32336	
32337	
32338	
32339	
32340	
32341	
32342	
32343	
32344	
32345	
32346	
32347	
32348	
32349	
32350	
32351	
32352	
32353	
32354	
32355	
32356	
32357	This gene encodes a member of the Golgi-localized, gamma adaptin ear-containing, ARF-binding (GGA) family. This family includes ubiquitous coat proteins that regulate the trafficking of proteins between the trans-Golgi network and the lysosome. These proteins share an amino-terminal VHS domain which mediates sorting of the mannose 6-phosphate receptors at the trans-Golgi network. They also contain a carboxy-terminal region with homology to the ear domain of gamma-adaptins. This family member may play a significant role in cargo molecules regulation and clathrin-coated vesicle assembly. Alternative splicing of this gene results in two transcript variants.
32358	
32359	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It is one of the most abundant core proteins of hnRNP complexes and it is localized to the nucleoplasm. This protein, along with other hnRNP proteins, is exported from the nucleus, probably bound to mRNA, and is immediately re-imported. Its M9 domain acts as both a nuclear localization and nuclear export signal. The encoded protein is involved in the packaging of pre-mRNA into hnRNP particles, transport of poly A+ mRNA from the nucleus to the cytoplasm, and may modulate splice site selection. It is also thought have a primary role in the formation of specific myometrial protein species in parturition. Multiple alternatively spliced transcript variants have been found for this gene but only two transcripts are fully described. These variant have multiple alternative transcription initiation sites and multiple polyA sites.
32360	
32361	
32362	
32363	
32364	
32365	
32366	
32367	
32368	
32369	
32370	
32371	
32372	
32373	
32374	
32375	
32376	
32377	
32378	
32379	
32380	
32381	
32382	
32383	SLC7A11 is a member of a heteromeric Na(+)-independent anionic amino acid transport system highly specific for cystine and glutamate. In this system, designated system Xc(-), the anionic form of cystine is transported in exchange for glutamate.[supplied by OMIM]
32384	
32385	
32386	
32387	
32388	
32389	
32390	
32391	
32392	The activity of nonmuscle myosin II (see MYH9; MIM 160775) is regulated by phosphorylation of a regulatory light chain, such as MRLC2. This phosphorylation results in higher MgATPase activity and the assembly of myosin II filaments (Iwasaki et al., 2001).[supplied by OMIM]
32393	
32394	
32395	
32396	
32397	
32398	
32399	
32400	
32401	This gene encodes a member of the DEAD box protein family. DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. The protein encoded by this gene interacts in a hormone-dependent manner with nuclear receptors and represses their transcriptional activity.
32402	FSHD region gene 1 maps 100 kb centromeric of the repeat units on chromosome 4q35 which are deleted in facioscapulohumeral muscular dystrophy. The FRG1 gene is evolutionarily conserved and has related sequences on multiple human chromosomes.  DNA sequence analysis did not reveal any homology to known genes.
32403	
32404	This gene encodes a protein that belongs to the inositol phosphokinase (IPK) family. This protein is likely responsible for the conversion of inositol hexakisphosphate (InsP6) to diphosphoinositol pentakisphosphate (InsP7/PP-InsP5). It may also convert 1,3,4,5,6-pentakisphosphate (InsP5) to PP-InsP4. Alternative splicing occurs for this gene; however, the full-length nature of all transcript variants has not yet been described.
32405	
32406	The proteasome is a multicatalytic proteinase complex with a highly ordered ring-shaped 20S core structure. The core structure is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway. An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes a member of the peptidase T1A family, that is a 20S core alpha subunit. A pseudogene has been identified on the Y chromosome.
32407	
32408	
32409	
32410	
32411	
32412	
32413	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes one of the non-ATPase subunits of the 19S regulator lid. Two alternate transcripts encoding two different isoforms have been described. Pseudogenes have been identified on chromosomes 10 and 21.
32414	
32415	
32416	
32417	
32418	
32419	
32420	
32421	
32422	
32423	
32424	
32425	The BTG1 gene locus has been shown to be involved in a t(8;12)(q24;q22) chromosomal translocation in a case of B-cell chronic lymphocytic leukemia. It is a member of a family of antiproliferative genes. BTG1 expression is maximal in the G0/G1 phases of the cell cycle and downregulated when cells progressed through G1. It negatively regulates cell proliferation.
32426	
32427	
32428	This gene encodes a member of the Kruppel family of C2H2-type zinc-finger transcription factor proteins. The encoded protein acts as a transcriptional activator. Two transcript variants encoding distinct isoforms have been identified for this gene. Other transcript variants have been described, but their full length sequence has not been determined.
32429	
32430	
32431	
32432	
32433	
32434	
32435	
32436	
32437	
32438	
32439	
32440	
32441	
32442	
32443	
32444	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
32445	
32446	
32447	
32448	This gene encodes an isoform of the alpha subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This isoform (alpha 1) is expressed in brain, placenta, lung, liver, kidney, and pancreas, and the other isoform (alpha 2) is expressed in brain, heart and skeletal muscle. This isoform is identified as an autoantigen in 66% of patients with Felty syndrome. This gene has been found to have multiple copies on many chromosomes, some of which, if not all, represent different pseudogenes.
32449	
32450	
32451	AIM2 is a member of the IFI20X /IFI16 family.  It plays a putative role in tumorigenic reversion and may control cell proliferation.  Interferon-gamma induces expression of AIM2.
32452	
32453	INPP1 encodes the enzyme inositol polyphosphate-1-phosphatase, one of the enzymes involved in phosphatidylinositol signaling pathways. This enzyme removes the the phosphate group at position 1 of the inositol ring from the polyphosphates inositol 1,4-bisphosphate and inositol 1,3,4-trisphophosphate.
32454	
32455	
32456	Lactate dehydrogenase A catalyzes the conversion of L-lactate and NAD to pyruvate and NADH in the final step of anaerobic glycolysis.  LDHA is found predominantly in muscle tissue and belongs to the lactate dehydrogenase family.  Mutations in LDHA have been linked to exertional myoglobinuria.
32457	
32458	
32459	
32460	
32461	
32462	This gene encodes a large extracellular member of the immunoglobulin superfamily. A similar protein in C. elegans forms long, fine tracks at specific extracellular sites which are involved in many processes such as stabilization of the germline syncytium, anchorage of mechanosensory neurons to the epidermis, and organization of hemidesmosomes in the epidermis.
32463	
32464	
32465	
32466	This gene belongs to the ephrin receptor subfamily of the protein-tyrosine kinase family. EPH and EPH-related receptors have been implicated in mediating developmental events, particularly in the nervous system. Receptors in the EPH subfamily typically have a single kinase domain and an extracellular region containing a Cys-rich domain and 2 fibronectin type III repeats. The ephrin receptors are divided into 2 groups based on the similarity of their extracellular domain sequences and their affinities for binding ephrin-A and ephrin-B ligands.
32467	
32468	
32469	
32470	
32471	
32472	
32473	
32474	
32475	Isocitrate dehydrogenases catalyze the oxidative decarboxylation of isocitrate to 2-oxoglutarate. These enzymes belong to two distinct subclasses, one of which utilizes NAD(+) as the electron acceptor and the other NADP(+). Five isocitrate dehydrogenases have been reported: three NAD(+)-dependent isocitrate dehydrogenases, which localize to the mitochondrial matrix, and two NADP(+)-dependent isocitrate dehydrogenases, one of which is mitochondrial and the other predominantly cytosolic. Each NADP(+)-dependent isozyme is a homodimer. The protein encoded by this gene is the NADP(+)-dependent isocitrate dehydrogenase found in the cytoplasm and peroxisomes. It contains the PTS-1 peroxisomal targeting signal sequence. The presence of this enzyme in peroxisomes suggests roles in the regeneration of NADPH for intraperoxisomal reductions, such as the conversion of 2, 4-dienoyl-CoAs to 3-enoyl-CoAs, as well as in peroxisomal reactions that consume 2-oxoglutarate, namely the alpha-hydroxylation of phytanic acid. The cytoplasmic enzyme serves a significant role in cytoplasmic NADPH production.
32476	
32477	
32478	
32479	
32480	
32481	The protein encoded by this gene is a signal transducer shared by many cytokines, including interleukin 6 (IL6), ciliary neurotrophic factor (CNTF), leukemia inhibitory factor (LIF), and oncostatin M (OSM). This protein functions as a part of the cytokine receptor complex. The activation of this protein is dependent upon the binding of cytokines to their receptors. vIL6, a protein related to IL6 and encoded by the Kaposi sarcoma-associated herpesvirus, can bypass the interleukin 6 receptor (IL6R) and directly activate this protein. Knockout studies in mice suggested a critical role of the gene encoding this protein in regulating myocyte apoptosis. Alternatively spliced transcript variants encoding distinct isoforms have been described.
32482	
32483	
32484	
32485	The galectins are a family of beta-galactoside-binding proteins implicated in modulating cell-cell and cell-matrix interactions. The protein encoded by this gene is an S-type lectin. This galectin is strongly overexpressed in Hodgkin@#$%&s disease tissue and it might participate in the interaction between the H&RS cells with their surrounding cells and might thus play a role in the pathogenesis of this disease and/or its consistently associated immunodeficiency. The protein has N- and C- terminal carbohydrate-binding domains connected by a link peptide. Two isoforms (long and short) exist.
32486	
32487	
32488	
32489	
32490	
32491	
32492	
32493	This gene encodes an E3 ubiquitin-protein ligase, part of the ubiquitin protein degradation system. This imprinted gene is maternally expressed in brain and biallelically expressed in other tissues. Maternally inherited deletion of this gene causes Angelman Syndrome, characterized by severe motor and intellectual retardation, ataxia, hypotonia, epilepsy, absence of speech, and characteristic facies. The protein also interacts with the E6 protein of human papillomavirus types 16 and 18, resulting in ubiquitination and proteolysis of tumor protein p53. Alternative splicing of this gene results in three transcript variants encoding three isoforms with different N-termini. Additional transcript variants have been described, but their full length nature has not been determined.
32494	
32495	The four human glycoprotein hormones chorionic gonadotropin (CG), luteinizing hormone (LH), follicle stimulating hormone (FSH), and thyroid stimulating hormone (TSH) are dimers consisting of alpha and beta subunits that are associated noncovalently. The alpha subunits of these hormones are identical, however, their beta chains are unique and confer biological specificity. The protein encoded by this gene is the alpha subunit and belongs to the glycoprotein hormones alpha chain family.
32496	
32497	This gene product is a member of the serine-arginine (SR) family of proteins, which is involved in constitutive and regulated RNA splicing. Members of this family are characterized by N-terminal RNP1 and RNP2 motifs, which are required for binding to RNA, and multiple C-terminal SR/RS repeats, which are important in mediating association with other cellular proteins. This protein can influence splice site selection of adenovirus E1A pre-mRNA. It interacts with the oncoprotein TLS, and abrogates the influence of TLS on E1A pre-mRNA splicing. Alternative splicing of this gene results in at least two transcript variants encoding different isoforms. In addition, transcript variants utilizing alternative polyA sites exist.
32498	
32499	
32500	This gene encodes a molecular chaperone that is member of the chaperonin containing TCP1 complex (CCT), also known as the TCP1 ring complex (TRiC). This complex consists of two identical stacked rings, each containing eight different proteins. Unfolded polypeptides enter the central cavity of the complex and are folded in an ATP-dependent manner. The complex folds various proteins, including actin and tubulin. Alternate transcriptional splice variants of this gene, encoding different isoforms, have been characterized.
32501	
32502	
32503	
32504	
32505	
32506	This gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins. The encoded protein may play a role in chondrogenesis. A pseudogene of this gene is located on chromosome 8. Multiple transcript variants encoding distinct isoforms have been identified for this gene.
32507	
32508	
32509	
32510	NCF1 encodes neutrophil cytosolic facter 1, the 47-kilodalton cytosolic subunit of the multi-protein complex known as NADPH oxidase found in neutrophils. This oxidase produces a burst of superoxide which is delivered to the lumen of the neutrophil phagosome. Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic granulomatous disease.
32511	
32512	
32513	
32514	
32515	
32516	
32517	
32518	HLA-E belongs to the HLA class I heavy chain paralogues. This class I molecule is a heterodimer consisting of a heavy chain and a light chain (beta-2 microglobulin). The heavy chain is anchored in the membrane. HLA-E binds a restricted subset of peptides derived from the leader peptides of other class I molecules. The heavy chain is approximately 45 kDa and its gene contains 8 exons. Exon one encodes the leader peptide, exons 2 and 3 encode the alpha1 and alpha2 domains, which both bind the peptide, exon 4 encodes the alpha3 domain, exon 5 encodes the transmembrane region, and exons 6 and 7 encode the cytoplasmic tail.
32519	
32520	
32521	
32522	
32523	
32524	
32525	
32526	Desmosomes are intercellular junctions that tightly link adjacent cells. Desmoplakin is an obligate component of functional desmosomes that anchors intermediate filaments to desmosomal plaques. The N-terminus of desmoplakin is required for localization to the desmosome and interacts with the N-terminal region of plakophilin 1 and plakoglobin. The C-terminus of desmoplakin binds with intermediate filaments. In the mid-region of desmoplakin, a coiled-coiled rod domain is responsible for homodimerization. Mutations in this gene are the cause of several cardiomyopathies and keratodermas as well as the autoimmune disease paraneoplastic pemphigus.
32527	
32528	
32529	
32530	
32531	
32532	The protein encoded by this gene belongs to the src family kinases. It is a cytoplasmic protein which is preferentially expressed in T-lymphocytes where it interacts with the protein-tyrosine kinase p59fyn. The presence of a PH domain and a SH3 domain suggests that this encoded protein is capable of interacting with several other intracellular proteins.
32533	
32534	
32535	This gene encodes a microsomal beta-glucosidase that catalyzes the hydrolysis of bile acid 3-O-glucosides as endogenous compounds. Studies to determine subcellular localization of this protein in the liver indicated that the enzyme was mainly enriched in the microsomal fraction where it appeared to be confined to the endoplasmic reticulum. This putative transmembrane protein is thought to play a role in carbohydrate transport and metabolism.
32536	
32537	
32538	
32539	
32540	
32541	
32542	The protein encoded by this gene is a member of the rhomboid family of integral membrane proteins. This family contains proteins that are related to Drosophila rhomboid protein. Members of this family are found in both prokaryotes and eukaryotes and are thought to function as intramembrane serine proteases. The encoded protein is thought to release soluble growth factors by proteolytic cleavage of certain membrane-bound substrates, including ephrin B2 and ephrin B3.
32543	
32544	
32545	
32546	
32547	
32548	
32549	
32550	
32551	
32552	
32553	
32554	
32555	The protein encoded by this gene is a cytokine that shares the sequence similarity with IL17. The treatment of endothelial cells with this cytokine has been shown to stimulate the production of other cytokines including IL6, IL8 and CSF2/ GM-CSF. The increased expression of IL8 induced by this cytokine was found to be NF-kappa B-dependent.
32556	
32557	
32558	
32559	
32560	
32561	
32562	
32563	
32564	
32565	
32566	
32567	
32568	The delta (HBD) and beta (HBB) genes are normally expressed in the adult: two alpha chains plus two beta chains constitute HbA, which in normal adult life comprises about 97% of the total hemoglobin. Two alpha chains plus two delta chains constitute HbA-2, which with HbF comprises the remaining 3% of adult hemoglobin. Five beta-like globin genes are found within a 45 kb cluster on chromosome 11 in the following order:      5@#$%&-epsilon--Ggamma--Agamma--delta--beta-3@#$%&. Mutations in the delta-globin gene are associated with beta-thalassemia.
32569	
32570	
32571	
32572	
32573	
32574	This gene encodes a member of the IgLON subfamily in the immunoglobulin protein superfamily. The encoded protein is localized in the plasma membrane and may have an accessory role in opioid receptor function. This gene has an ortholog in rat and bovine. The opioid binding-cell adhesion molecule encoded by the rat gene binds opioid alkaloids in the presence of acidic lipids, exhibits selectivity for mu ligands and acts as a GPI-anchored protein. Since the encoded protein is highly conserved in species during evolution, it may have a fundamental role in mammalian systems. Two transcript variants encoding different isoforms have been found for this gene.
32575	
32576	
32577	
32578	
32579	
32580	
32581	
32582	Tight junctions represent one mode of cell-to-cell adhesion in epithelial or endothelial cell sheets, forming continuous seals around cells and serving as a physical barrier to prevent solutes and water from passing freely through the paracellular space. These junctions are comprised of sets of continuous networking strands in the outwardly facing cytoplasmic leaflet, with complementary grooves in the inwardly facing extracytoplasmic leaflet. The protein encoded by this gene, a member of the claudin family, is an integral membrane protein and a component of tight junction strands. It is found primarily in the kidneys, specifically in the thick ascending limb of Henle, where it acts as either an intercellular pore or ion concentration sensor to regulate the paracellular resorption of magnesium ions. Defects in this gene are a cause of primary hypomagnesemia, which is characterized by massive renal magnesium wasting with hypomagnesemia and hypercalciuria, resulting in nephrocalcinosis and renal failure.
32583	
32584	
32585	
32586	
32587	
32588	Voltage-gated potassium (Kv) channels represent the most complex class of voltage-gated ion channels from both functional and structural standpoints. Their diverse functions include regulating neurotransmitter release, heart rate, insulin secretion, neuronal excitability, epithelial electrolyte transport, smooth muscle contraction, and cell volume. This gene encodes a member of the potassium channel, voltage-gated, subfamily H. This member is a pore-forming (alpha) subunit. There are at least two alternatively spliced transcript variants derived from this gene and encoding distinct isoforms.
32589	
32590	
32591	
32592	
32593	
32594	
32595	
32596	
32597	
32598	
32599	The product of this gene belongs to the serine/threonine protein kinase family and to the Ca(2+)/calmodulin-dependent protein kinase subfamily. Calcium signaling is crucial for several aspects of plasticity at glutamatergic synapses. In mammalian cells, the enzyme is composed of four different chains: alpha, beta, gamma, and delta. The product of this gene is a delta chain. Four alternatively spliced transcript variants that encode three different isoforms have been characterized to date. Distinct isoforms of this chain have different expression patterns.
32600	
32601	
32602	
32603	
32604	
32605	
32606	
32607	
32608	
32609	
32610	
32611	
32612	
32613	
32614	
32615	
32616	
32617	
32618	
32619	
32620	
32621	
32622	
32623	
32624	
32625	
32626	
32627	
32628	The protein encoded by this gene is a member of the keratin gene family. The type II cytokeratins consist of basic or neutral proteins which are arranged in pairs of heterotypic keratin chains coexpressed during differentiation of simple and stratified epithelial tissues. This type II cytokeratin is specifically expressed in the basal layer of the epidermis with family member KRT14. Mutations in these genes have been associated with a complex of diseases termed epidermolysis bullosa simplex. The type II cytokeratins are clustered in a region of chromosome 12q12-q13.
32629	
32630	
32631	
32632	
32633	
32634	
32635	
32636	
32637	
32638	
32639	
32640	
32641	
32642	
32643	Prenyltransferases attach either a farnesyl group or a geranylgeranyl group in thioether linkage to the cysteine residue of protein@#$%&s with a C-terminal CAAX box. CAAX geranylgeranyltransferase and CAAX farnesyltransferase are heterodimers that share the same alpha subunit but have different beta subunits. This gene encodes the alpha subunit of these transferases. Alternative splicing results in multiple transcript variants encoding different isoforms.
32644	
32645	
32646	D-dopachrome tautomerase converts D-dopachrome into 5,6-dihydroxyindole. The DDT gene is related to the migration inhibitory factor (MIF) in terms of sequence, enzyme activity, and gene structure. DDT and MIF are closely linked on chromosome 22.
32647	
32648	
32649	
32650	
32651	
32652	
32653	
32654	
32655	Proteins of the matrix metalloproteinase (MMP) family are involved in the breakdown of extracellular matrix in normal physiological processes, such as embryonic development, reproduction, and tissue remodeling,as well as in disease processes, such as arthritis and metastasis. Most MMP@#$%&s are secreted as inactive proproteins which are activated when cleaved by extracellular proteinases. This gene encodes an enzyme which degrades type IV collagen, the major structural component of basement membranes. The enzyme plays a role in endometrial menstrual breakdown, regulation of vascularization and the inflammatory response.
32656	
32657	
32658	This gene encodes a translation elongation factor. The protein is a guanine nucleotide exchange factor involved in the transfer of aminoacylated tRNAs to the ribosome. Alternative splicing results in two transcript variants which differ only in the 5@#$%& UTR.
32659	
32660	
32661	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S5P family of ribosomal proteins. It is located in the cytoplasm. This gene shares sequence similarity with mouse LLRep3. It is co-transcribed with the small nucleolar RNA gene U64, which is located in its third intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
32662	
32663	
32664	
32665	
32666	
32667	
32668	
32669	
32670	
32671	
32672	
32673	TC10 is a member of the RAS superfamily of small GTP-binding proteins (see HRAS, MIM 190020) involved in insulin-stimulated glucose uptake.[supplied by OMIM]
32674	TC10 is a member of the RAS superfamily of small GTP-binding proteins (see HRAS, MIM 190020) involved in insulin-stimulated glucose uptake.[supplied by OMIM]
32675	
32676	Apelin is a neuropeptide expressed in the supraoptic and paraventricular nuclei that acts on specific receptors located on vasopressinergic neurons (De Mota et al., 2004).[supplied by OMIM]
32677	
32678	
32679	
32680	
32681	
32682	
32683	
32684	
32685	
32686	
32687	
32688	This gene encodes a molecular chaperone that is member of the chaperonin containing TCP1 complex (CCT), also known as the TCP1 ring complex (TRiC). This complex consists of two identical stacked rings, each containing eight different proteins. Unfolded polypeptides enter the central cavity of the complex and are folded in an ATP-dependent manner. The complex folds various proteins, including actin and tubulin. Alternate transcriptional splice variants of this gene, encoding different isoforms, have been characterized.
32689	
32690	
32691	
32692	
32693	
32694	
32695	
32696	
32697	
32698	
32699	The protein encoded by this gene is an adaptor protein that mediates the association of the heat shock proteins HSP70 and HSP90. This protein has been shown to be involved in the assembly process of glucocorticoid receptor, which requires the assistance of multiple molecular chaperones. The expression of this gene is reported to be downregulated in colorectal carcinoma tissue suggesting that is a candidate tumor suppressor gene.
32700	The protein encoded by this gene belongs to the spermidine/spermine synthases family. This gene encodes an ubiquitous enzyme of polyamine metabolism.
32701	
32702	This gene is a member of the DAZ gene family and is a candidate for the human Y-chromosomal azoospermia factor (AZF). Its expression is restricted to premeiotic germ cells, particularly in spermatogonia. It encodes an RNA-binding protein that is important for spermatogenesis. Four copies of this gene are found on chromosome Y within palindromic duplications; one pair of genes is part of the P2 palindrome and the second pair is part of the P1 palindrome. Each gene contains a 2.4 kb repeat including a 72-bp exon, called the DAZ repeat; the number of DAZ repeats is variable and there are several variations in the sequence of the DAZ repeat. Each copy of the gene also contains a 10.8 kb region that may be amplified; this region includes five exons that encode an RNA recognition motif (RRM) domain. This gene contains two copies of the 10.8 kb repeat. Alternative splicing results in multiple transcript variants encoding different isoforms.
32703	
32704	
32705	The protein encoded by this gene is a member of the immunophilin protein family, which play a role in immunoregulation and basic cellular processes involving protein folding and trafficking. This encoded protein is a cis-trans prolyl isomerase that binds the immunosuppressants FK506 and rapamycin. It interacts with several intracellular signal transduction proteins including type I TGF-beta receptor. It also interacts with multiple intracellular calcium release channels including the tetrameric skeletal muscle ryanodine receptor. In mouse, deletion of this homologous gene causes congenital heart disorder known as noncompaction of left ventricular myocardium. There is evidence of multiple alternatively spliced transcript variants for this gene, but the full length nature of some variants has not been determined.
32706	
32707	
32708	
32709	
32710	
32711	RAN (ras-related nuclear protein) is a small GTP binding protein belonging to the RAS superfamily that is essential for the translocation of RNA and proteins through the nuclear pore complex. The RAN protein is also involved in control of DNA synthesis and cell cycle progression. Nuclear localization of RAN requires the presence of regulator of chromosome condensation 1 (RCC1). Mutations in RAN disrupt DNA synthesis. Because of its many functions, it is likely that RAN interacts with several other proteins. RAN regulates formation and organization of the microtubule network independently of its role in the nucleus-cytosol exchange of macromolecules. RAN could be a key signaling molecule regulating microtubule polymerization during mitosis. RCC1 generates a high local concentration of RAN-GTP around chromatin which, in turn, induces the local nucleation of microtubules.  RAN is an androgen receptor (AR) coactivator that binds differentially with different lengths of polyglutamine within the androgen receptor. Polyglutamine repeat expansion in the AR is linked to Kennedy@#$%&s disease (X-linked spinal and bulbar muscular atrophy). RAN coactivation of the AR diminishes with polyglutamine expansion within the AR, and this weak coactivation may lead to partial androgen insensitivity during the development of Kennedy@#$%&s disease.
32712	The muscarinic cholinergic receptors belong to a larger family of G protein-coupled receptors. The functional diversity of these receptors is defined by the binding of acetylcholine to these receptors and includes cellular responses such as adenylate cyclase inhibition, phosphoinositide degeneration, and potassium channel mediation. Muscarinic receptors influence many effects of acetylcholine in the central and peripheral nervous system. The muscarinic cholinergic receptor 2 is involved in mediation of bradycardia and a decrease in cardiac contractility. Multiple alternatively spliced transcript variants have been described for this gene.
32713	
32714	
32715	
32716	This gene may play a role in the development and normal function of the ovaries. Mutations of this gene have been linked to premature ovarian failure. Alternative splicing results in two protein isoforms. Furthermore, each splice variant undergoes additional splicing in the 3@#$%& UTR. Overall 4 splice variants have been described.
32717	
32718	
32719	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It is one of the most abundant core proteins of hnRNP complexes and it is localized to the nucleoplasm. This protein, along with other hnRNP proteins, is exported from the nucleus, probably bound to mRNA, and is immediately re-imported. Its M9 domain acts as both a nuclear localization and nuclear export signal. The encoded protein is involved in the packaging of pre-mRNA into hnRNP particles, transport of poly A+ mRNA from the nucleus to the cytoplasm, and may modulate splice site selection. It is also thought have a primary role in the formation of specific myometrial protein species in parturition. Multiple alternatively spliced transcript variants have been found for this gene but only two transcripts are fully described. These variant have multiple alternative transcription initiation sites and multiple polyA sites.
32720	
32721	
32722	
32723	
32724	
32725	
32726	
32727	
32728	This gene encodes a lysosomal membrane protein that cleaves the beta-glucosidic linkage of glycosylceramide, an intermediate in glycolipid metabolism. Mutations in this gene cause Gaucher disease, a lysosomal storage disease characterized by an accumulation of glucocerebrosides. A related pseudogene is approximately 12 kb downstream of this gene on chromosome 1. Alternative splicing results in multiple transcript variants encoding the same protein.
32729	
32730	
32731	
32732	
32733	
32734	
32735	
32736	
32737	
32738	
32739	
32740	
32741	
32742	
32743	
32744	
32745	
32746	
32747	
32748	
32749	This gene encodes a member of the RSK (ribosomal S6 kinase) family of serine/threonine kinases. This kinase contains 2 non-identical kinase catalytic domains and phosphorylates various substrates, including members of the mitogen-activated kinase (MAPK) signalling pathway. The activity of this protein has been implicated in controlling cell growth and differentiation. Mutations in this gene have been associated with Coffin-Lowry syndrome (CLS).
32750	
32751	
32752	
32753	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene contains a RNA binding domain and scaffold-associated region (SAR)-specific bipartite DNA-binding domain. This protein is also thought to be involved in the packaging of hnRNA into large ribonucleoprotein complexes. During apoptosis, this protein is cleaved in a caspase-dependent way. Cleavage occurs at the SALD site, resulting in a loss of DNA-binding activity and a concomitant detachment of this protein from nuclear structural sites. But this cleavage does not affect the function of the encoded protein in RNA metabolism. Two alternatively spliced transcript variants have been described for this gene.
32754	
32755	
32756	
32757	
32758	
32759	
32760	
32761	
32762	
32763	
32764	
32765	Vasodilator-stimulated phosphoprotein (VASP) is a member of the Ena-VASP protein family. Ena-VASP family members contain an EHV1 N-terminal domain that binds proteins containing E/DFPPPPXD/E motifs and targets Ena-VASP proteins to focal adhesions. In the mid-region of the protein, family members have a proline-rich domain that binds SH3 and WW domain-containing proteins. Their C-terminal EVH2 domain mediates tetramerization and binds both G and F actin. VASP is associated with filamentous actin formation and likely plays a widespread role in cell adhesion and motility. VASP may also be involved in the intracellular signaling pathways that regulate integrin-extracellular matrix interactions. VASP is regulated by the cyclic nucleotide-dependent kinases PKA and PKG. While several transcript variants may exist for this gene, the full-length natures of only two have been described to date. These two represent the major variants of this gene and encode distinct isoforms.
32766	UNC5B belongs to a family of netrin-1 (MIM 601614) receptors thought to mediate the chemorepulsive effect of netrin-1 on specific axons. For more information on UNC5 proteins, see UNC5C (MIM 603610).[supplied by OMIM]
32767	Molybdenum cofactor biosynthesis is a conserved pathway leading to the biological activation of molybdenum. The protein encoded by this gene is involved in molybdenum cofactor biosynthesis. This gene produces a bicistronic mRNA that has the potential to produce two proteins (MOCS1A and MOCS1B) from adjacent open reading frames. However, only the first open reading frame (MOCS1A) has been found to encode a protein from the bicistronic mRNA. Three splice variants have been found for this gene, two of which are monocistronic and express proteins (MOCS1A-MOCS1B) that result from a fusion between the two open reading frames. This gene is defective in patients with molybdenum cofactor deficiency, type A.
32768	
32769	
32770	
32771	
32772	
32773	The 17-beta-hydroxysteroid dehydrogenase enzyme (EC 1.1.1.62) oxidizes or reduces estrogens and androgens in mammals and regulates the biologic potency of these steroids.[supplied by OMIM]
32774	
32775	
32776	
32777	This gene shares both structural and functional similarities with the dystrophin gene.  It contains an actin-binding N-terminus, a triple coiled-coil repeat central region, and a C-terminus that consists of protein-protein interaction motifs which interact with dystroglycan protein components.  The protein encoded by this gene is located at the neuromuscular synapse and myotendinous junctions, where it participates in post-synaptic membrane maintenance and acetylcholine receptor clustering.  Mouse studies suggest that this gene may serve as a functional substitute for the dystrophin gene and therefore, may serve as a potential therapeutic alternative to muscular dystrophy which caused by mutations in the dystrophin gene.  Alternative splicing of the utrophin gene has been described; however, the full-length nature of these variants has not yet been determined.
32778	
32779	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. The function of this gene has not yet been determined. The expression pattern of the mouse homolog implies a role in nervous system development.
32780	
32781	
32782	
32783	
32784	
32785	
32786	This gene represents a candidate gene for human lipodystrophy, characterized by loss of body fat, fatty liver, hypertriglyceridemia, and insulin resistance. Mouse studies suggest that this gene functions during normal adipose tissue development and may also play a role in human triglyceride metabolism.
32787	
32788	
32789	
32790	
32791	
32792	
32793	
32794	
32795	
32796	
32797	
32798	
32799	
32800	This gene belongs to the sulfotransferase gene family. Sulfotransferases generate sulfated glycosaminoglycan (GAG) moities during chondroitin sulfate biosynthesis. They create considerable structural diversity among chondroitin sulfates by transferring sulfate with remarkable specificity for the underlying oligosaccharide substrate. This gene product mainly transfers sulfate to N-acetylgalactosamine. The regulated expression of each member of this gene family may be an important determinant of sulfated GAGs expression and the associated function of chondroitin sulfates as regulators of many biologic processes. This gene is part of a gene cluster on chromosome Xp11.23.
32801	
32802	
32803	
32804	
32805	Aminoacyl-tRNA synthetases catalyze the aminoacylation of tRNA by their cognate amino acid. Because of their central role in linking amino acids with nucleotide triplets contained in tRNAs, aminoacyl-tRNA synthetases are thought to be among the first proteins that appeared in evolution. In metazoans, 9 aminoacyl-tRNA synthetases specific for glutamine (gln), glutamic acid (glu), and 7 other amino acids are associated within a multienzyme complex. Although present in eukaryotes, glutaminyl-tRNA synthetase (QARS) is absent from many prokaryotes, mitochondria, and chloroplasts, in which Gln-tRNA(Gln) is formed by transamidation of the misacylated Glu-tRNA(Gln). Glutaminyl-tRNA synthetase belongs to the class-I aminoacyl-tRNA synthetase family.
32806	
32807	
32808	
32809	
32810	
32811	
32812	
32813	
32814	tRNA splicing is a fundamental process required for cell growth and division. SEN2 is a subunit of the tRNA splicing endonuclease, which catalyzes the removal of introns, the first step in tRNA splicing (Paushkin et al., 2004).[supplied by OMIM]
32815	
32816	
32817	
32818	
32819	
32820	
32821	
32822	
32823	
32824	
32825	
32826	
32827	
32828	
32829	
32830	
32831	
32832	
32833	
32834	
32835	
32836	
32837	
32838	
32839	
32840	This gene encodes the D5 subtype of the dopamine receptor. The D5 subtype is a G-protein coupled receptor which stimulates adenylyl cyclase. This receptor is expressed in neurons in the limbic regions of the brain. It has a 10-fold higher affinity for dopamine than the D1 subtype. Pseudogenes related to this gene reside on chromosomes 1 and 2.
32841	
32842	
32843	
32844	
32845	
32846	
32847	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
32848	
32849	
32850	
32851	This gene encodes a molecular chaperone that is member of the chaperonin containing TCP1 complex (CCT), also known as the TCP1 ring complex (TRiC). This complex consists of two identical stacked rings, each containing eight different proteins. Unfolded polypeptides enter the central cavity of the complex and are folded in an ATP-dependent manner. The complex folds various proteins, including actin and tubulin. Alternate transcriptional splice variants of this gene, encoding different isoforms, have been characterized.
32852	
32853	
32854	
32855	
32856	
32857	
32858	The product of this gene belongs to the family of highly homologous synovial sarcoma X (SSX) breakpoint proteins. These proteins may function as transcriptional repressors. They are also capable of eliciting spontaneously humoral and cellular immune responses in cancer patients, and are potentially useful targets in cancer vaccine-based immunotherapy. SSX1, SSX2 and SSX4 genes have been involved in the t(X;18) translocation characteristically found in all synovial sarcomas. This translocation results in the fusion of the synovial sarcoma translocation gene on chromosome 18 to one of the SSX genes on chromosome X. The encoded hybrid proteins are probably responsible for transforming activity.
32859	
32860	
32861	
32862	
32863	
32864	
32865	
32866	
32867	
32868	
32869	
32870	
32871	
32872	
32873	
32874	
32875	
32876	
32877	
32878	
32879	
32880	
32881	
32882	
32883	
32884	
32885	
32886	
32887	
32888	
32889	
32890	
32891	
32892	
32893	
32894	
32895	
32896	
32897	
32898	
32899	
32900	
32901	
32902	
32903	
32904	
32905	
32906	
32907	
32908	
32909	
32910	Ubiquitin (MIM 191339), a highly conserved protein involved in the regulation of intracellular protein breakdown, cell cycle regulation, and stress response, is released from degraded proteins by disassembly of the polyubiquitin chains. The disassembly process is mediated by ubiquitin-specific proteases (USPs). Also see USP1 (MIM 603478).[supplied by OMIM]
32911	
32912	
32913	
32914	
32915	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L21E family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
32916	
32917	
32918	This gene encodes the folylpolyglutamate synthetase enzyme. This enzyme has a central role in establishing and maintaining both cytosolic and mitochondrial folylpolyglutamate concentrations and, therefore, is essential for folate homeostasis and the survival of proliferating cells. This enzyme catalyzes the ATP-dependent addition of glutamate moieties to folate and folate derivatives. While several transcript variants may exist for this gene, the full-length natures of only two have been biologically validated to date. These two variants encode distinct isoforms.
32919	
32920	
32921	
32922	
32923	
32924	
32925	
32926	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes one of the ATPase subunits, a member of the triple-A family of ATPases which have a chaperone-like activity. This subunit and a 20S core alpha subunit interact specifically with the hepatitis B virus X protein, a protein critical to viral replication. This subunit also interacts with the adenovirus E1A protein and this interaction alters the activity of the proteasome. Finally, this subunit interacts with ataxin-7, suggesting a role for the proteasome in the development of spinocerebellar ataxia type 7, a progressive neurodegenerative disorder.
32927	
32928	
32929	
32930	
32931	
32932	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S3AE family of ribosomal proteins. It is located in the cytoplasm. Disruption of the gene encoding rat ribosomal protein S3a, also named v-fos transformation effector protein, in v-fos-transformed rat cells results in reversion of the transformed phenotype. Transcript variants utilizing alternative transcription start sites have been described. This gene is co-transcribed with the U73A and U73B small nucleolar RNA genes, which are located in its fourth and third introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
32933	
32934	
32935	
32936	
32937	The product of this gene belongs to the SEC15 family. It is highly similar to the protein encoded by Saccharomyces cerevisiae SEC15 gene. This protein is essential for vesicular traffic from the Golgi apparatus to the cell surface in yeast. It is one of the components of a multiprotein complex required for exocytosis. Alternatively spliced transcript variants encoding different isoforms have been identified.
32938	This gene encodes a member of the Ser/Thr protein kinase family. This kinase is regulated through direct phosphorylation by p38 MAP kinase. In conjunction with p38 MAP kinase, this kinase is known to be involved in many cellular processes including stress and inflammatory responses, nuclear export, gene expression regulation and cell proliferation. Heat shock protein HSP27 was shown to be one of the substrates of this kinase in vivo. Two transcript variants encoding two different isoforms have been found for this gene.
32939	
32940	
32941	
32942	
32943	
32944	
32945	
32946	
32947	
32948	
32949	
32950	
32951	
32952	
32953	
32954	
32955	
32956	NME1 was identified because of its reduced mRNA transcript levels in highly metastatic cells. NME1 encodes the @#$%&A@#$%& isoform of nucleoside diphosphate kinase (NDK). NDK exists as a hexamer composed of the @#$%&A@#$%& (NME1) and @#$%&B@#$%& (encoded by NME2) isoforms. Mutations in NME1 have been identified in aggressive neuroblastomas. Two transcript variants encoding different isoforms have been found for this gene.
32957	
32958	
32959	
32960	
32961	
32962	
32963	
32964	
32965	
32966	
32967	
32968	
32969	
32970	Complex II of the respiratory chain, which is specifically involved in the oxidation of succinate, carries electrons from FADH to CoQ.  The complex is composed of four nuclear-encoded subunits and is localized in the mitochondrial inner membrane.  The subunit D protein is one of two integral membrane proteins anchoring the complex to the matrix side of the membrane.  Mutations in SDHD have been linked to hereditary paraganglioma.
32971	
32972	Glial cell line-derived neurotrophic factor (GDNF) and neurturin (NTN) are two structurally related, potent neurotrophic factors that play key roles in the control of neuron survival and differentiation. The protein encoded by this gene is a member of the GDNF receptor family. It is a glycosylphosphatidylinositol(GPI)-linked cell surface receptor for both GDNF and NTN, and mediates activation of the RET tyrosine kinase receptor. This gene is a candidate gene for Hirschsprung disease. Two alternatively spliced transcipt variants encoding different isoforms have been described for this gene.
32973	
32974	
32975	
32976	
32977	
32978	UDP-glucose (UDPG) acts as the sugar donor in numerous glycosylation reactions, including those involved in the production of glycogen. NUDT14 is a UDPG pyrophosphatase (EC 3.6.1.45) that hydrolyzes UDPG to produce glucose 1-phosphate and UMP (Yagi et al., 2003).[supplied by OMIM]
32979	
32980	
32981	
32982	
32983	
32984	
32985	Coagulation factor II (thrombin) receptor-like 2 (F2RL2) is a member of the large family of 7-transmembrane-region receptors that couple to guanosine-nucleotide-binding proteins. F2RL2 is also a member of the protease-activated receptor family and activated by thrombin. F2RL2 is activated by proteolytic cleavage of its extracellular amino terminus. The new amino terminus functions as a tethered ligand and activates the receptor. F2RL2 is a cofactor for F2RL3 activation by thrombin. It mediates thrombin-triggered phosphoinositide hydrolysis and is expressed in a variety of tissues.
32986	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L23P family of ribosomal proteins. It is located in the cytoplasm. The protein may be one of the target molecules involved in mediating growth inhibition by interferon. In yeast, the corresponding protein binds to a specific site on the 26S rRNA. This gene is co-transcribed with the U42A, U42B, U101A, and U101B small nucleolar RNA genes, which are located in its third, first, second, and fourth introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
32987	
32988	
32989	Complex II of the respiratory chain, which is specifically involved in the oxidation of succinate, carries electrons from FADH to CoQ.  The complex is composed of four nuclear-encoded subunits and is localized in the mitochondrial inner membrane.  The subunit D protein is one of two integral membrane proteins anchoring the complex to the matrix side of the membrane.  Mutations in SDHD have been linked to hereditary paraganglioma.
32990	The protein encoded by this gene is similar to ING1, a tumor suppressor protein that can interact with TP53, inhibit cell growth, and induce apoptosis. This protein contains a PHD-finger, which is a common motif in proteins involved in chromatin remodeling. This protein can bind TP53 and EP300/p300, a component of the histone acetyl transferase complex, suggesting its involvement in TP53-dependent regulatory pathway.
32991	
32992	
32993	
32994	
32995	
32996	
32997	
32998	
32999	
33000	
33001	
33002	
33003	
33004	
33005	The protein encoded by this gene is a member of serine/threonine protein kinase family. This kinase contains a dual leucine-zipper motif, and has been shown to form dimers/oligomers through its leucine-zipper motif. This kinase can phosphorylate and activate MAPK8/JNK, MAP2K7/MKK7, which suggests a role in the JNK signaling pathway.
33006	
33007	
33008	
33009	
33010	
33011	
33012	
33013	
33014	
33015	
33016	
33017	
33018	
33019	The protein encoded by this gene is a type III transmembrane adaptor protein that binds to the tyrosine kinase csk protein. It is thought to be involved in the regulation of T cell activation.
33020	
33021	
33022	This gene encodes a voltage-activated potassium channel belonging to the eag family. It shares sequence similarity with the Drosophila ether-a-go-go (eag) gene. Mutations in this gene can cause long QT syndrome type 2 (LQT2). Transcript variants encoding distinct isoforms have been identified.
33023	
33024	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP contains two tandem Src homology-2 domains, which function as phospho-tyrosine binding domains and mediate the interaction of this PTP with its substrates. This PTP is widely expressed in most tissues and plays a regulatory role in various cell signaling events that are important for a diversity of cell functions, such as mitogenic activation, metabolic control, transcription regulation, and cell migration. Mutations in this gene are a cause of Noonan syndrome as well as acute myeloid leukemia.
33025	
33026	
33027	
33028	
33029	
33030	
33031	
33032	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L1P family of ribosomal proteins. It is located in the cytoplasm. The expression of this gene is downregulated in the thymus by cyclosporin-A (CsA), an immunosuppressive drug. Studies in mice have shown that the expression of the ribosomal protein L10a gene is downregulated in neural precursor cells during development. This gene previously was referred to as NEDD6 (neural precursor cell expressed, developmentally downregulated 6), but it has been renamed RPL10A (ribosomal protein 10a). As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
33033	
33034	
33035	
33036	
33037	This gene encodes a component of prolyl 4-hydroxylase, a key enzyme in collagen synthesis composed of two identical alpha subunits and two beta subunits. The encoded protein is one of several different types of alpha subunits and provides the major part of the catalytic site of the active enzyme. In collagen and related proteins, prolyl 4-hydroxylase catalyzes the formation of 4-hydroxyproline that is essential to the proper three-dimensional folding of newly synthesized procollagen chains. Alternatively spliced transcript variants encoding different isoforms have been described.
33038	
33039	
33040	
33041	
33042	
33043	
33044	
33045	The protein encoded by this gene is a type II membrane protein that may be present in the Golgi apparatus. The encoded protein, which is a member of glycosyltransferase family 29, may be involved in the synthesis of gangliosides GD1c, GT1a, GQ1b, and GT3 from GD1a, GT1b, GM1b, and GD3, respectively.
33046	
33047	
33048	
33049	
33050	
33051	
33052	
33053	
33054	
33055	
33056	
33057	
33058	
33059	The protein encoded by this gene is a serine/threonine kinase related to the dual specific mitogen-activated protein kinase kinase (MAPKK) family. Evidence suggests that mitotic phosphorylation is required for its catalytic activity. This mitotic kinase may be involved in the activation of lymphoid cells and support testicular functions, with a suggested role in the process of spermatogenesis.
33060	
33061	
33062	
33063	
33064	
33065	
33066	
33067	
33068	This gene encodes a dual specificity protein kinase that belongs to the Ser/Thr protein kinase family. This kinase is a direct activator of MAP kinases in response to various environmental stresses or mitogenic stimuli. It has been shown to activate MAPK8/JNK1, MAPK9/JNK2, and MAPK14/p38, but not MAPK1/ERK2 or MAPK3/ERK3. This kinase is phosphorylated, and thus activated by MAP3K1/MEKK. The knockout studies in mice suggested the roles of this kinase in mediating survival signal in T cell development, as well as in the organogenesis of liver.
33069	
33070	
33071	
33072	
33073	The mitochondrial matrix enzyme 3-oxoacid CoA transferase is homodimeric. It is a key enzyme in the extrahepatic utilization of ketone bodies, catalyzing the reversible transfer of coenzyme A from succinyl-CoA to acetoacetate, a necessary step in ketolytic energy production.  Deficiencies can result in intermittent ketoacidosis.
33074	
33075	
33076	
33077	
33078	
33079	
33080	
33081	
33082	
33083	
33084	
33085	
33086	
33087	
33088	This gene is an ortholog of the mouse jumonji gene, which encodes a nuclear protein essential for mouse embryogenesis, including neural tube formation. Overexpression of mouse jumonji negatively regulates cell proliferation. The jumonji proteins contain a DNA-binding domain, called an AT-rich interaction domain (ARID), and share regions of similarity with human retinoblastoma-binding protein-2 and the human SMCX protein.
33089	
33090	
33091	
33092	
33093	This gene encodes a protein containing four ankyrin repeat domains in its C-terminus. The suggested role for this protein is in tumorigenesis of renal cell carcinoma. Two alternatively spliced transcript variants encoding different isoforms have been identified.
33094	
33095	
33096	
33097	The protein encoded by this gene belongs to the annexin family of calcium-dependent phospholipid binding proteins some of which have been implicated in membrane-related events along exocytotic and endocytotic pathways. Annexin 5 is a phospholipase A2 and protein kinase C inhibitory protein with calcium channel activity and a potential role in cellular signal transduction, inflammation, growth and differentiation. Annexin 5 has also been described as placental anticoagulant protein I, vascular anticoagulant-alpha, endonexin II, lipocortin V, placental protein 4 and anchorin CII. The gene spans 29 kb containing 13 exons, and encodes a single transcript of approximately 1.6 kb and a protein product with a molecular weight of about 35 kDa.
33098	Staufen homolog 2 is a member of the family of double-stranded RNA (dsRNA)-binding proteins involved in the transport and/or localization of mRNAs to different subcellular compartments and/or organelles.  These proteins are characterized by the presence of multiple dsRNA-binding domains which are required to bind RNAs having double-stranded secondary structures.  Staufen homolog 2 shares 48.5% and 59.9% similarity with drosophila and human staufen, respectively.  The exact function of Staufen homolog 2 is not known, but since it contains 3 copies of conserved dsRNA binding domain, it could be involved in double-stranded RNA binding events.
33099	
33100	
33101	This gene encodes a member of the WD repeat protein family. WD repeats are minimally conserved regions of approximately 40 amino acids typically bracketed by gly-his and trp-asp (GH-WD), which may facilitate formation of heterotrimeric or multiprotein complexes. Members of this family are involved in a variety of cellular processes, including cell cycle progression, signal transduction, apoptosis, and gene regulation.
33102	
33103	
33104	
33105	
33106	
33107	
33108	
33109	
33110	
33111	
33112	
33113	
33114	
33115	
33116	
33117	
33118	
33119	
33120	
33121	
33122	
33123	
33124	
33125	
33126	
33127	
33128	
33129	
33130	This gene encodes a subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This subunit functions as guanine nucleotide exchange factor. It is reported that this subunit interacts with HIV-1 Tat, and thus it represses the translation of host-cell, but not HIV-1, mRNAs. Several alternatively spliced transcript variants have been found for this gene, however, the full length nature of only two variants has been determined.
33131	
33132	
33133	
33134	
33135	
33136	The cystatin superfamily encompasses proteins that contain multiple cystatin-like sequences. Some of the members are active cysteine protease inhibitors, while others have lost or perhaps never acquired this inhibitory activity. There are three inhibitory families in the superfamily, including the type 1 cystatins (stefins), type 2 cystatins and the kininogens. The type 2 cystatin proteins are a class of cysteine proteinase inhibitors found in a variety of human fluids and secretions, where they appear to provide protective functions. The cystatin locus on chromosome 20 contains the majority of the type 2 cystatin genes and pseudogenes. This gene is located in the cystatin locus and encodes the most abundant extracellular inhibitor of cysteine proteases, which is found in high concentrations in biological fluids and is expressed in virtually all organs of the body. A mutation in this gene has been associated with amyloid angiopathy. Expression of this protein in vascular wall smooth muscle cells is severely reduced in both atherosclerotic and aneurysmal aortic lesions, establishing its role in vascular disease.
33137	
33138	This gene encodes a protein that has four predicted transmembrane domains. The function of this gene has not yet been determined; however, studies in the mouse homolog suggest a role in the transport of small molecules across endosomal and lysosomal membranes.
33139	
33140	
33141	
33142	
33143	
33144	
33145	
33146	
33147	
33148	
33149	
33150	
33151	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and a single intracellular catalytic domain, and thus represents a receptor-type PTP. The catalytic domain of this PTP is most closely related to PTPRN/IA-2beta. This PTP and PTPRN are both found to be major autoantigens associated with insulin-dependent diabetes mellitus. Three alternatively spliced transcript variants of this gene, which encode distinct proteins, have been reported.
33152	
33153	
33154	
33155	
33156	
33157	The protein encoded by this gene belongs to the subtilisin-like proprotein convertase family. The members of this family are proprotein convertases that process latent precursor proteins into their biologically active products. This encoded protein is a calcium-dependent serine endoprotease. It is structurally related to its family members, PACE and PACE4. This protein is concentrated in the trans-Golgi network, associated with the membranes, and is not secreted. It can process proalbumin and is thought to be responsible for the activation of HIV envelope glycoproteins gp160 and gp140. This gene has been implicated in the transcriptional regulation of housekeeping genes. Multiple alternatively spliced transcripts are described for this gene but their full length nature is not yet known. Downstream of this gene@#$%&s map location at 11q23-q24, nucleotides that match part of this gene@#$%&s 3@#$%& end are duplicated and inverted. A translocation breakpoint associated with lymphoma occurs between this gene and its inverted counterpart.
33158	
33159	
33160	
33161	
33162	
33163	Complex II of the respiratory chain, which is specifically involved in the oxidation of succinate, carries electrons from FADH to CoQ.  The complex is composed of four nuclear-encoded subunits and is localized in the mitochondrial inner membrane.  The 70 kDa flavoprotein subunit contains a characteristic cysteine triplet.  Mutations in the flavoprotein subunit gene have been linked to a form of mitochondrial respiratory chain deficiency known as Leigh syndrome.
33164	
33165	
33166	
33167	
33168	This protein is a member of the keratin-associated protein (KAP) family. The KAP proteins form a matrix of keratin intermediate filaments which contribute to the structure of hair fibers. KAP family members appear to have unique, family-specific amino- and carboxyl-terminal regions and are subdivided into three multi-gene families according to amino acid composition: the high sulfur, the ultrahigh sulfur, and the high tyrosine/glycine KAPs. This protein is a member of the ultrahigh sulfur KAP family and the gene is localized to a cluster of KAPs at 17q12-q21.
33169	
33170	
33171	
33172	
33173	
33174	
33175	
33176	
33177	
33178	
33179	
33180	
33181	
33182	This gene is found on chromosome 20, approximately 20 kbp downstream of the gene encoding cellular prion protein, to which it is biochemically and structurally similar. The protein encoded by this gene is a membrane glycosylphosphatidylinositol-anchored glycoprotein that is found predominantly in testis. Mutations in this gene may lead to neurological disorders.
33183	
33184	
33185	
33186	
33187	
33188	
33189	
33190	
33191	
33192	
33193	
33194	
33195	
33196	
33197	
33198	
33199	
33200	
33201	
33202	The protein encoded by this gene belongs to the ARL (ADP-ribosylation factor-like) family of proteins, which are structurally related to ADP-ribosylation factors (ARFs). ARFs, described as activators of cholera toxin (CT) ADP-ribosyltransferase activity, regulate intracellular vesicular membrane trafficking, and stimulate a phospholipase D (PLD) isoform. Although, ARL proteins were initially thought not to activate CT or PLD, later work showed that they are weak stimulators of PLD and CT in a phospholipid dependent manner.
33203	
33204	
33205	
33206	
33207	
33208	
33209	
33210	
33211	The protein encoded by this gene contains a catalytic domain of flavin adenine dinucleotide synthetase, which catalyzes the adenylation of flavin mononucleotide (FMN) to form flavin adenine dinucleotide (FAD) coenzyme. Two alternatively spliced transcript variants encoding distinct isoforms have been observed.
33212	
33213	
33214	
33215	
33216	
33217	
33218	
33219	
33220	
33221	This gene encodes a zinc-dependent aminopeptidase that cleaves vasopressin, oxytocin, lys-bradykinin, met-enkephalin, dynorphin A and other peptide hormones. The protein can be secreted in maternal serum, reside in intracellular vesicles with the insulin-responsive glucose transporter GLUT4, or form a type II integral membrane glycoprotein. The protein catalyzes the final step in the conversion of angiotensinogen to angiotensin IV (AT4) and is also a receptor for AT4. Alternative splicing results in multiple transcript variants encoding different isoforms.
33222	
33223	
33224	
33225	
33226	
33227	
33228	
33229	
33230	
33231	
33232	
33233	
33234	
33235	
33236	
33237	
33238	
33239	Alternative splicing has been observed at this gene locus and two transcript variants have been identified. Both variants encode the same protein, which has an undetermined function.
33240	The protein encoded by this gene is a member of the transmembrane 4 superfamily, also known as the tetraspanin family. Most of these members are cell-surface proteins that are characterized by the presence of four hydrophobic domains. The proteins mediate signal transduction events that play a role in the regulation of cell development, activation, growth and motility. The use of alternate polyadenylation sites has been found for this gene.
33241	
33242	
33243	
33244	
33245	
33246	
33247	TC10 is a member of the RAS superfamily of small GTP-binding proteins (see HRAS, MIM 190020) involved in insulin-stimulated glucose uptake.[supplied by OMIM]
33248	
33249	This gene belongs to the TIMP gene family. The proteins encoded by this gene family are natural inhibitors of the matrix metalloproteinases, a group of peptidases involved in degradation of the extracellular matrix.  The protein encoded by this gene may function in a tissue-specific fashion in extracellular matrix homeostasis.
33250	
33251	
33252	
33253	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L13P family of ribosomal proteins. It is located in the cytoplasm. Transcript variants utilizing alternative polyA signals have been observed. This gene is co-transcribed with the small nucleolar RNA genes U32, U33, U34, and U35, which are located in its second, fourth, fifth, and sixth introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
33254	The protein encoded by this gene contains a RINF finger, a motif known to be involved in protein-protein and protein-DNA interactions. This protein interacts with DNAJA3/hTid-1, which is a DnaJ protein reported to function as a modulator of apoptosis. Overexpression of this gene in Hela cells was shown to confer the resistance to TNF-alpha induced apoptosis, suggesting an anti-apoptotic function of this protein. This protein can be cleaved by caspase-3 during the induction of apoptosis. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
33255	
33256	Retention of resident soluble proteins in the lumen of the endoplasmic reticulum (ER) is achieved in both yeast and animal cells by their continual retrieval from the cis-Golgi, or a pre-Golgi compartment.  Sorting of these proteins is dependent on a C-terminal tetrapeptide signal, usually lys-asp-glu-leu (KDEL) in animal cells, and his-asp-glu-leu (HDEL) in S. cerevisiae.  This process is mediated by a receptor that recognizes, and binds the tetrapeptide-containing protein, and returns it to the ER.  In yeast, the sorting receptor encoded by a single gene, ERD2, is a seven-transmembrane protein.  Unlike yeast, several human homologs of the ERD2 gene, constituting the KDEL receptor gene family, have been described. KDELR2 was the second member of the family to be identified, and it encodes a protein which is 83% identical to the KDELR1 gene product.
33257	
33258	
33259	
33260	
33261	
33262	
33263	
33264	
33265	This gene is a member of the paired box (PAX) family of transcription factors.  Members of this gene family typically contain a paired box domain, an octapeptide, and a paired-type homeodomain.  These genes play critical roles during fetal development and cancer growth.  The specific function of the paired box gene 9 is unknown but it may involve development of stratified squamous epithelia as well as various organs and skeletal elements.
33266	
33267	
33268	This gene is a member of the GAGE family, which is expressed in a variety of tumors and in some fetal and reproductive tissues. The protein encoded by this gene shares a sequence similarity with other GAGE/PAGE proteins. It may also belong to a family of CT (cancer-testis) antigens. Multiple alternatively spliced transcript variants encoding distinct isoforms have been found for this gene, but the biological validity of some variants have not been determined.
33269	
33270	
33271	
33272	
33273	
33274	
33275	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes one of the ATPase subunits, a member of the triple-A family of ATPases which have a chaperone-like activity. Pseudogenes have been identified on chromosomes 8 and 12.
33276	
33277	
33278	
33279	
33280	
33281	
33282	
33283	
33284	
33285	
33286	
33287	This gene is a member of the TAFA family which is composed of five highly homologous genes that encode small secreted proteins. These proteins contain conserved cysteine residues at fixed positions, and are distantly related to MIP-1alpha, a member of the CC-chemokine family. The TAFA proteins are predominantly expressed in specific regions of the brain, and are postulated to function as brain-specific chemokines or neurokines, that act as regulators of immune and nervous cells. Transcript variants with different 5@#$%& UTRs, but encoding the same protein, have been found for this gene.
33288	
33289	
33290	
33291	
33292	This gene encodes an endoplasmic reticulum associated protein that is involved in glycosylphosphatidylinositol (GPI)-anchor biosynthesis. The GPI anchor is a glycolipid found on many blood cells and which serves to anchor proteins to the cell surface. The protein encoded by this gene is a subunit of the GPI N-acetylglucosaminyl (GlcNAc) transferase that transfers GlcNAc to phosphatidylinositol (PI) on the cytoplasmic side of the endoplasmic reticulum.
33293	
33294	
33295	
33296	
33297	The protein encoded by this gene is a member of the immunophilin protein family, which play a role in immunoregulation and basic cellular processes involving protein folding and trafficking. This encoded protein is a cis-trans prolyl isomerase that binds to the immunosuppressants FK506 and rapamycin. It has high structural and functional similarity to FK506-binding protein 1A (FKBP1A), but unlike FKBP1A, this protein does not have immunosuppressant activity when complexed with FK506. It interacts with interferon regulatory factor-4 and plays an important role in immunoregulatory gene expression in B and T lymphocytes. This encoded protein is known to associate with phytanoyl-CoA alpha-hydroxylase. It can also associate with two heat shock proteins (hsp90 and hsp70) and thus may play a role in the intracellular trafficking of hetero-oligomeric forms of the steroid hormone receptors. This protein correlates strongly with adeno-associated virus type 2 vectors (AAV) resulting in a significant increase in AAV-mediated transgene expression in human cell lines. Thus this encoded protein is thought to have important implications for the optimal use of AAV vectors in human gene therapy. This gene has been found to have multiple polyadenylation sites.
33298	
33299	
33300	
33301	
33302	
33303	
33304	UCHL1 is a member of a gene family whose products hydrolyze small C-terminal adducts of ubiquitin to generate the ubiquitin monomer. Expression of UCHL1 is highly specific to neurons and to cells of the diffuse neuroendocrine system and their tumors. It is present in all neurons (Doran et al., 1983).[supplied by OMIM]
33305	This gene encodes a dual serine/threonine and tyrosine protein kinase which is expressed in multiple tissues. Multiple alternatively spliced transcript variants have been found, but the biological validity of some variants has not been determined.
33306	
33307	
33308	
33309	
33310	
33311	
33312	
33313	
33314	
33315	
33316	
33317	The protein encoded by this gene is a family member of enzymes known as ubiquitin carrier proteins, or E2s, which are an essential component of the ubiquitin-protein conjugation system. This protein exhibits E2 enzyme activity in its ability to form a thiol ester linkage with ubiquitin in a ubiquitin activating enzyme (E1)-dependent manner. This protein is the first member of this E2 family to be cloned from an epidermal source.
33318	
33319	
33320	
33321	
33322	
33323	
33324	
33325	
33326	
33327	
33328	
33329	
33330	
33331	
33332	
33333	
33334	Cyclin B2 is a member of the cyclin family, specifically the B-type cyclins. The B-type cyclins, B1 and B2, associate with p34cdc2 and are essential components of the cell cycle regulatory machinery. B1 and B2 differ in their subcellular localization. Cyclin B1 co-localizes with microtubules, whereas cyclin B2 is primarily associated with the Golgi region. Cyclin B2 also binds to transforming growth factor beta RII and thus cyclin B2/cdc2 may play a key role in transforming growth factor beta-mediated cell cycle control.
33335	
33336	
33337	
33338	
33339	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L29P family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
33340	
33341	The enzyme encoded by this gene catalyzes the activation of Neu5Ac to Cytidine 5-prime-monophosphate N-acetylneuraminic acid (CMP-Neu5Ac), which provides the substrate required for the addition of sialic acid. Sialic acids of cell surface glycoproteins and glycolipids play a pivotal role in the structure and function of animal tissues. The pattern of cell surface sialylation is highly regulated during embryonic development, and changes with stages of differentiation. Studies of a similar murine protein suggest that this protein localizes to the nucleus.
33342	
33343	
33344	
33345	
33346	
33347	
33348	
33349	
33350	
33351	
33352	
33353	
33354	
33355	
33356	
33357	
33358	
33359	This gene encodes a nuclear protein which interacts with the tumor suppressor retinoblastoma 1. The two interacting proteins are thought to act as a transcriptional repressor for promoters which are activated by the E2F1 transcription factor. This protein contains a Kruppel-associated box (KRAB), which is a transcriptional repressor motif.
33360	
33361	
33362	
33363	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 28S subunit protein that belongs to the ribosomal protein S6P family. Pseudogenes corresponding to this gene are found on chromosomes 1p and 12q.
33364	
33365	
33366	
33367	
33368	
33369	
33370	
33371	
33372	
33373	
33374	
33375	
33376	This gene encodes a protein that in mice may function as a maternal factor during the preimplantation stage of development. In mice, this gene may play a role in transcriptional repression, cell division, and maintenance of cell pluripotentiality. In humans, related intronless loci are located on chromosomes 14 and X.
33377	
33378	
33379	The protein encoded by this gene is a member of the signal-regulatory-protein (SIRP) family, and also belongs to the immunoglobulin superfamily. SIRP family members are receptor-type transmembrane glycoproteins known to be involved in the negative regulation of receptor tyrosine kinase-coupled signaling processes. This protein was found to interact with TYROBP/DAP12, a protein bearing immunoreceptor tyrosine-based activation motifs. This protein was also reported to participate in the recruitment of tyrosine kinase SYK.
33380	
33381	
33382	
33383	
33384	
33385	
33386	
33387	
33388	
33389	
33390	
33391	
33392	
33393	
33394	
33395	
33396	
33397	
33398	
33399	
33400	
33401	
33402	
33403	The protein encoded by this gene belongs to the immunoglobulin superfamily. It is a type I membrane protein. The protein may play a role in the adhesive interactions of activated T and NK cells during the late phase of the immune response. It may also function in antigen presentation. Alternative splicing occurs at this locus and two transcript variants encoding distinct isoforms have been identified.
33404	
33405	The protein encoded by this gene has significant homology to NADH oxidoreductases and the apoptosis-inducing factor PDCD8/AIF. Overexpression of this gene has been shown to induce apoptosis. The expression of this gene is found to be induced by tumor suppressor protein p53 in colon caner cells.
33406	
33407	
33408	
33409	
33410	
33411	
33412	
33413	
33414	
33415	
33416	
33417	
33418	
33419	
33420	
33421	
33422	
33423	
33424	
33425	
33426	
33427	
33428	
33429	
33430	
33431	
33432	
33433	
33434	
33435	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S5P family of ribosomal proteins. It is located in the cytoplasm. This gene shares sequence similarity with mouse LLRep3. It is co-transcribed with the small nucleolar RNA gene U64, which is located in its third intron. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
33436	
33437	
33438	
33439	
33440	
33441	
33442	This gene encodes a disintegrin and metalloprotease (ADAM) domain 12, which is a member of the ADAM protein family. Members of this family are membrane-anchored proteins structurally related to snake venom disintegrins, and have been implicated in a variety of biologic processes involving  cell-cell and cell-matrix interactions, including fertilization, muscle development, and neurogenesis. This gene has 2 alternative splicing transcript forms: a shorter secreted form and a longer membrane-bound form, that diverge at their 3@#$%& ends. The shorter form is found to stimulate myogenesis.
33443	This gene encodes a member of the chaperonin family. The encoded mitochondrial protein may function as a signaling molecule in the innate immune system. This protein is essential for the folding and assembly of newly imported proteins in the mitochondria. This gene is adjacent to a related family member and the region between the 2 genes functions as a bidirectional promoter. Two pseudogenes, both located on chromosome 8, have been associated with this gene. Two transcript variants encoding the same protein have been identified for this gene. Mutations associated with this gene cause autosomal recessive spastic paraplegia 13.
33444	
33445	
33446	
33447	
33448	
33449	
33450	
33451	
33452	
33453	
33454	
33455	
33456	
33457	
33458	
33459	
33460	
33461	
33462	
33463	
33464	
33465	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L23P family of ribosomal proteins. It is located in the cytoplasm. The protein may be one of the target molecules involved in mediating growth inhibition by interferon. In yeast, the corresponding protein binds to a specific site on the 26S rRNA. This gene is co-transcribed with the U42A, U42B, U101A, and U101B small nucleolar RNA genes, which are located in its third, first, second, and fourth introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
33466	
33467	
33468	
33469	
33470	
33471	
33472	
33473	
33474	
33475	
33476	
33477	
33478	
33479	
33480	
33481	
33482	This locus shares sequence similarity with heterogeneous nuclear ribonuclear proteins and represents an apparent transcribed pseudogene of the HNRPA3 gene, which maps to chromosome 2.
33483	
33484	
33485	
33486	
33487	
33488	
33489	
33490	
33491	
33492	
33493	
33494	
33495	
33496	
33497	This gene encodes a predicted transmembrane protein containing two extracellular CUB domains followed by a low-density lipoprotein class A (LDLa) domain. It also has an intracellular FXNPXY-like motif, which has been shown in other proteins to be essential for the internalization of clathrin coated pits during endocytosis. Alternatively spliced transcript variants have been observed, but they have not been fully characterized.
33498	
33499	
33500	
33501	
33502	
33503	
33504	
33505	
33506	
33507	
33508	
33509	Chromosomal protein HMG14 and its close analog HMG17 (MIM 163910) bind to the inner side of the nucleosomal DNA, potentially altering the interaction between the DNA and the histone octamer. The 2 proteins may be involved in the process that maintains transcribable genes in a unique chromatin conformation. Their ubiquitous distribution and relative abundance, as well as the high evolutionary conservation of the DNA-binding domain of the HMG14 family of proteins, suggest that they may be involved in an important cellular function.[supplied by OMIM]
33510	
33511	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L37AE family of ribosomal proteins. It is located in the cytoplasm. The protein contains a C4-type zinc finger-like domain. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
33512	
33513	
33514	
33515	
33516	
33517	
33518	
33519	
33520	
33521	
33522	The GLP2 receptor (GLP2R) is a G protein-coupled receptor superfamily member closely related to the glucagon receptor ans GLP1 receptor. Glucagon-like peptide-2 (GLP2) is a 33-amino acid proglucagon-derived peptide produced by intestinal enteroendocrine cells. Like glucagon-like peptide-1 (GLP1) and glucagon itself, it is derived from the proglucagon peptide encoded by the GCG gene. GLP2 stimulates intestinal growth and upregulates villus height in the small intestine, concomitant with increased crypt cell proliferation and decreased enterocyte apoptosis. Moreover, GLP2 prevents intestinal hypoplasia resulting from total parenteral nutrition. GLP2R, a G protein-coupled receptor superfamily member is expressed in the gut and closely related to the glucagon receptor (GCGR) and the receptor for GLP1 (GLP1R).
33523	
33524	
33525	
33526	
33527	
33528	
33529	
33530	
33531	
33532	The A-kinase anchor proteins (AKAPs) are a group of structurally diverse proteins, which have the common function of binding to the regulatory subunit of protein kinase A (PKA) and confining the holoenzyme to discrete locations within the cell. This gene encodes a member of the AKAP family. Alternative splicing of this gene results in at least 3 transcript variants encoding different isoforms containing a dbl oncogene homology (DH) domain and a pleckstrin homology (PH) domain. The DH domain is associated with guanine nucleotide exchange activation for the Rho/Rac family of small GTP binding proteins, resulting in the conversion of the inactive GTPase to the active form capable of transducing signals. The PH domain has multiple functions. Therefore, these isoforms function as scaffolding proteins to coordinate a Rho signaling pathway and, in addition, function as protein kinase A-anchoring proteins.
33533	
33534	
33535	
33536	
33537	
33538	
33539	
33540	
33541	
33542	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and a single intracellular catalytic domain, and thus represents a receptor-type PTP. The catalytic domain of this PTP is most closely related to PTPRN/IA-2beta. This PTP and PTPRN are both found to be major autoantigens associated with insulin-dependent diabetes mellitus. Three alternatively spliced transcript variants of this gene, which encode distinct proteins, have been reported.
33543	The protein encoded by this gene is a member of the fibroblast growth factor (FGF) family. FGF family members possess broad mitogenic and cell survival activities, and are involved in a variety of biological processes, including embryonic development, cell growth, morphogenesis, tissue repair, tumor growth and invasion. A mutation in this gene is associated with autosomal dominant cerebral ataxia. Alternatively spliced transcript variants have been found for this gene.
33544	
33545	This gene is the human homolog of mouse jerky gene. The encoded protein has similarity to several nuclear regulatory proteins, including centromere protein B, suggesting that it might function as a DNA-binding protein. Insertional inactivation of this gene in transgenic mice resulted in epileptic seizures. Childhood Absence Epilepsy (CAE) has been mapped to the same chromosomal location (8q24.3) as this gene, making this gene a strong candidate for CAE.
33546	
33547	
33548	
33549	
33550	This gene encodes a protein that is highly similar to the CDC10 protein of Saccharomyces cerevisiae. The protein also shares similarity with Diff 6 of Drosophila and with H5 of mouse. Each of these similar proteins, including the yeast CDC10, contains a GTP-binding motif. The yeast CDC10 protein is a structural component of the 10 nm filament which lies inside the cytoplasmic membrane and is essential for cytokinesis. Although the exact function of this gene has not yet been determined, its high similarity to yeast CDC10 and the high conservative nature of eukaryotic cell cycle machinery suggest a similar role to that of its yeast counterpart. Alternative splicing results in two transcript variants encoding different isoforms.
33551	
33552	The protein encoded by this gene is a member of the CLK family of dual specificity protein kinases. CLK family members have shown to interact with, and phosphorylate, serine- and arginine-rich (SR) proteins of the spliceosomal complex, which is a part of the regulatory mechanism that enables the SR proteins to control RNA splicing. This protein kinase is found to be involved in the regulation of several cellular processes and may serve as a link between cell cycle progression, apoptosis, and telomere length regulation. Alternatively spliced transcript variants encoding distinct isoforms have been reported.
33553	
33554	
33555	
33556	
33557	
33558	
33559	
33560	This gene encodes a member of the natural killer cell receptor C-type lectin family. The encoded protein inhibits osteoclast formation and contains a transmembrane domain near the N-terminus as well as the C-type lectin-like extracellular domain. Several alternatively spliced transcript variants have been identified, but the full-length nature of every transcript has not been defined.
33561	FSHD region gene 1 maps 100 kb centromeric of the repeat units on chromosome 4q35 which are deleted in facioscapulohumeral muscular dystrophy. The FRG1 gene is evolutionarily conserved and has related sequences on multiple human chromosomes.  DNA sequence analysis did not reveal any homology to known genes.
33562	This gene belongs to the A/B subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has two repeats of quasi-RRM domains that bind to RNAs. It is one of the most abundant core proteins of hnRNP complexes and it is localized to the nucleoplasm. This protein, along with other hnRNP proteins, is exported from the nucleus, probably bound to mRNA, and is immediately re-imported. Its M9 domain acts as both a nuclear localization and nuclear export signal. The encoded protein is involved in the packaging of pre-mRNA into hnRNP particles, transport of poly A+ mRNA from the nucleus to the cytoplasm, and may modulate splice site selection. It is also thought have a primary role in the formation of specific myometrial protein species in parturition. Multiple alternatively spliced transcript variants have been found for this gene but only two transcripts are fully described. These variant have multiple alternative transcription initiation sites and multiple polyA sites.
33563	
33564	
33565	
33566	
33567	
33568	
33569	
33570	
33571	The protein encoded by this gene is a member of the NR1 subfamily of nuclear hormone receptors. It is a DNA-binding protein that can bind as a monomer or as a homodimer to hormone response elements upstream of several genes to enhance the expression of those genes. The specific functions of this protein are not known, but it has been shown to interact with NM23-2, a nucleoside diphosphate kinase involved in organogenesis and differentiation.
33572	
33573	
33574	
33575	
33576	
33577	
33578	
33579	This gene encodes disintegrin and metalloprotease (ADAM) domain 9, which is a member of the ADAM protein family. Members of this family are membrane-anchored proteins structurally related to snake venom disintegrins, and have been implicated in a variety of biologic processes involving cell-cell and cell-matrix interactions, including fertilization, muscle development, and neurogenesis. The member encoded by this gene interacts with SH3 domain-containing proteins, binds mitotic arrest deficient 2 beta protein, and is also involved in TPA-induced ectodomain shedding of membrane-anchored heparin-binding EGF-like growth factor. Two alternative splice variants have been identified, encoding distinct isoforms.
33580	ADP-ribosylation factor-like 4 is a member of the ADP-ribosylation factor family of GTP-binding proteins. ARL4 is similar to ARL7 and ARL6 and each has a nuclear localization signal and an unusually high guaninine nucleotide exchange rate. ARL4 is located in both the nuclear and extranuclear cell compartments. Two transcript variants encoding the same protein have been found for this gene.
33581	
33582	
33583	
33584	
33585	
33586	
33587	
33588	
33589	
33590	
33591	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L14E family of ribosomal proteins. It contains a basic region-leucine zipper (bZIP)-like domain. The protein is located in the cytoplasm. This gene contains a trinucleotide (GCT) repeat tract whose length is highly polymorphic; these triplet repeats result in a stretch of alanine residues in the encoded protein. Transcript variants utilizing alternative polyA signals exist. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
33592	
33593	
33594	
33595	
33596	
33597	
33598	
33599	
33600	
33601	Protein kinase C (PKC) is a family of serine- and threonine-specific protein kinases that can be activated by calcium and the second messenger diacylglycerol. PKC family members phosphorylate a wide variety of protein targets and are known to be involved in diverse cellular signaling pathways. PKC family members also serve as major receptors for phorbol esters, a class of tumor promoters. Each member of the PKC family has a specific expression profile and is believed to play a distinct role. The protein encoded by this gene is one of the PKC family members. It is a calcium-independent and phospholipid-dependent protein kinase. This kinase is important for T-cell activation. It is required for the activation of the transcription factors NF-kappaB and AP-1, and may link the T cell receptor (TCR) signaling complex to the activation of the transcription factors.
33602	
33603	
33604	
33605	
33606	
33607	
33608	
33609	The cystatin superfamily encompasses proteins that contain multiple cystatin-like sequences. Some of the members are active cysteine protease inhibitors, while others have lost or perhaps never acquired this inhibitory activity. There are three inhibitory families in the superfamily, including the type 1 cystatins (stefins), type 2 cystatins and the kininogens. The type 2 cystatin proteins are a class of cysteine proteinase inhibitors found in a variety of human fluids and secretions, where they appear to provide protective functions. The cystatin locus on chromosome 20 contains the majority of the type 2 cystatin genes and pseudogenes. This gene is located in the cystatin locus and encodes a secreted protein that may play a role in hematopoietic differentiation or inflammation.
33610	
33611	
33612	This gene encodes the heavy subunit of the ferritin protein. Ferritin is the major intracellular iron storage protein in prokaryotes and eukaryotes. It is composed of 24 subunits of the heavy and light ferritin chains. Variation in ferritin subunit composition may affect the rates of iron uptake and release in different tissues. A major function of ferritin is the storage of iron in a soluble and nontoxic state. Defects in ferritin proteins are associated with several neurodegenerative diseases. This gene has multiple pseudogenes. Several alternatively spliced transcript variants have been observed, but their biological validity has not been determined.
33613	
33614	
33615	
33616	
33617	L-glutamate is the major excitatory neurotransmitter in the central nervous system and activates both ionotropic and metabotropic glutamate receptors.  Glutamatergic neurotransmission is involved in most aspects of normal brain function and can be perturbed in many neuropathologic conditions. The metabotropic glutamate receptors are a family of G protein-coupled receptors, that have been divided into 3 groups on the basis of sequence homology, putative signal transduction mechanisms, and pharmacologic properties.  Group I includes GRM1 and GRM5 and these receptors have been shown to activate phospholipase C.  Group II includes GRM2 and GRM3 while Group III includes GRM4, GRM6, GRM7 and GRM8.  Group II and III receptors are linked to the inhibition of the cyclic AMP cascade but differ in their agonist selectivities.  Alternative splice variants of GRM8 have been described but their full-length nature has not been determined.
33618	
33619	
33620	
33621	
33622	
33623	
33624	
33625	
33626	This gene encodes a member of the nucleosome assembly protein (NAP) family. This protein participates in DNA replication and may play a role in modulating chromatin formation and contribute to the regulation of cell proliferation. Alternative splicing of this gene results in several transcript variants; however, not all have been fully described.
33627	
33628	
33629	
33630	
33631	
33632	
33633	
33634	
33635	
33636	
33637	
33638	
33639	
33640	
33641	
33642	DNA (cytosine-5-)-methyltransferase 1 has a role in the establishment and regulation of tissue-specific patterns of methylated cytosine residues. Aberrant methylation patterns are associated with certain human tumors and developmental abnormalities.
33643	
33644	
33645	
33646	
33647	
33648	
33649	
33650	
33651	
33652	
33653	
33654	
33655	
33656	
33657	
33658	
33659	
33660	
33661	
33662	
33663	
33664	
33665	
33666	
33667	
33668	
33669	
33670	
33671	
33672	
33673	Members of the F-box protein family, such as FBXO34, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
33674	
33675	
33676	
33677	
33678	The protein encoded by this gene contains ARF-GAP, RHO-GAP, ankyrin repeat, RAS-associating, and pleckstrin homology domains. This protein lacks the predicted catalytic arginine in the RHO-GAP domain and is therefore unlikely to have RHO-GAP activity. While the encoded protein does contain a sterile alpha motif (SAM) commonly found in some signaling molecules, the function of the protein has not been determined. Two transcript variants encoding different isoforms have been found for this gene.
33679	
33680	This gene encodes a member of the [alpha]/[beta] hydrolase fold family and has isoform specific imprinting. The loss of imprinting of this gene has been linked to certain types of cancer and may be due to promotor switching. The encoded protein may play a role in development. Three transcript variants encoding two distinct isoforms have been identified for this gene. A pseudogene for this locus is located on chromosome 6.
33681	
33682	
33683	
33684	
33685	TC10 is a member of the RAS superfamily of small GTP-binding proteins (see HRAS, MIM 190020) involved in insulin-stimulated glucose uptake.[supplied by OMIM]
33686	
33687	
33688	
33689	This gene belongs to the chemokine-like factor gene superfamily, a novel superfamily that links the chemokine gene family and the transmembrane 4 superfamily. This gene is one of several chemokine-like factor genes located in a cluster on chromosome 16. The protein encoded by this gene is widely expressed in many tissues, but its exact function is unknown. Alternatively spliced transcript variants encoding different isoforms have been identified.
33690	
33691	Poly(ADP-ribose) glycohydrolase (PARG) is the major enzyme responsible for the catabolism of poly(ADP-ribose), a reversible covalent-modifier of chromosomal proteins.  The protein is found in many tissues and may be subject to proteolysis generating smaller, active products.
33692	
33693	
33694	
33695	Histones play a critical role in transcriptional regulation, cell cycle progression, and developmental events. Histone acetylation/deacetylation alters chromosome structure and affects transcription factor access to DNA. The protein encoded by this gene has sequence homology to members of the histone deacetylase family. This gene is orthologous to the Xenopus and mouse MITR genes. The MITR protein lacks the histone deacetylase catalytic domain. It represses MEF2 activity through recruitment of multicomponent corepressor complexes that include CtBP and HDACs. This encoded protein may play a role in hematopoiesis. Multiple alternatively spliced transcripts have been described for this gene but the full-length nature of some of them has not been determined.
33696	
33697	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S10E family of ribosomal proteins. It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
33698	
33699	
33700	
33701	
33702	
33703	
33704	
33705	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S12P family of ribosomal proteins. It is located in the cytoplasm. The protein shares significant amino acid similarity with S. cerevisiae ribosomal protein S28. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
33706	
33707	
33708	This gene encodes a member of the fibronectin leucine rich transmembrane protein (FLRT) family. FLRT family members may function in cell adhesion and/or receptor signalling. Their protein structures resemble small leucine-rich proteoglycans found in the extracellular matrix.
33709	
33710	
33711	
33712	
33713	
33714	
33715	The protein encoded by this gene has similarity to a yeast protein. It seems to be part of a peripheral membrane protein complex localized on cis/medial Golgi cisternae where it may participate in tethering intra-Golgi transport vesicles.
33716	
33717	
33718	
33719	This gene encodes a member of the Dual-specificity tyrosine phosphorylation-regulated kinase (DYRK) family. This member contains a nuclear targeting signal sequence, a protein kinase domain, a leucine zipper motif, and a highly conservative 13-consecutive-histidine repeat. It catalyzes its autophosphorylation on serine/threonine and tyrosine residues. It may play a significant role in a signaling pathway regulating cell proliferation and may be involved in brain development. This gene is a homolog of Drosophila mnb (minibrain) gene and rat Dyrk gene. It is localized in the Down syndrome critical region of chromosome 21, and is considered to be a strong candidate gene for learning defects associated with Down syndrome. Alternative splicing of this gene generates several transcript variants differing from each other either in the 5@#$%& region or in the 3@#$%& part of the coding region. These variants encode at least five different isoforms.
33720	
33721	
33722	
33723	
33724	
33725	This gene encodes a member of the annexin family. Members of this calcium-dependent phospholipid-binding protein family play a role in the regulation of cellular growth and in signal transduction pathways. This protein functions as an autocrine factor which heightens osteoclast formation and bone resorption. This gene has three pseudogenes located on chromosomes 4, 9 and 10, respectively. Multiple alternatively spliced transcript variants encoding different isoforms have been found for this gene.
33726	
33727	
33728	
33729	
33730	
33731	MICA encodes the higly polymorphic MHC (HLA) class I chain-related gene A. The protein product is expressed on the cell surface, although unlike canonical class I molecules does not seem to associate with beta-2-microglobulin. It is thought that MICA functions as a stress-induced antigen that is broadly recognized by intestinal epithelial gamma delta T cells.
33732	
33733	
33734	
33735	
33736	
33737	
33738	
33739	
33740	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal phosphoprotein that is a component of the 60S subunit. The protein, which is a functional equivalent of the E. coli L7/L12 ribosomal protein, belongs to the L12P family of ribosomal proteins. It plays an important role in the elongation step of protein synthesis. Unlike most ribosomal proteins, which are basic, the encoded protein is acidic. Its C-terminal end is nearly identical to the C-terminal ends of the ribosomal phosphoproteins P0 and P2. The P1 protein can interact with P0 and P2 to form a pentameric complex consisting of P1 and P2 dimers, and a P0 monomer. The protein is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome. Through sequence analysis, two alternatively spliced transcript variants that encode different proteins were found.
33741	
33742	
33743	
33744	
33745	The encoded protein, which directly binds to pituitary tumor-transforming gene 1 protein (PTTG1), facilitates the nuclear translocation of PTTG1 and potentiates the transcriptional activation of basic fibroblast growth factor by PTTG1. The gene product localizes to both the cytoplasm and nucleus. Its NLS is required for its own nuclear localization, the nuclear localization of PTTG1, and its interaction with PTTG1.
33746	
33747	
33748	
33749	
33750	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S15P family of ribosomal proteins. It is located in the cytoplasm. The protein has been shown to bind to the 5.8S rRNA in rat. The gene product of the E. coli ortholog (ribosomal protein S15) functions at early steps in ribosome assembly. This gene is co-transcribed with two U14 small nucleolar RNA genes, which are located in its third and fifth introns. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
33751	
33752	
33753	
33754	
33755	Alpha-chymotrypsin (EC 3.4.21.1) is one of a family of serine proteases secreted into the gastrointestinal tract as the inactive precursor chymotrypsinogen. The zymogen is activated by proteolytic cleavage by trypsin (PRSS1; MIM 276000).[supplied by OMIM]
33756	
33757	
33758	
33759	
33760	
33761	
33762	
33763	
33764	
33765	
33766	
33767	
33768	
33769	
33770	
33771	The protein encoded by this gene is a member of the mammalian members of the EGF family of growth factors. It is synthesized primarily as a transmembrane precursor, which is then processed to mature molecule by proteolytic events. This encoded protein is a ligand for the EGF receptor. Several alternatively spliced transcript variants of this gene have been described, but the full-length nature of some of these variants has not been determined.
33772	
33773	
33774	
33775	
33776	
33777	The enzyme encoded by this gene is responsible for the formation of the blood group I antigen. The i and I antigens are determined by linear and branched poly-N-acetyllactosaminoglycans, respectively. During embryonic development in human erythrocytes, the fetal i antigen is replaced by the adult I antigen as the result of the appearance of a beta-1,6-N-acetylglucosaminyltransferase, the I-branching enzyme. This gene encodes the I-branching enzyme that converts the linear form into the branched form. Defects in this gene have been associated with adult i blood group phenotype. Several transcript variants encoding different isoforms have been described.
33778	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 40S subunit. The protein belongs to the S10E family of ribosomal proteins. It is located in the cytoplasm. Variable expression of this gene in colorectal cancers compared to adjacent normal tissues has been observed, although no correlation between the level of expression and the severity of the disease has been found. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
33779	
33780	
33781	
33782	
33783	
33784	
33785	The protein encoded by this gene is a member of the CXC chemokine family. This chemokine is one of the major mediators of the inflammatory response. This chemokine is secreted by several cell types. It functions as a chemoattractant, and is also a potent angiogenic factor. This gene is believed to play a role in the pathogenesis of bronchiolitis, a common respiratory tract disease caused by viral infection. This gene and other ten members of the CXC chemokine gene family form a chemokine gene cluster in a region mapped to chromosome 4q.
33786	
33787	
33788	
33789	
33790	This gene belongs to a highly conserved gene family encoding EPS15 homology (EH) domain-containing proteins. The protein-binding EH domain was first noted in EPS15, a substrate for the epidermal growth factor receptor. The EH domain has been shown to be an important motif in proteins involved in protein-protein interactions and in intracellular sorting. The protein encoded by this gene is thought to play a role in the endocytosis of IGF1 receptors.
33791	
33792	
33793	
33794	
33795	
33796	
33797	
33798	This gene encodes a trypsinogen, which is a member of the trypsin family of serine proteases. This enzyme is secreted by the pancreas and cleaved to its active form in the small intestine. It is active on peptide linkages involving the carboxyl group of lysine or arginine. Mutations in this gene are associated with hereditary pancreatitis. This gene and several other trypsinogen genes are localized to the T cell receptor beta locus on chromosome 7.
33799	
33800	
33801	
33802	
33803	This gene encodes a member of the mammalian PIAS [protein inhibitor of activated STAT-1 (signal transducer and activator of transcription-1)] family. This member contains a putative zinc-binding motif and a highly acidic region. It inhibits STAT1-mediated gene activation and the DNA binding activity, binds to Gu protein/RNA helicase II/DEAD box polypeptide 21, and interacts with androgen receptor (AR). It functions in testis as a nuclear receptor transcriptional coregulator and may have a role in AR initiation and maintenance of spermatogenesis.
33804	
33805	HLA-DRA is one of the HLA class II alpha chain paralogues. This class II molecule is a heterodimer consisting of an alpha and a beta chain, both anchored in the membrane. It plays a central role in the immune system by presenting peptides derived from extracellular proteins. Class II molecules are expressed in antigen presenting cells (APC: B lymphocytes, dendritic cells, macrophages). The alpha chain is approximately 33-35 kDa and its gene contains 5 exons. Exon 1 encodes the leader peptide, exons 2 and 3 encode the two extracellular domains, and exon 4 encodes the transmembrane domain and the cytoplasmic tail. DRA does not have polymorphisms in the peptide binding part and acts as the sole alpha chain for DRB1, DRB3, DRB4 and DRB5.
33806	
33807	
33808	
33809	Members of the F-box protein family, such as FBXO33, are characterized by an approximately 40-amino acid F-box motif. SCF complexes, formed by SKP1 (MIM 601434), cullin (see CUL1; MIM 603134), and F-box proteins, act as protein-ubiquitin ligases. F-box proteins interact with SKP1 through the F box, and they interact with ubiquitination targets through other protein interaction domains (Jin et al., 2004).[supplied by OMIM]
33810	
33811	
33812	
33813	
33814	This gene encodes a potential zinc-binding protein and is located in the region associated with DiGeorge syndrome on chromosome 22. The encoded protein may play a role in the regulation of cellular morphology and may function during craniofacial development.
33815	
33816	
33817	
33818	
33819	
33820	
33821	
33822	
33823	
33824	
33825	
33826	
33827	
33828	
33829	
33830	
33831	
33832	
33833	
33834	
33835	
33836	
33837	
33838	
33839	
33840	
33841	
33842	
33843	
33844	
33845	Synuclein alpha interacting protein contains several protein-protein interaction domains and interacts with alpha synuclein in neurons. Mutations of SNCAIP have been linked to Parkinson disease.
33846	
33847	Cytoplasmic ribosomes, organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L7AE family of ribosomal proteins. It can interact with a subclass of nuclear hormone receptors, including thyroid hormone receptor, and inhibit their ability to transactivate by preventing their binding to their DNA response elements. This gene is included in the surfeit gene cluster, a group of very tightly linked genes that do not share sequence similarity. It is co-transcribed with the U24, U36a, U36b, and U36c small nucleolar RNA genes, which are located in its second, fifth, fourth, and sixth introns, respectively. This gene rearranges with the trk proto-oncogene to form the chimeric oncogene trk-2h, which encodes an oncoprotein consisting of the N terminus of ribosomal protein L7a fused to the receptor tyrosine kinase domain of trk. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
33848	
33849	
33850	This gene encodes a protein with multiple PDZ domains. PDZ domains mediate protein-protein interactions, and proteins with multiple PDZ domains often organize multimeric complexes at the plasma membrane. This protein localizes to tight junctions and to the apical membrane of epithelial cells. A similar protein in Drosophila is a scaffolding protein which tethers several members of a multimeric signaling complex in photoreceptors. Alternative splicing results in four transcript variants encoding different isoforms. Additional transcript variants have been described but their full-length sequences have not been determined.
33851	
33852	
33853	
33854	
33855	
33856	
33857	
33858	
33859	
33860	
33861	
33862	
33863	
33864	The protein encoded by this gene is one polypeptide of a small nuclear ribonucleoprotein complex and belongs to the snRNP SMB/SMN family. The protein plays a role in pre-mRNA processing, possibly tissue-specific alternative splicing events. Although individual snRNPs are believed to recognize specific nucleic acid sequences through RNA-RNA base pairing, the specific role of this family member is unknown. The protein arises from a bicistronic transcript that also encodes a protein identified as the SNRPN upstream reading frame (SNURF). Multiple transcription initiation sites have been identified and extensive alternative splicing occurs in the 5@#$%& untranslated region. Additional splice variants have been described but sequences for the complete transcripts have not been determined. The 5@#$%& UTR of this gene has been identified as an imprinting center. Alternative splicing or deletion caused by a translocation event in this paternally-expressed region is responsible for Angelman syndrome or Prader-Willi syndrome due to parental imprint switch failure.
33865	
33866	
33867	
33868	
33869	
33870	
33871	
33872	
33873	
33874	This gene is a member of the tropomyosin family of highly conserved, widely distributed actin-binding proteins involved in the contractile system of striated and smooth muscles and the cytoskeleton of non-muscle cells. Tropomyosin is composed of two alpha-helical chains arranged as a coiled-coil. It is polymerized end to end along the two grooves of actin filaments and provides stability to the filaments. The encoded protein is one type of alpha helical chain that forms the predominant tropomyosin of striated muscle, where it also functions in association with the troponin complex to regulate the calcium-dependent interaction of actin and myosin during muscle contraction. In smooth muscle and non-muscle cells, alternatively spliced transcript variants encoding a range of isoforms have been described. Mutations in this gene are associated with type 3 familial hypertrophic cardiomyopathy.
33875	
33876	The signal sequence receptor (SSR) is a glycosylated endoplasmic reticulum (ER) membrane receptor associated with protein translocation across the ER membrane. The SSR consists of 2 subunits, a 34-kD glycoprotein encoded by this gene and a 22-kD glycoprotein. This gene generates several mRNA species as a result of complex alternative polyadenylation. This gene is unusual in that it utilizes arrays of polyA signal sequences that are exclusively non-canonical.
33877	
33878	The protein encoded by this gene is a member of the TNF receptor associated factor (TRAF) protein family. TRAF proteins associate with, and mediate the signal transduction from, members of the TNF receptor (TNFR) superfamily. This protein participates in the signal transduction of CD40, a TNFR family member important for the activation of the immune response. This protein is found to be a critical component of the lymphotoxin-beta receptor (LTbetaR) signaling complex, which induces NF-kappaB activation and cell death initiated by LTbeta ligation. Epstein-Barr virus encoded latent infection membrane protein-1 (LMP1) can interact with this and several other members of the TRAF family, which may be essential for the oncogenic effects of LMP1. Three alternatively spliced transcript variants encoding two distinct isoforms have been reported.
33879	
33880	The protein encoded by this gene belongs to the ligand-gated ionic channel family. It is an integral membrane protein and plays an important role in inhibiting neurotransmission by binding to the benzodiazepine receptor and opening an integral chloride channel. This gene is clustered with three other family members on chromosome 4.
33881	
33882	
33883	
33884	
33885	
33886	
33887	
33888	
33889	Initiation of transcription by RNA polymerase II requires the activities of more than 70 polypeptides. The protein that coordinates these activities is transcription factor IID (TFIID), which binds to the core promoter to position the polymerase properly, serves as the scaffold for assembly of the remainder of the transcription complex, and acts as a channel for regulatory signals. TFIID is composed of the TATA-binding protein (TBP) and a group of evolutionarily conserved proteins known as TBP-associated factors or TAFs. TAFs may participate in basal transcription, serve as coactivators, function in promoter recognition or modify general transcription factors (GTFs) to facilitate complex assembly and transcription initiation. This gene encodes one of the smaller subunits of TFIID that binds to the basal transcription factor GTF2B as well as to several transcriptional activators such as p53 and VP16. A similar but distinct gene (TAF9L) has been found on the X chromosome and a pseudogene has been identified on chromosome 19. Alternative splicing results in multiple transcript variants encoding different isoforms.
33890	
33891	This gene encodes a member of von Willebrand factor A domain containing protein family. This family of proteins are thought to be involved in the formation of filamentous networks in the extracellular matrices of various tissues. Mutations of this gene have been associated with variety of inherited chondrodysplasias.
33892	
33893	
33894	
33895	
33896	
33897	The signal recognition particle (SRP) is a ribonucleoprotein complex that transports secreted and membrane proteins to the endoplasmic reticulum for processing. The complex includes a 7S RNA and six protein subunits. This gene encodes the 68kDa component of the SRP. Alternatively spliced transcript variants have been identified, but their biological validity has not been determined. Three related pseudogenes are located within the Smith-Magenis syndrome region on chromosome 17.
33898	
33899	
33900	
33901	
33902	
33903	
33904	
33905	
33906	
33907	
33908	
33909	
33910	
33911	
33912	
33913	
33914	The modification of proteins with ubiquitin is an important cellular mechanism for targeting abnormal or short-lived proteins for degradation. Ubiquitination involves at least three classes of enzymes: ubiquitin-activating enzymes (E1s), ubiquitin-conjugating enzymes (E2s) and ubiquitin-protein ligases (E3s). This gene encodes a member of the E2 ubiquitin-conjugating enzyme family. This enzyme is demonstrated to participate in the ubiquitination of p53, c-Fos, and the NF-kB precursor p105 in vitro. Two alternatively spliced transcript variants encoding distinct isoforms have been found for this gene.
33915	
33916	
33917	
33918	
33919	This gene encodes a protein that contains three ankyrin domains, a class I PDZ-binding motif and a sterile alpha motif. The encoded protein interacts with harmonin, which is associated with Usher syndrome type 1C. This protein plays a role in the development and maintenance of the auditory and visual systems and functions in the cohesion of hair bundles formed by inner ear sensory cells. Mutations in this gene are associated with Usher syndrome type 1G (USH1G).
33920	
33921	
33922	
33923	
33924	
33925	L-glutamate dehydrogenase (EC 1.4.1.3) has a central role in nitrogen metabolism in plants and animals. Glutamate dehydrogenase is found in all organisms and catalyzes the oxidative deamination of 1-glutamate to 2-oxoglutarate (Smith et al., 2001). Glutamate, the main substrate of GLUD, is present in brain in concentrations higher than in other organs. In nervous tissue, GLUD appears to function in both the synthesis and the catabolism of glutamate and perhaps in ammonia detoxification (Mavrothalassitis et al., 1988).[supplied by OMIM]
33926	
33927	
33928	
33929	
33930	
33931	
33932	
33933	
33934	
33935	
33936	Endocytosis of cell surface proteins requires a dynamic complex of proteins that assemble on the inner surface of the plasma membrane. Stonin-2 is a component of the endocytic machinery that likely regulates vesicle endocytosis.[supplied by OMIM]
33937	
33938	
33939	
33940	
33941	
33942	The protein encoded by this gene is a member of the immunoglobulin superfamily. It is a glycosylphosphatidylinositol (GPI)-anchored neuronal membrane protein that functions as a cell adhesion molecule. It may play a role in the formation of axon connections in the developing nervous system. Two alternatively spliced transcript variants encoding different isoforms have been described for this gene.
33943	
33944	
33945	
33946	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L13P family of ribosomal proteins. It is located in the cytoplasm. Transcript variants utilizing alternative polyA signals have been observed. This gene is co-transcribed with the small nucleolar RNA genes U32, U33, U34, and U35, which are located in its second, fourth, fifth, and sixth introns, respectively. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
33947	
33948	
33949	Multiprotein complexes are key determinants of Golgi apparatus structure and its capacity for intracellular transport and glycoprotein modification. Several complexes have been identified, including the Golgi transport complex (GTC), the LDLC complex, which is involved in glycosylation reactions, and the SEC34 complex, which is involved in vesicular transport. These 3 complexes are identical and have been termed the conserved oligomeric Golgi (COG) complex, which includes COG8 (Ungar et al., 2002).[supplied by OMIM]
33950	
33951	
33952	This gene encodes a member of the polycystin protein family. The encoded protein contains 11 transmembrane domains, a latrophilin/CL-1-like GPCR proteolytic site (GPS) domain, and a polycystin-1, lipoxygenase, alpha-toxin (PLAT) domain. This protein may function as a component of cation channel pores.
33953	This gene encodes a member of the Dual-specificity tyrosine phosphorylation-regulated kinase (DYRK) family. This member contains a nuclear targeting signal sequence, a protein kinase domain, a leucine zipper motif, and a highly conservative 13-consecutive-histidine repeat. It catalyzes its autophosphorylation on serine/threonine and tyrosine residues. It may play a significant role in a signaling pathway regulating cell proliferation and may be involved in brain development. This gene is a homolog of Drosophila mnb (minibrain) gene and rat Dyrk gene. It is localized in the Down syndrome critical region of chromosome 21, and is considered to be a strong candidate gene for learning defects associated with Down syndrome. Alternative splicing of this gene generates several transcript variants differing from each other either in the 5@#$%& region or in the 3@#$%& part of the coding region. These variants encode at least five different isoforms.
33954	
33955	
33956	
33957	
33958	
33959	
33960	
33961	
33962	
33963	
33964	
33965	
33966	
33967	
33968	
33969	
33970	
33971	
33972	DEAD box proteins, characterized by the conserved motif Asp-Glu-Ala-Asp (DEAD), are putative RNA helicases. They are implicated in a number of cellular processes involving alteration of RNA secondary structure such as translation initiation, nuclear and mitochondrial splicing, and ribosome and spliceosome assembly. Based on their distribution patterns, some members of this family are believed to be involved in embryogenesis, spermatogenesis, and cellular growth and division. This gene encodes a DEAD box protein, and it may be involved in ribosome assembly. Fusion of this gene and the nucleoporin gene, NUP98, by inversion 11 (p15q22) chromosome translocation is found in the patients with de novo or therapy-related myeloid malignancies.
33973	
33974	EPN1 is an endocytic accessory protein that interacts with EPS15 (MIM 600051), the alpha subunit of the clathrin adaptor AP2 (AP2A1; MIM 601026), and clathrin (see MIM 118960), as well as with other accessory proteins for the endocytosis of clathrin-coated vesicles.[supplied by OMIM]
33975	
33976	
33977	
33978	
33979	
33980	
33981	The protein encoded by this gene binds to the beta-amyloid precursor protein. Beta-amyloid precursor protein is a cell surface protein with signal-transducing properties, and it is thought to play a role in the pathogenesis of Alzheimer@#$%&s disease. In addition, the encoded protein can form a heterodimer with UBE1C and bind and activate NEDD8, a ubiquitin-like protein. This protein is required for cell cycle progression through the S/M checkpoint. Three transcript variants encoding different isoforms have been found for this gene.
33982	
33983	This gene encodes an isoform of the alpha subunit of the elongation factor-1 complex, which is responsible for the enzymatic delivery of aminoacyl tRNAs to the ribosome. This isoform (alpha 1) is expressed in brain, placenta, lung, liver, kidney, and pancreas, and the other isoform (alpha 2) is expressed in brain, heart and skeletal muscle. This isoform is identified as an autoantigen in 66% of patients with Felty syndrome. This gene has been found to have multiple copies on many chromosomes, some of which, if not all, represent different pseudogenes.
33984	
33985	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and two tandem intracytoplasmic catalytic domains, and thus represents a receptor-type PTP. The extracellular region of this PTP contains a carbonic anhydrase-like (CAH) domain, which is also found in the extracellular region of PTPRBETA/ZETA. This gene is located in a chromosomal region that is frequently deleted in renal cell carcinoma and lung carcinoma, thus is thought to be a candidate tumor suppressor gene.
33986	
33987	
33988	
33989	
33990	
33991	
33992	
33993	
33994	
33995	
33996	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L24P family of ribosomal proteins. It is located in the cytoplasm. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
33997	
33998	
33999	
34000	
34001	
34002	Ribosomes, the organelles that catalyze protein synthesis, consist of a small 40S subunit and a large 60S subunit. Together these subunits are composed of 4 RNA species and approximately 80 structurally distinct proteins. This gene encodes a ribosomal protein that is a component of the 60S subunit. The protein belongs to the L13E family of ribosomal proteins. It is located in the cytoplasm. This gene is expressed at significantly higher levels in benign breast lesions than in breast carcinomas. Transcript variants derived from alternative splicing and/or alternative polyadenylation exist; these variants encode the same protein. As is typical for genes encoding ribosomal proteins, there are multiple processed pseudogenes of this gene dispersed through the genome.
34003	
34004	
34005	
34006	The protein encoded by this gene catalyzes the deamination of dCMP to dUMP, the nucleotide substrate for thymidylate synthase. The encoded protein is allosterically activated by dCTP and inhibited by dTTP, and is found as a homohexamer. This protein uses zinc as a cofactor for its activity. Two transcript variants encoding different isoforms have been found for this gene.
34007	The protein encoded by this gene is localized exclusively in the cell nucleus. It plays a role in signal transduction, and may function in the maintenance of sarcomeres and in the assembly of myofibrils into sarcomeres. It also plays an important role in stress fiber formation. The gene is involved in therapy-related leukemia by a chromosomal translocation t(3;11)(p21;q23) that involves this gene and the myeloid/lymphoid leukemia gene. Alternative splicing occurs in this locus and two transcript variants encoding distinct isoforms have been identified.
34008	
34009	
34010	
34011	
34012	
34013	
34014	
34015	
34016	
34017	
34018	This gene encodes a trypsinogen, which is a member of the trypsin family of serine proteases. This enzyme is secreted by the pancreas and cleaved to its active form in the small intestine. It is active on peptide linkages involving the carboxyl group of lysine or arginine. Mutations in this gene are associated with hereditary pancreatitis. This gene and several other trypsinogen genes are localized to the T cell receptor beta locus on chromosome 7.
34019	
34020	
34021	
34022	
34023	
34024	
34025	
34026	
34027	This gene encodes a member of the annexin family. Members of this calcium-dependent phospholipid-binding protein family play a role in the regulation of cellular growth and in signal transduction pathways. This protein functions as an autocrine factor which heightens osteoclast formation and bone resorption. This gene has three pseudogenes located on chromosomes 4, 9 and 10, respectively. Multiple alternatively spliced transcript variants encoding different isoforms have been found for this gene.
34028	
34029	
34030	
34031	
34032	
34033	
34034	
34035	
34036	
34037	This gene encodes a member of the WAP-type four-disulfide core (WFDC) domain family. The WFDC domain, or WAP signature motif, contains eight cysteines forming four disulfide bonds at the core of the protein, and functions as a protease inhibitor. Most WFDC gene members are localized to chromosome 20q12-q13 in two clusters: centromeric and telomeric. This gene belongs to the telomeric cluster. Two alternatively spliced transcript variants have been found for this gene, and they encode distinct isoforms.
34038	
34039	
34040	This gene includes 11 exons spanning 25 kb and maps to a region of chromosome 16 that is sometimes deleted in breast and prostrate cancer. The second intron contains an apparently intronless gene, C16orf3, that is transcribed in the opposite orientation. This gene is a putative tumor suppressor gene.
34041	
34042	
34043	
34044	This gene is a member of the cytidine deaminase gene family. It is one of seven related genes or pseudogenes found in a cluster, thought to result from gene duplication, on chromosome 22. Members of the cluster encode proteins that are structurally and functionally related to the C to U RNA-editing cytidine deaminase APOBEC1. It is thought that the proteins may be RNA editing enzymes and have roles in growth or cell cycle control. This gene encodes a protein that lacks the zinc binding activity and may be an expressed pseudogene.
34045	
34046	
34047	
34048	
34049	
34050	
34051	
34052	
34053	Mammalian mitochondrial ribosomal proteins are encoded by nuclear genes and help in protein synthesis within the mitochondrion. Mitochondrial ribosomes (mitoribosomes) consist of a small 28S subunit and a large 39S subunit. They have an estimated 75% protein to rRNA composition compared to prokaryotic ribosomes, where this ratio is reversed. Another difference between mammalian mitoribosomes and prokaryotic ribosomes is that the latter contain a 5S rRNA. Among different species, the proteins comprising the mitoribosome differ greatly in sequence, and sometimes in biochemical properties, which prevents easy recognition by sequence homology. This gene encodes a 39S subunit protein. Pseudogenes corresponding to this gene are found on chromosomes 2p and 17q.
34054	
34055	
34056	
34057	
34058	
34059	
34060	
34061	
34062	
34063	
34064	
34065	
34066	
34067	
34068	
34069	
34070	
34071	
34072	
34073	
34074	
34075	The human NDUFA5 gene codes for the B13 subunit of complex I of the respiratory chain, which transfers electrons from NADH to ubiquinone. The high degree of conservation of NDUFA5 extending to plants and fungi indicates its functional significance in the enzyme complex. The protein localizes to the inner mitochondrial membrane as part of the 7 component-containing, water soluble @#$%!iron-sulfur protein@#$%! (IP) fraction of complex I, although its specific role is unknown. It is assumed to undergo post-translational removal of the initiator methionine and N-acetylation of the next amino acid. The predicted secondary structure is primarily alpha helix, but the carboxy-terminal half of the protein has high potential to adopt a coiled-coil form. The amino-terminal part contains a putative beta sheet rich in hydrophobic amino acids that may serve as mitochondrial import signal. Related pseudogenes have also been identified on four other chromosomes.
34076	
34077	
34078	
34079	
34080	
34081	
34082	
34083	
34084	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This protein contains a C-terminal PTP domain and an N-terminal domain homologous to the band 4.1 superfamily of cytoskeletal-associated proteins. This PTP has been shown to interact with glutamate receptor delta 2 and epsilon subunits, and is thought to play a role in signalling downstream of the glutamate receptors through tyrosine dephosphorylation.
34085	This gene is a member of the cyclophilin family of peptidylprolyl isomerases. The cyclophilins are a highly conserved family, members of which play an important role in protein folding, immunosuppression by cyclosporin A, and infection of HIV-1 virions.
34086	
34087	
34088	
34089	The 26S proteasome is a multicatalytic proteinase complex with a highly ordered structure composed of 2 complexes, a 20S core and a 19S regulator. The 20S core is composed of 4 rings of 28 non-identical subunits; 2 rings are composed of 7 alpha subunits and 2 rings are composed of 7 beta subunits. The 19S regulator is composed of a base, which contains 6 ATPase subunits and 2 non-ATPase subunits, and a lid, which contains up to 10 non-ATPase subunits. Proteasomes are distributed throughout eukaryotic cells at a high concentration and cleave peptides in an ATP/ubiquitin-dependent process in a non-lysosomal pathway.  An essential function of a modified proteasome, the immunoproteasome, is the processing of class I MHC peptides. This gene encodes one of the ATPase subunits, a member of the triple-A family of ATPases which have a chaperone-like activity. This subunit and a 20S core alpha subunit interact specifically with the hepatitis B virus X protein, a protein critical to viral replication. This subunit also interacts with the adenovirus E1A protein and this interaction alters the activity of the proteasome. Finally, this subunit interacts with ataxin-7, suggesting a role for the proteasome in the development of spinocerebellar ataxia type 7, a progressive neurodegenerative disorder.
34090	
34091	
34092	The glypicans comprise a family of glycosylphosphatidylinositol-anchored heparan sulfate proteoglycans. The glypicans have been implicated in the control of cell growth and division. Glypican 6 is a putative cell surface coreceptor for growth factors, extracellular matrix proteins, proteases and anti-proteases.
34093	
34094	
34095	
34096	
34097	
34098	
34099	
34100	
34101	This gene product belongs to the 14-3-3 family of proteins which mediate signal transduction by binding to phosphoserine-containing proteins. This highly conserved protein family is found in both plants and mammals, and this protein is 99% identical to the mouse, rat and sheep orthologs. The encoded protein interacts with IRS1 protein, suggesting a role in regulating insulin sensitivity. Two transcript variants differing in the 5@#$%& UTR, but encoding the same protein, have been identified for this gene.
34102	
34103	
34104	
34105	
34106	
34107	
34108	The protein encoded by this gene is membrane-associated and contains five ankyrin repeats, a protein phosphatase-1-interacting domain, and a carboxy-terminal CAAX box domain. Synthesis of the encoded protein is inhibited by transforming growth factor beta-1. The protein may bind to the membrane through its CAAX box domain and may act as a signaling molecule through interaction with protein phosphatase-1.
34109	
34110	
34111	
34112	
34113	
34114	
34115	
34116	
34117	
34118	
34119	This gene encodes a member of the non-histone chromosomal high mobility group protein family. The proteins of this family are chromatin-associated and ubiquitously distributed in the nucleus of higher eukaryotic cells. In vitro studies have demonstrated that this protein is able to efficiently bend DNA and form DNA circles.  These studies suggest a role in facilitating cooperative interactions between cis-acting proteins by promoting DNA flexibility. This protein was also reported to be involved in the final ligation step in DNA end-joining processes of DNA double-strand breaks repair and V(D)J recombination.
34120	
34121	
34122	The protein encoded by this gene is a member of the dual specificity protein phosphatase subfamily. These phosphatases inactivate their target kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues. They negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily (MAPK/ERK, SAPK/JNK, p38), which is associated with cellular proliferation and differentiation. Different members of the family of dual specificity phosphatases show distinct substrate specificities for various MAP kinases, different tissue distribution and subcellular localization, and different modes of inducibility of their expression by extracellular stimuli. This gene product inactivates SAPK/JNK and p38, is expressed predominantly in the adult brain, heart, and skeletal muscle, is localized in the cytoplasm, and is induced by nerve growth factor and insulin. An intronless pseudogene for DUSP8 is present on chromosome 10q11.2.
34123	The protein encoded by this gene belongs to the complex I 9kDa subunit family. Mammalian complex I of mitochondrial respiratory chain is composed of 45 different subunits. This protein has NADH dehydrogenase activity and oxidoreductase activity. It transfers electrons from NADH to the respiratory chain. The immediate electron acceptor for the enzyme is believed to be ubiquinone.
34124	
34125	
34126	
34127	
34128	Cytochrome c oxidase (COX) is the terminal enzyme of the mitochondrial respiratory chain. It is a multi-subunit enzyme complex that couples the transfer of electrons from cytochrome c to molecular oxygen and contributes to a proton electrochemical gradient across the inner mitochondrial membrane. The complex consists of 13 mitochondrial- and nuclear-encoded subunits. The mitochondrially-encoded subunits perform the electron transfer of proton pumping activities. The functions of the nuclear-encoded subunits are unknown but they may play a role in the regulation and assembly of the complex. This gene encodes the nuclear-encoded subunit Va of the human mitochondrial respiratory chain enzyme. A pseudogene COX5AP1 has been found in chromosome 14q22.
34129	
34130	
34131	
34132	
34133	
34134	
34135	
34136	This gene encodes a mitochondrial DNA-directed RNA polymerase. The gene product is responsible for mitochondrial gene expression as well as for providing RNA primers for initiation of replication of the mitochondrial genome. Although this polypeptide has the same function as the three nuclear DNA-directed RNA polymerases, it is more closely related to RNA polymerases of phage and mitochondrial polymerases of lower eukaryotes.
34137	
34138	
34139	
34140	
34141	
34142	
34143	
34144	
34145	This gene encodes the beta subunit of the karyopherin receptor complex which interacts with nuclear localization signals to target nuclear proteins to the nucleus. The karyopherin receptor complex is a heterodimer of an alpha subunit which recognizes the nuclear localization signal and a beta subunit which docks the complex at nucleoporins. Alternate splicing of this gene results in two transcript variants encoding the same protein.
34146	The protein encoded by this gene is a member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a variety of cellular processes including cell growth, differentiation, mitotic cycle, and oncogenic transformation. This PTP possesses an extracellular region, a single transmembrane region, and two tandem catalytic domains, and thus represents a receptor-type PTP. The extracellular region contains a meprin-A5 antigen-PTP mu (MAM) domain, an Ig-like domain and four fibronectin type III-like repeats. This PTP was shown to mediate homophilic intercellular interaction, possibly through the interaction with beta- and gamma-catenin at adherens junctions. Expression of this gene was found to be stimulated by TGF-beta 1, which may be important for the inhibition of keratinocyte proliferation.
34147	
34148	
34149	
34150	
34151	
34152	
34153	
34154	This gene product belongs to the zinc finger protein superfamily, members of which are regulatory proteins characterized by nucleic acid-binding zinc finger domains. The encoded protein is a member of the Kruppel family of zinc finger proteins, and contains Kruppel-associated box (KRAB) A and B domains and 15 tandemly arrayed C2H2-type zinc fingers. It is an ortholog of the mouse Zfp93 protein. This gene is located in a cluster of zinc finger genes on 19q13.2.
34155	
34156	
34157	WNT proteins are extracellular signaling molecules involved in the control of embryonic development. This gene encodes a secreted protein, which binds WNT proteins and inhibits their activities. This protein contains a WNT inhibitory factor (WIF) domain and 5 epidermal growth factor (EGF)-like domains. It may be involved in mesoderm segmentation. This protein is found to be present in fish, amphibia and mammals.
34158	
34159	
34160	
34161	
34162	
34163	This gene belongs to the subfamily of ubiquitously expressed heterogeneous nuclear ribonucleoproteins (hnRNPs). The hnRNPs are RNA binding proteins and they complex with heterogeneous nuclear RNA (hnRNA). These proteins are associated with pre-mRNAs in the nucleus and appear to influence pre-mRNA processing and other aspects of mRNA metabolism and transport. While all of the hnRNPs are present in the nucleus, some seem to shuttle between the nucleus and the cytoplasm. The hnRNP proteins have distinct nucleic acid binding properties. The protein encoded by this gene has three repeats of quasi-RRM domains that bind to RNAs. This protein also constitutes a monomer of the N-acetylglucosamine-specific receptor which is postulated to trigger selective recycling of immature GlcNAc-bearing thyroglobulin molecules. Multiple alternatively spliced transcript variants are known for this gene but only two transcripts has been isolated.
34164	HSF2, as well as the related gene HSF1, encodes a protein that binds specifically to the heat-shock element and has homology to HSFs of other species. Heat shock transcription factors activate heat-shock response genes under conditions of heat or other stresses. Although the names HSF1 and HSF2 were chosen for historical reasons, these peptides should be referred to as heat-shock transcription factors.
34165	
34166	
34167	
34168	
34169	
34170	
34171	
34172	
34173	
34174	
34175	
34176	
