C16orf82 is a protein that, in humans, is encoded by the C16orf82 gene.[3]C16orf82 encodes a 2285 nucleotide mRNA transcript which is translated into a 154 amino acid protein using a non-AUG (CUG) start codon. The gene has been shown to be largely expressed in the testis, tibial nerve, and the pituitary gland, although expression has been seen throughout a majority of tissue types.[4][5][6] The function of C16orf82 is not fully understood by the scientific community.[7]
The gene encodes for a 2285 nucleotide mRNA transcript that is intronless. Human intronless genes represent a unique subset of the genome that are often involved in signaling, sperm formation, immune responses, or development.[8]C16orf82 being such a gene indicates it may play a role in one of these processes. TranslationofC16orf82 initiates at a non-AUG (CUG) start codon. The presence of the non-canonical start codon suggests possible increased regulation of C16orf82 translation and/or possibly could allow for the translation of protein products that start with leucine instead of methionine as seen in proteins coded for by some genes present in the major histocompatibility complex.[9][10]
The C16orf82 promoter region has been predicted to contain a number of transcription factor binding sites including binding sites for transcription factors within the SOX family.[11] The presence of the SOX family transcription binding sites suggests that C16orf82 may play a role in sex determination.[12] Actual transcription factor functional studies show binding of the C16ORF82 promoter by ARNT, ELF5, SMAD4, and STAT3.[13]
C16orf82 expression in humans has been observed in major organ systems including the heart, liver, brain, and kidney at a constant level.[14] The tissue in which C16orf82 has been seen to be most highly expressed has been the testis, both by microarray experiments as well as RNA-seq.[4][5] C16orf82 expression is also highly variable between individuals, with some expressing the gene in large amounts while others barely express the gene within the same tissue type.[6][15]Micro RNA (miR-483) over expression has been shown to knock down C16orf82 expression.[16]
The C16orf82 protein is 154 amino acids in length with an approximate molecular weight of 16.46 kDa with a predicted isoelectric point of 6.06.[17] There are no known variants or isoforms of C16orf82.
C16orf82 contains one domain, DUF4694, which currently has a function that is uncharacterized. The domain spans from amino acid 8 to amino acid 153.[18] DUF4694 contains a SSGY (serine-serine-glycine-tyrosine) sequence motif that is found in a majority of the protein's orthologs.[19][20] There is no presence of a transmembrane domain thus the protein is not a transmembrane protein.[21]
Cellular localization
[edit]A conceptual diagram of C16orf82's structure. The flags represent sites of predicted phosphorylation and O-linked glycosylation. Grey flags represent phosphorylation sites and red flags represent sites of overlap between phosphorylation and O-linked glycosylation.[3][22]
The localization of C16orf82 within a cell has been predicted to be nuclear.[21] A bipartite nuclear localization signal can be found starting at Arg107.
One of the predicted 3D models of the human C16orf82 protein.[23][24][25]
The human C16orf82 protein has been predicted to be phosphorylated at a number of serine residues.[26] O-linked glycosylation has also been predicted to happen at a number of sites, including some that overlap with the aforementioned phosphorylation sites.[27] The sites of overlap between the two types of post-translational modifications could play important regulatory roles in the activity and lifespan of the human C16orf82 protein.[28]
C16orf82 has over 100 predicted orthologs, which all reside in the class mammalia and more precisely the subclass eutheria.[33][20] All of the orthologs contained the domain DUF4964.[33] The most distant ortholog detected was within the nine-banded armadillo (Dasypus novemcinctus) within the order Cingluata. Below is a table of 20 orthologs from various orders within the subclass eutheria with the sequence identity and time since divergence in relation to humans.
[edit]Graph of the adjusted protein sequence divergence of C16orf82, cytochrome C, and fibrinogen using species which contain C16orf82 orthologs.[20][36]
C16orf82's rate of evolution was determined to be relatively fast even in comparison to fibrinogen, a gene that has been shown to evolve quickly.[36]
C16orf82 has been associated with Schizophrenia through a genome-wide association study and autism based on copy number variation analysis.[37][38] Currently, research has not shown if C16orf82 plays any direct role in either of these disorders.
^Grzybowska EA (July 2012). "Human intronless genes: functional groups, associated diseases, evolution, and mRNA processing in absence of splicing". Biochemical and Biophysical Research Communications. 424 (1): 1–6. doi:10.1016/j.bbrc.2012.06.092. PMID22732409.
^Barrionuevo F, Scherer G (March 2010). "SOX E genes: SOX9 and SOX8 in mammalian testis development". The International Journal of Biochemistry & Cell Biology. 42 (3): 433–6. doi:10.1016/j.biocel.2009.07.015. PMID19647095.
^ abNakai K, Horton P (January 1999). "PSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization". Trends in Biochemical Sciences. 24 (1): 34–6. doi:10.1016/S0968-0004(98)01336-X. PMID10087920.
^Blom N, Gammeltoft S, Brunak S (December 1999). "Sequence and structure-based prediction of eukaryotic protein phosphorylation sites". Journal of Molecular Biology. 294 (5): 1351–62. doi:10.1006/jmbi.1999.3310. PMID10600390.
^Zhou Y, Kloczkowski A, Faraggi E, Yang Y (2016-10-28). Prediction of protein secondary structure. New York, NY. ISBN9781493964048. OCLC961911230.{{cite book}}: CS1 maint: location missing publisher (link)