C6orf58

Protein-coding gene in the species Homo sapiens From Wikipedia, the free encyclopedia

C6orf58

C6orf58 is a human gene located at locus 6q22.33 of chromosome 6 and encodes for UPF0762, a protein which is subsequently secreted after cleavage of a signal peptide.[5] DUF781, which is the singular identifiable domain in UPF0762, is tied to liver development in an orthologous protein in zebrafish.[6] The function of the human UPF0762 is not yet well characterized.[7]

Quick Facts Identifiers, Aliases ...
C6orf58
Identifiers
AliasesC6orf58, LEG1, chromosome 6 open reading frame 58
External IDsMGI: 1914969; HomoloGene: 134042; GeneCards: C6orf58; OMA:C6orf58 - orthologs
Orthologs
SpeciesHumanMouse
Entrez
Ensembl
UniProt
RefSeq (mRNA)

NM_001010905

NM_026336

RefSeq (protein)

NP_001010905

NP_080612

Location (UCSC)Chr 6: 127.52 – 127.59 MbChr 10: 28.85 – 28.86 Mb
PubMed search[3][4]
Wikidata
View/Edit HumanView/Edit Mouse
Close

Gene and mRNA

Summarize
Perspective
More information Genomic DNA Length (base pairs), Exons ...
Genomic DNA Length (base pairs)ExonsMature mRNA Length (base pairs)Splice variantsSignal peptide CDS (base pair)Mature Peptide CDS (base pair)5'-UTR (base pair)3'-UTR (base pair)
14644[5]6[5]1200[5]3[7]13-72[5]73-1002[5]1-12[5]1003-1200[5]
Close

Expression

While there are 3 splice variants of C6orf58, only one encodes a good protein.[7] In humans, C6orf58 expressed sequence tags were primarily detected in the larynx and trachea.[8] Transcripts were only detected during the adult stage of development.[8] Experimental microarray data, however, reveals additional regions of C6orf58 expression, namely in the salivary gland, thyroid, and small intestine.[9] Arsenic may also regulate expression as it increases methylation of the C6orf58 promoter.[10]

Tissue expression profile of C6orf58
A microarray experiment of various tissues shows C6orf58 expression to be limited.

Gene Neighborhood

Genes within 500 Kilobases of C6orf58 include RSPO3, C6orf174, KIAA0408, RPL17P23, ECHDC1, RPL5P18, YWHAZP4, LOC100420743, LOC100421513, MRPS17P5, and THEMIS.

Homology

A selected set of homologous sequences are listed below, with sequence identity being calculated in comparison to the human reference sequence.

More information Species, Common Name ...
SpeciesCommon NameAccession NumberSequence Length (base pairs)Sequence Identity
Nomascus leucogenysNorthern white-cheeked gibbonXM_003255689.11190.97
Macaca mulattaRhesus monkeyNM_001194318.11190.95
Oryctolagus cuniculusEuropean rabbitXM_002714721.11014.79
Loxodonta africanaAfrican bush elephantXM_003404026.11020.78
Cavia porcellusGuinea pigXM_003468475.11017.76
Equus caballusHorseXM_001917090.1990.77
Close

Protein

Summarize
Perspective

Properties

More information Amino acid length (amino acids), Signal Peptide Length (amino acids) ...
Amino acid length (amino acids)Signal Peptide Length (amino acids)Molecular Weight of Precursor ProteinMolecular Weight of Signal Peptide (Predicted)Molecular Weight of Mature Peptide(predicted)Molecular Weight(observed)Isoelectric Point (Predicted)N-linked glycosylation Site
330[5]20[5]37.9 kDa[11]2.1 kDa[11]35.8 kDa[11]32 kDa[12]5.78[11]Amino acid 69
Close

Mass spectrometry has shown that the observed molecular weight of UPF0762 is 32kDa.[12] It remains unclear why the observed molecular weight is less than predicted, even after accounting for cleavage of the signal peptide. Attachment of a sugar at the site of N-linked glycosylation would also increase the molecular weight.

Homology

UPF0762 shows high homology in primates and orthologous proteins can be traced back as far as trichoplax adhaerens. The list of proteins below is not a comprehensive listing of UPF0762 orthologs. Sequence identity and similarity were determined using BLAST[13] with the reference human sequence as the query.

More information Species, Common Name ...
SpeciesCommon NameAccession NumberSequence Length (amino acids)Sequence Identity (%)Sequence Similarity (%)
Pan troglodytesChimpanzeeXP_518733.233011
Pongo abeliiSumatran orangutanXP_002817388.1330.98.99
Callithrix jacchusMarmosetXP_002746989.1330.87.93
Canis lupusGray wolfXP_851589.1310.7.82
Taeniopygia guttataZebra finchXP_002190886.1364.43.63
Gallus gallusRed junglefowlXP_419749.3371.42.6
Xenopus tropicalisWestern clawed frogXP_002940437.11780.290.51
Trichoplax adhaerensN/AXP_002111384381.34.49
Close

Conserved domains

DUF781 is the singular domain of the protein and spans 318 of the protein's 330 amino acids. DUF781 has been linked to liver development in zebrafish.[6]

Post-translational modifications

Observed post-translational modifications include N-linked glycosylation at amino acid 69.[14] A signal peptide, which is predicted to direct the protein to the endoplasmic reticulum for secretion,[15] is cleaved from the first 20 amino acids of the peptide sequence.[5] The missense mutation S18F detected in hepatocellular carcinoma[16] significantly reduces the predicted cleavage score of the signal peptide.[17]

Thumb
A graphical representation of UPF0762 showing various post-translational modifications.

Interactions

Human C6orf58 has been reported to interact with the enzyme ribonucleotide reductase as encoded by the vaccinia virus through a yeast two-hybrid screen.[18]

Pathology

Statistical analysis has shown C6orf58 to be associated with pancreatic cancer survival time.[19] In addition, a missense mutation at amino acid 18 has been observed in liver cancer cells where serine becomes phenylalanine.[16] Analysis of the mutated protein sequence for a signal peptide shows cleavability at the regular amino acid 20 is lost.[17] DUF781's association with liver development and the missense mutation's association with liver cancer is a correlation that remains to be investigated.

Thumb
A SignalP analysis of the reference sequence and a sequence with the mutation S18F resulted in a significant drop in cleavage of the signal peptide.

References

Loading related searches...

Wikiwand - on

Seamless Wikipedia browsing. On steroids.