image source:

The MissingProteinPedia is a protein data and information sharing web system that aims to collate any relevant data ‘Missing proteins’ as defined by neXtProt. At its core is a schema-less database-driven web system allowing captures of all PE2-4 protein PubMed data, based on gene and protein including synonyms. The database also allows unpublished, preliminary or proprietary data (e.g., antibody, MS, cell biological and genetic studies) to be shared with collaborators via a protected interface.

MissingProteinPedia facilitates the Human Proteome Project (HPP) cross-disciplinary collaboration by providing a complimentary, unfiltered, lower stringency perspective to both the HPP metrics and guidelines approaches, enabling community evaluation and scrutiny. MissingProteinPedia incorporates text mining technology to fetch and search accumulated UniProt, GeneCards, GeneRifs, PubMed PE2-4 data. Besides, MissingProteinPedia summarizes publicly available MS data from PRIDE, GPMDB, ProteomicsDB and MaxQB for relevant PE2-4 proteins. It allows community administrators to curate information before web publication.

We encourage all visitors to browse and contribute to 'finding' these proteins by sharing any relevant information on them that we may have missed!


Showing 201-220 of 1,482 items.
Protein IDGene IDProtein NameChromosome IDGene NameTag(s) 
Q9HC47CTAGE1Cutaneous T-cell lymphoma-associated antigen 1Chromosome-18Cutaneous T-cell lymphoma-associated antigen 1
Q9H2H0CXXC4CXXC-type zinc finger protein 4Chromosome-4CXXC-type zinc finger protein 4
Q8NA66CNBD1Cyclic nucleotide-binding domain-containing protein 1Chromosome-8Cyclic nucleotide-binding domain-containing protein 1
Q16280CNGA2Cyclic nucleotide-gated olfactory channelChromosome-XCyclic nucleotide-gated olfactory channel
Q8N815CNTD1Cyclin N-terminal domain-containing protein 1Chromosome-17Cyclin N-terminal domain-containing protein 1
Q9H8S5CNTD2Cyclin N-terminal domain-containing protein 2Chromosome-19Cyclin N-terminal domain-containing protein 2
Q5MAI5CDKL4Cyclin-dependent kinase-like 4Chromosome-2Cyclin-dependent kinase-like 4
Q9H114CSTL1Cystatin-like 1Chromosome-20Cystatin-like 1
Q96J86CYYR1Cysteine and tyrosine-rich protein 1Chromosome-21Cysteine and tyrosine-rich protein 1
Q6Q6R5CRIP3Cysteine-rich protein 3Chromosome-6Cysteine-rich protein 3
Q8TF08COX7B2Cytochrome c oxidase subunit 7B2, mitochondrialChromosome-4Cytochrome c oxidase subunit 7B2, mitochondrial
Q7Z4L0COX8CCytochrome c oxidase subunit 8C, mitochondrialChromosome-14Cytochrome c oxidase subunit 8C, mitochondrial
O43174CYP26A1Cytochrome P450 26A1Chromosome-10Cytochrome P450 26A1
Q6V0L0CYP26C1Cytochrome P450 26C1Chromosome-10Cytochrome P450 26C1
P20853CYP2A7Cytochrome P450 2A7Chromosome-19Cytochrome P450 2A7
Q5VU57AGBL4Cytosolic carboxypeptidase 6Chromosome-1Cytosolic carboxypeptidase 6

Related Publications

Publications to be cited for using MPP data and services

Islam, M.T. et al. Protannotator: A Semiautomated Pipeline for Chromosome-Wise Functional Annotation of the “Missing” Human Proteome. Journal of Proteome Research 13 (1), 76-83, doi: 10.1021/pr400794x (2014)
Islam, M.T. et al. A systematic bioinformatics approach to identify high quality MS data and functionally annotate proteins and proteomes. Methods Mol. Biol.1549, 163–176, doi: 10.1007/978-1-4939-6740-7_13 (2016)
Baker, M. S. et al. Accelerating the search for the missing proteins in the human proteome. Nat. Commun. 8, 14271 doi: 10.1038/ncomms14271 (2017).
Islam, M.T. et al. Missing ProteinPedia - under preparation.


Professor Mark S. Baker

  •  Department of Biomedical Sciences,

           Faculty of Medicine and Health Sciences,

           Level 1, 75 Talavera Rd, Macquarie University, NSW 2109, Australia

Bioinformatics, Database and Web Administration

Professor Shoba Ranganathan

  •  Department of Biomedical Sciences,

           Department of Chemistry and Biomolecular Science,

           Building F7B Room 121, Macquarie University, NSW 2109, Australia

Write to us