image source:

The MissingProteinPedia is a protein data and information sharing web system that aims to collate any relevant data ‘Missing proteins’ as defined by neXtProt. At its core is a schema-less database-driven web system allowing captures of all PE2-4 protein PubMed data, based on gene and protein including synonyms. The database also allows unpublished, preliminary or proprietary data (e.g., antibody, MS, cell biological and genetic studies) to be shared with collaborators via a protected interface.

MissingProteinPedia facilitates the Human Proteome Project (HPP) cross-disciplinary collaboration by providing a complimentary, unfiltered, lower stringency perspective to both the HPP metrics and guidelines approaches, enabling community evaluation and scrutiny. MissingProteinPedia incorporates text mining technology to fetch and search accumulated UniProt, GeneCards, GeneRifs, PubMed PE2-4 data. Besides, MissingProteinPedia summarizes publicly available MS data from PRIDE, GPMDB, ProteomicsDB and MaxQB for relevant PE2-4 proteins. It allows community administrators to curate information before web publication.

We encourage all visitors to browse and contribute to 'finding' these proteins by sharing any relevant information on them that we may have missed!


Showing 61-80 of 1,482 items.
Protein IDGene IDProtein NameChromosome IDGene NameTag(s) 
Q3KP44ANKRD55Ankyrin repeat domain-containing protein 55Chromosome-5Ankyrin repeat domain-containing protein 55
Q9BZ19ANKRD60Ankyrin repeat domain-containing protein 60Chromosome-20Ankyrin repeat domain-containing protein 60
A6NF34ANTXRLAnthrax toxin receptor-likeChromosome-10Anthrax toxin receptor-like
Q8IVJ8APRG1AP20 region protein 1Chromosome-3AP20 region protein 1
Q96LR9APOLD1Apolipoprotein L domain-containing protein 1Chromosome-12Apolipoprotein L domain-containing protein 1
Q9BPW4APOL4Apolipoprotein L4Chromosome-22Apolipoprotein L4
Q9BWW9APOL5Apolipoprotein L5Chromosome-22Apolipoprotein L5
Q8TF27AGAP11Arf-GAP with GTPase, ANK repeat and PH domain-containing protein 11Chromosome-10Arf-GAP with GTPase, ANK repeat and PH domain-containing protein 11
Q96P64AGAP4Arf-GAP with GTPase, ANK repeat and PH domain-containing protein 4Chromosome-10Arf-GAP with GTPase, ANK repeat and PH domain-containing protein 4
A6NJG6ARGFXArginine-fifty homeoboxChromosome-3Arginine-fifty homeobox
Q8NEN0ARMC2Armadillo repeat-containing protein 2Chromosome-6Armadillo repeat-containing protein 2
Q6P093AADACL2Arylacetamide deacetylase-like 2Chromosome-3Arylacetamide deacetylase-like 2

Related Publications

Publications to be cited for using MPP data and services

Islam, M.T. et al. Protannotator: A Semiautomated Pipeline for Chromosome-Wise Functional Annotation of the “Missing” Human Proteome. Journal of Proteome Research 13 (1), 76-83, doi: 10.1021/pr400794x (2014)
Islam, M.T. et al. A systematic bioinformatics approach to identify high quality MS data and functionally annotate proteins and proteomes. Methods Mol. Biol.1549, 163–176, doi: 10.1007/978-1-4939-6740-7_13 (2016)
Baker, M. S. et al. Accelerating the search for the missing proteins in the human proteome. Nat. Commun. 8, 14271 doi: 10.1038/ncomms14271 (2017).
Islam, M.T. et al. Missing ProteinPedia - under preparation.


Professor Mark S. Baker

  •  Department of Biomedical Sciences,

           Faculty of Medicine and Health Sciences,

           Level 1, 75 Talavera Rd, Macquarie University, NSW 2109, Australia

Bioinformatics, Database and Web Administration

Professor Shoba Ranganathan

  •  Department of Biomedical Sciences,

           Department of Chemistry and Biomolecular Science,

           Building F7B Room 121, Macquarie University, NSW 2109, Australia

Write to us