Online citations, reference lists, and bibliographies.
← Back to Search

UniProt: The Universal Protein Knowledgebase

R. Apweiler, A. Bairoch, Cathy H. Wu, W. C. Barker, B. Boeckmann, Serenella Ferro, E. Gasteiger, Hongzhan Huang, R. Lopez, M. Magrane, M. Martin, D. Natale, C. O’Donovan, N. Redaschi, L. Yeh
Published 2004 · Biology, Computer Science, Medicine

Save to my Library
Download PDF
Analyze on Scholarcy Visualize in Litmaps
Share
Reduce the time it takes to create your bibliography by a factor of 10 by using the world’s favourite reference manager
Time to take this seriously.
Get Citationsy
To provide the scientific community with a single, centralized, authoritative resource for protein sequences and functional information, the Swiss-Prot, TrEMBL and PIR protein database activities have united to form the Universal Protein Knowledgebase (UniProt) consortium. Our mission is to provide a comprehensive, fully classified, richly and accurately annotated protein sequence knowledgebase, with extensive cross-references and query interfaces. The central database will have two sections, corresponding to the familiar Swiss-Prot (fully manually curated entries) and TrEMBL (enriched with automated classification, annotation and extensive cross-references). For convenient sequence searches, UniProt also provides several non-redundant sequence databases. The UniProt NREF (UniRef) databases provide representative subsets of the knowledgebase suitable for efficient searching. The comprehensive UniProt Archive (UniParc) is updated daily from many public source databases. The UniProt databases can be accessed online (http://www.uniprot.org) or downloaded in several formats (ftp://ftp.uniprot.org/pub). The scientific community is encouraged to submit data for inclusion in UniProt.
This paper references
10.1126/SCIENCE.1967194
Posttranslational glutamylation of alpha-tubulin.
B. Eddé (1990)
A novel method for automatic and reliable functional annotation of proteins
W. Fleischmann (1998)
10.1093/nar/30.1.106
The FlyBase database of the Drosophila Genome Projects and community literature.
W. Gelbart (1999)
10.1093/bioinformatics/15.3.228
A novel method for automatic functional annotation of proteins
W. Fleischmann (1999)
10.1093/bioinformatics/16.11.1048
VARSPLIC: alternatively-spliced protein sequences derived from SWISS-PROT and TrEMBL
P. Kersey (2000)
10.1038/75556
Gene Ontology: tool for the unification of biology
M. Ashburner (2000)
10.1006/JMBI.2001.5080
Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure.
J. Gough (2001)
10.1093/nar/29.1.41
TIGRFAMs: a protein family resource for the functional identification of proteins
D. Haft (2001)
10.1093/NAR/29.1.137
RefSeq and LocusLink: NCBI gene-centered resources
K. Pruitt (2001)
10.1093/nar/30.1.242
Recent improvements to the SMART domain-based sequence annotation resource
I. Letunic (2002)
10.1093/bioinformatics/18.1.77
Tolerating some redundancy significantly speeds up clustering of large protein databases
Weizhong Li (2002)
10.1093/bib/3.3.246
ProDom: Automated Clustering of Homologous Domains
F. Servant (2002)
10.1093/nar/27.1.85
The FlyBase database of the Drosophila genome projects and community literature.
W. Gelbart (2002)
10.1093/nar/30.1.38
The Ensembl genome database project
T. Hubbard (2002)
10.1093/nar/gkg053
WormBase: a cross-species database for comparative genomics
T. W. Harris (2003)
10.1093/nar/gkg021
The EMBL Nucleotide Sequence Database: major new developments
G. Stoesser (2003)
10.1016/S1476-9271(02)00098-1
Protein family classification and functional annotation
Cathy H. Wu (2003)
10.1016/S1476-9271(02)00094-4
Automated annotation of microbial proteomes in SWISS-PROT
A. Gattiker (2003)
10.1093/nar/gkg030
PRINTS and its automatic supplement, prePRINTS
T. Attwood (2003)
10.1093/nar/gkg095
The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003
B. Boeckmann (2003)
10.1093/NAR/GKG046
The InterPro Database, 2003 brings increased coverage and new features
N. Mulder (2003)
10.1093/nar/gkg068
The Protein Data Bank and structural genomics
J. Westbrook (2003)
10.1093/NAR/GKG040
The Protein Information Resource
Cathy H. Wu (2003)
10.1093/NAR/GKH044
Recent improvements to the PROSITE database
N. Hulo (2004)
10.1093/bioinformatics/bth191
UniProt archive
R. Leinonen (2004)
10.1002/PMIC.200300777
The RESID Database of Protein Modifications as a resource and annotation tool
J. S. Garavelli (2004)
10.1093/NAR/GKH097
PIRSF: family classification system at the Protein Information Resource
Cathy H. Wu (2004)
PIRSF: family classification system at the Protein Information Resource
陈奕欣 (2004)
10.1093/nar/gkh121
The Pfam protein families database
A. Bateman (2004)
10.1016/J.GDE.2005.09.006
The microbial pan-genome.
D. Medini (2005)



This paper is referenced by
10.1155/2021/6617118
In Silico Analysis of Common Long Noncoding RNAs in Schistosoma mansoni and Schistosoma haematobium
S. Sirekbasan (2021)
10.3390/ijms22116160
Spatially Enriched Paralog Rearrangements Argue Functionally Diverse Ribosomes Arise during Cold Acclimation in Arabidopsis
F. Martinez-Seidel (2021)
10.1155/2021/7262208
Exploring the Molecular Mechanism of Liuwei Dihuang Pills for Treating Diabetic Nephropathy by Combined Network Pharmacology and Molecular Docking
Gaoxiang Wang (2021)
10.3389/fmicb.2021.618856
Computational Biology and Machine Learning Approaches to Understand Mechanistic Microbiome-Host Interactions
P. Sudhakar (2021)
10.21203/RS.3.RS-463825/V1
Exploring the Molecular Mechanism of Liuwei Dihuang Pills for Treating Diabetic Nephropathy by Combined Network Pharmacology and Molecular Docking
Gaoxiang Wang (2021)
10.1101/2020.05.05.078741
An in silico approach to identification, categorization and prediction of nucleic acid binding proteins
Lei Xu (2021)
10.1038/s41588-021-00787-1
Identifying loci with different allele frequencies among cases of eight psychiatric disorders using CC-GWAS
Wouter J. Peyrot (2021)
10.1021/acs.analchem.1c00332
Two-Dimensional Partial Covariance Mass Spectrometry for the Top-Down Analysis of Intact Proteins.
T. Driver (2021)
10.1101/2021.01.14.426447
Improving integrative 3D modeling into low- to medium- resolution EM structures with evolutionary couplings
Caitlyn L. McCafferty (2021)
10.1186/s12906-021-03257-7
Deciphering potential pharmacological mechanism of Sha-Shen-Mai-Dong decoction on primary Sjogren’s syndrome
Yuepeng Jiang (2021)
10.1016/j.jbc.2021.100458
Ca2+-saturated calmodulin binds tightly to the N-terminal domain of A-type fibroblast growth factor homologous factors
R. Mahling (2021)
10.1101/2021.05.04.442698
Transcriptomic and rRNA:rDNA signatures of environmental vs. enteric Enterococcus faecalis isolates under oligotrophic freshwater conditions
Brittany Suttner (2021)
10.1186/s12870-021-03066-7
RNA-Seq analysis reveals potential regulators of programmed cell death and leaf remodelling in lace plant (Aponogeton madagascariensis)
Nathan M. Rowarth (2021)
10.1101/2021.06.16.448727
Accurate Protein Function Prediction via Graph Attention Networks with Predicted Structure Information
Boqiao Lai (2021)
10.1128/MRA.01336-20
Complete Genome Sequence of Streptomyces Phage Shady
C. Ortega (2021)
10.2147/JHC.S315634
SERPINA11 Inhibits Metastasis in Hepatocellular Carcinoma by Suppressing MEK/ERK Signaling Pathway
Ye Song (2021)
10.1111/mec.15867
Molecular mechanisms of local adaptation for salt‐tolerance in a treefrog
Molly A Albecker (2021)
10.1101/2021.09.17.460749
APPRIS principal isoforms and MANE Select transcripts in clinical variant interpretation
Fernando Pozo (2021)
10.3389/fmolb.2021.714238
A New Subfamily of Glycoside Hydrolase Family 30 with Strict Xylobiohydrolase Function
C. Crooks (2021)
10.1038/s41597-021-00984-x
ACO2 clinicobiological dataset with extensive phenotype ontology annotation
Khadidja Guehlouz (2021)
10.1016/j.mcpro.2021.100081
In-depth site-specific analysis of N-glycoproteome in human cerebrospinal fluid (CSF) and glycosylation landscape changes in Alzheimer's disease (AD).
Zhengwei Chen (2021)
10.1186/s12862-021-01784-y
Bacterial avidins are a widely distributed protein family in Actinobacteria, Proteobacteria and Bacteroidetes
O. Laitinen (2021)
10.1038/s41598-021-87970-w
Promoter switching in response to changing environment and elevated expression of protein-coding genes overlapping at their 5’ ends
Wojciech Rosikiewicz (2021)
10.2147/DDDT.S310686
Icariin, an Up-and-Coming Bioactive Compound Against Neurological Diseases: Network Pharmacology-Based Study and Literature Review
Shuangqiu Wang (2021)
10.1101/2021.06.11.448022
Prediction and Characterization of Disorder-Order Transition Regions in Proteins by Deep Learning
Zi-Ji Yan (2021)
10.1101/2021.04.14.439117
Chromosome evolution and the genetic basis of agronomically important traits in greater yam
J. Bredeson (2021)
10.1007/s42464-021-00102-x
Comparative transcriptome analysis reveals the candidate genes involved in SDR unreduced female gamete formation in the diploid rubber tree (Hevea brasiliensis (Willd. ex A. Juss.) Müll. Arg.)
Yuanyuan Zhang (2021)
10.1016/J.BBADVA.2021.100005
Origin of the Phosphoprotein Phosphatase (PPP) sequence family in Bacteria: Critical ancestral sequence changes, radiation patterns and substrate binding features
D. Kerk (2021)
10.3389/fgene.2021.637780
A Review of Genetic and Physiological Disease Mechanisms Associated With Cav1 Channels: Implications for Incomplete Congenital Stationary Night Blindness Treatment
Tal T. Sadeh (2021)
10.3390/molecules26051295
Effect of Syringopicroside Extracted from Syringa oblata Lindl on the Biofilm Formation of Streptococcus suis
Yang Tang (2021)
10.1101/2021.06.02.446692
FLAME: a web tool for functional and literature enrichment analysis of multiple gene lists
Foteini Thanati (2021)
10.1101/2021.04.11.439351
New insights of glycosylation role on variable domain of antibody structures
M. N. Melo-Braga (2021)
See more
Semantic Scholar Logo Some data provided by SemanticScholar