Online citations, reference lists, and bibliographies.
← Back to Search

NCBI Reference Sequence Project: Update And Current Status

K. Pruitt, T. Tatusova, D. Maglott
Published 2003 · Computer Science, Biology, Medicine

Cite This
Download PDF
Analyze on Scholarcy
Share
The goal of the NCBI Reference Sequence (RefSeq) project is to provide the single best non-redundant and comprehensive collection of naturally occurring biological molecules, representing the central dogma. Nucleotide and protein sequences are explicitly linked on a residue-by-residue basis in this collection. Ideally all molecule types will be available for each well-studied organism, but the initial database collection pragmatically includes only those molecules and organisms that are most readily identified. Thus different amounts of information are available for different organisms at any given time. Furthermore, for some organisms additional intermediate records are provided when the genome sequence is not yet finished. The collection is supplied by NCBI through three distinct pipelines in addition to collaborations with community groups. The collection is curated on an ongoing basis. Additional information about the NCBI RefSeq project is available at http://www.ncbi.nih.gov/RefSeq/.
This paper references



This paper is referenced by
10.1038/nature03154
Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution
L. Hillier (2004)
10.1186/1471-2105-6-39
The Use of Edge-Betweenness Clustering to Investigate Biological Function in Protein Interaction Networks
Ruth Dunn (2004)
10.5772/22016
Mining Host-Pathogen Interactions
D. Korkin (2011)
10.1111/j.1550-7408.2004.tb00552.x
ANNOUNCEMENT
(2004)
10.1093/nar/gki039
Integr8 and Genome Reviews: integrated views of complete genomes and proteomes
P. Kersey (2005)
10.1186/1471-2164-6-91
Evaluation of the similarity of gene expression data estimated with SAGE and Affymetrix GeneChips
F. van Ruissen (2004)
10.1016/J.GENE.2005.07.021
Proteomic resources: integrating biomedical information in humans.
Shubha Suresh (2005)
10.1186/1472-6807-8-18
A multi-template combination algorithm for protein comparative modeling
J. Cheng (2008)
10.1093/nar/gkl1011
GenomeTrafac: a whole genome resource for the detection of transcription factor binding site clusters associated with conventional and microRNA encoding genes conserved between mouse and human gene orthologs
A. Jegga (2007)
10.1007/11732990_15
CONTRAlign: Discriminative Training for Protein Sequence Alignment
Chuong B. Do (2006)
10.1109/MEMB.2007.384099
Name della Proteina: a protein identification resolution database [Engineering in Genomics]
K. Ng (2007)
10.1186/1471-2105-7-47
Protein kinases associated with the yeast phosphoproteome
R. Brinkworth (2005)
10.1186/1471-2164-15-265
VCGDB: a dynamic genome database of the Chinese population
Yunchao Ling (2013)
10.1016/J.CELLSIG.2003.08.013
Systematic identification of immunoreceptor tyrosine-based inhibitory motifs in the human proteome.
E. Staub (2004)
10.1007/978-1-60327-148-6_17
Web-based resources for clinical bioinformatics.
A. Joshua (2008)
10.1038/nmeth726
Mapping DNA-protein interactions in large genomes by sequence tag analysis of genomic enrichment
Jonghwan Kim (2005)
Genes in Chromosomal Ridge Domains Have Increased mRNA Folding Stability and Half-Life, Further Contributing to Their High Expression
Hinco J. Gierman (2010)
10.1371/journal.pcbi.1002341
A Dynamic View of Domain-Motif Interactions
Eyal Akiva (2012)
10.1093/nar/gkh122
The Eukaryotic Promoter Database EPD: the impact of in silico primer extension
Christoph D. Schmid (2004)
10.1142/9781860946882_0011
DATABASES ON GENE REGULATION
O. Kel-Margoulis (2005)
10.1016/B978-012369454-6/50018-2
The Mouse Genome
M. Adams (2007)
10.1093/bioinformatics/btn370
A knowledge-based approach to predict intragenic deletions or duplications
K. Kalari (2008)
Comparative mammalian genomics: models of evolution and detection of functional elements
A. Siepel (2005)
10.1016/J.DNAREP.2004.05.004
Alkylation damage in DNA and RNA--repair mechanisms and medical significance.
F. Drabløs (2004)
10.1128/MCB.00816-12
Involvement of Histone Demethylase LSD1 in Short-Time-Scale Gene Expression Changes during Cell Cycle Progression in Embryonic Stem Cells
V. Nair (2012)
10.1093/BFGP/2.2.121
The Drosophila phenotype gap - and how to close it.
J. A. Dow (2003)
10.1186/1471-2156-6-45
G2D: a tool for mining genes associated with disease
C. Perez-Iratxeta (2005)
10.1016/J.SBI.2006.04.004
Multiple sequence alignment.
R. Edgar (2006)
10.1093/dnares/dsm007
Merging Mouse Transcriptome Analyses with Parkinson's Disease Linkage Studies
D. Gherbassi (2007)
10.1093/BIB/4.2.179
Tutorial Section: Entrez: Making Use of Its Power
Renata C. Geer (2003)
10.1002/047001153X.G203112
The Human Genome Project
T. Hubbard (2005)
10.1093/HMG/DDH207
Gene-Ontology analysis reveals association of tissue-specific 5' CpG-island genes with development and embryogenesis.
P. Robinson (2004)
See more
Semantic Scholar Logo Some data provided by SemanticScholar