Online citations, reference lists, and bibliographies.
← Back to Search

SMART, A Simple Modular Architecture Research Tool: Identification Of Signaling Domains.

J. Schultz, F. Milpetz, P. Bork, C. Ponting
Published 1998 · Biology, Computer Science, Medicine

Cite This
Download PDF
Analyze on Scholarcy
Share
Accurate multiple alignments of 86 domains that occur in signaling proteins have been constructed and used to provide a Web-based tool (SMART: simple modular architecture research tool) that allows rapid identification and annotation of signaling domain sequences. The majority of signaling proteins are multidomain in character with a considerable variety of domain combinations known. Comparison with established databases showed that 25% of our domain set could not be deduced from SwissProt and 41% could not be annotated by Pfam. SMART is able to determine the modular architectures of single sequences or genomes; application to the entire yeast genome revealed that at least 6.7% of its genes contain one or more signaling domains, approximately 350 greater than previously annotated. The process of constructing SMART predicted (i) novel domain homologues in unexpected locations such as band 4.1-homologous domains in focal adhesion kinases; (ii) previously unknown domain families, including a citron-homology domain; (iii) putative functions of domain families after identification of additional family members, for example, a ubiquitin-binding role for ubiquitin-associated domains (UBA); (iv) cellular roles for proteins, such predicted DEATH domains in netrin receptors further implicating these molecules in axonal guidance; (v) signaling domains in known disease genes such as SPRY domains in both marenostrin/pyrin and Midline 1; (vi) domains in unexpected phylogenetic contexts such as diacylglycerol kinase homologues in yeast and bacteria; and (vii) likely protein misclassifications exemplified by a predicted pleckstrin homology domain in a Candida albicans protein, previously described as an integrin.
This paper references
10.1105/tpc.10.11.1771
Genomics
R. Quatrano (1998)
10.1016/0092-8674(95)90347-X
A phosphotyrosine interaction domain
P. Bork (1995)
10.1038/366643A0
Proteins regulating Ras and its relatives
M. Boguski (1993)
10.1016/0014-5793(95)00931-X
The death domain motif found in Fas (Apo‐1) and TNF receptor is present in proteins involved in apoptosis and axonal guidance
K. Hofmann (1995)
10.1002/PROT.340090304
A workbench for multiple alignment construction and analysis
G. Schuler (1991)
Proc. Natl. Acad. Sci. USA 95
Paper Colloquium (1998)
EMBO J
K Ozaki (1996)
Protein Eng
R B Russell (1994)
10.1128/MCB.16.9.4888
The N-terminal pleckstrin, coiled-coil, and IQ domains of the exchange factor Ras-GRF act cooperatively to facilitate activation by calcium.
R. Buchsbaum (1996)
10.1146/ANNUREV.BI.64.070195.001443
The multiplicity of domains in proteins.
R. Doolittle (1995)
Proc. Natl. Acad. Sci. USA 93
C Gale (1996)
10.1073/PNAS.91.25.12091
Detection of conserved segments in proteins: iterative scanning of sequence databases with alignment blocks.
R. Tatusov (1994)
J. Biol. Chem
I Knezevic (1996)
The International FMF Consortium
(1997)
Nucleic Acids Res
T K Attwood (1997)
Protein Sci
C P Ponting (1996)
10.1089/cmb.1995.2.9
Maximum Discrimination Hidden Markov Models of Sequence Consensus
S. Eddy (1995)
10.1016/S0968-0004(00)89070-2
The death domain: a module shared by proteins with diverse cellular functions.
E. Feinstein (1995)
10.1038/ng0997-25
A candidate gene for familial Mediterranean fever
The French Fmf Consortium (1997)
10.1016/S0968-0004(96)30024-8
A duplicated catalytic motif in a new superfamily of phosphohydrolases and phospholipid synthases that includes poxvirus envelope proteins.
E. Koonin (1996)
10.1038/386833A0
Vertebrate homologues of C. elegans UNC-5 are candidate netrin receptors
E. Leonardo (1997)
10.1016/S0968-0004(97)01049-9
SPRY domains in ryanodine receptors (Ca(2+)-release channels).
C. Ponting (1997)
10.1038/ng1197-285
Opitz G/BBB syndrome, a defect of midline development, is due to mutations in a new RING finger gene on Xp22
N. Quaderi (1997)
Trends Biochem. Sci
P Bork (1997)
J. Biol. Chem
C R Loomis (1985)
10.1016/S0092-8674(00)80539-5
Ancient missense mutations in a new member of the RoRet gene family are likely to cause familial Mediterranean fever
I. Aksentijevich (1997)
J. Cell. Biol
M D Schaller (1995)
10.1002/PRO.5560050120
Extending the C2 domain family: C2s in PKCs δ, ϵ,η,θ, phospholipases, GAPs, and perforin
C.P. Pointing (1996)
10.1093/NAR/22.22.4673
CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.
J. Thompson (1994)
Trends Biochem. Sci
G D Fasman (1990)
Schultz et al
B. Rost (1998)
10.1016/0968-0004(94)90108-2
PH domain: the first anniversary.
T. Gibson (1994)
10.1017/S0033583500005783
Structure and distribution of modules in extracellular proteins.
P. Bork (1996)
10.1074/jbc.271.27.16416
Direct Binding of the Platelet Integrin αIIbβ3 (GPIIb-IIIa) to Talin
I. Knezevic (1996)
Mol. Cell. Biol
R Buchsbaum (1996)
10.1038/NG0397-285
Chromosomal deletion complexes in mice by radiation of embryonic stem cells
Y. You (1997)
10.1073/PNAS.93.1.357
Cloning and expression of a gene encoding an integrin-like protein in Candida albicans.
C. Gale (1996)
Syst. Zool
W M Fitch (1970)
10.1016/0092-8674(93)90586-F
Cell
AC Tose (1993)
Comput. Appl. Biosci
B Rost (1994)
J. Cell. Biol
A J Otsuka (1995)
10.1016/S0076-6879(96)66013-3
Applying motif and profile searches.
P. Bork (1996)
10.1016/S0076-6879(96)66035-2
Analysis of compositionally biased regions in sequence databases.
J. Wootton (1996)
Trends Biochem. Sci
C P Ponting (1997)
J. Cell Sci
D Ilic (1997)
10.1093/nar/25.1.222
Recent enhancements to the Blocks Database servers
J. Henikoff (1997)
10.2307/2412448
Distinguishing homologous from analogous proteins.
W. Fitch (1970)
Trends Biochem. Sci
T J Gibson (1994)
Nucleic Acids Res
J G Henikoff (1997)
Trends Biochem. Sci
K Hofmann (1996)
Trends Biochem. Sci
E V Koonin (1996)
10.1016/0014-5793(95)01351-2
A novel partner for the GTP‐bound forms of rho and rac
P. Madaule (1995)
Nucleic Acids Res
A Bairoch (1997)
Protein Sci
C P Ponting (1996)
10.1016/0968-0004(90)90187-G
The prediction of transmembrane protein sequences and their conformation: an evaluation.
G. Fasman (1990)
10.1093/nar/25.1.212
Novel developments with the PRINTS protein fingerprint database
T. Attwood (1997)
10.1002/PRO.5560050513
A novel family of phospholipase D homologues that includes phospholipid synthases and putative endonucleases: Identification of duplicated repeats and potential active site residues
C. Ponting (1996)
Annu. Rev. Biochem
R F Doolittle (1995)
Trends Biochem. Sci
C W V Hogue (1996)
Nat. Genet
N A Quaderi (1997)
10.1016/S0014-5793(00)01734-8
FEBS Lett
M. Saraste (2000)
10.1093/NAR/25.17.3389
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
S. Altschul (1997)
10.1083/JCB.129.4.1081
An ankyrin-related gene (unc-44) is necessary for proper axonal guidance in Caenorhabditis elegans
A. J. Otsuka (1995)
Q. Rev. Biophys
P Bork (1996)
Nucleic Acids Res
E Birney (1996)
10.1016/S0968-0004(96)30027-3
Pleckstrin's repeat performance: a novel domain in G-protein signaling?
C. Ponting (1996)
10.1016/S0968-0004(96)80021-1
A dynamic look at structures: WWW-Entrez and the Molecular Modeling Database.
C. W. Hogue (1996)
10.1002/(SICI)1097-0134(199707)28:3<405::AID-PROT10>3.0.CO;2-L
Pfam: A comprehensive database of protein domain families based on seed alignments
E. L. Sonnhammer (1997)
Proc. Natl. Acad. Sci. USA 91
R L Tatusov (1994)
10.1074/jbc.271.34.20235
p62, a Phosphotyrosine-independent Ligand of the SH2 Domain of p56lck, Belongs to a New Class of Ubiquitin-binding Proteins*
R. Vadlamudi (1996)
10.1016/S0968-0004(97)01084-0
Cytoplasmic signalling domains: the next generation.
P. Bork (1997)
sn-1,2-Diacylglycerol kinase of Escherichia coli. Purification, reconstitution, and partial amino- and carboxyl-terminal analysis.
Carson R. Loomis (1985)
10.1093/NAR/24.14.2730
PairWise and SearchWise: finding the optimal alignment in a simultaneous comparison of a protein profile against all DNA translation frames.
E. Birney (1996)
10.1083/JCB.130.5.1181
Focal adhesion kinase and paxillin bind to peptides mimicking beta integrin cytoplasmic domains
M. Schaller (1995)
10.1093/nar/25.1.217
The PROSITE database, its status in 1997
A. Bairoch (1997)
10.1038/380538A0
A mechanism for regulation of the adhesion-associated protein tyrosine kinase pp125FAK
A. Richardson (1996)
J. Biol. Chem
R K Vadlamudi (1996)
Trends Biochem. Sci
E Feinstein (1995)
Proc. Natl. Acad. Sci. USA 94
A R Mushegian (1997)
10.1016/0888-7543(91)90071-L
Searching protein sequence libraries: comparison of the sensitivity and selectivity of the Smith-Waterman and FASTA algorithms.
W. Pearson (1991)
Comput. Biol
S R Eddy (1995)
Nucleic Acids Res
S F Altschul (1997)
Nat. Genet
The French (1997)
10.1126/science.252.5009.1162
Predicting coiled coils from protein sequences
A. Lupas (1991)
10.1093/bioinformatics/10.1.53
PHD - an automatic mail server for protein secondary structure prediction
B. Rost (1994)
10.1002/j.1460-2075.1996.tb00573.x
Rom1p and Rom2p are GDP/GTP exchange proteins (GEPs) for the Rho1p small GTP binding protein in Saccharomyces cerevisiae.
K. Ozaki (1996)
10.1016/S0968-0004(96)30015-7
The UBA domain: a sequence motif present in multiple enzyme classes of the ubiquitination pathway.
K. Hofmann (1996)
Nucleic Acids Res
J D Thompson (1994)
Protein Sci
D T Haynie (1996)
Trends Biochem. Sci
C P Ponting (1996)
Focal adhesion kinase: at the crossroads of signal transduction.
D. Ilic (1997)
10.1038/386838A0
The mouse rostral cerebellar malformation gene encodes an UNC-5-like protein
S. Ackerman (1997)
10.1073/PNAS.94.11.5831
Positionally cloned human disease genes: patterns of evolutionary conservation and functional motifs.
A. Mushegian (1997)
10.1002/PRO.5560051227
The N‐terminal domains of tensin and auxilin are phosphatase homologues
D. T. Haynie (1996)
10.1126/SCIENCE.1708917
Presence of an SH2 domain in the actin-binding protein tensin.
S. Davis (1991)



This paper is referenced by
10.1073/pnas.0606877103
Posttranslational hydroxylation of ankyrin repeats in IκB proteins by the hypoxia-inducible factor (HIF) asparaginyl hydroxylase, factor inhibiting HIF (FIH)
M. Cockman (2006)
10.1074/jbc.275.4.2966
Three Isoforms of Synaptic Scaffolding Molecule and Their Characterization
K. Hirao (2000)
10.1038/75192
Structure and lipid transport mechanism of a StAR-related domain
Y. Tsujishita (2000)
10.1016/S0014-5793(00)01777-4
Domain repertoires as a tool to derive protein recognition rules
A. Zucconi (2000)
10.1016/S0378-1119(02)00829-6
Cloning and characterization of human and mouse SNRK sucrose non-fermenting protein (SNF-1)-related kinases.
N. Kertesz (2002)
10.1074/JBC.M203061200
Interaction of HIV-1 Integrase with DNA Repair Protein hRad18*
L. Mulder (2002)
10.1016/S0378-1119(02)01058-2
The MADF-BESS domain factor Dip3 potentiates synergistic activation by Dorsal and Twist.
V. Bhaskar (2002)
10.1091/MBC.E02-05-0285
Identification of a novel type of cGMP phosphodiesterase that is defective in the chemotactic stmF mutants.
M. Meima (2002)
10.1271/bbb.67.77
Identification and Characterization of Scp15, a Protein from Streptomyces coelicolor A3(2) Inducing Neurites in PC12 Cells
S. Nakashima (2003)
10.1038/nsb946
Mechanism of multiple lysine methylation by the SET domain enzyme Rubisco LSMT
R. Trievel (2003)
10.1002/cfg.303
Protein–DNA Interactions: The Story so Far and a New Method for Prediction
S. Jones (2003)
10.1128/JB.183.16.4823-4838.2001
Genome sequence and comparative analysis of the solvent-producing bacterium Clostridium acetobutylicum.
J. Nölling (2001)
10.1016/J.YDBIO.2003.09.037
Characterization of C. elegans RING finger protein 1, a binding partner of ubiquitin-conjugating enzyme 1.
Emily A. Crowe (2004)
10.1093/MOLBEV/MSH055
Evolution of the multifunctional protein tyrosine phosphatase family.
Birgit Pils (2004)
Biochemische und molekularbiologische Charakterisierung von RAIP, einem neuen ER-lokalisierten proapoptotischen Protein
Erik Braziulis (2004)
10.1186/1471-2105-5-113
MUSCLE: a multiple sequence alignment method with reduced time and space complexity
R. Edgar (2004)
10.1074/JBC.M313097200
Analysis of the Role of Ubiquitin-interacting Motifs in Ubiquitin Binding and Ubiquitylation*[boxs]
Stephanie L H Miller (2004)
10.1128/JB.186.20.6782-6791.2004
PAS domain of the Aer redox sensor requires C-terminal residues for native-fold formation and flavin adenine dinucleotide binding.
S. Herrmann (2004)
Automated Genome-Wide Protein Domain Exploration
Bhanu Rekepalli (2007)
10.1186/1471-2148-7-12
Conservation of the TRAPPII-specific subunits of a Ypt/Rab exchanger complex
R. Cox (2006)
10.1111/J.1365-313X.2007.03028.X
A leucine-rich repeat protein is required for growth promotion and enhanced seed production mediated by the endophytic fungus Piriformospora indica in Arabidopsis thaliana.
Bationa Shahollari (2007)
10.1016/J.GENE.2006.11.002
Comparative and phylogenetic analysis of α-l-fucosidase genes
J. Intra (2007)
10.1016/J.IBMB.2007.04.002
Genes encoding putative Takeout/juvenile hormone binding proteins in the honeybee (Apis mellifera) and modulation by age and juvenile hormone of the takeout-like gene GB19811.
Tzachi Hagai (2007)
10.1128/JB.00287-07
Flavodoxin:quinone reductase (FqrB): a redox partner of pyruvate:ferredoxin oxidoreductase that reversibly couples pyruvate oxidation to NADPH production in Helicobacter pylori and Campylobacter jejuni.
M. St Maurice (2007)
10.1590/S1415-47572008000400022
Expression, purification and DNA-binding activities of two putative ModE proteins of Herbaspirillum seropedicae (Burkholderiales, Oxalobacteraceae)
André L. F. Souza (2008)
10.1016/J.BBAPAP.2007.08.001
The role of cell signaling in poxvirus tropism: the case of the M-T5 host range protein of myxoma virus.
Steven J. Werden (2008)
10.1038/nsmb0308-221
Ankyrin for methylated lysines
M. Brent (2008)
10.1016/j.jsb.2008.04.003
The leptospiral antigen Lp49 is a two-domain protein with putative protein binding function.
P. O. Giuseppe (2008)
10.1007/s00427-008-0245-9
Comprehensive survey and classification of homeobox genes in the genome of amphioxus, Branchiostoma floridae
N. Takatori (2008)
10.1074/jbc.M803618200
Caenorhabditis elegans Gelsolin-like Protein 1 Is a Novel Actin Filament-severing Protein with Four Gelsolin-like Repeats*
T. Klaavuniemi (2008)
10.1016/J.GDE.2005.04.009
Evolution of primary microcephaly genes and the enlargement of primate brains.
C. Ponting (2005)
10.1111/J.1399-3054.2005.00505.X
MATH domain proteins represent a novel protein family in Arabidopsis thaliana, and at least one member is modified in roots during the course of a plant–microbe interaction
R. Oelmüller (2005)
See more
Semantic Scholar Logo Some data provided by SemanticScholar