Online citations, reference lists, and bibliographies.
← Back to Search

Structure Comparison And Structure Patterns

I. Eidhammer, I. Jonassen, W. Taylor
Published 2000 · Mathematics, Medicine, Computer Science

Cite This
Download PDF
Analyze on Scholarcy
Share
This article investigates aspects of pairwise and multiple structure comparison, and the problem of automatically discover common patterns in a set of structures. Descriptions and representation of structures and patterns are described, as well as scoring and algorithms for comparison and discovery. A framework and nomenclature is developed for classifying different methods, and many of these are reviewed and placed into this framework.
This paper references
10.1002/PRO.5560061104
Tess: A geometric hashing algorithm for deriving 3D coordinate templates for searching structural databases. Application to enzyme active sites
A. C. Wallace (1997)
10.1006/JMBI.1998.2248
Three-dimensional profiles: a new tool to identify protein surface similarities.
M. DE RINALDIS (1998)
10.1017/S0016672300219320
Time Warps, String Edits, and Macromolecules – The Theory and Practice of Sequence Comparison . David Sankoff and Joseph Kruskal. ISBN 1-57586-217-4. Price £13.95 (US$22·95).
S. S. Sturrock (2000)
The Protein Data Bank: a computer-based archival file for macromolecular structures.
F. Bernstein (1977)
Using the radial distribution of physical features to compare amino acid environments and align amino acid sequences
L Wei (1997)
10.1002/PRO.5560070226
Comprehensive assessment of automatic structural alignment against a manual standard, the scop classification of proteins
M. Gerstein (1998)
10.1016/0022-4731(80)90019-9
Principles of Protein Structure
G. Schulz (1979)
A 3 D sequence - independent representation of the protein data bank
D. Fischer (1995)
10.1080/03052159508941353
MIXED-DISCRETE FUZZY PROGRAMMING FOR NONLINEAR ENGINEERING OPTIMIZATION
C. Shih (1995)
10.1007/BF02458274
A differential geometric treatment of protein structure comparison
Ding Da-fu (1994)
10.1073/PNAS.88.23.10495
Efficient detection of three-dimensional structural motifs in biological macromolecules by computer vision techniques.
R. Nussinov (1991)
A geometric algorithm to Ž nd small but highly similar 3 D substructures in proteins
M. Petitjean (1998)
Improved genetic algorithm-based protein structure comparisons
A.C.W (1995)
Structural Comparison Allowing Hinge Bending, Swiveling Motions
George Verbitsky (1999)
SCOP: a structural classi cation of proteins database
T. Hubbard (1995)
10.1093/PROTEIN/8.9.873
Improved genetic algorithm-based protein structure comparisons: pairwise and multiple superpositions.
A. C. May (1995)
10.1073/PNAS.89.22.10915
Amino acid substitution matrices from protein blocks.
S. Henikoff (1992)
10.1093/PROTEIN/7.1.31
An algorithm for automatically generating protein topology cartoons.
T. Flores (1994)
10.1016/0022-2836(91)90871-3
Motif recognition and alignment for many sequences by comparison of dot-matrices.
M. Vingron (1991)
10.1002/PRO.5560031025
Multiple protein structure alignment
W. Taylor (1994)
10.1002/PRO.5560030506
Three‐dimensional, sequence order‐independent structural comparison of a serine protease against the crystallographic database reveals active site similarities: Potential implications to evolution and to protein folding
D. Fischer (1994)
10.1016/S1359-0278(96)00021-1
Optimum superimposition of protein structures: ambiguities and implications.
Z. Feng (1996)
Ef Ž cient detection of three - dimensional structural motifs in biological macromolecules by computer vision techniques
C. Orengo (1991)
standard, the scop classiŽ cation of proteins
T. Madej (1996)
10.1016/0022-2836(90)90134-8
Definition of general topological equivalence in protein structures. A procedure involving comparison of properties and relationships through simulated annealing and dynamic programming.
A. S̆ali (1990)
10.1093/PROTEIN/1.5.377
Knowledge based modelling of homologous proteins, Part I: Three-dimensional frameworks derived from the simultaneous superposition of multiple structures.
M. J. Sutcliffe (1987)
10.1126/SCIENCE.2983426
Rapid and sensitive protein similarity searches.
D. Lipman (1985)
10.1006/JMBI.1998.1993
Method for prediction of protein function from sequence using the sequence-to-structure-to-function paradigm with application to glutaredoxins/thioredoxins and T1 ribonucleases.
J. Fetrow (1998)
De Ž nition of general topological equivalence in protein structures : A procedure involving comparison of properties and relationships through simulated annealing and dynamic programming
D. Sankoff (1990)
10.1093/bioinformatics/10.6.587
Flexible algorithm for direct multiple alignment of protein structures and sequences
A. Godzik (1994)
templates for searching structural databases: Applications to enzyme active sites
R. Laskowsi (1996)
SigniŽ cance of root-mean-square deviation in comparing
V. 356–369. Maiorov (1994)
10.1016/0022-2836(92)91021-G
Common spatial arrangements of backbone fragments in homologous and non-homologous proteins.
N. Alexandrov (1992)
10.1002/(SICI)1097-0134(199607)25:3<354::AID-PROT7>3.0.CO;2-F
Analysis of topological and nontopological structural similarities in the PDB: New examples with old structures
N. Alexandrov (1996)
Similarity Search in 3D Protein Databases
T. Seidl (1998)
10.1016/0097-8485(95)00062-3
A Protein Structure Comparison Methodology
N. P. Brown (1996)
De nition of general topological equivalence in protein structures: A procedure involving comparison of properties and relationships through simulated annealing and dynamic programming
A Sali (1990)
10.1093/PROTEIN/7.9.1059
The protein threading problem with sequence amino acid interaction preferences is NP-complete.
R. Lathrop (1994)
10.1073/PNAS.84.13.4355
Profile analysis: detection of distantly related proteins.
M. Gribskov (1987)
10.1093/nar/28.1.1
The Molecular Biology Database Collection: an online compilation of relevant database resources
A. Baxevanis (2000)
IdentiŽ cation of tertiary structure resemblance in proteins
H. Grindley (1993)
10.1089/cmb.1998.5.279
Approaches to the Automatic Discovery of Patterns in Biosequences
A. Brazma (1998)
Atlas of protein sequence and structure
M. A. Chang (1965)
10.1002/PRO.5560011006
On the multiple simultaneous superposition of molecular structures by rigid body transformations
R. Diamond (1992)
3-d lookup: Fast protein database structure searches at 90% reliability
L. Holm (1995)
Algorithmic Learning for Knowledge-Based Systems: Gosler Final Report
K. Jantke (1995)
10.1016/0003-9861(78)90204-7
The Protein Data Bank: a computer-based archival file for macromolecular structures.
F. Bernstein (1977)
10.1093/PROTEIN/8.10.981
A 3D sequence-independent representation of the protein data bank.
D. Fischer (1995)
10.1089/cmb.1996.3.289
An Algorithm for Finding Maximal Common Subtopologies in a Set of Protein Structures
I. Koch (1996)
10.1006/JMBI.1994.1017
Significance of root-mean-square deviation in comparing three-dimensional structures of globular proteins.
V. Maiorov (1994)
10.1006/JMBI.1993.1489
Protein structure comparison by alignment of distance matrices.
L. Holm (1993)
10.1007/PL00000069
Random structural models for double dynamic programming score evaluation
W. Taylor (2009)
10.1107/S0567739476001873
A solution for the best rotation to relate two sets of vectors
W. Kabsch (1976)
J. Mol. Biol
Y Satow (1986)
10.1016/0022-2836(86)90245-7
Phosphocholine binding immunoglobulin Fab McPC603. An X-ray diffraction study at 2.7 A.
Y. Satow (1986)
10.1126/science.273.5275.595
Mapping the Protein Universe
L. Holm (1996)
10.1093/PROTEIN/11.11.981
Novel method to detect a motif of local structures in different protein conformations.
H. Wako (1998)
10.1107/S010876739100867X
A method for multiple superposition of structures.
A. Shapiro (1992)
10.1002/PROT.340230208
Structural superposition of proteins with unknown alignment and detection of topological similarity using a six‐dimensional search algorithm
K. Diederichs (1995)
10.1016/0022-2836(86)90308-6
Identification of protein sequence homology by consensus template alignment.
W. Taylor (1986)
10.1093/PROTEIN/9.12.1093
Pairwise iterative superposition of distantly related proteins and assessment of the significance of 3-D structural similarity.
A. C. May (1996)
DeŽ nition of general topological equivalence in protein structures: A procedure
T. Blundell (1990)
10.1006/JMBI.1999.2581
Three-dimensional structure analysis of PROSITE patterns.
A. Kasuya (1999)
10.1142/9789814447300_0019
Application of Knowledge Discovery to Molecular Biology: Identifying Structural Regularities in Proteins
Shaobing Su (1999)
10.1002/PROT.340230309
Threading a database of protein cores
T. Madej (1995)
10.1002/(SICI)1097-0134(19990201)34:2<232::AID-PROT9>3.0.CO;2-9
Flexible structural comparison allowing hinge‐bending, swiveling motions
G. Verbitsky (1999)
10.1016/S0022-2836(05)80360-2
Basic local alignment search tool.
S. Altschul (1990)
10.1109/64.539018
Scalable Discovery of Informative Structural Concepts Using Domain Knowledge
D. Cook (1996)
10.1006/JMBI.2000.4211
Searching the protein structure databank with weak sequence patterns and structural constraints.
I. Jonassen (2000)
10.1093/nar/25.1.231
Dali/FSSP classification of three-dimensional protein folds
L. Holm (1997)
10.1093/bioinformatics/11.6.633
Using a measure of structural variation to define a core for the globins
M. Gerstein (1995)
10.1517/14622416.3.1.131
Recent progress in multiple sequence alignment: a survey.
Cédric Notredame (2002)
Multiple Structural Alignment and Core Detection by Geometric Hashing
N. Leibowitz (1999)
10.1016/S0959-440X(96)80058-3
Surprising similarities in structure comparison.
J. Gibrat (1996)
10.1093/bioinformatics/13.5.509
Efficient discovery of conserved patterns using a pattern graph
I. Jonassen (1997)
10.1093/bioinformatics/14.6.516
A geometric algorithm to find small but highly similar 3D substructures in proteins
X. Pennec (1998)
10.1093/PROTEIN/8.4.353
Comparison of spatial arrangements of secondary structural elements in proteins.
K. Mizuguchi (1995)
10.1002/PRO.5560050603
Derivation of 3D coordinate templates for searching structural databases: Application to ser‐His‐Asp catalytic triads in the serine proteinases and lipases
A. C. Wallace (1996)
10.1016/0022-2836(70)90057-4
A general method applicable to the search for similarities in the amino acid sequence of two proteins.
S. B. Needleman (1970)
10.1073/PNAS.95.11.5913
A unified statistical framework for sequence comparison and structure comparison.
M. Levitt (1998)
Protein: Struct., Funct., Gen
C. Orengo (1992)
larity using a six-dimensional search algorithm. Proteins: Struct., Func., Gen
D. Ding (1994)
Protein: Struct., Funct
C Orengo (1992)
Ef cient detection of three-dimensional structural motifs in biological macromolecules by computer vision techniques
R Nussinov (1991)
E-mail: Ingvar.Edihammer@ii.uib
10.1016/S0022-5193(05)80263-2
A rapid method of protein structure alignment.
C. Orengo (1990)
10.1002/(SICI)1097-0134(19990201)34:2<206::AID-PROT6>3.0.CO;2-N
Discovery of local packing motifs in protein structures
I. Jonassen (1999)
10.1093/PROTEIN/9.12.1103
Detection of non-topological motifs in protein structures.
V. Alesker (1996)
10.1016/0025-5564(94)90041-8
Modeling protein cores with Markov random fields.
J. White (1994)
10.1107/S0021889897006729
Align : A program to superimpose protein coordinates, accounting for insertions and deletions
G. Cohen (1997)
10.1007/BF00124346
Structure-based identification and clustering of protein families and superfamilies
S. Rufino (1994)
databases: Applications to ser-his-asp catalytic triads in the serine proteinases and lipases
R. B. Altman (1997)
10.1006/JMBI.2001.4817
Defining linear segments in protein structure.
W. Taylor (2001)
10.1006/JMBI.1996.0294
A surface of minimum area metric for the structural comparison of proteins.
A. Falicov (1996)
10.1002/PROT.340110107
Detection of common three‐dimensional substructures in proteins
G. Vriend (1991)
10.1002/PRO.5560030601
Biological meaning, statistical significance, and classification of local spatial similarities in nonhomologous proteins
N. Alexandrov (1994)
10.1006/JMBI.1993.1074
Identification of tertiary structure resemblance in proteins using a maximal common subgraph isomorphism algorithm.
H. M. Grindley (1993)
10.1073/PNAS.95.11.6073
Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships.
S. Brenner (1998)
A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins
CHRISTUS
10.1016/S0076-6879(97)77029-0
Detecting folding motifs and similarities in protein structures.
G. Kleywegt (1997)
10.1007/BF00993382
Machine discovery of protein motifs
D. Conklin (2004)
A comparison of the heme binding pocket in globins and cytochrome b5.
M. Rossmann (1975)
identiŽ ed distant evolutionary relationships
C. Brown (1973)
10.1006/JMBI.1998.1844
Detection of protein three-dimensional side-chain patterns: new examples of convergent evolution.
R. Russell (1998)
Biological meaning, statistical signiŽ cance, and classiŽ cation of local spatial simi
N. Alexandrov (1994)
10.1016/S0888-7543(05)80221-6
The HRAS1 gene cluster: two upstream regions recognizing transcripts and a third encoding a gene with a leucine zipper domain.
J. Weitzel (1992)
10.1110/PS.8.3.654
Protein structure comparison using iterated double dynamic programming
W. Taylor (1999)
10.1016/S0022-2836(77)80200-3
The Protein Data Bank. A computer-based archival file for macromolecular structures.
F. Bernstein (1977)
10.1002/PRO.5560040817
Finding flexible patterns in unaligned protein sequences
I. Jonassen (1995)
10.1016/S0097-8485(98)00017-5
Interactive Maximal Common 3D Substructure Searching with the Combined SDM/RMS Algorithm
M. Petitjean (1998)
Using the radial distributions of physical features to compare amino acid environments and align amino acid sequences.
L. Wei (1997)
10.1515/9783111576855-015
J
Seguin Hen (1824)
10.1007/3-540-60217-8_13
Pattern Inference
T. Shinohara (1995)
10.1145/362342.362367
Algorithm 457: finding all cliques of an undirected graph
C. Bron (1973)
10.1006/JMBI.1994.1657
A graph-theoretic approach to the identification of three-dimensional patterns of amino acid side-chains in protein structures.
P. Artymiuk (1994)
10.1016/0022-2836(73)90388-4
Comparison of super-secondary structures in proteins.
S. T. Rao (1973)
10.1145/299432.299464
Fast detection of common geometric substructure in proteins
L. Chew (1999)
10.1089/cmb.1998.5.41
Pairwide and Multiple Identification of Three-Dimensional Common Substructures in Proteins
V. Escalier (1998)
10.1093/bioinformatics/15.4.317
Motif-based searching in TOPS protein topology databases
D. Gilbert (1999)
Three-dimensional proŽ les: A new tool
M. O. Dayhoff (1978)
Act. Cryst. A43
P Gerber (1987)
Macromolecules: the theory and practice of sequence comparison
D. Sankoff (1983)
10.1006/JMBI.1996.0053
Global optimum protein threading with gapped alignment and empirical pair score functions.
R. Lathrop (1996)
10.1002/PRO.5560040404
Characterizing the microenvironment surrounding protein sites
S. Bagley (1995)
10.1002/JCC.540111011
An algorithm for the simultaneous superposition of a structural series
S. Kearsley (1990)
10.1002/PROT.340140216
Multiple protein sequence alignment from tertiary structure comparison: Assignment of global and residue confidence levels
R. Russell (1992)
An approach to detection of protein structural motifs using an encoding scheme of backbone conformations.
H. Matsuda (1997)
EfŽ cient discovery of conserved patterns using a pattern graph. CABIOS
I. Jonassen (1997)
A graph-theoretic approach to the identiŽ cation
P. Artymiuk (1994)
plications to evolution and to protein folding
D. Moss (1994)
Tops cartoons: formalisation, searching and comparison
D. Gilbert (1999)
Applications of knowledge discovery to molecular biology: Identifying structural
D. Cook (1999)
10.1016/S0022-2836(05)80134-2
SCOP: a structural classification of proteins database for the investigation of sequences and structures.
A. Murzin (1995)
10.1016/0022-2836(76)90195-9
Exploring structural homology of proteins.
M. Rossmann (1976)
10.1016/0022-2836(89)90084-3
Protein structure alignment.
W. Taylor (1989)
10.1002/PRO.5560050711
The structural alignment between two proteins: Is there a unique answer?
A. Godzik (1996)



This paper is referenced by
10.1007/978-3-642-11164-8_28
Automated Identification of Protein Structural Features
Chandrasekhar Mamidipally (2009)
K 2 : Protein Structure Comparisons and Their Statistical Significance
J. Szustakowski (2001)
10.1016/J.JMB.2004.04.058
3DCoffee: combining protein sequences and structures within multiple sequence alignments.
O. O'Sullivan (2004)
10.1385/1-59259-335-6:491
Pattern Discovery: Methods and Software
Brona Brejová (2007)
10.1142/S021972000400082X
Pairwise Protein Structure Alignment Based on an Orientation-independent Backbone Representation
Jieping Ye (2004)
Pattern Discovery in Protein Structures and Interaction Networks
H. Ahmed (2014)
10.1002/pro.2392
Structure‐based barcoding of proteins
R. Metri (2014)
10.1023/B:GENP.0000023687.41210.d7
Self Generating Metaheuristics in Bioinformatics: The Proteins Structure Comparison Case
N. Krasnogor (2004)
10.1007/978-3-540-30219-3_4
A polynomial-time algorithm for the matching of crossing contact-map patterns
J. Gramm (2004)
10.1089/106652704773416876
1001 Optimal PDB Structure Alignments: Integer Programming Methods for Finding the Maximum Contact Map Overlap
A. Caprara (2004)
10.1109/ICBBE.2007.42
Geometrical Issues for Representing Protein Structure
J. Liu (2007)
10.1142/S0219720008003461
ClEPaps: Fast Pair Alignment of protein Structures Based on conformational Letters
S. Wang (2008)
Benchmarking and Extension of Protein Structure Alignment tool ( Protein 3 DFit ) Project Report
N. Mahmood (2008)
A new algorithm for sequential and non-sequential protein multiple structure alignment
Lin Wang (2008)
A hybrid approach for indexing and searching protein structures
Tarek F. Gharib (2009)
10.1016/J.CPLETT.2006.11.071
A new geometric-topological method to measure protein fold similarity
Zikai Wu (2007)
COMPUTATIONAL ANALYSIS OF 3D PROTEIN STRUCTURES
Z. Aung (2006)
10.1109/TAI.2003.1250163
Pairwise protein structure alignment based on an orientation-independent representation of the backbone geometry
Jieping Ye (2003)
10.1007/978-1-59745-574-9_5
Algorithms for multiple protein structure alignment and structure-derived multiple sequence alignment.
M. Shatsky (2008)
10.1145/956750.956784
CloseGraph: mining closed frequent graph patterns
X. Yan (2003)
10.1109/CSB.2003.1227314
Towards index-based similarity search for protein structure databases
Orhan Çamoglu (2003)
10.1007/978-3-540-75530-2_27
Prefix-Shuffled Geometric Suffix Tree
T. Shibuya (2007)
PSIST: A Scalable Approach to Indexing
F. Gao (2007)
10.1109/CIT.2008.WORKSHOPS.59
Design of a Novel Protein Feature and Enzyme Function Classification
B. Lee (2008)
10.1007/978-3-540-69848-7_87
Classification of Ligase Function Based on Multi-parametric Feature Extracted from Protein Sequence
B. Lee (2008)
Structural Information and Hidden Markov Models for Biological Sequence Analysis
J. Tångrot (2008)
10.1142/S0218213005002417
Towards Scaleable Protein Structure Comparison and Database Search
Chern-Hooi Chionh (2005)
10.1002/prot.10553
Evaluation of protein fold comparison servers
M. Novotný (2004)
10.1089/cmb.2009.0217
Searching Protein Three-Dimensional Structures in Faster Than Linear Time
T. Shibuya (2010)
10.1016/j.jpdc.2007.07.008
PSIST: A scalable approach to indexing protein structures using suffix trees
F. Gao (2008)
Using structural information in modeling and multiple alignments for phylogenetics.
X. Pan (2008)
10.1142/S0219720004000491
Index-based Similarity Search for Protein Structure Databases
Orhan Çamoglu (2004)
See more
Semantic Scholar Logo Some data provided by SemanticScholar