Measuring Agreement In Method Comparison Studies
J. Bland, D. Altman
Published 1999 · Medicine, Mathematics
Download PDFAnalyze on Scholarcy
Agreement between two methods of clinical measurement can be quantified using the differences between observations made using the two methods on the same subjects. The 95% limits of agreement, estimated by mean difference 1.96 standard deviation of the differences, provide an interval within which 95% of differences between measurements by the two methods are expected to lie. We describe how graphical methods can be used to investigate the assumptions of the method and we also give confidence intervals. We extend the basic approach to data where there is a relationship between difference and magnitude, both with a simple logarithmic transformation approach and a new, more general, regression approach. We discuss the importance of the repeatability of each method separately and compare an estimate of this to the limits of agreement. We extend the limits of agreement approach to data with repeated measurements, proposing new estimates for equal numbers of replicates by each method on each subject, for unequal numbers of replicates, and for replicated data collected in pairs, where the underlying value of the quantity being measured is changing. Finally, we describe a nonparametric approach to comparing methods.
This paper references
SPECIAL ARTICLE COMPARING METHODS OF MEASUREMENT
J. Ludbrook (1997)
Clinical assessment of gestational age in the newborn infant.
L. Dubowitz (1970)
Use and interpretation of common statistical tests in method-comparison studies.
J. O. Westgard (1973)
Statistics in medical research: principles versus practices.
H. Schoolman (1968)
Calculating age-related reference centiles using absolute residuals
DG Altman (1993)
Statistical methods for assessing agreement between two methods of clinical measurement
J. Bland (1986)
Confidence intervals on variance components
Randall W. Potter (1992)
Comparing methods of measurement: why plotting difference against standard method is misleading
J. Bland (1995)
A note on the use of the intraclass correlation coefficient in the evaluation of agreement between two methods of measurement.
J. Bland (1990)
Clinical biostatistics; XXXVII. Demeaned errors, confidence games, nonplussed minuses, inefficient coefficients, and other statistical disruptions of scientific communication
A. Feinstein (1976)
Determination of serum immunoreactive erythropoietin in the investigation of erythrocytosis.
P. Cotes (1986)
Blood pressure measurement
G. Beevers (2001)
An Evaluation of the Vita‐Stat Automatic Blood Pressure Measuring Device
B. Polk (1980)
HPLC with enzymatic detection as a candidate reference method for serum creatinine.
K. Linnet (1991)
The analysis of blood pressure data Blood pressure measurement
Dg Altman (1991)
Construction of age-related reference centiles using absolute residuals.
D. Altman (1993)
Measurement in Medicine: The Analysis of Method Comparison Studies
D. Altman (1983)
An outline of the revised British Hypertension Society protocol for the evaluation of blood pressure measuring devices.
E. O'brien (1993)
Evaluation of method-comparison data.
S. Eksborg (1981)
Lack of agreement between measurement of ejection fraction by impedance cardiography versus radionuclide ventriculography
L. Bowling (1993)
An automated enzymic micromethod for the measurement of fat in human milk.
A. Lucas (1987)
Clinical assessment of gestational age in the newborn infant. Comparison of two methods.
G. Latis (1981)
Evaluating agreement between clinical assessment methods
G. Marshall (1995)
This paper is referenced by
Lack of correlation between three commercial platforms for the evaluation of human immunodeficiency virus type 1 (HIV-1) viral load at the clinically critical lower limit of quantification.
C. Yan (2010)
Image Similarity Metrics Suitable for Infrared Video Stabilization during Active Wildfire Monitoring: A Comparative Analysis
M. M. Valero (2020)
The Importance of the Level of the Lip Line and Resting Lip Pressure in Class II, Division 2 Malocclusion
B. Lapatki (2002)
Bioelectrical impedance analysis models for prediction of total body water and fat-free mass in healthy and HIV-infected children and adolescents.
M. Horlick (2002)
Bio-intact parathyroid hormone and intact parathyroid hormone in hemodialysis patients with secondary hyperparathyroidism receiving intravenous calcitriol therapy.
A. Fujimori (2004)
Preliminary evaluation of a new chemiluminescence assay (Liaison Cyclosporine; DiaSorin Laboratories) allowing both C0 and C2 cyclosporine levels determination: comparison with RIA method.
Y. Olejnik (2005)
Comparing two clinical measurements: a linear mixed model approach
D. Lai (2005)
GFR prediction using the MDRD and Cockcroft and Gault equations in patients with end-stage renal disease.
Ying Kuan (2005)
POB8 VALIDITY OF DATA COLLECTED FROM AN INTERNET-BASED COHORT STUDY
F. Coste (2005)
Statistical methods for the validation of questionnaires--discrepancy between theory and practice.
M. Schmidt (2006)
Use of proxies and reference quintiles obtained from minimal model analysis for determination of insulin sensitivity and pancreatic beta-cell responsiveness in horses.
K. Treiber (2005)
Development and validation of the Human Activity Profile into Chinese language: lessons in determining equivalence.
A. Bonner (2006)
Predictability and limitations of non-invasive murine tonometry: comparison of two devices.
T. Filippopoulos (2006)
Customizing fetal biometric charts.
M. W. Pang (2003)
Développement d‘une méthode de dosage de la plombémie par chronopotentiométrie
H. Mathieu (2003)
Avaliação da concordância dos métodos de pesagem direta de alimentos em creches - São Paulo - Brasil
Ana Teresa Rodrigues Cruz (2003)
Reproducibility and Reversibility of Tidal Forced Expirations
S. Lum (2003)
Urethral retro-resistance pressure: a new clinical measure of urethral function.
M. Slack (2004)
Maximal expiratory flow at FRC (V'maxFRC): Methods of selection and differences in reported values.
A. Koumbourlis (2004)
Concordance de deux variables : l’approche graphique: Méthode de Bland et Altman
D. Journois (2004)
The glitter of the correlation coefficient.
P. Mcdonough (2004)
Effect of Doppler angle in diagnosis of internal carotid artery stenosis.
M. Tola (2006)
Quantitative assessment of lung cancer perfusion using MDCT: does measurement reproducibility improve with greater tumor volume coverage?
Q. Ng (2006)
Are disposable prisms an adequate alternative to standard Goldmann tonometry prisms in glaucoma patients?
A. Maino (2006)
Reports of birthweight by adolescents and their mothers: comparing accuracy and identifying correlates.
V. Lucia (2006)
Avaliação da dosagem sérica de cistatina C para detecção precoce de alterações na função do enxerto após o transplante renal
E. D. Neto (2007)
EXPLICA LA ESQUIZOTIPIA LA DISCORDANCIA ENTRE INFORMANTES DE ALTERACIONES CONDUCTUALES ADOLESCENTES
C. Medina (2007)
Evaluation of NucliSens EasyQ HIV-1 assay for quantification of HIV-1 subtypes prevalent in South-east Asia.
H. Lam (2007)
Critical flicker fusion test of potential vision
H. Shankar (2007)
Association between antioxidant status and MnSOD Ala-9Val polymorphism in trained male athletes (rugby players) and sedentary male students controlled for antioxidant intake
Maria Seele (2007)
A Comparison of Forensic Age Prediction Models Using Data From Four DNA Methylation Technologies
A. Freire-Aradas (2020)
Muscle strength testing with one repetition maximum in the arm/shoulder for people aged 75 + - test-retest reliability
E. Rydwik (2007)See more