Online citations, reference lists, and bibliographies.

Measuring Agreement In Method Comparison Studies

J. Bland, D. Altman
Published 1999 · Medicine, Mathematics

Cite This
Download PDF
Analyze on Scholarcy
Share
Agreement between two methods of clinical measurement can be quantified using the differences between observations made using the two methods on the same subjects. The 95% limits of agreement, estimated by mean difference 1.96 standard deviation of the differences, provide an interval within which 95% of differences between measurements by the two methods are expected to lie. We describe how graphical methods can be used to investigate the assumptions of the method and we also give confidence intervals. We extend the basic approach to data where there is a relationship between difference and magnitude, both with a simple logarithmic transformation approach and a new, more general, regression approach. We discuss the importance of the repeatability of each method separately and compare an estimate of this to the limits of agreement. We extend the limits of agreement approach to data with repeated measurements, proposing new estimates for equal numbers of replicates by each method on each subject, for unequal numbers of replicates, and for replicated data collected in pairs, where the underlying value of the quantity being measured is changing. Finally, we describe a nonparametric approach to comparing methods.
This paper references
10.1111/j.1440-1681.1997.tb01807.x
SPECIAL ARTICLE COMPARING METHODS OF MEASUREMENT
J. Ludbrook (1997)
10.1016/S0022-3476(70)80038-5
Clinical assessment of gestational age in the newborn infant.
L. Dubowitz (1970)
10.1093/clinchem/19.1.49
Use and interpretation of common statistical tests in method-comparison studies.
J. O. Westgard (1973)
Statistics in medical research: principles versus practices.
H. Schoolman (1968)
Calculating age-related reference centiles using absolute residuals
DG Altman (1993)
10.1016/J.IJNURSTU.2009.10.001
Statistical methods for assessing agreement between two methods of clinical measurement
J. Bland (1986)
10.2307/1269531
Confidence intervals on variance components
Randall W. Potter (1992)
10.1016/S0140-6736(95)91748-9
Comparing methods of measurement: why plotting difference against standard method is misleading
J. Bland (1995)
10.1016/0010-4825(90)90013-F
A note on the use of the intraclass correlation coefficient in the evaluation of agreement between two methods of measurement.
J. Bland (1990)
10.1002/cpt1976205617
Clinical biostatistics; XXXVII. Demeaned errors, confidence games, nonplussed minuses, inefficient coefficients, and other statistical disruptions of scientific communication
A. Feinstein (1976)
10.1056/NEJM198607313150503
Determination of serum immunoreactive erythropoietin in the investigation of erythrocytosis.
P. Cotes (1986)
10.1136/bmj.322.7292.981
Blood pressure measurement
G. Beevers (2001)
10.1161/01.HYP.2.2.221
An Evaluation of the Vita‐Stat Automatic Blood Pressure Measuring Device
B. Polk (1980)
10.1093/clinchem/37.10.1669
HPLC with enzymatic detection as a candidate reference method for serum creatinine.
K. Linnet (1991)
The analysis of blood pressure data Blood pressure measurement
Dg Altman (1991)
10.1002/sim.4780121003
Construction of age-related reference centiles using absolute residuals.
D. Altman (1993)
10.2307/2987937
Measurement in Medicine: The Analysis of Method Comparison Studies
D. Altman (1983)
10.1097/00004872-199306000-00013
An outline of the revised British Hypertension Society protocol for the evaluation of blood pressure measuring devices.
E. O'brien (1993)
10.1093/clinchem/27.7.1311
Evaluation of method-comparison data.
S. Eksborg (1981)
10.1097/00003246-199310000-00021
Lack of agreement between measurement of ejection fraction by impedance cardiography versus radionuclide ventriculography
L. Bowling (1993)
10.1017/S0022029900025693
An automated enzymic micromethod for the measurement of fat in human milk.
A. Lucas (1987)
10.1016/0378-3782(81)90068-2
Clinical assessment of gestational age in the newborn infant. Comparison of two methods.
G. Latis (1981)
Evaluating agreement between clinical assessment methods
G. Marshall (1995)



This paper is referenced by
10.1016/j.jcv.2010.08.016
Lack of correlation between three commercial platforms for the evaluation of human immunodeficiency virus type 1 (HIV-1) viral load at the clinically critical lower limit of quantification.
C. Yan (2010)
10.3390/rs12030540
Image Similarity Metrics Suitable for Infrared Video Stabilization during Active Wildfire Monitoring: A Comparative Analysis
M. M. Valero (2020)
10.1177/154405910208100507
The Importance of the Level of the Lip Line and Resting Lip Pressure in Class II, Division 2 Malocclusion
B. Lapatki (2002)
10.1093/AJCN/76.5.991
Bioelectrical impedance analysis models for prediction of total body water and fat-free mass in healthy and HIV-infected children and adolescents.
M. Horlick (2002)
10.1111/J.1774-9987.2004.00189.X
Bio-intact parathyroid hormone and intact parathyroid hormone in hemodialysis patients with secondary hyperparathyroidism receiving intravenous calcitriol therapy.
A. Fujimori (2004)
10.1016/J.TRANSPROCEED.2004.12.193
Preliminary evaluation of a new chemiluminescence assay (Liaison Cyclosporine; DiaSorin Laboratories) allowing both C0 and C2 cyclosporine levels determination: comparison with RIA method.
Y. Olejnik (2005)
10.1080/02664760500080157
Comparing two clinical measurements: a linear mixed model approach
D. Lai (2005)
10.1093/NDT/GFI076
GFR prediction using the MDRD and Cockcroft and Gault equations in patients with end-stage renal disease.
Ying Kuan (2005)
10.1016/S1098-3015(10)67376-7
POB8 VALIDITY OF DATA COLLECTED FROM AN INTERNET-BASED COHORT STUDY
F. Coste (2005)
10.1055/S-0038-1634096
Statistical methods for the validation of questionnaires--discrepancy between theory and practice.
M. Schmidt (2006)
10.2460/AJVR.2005.66.2114
Use of proxies and reference quintiles obtained from minimal model analysis for determination of insulin sensitivity and pancreatic beta-cell responsiveness in horses.
K. Treiber (2005)
10.1111/J.1442-2018.2006.00265.X
Development and validation of the Human Activity Profile into Chinese language: lessons in determining equivalence.
A. Bonner (2006)
10.1016/J.EXER.2005.12.005
Predictability and limitations of non-invasive murine tonometry: comparison of two devices.
T. Filippopoulos (2006)
10.1002/UOG.196
Customizing fetal biometric charts.
M. W. Pang (2003)
Développement d‘une méthode de dosage de la plombémie par chronopotentiométrie
H. Mathieu (2003)
10.1590/S1415-790X2003000300005
Avaliação da concordância dos métodos de pesagem direta de alimentos em creches - São Paulo - Brasil
Ana Teresa Rodrigues Cruz (2003)
10.1159/000074219
Reproducibility and Reversibility of Tidal Forced Expirations
S. Lum (2003)
10.1002/NAU.20042
Urethral retro-resistance pressure: a new clinical measure of urethral function.
M. Slack (2004)
10.1002/PPUL.10452
Maximal expiratory flow at FRC (V'maxFRC): Methods of selection and differences in reported values.
A. Koumbourlis (2004)
10.1016/S0761-8425(04)71243-7
Concordance de deux variables : l’approche graphique: Méthode de Bland et Altman
D. Journois (2004)
10.1016/J.FERTNSTERT.2004.03.006
The glitter of the correlation coefficient.
P. Mcdonough (2004)
10.7863/JUM.2006.25.9.1187
Effect of Doppler angle in diagnosis of internal carotid artery stenosis.
M. Tola (2006)
10.2214/AJR.05.0889
Quantitative assessment of lung cancer perfusion using MDCT: does measurement reproducibility improve with greater tumor volume coverage?
Q. Ng (2006)
10.1016/J.OPHTHA.2006.05.031
Are disposable prisms an adequate alternative to standard Goldmann tonometry prisms in glaucoma patients?
A. Maino (2006)
10.1111/J.1365-3016.2006.00757.X
Reports of birthweight by adolescents and their mothers: comparing accuracy and identifying correlates.
V. Lucia (2006)
Avaliação da dosagem sérica de cistatina C para detecção precoce de alterações na função do enxerto após o transplante renal
E. D. Neto (2007)
EXPLICA LA ESQUIZOTIPIA LA DISCORDANCIA ENTRE INFORMANTES DE ALTERACIONES CONDUCTUALES ADOLESCENTES
C. Medina (2007)
10.1016/J.JCV.2006.10.002
Evaluation of NucliSens EasyQ HIV-1 assay for quantification of HIV-1 subtypes prevalent in South-east Asia.
H. Lam (2007)
10.1016/j.jcrs.2006.10.042
Critical flicker fusion test of potential vision
H. Shankar (2007)
Association between antioxidant status and MnSOD Ala-9Val polymorphism in trained male athletes (rugby players) and sedentary male students controlled for antioxidant intake
Maria Seele (2007)
10.3389/fgene.2020.00932
A Comparison of Forensic Age Prediction Models Using Data From Four DNA Methylation Technologies
A. Freire-Aradas (2020)
10.1177/0269215506072088
Muscle strength testing with one repetition maximum in the arm/shoulder for people aged 75 + - test-retest reliability
E. Rydwik (2007)
See more
Semantic Scholar Logo Some data provided by SemanticScholar