Online citations, reference lists, and bibliographies.
← Back to Search

Assessing Intrarater, Interrater And Test-retest Reliability Of Continuous Measurements.

V. Rousson, T. Gasser, B. Seifert
Published 2002 · Mathematics, Medicine

Cite This
Download PDF
Analyze on Scholarcy
Share
In this paper we review the problem of defining and estimating intrarater, interrater and test-retest reliability of continuous measurements. We argue that the usual notion of product-moment correlation is well adapted in a test-retest situation, whereas the concept of intraclass correlation should be used for intrarater and interrater reliability. The key difference between these two approaches is the treatment of systematic error, which is often due to a learning effect for test-retest data. We also consider the reliability of a sum and a difference of variables and illustrate the effects on components. Further, we compare these approaches of reliability with the concept of limits of agreement proposed by Bland and Altman (for evaluating the agreement between two methods of clinical measurements) and show how product-moment correlation is related to it. We then propose new kinds of limits of agreement which are related to intraclass correlation. A test battery to study the development of neuro-motor functions in children and adolescents illustrates our purpose throughout the paper.
This paper references
10.1016/S0140-6736(86)90837-8
STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT
J. Bland (1986)
10.1016/0010-4825(89)90036-X
Statistical evaluation of agreement between two methods for measuring a quantitative variable.
J. Lee (1989)
10.1016/s0140-6736(65)91037-8
AKUFO AND IBARAPA.
A. H. Beckett (1965)
10.1007/BF02293867
Approximate interval estimation for a certain intraclass correlation coefficient
J. Fleiss (1978)
10.1007/BF02289730
Reliability formulas for independent decision data when reliability data are matched
N. Rajaratnam (1960)
10.1002/(SICI)1097-0258(19990815)18:15<2051::AID-SIM162>3.0.CO;2-P
Higher-moment approaches to approximate interval estimation for a certain intraclass correlation coefficient.
K. Zou (1999)
10.1214/AOMS/1177706875
On the Comparative Anatomy of Transformations
John W. Tukey (1957)
10.1017/S0012162201000810
Neuromotor development from 5 to 18 years. Part 1: timed performance.
R. Largo (2001)
10.1037/0033-2909.86.2.420
Intraclass correlations: uses in assessing rater reliability.
P. Shrout (1979)
10.2307/3002019
An approximate distribution of estimates of variance components.
Satterthwaite Fe (1946)
10.2466/pr0.1966.19.1.3
The Intraclass Correlation Coefficient as a Measure of Reliability
J. Bartko (1966)
10.1098/RSTA.1899.0006
Mathematical Contributions to the Theory of Evolution. VI. Genetic (Reproductive) Selection: Inheritance of Fertility in Man, and of Fecundity in Thoroughbred Racehorses
K. Pearson (1899)
10.5694/j.1326-5377.1989.tb115999.x
Emergency medicine — a house that Jack is building?
T. K. Taylor (1989)
10.1016/0010-4825(90)90013-F
A note on the use of the intraclass correlation coefficient in the evaluation of agreement between two methods of measurement.
J. Bland (1990)



This paper is referenced by
10.1016/j.apmr.2007.12.031
The Moss Attention Rating Scale for traumatic brain injury: further explorations of reliability and sensitivity to change.
J. Whyte (2008)
Limited Diagnostic Utility in Nonradiographic Axial Spondyloarthritis Fat Infiltration on Magnetic Resonance Imaging of the Sacroiliac Joints Has
Robert G W Lambert ()
10.1097/SPV.0b013e31827bfd93
Web Versus Paper-Based Completion of the Epidemiology of Prolapse and Incontinence Questionnaire
M. Egger (2013)
10.1136/bjophthalmol-2017-310396
Reliability of Cyclotorsion measurements using Scanning Laser Ophthalmoscopy imaging in healthy subjects: the CySLO study
Fabian Lengwiler (2017)
10.1109/TNSRE.2020.3013705
Test-Retest Reliability of Kinematic Assessments for Upper Limb Robotic Rehabilitation
T. Koeppel (2020)
10.1016/j.spinee.2017.08.257
Reliability of physical functioning tests in patients with low back pain: a systematic review.
Lenie Denteneer (2018)
10.1186/1471-2474-8-57
Test-retest reliability of knee kinesthesia in healthy adults
E. Ageberg (2007)
10.1186/1475-2891-5-4
Test-retest reproducibility of a food frequency questionnaire (FFQ) and estimated effects on disease risk in the Norwegian Women and Cancer Study (NOWAC)
C. Parr (2006)
Some Case Studies of Simple Component Analysis
V. Rousson (2003)
10.1177/1087054715627488
Test–Retest Reliability and Measurement Invariance of Executive Function Tasks in Young Children With and Without ADHD
S. Karalunas (2016)
10.1080/23279095.2016.1263795
Test–retest reliability of five frequently used executive tasks in healthy adults
A. Soveri (2018)
10.1111/J.1365-3059.2012.02651.X
Reliability and accuracy of visual methods to quantify severity of foliar bacterial spot symptoms on peach and nectarine.
S. Bardsley (2013)
10.1002/PRI.358
Clinical relevance using timed walk tests and 'timed up and go' testing in persons with multiple sclerosis.
Y. Nilsagård (2007)
Specifying the Heterogeneity in Children with ADHD : Symptom Domains, Neuropsychological Processes, and Comorbidity
Cecilia Wåhlstedt (2009)
The Construction and Validation of a Test of Wrestling Skill
Khodadad Kashi Sholeh (2015)
10.1007/s12144-020-00918-7
Cross-cultural adaptation and validation of the Persian version of Children’s Behavior Questionnaire in Iranian children
Golnoosh Golmohammadi (2020)
10.1016/j.neuroimage.2019.06.038
Detecting resting-state brain activity using OEF-weighted imaging
Y. Yang (2019)
10.1016/J.IJGE.2015.05.007
Factors Associated with Attitude and Knowledge Toward Hospice Palliative Care Among Medical Caregivers
Shih-Yi Lee (2015)
10.1007/978-1-4471-2277-7_12
The Advanced Appreciation of Upper Limb Rehabilitation in Cervical Spinal Cord Injury
Ninja P. Oess (2012)
10.1016/J.FERTNSTERT.2004.07.932
Grading a developmental continuum--elegy on the rise and fall of the endometrial biopsy.
P. Mcdonough (2004)
10.5014/AJOT.2010.AJOT00000414
Objectivity and stability of the Preschool Imitation and Praxis Scale.
M. Vanvuchelen (2011)
10.1016/j.jns.2016.01.039
Test–retest reliability of single and paired pulse transcranial magnetic stimulation parameters in healthy subjects
A. Hermsen (2016)
10.1016/J.NEUCLI.2007.08.002
Original article/Article originalMultichannel recording of median nerve somatosensory evoked potentialsEnregistrement multicanal des PES du nerf médian
W. Wassenberg (2008)
10.1016/j.neuroimage.2009.03.082
Regional activation of the human medial temporal lobe during intentional encoding of objects and positions
Thomas Z. Ramsøy (2009)
10.1016/j.neuroimage.2011.08.044
Test–retest reliability of resting-state connectivity network characteristics using fMRI and graph theoretical measures
U. Braun (2012)
Clinical Translation of Diffusion Cardiac Magnetic Resonance Imaging: Motion Robust In Vivo Characterization of
Myocardial Tissue Microstructure (2015)
10.1186/s12968-015-0214-1
Contrast-free detection of myocardial fibrosis in hypertrophic cardiomyopathy patients with diffusion-weighted cardiovascular magnetic resonance
C. Nguyen (2015)
10.1530/EC-18-0296
GH deficiency in patients with spinal cord injury: efficacy/safety of GH replacement, a pilot study
G. Cuatrecasas (2018)
Impaired nasal patency and sleep disturbances - prevalence, quality of life, and treatment
Maria Värendh (2018)
10.1002/jmri.21282
Comparison between 2D and 3D high‐resolution black‐blood techniques for carotid artery wall imaging in clinically significant atherosclerosis
N. Balu (2008)
Measuring morbidity following major surgery
M. P. Grocott (2010)
10.1038/sj.eye.6702097
Statistical strategies to assess reliability in ophthalmology
N. Patton (2006)
See more
Semantic Scholar Logo Some data provided by SemanticScholar