← Back to Search
Assessing Intrarater, Interrater And Test-retest Reliability Of Continuous Measurements.
V. Rousson, T. Gasser, B. Seifert
Published 2002 · Mathematics, Medicine
Download PDFAnalyze on Scholarcy
In this paper we review the problem of defining and estimating intrarater, interrater and test-retest reliability of continuous measurements. We argue that the usual notion of product-moment correlation is well adapted in a test-retest situation, whereas the concept of intraclass correlation should be used for intrarater and interrater reliability. The key difference between these two approaches is the treatment of systematic error, which is often due to a learning effect for test-retest data. We also consider the reliability of a sum and a difference of variables and illustrate the effects on components. Further, we compare these approaches of reliability with the concept of limits of agreement proposed by Bland and Altman (for evaluating the agreement between two methods of clinical measurements) and show how product-moment correlation is related to it. We then propose new kinds of limits of agreement which are related to intraclass correlation. A test battery to study the development of neuro-motor functions in children and adolescents illustrates our purpose throughout the paper.
This paper references
STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT
J. Bland (1986)
Statistical evaluation of agreement between two methods for measuring a quantitative variable.
J. Lee (1989)
AKUFO AND IBARAPA.
A. H. Beckett (1965)
Approximate interval estimation for a certain intraclass correlation coefficient
J. Fleiss (1978)
Reliability formulas for independent decision data when reliability data are matched
N. Rajaratnam (1960)
Higher-moment approaches to approximate interval estimation for a certain intraclass correlation coefficient.
K. Zou (1999)
On the Comparative Anatomy of Transformations
John W. Tukey (1957)
Neuromotor development from 5 to 18 years. Part 1: timed performance.
R. Largo (2001)
Intraclass correlations: uses in assessing rater reliability.
P. Shrout (1979)
An approximate distribution of estimates of variance components.
Satterthwaite Fe (1946)
The Intraclass Correlation Coefficient as a Measure of Reliability
J. Bartko (1966)
Mathematical Contributions to the Theory of Evolution. VI. Genetic (Reproductive) Selection: Inheritance of Fertility in Man, and of Fecundity in Thoroughbred Racehorses
K. Pearson (1899)
Emergency medicine — a house that Jack is building?
T. K. Taylor (1989)
A note on the use of the intraclass correlation coefficient in the evaluation of agreement between two methods of measurement.
J. Bland (1990)
This paper is referenced by
The Moss Attention Rating Scale for traumatic brain injury: further explorations of reliability and sensitivity to change.
J. Whyte (2008)
Limited Diagnostic Utility in Nonradiographic Axial Spondyloarthritis Fat Infiltration on Magnetic Resonance Imaging of the Sacroiliac Joints Has
Robert G W Lambert ()
Web Versus Paper-Based Completion of the Epidemiology of Prolapse and Incontinence Questionnaire
M. Egger (2013)
Reliability of Cyclotorsion measurements using Scanning Laser Ophthalmoscopy imaging in healthy subjects: the CySLO study
Fabian Lengwiler (2017)
Test-Retest Reliability of Kinematic Assessments for Upper Limb Robotic Rehabilitation
T. Koeppel (2020)
Reliability of physical functioning tests in patients with low back pain: a systematic review.
Lenie Denteneer (2018)
Test-retest reliability of knee kinesthesia in healthy adults
E. Ageberg (2007)
Test-retest reproducibility of a food frequency questionnaire (FFQ) and estimated effects on disease risk in the Norwegian Women and Cancer Study (NOWAC)
C. Parr (2006)
Some Case Studies of Simple Component Analysis
V. Rousson (2003)
Test–Retest Reliability and Measurement Invariance of Executive Function Tasks in Young Children With and Without ADHD
S. Karalunas (2016)
Test–retest reliability of five frequently used executive tasks in healthy adults
A. Soveri (2018)
Reliability and accuracy of visual methods to quantify severity of foliar bacterial spot symptoms on peach and nectarine.
S. Bardsley (2013)
Clinical relevance using timed walk tests and 'timed up and go' testing in persons with multiple sclerosis.
Y. Nilsagård (2007)
Specifying the Heterogeneity in Children with ADHD : Symptom Domains, Neuropsychological Processes, and Comorbidity
Cecilia Wåhlstedt (2009)
The Construction and Validation of a Test of Wrestling Skill
Khodadad Kashi Sholeh (2015)
Cross-cultural adaptation and validation of the Persian version of Children’s Behavior Questionnaire in Iranian children
Golnoosh Golmohammadi (2020)
Detecting resting-state brain activity using OEF-weighted imaging
Y. Yang (2019)
Factors Associated with Attitude and Knowledge Toward Hospice Palliative Care Among Medical Caregivers
Shih-Yi Lee (2015)
The Advanced Appreciation of Upper Limb Rehabilitation in Cervical Spinal Cord Injury
Ninja P. Oess (2012)
Grading a developmental continuum--elegy on the rise and fall of the endometrial biopsy.
P. Mcdonough (2004)
Objectivity and stability of the Preschool Imitation and Praxis Scale.
M. Vanvuchelen (2011)
Test–retest reliability of single and paired pulse transcranial magnetic stimulation parameters in healthy subjects
A. Hermsen (2016)
Original article/Article originalMultichannel recording of median nerve somatosensory evoked potentialsEnregistrement multicanal des PES du nerf médian
W. Wassenberg (2008)
Regional activation of the human medial temporal lobe during intentional encoding of objects and positions
Thomas Z. Ramsøy (2009)
Test–retest reliability of resting-state connectivity network characteristics using fMRI and graph theoretical measures
U. Braun (2012)
Clinical Translation of Diffusion Cardiac Magnetic Resonance Imaging: Motion Robust In Vivo Characterization of
Myocardial Tissue Microstructure (2015)
Contrast-free detection of myocardial fibrosis in hypertrophic cardiomyopathy patients with diffusion-weighted cardiovascular magnetic resonance
C. Nguyen (2015)
GH deficiency in patients with spinal cord injury: efficacy/safety of GH replacement, a pilot study
G. Cuatrecasas (2018)
Impaired nasal patency and sleep disturbances - prevalence, quality of life, and treatment
Maria Värendh (2018)
Comparison between 2D and 3D high‐resolution black‐blood techniques for carotid artery wall imaging in clinically significant atherosclerosis
N. Balu (2008)
Measuring morbidity following major surgery
M. P. Grocott (2010)
Statistical strategies to assess reliability in ophthalmology
N. Patton (2006)See more