Online citations, reference lists, and bibliographies.
← Back to Search

The Case Against Statistical Significance Testing

R. P. Carver
Published 1978 · Psychology

Save to my Library
Download PDF
Analyze on Scholarcy
Share
In recent years the use of traditional statistical methods in educational research has increasingly come under attack. In this article, Ronald P. Carver exposes the fantasies often entertained by researchers about the meaning of statistical significance. The author recommends abandoning all statistical significance testing and suggests other ways of evaluating research results. Carver concludes that we should return to the scientific method of examining data and replicating results rather than relying on statistical significance testing to provide equivalent information.
This paper references
10.2307/2529115
Statistical Power Analysis for the Behavioral Sciences
J. Cohen (1969)
An analysis of the use of statistical testing in communication research
J. Saunders. Katzer (1973)
10.1037/006641
Subjectivist Statistics for the Current Crisis.
K. Wilson (1961)
10.1037/H0031324
Issues in psychophysical measurement.
S. Stevens (1971)
Issues and dilemmas in developing training programs for educational researchers
S. Elam (1965)
10.2466/pr0.1962.11.3.639
The Difference between Statistical Hypotheses and Scientific Hypotheses
R. Bolles (1962)
10.2307/1169648
Hypothesis Testing in Relation to Statistical Methodology
Cherry Ann Clark (1963)
A case against the normal use of inferential statistical models in educational research
W. Coats
10.1037/H0042040
The fallacy of the null-hypothesis significance test.
W. W. Rozeboom (1960)
10.1177/001316446002000401
The Place of Statistics in Psychology
J. D. Nunnally (1960)
A Textbook of Psychology
W. Kessen (1958)
11179%(723* C'''--&')&''&& ( & '' ( 1111 '' ** / 11 .223 '' 5 ---11 & -'' ) &((=''
10.3102/00346543040003371
Reconstruction of Educational Research
L. S. Shulman (1966)
10.1037/H0020412
The test of significance in psychological research.
D. Bakan (1966)
10.1097/00006324-197308000-00018
Textbook Of Psychology
D. Hebb (1958)
10.1037/H0076829
Beyond the Two Disciplines of Scientific Psychology.
L. Cronbach (1975)
10.1214/AOMS/1177704711
The Future of Data Analysis
J. W. Tukey (1962)
10.2307/2089381
Some Statistical Problems in Research Design
L. Kish (1959)
Psychological testing, 4th ed.
A. Anastasi (1976)
10.4324/9781315134918-26
Significance tests reconsidered.
D. E. Morrison (1969)
Significance tests in behavioral research : Skeptical conclusions and beyond
D. F. Morrison
10.1007/BF03394413
Statistical Power: Derivation, Development, and Data-Analytic Implications
L. J. Chase (1976)
10.1037/H0029774
Replication studies: A neglected aspect of psychological research.
Nathaniel C. Smith (1970)
10.3102/00346543033005455
Chapter I: Hypothesis Testing in Relation to Statistical Methodology
Cherry Ann Clark (1963)
10.1111/J.1460-2466.1973.TB00947.X
An Analysis of the Use of Statistical Testing in Communication Research
J. Katzer (1973)
Experimental and Quasi-Experi-mental Designs for Research
D. Campbell (1963)
10.1037/H0041052
The statistical concepts of confidence and significance.
R. E. Chandler (1957)
10.1037/H0036782
Two Dimensions of Tests: Psychometric and Edumetric.
R. P. Carver (1974)
10.2307/1419766
Tactics of scientific research : evaluating experimental data in psychology
H. N. Peters (1961)
10.2307/748778
Aptitudes and instructional methods: A handbook for research on interactions
L. Cronbach (1977)
10.1037/H0026141
Statistical significance in psychological research.
D. Lykken (1968)
10.2466/pr0.1964.14.1.219
Generalizing to a Language Population
E. B. Coleman (1964)
10.1126/SCIENCE.161.3844.849
Measurement, Statistics, and the Schemapiric View
S. S. Steivens (1968)



This paper is referenced by
10.1177/0959354307086923
Why P Values Are Not a Useful Measure of Evidence in Statistical Significance Testing
R. Hubbard (2008)
10.1017/S0140525X00054662
Anthropology and psi
K. L. Feder (1987)
10.1017/S0140525X00054741
Parapsychology: The science of ostensible anomalies
R. Hyman (1987)
Leven in onzekerheid en eenvoud
H. Kiers (2001)
10.1016/S0031-9406(10)60617-7
Some Misconceptions About Single Subject Designs in Physiotherapy
V. J. Robertson (1994)
10.1017/S0140525X00054595
Why parapsychology cannot become a science
M. Bunge (1987)
10.2307/3803159
THE ROLE OF HYPOTHESIS TESTING IN WILDLIFE SCIENCE
D. Johnson (2002)
Perplexities in Discrimination of Attention Deficit Hyperactivity Disorder (ADHD): Specific Behaviors that may hold some Answers
J. Harrison (2009)
10.1080/00220973.1993.10806591
The Case Against Statistical Significance Testing, Revisited
R. P. Carver (1993)
10.1007/BF03217251
Response to Menon
G. Rowley (1993)
10.1177/109442819924004
Ν2: A Proportional Variance-Accounted for Index for Some Cross-Level and Person-Situation Research Designs
C. Lance (1999)
The association between therapists' attachment security and mentalizing capacity
K. N. Wyatt-Brooks (2013)
Investigating the hypothesized factor structure of the Noel-Levitz Student Satisfaction Inventory: A study of the student satisfaction construct.
Leslie R. Odom (2008)
10.1198/tast.2009.08205
Could It Be Better to Discard 90% of the Data? A Statistical Paradox
T. Stanley (2009)
10.1016/j.neuroimage.2004.07.022
Optimizing the fMRI data-processing pipeline using prediction and reproducibility performance metrics: I. A preliminary group analysis
S. Strother (2004)
10.1007/s11136-013-0437-2
Significance, truth and proof of p values: reminders about common misconceptions regarding null hypothesis significance testing
M. G. Verdam (2013)
Common Methodology Mistakes in Dissertations: Improving Dissertation Quality.
B. Thompson (1988)
Choices of Methodology for Cooperative Education Researchers.
R. Coll (2000)
10.1177/0959354398083001
Statistical Significance and Replicability
David Sohn (1998)
10.1086/506236
Replicating Empirical Research In Behavioral Ecology: How And Why It Should Be Done But Rarely Ever Is
C. Kelly (2006)
10.1038/nrn3475-c2
Experimental power comes from powerful theories — the real problem in null hypothesis testing
J. Ashton (2013)
10.1007/978-3-319-28770-6_10
Randomized Block Designs: Ordinal Data
K. Berry (2016)
10.1007/978-3-319-28770-6_8
Randomized Block Data
K. Berry (2016)
10.1016/J.SOCEC.2004.09.024
Size Matters: The Standard Error of Regressions in the American Economic Review
S. Ziliak (2004)
10.15760/ETD.384
Prestructuring multilayer perceptrons based on information-theoretic modeling of a partido-alto-based grammar for afro-brazilian music: enhanced generalization and principles of parsimony, including an investigation of statistical paradigms
G. Lendaris (2011)
10.1007/978-1-4020-6230-8_31
Quantitative Research Synthesis: The Use of Meta-Analysis in Career Guidance and Vocational Psychology
Paul A. Gore (2008)
Bayesian Analysis of Parental Drinking Motives and Children's Adjustment
Aaron A. Duke (2013)
10.1080/00933104.1980.10506078
Populations, Samples, Randomness, and Replication in Two Social Studies Journals
J. P. Shaver (1980)
10.1891/088970107805059805
Conceptual and Practical Implications for Rehabilitation Research: Effect Size Estimates, Confidence Intervals, and Power
J. M. Ferrin (2007)
10.22237/JMASM/1067645940
Deconstructing arguments from the case against hypothesis testing
S. Sawilowsky (2003)
10.1080/10862969209547771
The Three Factors in Reading Ability: Reanalysis of a Study by Cunningham, Stanovich, and Wilson
R. P. Carver (1992)
Recent Literature on Whether Statistical Significance Tests
James Deegear (2012)
See more
Semantic Scholar Logo Some data provided by SemanticScholar