|
|
||||||||
Perspectives |
J Sim, PhD, is Professor, Primary Care Sciences Research Centre, Keele University, Keele, Staffordshire ST5 5BG, United Kingdom (j.sim{at}keele.ac.uk)
CC Wright, BSc, is Principal Lecturer, School of Health and Social Sciences, Coventry University, Coventry, United Kingdom
Address all correspondence to Dr Sim
Purpose. This article examines and illustrates the use and interpretation of the kappa statistic in musculoskeletal research. Summary of Key Points. The reliability of clinicians' ratings is an important consideration in areas such as diagnosis and the interpretation of examination findings. Often, these ratings lie on a nominal or an ordinal scale. For such data, the kappa coefficient is an appropriate measure of reliability. Kappa is defined, in both weighted and unweighted forms, and its use is illustrated with examples from musculoskeletal research. Factors that can influence the magnitude of kappa (prevalence, bias, and nonindependent ratings) are discussed, and ways of evaluating the magnitude of an obtained kappa are considered. The issue of statistical testing of kappa is considered, including the use of confidence intervals, and appropriate sample sizes for reliability studies using kappa are tabulated. Conclusions. The article concludes with recommendations for the use and interpretation of kappa.
Key Words: Kappa Measurement Reliability Sample size
This article has been cited by other articles:
![]() |
J. E. McLawsen, R. L. Jackson, S. D. Vannoy, G. J. Gagliardi, and M. J. Scalora Professional Perspectives on Sexual Sadism Sexual Abuse: A Journal of Research and Treatment, September 1, 2008; 20(3): 272 - 304. [Abstract] [PDF] |
||||
![]() |
C. A. Wyse, K. A. McNie, V. J. Tannahil, J. K. Murray, and S. Love Prevalence of obesity in riding horses in Scotland Vet Rec., May 3, 2008; 162(18): 590 - 591. [Full Text] [PDF] |
||||
![]() |
C. K. Yiannakopoulos, A. Chougle, A. Eskelinen, J. P. Hodgkinson, and G. Hartofilakidis Inter- and intra-observer variability of the Crowe and Hartofilakidis classification systems for congenital hip disease in adults J Bone Joint Surg Br, May 1, 2008; 90-B(5): 579 - 583. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Ruutu, G. Barosi, R. J. Benjamin, R. E. Clark, J. N. George, A. Gratwohl, E. Holler, M. Iacobelli, K. Kentouche, B. Lammle, et al. Diagnostic criteria for hematopoietic stem cell transplant-associated microangiopathy: results of a consensus process by an International Working Group Haematologica, January 1, 2007; 92(1): 95 - 100. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Svensson and C. Hager-Ross Hand function in Charcot Marie Tooth: test retest reliability of some measurements Clinical Rehabilitation, October 1, 2006; 20(10): 896 - 908. [Abstract] [PDF] |
||||
![]() |
H.-H. Wang, H.-F. Liao, and C.-L. Hsieh Reliability, Sensitivity to Change, and Responsiveness of the Peabody Developmental Motor Scales-Second Edition for Children With Cerebral Palsy Physical Therapy, October 1, 2006; 86(10): 1351 - 1359. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. M. Manini, S. B. Cook, T. VanArnam, M. Marko, and L. Ploutz-Snyder Evaluating task modification as an objective measure of functional limitation: repeatability and comparability. J. Gerontol. A Biol. Sci. Med. Sci., July 1, 2006; 61(7): 718 - 725. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Barosi, D. Bordessoule, J. Briere, F. Cervantes, J.-L. Demory, B. Dupriez, H. Gisslinger, M. Griesshammer, H. Hasselbalch, R. Kusec, et al. Response criteria for myelofibrosis with myeloid metaplasia: results of an initiative of the European Myelofibrosis Network (EUMNET) Blood, October 15, 2005; 106(8): 2849 - 2853. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |