The kappa statistic in reliability studies: use interpretation and sample size requirements Physical Therapy 85:257-268

Author(s): Sim J, Wright CC


Purpose:This article examines and illustrates the use and interpretation of the kappa statistic in musculoskeletal research.

Summary of key points:The reliability of clinicians' ratings is an important consideration in areas such as diagnosis and the interpretation of examination findings. Often, these ratings lie on a nominal or an ordinal scale. For such data, the kappa coefficient is an appropriate measure of reliability. Kappa is defined, in both weighted and unweighted forms, and its use is illustrated with examples from musculoskeletal research. Factors that can influence the magnitude of kappa (prevalence, bias, and non-independent ratings) are discussed, and ways of evaluating the magnitude of an obtained kappa are considered. The issue of statistical testing of kappa is considered, including the use of confidence intervals, and appropriate sample sizes for reliability studies using kappa are tabulated.

Conclusions:The article concludes with recommendations for the use and interpretation of kappa.

Similar Articles

Consensus recommendations for MS cortical lesion scoring using double inversion recovery

Author(s): Geurts J, Roosendaal S, Calabrese M,Ciccarelli O,Agosta F,et al.

Location of the central sulcus via cortical thickness of the precentral and postcentralgyri on MR

Author(s): Meyer JR,Roychowdhury S, Russell EJ, Callahan C,Gitelman D,et al.

An MR protocol for presurgical evaluation of patients with complex partial seizures of temporal lobe origin

Author(s): Achten E, Boon P, De Poorter J,Calliauw L, Van De Kerckhove, et al.

Anatomic relationships along the low-middle convexity: Part II: Lesion Localization

Author(s): Naidich TP,Valavanis AG,Kubik S, Taber Kh,Yasargil MG, et al.

Bias prevalence and kappa

Author(s): Byrt T, Bishop J, Carlin JB

Topography and identification of the inferior precentral sulcus in MR imaging

Author(s): Ebeling U, Steinmetz H, Huang Y, Kahn T