The Journal of Bone and Joint Surgery 81:1209-16 (1999)
© 1999 The Journal of Bone and Joint Surgery, Inc.
Stulberg Classification System for Evaluation of Legg-Calvé-Perthes Disease: Intra-Rater and Inter-Rater Reliability*
JEROEN G. NEYT, M.D. ,
STUART L. WEINSTEIN, M.D. ,
KEVIN F. SPRATT, PH.D. ,
LORI DOLAN, R.N., M.A. ,
JOSÉ MORCUENDE, M.D., PH.D. ,
FREDERICK R. DIETZ, M.D. ,
GREG GUYTON, M.D. ,
ROBERT HART, M.D. ,
MICHELLE STEVENS KRAUT, M.D. ,
GREGORY LERVICK, M.D. ,
PETER PARDUBSKY, M.D. and
ANDREA SATERBAK, M.D. , IOWA CITY, IOWA
Investigation performed at the University of Iowa Hospitals and Clinics, Iowa City
Background: Researchers and clinicians commonly use the classification system of Stulberg et al. as a basis for treatment decisions during the active phase of Legg-Calvé-Perthes disease because of its putative utility as a predictor of long-term outcome. It is generally assumed that this system has an acceptable degree of reliability. This assumption, however, is not convincingly supported by the literature.
Methods: The purpose of the present study was to assess the inter-rater and intra-rater reliability of the classification system of Stulberg et al. with use of a pre-test, post-test design. During the pre-test phase, nine raters independently used the system to evaluate the radiographs of skeletally mature patients who had been managed for Legg-Calvé-Perthes disease. The intervention between the pre-test and post-test phases consisted of a consensus-building session during which all raters jointly arrived at standardized definitions of the various joint structures that are assessed with use of the classification system. The effect of these definitions on reliability then was assessed by reevaluating the radiographs during the post-test phase.
Results: The pre-test intra-rater reliability coefficients ranged from 0.709 to 0.915, and the post-test coefficients ranged from 0.568 to 0.874. The pre-test inter-rater reliability coefficients ranged from 0.603 to 0.732, and the post-test coefficients ranged from 0.648 to 0.744. Contributing to the variance was a lack of agreement concerning the assessment of joint structures and the way in which the raters translated these evaluations into a classification according to the system of Stulberg et al.
Conclusions: Although intra-rater reliability was marginally acceptable, the degree of variability between the classifications assigned by different raterseven after the interventioncalls into question the reliability of the system of Stulberg et al.; consequently, the validity of any treatment decisions, outcome evaluations, or epidemiological studies based on this system is also in question.

CiteULike Connotea Del.icio.us Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
M. Ramachandran, K. Ward, R. R. Brown, C. F. Munns, C. T. Cowell, and D. G. Little
Intravenous Bisphosphonate Therapy for Traumatic Osteonecrosis of the Femoral Head in Adolescents
J. Bone Joint Surg. Am.,
August 1, 2007;
89(8):
1727 - 1734.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. Barker, J. Anderson, R. Chesnut, G. Nesbit, T. Tjauw, and R. Hart
Reliability and Reproducibility of Dens Fracture Classification with Use of Plain Radiography and Reformatted Computer-Aided Tomography
J. Bone Joint Surg. Am.,
January 1, 2006;
88(1):
106 - 112.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. A. Herring, H. T. Kim, and R. Browne
Legg-Calve-Perthes Disease. Part I: Classification of Radiographs with Use of the Modified Lateral Pillar and Stulberg Classifications
J. Bone Joint Surg. Am.,
October 1, 2004;
86(10):
2103 - 2120.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. S. Bae, P. M. Waters, and D. Zurakowski
Reliability of Three Classification Systems Measuring Active Motion in Brachial Plexus Birth Palsy
J. Bone Joint Surg. Am.,
September 1, 2003;
85(9):
1733 - 1738.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. J. Berst, L. Dolan, M. M. Bogdanowicz, M. A. Stevens, S. Chow, and E. A. Brandser
Effect of Knowledge of Chronologic Age on the Variability of Pediatric Bone Age Determined Using the Greulich and Pyle Standards
Am. J. Roentgenol.,
February 1, 2001;
176(2):
507 - 510.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|