This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF) Free
Right arrow Letters to the Editor: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Letters to the Editor are posted
Right arrow Alert me if a correction is posted
Services
Right arrow E-mail this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My File Cabinet
Right arrow Download to citation manager
Right arrow Rights and Permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by KELLER, R. B.
Right arrow Articles by DEYO, R. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by KELLER, R. B.
Right arrow Articles by DEYO, R. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Facebook   Add to Technorati   Add to Twitter  
What's this?
The Journal of Bone and Joint Surgery 81:752-62 (1999)
© 1999 The Journal of Bone and Joint Surgery, Inc.

Relationship Between Rates and Outcomes of Operative Treatment for Lumbar Disc Herniation and Spinal Stenosis

ROBERT B. KELLER, M.D.{dagger}, STEVEN J. ATLAS, M.D., M.P.H.{ddagger}, DAVID N. SOULE, B.A.§, DANIEL E. SINGER, M.D.{dagger} and RICHARD A. DEYO, M.D., M.P.H.#, MANCHESTER, MAINE

Investigation performed at Maine Medical Assessment Foundation, Manchester


    Abstract
 Top
 Abstract
 Introduction
 Methods
 Results
 Discussion
 References
 
Background: Population-based variations in rates of operations for the treatment of lumbar disc herniation and spinal stenosis are well known. This variability may occur in part because of differences in the threshold at which physicians recommend an operation, reflecting uncertainty about the optimum use of an operative procedure. To the best of our knowledge, no previous reports have indicated whether differences in population-based rates of operative treatment are associated with patient outcomes. Methods: The Maine Lumbar Spine Study is an ongoing prospective study of 655 patients who had a herniated lumbar disc or spinal stenosis. The patients were enrolled by their physicians, who provided baseline demographic and treatment-related data. The patients completed baseline and follow-up questionnaires that focused on symptoms, function, satisfaction, and quality of life. Small-area variation analysis was used to develop three distinct so-called spine service areas in Maine. The outcomes (usually at four years; minimum, two years) were compared among these areas, in which a total of 250 patients had been managed operatively and had answered questionnaires. Results: Population-based rates of operative treatment derived from statewide data that had been collected over five years in the state of Maine ranged from 38 percent below to 72 percent above the average rate in the state (a greater than fourfold difference). The outcomes for the patients who had been managed by surgeons in the lowest-rate area were superior to those for the patients in the two higher-rate areas. Seventy-nine percent (fifty-seven) of seventy-two patients in the lowest-rate area had marked or complete relief of pain in the lower extremity compared with 60 percent (eighteen) of thirty patients in the highest-rate area. The improvements in the Roland disability score (p <= 0.01), quality of life (p <= 0.01), and satisfaction (p <= 0.05) were significantly greater among the patients in the lowest-rate area. The patients in the higher-rate areas generally had less severe symptoms and findings at baseline than those in the lowest-rate area did. Conclusions: Higher population-based rates of elective spinal operations may be associated with inferior outcomes. This variability is possibly related to differences in physicians' preferences with regard to recommending an operation and in their criteria for the selection of patients. Physicians cannot assume that their outcomes will be the same as those of others, and therefore they need to evaluate their own results.


    Introduction
 Top
 Abstract
 Introduction
 Methods
 Results
 Discussion
 References
 
Since publication of the original paper describing small-area variations by Wennberg and Gittelsohn15, many authors have confirmed their seminal findings that population-based rates of hospitalization and procedures vary widely by geographic region and cannot be fully explained by differences in demographic features6,13. Despite the large number of reports on this topic in the literature, variations in practice patterns remain a major feature of medical care in the United States16.

Simple counts of hospital admissions or procedures, or both, do not provide information about the rates of services among discrete populations. Small-area analysis provides a method with which to calculate population-based utilization rates. Variations in utilization of services may be quite appropriate. For example, if there are differences in the incidence of the disease or in environmental or genetic factors, requirements for medical services may differ from one geographic area to another. However, among similar populations, large-scale differences in the rates of services do not appear logical. Research has indicated that the practice styles and treatment preferences of individual physicians are major factors in these variations6.

Although there appears to be broad agreement that variations in utilization among similar populations are inappropriate, there is no empirical evidence to indicate which of the many different rates is the so-called right rate. Analysis of variations demonstrates the inconsistency of many aspects of medical practice but provides little evidence to assist physicians in determining the right rate.

A solid and consistent body of scientific literature could guide doctors and patients in the medical decision-making process. However, the clinical literature is replete with publications advocating a wide range of treatments and procedures, leaving clinicians free to choose the ones that they prefer. In addition, scientific analyses of the literature (meta-analyses) indicate weak scientific evidence for many conclusions and recommendations5,12.

Driven in part by variations in practice patterns, patient-oriented clinical outcomes research has gained importance in recent years. The goal is to conduct high-quality, methodologically rigorous, prospective clinical studies that focus on outcomes of care that matter to patients7. It is reasoned that, if valid information can be provided to physicians and their patients, more consistent treatment choices will be made, variations will diminish, and outcomes will improve.

To our knowledge, analyses of the relationship of the rates of operative procedures with the outcomes of those procedures have not been previously published. The goal of this study was to integrate the results of small-area utilization analyses in Maine with data from a prospective effectiveness study of the outcomes of treatment for herniated lumbar intervertebral discs and spinal stenosis. A number of studies have indicated a positive association between the volume of procedures performed by surgeons or within hospitals and various outcomes. Such an association between volume and outcome has been demonstrated in studies of cardiac operations, percutaneous coronary angioplasty, and total knee arthroplasty4,9,10. These analyses focused on the concept of practice makes perfect—that is, when complex procedures are performed more often by surgeons, the outcomes may be better. Our analysis focused on the rate of lumbar discectomy—that is, the number of procedures performed on the population of a defined geographic region in a given time-period. Within a given region, the volume of procedures performed by individual physicians or hospitals is independent of the population-based rate.


    Methods
 Top
 Abstract
 Introduction
 Methods
 Results
 Discussion
 References
 

Small-Area Analysis
The methodology of small-area analysis has been previously described15. The present analysis focused on population-based utilization rates for lumbar discectomy from 1991 to 1995. For most medical services, the state of Maine is divided into thirty-two hospital service areas. These areas are based on geographically defined regions (zip codes of residence) that surround hospitals to which most of the residents are admitted. These patterns are based on the total number of admissions. In the case of certain more complex procedures, patients are not managed in their local hospital but are instead referred to institutions outside their hospital service area. In Maine, almost all operations on the spine are performed by neurosurgeons and spine-fellowship-trained orthopaedic surgeons. These physicians are located in five large communities in the state. Therefore, we created so-called spine service areas on the basis of procedures performed within these five communities. Maine's five spine service areas were defined in the same manner as typical hospital service areas are defined—that is, they were created by establishing the pattern of admissions as residents were admitted to hospitals in each of the areas.

This analysis resulted in spine service areas incorporating one to seventeen normal hospital service areas. One spine service area is made up of only its own hospital service area, whereas another area consists of sixteen hospital service areas in addition to the original hospital service area (Fig. 1). It is important to note that all procedures were allocated to the spine service area in which the patient resided, regardless of where the operation was performed. This process is necessary in order to create population-based rates of operations. In this study, all of the operations performed on residents of the spine service area were performed in institutions within that area.



View larger version (56K):
[in this window]
[in a new window]
 
Fig. 1. Map of the state of Maine, indicating five spine service areas. Each area is made up of one or more (range, one to seventeen) hospital service areas. Most patients residing in the hospital service areas receive an operation on the spine at one or more hospitals contained within the designated spine service area. The number of hospital service areas in each spine service area is indicated in parentheses.

 
Rates of lumbar discectomy and decompression laminectomy were calculated for each of the spine service areas. These were adjusted for age and gender with an indirect method. Three of the five spine service areas had a sufficient number of patients who were participants in the Maine Lumbar Spine Study to permit an outcome analysis (Fig. 2).



View larger version (23K):
[in this window]
[in a new window]
 
Fig. 2 The operative rates for three spine service areas in Maine are expressed as observed-to-expected ratios. The observed rate is the age and gender-adjusted rate for each spine service area. The expected rate is the statewide rate and equals 1.0. The observed-to-expected ratio in area A ranged from 16 to 38 percent lower than the state rate, that in area B ranged from 21 to 26 percent higher than the state rate, and that in area C ranged from 29 to 72 percent higher than the state rate. The rates in each area were relatively consistent over time, and most of the differences among the rates for the three areas in each year were significant (p < 0.001).

 

Outcomes Study
The details of the Maine Lumbar Spine Study have been described elsewhere2,3,8. Six hundred and fifty-five patients with sciatica due to a lumbar disc herniation or lumbar spinal stenosis that was initially managed operatively or nonoperatively were enrolled in this prospective, observational study. In order to restrict the study to patients for whom an operation would be elective and associated with an acceptable risk, patients were excluded if they had cauda equina syndrome, developmental spinal deformity, vertebral fracture, an infection or a tumor in the spine, inflammatory spondylopathy, a previous operation on the lumbar spine, or a severe comorbid condition or if they were pregnant. Of the 354 operatively managed patients, 279 (79 percent) could be assigned to one of the three spine service areas.

Patients were recruited as part of normal practice, and they were followed prospectively with standardized questionnaires. Baseline information that was supplied by physicians included demographic data, details of physical examination and imaging findings, and clinical impressions. Details of the operative procedure and any complications have been reported separately3,8. When possible, imaging studies were obtained and subjected to blinded interpretation by a neuroradiologist8. Initial interviews with the patients were conducted at the patients' homes by trained interviewers. Because the patients who were enrolled in the study represented only a portion of the patients who were managed by the participating physicians, a random sample of patients who were not enrolled were sent a follow-up questionnaire in order to compare their baseline characteristics and outcomes with those of the enrolled patients8.

Follow-up questionnaires were mailed to the patients at three, six, and twelve months and annually thereafter. Of the 279 operatively managed patients who were assigned to one of the three spine service areas, 250 (90 percent) completed at least one follow-up questionnaire between two and four years, and the data from those questionnaires are included in this analysis. When more than one follow-up questionnaire had been completed, the data from the most recent survey were used.

The baseline characteristics and features of the patients included demographic information, work and disability status, previous episodes of back pain, other comorbid conditions, symptoms, and functional status2. The frequency of pain in the low back and the lower extremity (with a score of 0 points indicating "not at all" and a score of 6 points indicating "always") and the extent to which these symptoms were bothersome (with a score of 0 points indicating "not bothersome" and a score of 6 points indicating "extremely bothersome") were assessed at baseline. The pain in the low back and the lower extremity at baseline was categorized as severe if the patient reported that it was "usually" to "always" present and was "extremely bothersome" (a score of 5 or 6 points). A symptom frequency index ranging from 0 to 24 points was created by summing the responses to four questions regarding the lower extremity: pain in the lower extremity; numbness or tingling in the lower extremity, foot, or groin; weakness in the lower extremity or foot; and pain in the back or lower extremity while sitting (or while walking, for patients who had stenosis)11. For all scales, higher scores indicated more severe symptoms.

Baseline measures of functional status included the back-specific modified Roland disability scale11 and the generic Medical Outcomes Study (MOS) 36-Item Short-Form Health Survey (SF-36)14. The modified Roland scores range from 0 to 23 points, reflecting a simple sum of items checked as present2. Four of eight domains of health included in the SF-36 are reported: physical function, bodily pain, and mental health and general health perceptions14. Each domain is scored from 0 (poor health) to 100 points (optimum health). Higher SF-36 scores reflect better function, but higher Roland scores reflect worse function.

Outcome measures emphasized a spectrum of patient-reported health-related quality of life, including symptoms (pain), functional status (physical, general health, employment, and disability), and satisfaction2,11. These outcome measures have been previously validated11, and many have been incorporated into the current MODEMS (Musculoskeletal Outcomes Data Evaluation and Management Systems) questionnaire1 of the American Academy of Orthopaedic Surgeons, the North American Spine Society, and the Council of Musculoskeletal Specialties. Pain in the low back and the lower extremity was considered to be decreased if the patient reported that the symptom was "better" to "completely gone" compared with the baseline status. Changes in the symptom frequency index and the modified Roland scores were calculated by subtracting the baseline score from the score at the time of follow-up. General health status was assessed with use of the SF-36. Work and disability status was ascertained at each follow-up evaluation.

In this analysis, patients who had operative management for a lumbar disc herniation or spinal stenosis were allocated to the appropriate spine service area. Our hypothesis was that outcomes would vary according to the population-based rate of operative treatment in the spine service area and that higher rates of operative treatment would result in less optimum outcomes. To assess for differences in baseline and outcome variables among the spine service areas—which were ranked as low-rate (area A), middle-rate (area B), and high-rate (area C)—the Mantel-Haenszel chi-square test of trend was used for dichotomous variables, with linear regression models for continuous variables. Additional analyses with the Cochran-Mantel-Haenszel test of trend were performed to examine whether outcomes across the spine service areas were different among specific subgroups. The subgroups were based on diagnosis (sciatica or stenosis), baseline disability compensation status (receiving or not receiving compensation), and duration of follow-up (four years compared with two or three years). The patients who were receiving disability compensation at baseline had worse outcomes regardless of the spine service area (data not shown). However, the relationships among outcome variables and spine service area were not different for any subgroup that was examined and, thus, only the overall results for each spine service area are presented.

Independent predictors of the patients' satisfaction with their current state at the time of follow-up were examined with use of multiple logistic regression analysis. The patients who had a score of 1 point ("delighted"), 2 points ("pleased"), or 3 points ("mostly satisfied") on a 7-point scale, with "delighted" and "terrible" as the two end points, were classified as satisfied. The models included spine service area and other baseline characteristics that were potentially associated with the satisfaction of the patient (at p values of 0.20)—that is, education, disability compensation, smoking status, comorbid illnesses, duration of the current episode, severity of pain in the low back and lower extremity, and back-specific and generic measures of functional status. To account for associations among patients within each physician cluster, we used the generalized estimating equations methodology to estimate the covariance structure of the models17. All statistical analyses were performed with use of a commercial software package (Statistical Analysis System, SAS Institute, Cary, North Carolina) on a UNIX workstation (Sun Microsystems, Mountain View, California).


    Results
 Top
 Abstract
 Introduction
 Methods
 Results
 Discussion
 References
 

Outcomes of the Maine Lumbar Spine Study
Analyses of the one-year outcomes of nonoperative and operative treatment of sciatica due to lumbar disc herniation and spinal stenosis have been published elsewhere2,3. They indicate that, for the entire group of patients in the Maine Lumbar Spine Society (including those in the present analysis), the outcomes of operative management were superior to those of nonoperative management. For example, 81 percent (170) of the 209 patients who had had a disc excision reported that pain in the lower extremity was much better or gone compared with 56 percent (ninety-two) of the 165 patients who had been managed nonoperatively for a herniated lumbar disc. Patients who had been operated on had an 11-point improvement in the Roland score compared with a 5-point improvement for the nonoperative group. The results were similar for the patients who had been managed for spinal stenosis. Seventy-eight percent (fifty-one) of the sixty-five operatively managed patients compared with 45 percent (twenty-two) of the forty-nine nonoperatively managed patients had a marked reduction in pain in the lower extremity. Sixty-nine percent (forty-five) of the sixty-five operatively managed patients compared with 36 percent (fourteen) of thirty-nine nonoperatively managed patients reported that their overall result was very good or excellent.

Small-Area Analysis
The three spine service areas had significantly different rates of operative treatment (p < 0.001) (Fig. 2). The rate in area A was 16 to 38 percent (mean, 28 percent) less than the mean in the state and remained consistently so. Area B had a stable rate of discectomy of 21 to 26 percent (mean, 24 percent) above the state average, whereas the rate in area C was more variable but was always the highest in the state—ranging from 29 to 72 percent (mean, 53 percent) above the state mean over the five years that were studied.

Relationship Between Patient Outcomes and Operative Rates
Baseline and follow-up data were available for seventy-nine operatively managed patients from area A, for 136 from area B, and for thirty-five from area C (Table I). Although area C had a high operative rate, its population was considerably smaller than those of the other two areas. Seventy-eight percent (196) of the 250 patients had the operation for treatment of herniation of a lumbar disc, and the percentages were similar among the three spine service areas. Most of the patients completed the most recent follow-up evaluation at forty-eight months, and the percentages that did so (91 percent [seventy-two patients], 90 percent [123 patients], and 86 percent [thirty patients]; weighted average, 90 percent) were similar among the three spine service areas. Eleven of the thirteen participating physicians were neurosurgeons, and the physicians enrolled one to fifty-four patients (median, nineteen patients) in the study. All of the spine surgeons in areas A and C and seven of the eight surgeons in area B participated in the study.


View this table:
[in this window]
[in a new window]
 
TABLE I CHARACTERISTICS OF THE PATIENTS AND PARTICIPATING PHYSICIANS

 
The patients enrolled in the study represented only a portion of all of the patients who were managed by the participating physicians. We previously reported that 22 percent (655) of 2977 eligible patients who had an operation were referred to the study8. The rates of referral in the three spine service areas were comparable with the overall referral rate. Referred and nonreferred patients had similar demographic and clinical characteristics.

The physicians were generally optimistic about the expected outcome of the procedure, although the physicians in area A were less likely than the physicians in the other areas to believe that the patients' quality of life would be greatly improved (p <= 0.017). The nature of the operative procedure and complications was similar across the spine service areas (data not shown)2,3. Most patients with a disc herniation had a standard discectomy, whereas patients with stenosis had a decompression laminectomy without spinal arthrodesis; no patient in the series had a concomitant arthrodesis.

Because different population-based rates of operative treatment imply that some patients who were managed operatively in one area may have been managed nonoperatively in another, we examined whether there were differences in baseline characteristics or outcomes among those managed nonoperatively in the different spine service areas. One hundred and twenty-four patients (thirty-six in area A, seventy in area B, and eighteen in area C) were managed nonoperatively. Regardless of the spine service area, the nonoperatively managed patients had less severe baseline symptoms and functional impairment than did the operatively managed patients, and the outcomes of the nonoperative treatment did not differ among the spine service areas (data not shown).

Patient Characteristics and Clinical Features at Baseline Evaluation
Baseline patient characteristics, including age, gender, level of education, tobacco use, and employment status, were similar across all areas (Table II). The patients in area C were somewhat more likely to have been receiving disability compensation and to have retained an attorney. The percentages of patients who had previous episodes of back pain and other comorbid illnesses were similar across the spine service areas.


View this table:
[in this window]
[in a new window]
 
TABLE II CHARACTERISTICS OF THE PATIENTS AND HISTORY AT THE BASELINE EVALUATION

 
There were a number of differences among the spine service areas with regard to the characteristics of the lumbar conditions at baseline (Table III). Among the patients who had sciatica, those in area C were somewhat less likely to have a positive straight-leg-raising test (p = 0.06). However, among all patients, those in area C were somewhat more likely to have other abnormal physical findings. The physicians in area C reported more abnormal motor, sensory, and reflex findings, with area B having the least and area A having an intermediate amount of such reported findings. In contrast to the findings on physical examination, the severity of the abnormalities as assessed by the independent examination of the advanced imaging studies was the least for the patients in area C and the greatest for the patients in area A (p <= 0.01).


View this table:
[in this window]
[in a new window]
 
TABLE III FEATURES OF THE BACK DISORDERS AT THE BASELINE EVALUATION

 
There were some differences in patient-reported symptoms across the three service areas. The percentage of patients who reported current symptoms that had lasted for less than six months decreased from 72 percent (fifty-seven of seventy-nine) in area A to 61 percent (eighty-two of 134) in area B to 49 percent (seventeen of thirty-five) in area C (p <= 0.05). The percentage of patients who had severe low-back pain was highest in area C and lowest in area A (p <= 0.1). In contrast, the percentages of those who had severe pain in the lower extremity and those who had unilateral symptoms in the lower extremity were highest in area A and lowest in area C (p <= 0.1). The percentages of patients who reported the symptoms in area B were generally intermediate between the percentages in the other two areas. With the numbers available, we were unable to detect significant differences in the symptom frequency index or in other measures of functional status, including the back-specific modified Roland disability scale and the generic SF-36 domains of physical function, bodily pain, and mental and general health perceptions (p > 0.1 for all except mental health perception, for which p = 0.08).

Patient-Reported Outcomes at the Time of Follow-up
Significant differences (p <= 0.05 to p <= 0.01) in patient-reported outcomes at the time of follow-up (usually at four years; minimum, two years) were observed among the three spine service areas (Table IV). In general, the patients who had had an operation in the area with the highest utilization rate (area C) had outcomes that were inferior to those for the patients in the area with the lowest utilization rate (area A). The patients in the middle-rate area (area B) had results that were intermediate between those of the other two areas for most of the outcomes that were assessed.


View this table:
[in this window]
[in a new window]
 
TABLE IV PATIENT-REPORTED OUTCOMES AT THE TIME OF FOLLOW-UP

 
There was a significantly greater decrease in the frequency of symptoms in the lower extremity (as measured by the symptom frequency index) between the baseline and follow-up evaluations in area A, the area with the lowest utilization rate (p <= 0.05). Although relief of pain in the low back and the lower extremity and relief of the predominant symptom (pain in either the low back or the lower extremity) were not significantly different among the spine service areas, the trends for these variables also favored the area with the lowest utilization rate. For example, pain in the lower extremity was rated as much better by 79 percent (fifty-seven) of the seventy-two patients in area A, 74 percent (ninety-five) of the 128 in area B, and 60 percent (eighteen) of the thirty in area C (p = 0.06). In addition, the back-specific functional status as assessed with the Roland scale improved significantly more in area A compared with area C (p <= 0.01). The general health status as assessed with the SF-36 was similar across the three spine service areas.

Patient-reported satisfaction with outcome was assessed with three questions regarding improvement in quality of life, the patients' satisfaction with their current state, and whether they would choose the operation again. Regardless of the specific question, the patients in the area with the lowest utilization rate (area A) were most satisfied with their care, the patients in the highest-rate area (area C) were least satisfied, and the patients in area B had a rate of satisfaction that was between the rates in the two other areas. Overall, 72 percent (fifty-seven) of the seventy-nine patients in area A were satisfied with their current state compared with 63 percent (eighty-five) of the 136 patients in area B and 49 percent (seventeen) of the thirty-five patients in area C (p <= 0.01). Finally, employment and disability outcomes were similar across the three spine service areas, although the patients in area A were slightly less likely to report that they were receiving any disability compensation (p = 0.15).

As noted previously, the patients who were referred to the study represented only a portion of the potentially eligible patients and a survey of nonreferred patients was performed. Overall, with the numbers available, no significant differences in outcomes could be detected between the referred and nonreferred patients or among the nonreferred patients in the three spine service areas (data not shown)2.

Independent Baseline Predictors of Patient Satisfaction at the Time of Follow-up
Multivariate analyses in which the spine service area and other baseline variables were examined revealed that the likelihood of patients being satisfied with their current state at the time of follow-up (usually at four years) was independently related to whether they had graduated from college, were receiving disability compensation, and had higher levels of physical function (Table V). Having graduated from college increased the odds of the patients being satisfied with their current state about threefold, whereas receiving any disability compensation decreased the odds about threefold. For each 5-point increase in the SF-36 physical function score at baseline, the odds of the patients being satisfied increased 10 percent. After adjustment for these variables, an operation in spine service area A or B increased the odds of the patients being satisfied at four years 2.7-fold (95 percent confidence interval, 1.6 to 4.7) or 1.6-fold (95 percent confidence interval, 1.0 to 2.6), respectively, compared with an operation in area C. These results were similar to the unadjusted relative odds of the patients being satisfied if the operation was in spine service area A (2.7; 95 percent confidence interval, 1.2 to 6.3) or area B (1.8; 95 percent confidence interval, 0.8 to 3.8) compared with area C (derived from data provided in Table IV).


View this table:
[in this window]
[in a new window]
 
TABLE V FEATURES AT BASELINE EVALUATION THAT WERE INDEPENDENT PREDICTORS OF PATIENT SATISFACTION AT THE TIME OF FOLLOW-UP*

 


    Discussion
 Top
 Abstract
 Introduction
 Methods
 Results
 Discussion
 References
 
To our knowledge, the present analysis represents the first attempt to associate differences in population-based rates of operative treatment with patient-reported outcomes of that treatment. We observed that better operative outcomes were associated with lower population-based rates of operative treatment.

It is noteworthy that, when the Maine Lumbar Spine Study is considered in its entirety, the outcomes of operations for lumbar disc herniation and lumbar spinal stenosis are very favorable. The patients who had an operation had had more severe symptoms and findings at the time of enrollment than those who were managed nonoperatively, and at the time of follow-up they had significantly better outcomes2,3. For example, compared with baseline values, pain in the lower extremity at one year had decreased in 81 percent (170) of 209 patients who had had a disc excision compared with 56 percent (ninety-two) of 165 patients who had been managed nonoperatively for lumbar disc herniation. The predominant symptom was much better or gone in 71 percent (148) of the 209 patients who had had a disc excision compared with 43 percent (seventy-two) of the 167 nonoperatively managed patients. These findings remained essentially the same at four years (unpublished data). The results similarly favored operative management for patients who had spinal stenosis. Thus, with the results viewed globally, an operation could be seen as the preferred method of management for patients who have severe sciatica due to herniation of a lumbar disc or those who have spinal stenosis.

However, not all of the patients who were managed operatively had the same outcome. Indeed, the outcomes of operative treatment were associated with the population-based rates of operative treatment. The patients from the area with the lowest rate of operative treatment (area A) had outcomes that were even better than those for the group as a whole, whereas patients from the highest-rate area (area C) had outcomes that were inferior to those of the entire group. The results in area B fell between those for the other two areas.

These findings may be partly explained by differences in the baseline findings among the three groups. Patients in the lowest-rate area reported symptoms that were of a shorter duration, were more likely to be unilateral, and were generally more severe in the lower extremity and less severe in the back; they also had more severe findings on independently evaluated imaging studies. In the highest-rate area, surgeons recorded that the straight-leg-raising test was positive in 59 percent (seventeen) of twenty-nine patients compared with 79 percent (131) of 166 patients in the other two areas.

Thus, although all patients met the same criteria for study eligibility, differences in baseline symptoms among the patients managed in the three spine service areas suggest that the superior outcomes in the lowest-rate area may be related to more stringent criteria for selection of patients for a procedure on the lumbar spine. We have previously shown that an operation has the least benefit in patients who have the lowest baseline severity of disease2. We have also demonstrated that patients who are receiving Workers' Compensation at baseline have inferior outcomes and are less likely to return to work2. We believe that the differences in the baseline characteristics of the patients in the three areas relate more to the decision-making patterns of the physicians in each of the areas than to differences in the socioeconomic status or employment patterns of the local populations.

There is no evidence that surgeons in any of the areas were operating on patients who did not have evidence of disc herniation or spinal stenosis. Rather, surgeons in the two higher-rate areas may have had a lower threshold for recommending an operation to their patients than did surgeons in the lowest-rate area. The stability of the area-based operative rates over time (Fig. 2) indicates that these local practice styles are entrenched and that physicians remain consistent with regard to how they make recommendations and manage patients. Surgeons who perform a high rate of operations do not believe that they are practicing inappropriately or differently from others, and they have no way of knowing that their practice patterns are quite different from those of colleagues in the same state.

Our results highlight several generic deficiencies in the manner in which physicians evaluate the outcomes of the patients whom they manage. First, physicians and surgeons are generally unaware of the rate of utilization in the area in which they practice because small-area variation analyses are not routinely available in most states and regions. Statewide hospital-discharge databases are required for this purpose. These are currently available in more than thirty states, but small-area analyses are infrequently performed. This problem has recently become exacerbated with the shift of many lumbar-disc procedures from the inpatient to the ambulatory setting, where utilization information is even less frequently collected and analyzed.

Second, physicians do not routinely receive information about the patient-oriented outcomes of the care that they provide. The analysis that we have presented indicates that the outcomes of operations on the lumbar spine vary significantly among patients residing in different spine service areas in Maine. Before presentation of the results of the current study, surgeons in each of the three areas believed that they operated on patients with similar indications and that they produced similar outcomes. All surgeons tend to function under these beliefs. They are similarly trained, are usually board-certified, and have strong beliefs in their own judgment and skills. In the absence of other information, it is not unreasonable that they would be confident that their diagnostic and operative skills result in a level of utilization and quality of care that are equal to those of their peers.

Our research indicates that individual physicians cannot assume that their treatment results will be the same as those of their peers or the same as the findings or recommendations published in the literature. Even the findings of a study in which they participated cannot be applied equally to all of the surgeons who enrolled patients.

Patients are not likely to be aware of or to appreciate the implication of small-area analysis and variation in population-based rates of operative treatment. The probability of having an operation is highly dependent on the practice style of the surgeons in the area in which the patient lives. This is because patients tend to follow the advice of their physician—a phenomenon described as delegated decision-making. In areas where surgeons are predisposed to recommend an operation more frequently, more patients will have an operation.

Outcomes research has the potential to provide information that will enable each patient to better understand the outcomes, risks, and benefits of an operation and other treatment. An operation for the treatment of lumbar disc herniation or spinal stenosis is almost always elective, as it is performed to decrease symptoms and improve function. Patients need to understand the risks and benefits of such an operation so that they can make better decisions about what works best for them. Many of them will elect to proceed with an operation and will do well, whereas others may choose watchful waiting and may have an equally good outcome.

The response of Maine's spine surgeons to the analyses that we have presented is illuminating. Rather than dispute the findings and implications of these data, the surgeons asked what they could do to rectify the problem. The Maine Medical Assessment Foundation has responded by continuing to conduct feedback of the information from small-area variation analyses. Additionally, most of the practices from all five spine service areas are participating in a voluntary outcomes data management program, directed by the Maine Medical Assessment Foundation. Patients who are evaluated because of back problems (lumbar and cervical) are enrolled prospectively. The MODEMS spine questionnaires are used in this project1. Individual doctors will be able to compare their outcomes with those of their colleagues and peers. As a result of these efforts, participating physicians have involved themselves in a continuous quality-improvement effort in which valid and useful data are collected, analyzed, and fed back to the involved doctors on a continuous basis.

Limitations
Because the Maine Lumbar Spine Study did not have narrowly restricted entry criteria, there are differences among enrolled patients with regard to the severity, type, and duration of underlying symptoms and the severity of physical and imaging findings. We have identified important differences in the baseline characteristics of the patients in the three defined spine service areas. Still, despite these baseline differences, the analysis indicates clearly that the outcomes in the area that had the lowest operative rate were preferable to those in the other two areas. The number of patients enrolled in the area with the highest operative rate (area C) was relatively small (thirty-five). Nonetheless, significant differences were observed for many outcome variables. Although not all patients completed the questionnaire at the forty-eight-month interval, 86 to 91 percent did respond at that time and there were no significant differences in the proportions across the three service areas or in the outcomes between the patients who responded at the shorter and longer intervals.

Our research did not lead us to conclude that high rates of lumbar disc excision for sciatica or laminectomy for spinal stenosis always produce outcomes that are inferior to those associated with lower rates. Although this was the case in the present study, additional research is needed to determine whether there is a generally optimum operative rate and whether this rate differs in various regions.

In conclusion, this community-based effectiveness outcome study demonstrated that higher population-based rates of lumbar discectomy and laminectomy were associated with inferior patient-oriented outcomes. Based on these findings, we make two recommendations. First, it is important that physicians be provided with accurate and timely comparative data on the population-based rates of the operative and medical treatments that they are using. Although small-area analysis does not establish the so-called right rate of medical care, it does provide useful information about variations in practice patterns. These studies can provide information about potential overutilization and underutilization and can provide a stimulus for further evaluation and analysis.

Second, surgeons and other physicians need to begin to collect information about the patient-oriented outcomes of the care that they provide. Only this level of rigorous self-analysis can provide data that will enable continuous improvement in quality. With high-quality outcomes data, doctors and their patients will be able to make better decisions about the best options for care. Outcomes information will allow surgeons to understand how their results compare with those of their colleagues and will give them the tools with which to improve when that is necessary.


    Footnotes
 
*No benefits in any form have been received or will be received from a commercial party related directly or indirectly to the subject of this article. Funds were received in total or partial support of the research or clinical study presented in this article. The funding sources were Grants HS 06344 and HS 08194 from the Back Pain Outcome Assessment Team of the Agency for Health Care Policy and Research.

{dagger}Maine Medical Assessment Foundation, P.O. Box 249, Manchester, Maine 04351-0249. The e-mail address for Dr. Keller is rbk@mmaf.org.

{ddagger}Medical Practices Evaluation Center and General Medicine Division, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts 02114.

§Deceased.

#Department of Medicine and Health Services, University of Washington, Seattle, Washington 98195.


    References
 Top
 Abstract
 Introduction
 Methods
 Results
 Discussion
 References
 

  1. American Academy of Orthopaedic Surgeons: Outcomes Data Collection Instruments, version 2.0. Rosemont, Illinois, American Academy of Orthopaedic Surgeons, 1997.
  2. Atlas, S. J.; Deyo, R. A.; Keller, R. B.; Chapin, A. M.; Patrick, D. L.; Long, J. M.; and Singer, D. E.: The Maine Lumbar Spine Study, Part II. 1-year outcomes of surgical and nonsurgical management of sciatica. Spine, 21: 1777-1786, 1996.[Medline]
  3. Atlas, S. J.; Deyo, R. A.; Keller, R. B.; Chapin, A. M.; Patrick, D. L.; Long, J. M.; and Singer, D. E.: The Maine Lumbar Spine Study, Part III. 1-year outcomes of surgical and nonsurgical management of lumbar spinal stenosis. Spine, 21: 1787-1795, 1996.[Medline]
  4. Hannan, E. L.; O'Donnell, J. F.; Kilburn, H., Jr.; Bernard, H. R.; and Yazici, A.: Investigation of the relationship between volume and mortality for surgical procedures performed in New York State hospitals. J. Am. Med. Assn., 262: 503-510, 1989.[Abstract/Free Full Text]
  5. Hoffman, R. M.; Wheeler, K. J.; and Deyo, R. A.: Surgery for herniated lumbar discs. A literature synthesis. J. Gen. Intern. Med., 8: 487-496, 1993.[Medline]
  6. Keller, R. B.; Soule, D. N.; Wennberg, J. E.; and Hanley, D. F.: Dealing with geographic variations in the use of hospitals. The experience of the Maine Medical Assessment Foundation Orthopaedic Study Group. J. Bone and Joint Surg., 72-A: 1286-1293, Oct. 1990.[Abstract/Free Full Text]
  7. Keller, R. B.: Outcomes research in orthopaedics. J. Am. Acad. Orthop. Surgeons, 1: 122-129, 1993.[Abstract]
  8. Keller, R. B.; Atlas, S. J.; Singer, D. E.; Chapin, A. M.; Mooney, N. A.; Patrick, D. L.; and Deyo, R. A.: The Maine Lumbar Spine Study, Part I. Background and concepts. Spine, 21: 1769-1776, 1996.[Medline]
  9. McGrath, P. D.; Wennberg, D. E.; Malenka, D. J.; Kellett, M. A., Jr.; Ryan, T. J., Jr.; O'Meara, J. R.; Bradley, W. A.; Hearne, M. J.; Hettleman, B.; Robb, J. F.; Shubrooks, S.; VerLee, P.; Watkins, M. W.; Lucas, F. L.; and O'Connor, G. T.: Operator volume and outcomes in 12,988 percutaneous coronary interventions. Northern New England Cardiovascular Study Group. J. Am. Coll. Cardiol., 31: 570-576, 1998.[Abstract/Free Full Text]
  10. Norton, E. C.; Garfinkel, S. A.; McQuay, L. J.; Heck, D. A.; Wright, J. G.; Dittus, R.; and Lubitz, R. M.: The effect of hospital volume on the in-hospital complication rate in knee replacement patients. Health Serv. Res., 33: 1191-1210, 1998.[Medline]
  11. Patrick, D. L.; Deyo, R. A.; Atlas, S. J.; Singer, D. E.; Chapin, A.; and Keller, R. B.: Assessing health-related quality of life in patients with sciatica. Spine, 20: 1899-1909, 1995.[Medline]
  12. Turner, J. A.; Ersek, M.; Herron, L.; and Deyo, R.: Surgery for lumbar spinal stenosis. Attempted meta-analysis of the literature. Spine, 17: 1-8, 1992.[Medline]
  13. Volinn, E.; Mayer, J.; Diehr, P.; Van Koevering, D.; Connell, F. A.; and Loeser, J. D.: Small area analysis of surgery for low-back pain. Spine, 17: 575-581, 1992.[Medline]
  14. Ware, J. E., Jr., and Sherbourne, C. D.: The MOS 36-Item Short-Form Health Survey (SF-36). I. Conceptual framework and item selection. Med. Care, 30: 473-483, 1992.[Medline]
  15. Wennberg, J. E., and Gittelsohn, A.: Variations in medical care among small areas. Sci. Am., 246: 120-134, 1982.[Medline]
  16. Wennberg, J. E., and Cooper, M. [editors]: The Dartmouth Atlas of Health Care. Chicago, American Hospital Publishing, 1998.
  17. Zeger, S. L., and Liang K.-Y.: Longitudinal data analysis for discrete and continuous outcomes. Biometrics, 42: 121-130, 1986.[Medline]

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Facebook Facebook   Add to Technorati Technorati   Add to Twitter Twitter    What's this?


This article has been cited by other articles:


Home page
J Am Board Fam MedHome page
R. A. Deyo, S. K. Mirza, J. A. Turner, and B. I. Martin
Overtreating Chronic Back Pain: Time to Back Off?
J Am Board Fam Med, January 1, 2009; 22(1): 62 - 68.
[Abstract] [Full Text] [PDF]


Home page
ptjournalHome page
S. K Carter and J. A Rizzo
Use of Outpatient Physical Therapy Services by People With Musculoskeletal Conditions
Physical Therapy, May 1, 2007; 87(5): 497 - 512.
[Abstract] [Full Text] [PDF]


Home page
JBJSHome page
A. H. Daniels, M. Arthur, and R. A. Hart
Variability in Rates of Arthrodesis Procedures for Patients with Cervical Spine Injuries with and without Associated Spinal Cord Injury
J. Bone Joint Surg. Am., February 1, 2007; 89(2): 317 - 323.
[Abstract] [Full Text] [PDF]


Home page
J Bone Joint Surg BrHome page
R. Lemaire
Informed consent - a contemporary myth?
J Bone Joint Surg Br, January 1, 2006; 88-B(1): 2 - 7.
[Full Text] [PDF]


Home page
NEJMHome page
R. A. Deyo and J. N. Weinstein
Low Back Pain
N. Engl. J. Med., February 1, 2001; 344(5): 363 - 370.
[Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF) Free
Right arrow Letters to the Editor: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Letters to the Editor are posted
Right arrow Alert me if a correction is posted
Services
Right arrow E-mail this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My File Cabinet
Right arrow Download to citation manager
Right arrow Rights and Permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by KELLER, R. B.
Right arrow Articles by DEYO, R. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by KELLER, R. B.
Right arrow Articles by DEYO, R. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Facebook   Add to Technorati   Add to Twitter  
What's this?