|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Reports |
A Reid, PT, MSc, was a physiotherapist at the Fowler Kennedy Sport Medicine Clinic, 3M Centre, University of Western Ontario, London, Ontario, Canada, during the completion of this project. She is currently a physiotherapist at the Allan McGavin Sports Medicine Centre, John Owen Pavilion, University of British Columbia, Vancouver, Canada
TB Birmingham, PT, PhD, is Associate Professor and Tier 2 Canada Research Chair in Musculoskeletal Rehabilitation, School of Physical Therapy, Elborn College, University of Western Ontario, London, Ontario, Canada N6G 1H1, and Co-Director, Wolf Orthopaedic Biomechanics Laboratory, Fowler Kennedy Sport Medicine Clinic
PW Stratford, PT, MSc, is Professor, School of Rehabilitation Science, and Associate Member, Department of Clinical Epidemiology and Biostatistics, McMaster University, Hamilton, Ontario, Canada, and a Scientific Affiliate in the Department of Surgery, Sunnybrook Health Sciences Centre, Toronto, Ontario, Canada
GK Alcock, PT, MSc, is Physiotherapist, Fowler Kennedy Sport Medicine Clinic
JR Giffin, MD, FRCS(C), is Assistant Professor, Department of Surgery, University of Western Ontario, and Co-Director, Wolf Orthopaedic Biomechanics Laboratory, Fowler Kennedy Sport Medicine Clinic
Address all correspondence to Dr Birmingham at: tbirming{at}uwo.ca
Submitted May 21, 2006;
Accepted November 6, 2006
| Abstract |
|---|
Subjects: Forty-two patients, 15 to 45 years of age, who had undergone ACL reconstruction participated in the study.
Methods and Measures: The study design was prospective and observational with repeated measures. The subjects performed a series of 4 hop tests on 3 separate occasions within the 16th week following surgery and on a fourth occasion 6 weeks later. The tests were a single hop for distance, a 6-m timed hop, a triple hop for distance, and crossover hops for distance. Performance on the ACL-reconstructed limb was expressed as a percentage of the performance on the nonoperative limb, termed the "limb symmetry index." Subjects also completed the Lower Extremity Functional Scale and a global rating of change questionnaire.
Results: Intraclass correlation coefficients for limb symmetry index values ranged from .82 to .93. Standard errors of measurement were 3.04% to 5.59%. Minimal detectable changes, at the 90% confidence level, were 7.05% to 12.96%. Changes in hop test scores on the operative limb were statistically greater than changes on the nonoperative limb. Pearson correlations (r) between change in hop performances and self-reported measures ranged from .26 to .58.
Discussion and Conclusion: The results show that the described series of hop tests provide a reliable and valid performance-based outcome measure for patients undergoing rehabilitation following ACL reconstruction. These findings support the use and facilitate the interpretation of hop tests for research and clinical practice.
| Introduction |
|---|
|
|
|---|
Standardized outcome measures can be described as measures with acceptable measurement properties that have been published with specific procedures for administration, scoring, and interpretation. Dissemination of this type of information has indeed occurred for a variety of self-report measures (questionnaires) and continues to progress. However, research reports focused on similar information for performance-based measures of physical function have not paralleled that for self-report measures. Specifically, although information about the measurement error and ability to detect change has been reported in a clinically interpretable way for many self-report measures, this often is not the case for performance-based measures.
Some authors810 have suggested that self-report and performance-based measures quantify different aspects of function and that using one type of measure alone does not sufficiently capture the breadth of health concepts associated with the measurement of function. Researchers8,9,11,12 investigating the relationship between self-report and performance-based measures have reported Pearson correlations (r) ranging from .02 to .59. Other authors13 have emphasized that there are situations in which performance-based measures may be preferable and have suggested that these measures also be included in research and clinical practice. Owing to the increased emphasis on incorporating functional and sport-specific exercises into current ACL postoperative rehabilitation protocols, and the goal to have patients return to dynamic and potentially injurious activities, the inclusion of outcome measures that are performance-based may be especially important when evaluating these patients.
Hop testing has frequently been proposed as a practical, performance-based outcome measure that reflects the integrated effect of neuromuscular control, strength (force-generating capacity), and confidence in the limb and requires minimal equipment and time to administer.1417 Based on a review of the potential use of hop tests as measures of dynamic knee stability, Fitzgerald et al8 suggested that hopping may be appropriate for use as a predictive tool for identifying patients who may have future problems as a result of knee injury or pathology and as an evaluative tool to reflect change in patient status in response to treatment.
A combination of 4 different hop tests originally described by Noyes et al18 may be particularly suitable as a performance-based outcome measure for patients who are undergoing rehabilitation after ACL reconstruction. The tests incorporate a variety of movement principles (ie, direction change, speed, acceleration-deceleration, rebound) that mimic the demands of dynamic knee stability during sporting activities and are suggested to prepare the patient for return to such activities.7,1922 This series of hop tests involves a single hop for distance, a 6-m timed hop, a triple hop for distance, and crossover hops for distance. Measurements are obtained on both extremities so that test performance on the operative limb can be expressed as a percentage of test performance on the opposite limb, termed the "limb symmetry index."
Based on performance on these 4 hop tests, the limb symmetry index has been used to help differentiate individuals with and without dynamic knee stability18,2327 and to compare different rehabilitation strategies following ACL reconstruction.19 Some authors7,20,21 also have advocated the use of these hop tests when monitoring progress in individual patients who are undergoing rehabilitation following ACL reconstruction. Various clinical practice guidelines include specific scores on the limb symmetry index that must be met in order for a patient to progress through phases of rehabilitation, to return to sports, or to be discharged from physical therapy.7,20,21
Bolgla and Keskula28 evaluated the relative reliability of scores on the limb symmetry index based on the described series hop tests in subjects who were healthy and suggested that it is a reliable measure of lower-extremity performance (intraclass correlation coefficient [ICC]=.95.96). Intraclass correlation coefficients also have been reported for individual hop tests in patients following ACL reconstruction (ICC=.76.97 for the single hop for distance test,11,29,30 ICC=.88.97 for the 6-m timed hop test,11,31 and ICC=.94.98 for the crossover hops for distance test31). However, we are unaware of any previous reports providing estimates of the measurement error and minimal detectable change for the series of hop tests in patients following ACL reconstruction, or the ability of this performance-based measure to detect change during postoperative rehabilitation.
In order to facilitate the use of the described series of hop tests as a standardized performance-based outcome measure for patients who are undergoing rehabilitation following ACL reconstruction, further information regarding its measurement properties should be provided. Specifically, further information regarding the reliability and longitudinal construct validity of data obtained from these hop tests is necessary to more accurately plan future clinical trials and to more confidently make clinical decisions about individual patients. Therefore, the objective of the present study was to investigate the reliability and longitudinal validity of data from these hop tests during rehabilitation after ACL reconstruction.
| Method |
|---|
|
|
|---|
|
Participants
Forty-two patients between the ages of 15 and 45 years participated in this study (Tab. 1). All patients had undergone primary unilateral ACL reconstruction at the Fowler Kennedy Sport Medicine Clinic using a semitendinosus and gracilis tendon autograft and were following the postoperative rehabilitation protocol used at that center. All patients had a stable contralateral knee (no injury or surgical interventions in the past 2 years), had full range of motion in the operative limb when compared with the nonoperative limb (flexion within 5°), and had only trace or no effusion. Patients with concomitant meniscal injury that required repair were included in the study, provided that they were permitted to undergo typical rehabilitation after ACL reconstruction involving immediate full weight-bearing gait and unrestricted nonweight-bearing range of motion.
|
Sample size was based on parameter estimation of the reliability coefficient for overall limb symmetry index, with a lower confidence interval (CI) width of 0.1, an expected ICC of at least .85, and a one-tailed CI set to 1
(
=.05).33 Using these parameters, the estimated sample size required was 36 subjects. Given that the study design involved 4 repeated test occasions over a 6-week period, we conservatively recruited 50 subjects to account for a dropout rate of up to 25%.
One hundred seventeen patients were approached as potential participants. Those who did not enter the study were injured on their nonoperative side (n=5), had undergone revision surgery (n=4), had experienced a superficial wound infection (n=2), had an associated fracture (n=2), had nontypical ACL reconstruction (n=3), were away from home either traveling or attending university (n=20), were outside of a reasonable driving distance (n=23), were unwilling to participate (n=6), or failed to attend the scheduled appointment (n=4). Forty-eight patients were entered into the study.
During the course of the study, 6 patients withdrew from the study for the following reasons: 1 patient moved out of the area, 1 patient was diagnosed with pneumonia, 2 patients had scheduling difficulties, and 2 patients had complaints of thigh pain after 2 consecutive days of testing. Of the remaining 42 patients, 8 patients could attend only 2 of the 3 sessions completed within 1 week. Three patients did not complete the final test day (1 patient had a back injury rendering her unable to hop, 1 patient had hernia surgery, and 1 patient developed a knee effusion after playing ice hockey the previous day). As a result, the final sample consisted of 42 patients who attended either 3 or 4 test occasions and contributed data for summary statistics. Thirty-five patients contributed data to the analysis of reliability, and 39 patients contributed data to the test of longitudinal validity.
Hop Testing Procedures
The series of 4 hop tests was administered in accordance with the protocols outlined by Noyes et al,18 Barber et al,34 and Daniel et al.35 The tests were a single hop for distance, a 6-m timed hop, a triple hop for distance, and a crossover hop for distance (Fig. 2). In keeping with the original description,18 the tests were administered in that order on each test occasion, followed by the administration of the self-report measures. The hop testing course was constructed on low-pile, rubber-backed carpet glued over concrete floor. The course consisted of a 6-m-long x 15-cm-wide marking placed on the floor.
|
For the hops for distance (single, triple, and crossover) to be deemed successful, the landing must have been maintained for 2 seconds. An unsuccessful hop was classified by any of the following: touching down of the contralateral lower extremity, touching down of either upper extremity, loss of balance, or an additional hop on landing. If the hop was unsuccessful, the subject was reminded of the requirement to maintain the landing, and the hop was repeated. No further instructions were provided to the subjects. Typically, 1 or 2 extra trials were required.
The single hop for distance was performed as outlined by Daniel et al.35 The subjects stood on the leg to be tested, hopped, and landed on the same limb. The distance hopped, measured at the level of the great toe, was measured and recorded to the nearest centimeter from a standard tape measure that was permanently affixed to the floor. The timed 6-m hop was performed as outlined by Barber et al.34 Subjects were instructed to perform large one-legged hops in series over the total distance. A standard stopwatch was used to record time. The stopwatch was started when a subjects heel lifted from the starting position and was stopped the moment that the tested foot passed the finish line. Measurements were recorded to the nearest 10th of a second.
The triple hop for distance was performed as outlined by Noyes et al.18 Subjects were instructed to stand on one leg and perform 3 consecutive hops as far as possible, landing on the same leg. The total distance for 3 consecutive hops was recorded. Finally, the crossover hop for distance18 was performed over a 15-cm strip on the floor. The subjects hopped forward 3 times while alternately crossing over a marking. The total distance hopped forward was recorded. Subjects were instructed to position themselves such that the first of the 3 hops was lateral with respect to the direction of crossover. The series of hop tests took approximately 10 minutes to administer.
Self-Report Measures
The Lower Extremity Functional Scale (LEFS) is a region-specific, self-report functional status measure.36 Individuals scores on this 20-item questionnaire range from 0 to 80, with higher scores indicating better functional status. Previous research37 has determined the measurement properties of the LEFS, including its standard error of measurement (SEM) (3.43.9 LEFS points), 90% CI for a given score (±6 LEFS points), minimal detectable change at the 90% confidence level (9 LEFS points), and minimal clinically important difference (9 LEFS points).
On the final test occasion, subjects also completed a global rating of change questionnaire that asked them how much they had changed over the last 6 weeks (ie, since first performing the hop tests).38 This tool was used to provide an indication of the subjects perception of the size of the change experienced. The questionnaire asks patients to indicate whether they are better, worse, or the same, and, if appropriate, how much they have changed on a 15-point scale (7 to 7) that includes descriptors ranging from "a tiny bit, almost same" to "a very great deal."39
Data Analysis
On each test occasion, all hop test scores were recorded as absolute distance (in centimeters) or time (in seconds) and were calculated as the mean of the 2 recorded trials. Also using the mean of 2 trials, the limb symmetry index was calculated such that the score on the ACL-reconstructed limb was expressed as a percentage of the score on the nonoperative limb. Limb symmetry index scores were calculated for each of the 4 hop tests and for the overall combination of hop tests. Although the limb symmetry index scores were the outcome measures of most interest, absolute scores on each limb also were presented to better understand the behavior of the calculated index scores upon repeated assessments.
Hop test scores on each of the 4 test occasions were compared using repeated-measures analyses of variance (ANOVAs). Separate ANOVAs were completed for the operative and nonoperative limbs using data from all subjects. Following a significant main effect, Scheffé post hoc tests were used to compare scores for each test occasion.
Reliability.
Test-retest reliability was assessed using the hop values obtained from test occasions 2 and 3. Reliability first was estimated using ICC(2,1).40 The ICC is a ratio of the variance between patients to the total variance, it provides an indication of how well a measure can distinguish among patients, and it therefore can be considered a measure of relative reliability. Reliability then was estimated using the SEM.41 The SEM provided an expression of an individual subjects hop test measurement error in the original test units (eg centimeters, seconds, percentage), and therefore can be considered an absolute measure of reliability. An upper one-sided 95% CI for the point estimate of the SEM was constructed using the method described by Stratford and Goldsmith.5 The point estimate of the SEM then was used to estimate the error in an individual subjects score at a given point in time, at the 90% confidence level, by multiplying the SEM by the z value for 90% confidence (1.64).
The point estimate of the SEM also was used to calculate an estimate of the minimal detectable change at the 90% confidence level by multiplying the SEM by the square root of 2 (this accounts for measurement error at 2 testing occasions) and the z value for 90% confidence (1.64).42 We used a different level of confidence when creating CIs for point estimates (95%) than when describing the interpretation of an individuals score (90%), partly to emphasize that these concepts are indeed different and because we believed that clinical interpretations based on a single subjects score should be interpreted more liberally than estimates based on our studys sample of subjects (n=35). We felt that the 90% level represented that sentiment while still being quite conservative.
Longitudinal validity.
Change scores were calculated as the difference between scores obtained on test occasion 4 and the mean of test occasions 2 and 3 (n=35). For the subjects without occasion 3 data, the values for test occasion 2 were used (n=4). For known-groups validity, we compared change scores on the absolute hop scores between limbs on each of the 4 hop tests using paired t tests. For convergent validity, we evaluated the correlation between change in limb symmetry index scores and: (1) change in the LEFS and (2) the global rating of change. We calculated Pearson correlation coefficients (r) and lower one-sided 95% CI. Given that previously reported correlations between performance-based and self-report measures have typically ranged from approximately 0 to 0.6,8,9,11,12 we decided on the following criteria for strength of evidence for longitudinal validity: good, r>.5; moderate, r=.36.5; low, r=.2.35; and no evidence, r<.2.
| Results |
|---|
|
|
|---|
|
|
|
When scores were expressed as a percentage of the nonoperative limb (ie, limb symmetry index scores, Tab. 2), the ANOVAs also indicated a significant main effect for time (P<.001) for each of the hop tests and for the combination of tests (overall limb symmetry index). For all tests, post hoc comparisons indicated that the limb symmetry index on the final test occasion was significantly different from those on all other test occasions (P<.005), but there were no significant differences among the first, second, and third test occasions (P>.40).
In general, comparison of hop scores over the 4 test occasions indicated that substantial motor learning took place on both the operative and nonoperative limbs between the first and second test occasions, which then leveled off by the third test occasion. The significant increases in hop scores on the fourth test occasion on the operative limb, but not on the nonoperative limb, suggested that hop performance improved over the 6-week period.
Reliability
Reliability statistics for the hop test limb symmetry index scores are presented in Table 5. The ICCs ranged from .82 to .93 and can be described as indicating excellent relative reliability.43 The single hop test and overall limb symmetry index scores demonstrated the highest relative reliability. The SEM was lowest for the single hop test and overall limb symmetry index scores, suggesting that these measures also demonstrated the highest absolute reliability. The error in an individuals limb symmetry index scores at one point in time and the minimal detectable changes upon reassessment, both at the 90% confidence level, also are presented in Table 5. An example of their interpretation is provided in the "Discussion" section.
|
|
| Discussion |
|---|
|
|
|---|
Limb symmetry index values provide important measures of performance on the operative limb in relation to the nonoperative limb. The fact that limb symmetry index values were relatively stable over the first 3 test occasions (ie, the limb symmetry index accounted for learning that occurred in both limbs) and were similar for male and female subjects also supports their use. However, examining absolute scores also is important. For example, although limb symmetry index values were similar for test occasions 1 and 2, the absolute scores were very different. Examining limb symmetry index in isolation would mask this change in performance.
The ICCs observed in the present study for limb symmetry index scores suggest excellent relative reliability43 and indicate that these tests are appropriate for distinguishing among patients, such as is done when comparing groups of patients participating in randomized clinical trials of different postoperative protocols. The relative reliability of the single hop for distance test in patients 1 to 2 years following ACL reconstruction has previously been reported.11,29,31 Intraclass correlation coefficients for the single hop for distance test reported in Table 5 were similar to those previously reported by Kramer et al29 (ICC=.76.96). The ICCs for limb symmetry scores on the 6-m timed hop and crossover hop tests (Tab. 5) were slightly lower than those reported by Hopper et al31 (6-m timed hop test, ICC=.93.96; crossover hop test, ICC=.94.98). To our knowledge, the ICC for the triple hop for distance test has not been previously reported in patients following ACL reconstruction.
We are unaware of previous reports of the SEM for hop test scores in patients following ACL reconstruction. The present findings facilitate the clinical use of hop tests by providing estimates of measurement error and minimal detectable change (Tab. 5) that enable clinicians to determine how much confidence they can place in their assessment of an individuals hop test limb symmetry index. For example, based on an individuals performance on the overall combination of hops assessed at one point in time (Tab. 5), the limb symmetry index could vary ±4.99% simply due to measurement error (ie, ±SEM x z value for 90% confidence=±3.04x1.64=±4.99%). Additionally, based on the observed minimal detectable change for the overall limb symmetry index (Tab. 5), 90% of stable patients would change by less than 7.05% on repeated measures (ie, ±SEM x z value for 90% confidencex
2=±3.04x1.64x
2=±7.05%).
The following description provides an example of how a physical therapist might use these values in clinical practice. Following adequate practice with hop testing, a patient 16 weeks after ACL reconstruction scores a limb symmetry index of 80% for the overall combination of hops, and the score improves to 90% following 6 weeks of treatment. Upon initial assessment, the clinician can be 90% confident that the true limb symmetry index value could vary from 75% to 85% simply due to measurement error (ie, 80% ± approximately 5%). When tested 6 weeks later, the clinician can be confident that this patient has truly improved because the observed change of 10% (ie, an increase from 80% to 90%) exceeds the minimal detectable change of approximately 7%. Also note that the minimal detectable change could represent deterioration in performance. For example, if the patients score dropped to 70% upon reassessment, the clinician can be confident that this patient has truly deteriorated because the observed change of 10% (ie, a decrease from 80% to 70%) also exceeds the minimal detectable change of approximately 7%.
The present findings are consistent with our constructs for change and provide evidence of longitudinal validity. When investigating known-groups validity, each of the hop tests demonstrated significantly greater changes on the operative limb than on the nonoperative limb over the 6-week period. When investigating convergent validity, the observed correlations between the change in limb symmetry index and change in both self-report measures, the single hop test, the crossover hop test, and the overall combination of hops met our criteria for at least moderate evidence of convergent validity. Interestingly, only the correlation between the change in the limb symmetry index for the overall combination of tests and the global rating of change exceeded .5 (Tab. 6). We speculate that this is because the change in combination of tests provided a more global representation of change in motor performance than any one test alone.
We decided to keep the order of the individual hop tests that make up the full test consistent with its original description.18 In our experience, the 4 hop tests progress logically from less difficult to more difficult, and the initial tests may help to improve performance on the later, more difficult tests. Although reliability would not likely differ from the present findings if a clinician decided to administer just the single hop for distance test (indeed, the present ICC is similar to those reported by Kramer et al29 on just the single hop test), reliability is more likely to change if a clinician decided to administer just one of the more difficult hop tests without adequate practice. Similarly, our experience with these tests suggests that considerable motor learning is likely when first performing them. It is advisable, therefore, to incorporate considerable practice before stable values can be recorded (eg, we used a "practice day" in the present study to ensure that our subjects performances were stable). The limitation in the generalizability of the present findings to the described order of testing and the use of a practice session should be recognized.
Although no subject reported pain during a test session, it is important to note that 2 subjects experienced thigh pain after 2 consecutive days of testing and subsequently withdrew from this study. The 2 subjects were the only subjects to report pain following testing. They were reviewed by the operating surgeon 6 months postoperatively and had fully recovered with no adverse effects. Although guidelines for the postoperative rehabilitation protocol used in the care of our subjects suggested that hopping activities should be incorporated by the 16th week following surgery, this was not the case for the 2 subjects who experienced thigh pain. Considering the repeated eccentric muscle contractions required for the landing portions of hop tests, we believe these 2 subjects experienced delayed onset muscle soreness. Clinicians should be aware of this possibility, clearly question patients about activities that they are accustomed to performing before deciding to use the hop tests, and clearly state the risk to patients undergoing testing.
| Conclusion |
|---|
|
|
|---|
| Footnotes |
|---|
The authors acknowledge the assistance of Michael Hunt and Jennifer Symmes in the completion of this project.
This study was approved by the University of Western Ontario Research Ethics Board for Healthy Sciences Research Involving Human Subjects, which is organized and operates according to the Tri-Council Policy Statement and the Health Canada/ICH Good Clinical Practice Practices: Consolidated Guidelines.
This research was undertaken, in part, thanks to funding from the Canadian Orthopaedic Foundation and the Canada Research Chairs Program.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
T. B. Birmingham, D. M. Bryant, J. R. Giffin, R. B. Litchfield, J. F. Kramer, A. Donner, and P. J. Fowler A Randomized Controlled Trial Comparing the Effectiveness of Functional Knee Brace and Neoprene Sleeve Use After Anterior Cruciate Ligament Reconstruction Am. J. Sports Med., April 1, 2008; 36(4): 648 - 655. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. J. Hurd, M. J. Axe, and L. Snyder-Mackler A 10-Year Prospective Trial of a Patient Management Algorithm and Screening Examination for Highly Active Individuals With Anterior Cruciate Ligament Injury: Part 2, Determinants of Dynamic Knee Stability Am. J. Sports Med., January 1, 2008; 36(1): 48 - 56. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |