|
|
||||||||
Research Reports |
CG Maher, PT, PhD, is Associate Professor, School of Physiotherapy, Faculty of Health Sciences, The University of Sydney, PO Box 170, Lidcombe, New South Wales 1825, Australia (C.Maher{at}fhs.usyd.edu.au). Address all correspondence to Dr Maher
C Sherrington, PT, PhD, is Research Officer, Prince of Wales Medical Research Institute, University of New South Wales, Sydney, New South Wales, Australia
RD Herbert, PT, PhD, is Senior Lecturer, School of Physiotherapy, The University of Sydney
AM Moseley, PT, PhD, is Lecturer, Rehabilitation Studies Unit, Department of Medicine, The University of Sydney
M Elkins, PT, M-HSc, is Research Physiotherapist, Department of Respiratory Medicine, Royal Prince Alfred Hospital, Camperdown, New South Wales, Australia
Submitted January 24, 2003;
Accepted March 25, 2003
| Abstract |
|---|
Key Words: Evidence-based medicine Meta-analysis Physical therapy Randomized controlled trials
| Introduction |
|---|
|
|
|---|
Systematic reviews may exclude low-quality studies from the analysis (eg, systematic review by Herbert and Gabriel7), or they may weight the findings of low-quality studies less heavily in the analysis (eg, systematic reviews by van Tulder et al,8 van Poppel et al,9 and Berghmans et al10). Consequently, the method of quality assessment can affect the conclusions of reviews that use quantitative methods (eg, meta-analysis)11 or qualitative methods (eg, the levels of evidence approach)12 to summarize the results. As an illustration, Colle and colleagues12 have shown in a re-analysis of the RCTs included in the Cochrane review of exercise for low back pain13 that the conclusions of the review changed substantially when different scales were used to rate the RCTs. The original conclusion was that there was "conflicting evidence" on the effectiveness of exercise therapy versus inactive treatment, but the conclusion changed to "strong evidence" that exercise is more effective than inactive treatment when the Beckerman scale, rather than the original scale, was used to rate RCT quality.12
The methodological quality of an RCT also may be of interest outside the context of a systematic review. Researchers planning or reporting an RCT, journal reviewers considering a manuscript reporting an RCT, or clinicians judging whether RCTs have relevance to their practice all may need to consider the issue of methodological quality. Although there are numerous definitions of RCT methodological quality, we prefer the following definition: "the likelihood of the trial design to generate unbiased results that are sufficiently precise and allow replication in clinical practice."3(p651)
Verhagen and colleagues3 described 2 approaches to quality assessment of RCTs. The first approach focuses on the presence or absence of key methodological components, such as randomization or blinding, whereas the second uses a criteria list to generate a quality score that provides an overall estimate of RCT quality.3 Both approaches judge the quality of the RCT as can best be discerned from the trial report, not the quality of the RCT. The published report of an RCT could provide a biased view of the quality of the RCT as conducted. For example, the quality of the trial could be underestimated if the report fails to mention that the trial incorporated desirable methodological features such as intention-to-treat analysis or concealed allocation. To address this problem, the CONSORT statement was developed by the CONSORT group in order to improve the quality of reports of RCTs.14
One issue that has received little consideration is the reliability of the assessments of RCT quality. In a 1998 review, 21 scales of trial quality were described, and only 12 scales had any evidence about reliability. In a review published in 2001,3 the authors studied about 60 scales and reported that the reliability of the resultant assessments for most of the scales was unknown. The reliability of the assessments obtained with what we believe is the most widely used scale (the Jadad scale used, for example, by the Philadelphia Panel15) is in dispute.16 Two groups16,17 reported low reliability of assessments obtained with the Jadad scale, whereas another 2 groups18,19 reported high reliability. The study by Jadad et al19 probably provides an overly optimistic view of reliability as the authors excluded one assessor's results because "he recorded the scores incorrectly and it was impossible to determine to which report each score referred."19(p5)
For researchers considering which scale they should use in a systematic review, there is an additional problem. In most systematic reviews, it is not the rating of an individual that is used but rather a consensus rating from 2 or more assessors. Therefore, the reliability estimate that is most important for people conducting systematic reviews is the reliability of consensus ratings, not the reliability of an individual rating. We are unaware of any study that has evaluated the reliability of consensus ratings.
In this article, we report on 2 studies that evaluated the reliability of data obtained with the PEDro scale. The scale is called the PEDro scale because it was initially developed to rate quality of RCTs on PEDro, the Physiotherapy Evidence Database (www.pedro.fhs.usyd.edu.au). The PEDro scale is an 11-item scale designed for rating methodological quality of RCTs (the scale is presented in the Figure, and operational definitions for each scale item are given in the Appendix). Each satisfied item (except for item 1, which, unlike other scale items, pertains to external validity) contributes one point to the total PEDro score (range=010 points). The scale has been used to rate the quality of over 3,000 RCTs in the PEDro database20 and in several systematic reviews.7,21,22 The scale is based on the list developed by Verhagen et al23 using the Delphi consensus technique. There is evidence for discriminative validity for 3 of the scale items: randomization,24 concealed allocation,4,6,24 and blinding.4 The other items are reported to have face validity23 but are yet to be validated by other means.
|
Because we regularly use the PEDro scale to rate RCTs in the database and to rate RCTs for the systematic reviews we conduct, we were interested in the reliability of assessments obtained with the PEDro scale. Additionally, the reliability of assessments obtained with the PEDro scale is likely to be of interest to the large number of people who have used the PEDro database. In this article, we report on 2 studies that investigated the interrater reliability of ratings of each of the 11 items on the PEDro scale and the total (summed) PEDro score. Interrater reliability was evaluated for individual ratings and consensus ratings.
| Method |
|---|
|
|
|---|
In both studies, raters were volunteer physical therapists who had been trained in the use of the scale. For training, raters rated a series of 5 practice RCTs and were given feedback on their performance using criterion ratings that we generated, as well as justification of the rating for each item. Instructions for obtaining the training package are on the PEDro Web site (www.pedro.fhs.usyd.edu.au). Additional feedback was obtained via e-mail correspondence with the third author (RDH). Raters then had to pass a rating accuracy test using a separate set of RCTs. Each rater independently rated 5 test RCTs using the 11-item PEDro scale (ie, a total of 55 ratings), and these ratings were submitted via e-mail and compared with criterion ratings that we generated. Those who scored 51/55 or more correct ratings (ie, <10% errors) were considered able to rate RCTs, whereas those who scored from 46/55 to 50/55 received further feedback before they were considered able to rate trials (raters scoring 45/55 or less were excluded from further rating unless they passed a subsequent accuracy test using another set of 5 RCTs). The cutoff of <10% errors as evidence of ability to rate RCTs is the consensus opinion of the PEDro developers and was not empirically derived.
In study 2, we examined the reliability of ratings on a larger sample of RCTs, and we examined the reliability of ratings made by a panel of 2 or 3 raters (ie, consensus ratings). For this study, 120 English-language RCTs with consensus ratings were randomly selected from the PEDro database (using the random number function in Microsoft Excel). One of the retrieved RCTs was published in the 1960s, 5 were published in the 1970s, 29 were published in the 1980s, 73 were published in the 1990s, and 12 were published in the 2000s. Forty-eight RCTs were coded as relevant to the musculoskeletal subdiscipline, 22 as relevant to cardiothoracics, 15 as relevant to gerontology, 9 as relevant to orthopedics, 8 as relevant to neurology, 6 as relevant to continence and women's health, 4 as relevant to pediatrics, 4 as not being relevant to a specific subdiscipline, 3 as relevant to ergonomics, and 1 as relevant to sports. Each RCT had previously been independently rated by 2 raters, and where the ratings for any scale item in any RCT disagreed, a third (consensus) rater arbitrated. These existing ratings, together with a new set of independent ratings created for the study, were used in the reliability analysis.
Consensus ratings were performed by 4 of the authors (CGM, CS, RDH, and AMM) and 2 research assistants who developed the PEDro scale and maintain the PEDro database. Raters were asked to specify where in the report of an RCT each criterion was described as being fulfilled, and these sheets were made available to the consensus raters. Many of the disagreements in the use of the scale seem to arise when one rater misses the description in the text of inclusion of the methodological feature. This can arise if the report of an RCT is poorly organized (eg, describing use of an intention-to-treat analysis in the discussion section) or the article is old and attends to a methodological feature (eg, concealed allocation) but does not use the specific term because it was not yet in common use. As an illustration, Doull and colleagues' 1931 RCT26 described a process that would achieve concealed allocation, but the term "concealed allocation" would not be coined for many decades.
The final rating (that agreed on by the first 2 raters or assigned by the third rater) will be referred to as the "consensus rating." The 120 RCTs were assessed by 25 raters who each rated from 1 to 56 RCTs (
=13.8). A third rater was required for at least one scale item in all except 24 RCTs. All of these ratings were conducted as part of the normal process of maintaining the PEDro database. The raters were not aware that the reliability of ratings would be evaluated.
Subsequently, the 120 RCTs in study 2 were re-rated by a different set of raters. Again, each RCT was rated twice, and where necessary a third rater arbitrated. Seven raters each rated from 8 to 60 trials (
=45). A third assessment was required for at least one scale item in all except 27 RCTs.
The reliability of dichotomous judgments for each item was evaluated with a generalized kappa statistic using the multirater kappa utility.
In addition, the base rate for a positive response and the percent of agreement were calculated. The reliability of individual ratings was evaluated in study 1 by comparing the ratings from all 5 raters and in study 2 by comparing all 4 individual ratings (the first 2 ratings from each of the 2 sets of raters). The reliability of consensus ratings was evaluated in study 2 by comparing the first and second sets of consensus ratings.
The reliability of the total PEDro score (obtained by summing "yes" responses to items 211) was evaluated using type 1,1 intraclass correlation coefficients (ICCs) with the ICCSF1A.SPS macro in SPSS10.0 (SPSS for Windows, Release 10.0.5
). In study 1, each of the 11 raters rated each RCT, so the 2,1 form of the ICC statistic would normally be used. In study 2, pairs of raters were drawn from a larger panel, so not all raters rated each RCT. Thus, in study 2, the 1,1 form of the ICC statistic was used. We believed it was more appropriate to use the same ICC model for each study (because this facilitates comparison across studies); therefore, we used the 1,1 model in both studies. The consequence of such a choice is that we potentially slightly underestimated the reliability of ratings in study 1. In addition, we determined the percentage of close agreement of ratings within 2 points on the total PEDro scale for all ratings and the standard error of the measurement for the consensus ratings only.
Reliability estimates lie along a continuum. Although kappa and ICC values are continuous data, we believe that physical therapists collapse these continuous data into discrete categories when they recall the results of reliability studies. Data to support this contention, however, are lacking. Because we believe therapists categorize measurements of reliability, we have chosen to describe the level of reliability for the kappa values using categories suggested by Landis and Koch27 (
.81="almost perfect," .61.80="substantial," .41.60="moderate," .21.40="fair," .00.20="slight," and <.00="poor") and for ICC values using those suggested by Fleiss28 (>.75="excellent" reliability, .40.75="fair" to "good" reliability, and <.40="poor" reliability). The categories provide a description of the level of reliability that some readers may find useful. They should not be used to make a judgment as to whether the level of reliability is acceptable or not. Such a decision would require a consideration of how the data will be used.
| Results |
|---|
|
|
|---|
|
|
The ICCs for interrater reliability of the total PEDro scores for individual raters were .55 (95% confidence interval [CI]=.41, .72) for study 1 and .56 (95% CI=.47, .65) for study 2. The ICC for consensus ratings was slightly higher at .68 (95% CI=.57, .76). These findings suggest that the total PEDro score can be assessed with "fair" to "good" reliability. In study 1, assessments by individual raters of the total PEDro score agreed exactly on 35% of occasions, differed by 1 point or less on 78% of occasions, and differed by 2 points or less 93% of the time. In study 2, exact agreement occurred on 35% of occasions, and individual raters differed by 1 point or less on 78% of occasions and by 2 points or less 94% of the time. Consensus scores were in exact agreement 46% of the time, differed by 1 point or less 85% of the time, and differed by 2 points or less 99% of the time. The standard error of the measurement for the consensus ratings was 0.70 unit.
| Discussion |
|---|
|
|
|---|
The reliability for some PEDro scale items was only "fair" or "moderate." The item "groups similar at baseline" was the only one that had "fair" reliability, and the reliability for this item improved to "moderate" when consensus ratings were used. Rating this item requires a decision as to whether groups of subjects in a RCT were similar on key prognostic indicators prior to the intervention. It is likely that this decision is influenced by the rater's knowledge of the condition being treated and how strictly the term "similar" is interpreted. The other 2 items with comparable reliability in consensus ratings were "point measures and variability data" and "intention-to-treat analysis." The appearance here of "point measures and variability data" is a little surprising because the presence or otherwise of such measures should be relatively easy to establish. The majority of RCTs in the PEDro database do not include an intention-to-treat analysis.25 Where an intention-to-treat analysis has been undertaken, it is often not explicitly stated, so accurate rating of this item required careful reading of the text to establish whether this had occurred. Our impression is that intention-to-treat analysis is better reported in more recent articles.
The reliability we observed for the total PEDro score for panels of raters (ICC=.68, 95% CI=.57, .76) is similar to that reported by Berard et al29 for the Chalmers scale (ICC=.66, 95% CI=.55, .79), by Jadad et al19 for the Jadad scale (ICC=.59, 95% CI=.46, .74), and by Verhagen et al30 for the Maastricht list (ICC=.77, 95% CI=.64, .89) but not as high as the reliability reported by Oremus et al18 for the Jadad scale (ICC=.90). Our reliability for individual items is difficult to benchmark because only Clark et al17 provided reliability estimates for each item using the Jadad scale, and the items in that scale are not sufficiently similar to items in the PEDro scale to allow meaningful comparison.
For a number of the scale items, the base rate was either very high or very low. When interpreting the kappa values for these items, readers need to be aware of the behavior of kappa values. When the prevalence (or base rate) is either very high or very low, it is possible to have high agreement but a low kappa value, and this characteristic of the kappa statistic is sometimes called the "base rate problem."31 This characteristic is not unique to the kappa statistic but also occurs, for example, with the ICC statistic when rating a homogeneous sample. Spitznagel and Helzer31 viewed this influence of the base rate on the kappa value as undesirable, whereas Shrout et al32 contended that this behavior is entirely appropriate and "represents the real problem of making distinctions in increasingly homogeneous populations."32(p175) They stated that "a major strength of K is precisely that it does weigh disagreements more when the base rate approaches 0% or 100%."32(p174)
Our opinion on this issue is closer to Shrout and colleagues' position,32 and so we would defend the use of the kappa statistic in our study. We believe that the important issue is not a low base rate but the scenario where a data set has an artificially low base rate that is not representative of the population. In such a situation, both sides of the base rate problem debate would agree that the estimates of reliability provided by the kappa statistic are misleading. In both studies, we randomly selected a sample of trials from the population of trials on the PEDro database. Not surprisingly, the base rates in the 2 samples were very similar to the base rate for the population (see Moseley et al25). Accordingly, we believe that the use of the kappa statistic was justified in our studies and did not produce misleading inferences about reliability of ratings for items on the PEDro scale.
An understanding of the error associated with the PEDro scale can be used to guide the conduct of a systematic review that uses a minimum PEDro score as an inclusion criterion. In our studies, we noted that repeated PEDro consensus scores were within one point on 85% of occasions and within 2 points on 99% of occasions. We believe it is sensible to conduct a sensitivity analysis to see how the conclusions of a systematic review are affected by varying the PEDro cutoff. For example, in Maher's review of workplace interventions to prevent low back pain,22 reducing the PEDro cutoff from the original strict PEDro cutoff of 6 to a less strict cutoff of 5 (or even 4) did not change the conclusion that there was strong evidence that braces are ineffective in preventing low back pain. Readers should have more confidence in the conclusion of a review that is unaffected by changing the quality cutoff.
The precision of the PEDro scale also should be considered by users of the PEDro database. None of the scale items had perfect reliability for the consensus ratings (consensus ratings are displayed on the PEDro database); thus, users need to understand that the PEDro scores contain some error. Readers who use the total score to distinguish between low- and high-quality RCTs need to recall that the standard error of the measurement for total scores is 0.70 unit and consider this when comparing 2 studies. Based on this standard error of the measurement, a difference of 1 unit in the PEDro scores of 2 studies provides 68% confidence that the 2 studies truly had different PEDro scores, a difference of 2 units provides 96% confidence that the 2 studies truly had different PEDro scores, and a difference of 3 units provides 99% confidence that the 2 studies truly had different PEDro scores.
| Conclusion |
|---|
|
|
|---|
| Appendix |
|---|
|
|
|---|
|
| Footnotes |
|---|
All authors provided concept/idea/research design, writing, data collection and analysis, project management, fund procurement, subjects, facilities/equipment, institutional liaisons, and consultation (including review of manuscript before submission).
The study was partially funded by the Centre for Evidence-Based Physiotherapy's financial supporters: Motor Accidents Authority of New South Wales, Australia; Physiotherapists Registration Board of New South Wales, Australia; NRMA Insurance, Australia; New South Wales Department of Health, Australia.
* Microsoft Corp, One Microsoft Way, Redmond, WA 98052-6399. ![]()
Christopher N Chapman, University of Tulsa. ![]()
SPSS Inc, 233 Wacker Dr, Chicago, IL 60606. ![]()
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
L. Wevers, I. van de Port, M. Vermue, G. Mead, and G. Kwakkel Effects of Task-Oriented Circuit Class Training on Walking Competency After Stroke: A Systematic Review Stroke, July 1, 2009; 40(7): 2450 - 2459. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Sakzewski, J. Ziviani, and R. Boyd Systematic Review and Meta-analysis of Therapeutic Management of Upper-Limb Dysfunction in Children With Congenital Hemiplegia Pediatrics, June 1, 2009; 123(6): e1111 - e1122. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Langer, E. Hendriks, C. Burtin, V. Probst, C. van der Schans, W. Paterson, M. Verhoef-de Wijk, R. Straver, M. Klaassen, T. Troosters, et al. A clinical practice guideline for physiotherapists treating patients with chronic obstructive pulmonary disease based on a systematic review of available evidence Clinical Rehabilitation, May 1, 2009; 23(5): 445 - 462. [Abstract] [PDF] |
||||
![]() |
L. A. C. Machado, S. J. Kamper, R. D. Herbert, C. G. Maher, and J. H. McAuley Analgesic effects of treatments for non-specific low back pain: a meta-analysis of placebo-controlled randomized trials Rheumatology, May 1, 2009; 48(5): 520 - 527. [Abstract] [Full Text] [PDF] |
||||
![]() |
M Asano, D. Dawes, A Arafah, C Moriello, and N. Mayo What does a structured review of the effectiveness of exercise interventions for persons with multiple sclerosis tell us about the challenges of designing trials? Multiple Sclerosis, April 1, 2009; 15(4): 412 - 421. [Abstract] [PDF] |
||||
![]() |
B. J. Kollen, S. Lennon, B. Lyons, L. Wheatley-Smith, M. Scheper, J. H. Buurke, J. Halfens, A. C.H. Geurts, and G. Kwakkel The Effectiveness of the Bobath Concept in Stroke Rehabilitation: What is the Evidence? Stroke, April 1, 2009; 40(4): e89 - e97. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Padua and L. J. DiStefano Sagittal Plane Knee Biomechanics and Vertical Ground Reaction Forces Are Modified Following ACL Injury Prevention Programs: A Systematic Review Sports Health: A Multidisciplinary Approach, March 1, 2009; 1(2): 165 - 173. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Waller, J. Lambeck, and D. Daly Therapeutic aquatic exercise in the treatment of low back pain: a systematic review Clinical Rehabilitation, January 1, 2009; 23(1): 3 - 14. [Abstract] [PDF] |
||||
![]() |
L. G Macedo, C. G Maher, J. Latimer, and J. H McAuley Motor Control Exercise for Persistent, Nonspecific Low Back Pain: A Systematic Review Physical Therapy, January 1, 2009; 89(1): 9 - 25. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. G Maher, A. M Moseley, C. Sherrington, M. R Elkins, and R. D Herbert A Description of the Trials, Reviews, and Practice Guidelines Indexed in the PEDro Database Physical Therapy, September 1, 2008; 88(9): 1068 - 1077. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Meilink, B. Hemmen, H. Seelen, and G. Kwakkel Impact of EMG-triggered neuromuscular stimulation of the wrist and finger extensors of the paretic hand after stroke: a systematic review of the literature Clinical Rehabilitation, April 1, 2008; 22(4): 291 - 305. [Abstract] [PDF] |
||||
![]() |
R. L. Zemek, S. K. Bhogal, and F. M. Ducharme Systematic Review of Randomized Controlled Trials Examining Written Action Plans in Children: What Is the Plan? Arch Pediatr Adolesc Med, February 1, 2008; 162(2): 157 - 163. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Olivo, L. G. Macedo, I. C. Gadotti, J. Fuentes, T. Stanton, and D. J Magee Scales to Assess the Quality of Randomized Controlled Trials: A Systematic Review Physical Therapy, February 1, 2008; 88(2): 156 - 175. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. D. Carnaby-Mann and M. A. Crary Examining the Evidence on Neuromuscular Electrical Stimulation for Swallowing: A Meta-analysis Arch Otolaryngol Head Neck Surg, June 1, 2007; 133(6): 564 - 571. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Markovic and R. U Newton Does plyometric training improve vertical jump height? A meta-analytical review * Commentary Br. J. Sports Med., June 1, 2007; 41(6): 349 - 355. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. L Woodley, R. J Newsham-West, G D. Baxter, M Kjaer, and M S Koehle Chronic tendinopathy: effectiveness of eccentric exercise * COMMENTARY 1 * COMMENTARY 2 Br. J. Sports Med., April 1, 2007; 41(4): 188 - 198. [Full Text] [PDF] |
||||
![]() |
N. A de Morton, J. L Keating, and K. Jeffs The effect of exercise on outcomes for older acute medical inpatients compared with control or alternative treatments: a systematic review of randomized controlled trials Clinical Rehabilitation, January 1, 2007; 21(1): 3 - 16. [Abstract] [PDF] |
||||
![]() |
J A Radford, J Burns, R Buchbinder, K B Landorf, C Cook, and K Rome Does stretching increase ankle dorsiflexion range of motion? A systematic review * COMMENTARY Br. J. Sports Med., October 1, 2006; 40(10): 870 - 875. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. M. Levack, K. Taylor, R. J Siegert, S. G Dean, K. M McPherson, and M. Weatherall Is goal planning in rehabilitation effective? A systematic review Clinical Rehabilitation, September 1, 2006; 20(9): 739 - 755. [Abstract] [PDF] |
||||
![]() |
N. C Foley, S. K Bhogal, R. W Teasell, Y. Bureau, and M. R Speechley Estimates of Quality and Reliability With the Physiotherapy Evidence-Based Database Scale to Assess the Methodology of Randomized Controlled Trials of Pharmacological and Nonpharmacological Interventions Physical Therapy, June 1, 2006; 86(6): 817 - 824. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Y. Pang, J. J Eng, A. S Dawson, and S. Gylfadottir The use of aerobic exercise training in improving aerobic capacity in individuals with stroke: a meta-analysis Clinical Rehabilitation, February 1, 2006; 20(2): 97 - 111. [Abstract] [PDF] |
||||
![]() |
L Bisset, A Paungmali, B Vicenzino, E Beller, and R D Herbert A systematic review and meta-analysis of clinical trials on physical interventions for lateral epicondylalgia * Commentary Br. J. Sports Med., July 1, 2005; 39(7): 411 - 422. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. G Maher, C. Sherrington, M. Elkins, R. D Herbert, and A. M Moseley Challenges for Evidence-Based Physical Therapy: Accessing and Interpreting High-Quality Evidence on Therapy Physical Therapy, July 1, 2004; 84(7): 644 - 654. [Abstract] [Full Text] [PDF] |
||||
![]() |
N A Lannin and R D Herbert Is hand splinting effective for adults following stroke? A systematic review and methodological critique of published research Clinical Rehabilitation, August 1, 2003; 17(8): 807 - 816. [Abstract] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |