Statistical Fragility and the Role of P Values in the Sports Medicine Literature
Robert L. Parisien, MD, David P. Trofa, MD, Jesse Dashe, MD, Patrick K. Cronin, MD, Emily J. Curry, BA, Freddie H. Fu, MD, Xinning Li, MD
Introduction: Comparative trials evaluating categorical outcomes have important implications on surgical decision making. The purpose of this study was to examine the statistical stability of sports medicine research.
Methods: Comparative clinical sports medicine research studies involving anterior cruciate ligament, meniscus, and knee instability were reviewed in two journals between 2006 and 2016. The statistical stability for each study outcome was determined by the number of event reversals required to changetheP value to either greater or less than 0.05. The number of patients lost to follow-up was also determined.
Results: Of the 1,505 studies screened, 102 studies were included for analysis, 40 of which were randomized controlled trials. There were 339 total outcome events, with 98 significant and 241 not significant.
The Fragility Index, or the median number of events required to change the statistical significance of the overall study, was five (interquartile range, 3 to 8) or 5.4% of the total study population. In addition, the average number of patients lost to follow-up was 7.9, which is greater than the number needed to change the significance of each study arm and the entire study population.
Conclusion: Results in the comparative sports medicine literature may not be as stable as previously thought,with only a small percentage of outcome events needed to change study significance. Outcomes research based on a single discreet P value cutoff may be misleading.