Literatur

Abt, Grant, Colin Boreham, Gareth Davison, Robin Jackson, Simon Jobson, Eric Wallace, and Mark Williams. 2025. “Sample Size Estimation Revisited.” Journal of Sports Sciences, 1–6.
Altman, Douglas G, and J Martin Bland. 1995. “Statistics Notes: Absence of Evidence Is Not Evidence of Absence.” Bmj 311 (7003): 485.
Altman, Naomi, and Martin Krzywinski. 2015a. “Points of Significance: Multiple Linear Regression.” Nature Methods 12 (12): 1103–4.
———. 2015b. “Points of Significance: Simple Linear Regression.” Nature Methods 12 (11).
———. 2015c. “Points of Significance: Split Plot Design.” Nature Methods 12 (3): 165.
———. 2016a. “Points of Significance: Analyzing Outliers: Influential or Nuisance.” Nature Methods 13 (4): 281–82.
———. 2016b. “Points of Significance: Regression Diagnostics.” Nature Methods 13 (5): 385–86.
Atkinson, G., and A. Nevill. 2000. “Typical Error Versus Limits of Agreement.” Sports Med 30 (5): 375–81. https://doi.org/10.2165/00007256-200030050-00005.
Atkinson, G., and A. M. Nevill. 1998. “Statistical Methods for Assessing Measurement Error (Reliability) in Variables Relevant to Sports Medicine.” Sports Med 26 (4): 217–38. https://doi.org/10.2165/00007256-199826040-00002.
Bartoš, František, and Maximilian Maier. 2022. “Power or Alpha? The Better Way of Decreasing the False Discovery Rate.” Meta-Psychology 6.
Baumgartner, Ted A. 1968. “The Applicability of the Spearman-Brown Prophecy Formula When Applied to Physical Performance Tests.” Research Quarterly. American Association for Health, Physical Education and Recreation 39 (4): 847–56. https://doi.org/10.1080/10671188.1968.10613429.
———. 1989. “Norm-Referenced Measurement: Reliability.” In Measurement Concepts in Physical Education and Exercise Science, edited by Margaret J Safrit and Terry M Wood, 45–72. Champaign: Human Kinetics Books.
Benjamini, Yoav, and Yosef Hochberg. 2018. “Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing.” Journal of the Royal Statistical Society: Series B (Methodological) 57 (1): 289–300. https://doi.org/10.1111/j.2517-6161.1995.tb02031.x.
Berchtold, André. 2016. “Test–Retest: Agreement or Reliability?” Methodological Innovations 9: 2059799116672875. https://doi.org/10.1177/2059799116672875.
Berger, James O. 2003. “Could Fisher, Jeffreys and Neyman Have Agreed on Testing?” Statistical Science 18 (1): 1–32.
Bishop, Christopher M, and Hugh Bishop. 2023. Deep Learning: Foundations and Concepts. Springer Nature.
Bland, J. M., and D. G. Altman. 1986. “Statistical Methods for Assessing Agreement Between Two Methods of Clinical Measurement.” Lancet 1 (8476): 307–10.
———. 1996. “Measurement Error Proportional to the Mean.” BMJ 313 (7049): 106. https://doi.org/10.1136/bmj.313.7049.106.
———. 1999. “Measuring Agreement in Method Comparison Studies.” Stat Methods Med Res 8 (2): 135–60. https://doi.org/10.1177/096228029900800204.
Borg, David N, Adrian G Barnett, Aaron R Caldwell, Nicole M White, and Ian B Stewart. 2023. “The Bias for Statistical Significance in Sport and Exercise Medicine.” Journal of Science and Medicine in Sport 26 (3): 164–68.
Bradley, P. S., W. Sheldon, B. Wooster, P. Olsen, P. Boanas, and P. Krustrup. 2009. “High-Intensity Running in English FA Premier League Soccer Matches.” Journal of Sports Sciences 27 (2): 159–68. https://doi.org/10.1080/02640410802512775.
Bradley, Ralph A., and Sushil S. and Srivastava. 1979. “Correlation in Polynomial Regression.” The American Statistician 33 (1): 11–14. https://doi.org/10.1080/00031305.1979.10482644.
Breiman, Leo. 2001. “Statistical Modeling: The Two Cultures.” Journal Article. Statistical Science 16 (3): 199–231. D:/sciebo/database/720.pdf.
Brown, Emily, and Peter O’Donoghue. 2007. “Relating Reliability to Analytical Goals in Performance Analysis.” International Journal of Performance Analysis in Sport 7 (1): 28–34. https://doi.org/10.1080/24748668.2007.11868385.
Bühner, Markus. 2011. Einführung in Die Test-Und Fragebogenkonstruktion. Vol. 4033. Pearson Deutschland GmbH.
Button, K. S., J. P. Ioannidis, C. Mokrysz, B. A. Nosek, J. Flint, E. S. Robinson, and M. R. Munafo. 2013. “Power Failure: Why Small Sample Size Undermines the Reliability of Neuroscience.” Nat Rev Neurosci 14 (5): 365–76. https://doi.org/10.1038/nrn3475.
Carlin, John B, and Margarita Moreno-Betancur. 2023. “On the Uses and Abuses of Regression Models: A Call for Reform of Statistical Practice and Teaching.” arXiv Preprint arXiv:2309.06668.
Casella, George. 2009. Statistical Design. 1st ed. Springer Texts in Statistics. Springer, New York, NY.
Chambers, John M. 2008. Software for Data Analysis: Programming with r. Vol. 2. 1. Springer.
Chang, Winston. 2018. R Graphics Cookbook: Practical Recipes for Visualizing Data. O’Reilly Media.
Christensen, Ronald. 2018. Analysis of Variance, Design, and Regression: Linear Modeling for Unbalanced Data. CRC Press.
Cohen, Jacob. 1988. Statistical Power Analysis for the Behavioral Sciences. 2nd ed. Routledge.
Coleman, Max, Ryan Burke, Cristina Benavente, Alec Piñero, Francesca Augustin, Jaime Maldonado, James P. Fisher, Douglas Oberlin, Andrew D. Vigotsky, and Brad J. Schoenfeld. 2023. “Supervision During Resistance Training Positively Influences Muscular Adaptations in Resistance-Trained Individuals.” Journal of Sports Sciences 41 (12): 1207–17. https://doi.org/10.1080/02640414.2023.2261090.
Cumming, Geoff. 2013. Understanding the New Statistics: Effect Sizes, Confidence Intervals, and Meta-Analysis. Routledge.
Dalgaard, Peter. 2008. Introductory Statistics with r. Springer.
De Vet, Henrica CW, Caroline B Terwee, Lidwine B Mokkink, and Dirk L Knol. 2011. Measurement in Medicine: A Practical Guide. Cambridge university press.
Dean, Angela, Daniel Voss, Danel Draguljić, et al. 1999. Design and Analysis of Experiments. Vol. 1. Springer.
Debanne, Thierry, and Guillaume Laffaye. 2011. “Predicting the Throwing Velocity of the Ball in Handball with Anthropometric Variables and Isotonic Tests.” Journal of Sports Sciences 29 (7): 705–13.
Denegar, Craig R, and Donald W Ball. 1993. “Assessing Reliability and Precision of Measurement: An Introduction to Intraclass Correlation and Standard Error of Measurement.” Journal of Sport Rehabilitation 2 (1): 35–42.
Djulbegovic, B., and I. Hozo. 2007. “When Should Potentially False Research Findings Be Considered Acceptable?” PLoS Med 4 (2): e26. https://doi.org/10.1371/journal.pmed.0040026.
Dudek, Frank J. 1979. “The Continuing Misinterpretation of the Standard Error of Measurement.” Psychological Bulletin 86 (2): 335–37. https://doi.org/10.1037/0033-2909.86.2.335.
Duncan, Michael J, Elizabeth Bryant, and David Stodden. 2017. “Low Fundamental Movement Skill Proficiency Is Associated with High BMI and Body Fatness in Girls but Not Boys Aged 6–11 Years Old.” Journal of Sports Sciences 35 (21): 2135–41.
Feise, R. J. 2002. “Do Multiple Outcome Measures Require p-Value Adjustment?” BMC Med Res Methodol 2: 8. https://doi.org/10.1186/1471-2288-2-8.
Ferreira, M. L., R. D. Herbert, P. H. Ferreira, J. Latimer, R. W. Ostelo, D. P. Nascimento, and R. J. Smeets. 2012. “A Critical Review of Methods Used to Determine the Smallest Worthwhile Effect of Interventions for Low Back Pain.” Journal Article. J Clin Epidemiol 65 (3): 253–61. https://doi.org/10.1016/j.jclinepi.2011.06.018.
Fien, Samantha, Tim Henwood, Mike Climstein, Evelyne Rathbone, and Justin WL Keogh. 2019. “Exploring the Feasibility, Sustainability and the Benefits of the GrACE+ GAIT Exercise Programme in the Residential Aged Care Setting.” PeerJ 7: e6973.
Fisher, R. A. 1935. The Design of Experiments. Book. Edinburgh: Oliver; Boyd.
Fisher, Ronald A. 1935. “The Logic of Inductive Inference.” Journal of the Royal Statistical Society 98 (1): 39–82.
Fox, John. 2011. An r Companion to Applied Regression. 2nd ed. SAGE Publication Inc., Thousand Oaks.
Fraser, C. G., P. Hyltoft Peterson, and M. L. Larsen. 1990. “Setting Analytical Goals for Random Analytical Error in Specific Clinical Monitoring Situations.” Clinical Chemistry 36 (9): 1625–28. https://doi.org/10.1093/clinchem/36.9.1625.
Gauss, Carl Friedrich. 1887. Abhandlungen Zur Methode Der Kleinsten Quadrate. P. Stankiewicz.
Gelman, Andrew. 2024. “This Well-Known Paradox of r-Squared Is Still Buggin Me. Can You Help Me Out?” 2024. https://statmodeling.stat.columbia.edu/2024/06/17/this-well-known-paradox-of-r-squared-is-still-buggin-me-can-you-help-me-out/.
Gelman, Andrew, and Hal Stern. 2006. “The Difference Between ‘Significant’ and ‘Not Significant’ Is Not Itself Statistically Significant.” The American Statistician 60 (4): 328–31.
Gigerenzer, Gerd. 2004. “Mindless Statistics.” The Journal of Socio-Economics 33 (5): 587–606.
Goos, Peter, and Bradley Jones. 2011. Optimal Design of Experiments: A Case Study Approach. John Wiley & Sons.
Grant, S., H. Hynes, A. Whittaker, and T. Aitchison. 1996. “Anthropometric, Strength, Endurance and Flexibility Characteristics of Elite and Recreational Climbers.” Journal of Sports Sciences 14 (4): 301–9. https://doi.org/10.1080/02640419608727715.
Greenland, Sander. 2023. “Divergence Versus Decision p-Values: A Distinction Worth Making in Theory and Keeping in Practice: Or, How Divergence p-Values Measure Evidence Even When Decision p-Values Do Not.” Journal Article. Scandinavian Journal of Statistics 50 (1): 54–88. https://doi.org/https://doi.org/10.1111/sjos.12625.
Greenland, Sander, Stephen J Senn, Kenneth J Rothman, John B Carlin, Charles Poole, Steven N Goodman, and Douglas G Altman. 2016. “Statistical Tests, p Values, Confidence Intervals, and Power: A Guide to Misinterpretations.” European Journal of Epidemiology 31: 337–50.
Gross, Benedict, Joe Harris, and Emily Riehl. 2019. Fat Chance: Probability from 0 to 1. Cambridge University Press.
Haeffel, Gerald J. 2022. “Psychology Needs to Get Tired of Winning.” Royal Society Open Science 9 (6): 220099.
Hager, Willi. 2013. “The Statistical Theories of Fisher and of Neyman and Pearson: A Methodological Perspective.” Theory & Psychology 23 (2): 251–70.
Harvill, Leo M. 1991. “Standard Error of Measurement: An NCME Instructional Module On.” Educational Measurement: Issues and Practice 10 (2): 33–41.
Healy, Kieran. 2018. Data Visualization: A Practical Introduction. Princeton University Press.
Hoekstra, Rink, Richard D Morey, Jeffrey N Rouder, and Eric-Jan Wagenmakers. 2014. “Robust Misinterpretation of Confidence Intervals.” Psychonomic Bulletin & Review 21: 1157–64.
Holm, Sture. 1979. “A Simple Sequentially Rejective Multiple Test Procedure.” Journal Article. Scandinavian Journal of Statistics 6 (2): 65–70. D:/sciebo/database/3431.pdf.
Hopkins, W. G. 2000. “Measures of Reliability in Sports Medicine and Science.” Sports Med 30 (1): 1–15. https://doi.org/10.2165/00007256-200030010-00001.
Hubbard, Raymond. 2004. “Alphabet Soup: Blurring the Distinctions Betweenp’s Anda’s in Psychological Research.” Theory & Psychology 14 (3): 295–327.
Hubbard, Raymond, and Marı́a Jesús Bayarri. 2003. “Confusion over Measures of Evidence (p’s) Versus Errors (α’s) in Classical Statistical Testing.” The American Statistician 57 (3): 171–78.
Hurlbert, S. H. 1984. “Pseudoreplication and the Design of Ecological Field Experiments.” Ecological Monographs 54 (2): 187–211. https://doi.org/10.2307/1942661.
Hursh, J. B. 1939. “Conduction Velocity and Diameter of Nerve Fibers.” American Journal of Physiology 127 (1): 131–39. https://doi.org/10.1152/ajplegacy.1939.127.1.131.
Jadczak, Lukasz, Monika Grygorowicz, Witold Dzudzinski, and Robert Sliwowski. 2019. “Comparison of Static and Dynamic Balance at Different Levels of Sport Competition in Professional and Junior Elite Soccer Players.” The Journal of Strength & Conditioning Research 33 (12): 3384–91.
James, Gareth, Daniela Witten, Trevor Hastie, Robert Tibshirani, and Jonathan Taylor. 2023. An Introduction to Statistical Learning: With Applications in Python. Springer.
Jiroumaru, Takumi, Yutaro Hyodo, Michio Wachi, Nobuko Shichiri, Junko Ochi, and Takamitsu Fujikawa. 2023. “Relationship Between Walking Speed, Respiratory Muscle Strength, and Dynamic Balance in Community-Dwelling Older People Who Required Long-Term Care or Support and Used a Daycare Center.” PeerJ 11: e16630.
Kadlec, Daniel, Kristin L Sainani, and Sophia Nimphius. 2022. “With Great Power Comes Great Responsibility: Common Errors in Meta-Analyses and Meta-Regressions in Strength & Conditioning Research.” Sports Medicine, 1–13.
Kelley, Ken, and Scott E Maxwell. 2003. “Sample Size for Multiple Regression: Obtaining Regression Coefficients That Are Accurate, Not Simply Significant.” Psychological Methods 8 (3): 305.
Kernighan, Brian W, and Rob Pike. 1984. “The Unix Programming Environment. Prentice Hall.” Englewood Cliffs, NY.
Koo, Terry K, and Mae Y Li. 2016. “A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research.” Journal of Chiropractic Medicine 15 (2): 155–63.
Kottner, J., and D. L. Streiner. 2011. “The Difference Between Reliability and Agreement.” J Clin Epidemiol 64 (6): 701–2; author reply 702. https://doi.org/10.1016/j.jclinepi.2010.12.001.
Kowalski, Scott M, Peter A Parker, and G Geoffrey Vining. 2007. “Tutorial: Industrial Split-Plot Experiments.” Quality Engineering 19 (1): 1–15.
Kroes, A. D. A., and J. R. Finley. 2023. “Demystifying Omega Squared: Practical Guidance for Effect Size in Common Analysis of Variance Designs.” Psychol Methods. https://doi.org/10.1037/met0000581.
Krzywinski, Martin, and Naomi Altman. 2014a. “Points of Significance: Analysis of Variance and Blocking.” Nature Methods 11 (7): 699–700.
———. 2014b. “Points of View: Designing Comparative Experiments.” Nature Methods 11 (6): 597–98.
Kutner, Michael H, Christopher J Nachtsheim, John Neter, and William Li. 2005. Applied Linear Statistical Models. 5th ed. McGraw-Hill Irwin New York.
Lazic, Stanley E, Charlie J Clarke-Williams, and Marcus R Munafò. 2018. “What Exactly Is ‘n’in Cell Culture and Animal Experiments?” PLoS Biology 16 (4): e2005282. https://doi.org/10.1371/journal.pbio.2005282.
Lehmann, Erich L. 1993. “The Fisher, Neyman-Pearson Theories of Testing Hypotheses: One Theory or Two?” Journal of the American Statistical Association 88 (424): 1242–49.
Levine, Timothy R., and Craig R. Hullett. 2002. “Eta Squared, Partial Eta Squared, and Misreporting of Effect Size in Communication Research.” Human Communication Research 28 (4): 612–25. https://doi.org/10.1111/j.1468-2958.2002.tb00828.x.
Liljequist, D., B. Elfving, and K. Skavberg Roaldsen. 2019. “Intraclass Correlation - a Discussion and Demonstration of Basic Features.” PLoS One 14 (7): e0219854. https://doi.org/10.1371/journal.pone.0219854.
Louçã, Francisco. 2008. “The Widest Cleft in Statistics: How and Why Fisher Opposed Neyman and Pearson.”
Luchette, M., and A. Akhondi-Asl. 2024. “Measurement Error.” Pediatr Crit Care Med 25 (3): e140–48. https://doi.org/10.1097/PCC.0000000000003420.
Lydick, E., and R. S. Epstein. 1993. “Interpretation of Quality of Life Changes.” Qual Life Res 2 (3): 221–26. https://doi.org/10.1007/BF00435226.
Mansournia, Mohammad Ali, Rachel Waters, Maryam Nazemipour, Martin Bland, and Douglas G. Altman. 2021. “Bland-Altman Methods for Comparing Methods of Measurement and Response to Criticisms.” Global Epidemiology 3: 100045. https://doi.org/https://doi.org/10.1016/j.gloepi.2020.100045.
Maxwell, Scott E, Harold D Delaney, and Ken Kelley. 2004. Designing Experiments and Analyzing Data: A Model Comparison Perspective. 2nd ed. Routledge.
Mayo, Deborah G. 2018. Statistical Inference as Severe Testing: How to Get Beyond the Statistics Wars. Cambridge University Press.
McElreath, Richard. 2016. Statistical Rethinking, a Bayesian Course with Examples in r and Stan. 1st ed. Boca Raton: CRC Press.
McGraw, Kenneth O., and S. P. Wong. 1996. “Forming Inferences about Some Intraclass Correlation Coefficients.” Psychological Methods 1 (1): 30–46. https://doi.org/10.1037/1082-989X.1.1.30.
McManus, I. C. 2012. “The Misinterpretation of the Standard Error of Measurement in Medical Education: A Primer on the Problems, Pitfalls and Peculiarities of the Three Different Standard Errors of Measurement.” Med Teach 34 (7): 569–76. https://doi.org/10.3109/0142159X.2012.670318.
Milanese, Chiara, Francesco Piscitelli, Chiara Lampis, and Carlo Zancanaro. 2011. “Anthropometry and Body Composition of Female Handball Players According to Competitive Level or the Playing Position.” Journal of Sports Sciences 29 (12): 1301–9. https://doi.org/10.1080/02640414.2011.591419.
Moosbrugger, Helfried, and Augustin Kelava. 2020. Testtheorie Und Fragebogenkonstruktion. 3rd ed. Springer.
Müller, Reinhold, and Petra Büttner. 1994. “A Critical Discussion of Intraclass Correlation Coefficients.” Statistics in Medicine 13 (23-24): 2465–76. https://doi.org/https://doi.org/10.1002/sim.4780132310.
Myung, In Jae. 2003. “Tutorial on Maximum Likelihood Estimation.” Journal of Mathematical Psychology 47 (1): 90–100.
Neyman, Jerzy. 1935. “On the Problem of Confidence Intervals.” Biometrika 26: 404–13.
———. 1956. “Note on an Article by Sir Ronald Fisher.” Journal of the Royal Statistical Society Series B: Statistical Methodology 18 (2): 288–94.
Neyman, Jerzy, and Egon Sharpe Pearson. 1933. “On the Problem of the Most Efficient Tests of Statistical Hypotheses.” Philosophical Transactions of the Royal Society of London. Series A 231 (694-706): 289–337.
Nieuwenhuis, S., B. U. Forstmann, and E. J. Wagenmakers. 2011. “Erroneous Analyses of Interactions in Neuroscience: A Problem of Significance.” Nat Neurosci 14 (9): 1105–7. https://doi.org/10.1038/nn.2886.
Norman, G. R., F. G. Sridhar, G. H. Guyatt, and S. D. Walter. 2001. “Relation of Distribution- and Anchor-Based Approaches in Interpretation of Changes in Health-Related Quality of Life.” Med Care 39 (10): 1039–47. https://doi.org/10.1097/00005650-200110000-00002.
Nuzzo, Regina. 2014. “Scientific Method: Statistical Errors.” Nature 506 (7487).
Olejnik, S., and J. Algina. 2003. “Generalized Eta and Omega Squared Statistics: Measures of Effect Size for Some Common Research Designs.” Psychol Methods 8 (4): 434–47. https://doi.org/10.1037/1082-989X.8.4.434.
Patterson, C. H. 1955. “The Interpretation of the Standard Error of Measurement.” The Journal of Experimental Education 23 (3): 247–52. https://doi.org/10.1080/00220973.1955.11010510.
Payne, Robert W. 1989. “Reliability Theory and Clinical Psychology.” Journal of Clinical Psychology 45 (2): 351–53. https://doi.org/https://doi.org/10.1002/1097-4679(198903)45:2<351::AID-JCLP2270450228>3.0.CO;2-W.
Peixoto, Julio L. 1987. “Hierarchical Variable Selection in Polynomial Regression Models.” The American Statistician 41 (4): 311–13.
———. 1990. “A Property of Well-Formulated Polynomial Regression Models.” The American Statistician 44 (1): 26–30.
Peng, Roger D. 2016. R Programming for Data Science. Leanpub Victoria, BC, Canada.
Pickett, Craig W, Chris Abbiss, James Zois, and Anthony J Blazevich. 2021. “Pacing and Stroke Kinematics in 200-m Kayak Racing.” Journal of Sports Sciences 39 (10): 1096–1104.
Ponce-Garcı́a, Tomás, Jerónimo Garcı́a-Romero, Laura Carrasco-Fernández, Alejandro Castillo-Domı́nguez, and Javier Benı́tez-Porres. 2025. “Sex Differences in Anaerobic Performance in CrossFit Athletes: A Comparison of Three Different All-Out Tests.” PeerJ 13: e18930.
Qin, S., L. Nelson, L. McLeod, S. Eremenco, and S. J. Coons. 2019. “Assessing Test-Retest Reliability of Patient-Reported Outcome Measures Using Intraclass Correlation Coefficients: Recommendations for Selecting and Documenting the Analytical Formula.” Qual Life Res 28 (4): 1029–33. https://doi.org/10.1007/s11136-018-2076-0.
Riechman, Steven E, Robert F Zoeller, G Balasekaran, Fredric L Goss, and Robert J Robertson. 2002. “Prediction of 2000 m Indoor Rowing Performance Using a 30 s Sprint and Maximal Oxygen Uptake.” Journal of Sports Sciences 20 (9): 681–87.
Roback, Paul, and Julie Legler. 2021. Beyond Multiple Linear Regression: Applied Generalized Linear Models and Multilevel Models in r. Chapman; Hall/CRC.
Rohde, Charles A. 2014. Introductory Statistical Inference with the Likelihood Function. Springer.
Rothman, K. J. 1990. “No Adjustments Are Needed for Multiple Comparisons.” Epidemiology 1 (1): 43–46.
Sandercock, Gavin. 2024. “The Standard Error/Standard Deviation Mix-up: Potential Impacts on Meta-Analyses in Sports Medicine.” Sports Medicine, 1–10.
Schemper, Michael. 2003. “Predictive Accuracy and Explained Variation.” Statistics in Medicine 22 (14): 2299–2308.
Searle, S. R., F. M. Speed, and G. A. Milliken. 1980. “Population Marginal Means in the Linear Model: An Alternative to Least Squares Means.” Journal Article. The American Statistician 34 (4): 216–21. https://doi.org/10.2307/2684063.
Searle, Shayle R, George Casella, and Charles E McCulloch. 1992. Variance Components. John Wiley & Sons.
Shacham, Modechai, and Neima Brauner. 1997. “Minimizing the Effects of Collinearity in Polynomial Regression.” Industrial & Engineering Chemistry Research 36 (10): 4405–12. https://doi.org/10.1021/ie970236k.
Shalizi, Cosma. 2015. “Modern Regression - Lecture Notes.” 2015. https://www.stat.cmu.edu/~cshalizi/mreg/15/.
Silverman, S. 2004. “Analyzing Data from Field Research: The Unit of Analysis Issue.” Res Q Exerc Sport 75 (2): iii–iv. https://doi.org/10.1080/02701367.2004.10609141.
Spiegelhalter, David. 2019. The Art of Statistics: Learning from Data. Penguin UK.
Stratford, Paul W, and Charlie H Goldsmith. 1997. “Use of the Standard Error as a Reliability Index of Interest: An Applied Example Using Elbow Flexor Strength Data.” Physical Therapy 77 (7): 745–50.
Terwee, C. B., J. D. Peipert, R. Chapman, J. S. Lai, B. Terluin, D. Cella, P. Griffiths, and L. B. Mokkink. 2021. “Minimal Important Change (MIC): A Conceptual Clarification and Systematic Review of MIC Estimates of PROMIS Measures.” Qual Life Res 30 (10): 2729–54. https://doi.org/10.1007/s11136-021-02925-y.
Terwee, C. B., L. D. Roorda, D. L. Knol, M. R. De Boer, and H. C. De Vet. 2009. “Linking Measurement Error to Minimal Important Change of Patient-Reported Outcomes.” J Clin Epidemiol 62 (10): 1062–67. https://doi.org/10.1016/j.jclinepi.2008.10.011.
Tighe, J., I. C. McManus, N. G. Dewhurst, L. Chis, and J. Mucklow. 2010. “The Standard Error of Measurement Is a More Appropriate Measure of Quality for Postgraduate Medical Assessments Than Is Reliability: An Analysis of MRCP(UK) Examinations.” BMC Med Educ 10: 40. https://doi.org/10.1186/1472-6920-10-40.
Turner, D., H. J. Schunemann, L. E. Griffith, D. E. Beaton, A. M. Griffiths, J. N. Critch, and G. H. Guyatt. 2010. “The Minimal Detectable Change Cannot Reliably Replace the Minimal Important Difference.” J Clin Epidemiol 63 (1): 28–36. https://doi.org/10.1016/j.jclinepi.2009.01.024.
Tyrrell, AR, T Reilly, and JDG Troup. 1985. “Circadian Variation in Stature and the Effects of Spinal Loading.” Spine 10 (2): 161–64.
Uehata, Aki, and Robert Rein. 2025. “Statistical Power Analysis in Exercise Science.” Journal of Sports Sciences 0 (0): 1–17. https://doi.org/10.1080/02640414.2025.2571841.
Venables, Bill. 2023. “Coding Matrices, Contrast Matrices and Linear Models.”
Vet, H. C. de, C. B. Terwee, D. L. Knol, and L. M. Bouter. 2006. “When to Use Agreement Versus Reliability Measures.” Journal Article. J Clin Epidemiol 59 (10): 1033–39. https://doi.org/10.1016/j.jclinepi.2005.10.015.
Vet, Henrica CW de, Lidwine B Mokkink, David G Mosmuller, and Caroline B Terwee. 2017. “Spearman-Brown Prophecy Formula and Cronbach’s Alpha: Different Faces of Reliability and Opportunities for New Applications.” J Clin Epidemiol 85: 45–49.
Wan, Fei. 2020. “Analyzing Pre-Post Designs Using the Analysis of Covariance Models with and Without the Interaction Term in a Heterogeneous Study Population.” Statistical Methods in Medical Research 29 (1): 189–204. https://doi.org/10.1177/0962280219827971.
———. 2021. “Statistical Analysis of Two Arm Randomized Pre-Post Designs with One Post-Treatment Measurement.” BMC Medical Research Methodology 21 (1): 1–16.
Warneke, Konstantin, Thomas Gronwald, Sebastian Wallot, Alessia Magno, Martin Hillebrecht, and Klaus Wirth. 2025. “Discussion on the Validity of Commonly Used Reliability Indices in Sports Medicine and Exercise Science: A Critical Review with Data Simulations.” European Journal of Applied Physiology, 1–16.
Warrens, Matthijs J. 2017. “Transforming Intraclass Correlation Coefficients with the Spearman-Brown Formula.” Journal Article. J Clin Epidemiol 85: 14–16. https://doi.org/10.1016/j.jclinepi.2017.03.005.
Wasserstein, Ronald L, and Nicole A Lazar. 2016. “The ASA Statement on p-Values: Context, Process, and Purpose.” Taylor & Francis.
Westreich, Daniel, and Sander Greenland. 2013. “The Table 2 Fallacy: Presenting and Interpreting Confounder and Modifier Coefficients.” American Journal of Epidemiology 177 (4): 292–98. https://doi.org/10.1093/aje/kws412.
Wickham, Hadley. 2009. Ggplot2: Elegant Graphics for Data Analysis. Springer, New York, NY.
Wickham, Hadley, Mine Çetinkaya-Rundel, and Garrett Grolemund. 2023. R for Data Science. " O’Reilly Media, Inc.".
Wild, Christopher J, and Georg AF Seber. 2000. Chance Encounters: A First Course in Data Analysis and Inference. Wiley Press.
Wilkinson, G. N., and C. E. Rogers. 1973. “Symbolic Description of Factorial Models for Analysis of Variance.” Applied Statistics 22 (3): 392–99.
Wyrwich, K. W., N. A. Nienaber, W. M. Tierney, and F. D. Wolinsky. 1999. “Linking Clinical Relevance and Statistical Significance in Evaluating Intra-Individual Changes in Health-Related Quality of Life.” Med Care 37 (5): 469–78. https://doi.org/10.1097/00005650-199905000-00006.
Wyrwich, K. W., W. M. Tierney, and F. D. Wolinsky. 1999. “Further Evidence Supporting an SEM-Based Criterion for Identifying Meaningful Intra-Individual Changes in Health-Related Quality of Life.” J Clin Epidemiol 52 (9): 861–73. https://doi.org/10.1016/s0895-4356(99)00071-2.
Young, Alwyn. 2019. “Channeling Fisher: Randomization Tests and the Statistical Insignificance of Seemingly Significant Experimental Results.” The Quarterly Journal of Economics 134 (2): 557–98.