References

Authors
Affiliations

Bowling Green State University

Smith College

Max Marchi

Cleveland Guardians

Adler, Joseph. 2006. Baseball Hacks: Tips & Tools for Analyzing and Winning with Statistics. Sebastopol, CA: O’Reilly Media.
Albert, Jim. 2002. “Smoothing Career Trajectories of Baseball Hitters.” Bowling Green State University.
———. 2008. “Streaky Hitting in Baseball.” Journal of Quantitative Analysis in Sports 4 (1). https://doi.org/10.2202/1559-0410.1085.
———. 2009. “Is Roger ClemensWHIP Trajectory Unusual?” Chance 22 (2): 8–20. https://doi.org/10.1080/09332480.2009.10722954.
———. 2017. Teaching Statistics Using Baseball. Washington, DC: Mathematical Association of America.
———. 2018. LearnBayes: Functions for Learning Bayesian Inference. https://CRAN.R-project.org/package=LearnBayes.
Albert, Jim, Jay Bartroff, Roger Blandford, Dan Brooks, Josh Derenski, Larry Goldstein, Hosoi Anette, Gary Lorden, Alan Nathan, and Lloyd Smith. 2018. “Report of the Committee Studying Home Run Rates in MLB.” http://baseball.physics.illinois.edu/HRReport2018.pdf.
Albert, Jim, and Jay Bennett. 2003. Curve Ball: Baseball, Statistics, and the Role of Chance in the Game. New York: Copernicus Books.
Albert, Jim, Anette Hosoi, Alan Nathan, and Lloyd Smith. 2019. “Preliminary Report of the Committee Studying Home Run Rates in MLB.” http://baseball.physics.illinois.edu/HRReport2019.pdf.
Albert, Jim, and Alan Nathan. 2022. “Home Runs and Drag: An Early Look at the 2022 Season.” Fangraphs. https://blogs.fangraphs.com/home-runs-and-drag-an-early-look-at-the-2022-season/.
Albert, Jim, and Maria Rizzo. 2012. R by Example. New York: Springer Science & Business Media.
Allaire, J. J., Yihui Xie, Christophe Dervieux, Jonathan McPherson, Javier Luraschi, Kevin Ushey, Aron Atkins, et al. 2024. rmarkdown: Dynamic Documents for r. https://github.com/rstudio/rmarkdown.
Allen, Dave. 2009a. “Deconstructing the Non-Fastball Run Maps.” Baseball Analysts. http://baseballanalysts.com/archives/2009/03/deconstructing_1.php.
———. 2009b. “Platoon Splits for Three Types of Fastballs.” Baseball Analysts. http://baseballanalysts.com/archives/2009/05/platoon_splits.php.
Appelman, David. 2008. “Get to Know: RE24.” Fangraphs. http://www.fangraphs.com/blogs/get-to-know-re24/.
Bates, Douglas, Martin Mächler, Ben Bolker, and Steve Walker. 2015. “Fitting Linear Mixed-Effects Models Using lme4.” Journal of Statistical Software 67 (1): 1–48. https://doi.org/10.18637/jss.v067.i01.
Baumer, Benjamin S., and Jim Albert. 2024. Abdwr3edata: Companion to Analyzing Baseball Data with R,” 3rd Edition. https://github.com/beanumber/abdwr3edata.
Baumer, Benjamin S., Shane T. Jensen, and Gregory J. Matthews. 2015. openWAR: An Open Source System for Evaluating Overall Player Performance in Major League Baseball.” Journal of Quantitative Analysis in Sports 11 (2): 69–84. https://doi.org/10.1515/jqas-2014-0098.
Baumer, Benjamin S., Daniel T. Kaplan, and Nicholas J. Horton. 2021a. Modern Data Science with R. 2nd ed. Boca Raton: Chapman; Hall/CRC Press. https://www.routledge.com/Modern-Data-Science-with-R/Baumer-Kaplan-Horton/p/book/9780367191498.
———. 2021b. Modern Data Science with R. 2nd ed. Boca Raton, FL: Chapman; Hall/CRC Press. https://www.routledge.com/Modern-Data-Science-with-R/Baumer-Kaplan-Horton/p/book/9780367191498.
Baumer, Benjamin S., Gregory J. Matthews, and Quang Nguyen. 2023. “Big Ideas in Sports Analytics and Statistical Tools for Their Investigation.” Wiley Interdisciplinary Reviews: Computational Statistics, e1612. https://doi.org/10.1002/wics.1612.
Berry, Scott M. 1991. “The Summer of ’41: A Probabilistic Analysis of DiMaggio’s “Streak" and Williams’s Average of .406.” Chance 4 (4): 8–11. https://doi.org/10.1080/09332480.1991.10542337.
Berry, Scott M., C. Shane Reese, and Patrick D. Larkey. 1999. “Bridging Different Eras in Sports.” Journal of the American Statistical Association 94 (447): 661–76. https://doi.org/10.1080/01621459.1999.10474163.
Bouzarth, Elizabeth, Benjamin Grannan, John Harris, Andrew Hartley, Kevin Hutson, and Ella Morton. 2021. “Swing Shift: A Mathematical Approach to Defensive Positioning in Baseball.” Journal of Quantitative Analysis in Sports 17 (1): 47–55. https://doi.org/10.1515/jqas-2020-0027.
Bradley, Ralph Allan, and Milton E. Terry. 1952. “Rank Analysis of Incomplete Block Designs: I. The Method of Paired Comparisons.” Biometrika 39 (3/4): 324–45. https://doi.org/10.2307/2334029.
Brill, Ryan S., Sameer K. Deshpande, and Abraham J. Wyner. 2023. “A Bayesian Analysis of the Time Through the Order Penalty in Baseball.” Journal of Quantitative Analysis in Sports, no. 0. https://doi.org/10.1515/jqas-2022-0116.
Brooks, Dan, Harry Pavilidis, and Jonathan Judge. 2015. “Moving Beyond WOWY: A Mixed Approach to Measuring Catcher Framing.” Baseball Prospectus. https://www.baseballprospectus.com/news/article/25514/moving-beyond-wowy-a-mixed-approach-to-measuring-catcher-framing/.
Brooks, Dan, and Harry Pavlidis. 2014. “Framing and Blocking Pitches: A Regressed, Probabilistic Model: A New Method for Measuring Catcher Defense.” Baseball Prospectus. https://www.baseballprospectus.com/news/article/22934/framing-and-blocking-pitches-a-regressed-probabilistic-model-a-new-method-for-measuring-catcher-defense/.
Bukiet, Bruce, Elliotte Rusty Harold, and José Luis Palacios. 1997. “A Markov Chain Approach to Baseball.” Operations Research 45 (1): 14–23. https://doi.org/10.1287/opre.45.1.14.
Campitelli, Elio. 2021. metR: Tools for Easier Analysis of Meteorological Fields. https://doi.org/10.5281/zenodo.2593516.
Caola, Ralph. 2003. “Using Calculus to Relate Runs to Wins: Part i.” By the Numbers 13: 9–16.
Carl, Sebastian, and Camden Kay. 2023. mlbplotR: Create ggplot2 and gt Visuals with Major League Baseball Logos. https://CRAN.R-project.org/package=mlbplotR.
Casals, Martı́, José Fernández, Victor Martı́nez, Michael Lopez, Klaus Langohr, and Jordi Cortés. 2023. “A Systematic Review of Sport-Related Packages Within the r CRAN Repository.” International Journal of Sports Science & Coaching 18 (2): 621–29. https://doi.org/10.1177/17479541221136238.
Chang, Winston, Joe Cheng, J. J. Allaire, Carson Sievert, Barret Schloerke, Yihui Xie, Jeff Allen, Jonathan McPherson, Alan Dipert, and Barbara Borges. 2024. shiny: Web Application Framework for r. https://CRAN.R-project.org/package=shiny.
Cleveland, William S. 1979. “Robust Locally Weighted Regression and Smoothing Scatterplots.” Journal of the American Statistical Association 74 (368): 829–36. https://doi.org/10.1080/01621459.1979.10481038.
———. 1985. The Elements of Graphing Data. Vol. 2. Monterey, CA: Wadsworth Advanced Books; Software.
Csárdi, Gábor, Jim Hester, Hadley Wickham, Winston Chang, Martin Morgan, and Dan Tenenbaum. 2024. remotes: R Package Installation from Remote Repositories, Including GitHub. https://CRAN.R-project.org/package=remotes.
Dahl, David B., David Scott, Charles Roosen, Arni Magnusson, and Jonathan Swinton. 2019. xtable: Export Tables to LaTeX or HTML. https://CRAN.R-project.org/package=xtable.
Davenport, Clay, and Keith Woolner. 1999. “Revisiting the Pythagorean Theorem.” Baseball Prospectus. www.baseballprospectus.com/article.php?articleid=342.
Deshpande, Sameer K., and Abraham Wyner. 2017. “A Hierarchical Bayesian Model of Pitch Framing.” Journal of Quantitative Analysis in Sports 13 (3): 95–112. https://doi.org/10.1515/jqas-2017-0027.
Donoho, David. 2017. “50 Years of Data Science.” Journal of Computational and Graphical Statistics 26 (4): 745–66. https://doi.org/10.1080/10618600.2017.1384734.
Douglas, Colin, and Richard Scriven. 2024. retrosheet: Import Professional Baseball Data from Retrosheet. https://CRAN.R-project.org/package=retrosheet.
Fair, Ray C. 2008. “Estimated Age Effects in Baseball.” Journal of Quantitative Analysis in Sports 4 (1). https://doi.org/10.2202/1559-0410.1074.
Fast, Mike. 2010. “What the Heck Is PITCHf/x.” The Hardball Times Annual 2010: 153–58.
———. 2011. “Spinning Yarn: Removing the Mask Encore Presentation.” Baseball Prospectus. https://www.baseballprospectus.com/news/article/15093/spinning-yarn-removing-the-mask-encore-presentation/.
Francisco Rodriguez-Sanchez, and Connor P. Jackson. 2023. grateful: Facilitate Citation of r Packages. https://pakillo.github.io/grateful/.
Friendly, Michael, Chris Dalzell, Martin Monkman, and Dennis Murphy. 2023. Lahman: Sean Lahman Baseball Database. https://CRAN.R-project.org/package=Lahman.
Gerber, Eric A. E., and Bruce A. Craig. 2021. “A Mixed Effects Multinomial Logistic-Normal Model for Forecasting Baseball Performance.” Journal of Quantitative Analysis in Sports 17 (3): 221–39. https://doi.org/10.1515/jqas-2020-0007.
Gould, Stephen Jay. 1989. “The Streak of Streaks.” Chance 2 (2): 10–16. https://doi.org/10.1080/09332480.1989.10554932.
Grolemund, Garrett, and Hadley Wickham. 2011. “Dates and Times Made Easy with lubridate.” Journal of Statistical Software 40 (3): 1–25. https://www.jstatsoft.org/v40/i03/.
Harrell Jr, Frank E. 2024. Hmisc: Harrell Miscellaneous. https://CRAN.R-project.org/package=Hmisc.
Healey, Glenn. 2019. “A Bayesian Method for Computing Intrinsic Pitch Values Using Kernel Density and Nonparametric Regression Estimates.” Journal of Quantitative Analysis in Sports 15 (1): 59–74. https://doi.org/10.1515/jqas-2017-0058.
Heipp, B. 2003. “W% Estimators.” Buckeyes and Sabermetrics. http://gosu02.tripod.com/id69.html.
Hester, Jim, and Davis Vaughan. 2023. bench: High Precision Timing of r Expressions. https://CRAN.R-project.org/package=bench.
Hester, Jim, Hadley Wickham, and Gábor Csárdi. 2023. fs: Cross-Platform File System Operations Based on libuv. https://CRAN.R-project.org/package=fs.
Hirotsu, Nobuyoshi, and J. Eric Bickel. 2019. “Using a Markov Decision Process to Model the Value of the Sacrifice Bunt.” Journal of Quantitative Analysis in Sports 15 (4): 327–44. https://doi.org/10.1515/jqas-2017-0092.
Ismay, Chester, and Albert Y. Kim. 2019. Modern Dive: Statistical Inference via Data Science. Boca Raton, FL: CRC Press. https://moderndive.com/.
James, Bill. 1980. Baseball Abstract. Lawrence, KS: self-published.
———. 1982. Baseball Abstract. New York: Ballantine Books.
———. 1994. The Politics of Glory: How Baseball’s Hall of Fame Really Works. London: Macmillan.
Judge, Jonathan. 2018. “Bayesian Bagging to Generate Uncertainty Intervals: A Catcher Framing Story.” Baseball Prospectus. https://www.baseballprospectus.com/news/article/38289/bayesian-bagging-generate-uncertainty-intervals-catcher-framing-story/.
Kabacoff, Robert I. 2010. R in Action. New York: Manning Publications.
Kemeny, John G., and James Laurie Snell. 1960. Finite Markov Chains. Vol. 210. New York: Springer-Verlag.
Kepner, Tyler. 2011. “A Night of Twists and Collapses.” The New York Times. https://www.nytimes.com/2011/09/30/sports/baseball/5-hour-joy-ride-like-no-other.html.
Keri, Jonah, and Baseball Prospectus. 2007. Baseball Between the Numbers: Why Everything You Know about the Game Is Wrong. New York: Basic Books.
Lahman, Sean. 2018. “Lahman’s Baseball Database, 1871–2017.” seanlahman.com. http://seanlahman.com/.
Lindbergh, Ben. 2013. “The Art of Pitch Framing.” Grantland. http://grantland.com/features/studying-art-pitch-framing-catchers-such-francisco-cervelli-chris-stewart-jose-molina-others/.
Lindsey, George R. 1963. “An Investigation of Strategies in Baseball.” Operations Research 11 (4): 477–501. https://doi.org/10.1287/opre.11.4.477.
Lopez, Michael J., Gregory J. Matthews, and Benjamin S. Baumer. 2018. “How Often Does the Best Team Win? A Unified Approach to Understanding Randomness in North American Sport.” The Annals of Applied Statistics 12 (4): 2483–2516. https://doi.org/10.1214/18-AOAS1165.
Marchi, Max. 2010. “Platoon Splits 2.0.” Hardball Times. http://www.hardballtimes.com/main/article/platoon-splits-2.0.
McCotter, Trent. 2010. “Hitting Streaks Don’t Obey Your Rules: Evidence That Hitting Streaks Are Not Just Byproducts of Random Variation.” Chance 23 (4): 52–57. https://doi.org/10.1080/09332480.2010.10739837.
Meschiari, Stefano. 2022. Latex2exp: Use LaTeX Expressions in Plots. https://CRAN.R-project.org/package=latex2exp.
Mühleisen, Hannes, and Mark Raasveldt. 2024. duckdb: DBI Package for the DuckDB Database Management System. https://CRAN.R-project.org/package=duckdb.
Müller, Kirill. 2020. here: A Simpler Way to Find Your Files. https://CRAN.R-project.org/package=here.
Müller, Kirill, Jeroen Ooms, David James, Saikat DebRoy, Hadley Wickham, and Jeffrey Horner. 2023. RMariaDB: Database Interface and MariaDB Driver. https://CRAN.R-project.org/package=RMariaDB.
Müller, Kirill, Hadley Wickham, David A. James, and Seth Falcon. 2024. RSQLite: SQLite Interface for r. https://CRAN.R-project.org/package=RSQLite.
Murrell, Paul. 2006. R Graphics. Boca Raton, FL: Chapman & Hall, CRC Press.
Nathan, Alan M. 2011. “Baseball ProGUESTus: Home Runs and Humidors: Is There a Connection?” Baseball Prospectus. https://www.baseballprospectus.com/news/article/13057/baseball-proguestus-home-runs-and-humidors-is-there-a-connection/.
Official Playing Rules Committee. 2018. 2018 Official Rules of Major League Baseball. Chicago, IL: Triumph Books. http://www.triumphbooks.com/2018-official-rules-of-major-league-baseball-products-9781629375434.php.
Palmer, Pete. 1983. “Balls and Strikes.” Baseball Analyst. http://sabr.org/research/baseball-analyst-archives.
Pankin, Mark D. 1987. “Baseball as a Markov Chain.” In The Great American Baseball Stat Book, 1st ed., 520–24. New York: Ballantine Books.
Pedersen, Thomas Lin. 2024. patchwork: The Composer of Plots. https://CRAN.R-project.org/package=patchwork.
Petti, Bill, and Saiem Gilani. 2024. baseballr: Acquiring and Analyzing Baseball Data. https://CRAN.R-project.org/package=baseballr.
R Core Team. 2024. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. https://www.R-project.org/.
Richardson, Neal, Ian Cook, Nic Crane, Dewey Dunnington, Romain François, Jonathan Keane, Dragoș Moldovan-Grünfeld, Jeroen Ooms, Jacob Wujciak-Jens, and Apache Arrow. 2024. arrow: Integration to Apache Arrow. https://CRAN.R-project.org/package=arrow.
Robinson, David, Alex Hayes, and Simon Couch. 2023. broom: Convert Statistical Objects into Tidy Tibbles. https://CRAN.R-project.org/package=broom.
RStudio Team. 2018. RStudio: Integrated Development Environment for R. Boston, MA: RStudio, Inc. http://www.rstudio.com/.
Schwarz, Alan. 2004. The Numbers Game: Baseball’s Lifelong Fascination with Statistics. New York: St. Martin’s Press.
Seidel, Michael. 2002. Streak: Joe DiMaggio and the Summer of ’41. Lincoln, NE: University of Nebraska Press.
Slowikowski, Kamil. 2024. ggrepel: Automatically Position Non-Overlapping Text Labels with ggplot2. https://CRAN.R-project.org/package=ggrepel.
Tango, Tom M., Mitchel G. Lichtman, and Andrew E. Dolphin. 2007. The Book: Playing the Percentages in Baseball. Dulles, VA: Potomac Books, Inc.
Turkenkopf, Dan. 2008. “Framing the Debate.” Beyond the Box Score. https://www.beyondtheboxscore.com/2008/4/5/389840/framing-the-debate.
Venables, W. N., D. M. Smith, and the R Development Core Team. 2011. “An Introduction to R: Notes on R, a Programming Environment for Data Analysis and Graphics, v. 2.13.0.”
Walsh, John. 2008. “Searching for the Game’s Best Pitch.” The Hardball Times. http://www.hardballtimes.com/main/article/searching-for-the-games-best-pitch/.
———. 2010. “The Compassionate Umpire.” The Hardball Times. http://www.hardballtimes.com/main/article/the-compassionate-umpire/.
Waring, Elin, Michael Quinn, Amelia McNamara, Eduardo Arino de la Rubia, Hao Zhu, and Shannon Ellis. 2022. skimr: Compact and Flexible Summaries of Data. https://CRAN.R-project.org/package=skimr.
Wickham, Hadley. 2014. “Tidy Data.” Journal of Statistical Software 59 (10): 1–23. https://doi.org/10.18637/jss.v059.i10.
———. 2016b. ggplot2: Elegant Graphics for Data Analysis. New York: Springer.
———. 2016a. Ggplot2: Elegant Graphics for Data Analysis. New York: Springer-Verlag. https://ggplot2.tidyverse.org.
———. 2022. lobstr: Visualize r Data Structures with Trees. https://CRAN.R-project.org/package=lobstr.
———. 2023a. downlit: Syntax Highlighting and Automatic Linking. https://CRAN.R-project.org/package=downlit.
———. 2023b. modelr: Modelling Functions That Work with the Pipe. https://CRAN.R-project.org/package=modelr.
———. 2023c. stringr: Simple, Consistent Wrappers for Common String Operations. https://CRAN.R-project.org/package=stringr.
———. 2024. rvest: Easily Harvest (Scrape) Web Pages. https://CRAN.R-project.org/package=rvest.
Wickham, Hadley, Mara Averick, Jennifer Bryan, Winston Chang, Lucy D’Agostino McGowan, Romain François, Garrett Grolemund, et al. 2019. “Welcome to the tidyverse.” Journal of Open Source Software 4 (43): 1686. https://doi.org/10.21105/joss.01686.
Wickham, Hadley, Mine Çetinkaya-Rundel, and Garrett Grolemund. 2023. R for Data Science. 2nd ed. Sebastapol, CA: O’Reilly Media, Inc. https://r4ds.hadley.nz/.
Wickham, Hadley, Romain François, Lionel Henry, Kirill Müller, and Davis Vaughan. 2023. dplyr: A Grammar of Data Manipulation. https://CRAN.R-project.org/package=dplyr.
Wickham, Hadley, Maximilian Girlich, and Edgar Ruiz. 2024. dbplyr: A dplyr Back End for Databases. https://CRAN.R-project.org/package=dbplyr.
Wickham, Hadley, Jim Hester, and Jennifer Bryan. 2024. readr: Read Rectangular Text Data. https://CRAN.R-project.org/package=readr.
Wickham, Hadley, Jim Hester, and Jeroen Ooms. 2023. Xml2: Parse XML. https://CRAN.R-project.org/package=xml2.
Wilkinson, Leland. 2006. The Grammar of Graphics. New York: Springer Science & Business Media.
Wood, S. N. 2003. “Thin-Plate Regression Splines.” Journal of the Royal Statistical Society (B) 65 (1): 95–114. https://doi.org/10.1111/1467-9868.00374.
———. 2004. “Stable and Efficient Multiple Smoothing Parameter Estimation for Generalized Additive Models.” Journal of the American Statistical Association 99 (467): 673–86. https://doi.org/10.1198/016214504000000980.
———. 2011. “Fast Stable Restricted Maximum Likelihood and Marginal Likelihood Estimation of Semiparametric Generalized Linear Models.” Journal of the Royal Statistical Society (B) 73 (1): 3–36. https://doi.org/10.1111/j.1467-9868.2010.00749.x.
———. 2017. Generalized Additive Models: An Introduction with R. 2nd ed. Boca Raton: Chapman; Hall/CRC.
Wood, S. N., Natalya Pya, and B. Säfken. 2016. “Smoothing Parameter and Model Selection for General Smooth Models (with Discussion).” Journal of the American Statistical Association 111: 1548–75. https://doi.org/10.1080/01621459.2016.1180986.
Xie, Yihui. 2014. knitr: A Comprehensive Tool for Reproducible Research in R.” In Implementing Reproducible Computational Research, edited by Victoria Stodden, Friedrich Leisch, and Roger D. Peng. Boca Raton: Chapman; Hall/CRC.
———. 2015. Dynamic Documents with R and Knitr. 2nd ed. Boca Raton, Florida: Boca Raton: Chapman; Hall/CRC. https://yihui.org/knitr/.
———. 2023. knitr: A General-Purpose Package for Dynamic Report Generation in r. https://yihui.org/knitr/.
Xie, Yihui, J. J. Allaire, and Garrett Grolemund. 2018. R Markdown: The Definitive Guide. Boca Raton, Florida: Boca Raton: Chapman; Hall/CRC. https://bookdown.org/yihui/rmarkdown.
Xie, Yihui, Christophe Dervieux, and Emily Riederer. 2020. R Markdown Cookbook. Boca Raton, Florida: Boca Raton: Chapman; Hall/CRC. https://bookdown.org/yihui/rmarkdown-cookbook.
Zeileis, Achim, and Gabor Grothendieck. 2005. zoo: S3 Infrastructure for Regular and Irregular Time Series.” Journal of Statistical Software 14 (6): 1–27. https://doi.org/10.18637/jss.v014.i06.
Zhu, Hao. 2024. kableExtra: Construct Complex Table with kable and Pipe Syntax. https://CRAN.R-project.org/package=kableExtra.