References
Adler, Joseph. 2006. Baseball Hacks: Tips & Tools for Analyzing
and Winning with Statistics. Sebastopol, CA: O’Reilly Media.
Albert, Jim. 2002. “Smoothing Career Trajectories of Baseball
Hitters.” Bowling Green State University.
———. 2008. “Streaky Hitting in Baseball.” Journal of
Quantitative Analysis in Sports 4 (1). https://doi.org/10.2202/1559-0410.1085.
———. 2009. “Is Roger Clemens’ WHIP
Trajectory Unusual?” Chance 22 (2): 8–20. https://doi.org/10.1080/09332480.2009.10722954.
———. 2017. Teaching Statistics Using Baseball. Washington, DC:
Mathematical Association of America.
———. 2018. LearnBayes: Functions for Learning Bayesian
Inference. https://CRAN.R-project.org/package=LearnBayes.
Albert, Jim, Jay Bartroff, Roger Blandford, Dan Brooks, Josh Derenski,
Larry Goldstein, Hosoi Anette, Gary Lorden, Alan Nathan, and Lloyd
Smith. 2018. “Report of the Committee Studying Home Run Rates in
MLB.” http://baseball.physics.illinois.edu/HRReport2018.pdf.
Albert, Jim, and Jay Bennett. 2003. Curve Ball:
Baseball, Statistics, and the Role of Chance in the Game.
New York: Copernicus Books.
Albert, Jim, Anette Hosoi, Alan Nathan, and Lloyd Smith. 2019.
“Preliminary Report of the Committee Studying Home Run Rates in
MLB.” http://baseball.physics.illinois.edu/HRReport2019.pdf.
Albert, Jim, and Alan Nathan. 2022. “Home Runs and Drag: An Early
Look at the 2022 Season.” Fangraphs. https://blogs.fangraphs.com/home-runs-and-drag-an-early-look-at-the-2022-season/.
Albert, Jim, and Maria Rizzo. 2012. R by Example. New York:
Springer Science & Business Media.
Allaire, J. J., Yihui Xie, Christophe Dervieux, Jonathan McPherson,
Javier Luraschi, Kevin Ushey, Aron Atkins, et al. 2024. rmarkdown: Dynamic Documents for r. https://github.com/rstudio/rmarkdown.
Allen, Dave. 2009a. “Deconstructing the Non-Fastball Run
Maps.” Baseball Analysts. http://baseballanalysts.com/archives/2009/03/deconstructing_1.php.
———. 2009b. “Platoon Splits for Three Types of Fastballs.”
Baseball Analysts. http://baseballanalysts.com/archives/2009/05/platoon_splits.php.
Appelman, David. 2008. “Get to Know: RE24.”
Fangraphs. http://www.fangraphs.com/blogs/get-to-know-re24/.
Bates, Douglas, Martin Mächler, Ben Bolker, and Steve Walker. 2015.
“Fitting Linear Mixed-Effects Models Using lme4.” Journal of Statistical
Software 67 (1): 1–48. https://doi.org/10.18637/jss.v067.i01.
Baumer, Benjamin S., and Jim Albert. 2024. Abdwr3edata: Companion to
“Analyzing Baseball Data with
R,” 3rd Edition. https://github.com/beanumber/abdwr3edata.
Baumer, Benjamin S., Shane T. Jensen, and Gregory J. Matthews. 2015.
“openWAR: An Open Source System for
Evaluating Overall Player Performance in Major League Baseball.”
Journal of Quantitative Analysis in Sports 11 (2): 69–84. https://doi.org/10.1515/jqas-2014-0098.
Baumer, Benjamin S., Daniel T. Kaplan, and Nicholas J. Horton. 2021a.
Modern Data Science with R. 2nd ed. Boca Raton:
Chapman; Hall/CRC Press. https://www.routledge.com/Modern-Data-Science-with-R/Baumer-Kaplan-Horton/p/book/9780367191498.
———. 2021b. Modern Data Science with
R. 2nd ed. Boca Raton, FL: Chapman; Hall/CRC Press. https://www.routledge.com/Modern-Data-Science-with-R/Baumer-Kaplan-Horton/p/book/9780367191498.
Baumer, Benjamin S., Gregory J. Matthews, and Quang Nguyen. 2023.
“Big Ideas in Sports Analytics and Statistical Tools for Their
Investigation.” Wiley Interdisciplinary Reviews:
Computational Statistics, e1612. https://doi.org/10.1002/wics.1612.
Berry, Scott M. 1991. “The Summer of ’41: A Probabilistic Analysis
of DiMaggio’s “Streak" and Williams’s Average
of .406.” Chance 4 (4): 8–11. https://doi.org/10.1080/09332480.1991.10542337.
Berry, Scott M., C. Shane Reese, and Patrick D. Larkey. 1999.
“Bridging Different Eras in Sports.” Journal of the
American Statistical Association 94 (447): 661–76. https://doi.org/10.1080/01621459.1999.10474163.
Bouzarth, Elizabeth, Benjamin Grannan, John Harris, Andrew Hartley,
Kevin Hutson, and Ella Morton. 2021. “Swing Shift: A Mathematical
Approach to Defensive Positioning in Baseball.” Journal of
Quantitative Analysis in Sports 17 (1): 47–55. https://doi.org/10.1515/jqas-2020-0027.
Bradley, Ralph Allan, and Milton E. Terry. 1952. “Rank Analysis of
Incomplete Block Designs: I. The Method of Paired Comparisons.”
Biometrika 39 (3/4): 324–45. https://doi.org/10.2307/2334029.
Brill, Ryan S., Sameer K. Deshpande, and Abraham J. Wyner. 2023.
“A Bayesian Analysis of the Time Through the Order Penalty in
Baseball.” Journal of Quantitative Analysis in Sports,
no. 0. https://doi.org/10.1515/jqas-2022-0116.
Brooks, Dan, Harry Pavilidis, and Jonathan Judge. 2015. “Moving
Beyond WOWY: A Mixed Approach to Measuring Catcher
Framing.” Baseball Prospectus. https://www.baseballprospectus.com/news/article/25514/moving-beyond-wowy-a-mixed-approach-to-measuring-catcher-framing/.
Brooks, Dan, and Harry Pavlidis. 2014. “Framing and Blocking
Pitches: A Regressed, Probabilistic Model: A New Method for Measuring
Catcher Defense.” Baseball Prospectus. https://www.baseballprospectus.com/news/article/22934/framing-and-blocking-pitches-a-regressed-probabilistic-model-a-new-method-for-measuring-catcher-defense/.
Bukiet, Bruce, Elliotte Rusty Harold, and José Luis Palacios. 1997.
“A Markov Chain Approach to Baseball.”
Operations Research 45 (1): 14–23. https://doi.org/10.1287/opre.45.1.14.
Campitelli, Elio. 2021. metR: Tools for
Easier Analysis of Meteorological Fields. https://doi.org/10.5281/zenodo.2593516.
Caola, Ralph. 2003. “Using Calculus to Relate Runs to Wins: Part
i.” By the Numbers 13: 9–16.
Carl, Sebastian, and Camden Kay. 2023. mlbplotR: Create “ggplot2” and “gt” Visuals with Major League Baseball
Logos. https://CRAN.R-project.org/package=mlbplotR.
Casals, Martı́, José Fernández, Victor Martı́nez, Michael Lopez, Klaus
Langohr, and Jordi Cortés. 2023. “A Systematic Review of
Sport-Related Packages Within the r CRAN Repository.”
International Journal of Sports Science & Coaching 18 (2):
621–29. https://doi.org/10.1177/17479541221136238.
Chang, Winston, Joe Cheng, J. J. Allaire, Carson Sievert, Barret
Schloerke, Yihui Xie, Jeff Allen, Jonathan McPherson, Alan Dipert, and
Barbara Borges. 2024. shiny: Web
Application Framework for r. https://CRAN.R-project.org/package=shiny.
Cleveland, William S. 1979. “Robust Locally Weighted Regression
and Smoothing Scatterplots.” Journal of the American
Statistical Association 74 (368): 829–36. https://doi.org/10.1080/01621459.1979.10481038.
———. 1985. The Elements of Graphing Data. Vol. 2. Monterey, CA:
Wadsworth Advanced Books; Software.
Csárdi, Gábor, Jim Hester, Hadley Wickham, Winston Chang, Martin Morgan,
and Dan Tenenbaum. 2024. remotes: R
Package Installation from Remote Repositories, Including
“GitHub”. https://CRAN.R-project.org/package=remotes.
Dahl, David B., David Scott, Charles Roosen, Arni Magnusson, and
Jonathan Swinton. 2019. xtable: Export
Tables to LaTeX or HTML. https://CRAN.R-project.org/package=xtable.
Davenport, Clay, and Keith Woolner. 1999. “Revisiting the
Pythagorean Theorem.” Baseball Prospectus. www.baseballprospectus.com/article.php?articleid=342.
Deshpande, Sameer K., and Abraham Wyner. 2017. “A Hierarchical
Bayesian Model of Pitch Framing.” Journal of
Quantitative Analysis in Sports 13 (3): 95–112. https://doi.org/10.1515/jqas-2017-0027.
Donoho, David. 2017. “50 Years of Data Science.”
Journal of Computational and Graphical Statistics 26 (4):
745–66. https://doi.org/10.1080/10618600.2017.1384734.
Douglas, Colin, and Richard Scriven. 2024. retrosheet: Import Professional Baseball Data from
“Retrosheet”. https://CRAN.R-project.org/package=retrosheet.
Fair, Ray C. 2008. “Estimated Age Effects in Baseball.”
Journal of Quantitative Analysis in Sports 4 (1). https://doi.org/10.2202/1559-0410.1074.
Fast, Mike. 2010. “What the Heck Is PITCHf/x.” The Hardball Times
Annual 2010: 153–58.
———. 2011. “Spinning Yarn: Removing the Mask Encore
Presentation.” Baseball Prospectus. https://www.baseballprospectus.com/news/article/15093/spinning-yarn-removing-the-mask-encore-presentation/.
Francisco Rodriguez-Sanchez, and Connor P. Jackson. 2023. grateful: Facilitate Citation of r Packages.
https://pakillo.github.io/grateful/.
Friendly, Michael, Chris Dalzell, Martin Monkman, and Dennis Murphy.
2023. Lahman: Sean “Lahman”
Baseball Database. https://CRAN.R-project.org/package=Lahman.
Gerber, Eric A. E., and Bruce A. Craig. 2021. “A Mixed Effects
Multinomial Logistic-Normal Model for Forecasting Baseball
Performance.” Journal of Quantitative Analysis in Sports
17 (3): 221–39. https://doi.org/10.1515/jqas-2020-0007.
Gould, Stephen Jay. 1989. “The Streak of Streaks.”
Chance 2 (2): 10–16. https://doi.org/10.1080/09332480.1989.10554932.
Grolemund, Garrett, and Hadley Wickham. 2011. “Dates and Times
Made Easy with lubridate.”
Journal of Statistical Software 40 (3): 1–25. https://www.jstatsoft.org/v40/i03/.
Harrell Jr, Frank E. 2024. Hmisc: Harrell
Miscellaneous. https://CRAN.R-project.org/package=Hmisc.
Healey, Glenn. 2019. “A Bayesian Method for Computing Intrinsic
Pitch Values Using Kernel Density and Nonparametric Regression
Estimates.” Journal of Quantitative Analysis in Sports
15 (1): 59–74. https://doi.org/10.1515/jqas-2017-0058.
Heipp, B. 2003. “W% Estimators.” Buckeyes and Sabermetrics.
http://gosu02.tripod.com/id69.html.
Hester, Jim, and Davis Vaughan. 2023. bench: High Precision Timing of r
Expressions. https://CRAN.R-project.org/package=bench.
Hester, Jim, Hadley Wickham, and Gábor Csárdi. 2023. fs: Cross-Platform File System Operations Based on
“libuv”. https://CRAN.R-project.org/package=fs.
Hirotsu, Nobuyoshi, and J. Eric Bickel. 2019. “Using a Markov
Decision Process to Model the Value of the Sacrifice Bunt.”
Journal of Quantitative Analysis in Sports 15 (4): 327–44. https://doi.org/10.1515/jqas-2017-0092.
Ismay, Chester, and Albert Y. Kim. 2019. Modern Dive: Statistical
Inference via Data Science. Boca Raton, FL: CRC Press. https://moderndive.com/.
James, Bill. 1980. Baseball Abstract. Lawrence, KS:
self-published.
———. 1982. Baseball Abstract. New York: Ballantine Books.
———. 1994. The Politics of Glory: How Baseball’s Hall of Fame Really Works. London: Macmillan.
Judge, Jonathan. 2018. “Bayesian Bagging to Generate Uncertainty
Intervals: A Catcher Framing Story.” Baseball Prospectus. https://www.baseballprospectus.com/news/article/38289/bayesian-bagging-generate-uncertainty-intervals-catcher-framing-story/.
Kabacoff, Robert I. 2010. R in Action. New York:
Manning Publications.
Kemeny, John G., and James Laurie Snell. 1960. Finite
Markov Chains. Vol. 210. New York:
Springer-Verlag.
Kepner, Tyler. 2011. “A Night of Twists and Collapses.” The
New York Times. https://www.nytimes.com/2011/09/30/sports/baseball/5-hour-joy-ride-like-no-other.html.
Keri, Jonah, and Baseball Prospectus. 2007. Baseball Between the
Numbers: Why Everything You Know about the Game Is Wrong. New York:
Basic Books.
Lahman, Sean. 2018. “Lahman’s Baseball Database,
1871–2017.” seanlahman.com. http://seanlahman.com/.
Lindbergh, Ben. 2013. “The Art of Pitch Framing.”
Grantland. http://grantland.com/features/studying-art-pitch-framing-catchers-such-francisco-cervelli-chris-stewart-jose-molina-others/.
Lindsey, George R. 1963. “An Investigation of Strategies in
Baseball.” Operations Research 11 (4): 477–501. https://doi.org/10.1287/opre.11.4.477.
Lopez, Michael J., Gregory J. Matthews, and Benjamin S. Baumer. 2018.
“How Often Does the Best Team Win? A Unified Approach
to Understanding Randomness in North American
Sport.” The Annals of Applied Statistics 12 (4):
2483–2516. https://doi.org/10.1214/18-AOAS1165.
Marchi, Max. 2010. “Platoon Splits 2.0.” Hardball Times. http://www.hardballtimes.com/main/article/platoon-splits-2.0.
McCotter, Trent. 2010. “Hitting Streaks Don’t Obey Your Rules:
Evidence That Hitting Streaks Are Not Just Byproducts of Random
Variation.” Chance 23 (4): 52–57. https://doi.org/10.1080/09332480.2010.10739837.
Meschiari, Stefano. 2022. Latex2exp: Use LaTeX Expressions in
Plots. https://CRAN.R-project.org/package=latex2exp.
Mühleisen, Hannes, and Mark Raasveldt. 2024. duckdb: DBI Package for the DuckDB Database
Management System. https://CRAN.R-project.org/package=duckdb.
Müller, Kirill. 2020. here: A Simpler
Way to Find Your Files. https://CRAN.R-project.org/package=here.
Müller, Kirill, Jeroen Ooms, David James, Saikat DebRoy, Hadley Wickham,
and Jeffrey Horner. 2023. RMariaDB: Database Interface
and MariaDB Driver. https://CRAN.R-project.org/package=RMariaDB.
Müller, Kirill, Hadley Wickham, David A. James, and Seth Falcon. 2024.
RSQLite: SQLite Interface for r. https://CRAN.R-project.org/package=RSQLite.
Murrell, Paul. 2006. R Graphics. Boca Raton, FL: Chapman &
Hall, CRC Press.
Nathan, Alan M. 2011. “Baseball ProGUESTus: Home Runs
and Humidors: Is There a Connection?” Baseball Prospectus. https://www.baseballprospectus.com/news/article/13057/baseball-proguestus-home-runs-and-humidors-is-there-a-connection/.
Official Playing Rules Committee. 2018. 2018 Official Rules of Major
League Baseball. Chicago, IL: Triumph Books. http://www.triumphbooks.com/2018-official-rules-of-major-league-baseball-products-9781629375434.php.
Palmer, Pete. 1983. “Balls and Strikes.” Baseball Analyst.
http://sabr.org/research/baseball-analyst-archives.
Pankin, Mark D. 1987. “Baseball as a Markov
Chain.” In The Great American Baseball Stat Book, 1st
ed., 520–24. New York: Ballantine Books.
Pedersen, Thomas Lin. 2024. patchwork:
The Composer of Plots. https://CRAN.R-project.org/package=patchwork.
Petti, Bill, and Saiem Gilani. 2024. baseballr: Acquiring and Analyzing Baseball
Data. https://CRAN.R-project.org/package=baseballr.
R Core Team. 2024. R: A Language and Environment for
Statistical Computing. Vienna, Austria: R Foundation for
Statistical Computing. https://www.R-project.org/.
Richardson, Neal, Ian Cook, Nic Crane, Dewey Dunnington, Romain
François, Jonathan Keane, Dragoș Moldovan-Grünfeld, Jeroen Ooms, Jacob
Wujciak-Jens, and Apache Arrow. 2024. arrow: Integration to
“Apache”
“Arrow”. https://CRAN.R-project.org/package=arrow.
Robinson, David, Alex Hayes, and Simon Couch. 2023. broom: Convert Statistical Objects into Tidy
Tibbles. https://CRAN.R-project.org/package=broom.
RStudio Team. 2018. RStudio: Integrated Development Environment for
R. Boston, MA: RStudio, Inc. http://www.rstudio.com/.
Schwarz, Alan. 2004. The Numbers Game: Baseball’s Lifelong
Fascination with Statistics. New York: St. Martin’s Press.
Seidel, Michael. 2002. Streak: Joe DiMaggio and the Summer of
’41. Lincoln, NE: University of Nebraska Press.
Slowikowski, Kamil. 2024. ggrepel:
Automatically Position Non-Overlapping Text Labels with “ggplot2”. https://CRAN.R-project.org/package=ggrepel.
Tango, Tom M., Mitchel G. Lichtman, and Andrew E. Dolphin. 2007. The
Book: Playing the Percentages in Baseball. Dulles, VA: Potomac
Books, Inc.
Turkenkopf, Dan. 2008. “Framing the Debate.” Beyond the Box
Score. https://www.beyondtheboxscore.com/2008/4/5/389840/framing-the-debate.
Venables, W. N., D. M. Smith, and the R Development Core Team. 2011.
“An Introduction to R: Notes on R, a
Programming Environment for Data Analysis and Graphics, v.
2.13.0.”
Walsh, John. 2008. “Searching for the Game’s Best Pitch.”
The Hardball Times. http://www.hardballtimes.com/main/article/searching-for-the-games-best-pitch/.
———. 2010. “The Compassionate Umpire.” The Hardball Times.
http://www.hardballtimes.com/main/article/the-compassionate-umpire/.
Waring, Elin, Michael Quinn, Amelia McNamara, Eduardo Arino de la Rubia,
Hao Zhu, and Shannon Ellis. 2022. skimr:
Compact and Flexible Summaries of Data. https://CRAN.R-project.org/package=skimr.
Wickham, Hadley. 2014. “Tidy Data.” Journal of
Statistical Software 59 (10): 1–23. https://doi.org/10.18637/jss.v059.i10.
———. 2016b. ggplot2: Elegant Graphics
for Data Analysis. New York: Springer.
———. 2016a. Ggplot2: Elegant Graphics for Data Analysis. New
York: Springer-Verlag. https://ggplot2.tidyverse.org.
———. 2022. lobstr: Visualize r Data
Structures with Trees. https://CRAN.R-project.org/package=lobstr.
———. 2023a. downlit: Syntax Highlighting
and Automatic Linking. https://CRAN.R-project.org/package=downlit.
———. 2023b. modelr: Modelling Functions
That Work with the Pipe. https://CRAN.R-project.org/package=modelr.
———. 2023c. stringr: Simple, Consistent
Wrappers for Common String Operations. https://CRAN.R-project.org/package=stringr.
———. 2024. rvest: Easily Harvest
(Scrape) Web Pages. https://CRAN.R-project.org/package=rvest.
Wickham, Hadley, Mara Averick, Jennifer Bryan, Winston Chang, Lucy
D’Agostino McGowan, Romain François, Garrett Grolemund, et al. 2019.
“Welcome to the tidyverse.”
Journal of Open Source Software 4 (43): 1686. https://doi.org/10.21105/joss.01686.
Wickham, Hadley, Mine Çetinkaya-Rundel, and Garrett Grolemund. 2023.
R for Data Science. 2nd ed. Sebastapol, CA: O’Reilly Media,
Inc. https://r4ds.hadley.nz/.
Wickham, Hadley, Romain François, Lionel Henry, Kirill Müller, and Davis
Vaughan. 2023. dplyr: A Grammar of Data
Manipulation. https://CRAN.R-project.org/package=dplyr.
Wickham, Hadley, Maximilian Girlich, and Edgar Ruiz. 2024. dbplyr: A “dplyr” Back End for Databases. https://CRAN.R-project.org/package=dbplyr.
Wickham, Hadley, Jim Hester, and Jennifer Bryan. 2024. readr: Read Rectangular Text Data. https://CRAN.R-project.org/package=readr.
Wickham, Hadley, Jim Hester, and Jeroen Ooms. 2023. Xml2: Parse
XML. https://CRAN.R-project.org/package=xml2.
Wilkinson, Leland. 2006. The Grammar of Graphics. New York:
Springer Science & Business Media.
Wood, S. N. 2003. “Thin-Plate Regression Splines.”
Journal of the Royal Statistical Society (B) 65 (1): 95–114. https://doi.org/10.1111/1467-9868.00374.
———. 2004. “Stable and Efficient Multiple Smoothing Parameter
Estimation for Generalized Additive Models.” Journal of the
American Statistical Association 99 (467): 673–86. https://doi.org/10.1198/016214504000000980.
———. 2011. “Fast Stable Restricted Maximum Likelihood and Marginal
Likelihood Estimation of Semiparametric Generalized Linear
Models.” Journal of the Royal Statistical Society (B) 73
(1): 3–36. https://doi.org/10.1111/j.1467-9868.2010.00749.x.
———. 2017. Generalized Additive Models: An Introduction with
R. 2nd ed. Boca Raton: Chapman; Hall/CRC.
Wood, S. N., Natalya Pya, and B. Säfken. 2016. “Smoothing
Parameter and Model Selection for General Smooth Models (with
Discussion).” Journal of the American Statistical
Association 111: 1548–75. https://doi.org/10.1080/01621459.2016.1180986.
Xie, Yihui. 2014. “knitr: A
Comprehensive Tool for Reproducible Research in R.”
In Implementing Reproducible Computational Research, edited by
Victoria Stodden, Friedrich Leisch, and Roger D. Peng. Boca Raton:
Chapman; Hall/CRC.
———. 2015. Dynamic Documents with R and Knitr. 2nd
ed. Boca Raton, Florida: Boca Raton: Chapman; Hall/CRC. https://yihui.org/knitr/.
———. 2023. knitr: A General-Purpose
Package for Dynamic Report Generation in r. https://yihui.org/knitr/.
Xie, Yihui, J. J. Allaire, and Garrett Grolemund. 2018. R Markdown:
The Definitive Guide. Boca Raton, Florida: Boca Raton: Chapman;
Hall/CRC. https://bookdown.org/yihui/rmarkdown.
Xie, Yihui, Christophe Dervieux, and Emily Riederer. 2020. R
Markdown Cookbook. Boca Raton, Florida: Boca Raton: Chapman;
Hall/CRC. https://bookdown.org/yihui/rmarkdown-cookbook.
Zeileis, Achim, and Gabor Grothendieck. 2005. “zoo: S3 Infrastructure for Regular and Irregular
Time Series.” Journal of Statistical Software 14 (6):
1–27. https://doi.org/10.18637/jss.v014.i06.
Zhu, Hao. 2024. kableExtra: Construct
Complex Table with “kable” and
Pipe Syntax. https://CRAN.R-project.org/package=kableExtra.