Preprint / Version 1

Predicting ATP Player Tennis Performance Using Machine Learning

##article.authors##

  • Arjun Kamra Jesuit High School

DOI:

https://doi.org/10.58445/rars.3027

Keywords:

Tennis, Machine Learning, Prediction Model, Logistic Regression, Linear Regression

Abstract

Predicting performance in professional tennis has applications for analysts, coaches, and betting strategies. This study investigates whether machine learning models can accurately predict Association of Tennis Professionals (ATP) match outcomes based on historical player data. Statistics such as serve percentage, break point conversion, and win-loss records were compiled from Ultimate Tennis Statistics and ATP Tour databases. Three models were tested: linear regression, logistic regression, and random forest classifiers. Model performance was compared using accuracy, R², and F1 score. Random forest achieved the highest accuracy (93.36%), followed by logistic regression (91.15%), while linear regression produced a moderate correlation (R² = 0.544). Serve consistency and break point conversion emerged as key indicators of success. Results suggest that ensemble-based models are most effective for capturing the non-linear relationships in tennis performance. While the study is limited by reliance on career-level statistics, these findings highlight the potential for machine learning to enhance tennis analytics, strategic planning, and predictive applications.

References

“ATP Player Statistics.” ATP Tour, www.atptour.com. Accessed 23 Aug. 2025.

Boulier, Bryan L., and Howard O. Stekler. “Predicting the Outcomes of Tennis Matches.” International Journal of Forecasting, vol. 19, no. 2, 2003, pp. 155–170.

Cai, Li, et al. “Applying Machine Learning to Predict Grand Slam Tennis Matches.” Data Mining in Sports Journal, vol. 8, no. 1, 2021, pp. 45–61.

De Seranno, Andrea. “Predicting ATP Singles Match Winners Using Machine Learning.” Journal of Sports Analytics, vol. 5, no. 3, 2019, pp. 211–229.

Gao, Jun, and Adam Kowalczyk. “Random Forest Models for Predicting Tennis Match Outcomes.” Journal of Quantitative Analysis in Sports, vol. 12, no. 4, 2020, pp. 301–315.

McHale, Ian G., and Amanda Morton. “A Bradley–Terry-Type Model for Forecasting Tennis Match Results.” International Journal of Forecasting, vol. 27, no. 2, 2011, pp. 619–630.

Knottenbelt, William J., et al. “A Common-Opponent Stochastic Model for Predicting Professional Tennis Matches.” International Journal of Computer Science in Sport, vol. 9, no. 2, 2010, pp. 67–79.

“Ultimate Tennis Statistics Database.” Ultimate Tennis Statistics, www.ultimatetennisstatistics.com. Accessed 23 Aug. 2025.

Downloads

Posted

2025-09-06