Skip to content

DSCI 310 Group 16 project on Predicting Win Rate of Tennis Players done in Jupyter Notebook using R

License

Notifications You must be signed in to change notification settings

DSCI-310/tennis-players-win-rate

Repository files navigation

Predicting Win Rate of Tennis Players

DSCI 310 Group 16 project on Predicting Win Rate of Tennis Players done using R.

Table of Contents

=================

=================

Authors/Contributors:

  • Ammar Bagharib
  • Miles Brodie
  • Sammy Gulipalli

Summary of Project

The goal of this project is to predict the win rate of tennis players using machine learning techniques. The motivation behind this project is to explore the relationship between various player statistics and their win rate, and to develop a model that can accurately predict a player's win rate based on these statistics. The analysis has been carried out in R using various packages from the tidyverse and tidymodels ecosystems.

Important Project Rules and Regulations

Detailed Analysis

The detailed analysis files can be found here

Dependencies required

The following are dependencies used within this project, and that which is necessary to reproduce an identical analysis.

Package Name Version
R 4.1.3
data.table 1.12.8
GGally 2.1.0
here 1.0.1
kknn 1.3.1
tidymodels 0.1.1
tidyverse 1.3.0
knitr 1.4.1

see Dockerfile

How to reproduce this project's Analysis.

1. Set Up Your Environment

  • Sign up/ Log in a Docker account.

  • Install Docker in your computer.

Fork this project's repository on GitHub and then clone the fork to your local machine. For more details on forking see the GitHub Documentation. Type in the following command in your Terminal.

git clone https://github.com/mjbrodie/dsci-310-group-16

To keep your fork up to date with changes in this repo, you can use the fetch upstream button on GitHub. More details regarding fork syncing, e.g., syncing from your terminal instead of directly on Github can be found here.

Then, fire up your Terminal on your local machine, and using Docker, follow these steps to reproduce this analysis:

2. Pull down docker image

Type in the following command in Terminal.

docker pull mjbrodie/dsci-310-group-project:latest

3. Run docker image

Run the Docker appliation on your local machine. Then, run the following in your terminal.

docker run --platform linux/amd64 -p 8787:8787 -e DISABLE_AUTH=true mjbrodie/dsci-310-group-project:latest

4. Reproduce Analysis

Run your preferred browser, (we used Chrome) and visit http://localhost:8787

Once done, within the terminal in the Rstudio, run:

make all

It should take max 10 minutes to run, be patient! Once done, you should see the following message:

Output created: Predicting_Win_Rate_of_Tennis_Players.html

There'll be many new files created, you can explore them within the data and Analysis folder on your Rstudio environment.

If you'd like to restart your workflow, within the terminal in the Rstudio, run:

make clean

All the best!!

Licenses

  • MIT license for project analysis (completed in Rmarkdown)

About

DSCI 310 Group 16 project on Predicting Win Rate of Tennis Players done in Jupyter Notebook using R

Resources

License

Code of conduct

Stars

Watchers

Forks

Packages

No packages published

Contributors 3

  •  
  •  
  •