Skip to content

wdonofrio/MGT6203-GenderPayGapAnalysis

Repository files navigation

Team-26

Team 26's group project GitHub repository for MGT 6203 (Canvas) Fall of 2023 semester.

Getting Started

Proposal, Progress, and Final Report

Begin reviewing this analysis by reading the reports curated, which walk through the team's hypothesis, process, and conclusions regarding this analysis.
The reports should provide a detailed description of the entire analysis.

Primary Resource (Software)

The supporting software that enabled this analysis can be found under Code/, which contains all the software related to this analysis.

The program can be executed using the latest version of R (>= 4.3.1), and does require various packages to be installed. These packages are listed in the requirements.txt file of the project, however, the program will install the packages upon execution as well.

The program relies on the data being pre-downloaded into Data/. If executing the program, please ensure the following csv files have been downloaded:

A rendered/knitted html file is also provided here if the program cannot be executed directly.

Supporting Resources (Software)

Additional software can be found under Other Resources. This section includes exploratory software that was not used in the final report, but was crucial for the initial exploration and review of this analysis.

Visualizations (PDFs)

Visualizations used in this analysis can be found under Visualizations/.

Team Schedule

October 11th - Data Cleaning and Preparation:

  • Exploratory Data Analysis (EDA)
  • Handling missing data, outliers, preprocessing
  • Documenting preprocessing techniques

October 15th - Model Creation and Evaluation:

  • Linear Regression Models, including evaluation
  • Logistic Regression Models, including evaluation
  • Assess additional models ( ideally choose 2 more)

November 4th - Progress Report:

  • Hypothesis Testing, significance, variables impact

November 11th - Additional Models and Data Visualization:

  • Additional model work (if needed) by Jamie/Emmett (Reviewer - William & Jamie)
  • Data visualization to illustrate disparities by William/Sanjay(Reviewer - Marissa)
  • Begin writing the final report by Marissa

November 14th - Receive TA Feedback

  • Receive TA Feedback

Due November 15th

  • Sanjay - Hypothesis Testing
  • Jamie - Logistic Regression Model Validation ( Possibly Tuning )
  • William - Gender Breakdown by Role and Department
  • Emmett - Validate Hypothesis Testing and add that portion to the final paper
  • Marissa - Validate Jamie & Williams work and add to final paper

November 18th - Final Report:

  • Overview of data by Jamie
  • Describing the cleaning process, key variables, dataset by Sanjay

November 25th - Final Report:

  • Overview of modeling by Marissa
  • Describing the models used and selection process by Jamie/Emmett
  • In-depth discussion of model performance by William/Sanjay

December 3rd - Final Report Due

December 6th - Teammate Evaluation Due

About

Kaggle Gender Pay Gap Analysis

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages