Practicality of Using Transformations in Multiple Linear Regression

In our previous research, Gender Wage Inequality in STEM, my colleagues and I used multiple linear regression (MLR) to explore the relationship between gender demographics and median salary of STEM major categories. Our final model used the inverse transformation of the response variable to improve the model fit. Transforming response (and/or explanatory) variables, common practice among statisticians, can lead to a better fitting model, but these models are not easily understood by the average person.

Research Goals

In this project, I compared the multiple linear regression model with the inverse transformation dependent response variable, $Median^{-1}$, from my previous project to a comparable model without an inverse transformation dependent response variable. My goal was to see how much prediction power is lost by not using a transformed response variable to fit a MLR model, and whether it is worth the inability to easily explain your model when using a transformed response variable.

Dataset Used

To address this problem, I used a subset of the College Majors dataset from FiveThirthyEight, found here: https://github.com/fivethirtyeight/data/blob/master/college-majors/women-stem.csv

Tools Used

Packages: tidyverse, ggpubr, easystats, lindia, ggstatsplot
Statistical Tests & Analyses: Box-Cox, Step-wise selection, Model Diagnostics

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
women-in-stem_files		women-in-stem_files
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
Women-in-STEM.Rproj		Women-in-STEM.Rproj
_publish.yml		_publish.yml
style.scss		style.scss
women-in-stem.html		women-in-stem.html
women-in-stem.qmd		women-in-stem.qmd
women-stem.csv		women-stem.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Practicality of Using Transformations in Multiple Linear Regression

Research Goals

Dataset Used

Tools Used

About

Releases

Packages

Contributors 2

Languages

lgibson7/Women-in-STEM

Folders and files

Latest commit

History

Repository files navigation

Practicality of Using Transformations in Multiple Linear Regression

Research Goals

Dataset Used

Tools Used

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages