We will use kaggle dataset to analyze
- Whether or not COVID has more effect on older people
- Whether or not COVID has more effect on specific gender.
In terminal run
install.packages("Hmisc")
- Collect data
- Clear data
- Analyze age hypothesis
- Analyze gender hypothesis
Idealy, f p-value < 0.05, we reject null hypothesis. Here, p-value ~ 0, so, we reject the null hypothesis and conclude that this is statistically significant.
Here we found that, men have from 0.8% to 8.8% higher chance of dying. p-value = 0.002 < 0.05
, so this is statistically significant