Skip to content

Using Framingham Heart Disease data I will use Logisitic Regression to determine Probability of Heart Disease and its factors.

License

Notifications You must be signed in to change notification settings

mccurcio/Logistic_with_R

Repository files navigation

Logistic_with_R

To view report: An Investigation Of The Framingham Heart Study

  1. Purpose: This R markdown document displays my understanding of logistic regression and R. This report is part one of two articles describing Logit.

    A. Logisitc Regression Report B. Under construction

  2. Data: Framingham Heart Disease Study, FHS_data

  3. Conclusion: We find seven (7) factors and their related odds leading to cardiovascular disease.

No. Factors Approximate Odds Over Mean
1 Prevalence Of Stroke In Family History.       240%
2 Male Vs Female       150%
3 Prevalence Of Hypertension In Family History       130%
4 Age < 2,800%
5 Cigarettes Per Day <    210%
6 Systolic Blood Pressure <    780%
7 Glucose Levels <    250%

NOTES: I will use Logistic Regression to determine the Probability of Heart Disease and its factors.

Assumptions of Logistic Regression

  1. Logistic Regression does not require a linear relationship between the independent and dependent variables.

  2. The residuals from the model do not need to follow the normal distribution.

  3. Logistic Regression does not require the assumption of homoscedasticity. Homoscedasticity means all the variables in the model have the same variance. So, the variables may have different variances in the Logistic Regression model.

  4. The dependent variable in Logistic Regression is not measured on an interval or ratio scale.

  5. Logistic Regression algorithm requires little or no multicollinearity among the independent variables. It means that the independent variables should not be too highly correlated with each other.

  6. Logistic Regression model assumes linearity of independent variables and log odds.

  7. The success of Logistic Regression model depends on the sample sizes. Typically, it requires a large sample size to achieve the high accuracy.

About

Using Framingham Heart Disease data I will use Logisitic Regression to determine Probability of Heart Disease and its factors.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages