Skip to content

Latest commit

 

History

History
40 lines (34 loc) · 1.14 KB

File metadata and controls

40 lines (34 loc) · 1.14 KB

California Housing Price Prediction:

This California Census Data is taken from the US Census Bureau, which has 20,640 records and 10 different variables:  Longitude  Latitude  Housing median age  Total rooms  Total bedrooms  Population  Households  Median income  Ocean proximity  Median house value

This project has been completed in three different parts as given below:

(1) Data Wrangling: • Checking missing values • Filling missing values • Creating new features • Converting categorical variables into numerical • Standardizing data • Checking multicollenearity

(2) EDA (Exploratory Data Analysis): • Distribution of households with different visualizations • Heat Map and other visualizations to see the correlations between variables. • Checking Outliers with boxplots and scatter plots • Detecting and Removing Outliers • Checking Skewness of features

(3) Machine Learning Algorithms to train data:

• Linear Regression • Ridge Regression • Support Vector Machine Regression • Decision Tree Regression • Random Forest Regression • Gradient Boosting Regression • XGBoost Regression