A repository containing all the projects that were carried out during my Applied Data Science course learning at WorldQuant University.
- Predicting House Prices in Mexico
- Predicting Apartment Prices in Bueno Aires
- Predicting Air Quality in Nairobi
- Predicting Earthquake Damage in Nepal
- Predicting Bankruptcy in Poland
In this assignment, you'll work with a dataset of homes for sale in Brazil. Your goal is to determine if there are regional differences in the real estate market. Also, you will look at southern Brazil to see if there is a relationship between home size and price, similar to what you saw with housing in some states in Mexico.
This was similar to the first project but we had to create a wrangle function that could do the following:
- Subset the data in the CSV file and return only apartments in Mexico City ("Distrito Federal") that cost less than $100,000.
- Remove outliers by trimming the bottom and top 10% of properties in terms of "surface_covered_in_m2".
- Create separate "lat" and "lon" columns.
- Mexico City is divided into 16 boroughs. Create a "borough" feature from the "place_with_parent_names" column.
- Drop columns that are more than 50% null values.
- Drop columns containing low- or high-cardinality categorical values.
- Drop any columns that would constitute leakage for the target "price_aprox_usd".
- Drop any columns that would create issues of multicollinearity.
In this project we used a database server(MongoDB) and also predicted air quality in Nairobi Kenya for particular times.
In this assignment, we built a classification model to predict building damage for the district of Kavrepalanchok.
- Linear Regression
- Logistic Regression
- Random Forest
- Decision Trees