Ames House Price Prediction
The Housing prices competition is with over 45.000 participating teams and individuals one of the most popular Machine Learning competition on Kaggle. The goal of this competition is to predict the sale price of residential homes in Ames (Iowa) using 79 explanatory variables.
For this project I performed a variety of actions including:
- Imputing missing data using Simple imputer, KNN imputer and Mice imputer
- Creating variety of new binary features, features representing the number of years since the house was last remoddeled and features indicating the proximity to the train station
- Exploring the effect of features on the sale price using scatter and bar plots
- Testing the difference of using log-transformed vs untransformed sale price on RMSE
- Optimizing Ridge regression using GridSearch CV to obtain a model with a top 9% score on the public Kaggle leaderboard
Click here to view this project on GitHub.