Skip to content

v-ade-r/Top0.02-Kaggle-House-Prices

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Top2% Kaggle House Prices

Code for the "House Prices - Advanced Regression Techniques" competition on Kaggle.

It is not the the code from A to Z, ready to copy/paste. It shows the most important things, but omits some parts of manual testing and changing the hypertuning search spaces. It also doesn't contain EDA (Exploratory data analysis) except one general idea. I did a lot of feature visualization, comparisons, engineering on the fly, however adding this would have lengthen the code and reduced its clarity, so I opted against it. Finally I could have added more functions to reduce unnecessary code, but practicing data manipulation syntax was one of my goals, so again I decided not to.

You can learn from this code how to:

  1. Prepare data for Machine Learning models (different methods of semi-manual filling NaNs, removing skewness, one-hot-encoding with pandas).
  2. Engineer new features and map existing ones to the better spaces for ML models.
  3. Tune hyperparameters of the most efficient ML models and Neural Nets for regression (with Optuna).
  4. Stack models together, creating ensembles and get even better results.

About

ML in regression problem. Code for the competition on Kaggle.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages