This repository was archived by the owner on Jun 22, 2022. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 21
LightGBM with row aggregations
Kamil A. Kaczmarek edited this page Jul 10, 2018
·
2 revisions
Recipe for row aggregations is simple: take single row from the dataset and calculate several summary statistics of that row, for example: mean, max, min, std, count_non_zero, fraction_non_zero. These aggregations are implemented in the feature_extraction.py:L111.
In the future solution we will add much more aggregations.
LightGBM with our steppy-style wrapper.
- lightGBM on row aggregations data:
1.36 CV
and1.48 LB
- lightGBM with both raw features and row aggregations:
1.35 CV
and1.41 LB
🏆
Combined raw features with row aggregations led us to the great increase in the both CV and LB results.
check our GitHub organization https://github.com/neptune-ml for more cool stuff 😃
Kamil & Kuba, core contributors
- honey bee 🐝 LightGBM and 5fold CV
- beetle 🪲 LightGBM on binarized dataset
- dromedary camel 🐪 LightGBM with row aggregations
- whale 🐳 LightGBM on dimension reduced dataset
- water buffalo 🐃 Exploring various dimension reduction techniques
- blowfish 🐡 bucketing row aggregations