LightGBM with row aggregations

dromedary camel 🐪

Features

Recipe for row aggregations is simple: take single row from the dataset and calculate several summary statistics of that row, for example: mean, max, min, std, count_non_zero, fraction_non_zero. These aggregations are implemented in the feature_extraction.py:L111.

In the future solution we will add much more aggregations.

Model

LightGBM with our steppy-style wrapper.

Results

lightGBM on row aggregations data: 1.36 CV and 1.48 LB
lightGBM with both raw features and row aggregations: 1.35 CV and 1.41 LB 🏆

Combined raw features with row aggregations led us to the great increase in the both CV and LB results.

Pipeline diagram

pipeline-solution-3

check our GitHub organization https://github.com/neptune-ml for more cool stuff 😃

Kamil & Kuba, core contributors

Open solutions

honey bee 🐝 LightGBM and 5fold CV
beetle 🪲 LightGBM on binarized dataset
dromedary camel 🐪 LightGBM with row aggregations
whale 🐳 LightGBM on dimension reduced dataset
water buffalo 🐃 Exploring various dimension reduction techniques
blowfish 🐡 bucketing row aggregations

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LightGBM with row aggregations

dromedary camel 🐪

Features

Model

Results

Pipeline diagram

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Open solutions

Clone this wiki locally