This dataset is a subset of the [KDDCup 2012 track 2](https://www.kaggle.com/competitions/kddcup2012-track2/) data created by Manu Joseph and Harsh Raj for the paper
> Joseph, M., & Raj, H. (2022).
> GATE: Gated Additive Tree Ensemble for Tabular Classification and Regression.
> arXiv preprint arXiv:2207.08548v4.
We retrieved the data from [Dropbox](https://www.dropbox.com/s/ry6zsr6qtuz8l5z/click_set_1_.pickle?dl=1).
Note: please read the Kaggle dataset description carefully before using this dataset. This dataset mostly contains IDs that should be looked up in other files that are not on OpenML and were not used as part of the benchmark in the paper mentioned above.