Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark,
transformed in the same way. This dataset belongs to the "regression on categorical and
numerical features" benchmark. Original description:
Author:
Source: Unknown - Date unknown
Please cite:
analcatdata A collection of data sets used in the book "Analyzing Categorical Data,"
by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission
consists of a zip file containing two versions of each of 84 data sets,
plus this README file. Each data set is given in comma-delimited ASCII
(.csv) form, and Microsoft Excel (.xls) form.
NOTICE: These data sets may be used freely for scientific, educational and/or
noncommercial purposes, provided suitable acknowledgment is given (by citing
the above-named reference).
Further details concerning the book, including information on statistical software
(including sample S-PLUS/R and SAS code), are available at the web site
http://www.stern.nyu.edu/~jsimonof/AnalCatData
Information about the dataset
CLASSTYPE: numeric
CLASSINDEX: none specific
Note: Quotes, Single-Quotes and Backslashes were removed, Blanks replaced
with Underscores