Data
OpenML
Help
Sign in
×
Sign in
No account? Join OpenML
Forgot password
×
JavaScript is required to properly view the contents of this page!
OpenML
Explore
Data
Task
Flow
Run
Study
Task type
Measure
People
Help
Blog
Contact
Please cite us
mlr_xgboost_rng
ARFF
CSV
JSON
XML
RDF
mlr_xgboost_rng
active
ARFF
Attribution-ShareAlike (CC BY-SA)
Visibility: public
Uploaded 23-05-2020 by
Pieter Gijsbers
0 likes
downloaded by 0 people , 0 total downloads
0 issues
0 downvotes
Machine Learning
Meteorology
Add tag
Issue
#Downvotes for this reason
By
Loading wiki
Help us complete this description
Edit
Experiment data obtained by running random configurations of xgboost through mlr on 118 different classification tasks from openml. Parameter descriptions: https://xgboost.readthedocs.io/en/latest/parameter.html
21 features
perf.mmce
(target)
numeric
717489 unique values
0 missing
perf.logloss
(target)
numeric
2477560 unique values
0 missing
dataset
string
119 unique values
0 missing
learner
string
3 unique values
0 missing
traintime
numeric
855953 unique values
0 missing
predicttime
numeric
371639 unique values
0 missing
gamma
numeric
43779 unique values
358433 missing
num.impute.selected.cpo
string
3 unique values
0 missing
nrounds
numeric
2623 unique values
0 missing
eta
numeric
43526 unique values
358433 missing
lambda
numeric
48477 unique values
0 missing
alpha
numeric
48449 unique values
0 missing
subsample
numeric
47500 unique values
0 missing
max_depth
numeric
31 unique values
358433 missing
min_child_weight
numeric
43364 unique values
358433 missing
colsample_bytree
numeric
43166 unique values
358433 missing
colsample_bylevel
numeric
43126 unique values
358433 missing
booster
string
3 unique values
0 missing
rate_drop
numeric
16473 unique values
2450204 missing
skip_drop
numeric
16471 unique values
2450204 missing
task_id
numeric
119 unique values
0 missing
Show all 21 features
19 properties
NumberOfInstances
2955210
Number of instances (rows) of the dataset.
NumberOfFeatures
21
Number of attributes (columns) of the dataset.
NumberOfClasses
Number of distinct values of the target attribute (if it is nominal).
NumberOfMissingValues
7051006
Number of missing values in the dataset.
NumberOfInstancesWithMissingValues
2450204
Number of instances with at least one value missing.
NumberOfNumericFeatures
17
Number of numeric attributes.
NumberOfSymbolicFeatures
0
Number of nominal attributes.
Dimensionality
0
Number of attributes divided by the number of instances.
PercentageOfNumericFeatures
80.95
Percentage of numeric attributes.
MajorityClassPercentage
Percentage of instances belonging to the most frequent class.
PercentageOfSymbolicFeatures
0
Percentage of nominal attributes.
MajorityClassSize
Number of instances belonging to the most frequent class.
MinorityClassPercentage
Percentage of instances belonging to the least frequent class.
MinorityClassSize
Number of instances belonging to the least frequent class.
NumberOfBinaryFeatures
0
Number of binary attributes.
PercentageOfBinaryFeatures
0
Percentage of binary attributes.
PercentageOfInstancesWithMissingValues
82.91
Percentage of instances having missing values.
AutoCorrelation
Average class difference between consecutive instances.
PercentageOfMissingValues
11.36
Percentage of missing values.
Show all 19 properties
7 tasks
Clustering on mlr_xgboost_rng
0 runs
- estimation_procedure: 50 times Clustering
Clustering on mlr_xgboost_rng
0 runs
- estimation_procedure: 50 times Clustering
Clustering on mlr_xgboost_rng
0 runs
- estimation_procedure: 50 times Clustering
Clustering on mlr_xgboost_rng
0 runs
- estimation_procedure: 50 times Clustering
Clustering on mlr_xgboost_rng
0 runs
- estimation_procedure: 50 times Clustering
Clustering on mlr_xgboost_rng
0 runs
- estimation_procedure: 50 times Clustering
Clustering on mlr_xgboost_rng
0 runs
- estimation_procedure: 50 times Clustering
Define a new task