Data
OpenML
Help
Sign in
×
Sign in
No account? Join OpenML
Forgot password
×
JavaScript is required to properly view the contents of this page!
OpenML
Explore
Data
Task
Flow
Run
Study
Task type
Measure
People
Help
Blog
Contact
Please cite us
Click_prediction_small
ARFF
CSV
JSON
XML
RDF
Click_prediction_small
active
ARFF
Publicly available
Visibility: public
Uploaded 07-01-2019 by
Florian Pargent
0 likes
downloaded by 0 people , 0 total downloads
0 issues
0 downvotes
Add tag
Issue
#Downvotes for this reason
By
Loading wiki
Help us complete this description
Edit
This is the same data as version 5 (OpenML ID = 1220) with '_id' features coded as nominal factor variables.
12 features
click
(target)
nominal
2 unique values
0 missing
impression
numeric
99 unique values
0 missing
url_hash
numeric
6941 unique values
0 missing
ad_id
nominal
19228 unique values
0 missing
advertiser_id
nominal
6064 unique values
0 missing
depth
numeric
3 unique values
0 missing
position
numeric
3 unique values
0 missing
query_id
numeric
30748 unique values
0 missing
keyword_id
nominal
19803 unique values
0 missing
title_id
nominal
25321 unique values
0 missing
description_id
nominal
22381 unique values
0 missing
user_id
nominal
30114 unique values
0 missing
Show all 12 features
19 properties
NumberOfInstances
39948
Number of instances (rows) of the dataset.
NumberOfFeatures
12
Number of attributes (columns) of the dataset.
NumberOfClasses
2
Number of distinct values of the target attribute (if it is nominal).
NumberOfMissingValues
0
Number of missing values in the dataset.
NumberOfInstancesWithMissingValues
0
Number of instances with at least one value missing.
NumberOfNumericFeatures
5
Number of numeric attributes.
NumberOfSymbolicFeatures
7
Number of nominal attributes.
MajorityClassSize
33220
Number of instances belonging to the most frequent class.
MinorityClassPercentage
16.84
Percentage of instances belonging to the least frequent class.
MinorityClassSize
6728
Number of instances belonging to the least frequent class.
NumberOfBinaryFeatures
1
Number of binary attributes.
PercentageOfBinaryFeatures
8.33
Percentage of binary attributes.
PercentageOfInstancesWithMissingValues
0
Percentage of instances having missing values.
PercentageOfMissingValues
0
Percentage of missing values.
AutoCorrelation
0.72
Average class difference between consecutive instances.
PercentageOfNumericFeatures
41.67
Percentage of numeric attributes.
Dimensionality
0
Number of attributes divided by the number of instances.
PercentageOfSymbolicFeatures
58.33
Percentage of nominal attributes.
MajorityClassPercentage
83.16
Percentage of instances belonging to the most frequent class.
Show all 19 properties
10 tasks
Supervised Classification on Click_prediction_small
0 runs
- estimation_procedure: 10-fold Crossvalidation - target_feature: click
Clustering on Click_prediction_small
0 runs
- estimation_procedure: 50 times Clustering
Clustering on Click_prediction_small
0 runs
- estimation_procedure: 50 times Clustering
Clustering on Click_prediction_small
0 runs
- estimation_procedure: 50 times Clustering
Clustering on Click_prediction_small
0 runs
- estimation_procedure: 50 times Clustering
Clustering on Click_prediction_small
0 runs
- estimation_procedure: 50 times Clustering
Clustering on Click_prediction_small
0 runs
- estimation_procedure: 50 times Clustering
Clustering on Click_prediction_small
0 runs
- estimation_procedure: 50 times Clustering
Clustering on Click_prediction_small
0 runs
- estimation_procedure: 50 times Clustering
Clustering on Click_prediction_small
0 runs
- estimation_procedure: 50 times Clustering
Define a new task