Data
OpenML
Help
Sign in
×
Sign in
No account? Join OpenML
Forgot password
×
JavaScript is required to properly view the contents of this page!
OpenML
Explore
Data
Task
Flow
Run
Study
Task type
Measure
People
Help
Blog
Contact
Please cite us
Click_prediction_small
ARFF
CSV
JSON
XML
RDF
Click_prediction_small
active
ARFF
Publicly available
Visibility: public
Uploaded 19-11-2020 by
Marcos de Paula Bueno
0 likes
downloaded by 0 people , 0 total downloads
0 issues
0 downvotes
Machine Learning
study_270
study_271
Add tag
Issue
#Downvotes for this reason
By
Loading wiki
Help us complete this description
Edit
This is the same data as version 5 (OpenML ID = 1220) with '_id' features coded as nominal factor variables.
12 features
click
(target)
nominal
2 unique values
0 missing
impression
numeric
99 unique values
0 missing
url_hash
numeric
6941 unique values
0 missing
ad_id
nominal
19228 unique values
0 missing
advertiser_id
nominal
6064 unique values
0 missing
depth
numeric
3 unique values
0 missing
position
numeric
3 unique values
0 missing
query_id
numeric
30748 unique values
0 missing
keyword_id
nominal
19803 unique values
0 missing
title_id
nominal
25321 unique values
0 missing
description_id
nominal
22381 unique values
0 missing
user_id
nominal
30114 unique values
0 missing
Show all 12 features
19 properties
NumberOfInstances
39948
Number of instances (rows) of the dataset.
NumberOfFeatures
12
Number of attributes (columns) of the dataset.
NumberOfClasses
2
Number of distinct values of the target attribute (if it is nominal).
NumberOfMissingValues
0
Number of missing values in the dataset.
NumberOfInstancesWithMissingValues
0
Number of instances with at least one value missing.
NumberOfNumericFeatures
5
Number of numeric attributes.
NumberOfSymbolicFeatures
7
Number of nominal attributes.
PercentageOfBinaryFeatures
8.33
Percentage of binary attributes.
PercentageOfInstancesWithMissingValues
0
Percentage of instances having missing values.
AutoCorrelation
0.72
Average class difference between consecutive instances.
PercentageOfMissingValues
0
Percentage of missing values.
PercentageOfNumericFeatures
41.67
Percentage of numeric attributes.
Dimensionality
0
Number of attributes divided by the number of instances.
PercentageOfSymbolicFeatures
58.33
Percentage of nominal attributes.
MajorityClassPercentage
83.16
Percentage of instances belonging to the most frequent class.
MajorityClassSize
33220
Number of instances belonging to the most frequent class.
MinorityClassPercentage
16.84
Percentage of instances belonging to the least frequent class.
MinorityClassSize
6728
Number of instances belonging to the least frequent class.
NumberOfBinaryFeatures
1
Number of binary attributes.
Show all 19 properties
2 tasks
Supervised Classification on Click_prediction_small
0 runs
- estimation_procedure: 4-fold Crossvalidation - target_feature: click
Supervised Classification on Click_prediction_small
0 runs
- estimation_procedure: 10-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: click
Define a new task