OpenML

JavaScript is required to properly view the contents of this page!

Explore
- Data
- Task
- Flow
- Run
- Study
- Task type
- Measure
- People
Help
Blog
Contact
Please cite us

pol

active ARFF Publicly available Visibility: public Uploaded 04-10-2014 by Joaquin Vanschoren
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes

Issue	#Downvotes for this reason	By

Loading wiki

Help us complete this description Edit

Author: Source: Unknown - Date unknown Please cite: Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a lower target value as positive ('P') and all others as negative ('N').

49 features

binaryClass (target)	nominal	2 unique values 0 missing
f1	numeric	1 unique values 0 missing
f2	numeric	1 unique values 0 missing
f3	numeric	1 unique values 0 missing
f4	numeric	1 unique values 0 missing
f5	numeric	184 unique values 0 missing
f6	numeric	118 unique values 0 missing
f7	numeric	114 unique values 0 missing
f8	numeric	106 unique values 0 missing
f9	numeric	80 unique values 0 missing
f10	numeric	1 unique values 0 missing
f11	numeric	1 unique values 0 missing
f12	numeric	1 unique values 0 missing
f13	numeric	97 unique values 0 missing
f14	numeric	117 unique values 0 missing
f15	numeric	121 unique values 0 missing
f16	numeric	120 unique values 0 missing
f17	numeric	120 unique values 0 missing
f18	numeric	123 unique values 0 missing
f19	numeric	102 unique values 0 missing
f20	numeric	86 unique values 0 missing
f21	numeric	85 unique values 0 missing
f22	numeric	88 unique values 0 missing
f23	numeric	79 unique values 0 missing
f24	numeric	63 unique values 0 missing
f25	numeric	68 unique values 0 missing
f26	numeric	68 unique values 0 missing
f27	numeric	65 unique values 0 missing
f28	numeric	64 unique values 0 missing
f29	numeric	62 unique values 0 missing
f30	numeric	44 unique values 0 missing
f31	numeric	43 unique values 0 missing
f32	numeric	42 unique values 0 missing
f33	numeric	38 unique values 0 missing
f34	numeric	1 unique values 0 missing
f35	numeric	1 unique values 0 missing
f36	numeric	1 unique values 0 missing
f37	numeric	1 unique values 0 missing
f38	numeric	1 unique values 0 missing
f39	numeric	1 unique values 0 missing
f40	numeric	1 unique values 0 missing
f41	numeric	1 unique values 0 missing
f42	numeric	1 unique values 0 missing
f43	numeric	1 unique values 0 missing
f44	numeric	1 unique values 0 missing
f45	numeric	1 unique values 0 missing
f46	numeric	1 unique values 0 missing
f47	numeric	1 unique values 0 missing
f48	numeric	1 unique values 0 missing

Show all 49 features

107 properties

NumberOfInstances

15000

Number of instances (rows) of the dataset.

NumberOfFeatures

Number of attributes (columns) of the dataset.

NumberOfClasses

Number of distinct values of the target attribute (if it is nominal).

NumberOfMissingValues

Number of missing values in the dataset.

NumberOfInstancesWithMissingValues

Number of instances with at least one value missing.

NumberOfNumericFeatures

Number of numeric attributes.

NumberOfSymbolicFeatures

Number of nominal attributes.

MaxSkewnessOfNumericAtts

11.62

Maximum skewness among attributes of the numeric type.

MinStdDevOfNumericAtts

Minimum standard deviation of attributes of the numeric type.

PercentageOfMissingValues

Percentage of missing values.

Quartile3KurtosisOfNumericAtts

69.84

Third quartile of kurtosis among attributes of the numeric type.

AutoCorrelation

0.56

Average class difference between consecutive instances.

RandomTreeDepth1Kappa

0.88

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

J48.00001.AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .00001

MaxStdDevOfNumericAtts

35.25

Maximum standard deviation of attributes of the numeric type.

MinorityClassPercentage

33.61

Percentage of instances belonging to the least frequent class.

PercentageOfNumericFeatures

97.96

Percentage of numeric attributes.

Quartile3MeansOfNumericAtts

12.01

Third quartile of means among attributes of the numeric type.

CfsSubsetEval_DecisionStumpAUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth2AUC

0.94

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

J48.00001.ErrRate

0.03

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .00001

MeanAttributeEntropy

Average entropy of the attributes.

MinorityClassSize

5041

Number of instances belonging to the least frequent class.

PercentageOfSymbolicFeatures

2.04

Percentage of nominal attributes.

Quartile3MutualInformation

Third quartile of mutual information between the nominal attributes and the target attribute.

CfsSubsetEval_DecisionStumpErrRate

0.05

Error rate achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth2ErrRate

0.05

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

J48.00001.Kappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .00001

J48.0001.AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .0001

MeanKurtosisOfNumericAtts

34.89

Mean kurtosis among attributes of the numeric type.

NaiveBayesAUC

0.88

Area Under the ROC Curve achieved by the landmarker weka.classifiers.bayes.NaiveBayes

Quartile1AttributeEntropy

First quartile of entropy among attributes.

Quartile3SkewnessOfNumericAtts

7.8

Third quartile of skewness among attributes of the numeric type.

CfsSubsetEval_DecisionStumpKappa

0.89

Kappa coefficient achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth2Kappa

0.88

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

J48.0001.ErrRate

0.03

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .0001

MeanMeansOfNumericAtts

19.43

Mean of means among attributes of the numeric type.

NaiveBayesErrRate

0.34

Error rate achieved by the landmarker weka.classifiers.bayes.NaiveBayes

Quartile1KurtosisOfNumericAtts

3.35

First quartile of kurtosis among attributes of the numeric type.

Quartile3StdDevOfNumericAtts

11.51

Third quartile of standard deviation of attributes of the numeric type.

CfsSubsetEval_NaiveBayesAUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth3AUC

0.94

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

J48.0001.Kappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .0001

MeanMutualInformation

Average mutual information between the nominal attributes and the target attribute.

NaiveBayesKappa

0.36

Kappa coefficient achieved by the landmarker weka.classifiers.bayes.NaiveBayes

Quartile1MeansOfNumericAtts

First quartile of means among attributes of the numeric type.

REPTreeDepth1AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 1

CfsSubsetEval_NaiveBayesErrRate

0.05

Error rate achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth3ErrRate

0.05

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

RandomTreeDepth3Kappa

0.88

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

J48.001.AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .001

MeanNoiseToSignalRatio

An estimate of the amount of irrelevant information in the attributes regarding the class. Equals (MeanAttributeEntropy - MeanMutualInformation) divided by MeanMutualInformation.

NumberOfBinaryFeatures

Number of binary attributes.

Quartile1MutualInformation

First quartile of mutual information between the nominal attributes and the target attribute.

REPTreeDepth1ErrRate

0.03

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 1

CfsSubsetEval_NaiveBayesKappa

0.89

Kappa coefficient achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

StdvNominalAttDistinctValues

Standard deviation of the number of distinct values among attributes of the nominal type.

J48.001.ErrRate

0.03

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .001

MeanNominalAttDistinctValues

Average number of distinct values among the attributes of the nominal type.

Quartile1SkewnessOfNumericAtts

1.86

First quartile of skewness among attributes of the numeric type.

REPTreeDepth1Kappa

0.93

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 1

CfsSubsetEval_kNN1NAUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

kNN1NAUC

0.96

Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk

J48.001.Kappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .001

MeanSkewnessOfNumericAtts

4.78

Mean skewness among attributes of the numeric type.

Quartile1StdDevOfNumericAtts

First quartile of standard deviation of attributes of the numeric type.

REPTreeDepth2AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 2

CfsSubsetEval_kNN1NErrRate

0.05

Error rate achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

kNN1NErrRate

0.04

Error rate achieved by the landmarker weka.classifiers.lazy.IBk

MajorityClassPercentage

66.39

Percentage of instances belonging to the most frequent class.

MeanStdDevOfNumericAtts

6.52

Mean standard deviation of attributes of the numeric type.

Quartile2AttributeEntropy

Second quartile (Median) of entropy among attributes.

REPTreeDepth2ErrRate

0.03

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 2

CfsSubsetEval_kNN1NKappa

0.89

Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

kNN1NKappa

0.92

Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk

MajorityClassSize

9959

Number of instances belonging to the most frequent class.

MinAttributeEntropy

Minimal entropy among attributes.

Quartile2KurtosisOfNumericAtts

16.99

Second quartile (Median) of kurtosis among attributes of the numeric type.

REPTreeDepth2Kappa

0.93

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 2

ClassEntropy

0.92

Entropy of the target attribute values.

MaxAttributeEntropy

Maximum entropy among attributes.

MinKurtosisOfNumericAtts

-0.07

Minimum kurtosis among attributes of the numeric type.

Quartile2MeansOfNumericAtts

0.93

Second quartile (Median) of means among attributes of the numeric type.

REPTreeDepth3AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 3

DecisionStumpAUC

0.69

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.DecisionStump

MaxKurtosisOfNumericAtts

147.26

Maximum kurtosis among attributes of the numeric type.

MinMeansOfNumericAtts

Minimum of means among attributes of the numeric type.

Quartile2MutualInformation

Second quartile (Median) of mutual information between the nominal attributes and the target attribute.

REPTreeDepth3ErrRate

0.03

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 3

DecisionStumpErrRate

0.34

Error rate achieved by the landmarker weka.classifiers.trees.DecisionStump

MaxMeansOfNumericAtts

110

Maximum of means among attributes of the numeric type.

MinMutualInformation

Minimal mutual information between the nominal attributes and the target attribute.

Quartile2SkewnessOfNumericAtts

3.92

Second quartile (Median) of skewness among attributes of the numeric type.

REPTreeDepth3Kappa

0.93

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 3

DecisionStumpKappa

Kappa coefficient achieved by the landmarker weka.classifiers.trees.DecisionStump

MaxMutualInformation

Maximum mutual information between the nominal attributes and the target attribute.

MinNominalAttDistinctValues

The minimal number of distinct values among attributes of the nominal type.

PercentageOfBinaryFeatures

2.04

Percentage of binary attributes.

Quartile2StdDevOfNumericAtts

3.13

Second quartile (Median) of standard deviation of attributes of the numeric type.

RandomTreeDepth1AUC

0.94

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

Dimensionality

Number of attributes divided by the number of instances.

MaxNominalAttDistinctValues

The maximum number of distinct values among attributes of the nominal type.

MinSkewnessOfNumericAtts

0.31

Minimum skewness among attributes of the numeric type.

PercentageOfInstancesWithMissingValues

Percentage of instances having missing values.

Quartile3AttributeEntropy

Third quartile of entropy among attributes.

RandomTreeDepth1ErrRate

0.05

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

EquivalentNumberOfAtts

Number of attributes needed to optimally describe the class (under the assumption of independence among attributes). Equals ClassEntropy divided by MeanMutualInformation.

Show all 107 properties

16 tasks

Supervised Classification on pol

420 runs - estimation_procedure: 10-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: binaryClass

Supervised Classification on pol

204 runs - estimation_procedure: 10 times 10-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: binaryClass

Supervised Classification on pol

0 runs - estimation_procedure: 33% Holdout set - evaluation_measure: predictive_accuracy - target_feature: binaryClass

Supervised Classification on pol

0 runs - estimation_procedure: 4-fold Crossvalidation - target_feature: binaryClass

Supervised Data Stream Classification on pol

0 runs - estimation_procedure: Interleaved Test then Train - target_feature: binaryClass

Clustering on pol