OpenML

JavaScript is required to properly view the contents of this page!

Explore
- Data
- Task
- Flow
- Run
- Study
- Task type
- Measure
- People
Help
Blog
Contact
Please cite us

anneal

deactivated ARFF Publicly available Visibility: public Uploaded 06-04-2014 by Jan van Rijn
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes

Issue	#Downvotes for this reason	By

Loading wiki

Help us complete this description Edit

Author: donated by David Sterling and Wray Buntine Source: [original (UCI)](http://www.openml.org/d/2) - Please cite: This is a preprocessed version of the anneal dataset (version 1). All missing values are treated as a nominal value with label '?'. (Quotes for clarity). Because this is not good practice, this dataset is deactivated. Use version 1 instead 1. Title of Database: Annealing Data 2. Source Information: donated by David Sterling and Wray Buntine. 3. Past Usage: unknown 4. Relevant Information: -- Explanation: I suspect this was left by Ross Quinlan in 1987 at the 4th Machine Learning Workshop. I'd have to check with Jeff Schlimmer to double check this. 5. Number of Instances: 898 6. Number of Attributes: 38 -- 6 continuously-valued -- 3 integer-valued -- 29 nominal-valued 7. Attribute Information: 1. family: --,GB,GK,GS,TN,ZA,ZF,ZH,ZM,ZS 2. product-type: C, H, G 3. steel: -,R,A,U,K,M,S,W,V 4. carbon: continuous 5. hardness: continuous 6. temper_rolling: -,T 7. condition: -,S,A,X 8. formability: -,1,2,3,4,5 9. strength: continuous 10. non-ageing: -,N 11. surface-finish: P,M,- 12. surface-quality: -,D,E,F,G 13. enamelability: -,1,2,3,4,5 14. bc: Y,- 15. bf: Y,- 16. bt: Y,- 17. bw/me: B,M,- 18. bl: Y,- 19. m: Y,- 20. chrom: C,- 21. phos: P,- 22. cbond: Y,- 23. marvi: Y,- 24. exptl: Y,- 25. ferro: Y,- 26. corr: Y,- 27. blue/bright/varn/clean: B,R,V,C,- 28. lustre: Y,- 29. jurofm: Y,- 30. s: Y,- 31. p: Y,- 32. shape: COIL, SHEET 33. thick: continuous 34. width: continuous 35. len: continuous 36. oil: -,Y,N 37. bore: 0000,0500,0600,0760 38. packing: -,1,2,3 classes: 1,2,3,4,5,U -- The '-' values are actually 'not_applicable' values rather than 'missing_values' (and so can be treated as legal discrete values rather than as showing the absence of a discrete value). 8. Missing Attribute Values: Signified with "?" Attribute: Number of instances missing its value: 1 0 2 0 3 70 4 0 5 0 6 675 7 271 8 283 9 0 10 703 11 790 12 217 13 785 14 797 15 680 16 736 17 609 18 662 19 798 20 775 21 791 22 730 23 798 24 796 25 772 26 798 27 793 28 753 29 798 30 798 31 798 32 0 33 0 34 0 35 0 36 740 37 0 38 789 39 0 9. Distribution of Classes Class Name: Number of Instances: 1 8 2 88 3 608 4 0 5 60 U 34 --- 798

39 features

class (target)	nominal	5 unique values 0 missing
family	nominal	3 unique values 0 missing
product-type	nominal	1 unique values 0 missing
steel	nominal	8 unique values 0 missing
carbon	numeric	10 unique values 0 missing
hardness	numeric	7 unique values 0 missing
temper_rolling	nominal	2 unique values 0 missing
condition	nominal	3 unique values 0 missing
formability	nominal	5 unique values 0 missing
strength	numeric	8 unique values 0 missing
non-ageing	nominal	2 unique values 0 missing
surface-finish	nominal	2 unique values 0 missing
surface-quality	nominal	5 unique values 0 missing
enamelability	nominal	3 unique values 0 missing
bc	nominal	2 unique values 0 missing
bf	nominal	2 unique values 0 missing
bt	nominal	2 unique values 0 missing
bw%2Fme	nominal	3 unique values 0 missing
bl	nominal	2 unique values 0 missing
m	nominal	1 unique values 0 missing
chrom	nominal	2 unique values 0 missing
phos	nominal	2 unique values 0 missing
cbond	nominal	2 unique values 0 missing
marvi	nominal	1 unique values 0 missing
exptl	nominal	2 unique values 0 missing
ferro	nominal	2 unique values 0 missing
corr	nominal	1 unique values 0 missing
blue%2Fbright%2Fvarn%2Fclean	nominal	4 unique values 0 missing
lustre	nominal	2 unique values 0 missing
jurofm	nominal	1 unique values 0 missing
s	nominal	1 unique values 0 missing
p	nominal	1 unique values 0 missing
shape	nominal	2 unique values 0 missing
thick	numeric	50 unique values 0 missing
width	numeric	68 unique values 0 missing
len	numeric	24 unique values 0 missing
oil	nominal	3 unique values 0 missing
bore	nominal	3 unique values 0 missing
packing	nominal	3 unique values 0 missing

Show all 39 features

107 properties

NumberOfInstances

898

Number of instances (rows) of the dataset.

NumberOfFeatures

Number of attributes (columns) of the dataset.

NumberOfClasses

Number of distinct values of the target attribute (if it is nominal).

NumberOfMissingValues

Number of missing values in the dataset.

NumberOfInstancesWithMissingValues

Number of instances with at least one value missing.

NumberOfNumericFeatures

Number of numeric attributes.

NumberOfSymbolicFeatures

Number of nominal attributes.

MaxSkewnessOfNumericAtts

3.76

Maximum skewness among attributes of the numeric type.

MinStdDevOfNumericAtts

0.87

Minimum standard deviation of attributes of the numeric type.

PercentageOfMissingValues

Percentage of missing values.

Quartile3KurtosisOfNumericAtts

12.74

Third quartile of kurtosis among attributes of the numeric type.

AutoCorrelation

0.61

Average class difference between consecutive instances.

RandomTreeDepth1Kappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

J48.00001.AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .00001

MaxStdDevOfNumericAtts

1871.4

Maximum standard deviation of attributes of the numeric type.

MinorityClassPercentage

0.89

Percentage of instances belonging to the least frequent class.

PercentageOfNumericFeatures

15.38

Percentage of numeric attributes.

Quartile3MeansOfNumericAtts

901.26

Third quartile of means among attributes of the numeric type.

CfsSubsetEval_DecisionStumpAUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth2AUC

0.97

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

J48.00001.ErrRate

0.02

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .00001

MeanAttributeEntropy

0.46

Average entropy of the attributes.

MinorityClassSize

Number of instances belonging to the least frequent class.

PercentageOfSymbolicFeatures

84.62

Percentage of nominal attributes.

Quartile3MutualInformation

0.13

Third quartile of mutual information between the nominal attributes and the target attribute.

CfsSubsetEval_DecisionStumpErrRate

0.02

Error rate achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth2ErrRate

0.02

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

J48.00001.Kappa

0.96

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .00001

MeanKurtosisOfNumericAtts

4.65

Mean kurtosis among attributes of the numeric type.

NaiveBayesAUC

0.96

Area Under the ROC Curve achieved by the landmarker weka.classifiers.bayes.NaiveBayes

Quartile1AttributeEntropy

0.02

First quartile of entropy among attributes.

Quartile3SkewnessOfNumericAtts

3.75

Third quartile of skewness among attributes of the numeric type.

CfsSubsetEval_DecisionStumpKappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth2Kappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

J48.0001.AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .0001

MeanMeansOfNumericAtts

348.5

Mean of means among attributes of the numeric type.

NaiveBayesErrRate

0.14

Error rate achieved by the landmarker weka.classifiers.bayes.NaiveBayes

Quartile1KurtosisOfNumericAtts

-0.4

First quartile of kurtosis among attributes of the numeric type.

Quartile3StdDevOfNumericAtts

771.86

Third quartile of standard deviation of attributes of the numeric type.

CfsSubsetEval_NaiveBayesAUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth3AUC

0.97

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

J48.0001.ErrRate

0.02

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .0001

MeanMutualInformation

0.08

Average mutual information between the nominal attributes and the target attribute.

NaiveBayesKappa

0.72

Kappa coefficient achieved by the landmarker weka.classifiers.bayes.NaiveBayes

Quartile1MeansOfNumericAtts

3.03

First quartile of means among attributes of the numeric type.

REPTreeDepth1AUC

0.99

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 1

CfsSubsetEval_NaiveBayesErrRate

0.02

Error rate achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth3ErrRate

0.02

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

J48.0001.Kappa

0.96

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .0001

MeanNoiseToSignalRatio

4.66

An estimate of the amount of irrelevant information in the attributes regarding the class. Equals (MeanAttributeEntropy - MeanMutualInformation) divided by MeanMutualInformation.

NumberOfBinaryFeatures

Number of binary attributes.

Quartile1MutualInformation

First quartile of mutual information between the nominal attributes and the target attribute.

REPTreeDepth1ErrRate

0.03

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 1

CfsSubsetEval_NaiveBayesKappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth3Kappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

J48.001.AUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .001

MeanNominalAttDistinctValues

2.52

Average number of distinct values among the attributes of the nominal type.

Quartile1SkewnessOfNumericAtts

0.97

First quartile of skewness among attributes of the numeric type.

REPTreeDepth1Kappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 1

CfsSubsetEval_kNN1NAUC

0.98

Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

StdvNominalAttDistinctValues

1.5

Standard deviation of the number of distinct values among attributes of the nominal type.

J48.001.ErrRate

0.02

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .001

MeanSkewnessOfNumericAtts

2.03

Mean skewness among attributes of the numeric type.

Quartile1StdDevOfNumericAtts

10.51

First quartile of standard deviation of attributes of the numeric type.

REPTreeDepth2AUC

0.99

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 2

CfsSubsetEval_kNN1NErrRate

0.02

Error rate achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

kNN1NAUC

0.95

Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk

J48.001.Kappa

0.96

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .001

MeanStdDevOfNumericAtts

405.17

Mean standard deviation of attributes of the numeric type.

Quartile2AttributeEntropy

0.26

Second quartile (Median) of entropy among attributes.

REPTreeDepth2ErrRate

0.03

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 2

CfsSubsetEval_kNN1NKappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

kNN1NErrRate

0.03

Error rate achieved by the landmarker weka.classifiers.lazy.IBk

MajorityClassPercentage

76.17

Percentage of instances belonging to the most frequent class.

MinAttributeEntropy

Minimal entropy among attributes.

Quartile2KurtosisOfNumericAtts

1.64

Second quartile (Median) of kurtosis among attributes of the numeric type.

REPTreeDepth2Kappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 2

ClassEntropy

1.19

Entropy of the target attribute values.

kNN1NKappa

0.92

Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk

MajorityClassSize

684

Number of instances belonging to the most frequent class.

MinKurtosisOfNumericAtts

-0.97

Minimum kurtosis among attributes of the numeric type.

Quartile2MeansOfNumericAtts

21.22

Second quartile (Median) of means among attributes of the numeric type.

REPTreeDepth3AUC

0.99

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 3

DecisionStumpAUC

0.82

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.DecisionStump

MaxAttributeEntropy

2.05

Maximum entropy among attributes.

MinMeansOfNumericAtts

1.2

Minimum of means among attributes of the numeric type.

Quartile2MutualInformation

0.03

Second quartile (Median) of mutual information between the nominal attributes and the target attribute.

REPTreeDepth3ErrRate

0.03

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 3

DecisionStumpErrRate

0.23

Error rate achieved by the landmarker weka.classifiers.trees.DecisionStump

MaxKurtosisOfNumericAtts

13.22

Maximum kurtosis among attributes of the numeric type.

MaxMeansOfNumericAtts

1263.09

Maximum of means among attributes of the numeric type.

MinMutualInformation

Minimal mutual information between the nominal attributes and the target attribute.

Quartile2SkewnessOfNumericAtts

1.65

Second quartile (Median) of skewness among attributes of the numeric type.

REPTreeDepth3Kappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 3

DecisionStumpKappa

0.45

Kappa coefficient achieved by the landmarker weka.classifiers.trees.DecisionStump

MaxMutualInformation

0.44

Maximum mutual information between the nominal attributes and the target attribute.

MinNominalAttDistinctValues

The minimal number of distinct values among attributes of the nominal type.

PercentageOfBinaryFeatures

48.72

Percentage of binary attributes.

Quartile2StdDevOfNumericAtts

69.85

Second quartile (Median) of standard deviation of attributes of the numeric type.

RandomTreeDepth1AUC

0.97

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

Dimensionality

0.04

Number of attributes divided by the number of instances.

MaxNominalAttDistinctValues

The maximum number of distinct values among attributes of the nominal type.

MinSkewnessOfNumericAtts

0.07

Minimum skewness among attributes of the numeric type.

PercentageOfInstancesWithMissingValues

Percentage of instances having missing values.

Quartile3AttributeEntropy

0.64

Third quartile of entropy among attributes.

RandomTreeDepth1ErrRate

0.02

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

EquivalentNumberOfAtts

14.54

Number of attributes needed to optimally describe the class (under the assumption of independence among attributes). Equals ClassEntropy divided by MeanMutualInformation.

Show all 107 properties

29 tasks

Supervised Classification on anneal

950 runs - estimation_procedure: 10-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: class

Supervised Classification on anneal

300 runs - estimation_procedure: 33% Holdout set - evaluation_measure: predictive_accuracy - target_feature: class

Supervised Classification on anneal

298 runs - estimation_procedure: 5 times 2-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: class

Supervised Classification on anneal

185 runs - estimation_procedure: 10 times 10-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: class

Supervised Classification on anneal

24 runs - estimation_procedure: 10-fold Crossvalidation - evaluation_measure: precision - target_feature: class

Supervised Classification on anneal

0 runs - estimation_procedure: Leave one out - evaluation_measure: predictive_accuracy - target_feature: class

Supervised Classification on anneal

0 runs - estimation_procedure: 20% Holdout (Ordered) - evaluation_measure: predictive_accuracy - target_feature: class

Learning Curve on anneal

181 runs - estimation_procedure: 10 times 10-fold Learning Curve - evaluation_measure: predictive_accuracy - target_feature: class

Learning Curve on anneal

79 runs - estimation_procedure: 10-fold Learning Curve - evaluation_measure: predictive_accuracy - target_feature: class

Learning Curve on anneal

0 runs - estimation_procedure: 10-fold Learning Curve - target_feature: class

Learning Curve on anneal

0 runs - estimation_procedure: 10-fold Learning Curve - target_feature: class

Learning Curve on anneal

0 runs - estimation_procedure: 10-fold Learning Curve - target_feature: class

Learning Curve on anneal

0 runs - estimation_procedure: 10-fold Learning Curve - target_feature: class

Learning Curve on anneal

0 runs - estimation_procedure: 10-fold Learning Curve - target_feature: class

Learning Curve on anneal

0 runs - estimation_procedure: 10-fold Learning Curve - target_feature: class

Supervised Data Stream Classification on anneal

25 runs - estimation_procedure: Interleaved Test then Train - target_feature: class

Clustering on anneal