OpenML

JavaScript is required to properly view the contents of this page!

Explore
- Data
- Task
- Flow
- Run
- Study
- Task type
- Measure
- People
Help
Blog
Contact
Please cite us

artificial-characters

active ARFF Publicly available Visibility: public Uploaded 21-05-2015 by Rafael Gomes Mantovani
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes

Issue	#Downvotes for this reason	By

Loading wiki

Help us complete this description Edit

Author: H. Altay Guvenir, Burak Acar, Haldun Muderrisoglu Source: [UCI](https://archive.ics.uci.edu/ml/datasets/Artificial+Characters) - 1992 Please cite: [UCI](https://archive.ics.uci.edu/ml/citation_policy.html) This database has been artificially generated. It describes the structure of the capital letters A, C, D, E, F, G, H, L, P, R, indicated by a number 1-10, in that order (A=1,C=2,...). Each letter's structure is described by a set of segments (lines) which resemble the way an automatic program would segment an image. The dataset consists of 600 such descriptions per letter. Originally, each 'instance' (letter) was stored in a separate file, each consisting of between 1 and 7 segments, numbered 0,1,2,3,... Here they are merged. That means that the first 5 instances describe the first 5 segments of the first segmentation of the first letter (A). Also, the training set (100 examples) and test set (the rest) are merged. The next 7 instances describe another segmentation (also of the letter A) and so on. ### Attribute Information * V1: object number, the number of the segment (0,1,2,..,7) * V2-V5: the initial and final coordinates of a segment in a cartesian plane (XX1,YY1,XX2,YY2). * V6: size, this is the length of a segment computed by using the geometric distance between two points A(X1,Y1) and B(X2,Y2). * V7: diagonal, this is the length of the diagonal of the smallest rectangle which includes the picture of the character. The value of this attribute is the same in each object. ### Relevant Papers M. Botta, A. Giordana, L. Saitta: "Learning Fuzzy Concept Definitions", IEEE-Fuzzy Conference, 1993. M. Botta, A. Giordana: "Learning Quantitative Feature in a Symbolic Environment", LNAI 542, 1991, pp. 296-305.

8 features

Class (target)	nominal	10 unique values 0 missing
V1	numeric	8 unique values 0 missing
V2	numeric	45 unique values 0 missing
V3	numeric	63 unique values 0 missing
V4	numeric	48 unique values 0 missing
V5	numeric	66 unique values 0 missing
V6	numeric	333 unique values 0 missing
V7	numeric	511 unique values 0 missing

Show all 8 features

107 properties

NumberOfInstances

10218

Number of instances (rows) of the dataset.

NumberOfFeatures

Number of attributes (columns) of the dataset.

NumberOfClasses

Number of distinct values of the target attribute (if it is nominal).

NumberOfMissingValues

Number of missing values in the dataset.

NumberOfInstancesWithMissingValues

Number of instances with at least one value missing.

NumberOfNumericFeatures

Number of numeric attributes.

NumberOfSymbolicFeatures

Number of nominal attributes.

Quartile2SkewnessOfNumericAtts

0.71

Second quartile (Median) of skewness among attributes of the numeric type.

REPTreeDepth3Kappa

0.6

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 3

DecisionStumpKappa

0.08

Kappa coefficient achieved by the landmarker weka.classifiers.trees.DecisionStump

MaxMeansOfNumericAtts

40.46

Maximum of means among attributes of the numeric type.

MinMutualInformation

Minimal mutual information between the nominal attributes and the target attribute.

Quartile2StdDevOfNumericAtts

9.73

Second quartile (Median) of standard deviation of attributes of the numeric type.

RandomTreeDepth1AUC

0.88

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

Dimensionality

Number of attributes divided by the number of instances.

MaxMutualInformation

Maximum mutual information between the nominal attributes and the target attribute.

MinNominalAttDistinctValues

The minimal number of distinct values among attributes of the nominal type.

PercentageOfBinaryFeatures

Percentage of binary attributes.

Quartile3AttributeEntropy

Third quartile of entropy among attributes.

RandomTreeDepth1ErrRate

0.21

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

EquivalentNumberOfAtts

Number of attributes needed to optimally describe the class (under the assumption of independence among attributes). Equals ClassEntropy divided by MeanMutualInformation.

MaxNominalAttDistinctValues

The maximum number of distinct values among attributes of the nominal type.

MinSkewnessOfNumericAtts

0.2

Minimum skewness among attributes of the numeric type.

PercentageOfInstancesWithMissingValues

Percentage of instances having missing values.

Quartile3KurtosisOfNumericAtts

0.26

Third quartile of kurtosis among attributes of the numeric type.

AutoCorrelation

Average class difference between consecutive instances.

RandomTreeDepth1Kappa

0.76

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

J48.00001.AUC

0.93

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .00001

MaxSkewnessOfNumericAtts

1.42

Maximum skewness among attributes of the numeric type.

MinStdDevOfNumericAtts

1.72

Minimum standard deviation of attributes of the numeric type.

PercentageOfMissingValues

Percentage of missing values.

Quartile3MeansOfNumericAtts

21.03

Third quartile of means among attributes of the numeric type.

CfsSubsetEval_DecisionStumpAUC

0.91

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth2AUC

0.88

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

J48.00001.ErrRate

0.3

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .00001

MaxStdDevOfNumericAtts

14.18

Maximum standard deviation of attributes of the numeric type.

MinorityClassPercentage

5.87

Percentage of instances belonging to the least frequent class.

PercentageOfNumericFeatures

87.5

Percentage of numeric attributes.

Quartile3MutualInformation

Third quartile of mutual information between the nominal attributes and the target attribute.

CfsSubsetEval_DecisionStumpErrRate

0.42

Error rate achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth2ErrRate

0.21

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

J48.00001.Kappa

0.66

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .00001

MeanAttributeEntropy

Average entropy of the attributes.

MinorityClassSize

600

Number of instances belonging to the least frequent class.

PercentageOfSymbolicFeatures

12.5

Percentage of nominal attributes.

Quartile3SkewnessOfNumericAtts

0.79

Third quartile of skewness among attributes of the numeric type.

CfsSubsetEval_DecisionStumpKappa

0.53

Kappa coefficient achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth2Kappa

0.76

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

J48.0001.AUC

0.93

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .0001

MeanKurtosisOfNumericAtts

-0.04

Mean kurtosis among attributes of the numeric type.

NaiveBayesAUC

0.77

Area Under the ROC Curve achieved by the landmarker weka.classifiers.bayes.NaiveBayes

Quartile1AttributeEntropy

First quartile of entropy among attributes.

Quartile1KurtosisOfNumericAtts

-0.53

First quartile of kurtosis among attributes of the numeric type.

Quartile3StdDevOfNumericAtts

13.1

Third quartile of standard deviation of attributes of the numeric type.

CfsSubsetEval_NaiveBayesAUC

0.91

Area Under the ROC Curve achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth3AUC

0.88

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

J48.0001.ErrRate

0.3

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .0001

MeanMeansOfNumericAtts

15.68

Mean of means among attributes of the numeric type.

NaiveBayesErrRate

0.7

Error rate achieved by the landmarker weka.classifiers.bayes.NaiveBayes

Quartile1MeansOfNumericAtts

6.06

First quartile of means among attributes of the numeric type.

REPTreeDepth1AUC

0.94

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 1

CfsSubsetEval_NaiveBayesErrRate

0.42

Error rate achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth3ErrRate

0.21

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

J48.0001.Kappa

0.66

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .0001

MeanMutualInformation

Average mutual information between the nominal attributes and the target attribute.

NaiveBayesKappa

0.22

Kappa coefficient achieved by the landmarker weka.classifiers.bayes.NaiveBayes

Quartile1MutualInformation

First quartile of mutual information between the nominal attributes and the target attribute.

REPTreeDepth1ErrRate

0.35

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 1

CfsSubsetEval_NaiveBayesKappa

0.53

Kappa coefficient achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth3Kappa

0.76

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

J48.001.AUC

0.93

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .001

MeanNoiseToSignalRatio

An estimate of the amount of irrelevant information in the attributes regarding the class. Equals (MeanAttributeEntropy - MeanMutualInformation) divided by MeanMutualInformation.

NumberOfBinaryFeatures

Number of binary attributes.

Quartile1SkewnessOfNumericAtts

0.35

First quartile of skewness among attributes of the numeric type.

REPTreeDepth1Kappa

0.6

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 1

CfsSubsetEval_kNN1NAUC

0.91

Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

StdvNominalAttDistinctValues

Standard deviation of the number of distinct values among attributes of the nominal type.

J48.001.ErrRate

0.3

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .001

MeanNominalAttDistinctValues

Average number of distinct values among the attributes of the nominal type.

Quartile1StdDevOfNumericAtts

7.8

First quartile of standard deviation of attributes of the numeric type.

REPTreeDepth2AUC

0.94

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 2

CfsSubsetEval_kNN1NErrRate

0.42

Error rate achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

kNN1NAUC

0.87

Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk

J48.001.Kappa

0.66

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .001

MeanSkewnessOfNumericAtts

0.67

Mean skewness among attributes of the numeric type.

Quartile2AttributeEntropy

Second quartile (Median) of entropy among attributes.

REPTreeDepth2ErrRate

0.35

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 2

CfsSubsetEval_kNN1NKappa

0.53

Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

kNN1NErrRate

0.21

Error rate achieved by the landmarker weka.classifiers.lazy.IBk

MajorityClassPercentage

13.86

Percentage of instances belonging to the most frequent class.

MeanStdDevOfNumericAtts

9.5

Mean standard deviation of attributes of the numeric type.

Quartile2KurtosisOfNumericAtts

-0.23

Second quartile (Median) of kurtosis among attributes of the numeric type.

REPTreeDepth2Kappa

0.6

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 2

ClassEntropy

3.28

Entropy of the target attribute values.

kNN1NKappa

0.76

Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk

MajorityClassSize

1416

Number of instances belonging to the most frequent class.

MinAttributeEntropy

Minimal entropy among attributes.

Quartile2MeansOfNumericAtts

15.25

Second quartile (Median) of means among attributes of the numeric type.

REPTreeDepth3AUC

0.94

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 3

DecisionStumpAUC

0.6

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.DecisionStump

MaxAttributeEntropy

Maximum entropy among attributes.

MinKurtosisOfNumericAtts

-0.55

Minimum kurtosis among attributes of the numeric type.

Quartile2MutualInformation

Second quartile (Median) of mutual information between the nominal attributes and the target attribute.

REPTreeDepth3ErrRate

0.35

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 3

DecisionStumpErrRate

0.83

Error rate achieved by the landmarker weka.classifiers.trees.DecisionStump

MaxKurtosisOfNumericAtts

1.25

Maximum kurtosis among attributes of the numeric type.

MinMeansOfNumericAtts

2.22

Minimum of means among attributes of the numeric type.

Show all 107 properties

58 tasks

Supervised Classification on artificial-characters

11744 runs - estimation_procedure: 10-fold Crossvalidation - target_feature: Class

Supervised Classification on artificial-characters

38 runs - estimation_procedure: 10-fold Crossvalidation - target_feature: Class

Supervised Classification on artificial-characters

1 runs - estimation_procedure: 5 times 2-fold Crossvalidation - target_feature: Class

Supervised Classification on artificial-characters

0 runs - estimation_procedure: 33% Holdout set - evaluation_measure: predictive_accuracy - target_feature: Class

Supervised Classification on artificial-characters

0 runs - estimation_procedure: 33% Holdout set - target_feature: Class

Learning Curve on artificial-characters

0 runs - estimation_procedure: 10-fold Learning Curve - target_feature:

Clustering on artificial-characters

0 runs - estimation_procedure: 50 times Clustering

Clustering on artificial-characters

0 runs - estimation_procedure: 50 times Clustering

Clustering on artificial-characters

0 runs - estimation_procedure: 50 times Clustering

Clustering on artificial-characters

0 runs - estimation_procedure: 50 times Clustering

Clustering on artificial-characters

0 runs - estimation_procedure: 50 times Clustering

Clustering on artificial-characters

0 runs

Clustering on artificial-characters

0 runs - target_feature: Class

Clustering on artificial-characters

0 runs - estimation_procedure: 50 times Clustering

Clustering on artificial-characters

0 runs - estimation_procedure: 50 times Clustering

Clustering on artificial-characters

0 runs - estimation_procedure: 50 times Clustering

Clustering on artificial-characters

0 runs - estimation_procedure: 50 times Clustering

Clustering on artificial-characters

0 runs - estimation_procedure: 50 times Clustering

Subgroup Discovery on artificial-characters

1299 runs - target_feature: Class

Subgroup Discovery on artificial-characters

1299 runs - target_feature: Class

Subgroup Discovery on artificial-characters

1299 runs - target_feature: Class

Subgroup Discovery on artificial-characters

1298 runs - target_feature: Class

Subgroup Discovery on artificial-characters

1298 runs - target_feature: Class

Subgroup Discovery on artificial-characters

1298 runs - target_feature: Class

Subgroup Discovery on artificial-characters

1297 runs - target_feature: Class

Subgroup Discovery on artificial-characters

1297 runs - target_feature: Class

Subgroup Discovery on artificial-characters

1297 runs - target_feature: Class

Subgroup Discovery on artificial-characters

1297 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Subgroup Discovery on artificial-characters

0 runs - target_feature: Class

Define a new task

Sign in

artificial-characters

8 features

107 properties

58 tasks