OpenML

JavaScript is required to properly view the contents of this page!

Explore
- Data
- Task
- Flow
- Run
- Study
- Task type
- Measure
- People
Help
Blog
Contact
Please cite us

gas-drift-different-concentrations

active ARFF Publicly available Visibility: public Uploaded 22-05-2015 by Rafael Gomes Mantovani
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes

Issue	#Downvotes for this reason	By

Loading wiki

Help us complete this description Edit

Author: Alexander Vergara Source: UCI Please cite: A Vergara, S Vembu, T Ayhan, M Ryan, M Homer, R Huerta. "Chemical gas sensor drift compensation using classifier ensembles." Sensors and Actuators B: Chemical 166 (2012): 320-329. I Rodriguez-Lujan, J Fonollosa, A Vergara, M Homer, R Huerta. "On the calibration of sensor arrays for pattern recognition using the minimal number of experiments." Chemometrics and Intelligent Laboratory Systems 130 (2014): 123-134. Source: Creators: Alexander Vergara (vergara '@' ucsd.edu) BioCircutis Institute University of California San Diego San Diego, California, USA Donors of the Dataset: Alexander Vergara (vergara '@' ucsd.edu) Jordi Fonollosa (fonollosa '@'ucsd.edu) Irene Rodriguez-Lujan (irrodriguezlujan '@' ucsd.edu) Ramon Huerta (rhuerta '@' ucsd.edu) Data Set Information: This data set contains 13,910 measurements from 16 chemical sensors exposed to 6 gases at different concentration levels. This dataset is an extension of the Gas Sensor Array Drift Dataset ([Web Link]), providing now the information about the concentration level at which the sensors were exposed for each measurement. The primary purpose of making this dataset freely accessible on-line is to provide an extensive dataset to the sensor and artificial intelligence research communities to develop and test strategies to solve a wide variety of tasks, including sensor drift, classification, regression, among others. The dataset can be used exclusively for research purposes. Commercial purposes are fully excluded. Citation of both Vergara et al. 'Chemical gas sensor drift compensation using classifier ensembles' and Rodriguez-Lujan et al. â€œOn the calibration of sensor arrays for pattern recognition using the minimal number of experimentsâ€ is required (see below). The dataset was gathered during the period of January 2008 to February 2011 (36 months) in a gas delivery platform facility situated at the ChemoSignals Laboratory in the BioCircuits Institute, University of California San Diego. The measurement system platform provides versatility for obtaining the desired concentrations of the chemical substances of interest with high accuracy and in a highly reproducible manner, minimizing thereby the common mistakes caused by human intervention and making it possible to exclusively concentrate on the chemical sensors. See reference 1 for more details on the experimental setup. The resulting dataset comprises recordings from six distinct pure gaseous substances, namely Ammonia, Acetaldehyde, Acetone, Ethylene, Ethanol, and Toluene, dosed at a wide variety of concentration levels in the intervals (50,1000), (5,500), (12,1000), (10,300), (10,600), and (10,100) ppmv, respectively. Attribute Information: The responses of the said sensors are read in the form of the resistance across the active layer of each sensor; hence, each measurement produced a 16-channel time series, each represented by an aggregate of features reflecting the dynamic processes occurring at the sensor surface in reaction to the chemical substance being evaluated. In particular, two distinct types of features were considered in the creation of this dataset: (i) the so-called steady-state feature (DR), defined as the maximal resistance change with respect to the baseline and its DR normalized version (DR divided by the acquired value when the chemical vapor is present in the test chamber). And (ii), an aggregate of features reflecting the sensor dynamics of the increasing/decaying transient portion of the sensor response during the entire measurement. This aggregate of features is a transformation, borrowed from the field of econometrics and originally introduced to the chemo-sensing community by Muezzinoglu et al. (2009), that converts the transient portion of the sensor response into a real scalar by estimating the maximum/minimum value y[k] for the rising/decaying portion of the exponential moving average of the sensor response: y[k] = (1-Alfa) y[k-1]+Alfa(R[k]-R[k-1]) where R[k] is the sensor resistance measured at time k and Alfa is a scalar smoothing parameter between 0 and 1. In particular, three different values for Alfa=0.1, 0.01, 0.001 were set to obtain three different feature values from the rising portion of the sensor response and three additional features with the same Alfa values for the decaying portion of the sensor response, covering thus the entire sensor response dynamics. Thus, each feature vector contains the 8 features extracted from each particular sensor, resulting in a 128-dimensional feature vector (8 features x 16 sensors) containing all the features and organized as follows: DR_1, |DR|_1, EMAi0.001_1, EMAi0.01_1, EMAi0.1_1, EMAd0.001_1, EMAd0.01_1, EMAd0.1_1, DR_2, |DR|_2, EMAi0.001_2, EMAi0.01_2, EMAi0.1_2, EMAd0.001_2, EMAd0.01_2, EMAd0.1_2,..., DR_16, |DR|_16, EMAi0.001_16, EMAi0.01_16, EMAi0.1_16, EMAd0.001_16, EMAd0.01_16, EMAd0.1_16 where: DR_j and |DR|_j are the R and the normalized R features, respectively. EMAi0.001_j, EMAi0.01_j, and EMAi0.1_j, are the emaR of the rising transient portion of the sensor response for Alfa 0.001, 0.01, and 0.1, respectively. EMAd0.001_j, EMAd0.01_j, and EMAd0.1_j, are emaR of the decaying transient portion of the sensor response for Alfa 0.001, 0.01, and 0.1, respectively. The index j=1â€¦16 represents the number of the sensor, forming thus the 128-dimensional feature vector. For processing purposes, the dataset is organized into ten batches, each containing the number of measurements per class and month indicated in the tables below. This reorganization of data was done to ensure having a sufficient and as uniformly distributed as possible number of experiments in each batch. Batch ID Month IDs Batch 1 Months 1 and 2 Batch 2 Months 3, 4, 8, 9 and 10 Batch 3 Months 11, 12, and 13 Batch 4 Months 14 and 15 Batch 5 Month 16 Batch 6 Months 17, 18, 19, and 20 Batch 7 Month 21 Batch 8 Months 22 and 23 Batch 9 Months 24 and 30 Batch 10 Month 36 Batch ID: Ethanol, Ethylene, Ammonia, Acetaldehyde, Acetone, Toluene Batch 1: 83, 30, 70, 98, 90, 74 Batch 2: 100, 109, 532, 334, 164, 5 Batch 3: 216, 240, 275, 490, 365, 0 Batch 4: 12, 30, 12, 43, 64, 0 Batch 5: 20, 46, 63, 40, 28, 0 Batch 6: 110, 29, 606, 574, 514, 467 Batch 7: 360, 744, 630, 662, 649, 568 Batch 8: 40, 33, 143, 30, 30, 18 Batch 9: 100, 75, 78, 55, 61, 101 Batch 10: 600, 600, 600, 600, 600, 600 The dataset is organized in files, each representing a different batch. Within the files, each line represents a measurement. The first character (1-6) codes the analyte, followed by the concentration level: 1: Ethanol; 2: Ethylene; 3: Ammonia; 4: Acetaldehyde; 5: Acetone; 6: Toluene The data format follows the same coding style as in libsvm format x:v, where x stands for the feature number and v for the actual value of the feature. For example, in 1;10.000000 1:15596.162100 2:1.868245 3:2.371604 4:2.803678 5:7.512213 â€¦ 128:-2.654529 The number 1 stands for the class number (in this case Ethanol), the gas concentration level was 10ppmv, and the remaining 128 columns list the actual feature values for each measurement recording organized as described above.

130 features

Class (target)	nominal	6 unique values 0 missing
V1	numeric	13904 unique values 0 missing
V2	numeric	13890 unique values 0 missing
V3	numeric	13904 unique values 0 missing
V4	numeric	13905 unique values 0 missing
V5	numeric	13904 unique values 0 missing
V6	numeric	13897 unique values 0 missing
V7	numeric	13895 unique values 0 missing
V8	numeric	13907 unique values 0 missing
V9	numeric	13897 unique values 0 missing
V10	numeric	13888 unique values 0 missing
V11	numeric	13905 unique values 0 missing
V12	numeric	13909 unique values 0 missing
V13	numeric	13906 unique values 0 missing
V14	numeric	13906 unique values 0 missing
V15	numeric	13902 unique values 0 missing
V16	numeric	13908 unique values 0 missing
V17	numeric	13910 unique values 0 missing
V18	numeric	13892 unique values 0 missing
V19	numeric	13896 unique values 0 missing
V20	numeric	13903 unique values 0 missing
V21	numeric	13909 unique values 0 missing
V22	numeric	13883 unique values 0 missing
V23	numeric	13903 unique values 0 missing
V24	numeric	13899 unique values 0 missing
V25	numeric	13896 unique values 0 missing
V26	numeric	13885 unique values 0 missing
V27	numeric	13891 unique values 0 missing
V28	numeric	13892 unique values 0 missing
V29	numeric	13893 unique values 0 missing
V30	numeric	13872 unique values 0 missing
V31	numeric	13886 unique values 0 missing
V32	numeric	13891 unique values 0 missing
V33	numeric	13904 unique values 0 missing
V34	numeric	13874 unique values 0 missing
V35	numeric	13855 unique values 0 missing
V36	numeric	13894 unique values 0 missing
V37	numeric	13886 unique values 0 missing
V38	numeric	13835 unique values 0 missing
V39	numeric	13869 unique values 0 missing
V40	numeric	13891 unique values 0 missing
V41	numeric	13908 unique values 0 missing
V42	numeric	13877 unique values 0 missing
V43	numeric	13864 unique values 0 missing
V44	numeric	13891 unique values 0 missing
V45	numeric	13894 unique values 0 missing
V46	numeric	13820 unique values 0 missing
V47	numeric	13859 unique values 0 missing
V48	numeric	13882 unique values 0 missing
V49	numeric	13908 unique values 0 missing
V50	numeric	13898 unique values 0 missing
V51	numeric	13906 unique values 0 missing
V52	numeric	13908 unique values 0 missing
V53	numeric	13907 unique values 0 missing
V54	numeric	13893 unique values 0 missing
V55	numeric	13903 unique values 0 missing
V56	numeric	13903 unique values 0 missing
V57	numeric	13909 unique values 0 missing
V58	numeric	13897 unique values 0 missing
V59	numeric	13900 unique values 0 missing
V60	numeric	13905 unique values 0 missing
V61	numeric	13906 unique values 0 missing
V62	numeric	13902 unique values 0 missing
V63	numeric	13901 unique values 0 missing
V64	numeric	13904 unique values 0 missing
V65	numeric	13899 unique values 0 missing
V66	numeric	13889 unique values 0 missing
V67	numeric	13902 unique values 0 missing
V68	numeric	13906 unique values 0 missing
V69	numeric	13907 unique values 0 missing
V70	numeric	13891 unique values 0 missing
V71	numeric	13907 unique values 0 missing
V72	numeric	13906 unique values 0 missing
V73	numeric	13904 unique values 0 missing
V74	numeric	13887 unique values 0 missing
V75	numeric	13904 unique values 0 missing
V76	numeric	13903 unique values 0 missing
V77	numeric	13905 unique values 0 missing
V78	numeric	13897 unique values 0 missing
V79	numeric	13898 unique values 0 missing
V80	numeric	13900 unique values 0 missing
V81	numeric	13908 unique values 0 missing
V82	numeric	13888 unique values 0 missing
V83	numeric	13906 unique values 0 missing
V84	numeric	13906 unique values 0 missing
V85	numeric	13905 unique values 0 missing
V86	numeric	13892 unique values 0 missing
V87	numeric	13899 unique values 0 missing
V88	numeric	13903 unique values 0 missing
V89	numeric	13908 unique values 0 missing
V90	numeric	13900 unique values 0 missing
V91	numeric	13903 unique values 0 missing
V92	numeric	13905 unique values 0 missing
V93	numeric	13903 unique values 0 missing
V94	numeric	13886 unique values 0 missing
V95	numeric	13896 unique values 0 missing
V96	numeric	13902 unique values 0 missing
V97	numeric	13902 unique values 0 missing
V98	numeric	13882 unique values 0 missing
V99	numeric	13872 unique values 0 missing
V100	numeric	13905 unique values 0 missing
V101	numeric	13902 unique values 0 missing
V102	numeric	13854 unique values 0 missing
V103	numeric	13882 unique values 0 missing
V104	numeric	13895 unique values 0 missing
V105	numeric	13910 unique values 0 missing
V106	numeric	13885 unique values 0 missing
V107	numeric	13876 unique values 0 missing
V108	numeric	13894 unique values 0 missing
V109	numeric	13895 unique values 0 missing
V110	numeric	13850 unique values 0 missing
V111	numeric	13875 unique values 0 missing
V112	numeric	13875 unique values 0 missing
V113	numeric	13905 unique values 0 missing
V114	numeric	13898 unique values 0 missing
V115	numeric	13903 unique values 0 missing
V116	numeric	13908 unique values 0 missing
V117	numeric	13906 unique values 0 missing
V118	numeric	13898 unique values 0 missing
V119	numeric	13903 unique values 0 missing
V120	numeric	13907 unique values 0 missing
V121	numeric	13909 unique values 0 missing
V122	numeric	13898 unique values 0 missing
V123	numeric	13903 unique values 0 missing
V124	numeric	13907 unique values 0 missing
V125	numeric	13903 unique values 0 missing
V126	numeric	13898 unique values 0 missing
V127	numeric	13905 unique values 0 missing
V128	numeric	13907 unique values 0 missing
V129	numeric	59 unique values 0 missing

Show first 100 features

107 properties

NumberOfInstances

13910

Number of instances (rows) of the dataset.

NumberOfFeatures

130

Number of attributes (columns) of the dataset.

NumberOfClasses

Number of distinct values of the target attribute (if it is nominal).

NumberOfMissingValues

Number of missing values in the dataset.

NumberOfInstancesWithMissingValues

Number of instances with at least one value missing.

NumberOfNumericFeatures

129

Number of numeric attributes.

NumberOfSymbolicFeatures

Number of nominal attributes.

Quartile3MutualInformation

Third quartile of mutual information between the nominal attributes and the target attribute.

CfsSubsetEval_DecisionStumpErrRate

0.05

Error rate achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth2ErrRate

0.04

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

J48.00001.Kappa

0.96

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .00001

MeanAttributeEntropy

Average entropy of the attributes.

MinorityClassSize

1641

Number of instances belonging to the least frequent class.

PercentageOfSymbolicFeatures

0.77

Percentage of nominal attributes.

Quartile3SkewnessOfNumericAtts

2.59

Third quartile of skewness among attributes of the numeric type.

CfsSubsetEval_DecisionStumpKappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth2Kappa

0.95

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

J48.0001.AUC

0.99

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .0001

MeanKurtosisOfNumericAtts

1029.19

Mean kurtosis among attributes of the numeric type.

NaiveBayesAUC

0.84

Area Under the ROC Curve achieved by the landmarker weka.classifiers.bayes.NaiveBayes

Quartile1AttributeEntropy

First quartile of entropy among attributes.

Quartile3StdDevOfNumericAtts

25.11

Third quartile of standard deviation of attributes of the numeric type.

CfsSubsetEval_NaiveBayesAUC

0.97

Area Under the ROC Curve achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth3AUC

0.97

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

J48.0001.ErrRate

0.03

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .0001

MeanMeansOfNumericAtts

2771.04

Mean of means among attributes of the numeric type.

NaiveBayesErrRate

0.42

Error rate achieved by the landmarker weka.classifiers.bayes.NaiveBayes

Quartile1KurtosisOfNumericAtts

4.25

First quartile of kurtosis among attributes of the numeric type.

REPTreeDepth1AUC

0.99

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 1

CfsSubsetEval_NaiveBayesErrRate

0.05

Error rate achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth3ErrRate

0.04

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

J48.0001.Kappa

0.96

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .0001

MeanMutualInformation

Average mutual information between the nominal attributes and the target attribute.

NaiveBayesKappa

0.5

Kappa coefficient achieved by the landmarker weka.classifiers.bayes.NaiveBayes

Quartile1MeansOfNumericAtts

-4.73

First quartile of means among attributes of the numeric type.

REPTreeDepth1ErrRate

0.04

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 1

CfsSubsetEval_NaiveBayesKappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth3Kappa

0.95

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

J48.001.AUC

0.99

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .001

MeanNoiseToSignalRatio

An estimate of the amount of irrelevant information in the attributes regarding the class. Equals (MeanAttributeEntropy - MeanMutualInformation) divided by MeanMutualInformation.

NumberOfBinaryFeatures

Number of binary attributes.

Quartile1MutualInformation

First quartile of mutual information between the nominal attributes and the target attribute.

REPTreeDepth1Kappa

0.95

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 1

CfsSubsetEval_kNN1NAUC

0.97

Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

StdvNominalAttDistinctValues

Standard deviation of the number of distinct values among attributes of the nominal type.

J48.001.ErrRate

0.03

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .001

MeanNominalAttDistinctValues

Average number of distinct values among the attributes of the nominal type.

Quartile1SkewnessOfNumericAtts

-2.27

First quartile of skewness among attributes of the numeric type.

REPTreeDepth2AUC

0.99

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 2

CfsSubsetEval_kNN1NErrRate

0.05

Error rate achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

kNN1NAUC

Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk

J48.001.Kappa

0.96

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .001

MeanSkewnessOfNumericAtts

4.61

Mean skewness among attributes of the numeric type.

Quartile1StdDevOfNumericAtts

4.36

First quartile of standard deviation of attributes of the numeric type.

REPTreeDepth2ErrRate

0.04

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 2

CfsSubsetEval_kNN1NKappa

0.94

Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

kNN1NErrRate

0.01

Error rate achieved by the landmarker weka.classifiers.lazy.IBk

MajorityClassPercentage

21.63

Percentage of instances belonging to the most frequent class.

MeanStdDevOfNumericAtts

2709.48

Mean standard deviation of attributes of the numeric type.

Quartile2AttributeEntropy

Second quartile (Median) of entropy among attributes.

Quartile2KurtosisOfNumericAtts

10.15

Second quartile (Median) of kurtosis among attributes of the numeric type.

REPTreeDepth2Kappa

0.95

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 2

ClassEntropy

2.55

Entropy of the target attribute values.

kNN1NKappa

0.99

Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk

MajorityClassSize

3009

Number of instances belonging to the most frequent class.

MinAttributeEntropy

Minimal entropy among attributes.

Quartile2MeansOfNumericAtts

5.4

Second quartile (Median) of means among attributes of the numeric type.

REPTreeDepth3AUC

0.99

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 3

DecisionStumpAUC

0.71

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.DecisionStump

MaxAttributeEntropy

Maximum entropy among attributes.

MinKurtosisOfNumericAtts

-0.07

Minimum kurtosis among attributes of the numeric type.

Quartile2MutualInformation

Second quartile (Median) of mutual information between the nominal attributes and the target attribute.

REPTreeDepth3ErrRate

0.04

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 3

DecisionStumpErrRate

0.61

Error rate achieved by the landmarker weka.classifiers.trees.DecisionStump

MaxKurtosisOfNumericAtts

13909.09

Maximum kurtosis among attributes of the numeric type.

MinMeansOfNumericAtts

-72.75

Minimum of means among attributes of the numeric type.

Quartile2SkewnessOfNumericAtts

1.3

Second quartile (Median) of skewness among attributes of the numeric type.

REPTreeDepth3Kappa

0.95

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 3

DecisionStumpKappa

0.22

Kappa coefficient achieved by the landmarker weka.classifiers.trees.DecisionStump

MaxMeansOfNumericAtts

57340.1

Maximum of means among attributes of the numeric type.

MinMutualInformation

Minimal mutual information between the nominal attributes and the target attribute.

Quartile2StdDevOfNumericAtts

10.05

Second quartile (Median) of standard deviation of attributes of the numeric type.

RandomTreeDepth1AUC

0.97

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

Dimensionality

0.01

Number of attributes divided by the number of instances.

MaxMutualInformation

Maximum mutual information between the nominal attributes and the target attribute.

MinNominalAttDistinctValues

The minimal number of distinct values among attributes of the nominal type.

PercentageOfBinaryFeatures

Percentage of binary attributes.

Quartile3AttributeEntropy

Third quartile of entropy among attributes.

RandomTreeDepth1ErrRate

0.04

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

EquivalentNumberOfAtts

Number of attributes needed to optimally describe the class (under the assumption of independence among attributes). Equals ClassEntropy divided by MeanMutualInformation.

MaxNominalAttDistinctValues

The maximum number of distinct values among attributes of the nominal type.

MinSkewnessOfNumericAtts

-87.65

Minimum skewness among attributes of the numeric type.

PercentageOfInstancesWithMissingValues

Percentage of instances having missing values.

Quartile3KurtosisOfNumericAtts

80.48

Third quartile of kurtosis among attributes of the numeric type.

AutoCorrelation

0.59

Average class difference between consecutive instances.

RandomTreeDepth1Kappa

0.95

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

J48.00001.AUC

0.99

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .00001

MaxSkewnessOfNumericAtts

117.93

Maximum skewness among attributes of the numeric type.

MinStdDevOfNumericAtts

0.53

Minimum standard deviation of attributes of the numeric type.

PercentageOfMissingValues

Percentage of missing values.

Quartile3MeansOfNumericAtts

15.4

Third quartile of means among attributes of the numeric type.

CfsSubsetEval_DecisionStumpAUC

0.97

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

RandomTreeDepth2AUC

0.97

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

J48.00001.ErrRate

0.03

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .00001

MaxStdDevOfNumericAtts

69844.79

Maximum standard deviation of attributes of the numeric type.

MinorityClassPercentage

11.8

Percentage of instances belonging to the least frequent class.