OpenML

REPTreeDepth1AUC

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 1

data quality

REPTreeDepth1ErrRate

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 1

data quality

REPTreeDepth1Kappa

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 1

data quality

REPTreeDepth2AUC

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 2

data quality

REPTreeDepth2ErrRate

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 2

data quality

REPTreeDepth2Kappa

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 2

data quality

REPTreeDepth3AUC

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 3

data quality

REPTreeDepth3ErrRate

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 3

data quality

REPTreeDepth3Kappa

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 3

data quality

RandomTreeDepth1AUC

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

data quality

RandomTreeDepth1ErrRate

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

data quality

RandomTreeDepth1Kappa

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

data quality

RandomTreeDepth2AUC

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

data quality

RandomTreeDepth2ErrRate

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

data quality

RandomTreeDepth2Kappa

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

data quality

RandomTreeDepth3AUC

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

data quality

RandomTreeDepth3ErrRate

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

data quality

RandomTreeDepth3Kappa

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

data quality

StdvNominalAttDistinctValues

Standard deviation of the number of distinct values among attributes of the nominal type.

data quality

kNN1NAUC

Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk

data quality

kNN1NErrRate

Error rate achieved by the landmarker weka.classifiers.lazy.IBk

data quality

kNN1NKappa

Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk

data quality

Interleaved Test then Train (Batch)

Description to be added

estimation procedure

binominal_test

Subgroup discovery measure.

evaluation measure

chi-squared

Subgroup discovery measure.

evaluation measure

cortana_quality

Subgroup discovery measure.

evaluation measure

coverage

The number of observations in the current subgroup.

evaluation measure

information_gain

Subgroup discovery measure.

evaluation measure

jaccard

Subgroup discovery measure.

evaluation measure

positives

The amount of positives in the subgroup

evaluation measure

probability

The probability of a subgroup.

evaluation measure

quality

The quality of the founded subgroup

evaluation measure

joint_entropy

Subgroup discovery measure.

evaluation measure

pattern_team_auroc10

Area under the ROC curve for the 10 best subgroups

evaluation measure

4-fold Crossvalidation

Cross-validation is a technique to evaluate predictive models by partitioning the original sample into a training set to train the model, and a test set to evaluate it. In k-fold cross-validation,…

estimation procedure

3-fold Crossvalidation

Cross-validation is a technique to evaluate predictive models by partitioning the original sample into a training set to train the model, and a test set to evaluate it. In k-fold cross-validation,…

estimation procedure

Custom 10-fold Crossvalidation

A custom holdout partitions a set of observations into a training set and a test set in a predefined way. This is typically done in order to compare the performance of different predictive algorithms…

estimation procedure

100 times 10-fold Crossvalidation

Cross-validation is a technique to evaluate predictive models by partitioning the original sample into a training set to train the model, and a test set to evaluate it. In k-fold cross-validation,…

estimation procedure

10-fold Crossvalidation

Cross-validation is a technique to evaluate predictive models by partitioning the original sample into a training set to train the model, and a test set to evaluate it. In k-fold cross-validation,…

estimation procedure

5 times 2-fold Crossvalidation

Cross-validation is a technique to evaluate predictive models by partitioning the original sample into a training set to train the model, and a test set to evaluate it. In k-fold cross-validation,…

estimation procedure

10 times 10-fold Crossvalidation

Cross-validation is a technique to evaluate predictive models by partitioning the original sample into a training set to train the model, and a test set to evaluate it. In k-fold cross-validation,…

estimation procedure

Leave one out

Leave-on-out is a special case of cross-validation where the number of folds equals the number of instances. Thus, models are always evaluated on one instance and trained on all others. Leave-one-out…

estimation procedure

10% Holdout set

Holdout or random subsampling is a technique to evaluate predictive models by partitioning the original sample into a training set to train the model, and a test set to evaluate it. In a k% holdout,…

estimation procedure

33% Holdout set

Holdout or random subsampling is a technique to evaluate predictive models by partitioning the original sample into a training set to train the model, and a test set to evaluate it. In a k% holdout,…

estimation procedure

10-fold Crossvalidation

Cross-validation is a technique to evaluate predictive models by partitioning the original sample into a training set to train the model, and a test set to evaluate it. In k-fold cross-validation,…

estimation procedure

5 times 2-fold Crossvalidation

Cross-validation is a technique to evaluate predictive models by partitioning the original sample into a training set to train the model, and a test set to evaluate it. In k-fold cross-validation,…

estimation procedure

10 times 10-fold Crossvalidation

Cross-validation is a technique to evaluate predictive models by partitioning the original sample into a training set to train the model, and a test set to evaluate it. In k-fold cross-validation,…

estimation procedure

Leave one out

Leave-on-out is a special case of cross-validation where the number of folds equals the number of instances. Thus, models are always evaluated on one instance and trained on all others. Leave-one-out…

estimation procedure

10% Holdout set

Holdout or random subsampling is a technique to evaluate predictive models by partitioning the original sample into a training set to train the model, and a test set to evaluate it. In a k% holdout,…

estimation procedure

33% Holdout set

Holdout or random subsampling is a technique to evaluate predictive models by partitioning the original sample into a training set to train the model, and a test set to evaluate it. In a k% holdout,…

estimation procedure

10-fold Learning Curve

Cross-validation is a technique to evaluate predictive models by partitioning the original sample into a training set to train the model, and a test set to evaluate it. In k-fold cross-validation,…

estimation procedure

10 times 10-fold Learning Curve

Cross-validation is a technique to evaluate predictive models by partitioning the original sample into a training set to train the model, and a test set to evaluate it. In k-fold cross-validation,…

estimation procedure

Interleaved Test then Train

Description to be added

estimation procedure

Custom Holdout

A custom holdout partitions a set of observations into a training set and a test set in a predefined way. This is typically done in order to compare the performance of different predictive algorithms…

estimation procedure

10-fold Crossvalidation

Cross-validation is a technique to evaluate predictive models by partitioning the original sample into a training set to train the model, and a test set to evaluate it. In k-fold cross-validation,…

estimation procedure

5 times 2-fold Crossvalidation

Cross-validation is a technique to evaluate predictive models by partitioning the original sample into a training set to train the model, and a test set to evaluate it. In k-fold cross-validation,…

estimation procedure

10 times 10-fold Crossvalidation

Cross-validation is a technique to evaluate predictive models by partitioning the original sample into a training set to train the model, and a test set to evaluate it. In k-fold cross-validation,…

estimation procedure

Leave one out

Leave-on-out is a special case of cross-validation where the number of folds equals the number of instances. Thus, models are always evaluated on one instance and trained on all others. Leave-one-out…

estimation procedure

BiasVarianceProfile

The weight of the bias component in the learning algorithm's error. I.e., the percentage of errors that can be attributed to bias error (underfitting) as opposed to variance error (overfitting).

flow quality

BiasWeightKohaviWolpert

empirically calculated average ratio of bias error in the total error, using Kohavi-Wolpert's definition of bias and variance

flow quality

BiasWeightWebb

empirically determined average ratio of bias error in the total error, using Webb's definition of bias and variance

flow quality

HandlesMissingValues

No data.

flow quality

HandlesNominalFeatures

No data.

flow quality

HandlesNominalTarget

No data.

flow quality

HandlesNonBinaryClasses

No data.

flow quality

HandlesNumericFeatures

No data.

flow quality

HandlesNumericTarget

No data.

flow quality

PerformsClassification

true if the algorithm can perform classification, false otherwise

flow quality

PerformsRegression

true if the algorithm can perform regression, false otherwise

flow quality

VarianceWeightKohaviWolpert

empirically calculated average ratio of variance error in the total error, using Kohavi-Wolpert's definition of bias and variance

flow quality

VarianceWeightWebb

empirically determined average ratio of variance error in the total error, using Webb's definition of bias and variance

flow quality

area_under_roc_curve

The area under the ROC curve (AUROC), calculated using the Mann-Whitney U-test. The curve is constructed by shifting the threshold for a positive prediction from 0 to 1, yielding a series of true…

evaluation measure

average_cost

No data.

evaluation measure

build_cpu_time

The time in seconds to build a single model on all data.

evaluation measure

build_memory

The memory, in bytes, needed to build a single model on all data.

evaluation measure

c_index

Used for survival Analysis

evaluation measure

class_complexity

Entropy, in bits, of the class distribution generated by the model's predictions. Calculated by taking the sum of -log2(predictedProb) over all instances, where predictedProb is the probability…

evaluation measure

class_complexity_gain

Entropy reduction, in bits, between the class distribution generated by the model's predictions, and the prior class distribution. Calculated by taking the difference of the prior_class_complexity and…

evaluation measure

confusion_matrix

The confusion matrix, or contingency table, is a table that summarizes the number of instances that were predicted to belong to a certain class, versus their actual class. It is an NxN matrix where N…

evaluation measure

correlation_coefficient

The sample Pearson correlation coefficient, or 'r': $$r = \frac{\sum ^n _{i=1}(X_i - \bar{X})(Y_i - \bar{Y})}{\sqrt{\sum ^n _{i=1}(X_i - \bar{X})^2} \sqrt{\sum ^n _{i=1}(Y_i - \bar{Y})^2}}$$ It…

evaluation measure

f_measure

The F-Measure is the harmonic mean of precision and recall, also known as the the traditional F-measure, balanced F-score, or F1-score: Formula: 2*Precision*Recall/(Precision+Recall) See:…

evaluation measure

kappa

Cohen's kappa coefficient is a statistical measure of agreement for qualitative (categorical) items: it measures the agreement of prediction with the true class – 1.0 signifies complete agreement.…

evaluation measure

kb_relative_information_score

The Kononenko and Bratko Information score, divided by the prior entropy of the class distribution. See: Kononenko, I., Bratko, I.: Information-based evaluation criterion for classi er's performance.…

evaluation measure

kohavi_wolpert_bias_squared

Bias component (squared) of the bias-variance decomposition as defined by Kohavi and Wolpert in: R. Kohavi & D. Wolpert (1996), Bias plus variance decomposition for zero-one loss functions, in Proc.…

evaluation measure

kohavi_wolpert_error

Error rate measured in the bias-variance decomposition as defined by Kohavi and Wolpert in: R. Kohavi & D. Wolpert (1996), Bias plus variance decomposition for zero-one loss functions, in Proc. of the…

evaluation measure

kohavi_wolpert_sigma_squared

Intrinsic error component (squared) of the bias-variance decomposition as defined by Kohavi and Wolpert in: R. Kohavi and D. Wolpert (1996), Bias plus variance decomposition for zero-one loss…

evaluation measure

kohavi_wolpert_variance

Variance component of the bias-variance decomposition as defined by Kohavi and Wolpert in: R. Kohavi and D. Wolpert (1996), Bias plus variance decomposition for zero-one loss functions, in Proc. of…

evaluation measure

kononenko_bratko_information_score

Kononenko and Bratko Information score. This measures predictive accuracy but eliminates the influence of prior probabilities. See: Kononenko, I., Bratko, I.: Information-based evaluation criterion…

evaluation measure

matthews_correlation_coefficient

The Matthews correlation coefficient takes into account true and false positives and negatives and is generally regarded as a balanced measure which can be used even if the classes are of very…

evaluation measure

mean_absolute_error

The mean absolute error (MAE) measures how close the model's predictions are to the actual target values. It is the sum of the absolute value of the difference of each instance prediction and the…

evaluation measure

mean_class_complexity

The entropy of the class distribution generated by the model (see class_complexity), divided by the number of instances in the input data.

evaluation measure

mean_class_complexity_gain

The entropy gain of the class distribution by the model over the prior distribution (see class_complexity_gain), divided by the number of instances in the input data.

evaluation measure

mean_f_measure

Unweighted(!) macro-average F-Measure. In macro-averaging, F-measure is computed locally over each category ?rst and then the average over all categories is taken.

evaluation measure

mean_kononenko_bratko_information_score

Kononenko and Bratko Information score, see kononenko_bratko_information_score, divided by the number of instances in the input data. See: Kononenko, I., Bratko, I.: Information-based evaluation…

evaluation measure

mean_precision

Unweighted(!) macro-average Precision. In macro-averaging, Precision is computed locally over each category ?rst and then the average over all categories is taken.

evaluation measure

mean_prior_absolute_error

The mean prior absolute error (MPAE) is the mean absolute error (see mean_absolute_error) of the prior (e.g., default class prediction). See: http://en.wikipedia.org/wiki/Mean_absolute_error

evaluation measure

mean_prior_class_complexity

The entropy of the class distribution of the prior (see prior_class_complexity), divided by the number of instances in the input data.

evaluation measure

mean_recall

Unweighted(!) macro-average Recall. In macro-averaging, Recall is computed locally over each category ?rst and then the average over all categories is taken.

evaluation measure

mean_weighted_area_under_roc_curve

The macro weighted (by class size) average area_under_ROC_curve (AUROC). In macro-averaging, AUROC is computed locally over each category ?rst and then the average over all categories is taken,…

evaluation measure

mean_weighted_f_measure

The macro weighted (by class size) average F-Measure. In macro-averaging, F-measure is computed locally over each category ?rst and then the average over all categories is taken, weighted by the…

evaluation measure

Sign in

Filter results by: