OpenML

mean_weighted_precision

The macro weighted (by class size) average Precision. In macro-averaging, Precision is computed locally over each category ?rst and then the average over all categories is taken, weighted by the…

evaluation measure

weighted_recall

The macro weighted (by class size) average Recall. In macro-averaging, Recall is computed locally over each category ?rst and then the average over all categories is taken, weighted by the number of…

evaluation measure

number_of_instances

The number of instances used for this evaluation.

evaluation measure

os_information

Default information about OS, JVM, installations, etc.

evaluation measure

precision

Precision is defined as the number of true positive (TP) predictions, divided by the sum of the number of true positives and false positives (TP+FP): $$\text{Precision}=\frac{tp}{tp+fp} \, $$ It is…

evaluation measure

predictive_accuracy

The Predictive Accuracy is the percentage of instances that are classified correctly. Is it 1 - ErrorRate.

evaluation measure

prior_class_complexity

Entropy, in bits, of the prior class distribution. Calculated by taking the sum of -log2(priorProb) over all instances, where priorProb is the prior probability of the actual class for that instance.…

evaluation measure

prior_entropy

Entropy, in bits, of the prior class distribution. Calculated by taking the sum of -log2(priorProb) over all instances, where priorProb is the prior probability of the actual class for that instance.…

evaluation measure

ram_hours

Every GB of RAM deployed for 1 hour equals one RAM-Hour.

evaluation measure

recall

Recall is defined as the number of true positive (TP) predictions, divided by the sum of the number of true positives and false negatives (TP+FN): $$\text{Recall}=\frac{tp}{tp+fn} \, $$ It is also…

evaluation measure

relative_absolute_error

The Relative Absolute Error (RAE) is the mean absolute error (MAE) divided by the mean prior absolute error (MPAE).

evaluation measure

root_mean_prior_squared_error

The Root Mean Prior Squared Error (RMPSE) is the Root Mean Squared Error (RMSE) of the prior (e.g., the default class prediction).

evaluation measure

root_mean_squared_error

The Root Mean Squared Error (RMSE) measures how close the model's predictions are to the actual target values. It is the square root of the Mean Squared Error (MSE), the sum of the squared differences…

evaluation measure

root_relative_squared_error

The Root Relative Squared Error (RRSE) is the Root Mean Squared Error (RMSE) divided by the Root Mean Prior Squared Error (RMPSE). See root_mean_squared_error and root_mean_prior_squared_error.

evaluation measure

run_cpu_time

Runtime in seconds of the entire run. In the case of cross-validation runs, this will include all iterations.

evaluation measure

run_memory

Amount of memory, in bytes, used during the entire run.

evaluation measure

run_virtual_memory

Amount of virtual memory, in bytes, used during the entire run.

evaluation measure

scimark_benchmark

A benchmark tool which measures (single core) CPU performance on the JVM.

evaluation measure

single_point_area_under_roc_curve

No data.

evaluation measure

total_cost

No data.

evaluation measure

unclassified_instance_count

Number of instances that were not classified by the model.

evaluation measure

usercpu_time_millis

The time in milliseconds to build and test a single model on all data.

evaluation measure

usercpu_time_millis_testing

The time in milliseconds to test a single model on all data.

evaluation measure

usercpu_time_millis_training

The time in milliseconds to build a single model on all data.

evaluation measure

webb_bias

Bias component (squared) of the bias-variance decomposition as defined by Webb in: Geoffrey I. Webb (2000), MultiBoosting: A Technique for Combining Boosting and Wagging, Machine Learning, 40(2),…

evaluation measure

webb_error

Intrinsic error component (squared) of the bias-variance decomposition as defined by Webb in: Geoffrey I. Webb (2000), MultiBoosting: A Technique for Combining Boosting and Wagging, Machine Learning,…

evaluation measure

webb_variance

Variance component of the bias-variance decomposition as defined by Webb in: Geoffrey I. Webb (2000), MultiBoosting: A Technique for Combining Boosting and Wagging, Machine Learning, 40(2), pages…

evaluation measure

unweighted_recall

The macro unweighted (ignoring class size) average Recall. In macro-averaging, Recall is computed locally over each category ?rst and then the average over all categories is taken, weighted by the…

evaluation measure

mean_weighted_recall

The macro weighted (by class size) average Recall. In macro-averaging, Recall is computed locally over each category ?rst and then the average over all categories is taken, weighted by the number of…

evaluation measure

Sign in

Filter results by: