OpenML
Filter results by:
Underwater
0 runs0 likes0 downloads0 reach0 impact
251 instances - 3 features - classes - 0 missing values
Malaria is a life-threatening disease caused by parasites that are transmitted to people through the bites of infected female Anopheles mosquitoes. It is preventable and curable. In 2018, there were…
0 runs0 likes0 downloads0 reach0 impact
856 instances - 11 features - classes - 1288 missing values
0 likes - 0 downloads - 0 reach - area_under_roc_curve: 0.9971, f_measure: 0.9667, kappa: 0.95, kb_relative_information_score: 0.9257, mean_absolute_error: 0.0413, mean_prior_absolute_error: 0.4444, weighted_recall: 0.9667, number_of_instances: 150, precision: 0.9668, predictive_accuracy: 0.9667, prior_entropy: 1.585, relative_absolute_error: 0.093, root_mean_prior_squared_error: 0.4714, root_mean_squared_error: 0.1332, root_relative_squared_error: 0.2825, unweighted_recall: 0.9667,
0 likes - 0 downloads - 0 reach - area_under_roc_curve: 0.9971, f_measure: 0.9667, kappa: 0.95, kb_relative_information_score: 0.9257, mean_absolute_error: 0.0413, mean_prior_absolute_error: 0.4444, weighted_recall: 0.9667, number_of_instances: 150, precision: 0.9668, predictive_accuracy: 0.9667, prior_entropy: 1.585, relative_absolute_error: 0.093, root_mean_prior_squared_error: 0.4714, root_mean_squared_error: 0.1332, root_relative_squared_error: 0.2825, unweighted_recall: 0.9667,
0 likes - 0 downloads - 0 reach - area_under_roc_curve: 0.9364, f_measure: 0.8798, kappa: 0.7332, kb_relative_information_score: 0.597, mean_absolute_error: 0.1945, mean_prior_absolute_error: 0.456, weighted_recall: 0.8818, number_of_instances: 19020, precision: 0.8815, predictive_accuracy: 0.8818, prior_entropy: 0.9355, relative_absolute_error: 0.4266, root_mean_prior_squared_error: 0.4775, root_mean_squared_error: 0.2991, root_relative_squared_error: 0.6264, unweighted_recall: 0.8563,
0 likes - 0 downloads - 0 reach - area_under_roc_curve: 0.9364, f_measure: 0.8798, kappa: 0.7332, kb_relative_information_score: 0.597, mean_absolute_error: 0.1945, mean_prior_absolute_error: 0.456, weighted_recall: 0.8818, number_of_instances: 19020, precision: 0.8815, predictive_accuracy: 0.8818, prior_entropy: 0.9355, relative_absolute_error: 0.4266, root_mean_prior_squared_error: 0.4775, root_mean_squared_error: 0.2991, root_relative_squared_error: 0.6264, unweighted_recall: 0.8563,
A random forest classifier. A random forest is a meta estimator that fits a number of decision tree classifiers on various sub-samples of the dataset and uses averaging to improve the predictive…
2 runs0 likes0 downloads0 reach0 impact
The Employee Information dataset is designed to provide a comprehensive overview of the workforce within an organization. It aims to support human resource management, strategic planning, and…
0 runs0 likes0 downloads0 reach0 impact
100000 instances - 16 features - classes - 1498 missing values
0 likes - 0 downloads - 0 reach - area_under_roc_curve: 0.9953, f_measure: 0.9797, kappa: 0.9532, kb_relative_information_score: 0.6199, mean_absolute_error: 0.1959, mean_prior_absolute_error: 0.4435, weighted_recall: 0.9797, number_of_instances: 640, precision: 0.9798, predictive_accuracy: 0.9797, prior_entropy: 0.9027, relative_absolute_error: 0.4418, root_mean_prior_squared_error: 0.4658, root_mean_squared_error: 0.2397, root_relative_squared_error: 0.5147, unweighted_recall: 0.9785,
A random forest classifier. A random forest is a meta estimator that fits a number of decision tree classifiers on various sub-samples of the dataset and uses averaging to improve the predictive…
1 runs0 likes0 downloads0 reach0 impact
0 likes - 0 downloads - 0 reach - area_under_roc_curve: 0.9971, f_measure: 0.9667, kappa: 0.95, kb_relative_information_score: 0.9257, mean_absolute_error: 0.0413, mean_prior_absolute_error: 0.4444, weighted_recall: 0.9667, number_of_instances: 150, precision: 0.9668, predictive_accuracy: 0.9667, prior_entropy: 1.585, relative_absolute_error: 0.093, root_mean_prior_squared_error: 0.4714, root_mean_squared_error: 0.1332, root_relative_squared_error: 0.2825, unweighted_recall: 0.9667,
0 likes - 0 downloads - 0 reach - area_under_roc_curve: 0.9971, f_measure: 0.9667, kappa: 0.95, kb_relative_information_score: 0.9257, mean_absolute_error: 0.0413, mean_prior_absolute_error: 0.4444, weighted_recall: 0.9667, number_of_instances: 150, precision: 0.9668, predictive_accuracy: 0.9667, prior_entropy: 1.585, relative_absolute_error: 0.093, root_mean_prior_squared_error: 0.4714, root_mean_squared_error: 0.1332, root_relative_squared_error: 0.2825, unweighted_recall: 0.9667,
0 likes - 0 downloads - 0 reach - area_under_roc_curve: 0.9971, f_measure: 0.9667, kappa: 0.95, kb_relative_information_score: 0.9257, mean_absolute_error: 0.0413, mean_prior_absolute_error: 0.4444, weighted_recall: 0.9667, number_of_instances: 150, precision: 0.9668, predictive_accuracy: 0.9667, prior_entropy: 1.585, relative_absolute_error: 0.093, root_mean_prior_squared_error: 0.4714, root_mean_squared_error: 0.1332, root_relative_squared_error: 0.2825, unweighted_recall: 0.9667,
Classifier implementing the k-nearest neighbors vote.
5 runs0 likes0 downloads0 reach0 impact
0 likes - 0 downloads - 0 reach - area_under_roc_curve: 0.9016, f_measure: 0.8422, kappa: 0.6809, kb_relative_information_score: 0.6192, mean_absolute_error: 0.1942, mean_prior_absolute_error: 0.494, weighted_recall: 0.842, number_of_instances: 690, precision: 0.8425, predictive_accuracy: 0.842, prior_entropy: 0.9912, relative_absolute_error: 0.3932, root_mean_prior_squared_error: 0.497, root_mean_squared_error: 0.3439, root_relative_squared_error: 0.692, unweighted_recall: 0.8412,
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 45956 - estimation_procedure : 10-fold Crossvalidation - target_feature : loan_status
Loan Status Dataset
0 runs0 likes0 downloads0 reach0 impact
26257 instances - 14 features - 3 classes - 4 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44331 - estimation_procedure : 10-fold Crossvalidation - target_feature : class
Financial dataset preprocessed for automl benchmark. Name = dataset_credit-approval, target = class
0 runs0 likes0 downloads0 reach0 impact
690 instances - 16 features - 0 classes - 0 missing values
0 likes - 0 downloads - 0 reach - area_under_roc_curve: 0.9366, f_measure: 0.88, kappa: 0.7338, kb_relative_information_score: 0.5972, mean_absolute_error: 0.1943, mean_prior_absolute_error: 0.456, weighted_recall: 0.882, number_of_instances: 19020, precision: 0.8817, predictive_accuracy: 0.882, prior_entropy: 0.9355, relative_absolute_error: 0.4262, root_mean_prior_squared_error: 0.4775, root_mean_squared_error: 0.2991, root_relative_squared_error: 0.6264, unweighted_recall: 0.8567,
A random forest classifier. A random forest is a meta estimator that fits a number of decision tree classifiers on various sub-samples of the dataset and uses averaging to improve the predictive…
1 runs0 likes0 downloads0 reach0 impact
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 45956 - estimation_procedure : 10-fold Crossvalidation - target_feature : loan_status
Loan Status Dataset
0 runs0 likes0 downloads0 reach0 impact
45000 instances - 14 features - 2 classes - 0 missing values
Loan Status Dataset
0 runs0 likes0 downloads0 reach0 impact
45000 instances - 14 features - 2 classes - 0 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 45956 - estimation_procedure : 10-fold Crossvalidation - target_feature : Rating
# Credit Ratings of Big US Firms and their Financials ## Context A corporate credit rating expresses the ability of a firm to repay its debt to creditors. Credit rating agencies assess companies'…
0 runs0 likes0 downloads0 reach0 impact
2029 instances - 31 features - 10 classes - 0 missing values
corporate rating
0 runs0 likes0 downloads0 reach0 impact
2029 instances - 31 features - 10 classes - 0 missing values
Corporate_Credit dataset
0 runs0 likes0 downloads0 reach0 impact
2029 instances - 31 features - 10 classes - 0 missing values
# Credit Ratings of Big US Firms and their Financials
0 runs0 likes0 downloads0 reach0 impact
2029 instances - 31 features - 10 classes - 0 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44331 - estimation_procedure : 10-fold Crossvalidation - target_feature : y
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44331 - estimation_procedure : 10-fold Crossvalidation - target_feature : credit_score
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44331 - estimation_procedure : 10-fold Crossvalidation - target_feature : approved_flag
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44331 - estimation_procedure : 10-fold Crossvalidation - target_feature : class
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44331 - estimation_procedure : 10-fold Crossvalidation - target_feature : application_accepted
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44331 - estimation_procedure : 10-fold Crossvalidation - target_feature : seriousdlqin2yrs
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44331 - estimation_procedure : 10-fold Crossvalidation - target_feature : class
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44331 - estimation_procedure : 10-fold Crossvalidation - target_feature : five_categories
Financial dataset for automl benchmark. Name = dataset_default_of_credit_card_clients, target = y
0 runs0 likes0 downloads0 reach0 impact
30000 instances - 24 features - 2 classes - 0 missing values
Financial dataset for automl benchmark. Name = dataset_credit_score, target = credit_score
0 runs0 likes0 downloads0 reach0 impact
100000 instances - 28 features - 3 classes - 16967 missing values
Financial dataset for automl benchmark. Name = dataset_credit_risk_file_2, target = approved_flag
0 runs0 likes0 downloads0 reach0 impact
51336 instances - 62 features - 4 classes - 0 missing values
Financial dataset for automl benchmark. Name = dataset_credit-approval, target = class
0 runs0 likes0 downloads0 reach0 impact
690 instances - 16 features - 2 classes - 25 missing values
Financial dataset for automl benchmark. Name = dataset_analcatdata_creditscore, target = application_accepted
0 runs0 likes0 downloads0 reach0 impact
100 instances - 7 features - 2 classes - 0 missing values
Financial dataset for automl benchmark. Name = dataset_credit, target = seriousdlqin2yrs
0 runs0 likes0 downloads0 reach0 impact
16714 instances - 11 features - 2 classes - 0 missing values
Financial dataset for automl benchmark. Name = dataset_credit-g, target = class
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - 2 classes - 0 missing values
Financial dataset for automl benchmark. Name = dataset_china, target = five_categories
0 runs0 likes0 downloads0 reach0 impact
27522 instances - 28 features - 5 classes - 8792 missing values
Financial dataset for automl benchmark. Name = dataset_credit-g, target = class
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - 2 classes - 0 missing values
Financial dataset for automl benchmark. Name = dataset_china, target = five_categories
0 runs0 likes0 downloads0 reach0 impact
27522 instances - 28 features - 5 classes - 8792 missing values
Financial dataset for automl benchmark. Name = dataset_default_of_credit_card_clients, target = y
0 runs0 likes0 downloads0 reach0 impact
30000 instances - 24 features - 0 classes - 0 missing values
Financial dataset for automl benchmark. Name = dataset_credit_score, target = credit_score
0 runs0 likes0 downloads0 reach0 impact
100000 instances - 28 features - 3 classes - 16967 missing values
Financial dataset for automl benchmark. Name = dataset_credit_risk_file_2, target = approved_flag
0 runs0 likes0 downloads0 reach0 impact
51336 instances - 62 features - 4 classes - 0 missing values
Financial dataset for automl benchmark. Name = dataset_credit-approval, target = class
0 runs0 likes0 downloads0 reach0 impact
690 instances - 16 features - 2 classes - 25 missing values
Financial dataset for automl benchmark. Name = dataset_analcatdata_creditscore, target = application_accepted
0 runs0 likes0 downloads0 reach0 impact
100 instances - 7 features - 0 classes - 0 missing values
Financial dataset for automl benchmark. Name = dataset_credit, target = seriousdlqin2yrs
0 runs0 likes0 downloads0 reach0 impact
16714 instances - 11 features - 0 classes - 0 missing values
Financial dataset for automl benchmark. Name = dataset_credit-g, target = class
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - 2 classes - 0 missing values
Financial dataset for automl benchmark. Name = dataset_china, target = five_categories
0 runs0 likes0 downloads0 reach0 impact
27522 instances - 28 features - 5 classes - 8792 missing values
# Credit Ratings of Big US Firms and their Financials ## Context A corporate credit rating expresses the ability of a firm to repay its debt to creditors. Credit rating agencies assess companies'…
0 runs0 likes0 downloads0 reach0 impact
2029 instances - 31 features - 10 classes - 0 missing values
# Credit Ratings
0 runs0 likes0 downloads0 reach0 impact
2029 instances - 31 features - 10 classes - 0 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 45575 - estimation_procedure : 10-fold Crossvalidation - target_feature : action_taken
Dataset is uploaded from kaggle. https://www.kaggle.com/code/ambarish/eda-home-mortgage-ny-with-feature-analysis/script
0 runs0 likes0 downloads0 reach0 impact
43965 instances - 30 features - 6 classes - 49605 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 45956 - estimation_procedure : 10-fold Crossvalidation - target_feature : class
# 200+ Financial Indicators of US Stocks (2018) ## Context Algorithmic trading space is buzzing with new strategies. Companies have spent billions in infrastructure and R&D to be able to jump ahead of…
0 runs0 likes0 downloads0 reach0 impact
4392 instances - 231 features - 2 classes - 93949 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 45956 - estimation_procedure : 10-fold Crossvalidation - target_feature : loan_status
# Loan Approval Classification Dataset This dataset is a synthetic version inspired by the original Credit Risk dataset on Kaggle and enriched with additional variables based on historical loan…
0 runs0 likes0 downloads0 reach0 impact
45000 instances - 24 features - 2 classes - 0 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 45956 - estimation_procedure : 10-fold Crossvalidation - target_feature : rating
# Credit Ratings of Big US Firms and their Financials ## Context A corporate credit rating expresses the ability of a firm to repay its debt to creditors. Credit rating agencies assess companies'…
0 runs0 likes0 downloads0 reach0 impact
2029 instances - 42 features - 10 classes - 0 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44331 - estimation_procedure : 10-fold Crossvalidation - target_feature : credit_score
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44331 - estimation_procedure : 10-fold Crossvalidation - target_feature : approved_flag
Finantial dataset for automl benchmark. Target column: credit_score
0 runs0 likes0 downloads0 reach0 impact
100000 instances - 28 features - 3 classes - 60080 missing values
Finantial dataset for automl benchmark. Target column: approved_flag
0 runs0 likes0 downloads0 reach0 impact
51336 instances - 62 features - 4 classes - 0 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 45575 - estimation_procedure : 10-fold Crossvalidation - target_feature : seriousdlqin2yrs
Dataset is uploaded from kaggle, see citation for the link.
0 runs0 likes0 downloads0 reach0 impact
45000 instances - 11 features - 2 classes - 10132 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 45575 - estimation_procedure : 10-fold Crossvalidation - target_feature : bad
Predict clients who default on their loan. Dataset is uploaded from kaggle, see citation for the link.
0 runs0 likes0 downloads0 reach0 impact
5960 instances - 21 features - 2 classes - 4740 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 45575 - estimation_procedure : 10-fold Crossvalidation - target_feature : risk_rating
Dataset is uploaded from kaggle. https://www.kaggle.com/datasets/preethamgouda/financial-risk
0 runs0 likes0 downloads0 reach0 impact
15000 instances - 22 features - 3 classes - 13500 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44101 - estimation_procedure : 10-fold Crossvalidation - target_feature : Loan_Type
The International Bank for Reconstruction and Development (IBRD) loans are public and publicly guaranteed debt extended by the World Bank Group. IBRD loans are made to, or guaranteed by,…
0 runs0 likes0 downloads0 reach0 impact
9217 instances - 7 features - 11 classes - 100 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44101 - estimation_procedure : 10-fold Crossvalidation - target_feature : Interest_Rate
A loan is when you receive the money from a financial institution in exchange for future repayment of the principal, plus interest. Financial institutions provide loans to the industries, corporates…
0 runs0 likes0 downloads0 reach0 impact
32862 instances - 13 features - 3 classes - 24111 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44101 - estimation_procedure : 10-fold Crossvalidation - target_feature : Credit_Score
This dataset is designed to evaluate companies based on quality and valuation metrics.
0 runs0 likes0 downloads0 reach0 impact
50000 instances - 22 features - 3 classes - 15950 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44101 - estimation_procedure : 10-fold Crossvalidation - target_feature : total_score
This dataset is designed to evaluate companies based on quality and valuation metrics.
0 runs0 likes0 downloads0 reach0 impact
719 instances - 11 features - 4 classes - 0 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44101 - estimation_procedure : 10-fold Crossvalidation - target_feature : is_fraud
A fraud detection dataset for binary classification. The target variable is 'is_fraud', indicating whether a transaction is fraudulent.
0 runs0 likes0 downloads0 reach0 impact
5227 instances - 21 features - 2 classes - 0 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44101 - estimation_procedure : 10-fold Crossvalidation - target_feature : class
A dataset for binary classification of credit approval status. Features include customer demographics, financial attributes, and credit history. The target variable class indicates whether the credit…
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 51 features - 2 classes - 0 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44101 - estimation_procedure : 10-fold Crossvalidation - target_feature : class
A dataset for binary classification of credit approval status. Features include customer demographics, financial attributes, and credit history. The target variable class indicates whether the credit…
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 51 features - 2 classes - 0 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 44101 - estimation_procedure : 10-fold Crossvalidation - target_feature : class
A dataset for binary classification of credit approval status. Features include customer demographics, financial attributes, and credit history. The target variable class indicates whether the credit…
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 51 features - 0 classes - 0 missing values
senior datasets
0 runs0 likes0 downloads0 reach0 impact
2069 instances - 9 features - classes - 0 missing values
loan approval
0 runs0 likes0 downloads0 reach0 impact
614 instances - 13 features - classes - 149 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 45575 - estimation_procedure : 10-fold Crossvalidation - target_feature : bad
Predict clients who default on their loan. Dataset is uploaded from kaggle, see citation for the link.
0 runs0 likes0 downloads0 reach0 impact
5960 instances - 13 features - 2 classes - 5271 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 45575 - estimation_procedure : 10-fold Crossvalidation - target_feature : creditability
Dataset is uploaded from kaggle. https://www.kaggle.com/datasets/mpwolke/cusersmarildownloadsgermancsv/data
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - 2 classes - 0 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 45575 - estimation_procedure : 10-fold Crossvalidation - target_feature : risk_rating
Dataset is uploaded from kaggle. https://www.kaggle.com/datasets/preethamgouda/financial-risk
0 runs0 likes0 downloads0 reach0 impact
15000 instances - 20 features - 3 classes - 13500 missing values
0 runs0 likes0 downloads0 reach0 impact
uploader_id : 45575 - estimation_procedure : 10-fold Crossvalidation - target_feature : action_taken