Data
Filter results by:
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#water) - Number of nodes: 32 - Number of arcs: 66 - Number of parameters: 10083 - Average…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 32 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#water) - Number of nodes: 32 - Number of arcs: 66 - Number of parameters: 10083 - Average…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 32 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#water) - Number of nodes: 32 - Number of arcs: 66 - Number of parameters: 10083 - Average…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 32 features - classes - 0 missing values
Gross Domestic Product (GDP) is the monetary value of all finished goods and services made within a country during a specific period. GDP provides an economic snapshot of a country, used to estimate…
0 runs0 likes0 downloads0 reach0 impact
260 instances - 32 features - classes - 1074 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
0 runs0 likes0 downloads0 reach0 impact
31 instances - 31 features - 0 classes - 33 missing values
The datasets contains transactions made by credit cards in September 2013 by european cardholders. This dataset present transactions that occurred in two days, where we have 492 frauds out of 284,807…
357 runs0 likes0 downloads0 reach0 impact
284807 instances - 31 features - 2 classes - 0 missing values
Current dataset was adapted to ARFF format from the UCI version. Sample code ID's were removed. ! Note that there is also a related Breast Cancer Wisconsin (Original) Data Set with a different set of…
226893 runs0 likes0 downloads0 reach0 impact
569 instances - 31 features - 2 classes - 0 missing values
The YouTube personality dataset consists of a collection of behavorial features, speech transcriptions, and personality impression scores for a set of 404 YouTube vloggers that explicitly show…
0 runs0 likes0 downloads0 reach0 impact
404 instances - 31 features - classes - 0 missing values
Om algos te testen
74 runs0 likes0 downloads0 reach0 impact
14240 instances - 31 features - 2 classes - 0 missing values
Subsampling of the dataset PhishingWebsites (4534) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 31 features - 2 classes - 0 missing values
Subsampling of the dataset PhishingWebsites (4534) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 31 features - 2 classes - 0 missing values
One of the biggest challenges of an auto dealership purchasing a used car at an auto auction is the risk of that the vehicle might have serious issues that prevent it from being sold to customers. The…
0 runs0 likes0 downloads0 reach0 impact
67212 instances - 31 features - 0 classes - 0 missing values
Contaminant-detection-in-packaged-cocoa-hazelnut-spread-jars-using-Microwaves-Sensing-and-Machine-Learning-9.0GHz(Urbinati) ---------------- This dataset is part of a series of five different datasets…
0 runs0 likes0 downloads0 reach0 impact
2400 instances - 31 features - 0 classes - 0 missing values
The YouTube personality dataset consists of a collection of behavorial features, speech transcriptions, and personality impression scores for a set of 404 YouTube vloggers that explicitly show…
0 runs0 likes0 downloads0 reach0 impact
404 instances - 31 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 31 features - classes - 0 missing values
Subsampling of the dataset PhishingWebsites (4534) with seed=2 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 31 features - 2 classes - 0 missing values
Subsampling of the dataset PhishingWebsites (4534) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 31 features - 2 classes - 0 missing values
Subsampling of the dataset PhishingWebsites (4534) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 31 features - 2 classes - 0 missing values
This is the datasets from the Kaggle Higgs Boson Machine Learning Challenge 2014. The data was downloaded from the [CERN website](http://opendata.cern.ch/record/328), which also hosts the…
0 runs0 likes0 downloads0 reach0 impact
818238 instances - 31 features - 2 classes - 0 missing values
This is the datasets from the Kaggle Higgs Boson Machine Learning Challenge 2014. The data was downloaded from the [CERN website](http://opendata.cern.ch/record/328), which also hosts the…
0 runs0 likes0 downloads0 reach0 impact
818238 instances - 31 features - 2 classes - 5168486 missing values
Contaminant-detection-in-packaged-cocoa-hazelnut-spread-jars-using-Microwaves-Sensing-and-Machine-Learning-9.5GHz(Urbinati) ---------------- This dataset is part of a series of five different datasets…
0 runs0 likes0 downloads0 reach0 impact
2400 instances - 31 features - 0 classes - 0 missing values
Contaminant-detection-in-packaged-cocoa-hazelnut-spread-jars-using-Microwaves-Sensing-and-Machine-Learning-10.0GHz(Urbinati) ---------------- This dataset is part of a series of five different…
0 runs0 likes0 downloads0 reach0 impact
2400 instances - 31 features - 0 classes - 0 missing values
Contaminant-detection-in-packaged-cocoa-hazelnut-spread-jars-using-Microwaves-Sensing-and-Machine-Learning-10.5GHz(Urbinati) ---------------- This dataset is part of a series of five different…
0 runs0 likes0 downloads0 reach0 impact
2400 instances - 31 features - 0 classes - 0 missing values
Contaminant-detection-in-packaged-cocoa-hazelnut-spread-jars-using-Microwaves-Sensing-and-Machine-Learning-11.0GHz(Urbinati) ---------------- This dataset is part of a series of five different…
0 runs0 likes0 downloads0 reach0 impact
2400 instances - 31 features - 0 classes - 0 missing values
This is the datasets from the Kaggle Higgs Boson Machine Learning Challenge 2014. The data was downloaded from the [CERN website](http://opendata.cern.ch/record/328), which also hosts the…
0 runs0 likes0 downloads0 reach0 impact
800000 instances - 31 features - 2 classes - 5053446 missing values
Dataset includes date and time, opponent, score, rushing statistics, passing statistics, turnovers, Nebraska penalties, point spread, and weather information. All games are included. Penalty data is…
0 runs0 likes0 downloads0 reach0 impact
733 instances - 31 features - classes - 1009 missing values
A modified version of the European Soccer Database by Hugo Mathien (https://www.kaggle.com/hugomathien). New players' performance indicators have been created from the original dataset. Data features:…
0 runs0 likes0 downloads0 reach0 impact
25979 instances - 31 features - classes - 35756 missing values
It is important that credit card companies are able to recognize fraudulent credit card transactions so that customers are not charged for items that they did not purchase.
0 runs0 likes0 downloads0 reach0 impact
284807 instances - 31 features - 0 classes - 0 missing values
Context The official Goodread's API limits retrievable data, so I decided to scrape the actual HTTP pages and grab additional details on each book. Content Books are scraped from a list titles the…
0 runs0 likes0 downloads0 reach0 impact
52199 instances - 31 features - classes - 285951 missing values
Context This is home value data for the hot Nashville market. Content There are 56,000+ rows altogether. However, I'm missing home detail data for about half. So if anyone wants to track that down…
0 runs0 likes0 downloads0 reach0 impact
56636 instances - 31 features - classes - 648773 missing values
Context This dataset provides a list of lyrics from 1950 to 2019 describing music metadata as sadness, danceability, loudness, acousticness, etc. Authors also provide some information as lyrics which…
0 runs0 likes0 downloads0 reach0 impact
28372 instances - 31 features - classes - 13 missing values
Context It is important that credit card companies are able to recognize fraudulent credit card transactions so that customers are not charged for items that they did not purchase. Content The…
0 runs1 likes3 downloads4 reach9 impact
284807 instances - 31 features - 2 classes - 0 missing values
Testing dataset
0 runs0 likes0 downloads0 reach0 impact
134731 instances - 31 features - 2 classes - 0 missing values
Context A corporate credit rating expresses the ability of a firm to repay its debt to creditors. Credit rating agencies are the entities responsible to make the assessment and give a verdict. When a…
0 runs0 likes0 downloads0 reach0 impact
2029 instances - 31 features - classes - 0 missing values
ContextAcorporatecreditratingexpressestheabilityofafirmtorepayitsdebttocreditorsCreditratingagenciesaretheentitiesresponsibletomaketheassessmentandgiveaverdictWhenabigcorporationfromtheUSoranywhereintheworldwantstoissueanewbondithiresacreditagencytomakeanassessmentsothatinvestorscanknowhowtrustworthyisthecompanyTheassessmentisbasedespeciallyinthefinancialsindicatorsthatcomefromthebalancesheetSomeofthemostimportantagenciesintheworldareMoodysFitchandStandardandPoorsContentAlistof2029creditratingsissuedbymajoragenciessuchasStandardandPoorstobigUSfirmstradedonNYSEorNasdaqfrom2010to2016Thereare30featuresforeverycompanyofwhich25arefinancialindicatorsTheycanbedividedinLiquidityMeasurementRatioscurrentRatioquickRatiocashRatiodaysOfSalesOutstandingProfitabilityIndicatorRatiosgrossProfitMarginoperatingProfitMarginpretaxProfitMarginnetProfitMargineffectiveTaxRatereturnOnAssetsreturnOnEquityreturnOnCapitalEmployedDebtRatiosdebtRatiodebtEquityRatioOperatingPerformanceRatiosassetTurnoverCashFlowIndicatorRatiosoperatingCashFlowPerSharefreeCashFlowPerSharecashPerShareoperatingCashFlowSalesRatiofreeCashFlowOperatingCashFlowRatioFormoreinformationaboutfinancialindicatorsvisithttpsfinancialmodelingprepcommarketindexesmajormarketsTheadditionalfeaturesareNameSymbolfortradingRatingAgencyNameDateandSectorThedatasetisunbalancedhereisthefrequencyofratingsAAA7AA89A398BBB671BB490B302CCC64CC5C2D1AcknowledgementsThisdatasetwaspossiblethankstofinancialmodelingprepandopendatasoftthesourcesofthedataToseehowthedatawasintegratedandreshapedcheckhereInspirationIsitpossibletoforecasttheratinganagencywillgivetoacompanybasedonitsfinancials…
0 runs0 likes0 downloads0 reach0 impact
2029 instances - 31 features - classes - 0 missing values
Paulo Cortez, University of Minho, Guimaraes, Portugal, http://www3.dsi.uminho.pt/pcortez The dataset was obtained from the UCI Repository. This data approach student achievement in secondary…
0 runs0 likes0 downloads0 reach0 impact
649 instances - 31 features - 0 classes - 0 missing values
Context It is important that credit card companies are able to recognize fraudulent credit card transactions so that customers are not charged for items that they did not purchase. Content The…
0 runs1 likes9 downloads10 reach8 impact
284807 instances - 31 features - 0 classes - 0 missing values
Anonymized data of dating profiles from OkCupid
0 runs0 likes0 downloads0 reach0 impact
59946 instances - 31 features - 0 classes - 273249 missing values
Context About the Rio Niteri Bridge, according to Wikipedia: Presidente Costa e Silva Bridge, popularly known as Rio Niteri Bridge, is a bridge that crosses Guanabara Bay, in the state of Rio de…
0 runs0 likes0 downloads0 reach0 impact
9367 instances - 31 features - classes - 44380 missing values
ABOUT This dataset identifies areas within a city where drivers are experiencing difficulty searching for parking. Cities can use this data to identify problem areas, adjust signage, and more. Only…
0 runs0 likes0 downloads0 reach0 impact
4750 instances - 31 features - classes - 1002 missing values
This data approach student achievement in secondary education of two Portuguese schools. The data attributes include student grades, demographic, social and school related features) and it was…
0 runs0 likes0 downloads0 reach0 impact
649 instances - 31 features - 0 classes - 0 missing values
Source: Rami Mustafa A Mohammad ( University of Huddersfield, rami.mohammad '@' hud.ac.uk, rami.mustafa.a '@' gmail.com) Lee McCluskey (University of Huddersfield,t.l.mccluskey '@' hud.ac.uk ) Fadi…
51677 runs1 likes29 downloads30 reach29 impact
11055 instances - 31 features - 2 classes - 0 missing values
; ; Thyroid disease records supplied by the Garavan Institute and J. Ross ; Quinlan, New South Wales Institute, Syndney, Australia. ; ; 1987. ; hypothyroid, primary hypothyroid, compensated…
883 runs0 likes0 downloads0 reach0 impact
3772 instances - 30 features - 4 classes - 6064 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% %% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
756 runs0 likes0 downloads0 reach0 impact
121 instances - 30 features - 2 classes - 0 missing values
No data.
718 runs0 likes0 downloads0 reach0 impact
63 instances - 30 features - 2 classes - 0 missing values
No data.
794 runs0 likes0 downloads0 reach0 impact
107 instances - 30 features - 2 classes - 0 missing values
No data.
726 runs0 likes0 downloads0 reach0 impact
36 instances - 30 features - 2 classes - 0 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% %% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
789 runs0 likes0 downloads0 reach0 impact
101 instances - 30 features - 2 classes - 0 missing values
No data.
253 runs0 likes0 downloads0 reach0 impact
1076790 instances - 30 features - 2 classes - 7275 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Water Quality dataset (Dzeroski et al. 2000) has 14 target attributes that refer to the…
0 runs0 likes0 downloads0 reach0 impact
1060 instances - 30 features - classes - 0 missing values
Re-upload of the dataset as it is present in the Penn ML Benchmark (https://github.com/EpistasisLab/penn-ml-benchmarks/tree/master/datasets/classification/fars). It's a dataset on traffic accidents,…
1 runs0 likes4 downloads4 reach23 impact
100968 instances - 30 features - 8 classes - 0 missing values
Finding which features are most important to job profitability
0 runs0 likes0 downloads0 reach0 impact
14480 instances - 30 features - 0 classes - 0 missing values
This dataset is for classification tasks, and has both continuous and categorical variables.
0 runs0 likes0 downloads0 reach0 impact
100959 instances - 30 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 30 features - classes - 0 missing values
Sick dataset from the opencc18 with all textual binary variables label encoded.
1 runs0 likes0 downloads0 reach0 impact
3772 instances - 30 features - 2 classes - 0 missing values
The sick dataset from the OpenCC18 with all categorical data label encoded so all data is numeric
0 runs0 likes0 downloads0 reach0 impact
3772 instances - 30 features - classes - 0 missing values
Embedding of atoms for HIV inhibitors dataser
0 runs0 likes0 downloads0 reach0 impact
1069964 instances - 30 features - classes - 0 missing values
Embedding of molecules bonds in HIV inhibitors dataset
0 runs0 likes0 downloads0 reach0 impact
1151940 instances - 30 features - classes - 0 missing values
This dataset is based on dipam7/student-grade-prediction Dataset General information This data approach student achievement in secondary education of two Portuguese schools. The data attributes…
0 runs0 likes0 downloads0 reach0 impact
395 instances - 30 features - 0 classes - 0 missing values
Context It is important that credit card companies are able to recognize fraudulent credit card transactions so that customers are not charged for items that they did not purchase. Content The…
0 runs0 likes0 downloads0 reach0 impact
284807 instances - 30 features - classes - 0 missing values
This dataset is the resulting cleaned version of Panos Kostakos's Risk of being drawn into online sex work dataset. Context This database was used in the paper: "Covert online ethnography and machine…
0 runs0 likes0 downloads0 reach0 impact
28831 instances - 30 features - classes - 54259 missing values
Attribute information: ``` sick, negative. | classes age: continuous. sex: M, F. on thyroxine: f, t. query on thyroxine: f, t. on antithyroid medication: f, t. sick: f, t. pregnant: f, t. thyroid…
19950 runs0 likes0 downloads0 reach0 impact
3772 instances - 30 features - 2 classes - 6064 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
737 runs0 likes0 downloads0 reach0 impact
3772 instances - 30 features - 2 classes - 6064 missing values
Consumer Price Indices (CPI) measure changes over time in general level of prices of goods and services that households acquire for the purpose of consumption. CPI numbers are widely used as a…
0 runs0 likes0 downloads0 reach0 impact
267 instances - 30 features - classes - 208 missing values
allbp-pmlb
31 runs0 likes0 downloads0 reach0 impact
3772 instances - 30 features - 3 classes - 0 missing values
allrep-pmlb
31 runs0 likes0 downloads0 reach0 impact
3772 instances - 30 features - 4 classes - 0 missing values
dis-pmlb
31 runs0 likes0 downloads0 reach0 impact
3772 instances - 30 features - 2 classes - 0 missing values
No data.
73 runs0 likes0 downloads0 reach0 impact
1000000 instances - 30 features - 2 classes - 0 missing values
No data.
65 runs0 likes0 downloads0 reach0 impact
1000000 instances - 30 features - 4 classes - 0 missing values
ARFF version of UCI dataset 'flags'. Creators: Collected primarily from the "Collins Gem Guide to Flags": Collins Publishers (1986). Donor: Richard S. Forsyth. Date 5/15/1990 This data file contains…
108 runs0 likes0 downloads0 reach0 impact
194 instances - 29 features - 8 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
140 runs0 likes0 downloads0 reach0 impact
194 instances - 29 features - 2 classes - 0 missing values
Source: 1. Olcay KURSUN, PhD., Istanbul University, Department of Computer Engineering, 34320, Istanbul, Turkey Phone: +90 (212) 473 7070 - 17827 Email: okursun '@' istanbul.edu.tr 2. Betul ERDOGDU…
0 runs0 likes0 downloads0 reach0 impact
1039 instances - 29 features - classes - 0 missing values
### Attribute Information * The first column is the class label (1 for signal, 0 for background) * 21 low-level features (kinematic properties): lepton pT, lepton eta, lepton phi, missing energy…
14393 runs0 likes0 downloads0 reach0 impact
98050 instances - 29 features - 2 classes - 9 missing values
Data Set Information: The data has been produced using Monte Carlo simulations. The first 21 features (columns 2-22) are kinematic properties measured by the particle detectors in the accelerator. The…
0 runs0 likes0 downloads0 reach0 impact
98050 instances - 29 features - 0 classes - 9 missing values
Source: 1. Muhammad Naeem, Centre of Research in Data Engineering(CORDE) & Department of Computer Science, MAJU Islamabad Pakistan(naeems.naeem '@' gmail.com). 2. Sohail Asghar, Director/Associate…
0 runs0 likes0 downloads0 reach0 impact
65554 instances - 29 features - classes - 0 missing values
This is a classification problem to distinguish between a signal process which produces Higgs bosons and a background process which does not. ## Information The data has been produced using Monte…
0 runs0 likes0 downloads0 reach0 impact
11000000 instances - 29 features - 2 classes - 0 missing values
Context Climate change is forcing cities to re-imaging their transportation infrastructure. Shared mobility concepts, such as car sharing, bike sharing or scooter sharing become more and more popular.…
0 runs0 likes0 downloads0 reach0 impact
2922 instances - 29 features - classes - 51510 missing values
Context This dataset deals with pollution in the U.S. Pollution in the U.S. has been well documented by the U.S. EPA but it is a pain to download all the data and arrange them in a format that…
0 runs0 likes0 downloads0 reach0 impact
1746661 instances - 29 features - classes - 1746230 missing values
Context At Shadow Robot, we are leaders in robotic grasping and manipulation. As part of our Smart Grasping System development, we're developing different algorithms using machine learning. This first…
0 runs0 likes0 downloads0 reach0 impact
992641 instances - 29 features - classes - 0 missing values
See [https://github.com/slds-lmu/paper_2023_ci_for_ge](https://github.com/slds-lmu/paper_2023_ci_for_ge) for a description.
0 runs0 likes0 downloads0 reach0 impact
5100000 instances - 29 features - 0 classes - 0 missing values
The datasets provided include the players data for the Career Mode from FIFA 22. It only includes male football players with known wage. The goal is to predict the wage of the player based on his…
0 runs0 likes0 downloads0 reach0 impact
19178 instances - 29 features - 0 classes - 0 missing values
This is a smaller version of the original dataset, containing 1M rows. ### Attribute Information * The first column is the class label (1 for signal, 0 for background) * 21 low-level features…
0 runs0 likes0 downloads0 reach2 impact
1000000 instances - 29 features - 2 classes - 0 missing values
We use the following representation to collect the dataset age - age bp - blood pressure sg - specific gravity al - albumin su - sugar rbc - red blood cells pc - pus cell pcc - pus cell clumps ba -…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 28 features - classes - 0 missing values
The task consists of Learning Quantitative Structure Activity Relationships (QSARs). The Inhibition of Dihydrofolate Reductase by Pyrimidines.The data are described in: King, Ross .D., Muggleton,…
6 runs0 likes0 downloads0 reach0 impact
74 instances - 28 features - 0 classes - 0 missing values
Survey to know if people self-identify as Midwesterners.
0 runs0 likes0 downloads0 reach0 impact
2778 instances - 28 features - 10 classes - 1737 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 28 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 28 features - classes - 0 missing values
Survey to know if people self-identify as Midwesterners.
0 runs0 likes0 downloads0 reach0 impact
2778 instances - 28 features - 10 classes - 1737 missing values
Survey to know if people self-identify as Midwesterners.
0 runs0 likes0 downloads0 reach0 impact
2494 instances - 28 features - 9 classes - 99 missing values
Context This dataset contains traffic violation information from all electronic traffic violations issued in the County. Any information that can be used to uniquely identify the vehicle, the vehicle…
0 runs0 likes0 downloads0 reach0 impact
100000 instances - 28 features - classes - 1985 missing values
Subsampling of the dataset helena (41169) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 28 features - 100 classes - 0 missing values
Subsampling of the dataset helena (41169) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 28 features - 100 classes - 0 missing values
Subsampling of the dataset helena (41169) with seed=2 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 28 features - 100 classes - 0 missing values
Subsampling of the dataset helena (41169) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 28 features - 100 classes - 0 missing values
The midwest survey dataset contain individual responses from surveys about regional identification conducted for FiveThirtyEight by SurveyMonkey.
0 runs0 likes0 downloads0 reach0 impact
2778 instances - 28 features - 10 classes - 1744 missing values
The midwest survey dataset contain individual responses from surveys about regional identification conducted for FiveThirtyEight by SurveyMonkey.
0 runs0 likes0 downloads0 reach0 impact
2778 instances - 28 features - 10 classes - 1744 missing values
Subsampling of the dataset steel-plates-fault (40982) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def…
0 runs0 likes0 downloads0 reach0 impact
1941 instances - 28 features - 7 classes - 0 missing values
Subsampling of the dataset steel-plates-fault (40982) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def…
0 runs0 likes0 downloads0 reach0 impact
1941 instances - 28 features - 7 classes - 0 missing values
Subsampling of the dataset steel-plates-fault (40982) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def…
0 runs0 likes0 downloads0 reach0 impact
1941 instances - 28 features - 7 classes - 0 missing values
Subsampling of the dataset steel-plates-fault (40982) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def…
0 runs0 likes0 downloads0 reach0 impact
1941 instances - 28 features - 7 classes - 0 missing values