OpenML
Filter results by:
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-verylarge.html#pathfinder) - Number of nodes: 109 - Number of arcs: 195 - Number of parameters: 72079…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 109 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-verylarge.html#pathfinder) - Number of nodes: 109 - Number of arcs: 195 - Number of parameters: 72079…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 109 features - classes - 0 missing values
Data are collected from Kickstarter Platform You'll find most useful data for project analysis. Columns are self explanatory except: usd_pledged: conversion in US dollars of the pledged column…
0 runs0 likes0 downloads0 reach0 impact
331675 instances - 14 features - classes - 210 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach0 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
https://www.kaggle.com/harlfoxem/ This dataset contains house sale prices for King County, which includes Seattle. It includes homes sold between May 2014 and May 2015. It contains 19 house features…
0 runs0 likes0 downloads0 reach0 impact
21613 instances - 20 features - classes - 0 missing values
General Description 2015-current: greater than $200.00. The Commission categorizes contributions from individuals using the calendar year-to-date amount for political action committee (PAC) and party…
0 runs0 likes0 downloads0 reach0 impact
3348209 instances - 21 features - 0 classes - 10786577 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach0 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach0 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 3 classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 3 classes - 0 missing values
The BoT-IoT dataset was created by designing a realistic network environment in the Cyber Range Lab of The center of UNSW Canberra Cyber. The environment incorporates a combination of normal and…
0 runs0 likes0 downloads0 reach0 impact
3668522 instances - 45 features - 0 classes - 0 missing values
Wine data gathered by https://www.kaggle.com/zynicideThe data was scraped from WineEnthusiast during the week of June 15th, 2017. The code for the scraper can be found at…
0 runs0 likes0 downloads0 reach0 impact
150930 instances - 10 features - classes - 174477 missing values
iris data with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach8 impact
150 instances - 5 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 3 classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 3 classes - 0 missing values
Daily air quality measurements in New York, May to September 1973. This data is taken from R.
0 runs0 likes0 downloads0 reach0 impact
Daily air quality measurements in New York, May to September 1973. This data is taken from R.
0 runs0 likes0 downloads0 reach0 impact
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 3 classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 3 classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 3 classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 3 classes - 0 missing values
Daily air quality measurements in New York, May to September 1973. This data is taken from R.
0 runs0 likes0 downloads0 reach0 impact
Daily air quality measurements in New York, May to September 1973. This data is taken from R.
0 runs0 likes0 downloads0 reach0 impact
Chocolate Bar Ratings. Expert ratings of over 1,700 chocolate bars. Each chocolate is evaluated from a combination of both objective qualities and subjective interpretation. A rating here only…
0 runs0 likes0 downloads0 reach0 impact
1795 instances - 9 features - 42 classes - 1 missing values
Regroups information for about 7800 different US colleges. Including geographical information, stats about the population attending and post graduation career earnings.
0 runs0 likes0 downloads0 reach0 impact
This dataset reflects incidents of crime in the City of Los Angeles dating back to 2010. This data is transcribed from original crime reports that are typed on paper and therefore there may be some…
0 runs0 likes0 downloads0 reach0 impact
Public procurement data for the European Economic Area, Switzerland, and the Macedonia. 2015
0 runs0 likes0 downloads0 reach0 impact
kaggle_santander_p
0 runs0 likes0 downloads0 reach0 impact
200000 instances - 203 features - classes - 0 missing values
Synthetic 2-d data with N=5000 vectors and k=15 Gaussian clusters with different degree of cluster overlap P. Fränti and O. Virmajoki, "Iterative shrinking method for clustering…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 3 features - 0 classes - 0 missing values
Synthetic 2-d data with N=5000 vectors and k=15 Gaussian clusters with different degree of cluster overlap P. Fränti and O. Virmajoki, "Iterative shrinking method for clustering…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 3 features - 0 classes - 0 missing values
The Inpatient Utilization and Payment Public Use File (Inpatient PUF) provides information on inpatient discharges for Medicare fee-for-service beneficiaries. The Inpatient PUF includes information on…
0 runs0 likes0 downloads0 reach0 impact
163065 instances - 12 features - 0 classes - 0 missing values
The Inpatient Utilization and Payment Public Use File (Inpatient PUF) provides information on inpatient discharges for Medicare fee-for-service beneficiaries. The Inpatient PUF includes information on…
0 runs0 likes0 downloads0 reach0 impact
163065 instances - 12 features - 0 classes - 0 missing values
This dataset contains traffic violation information from all electronic traffic violations issued in the County. Any information that can be used to uniquely identify the vehicle, the vehicle owner or…
0 runs0 likes0 downloads0 reach0 impact
1578154 instances - 43 features - 4 classes - 8006541 missing values
Regroups information for about 7800 different US colleges. Including geographical information, stats about the population attending and post graduation career earnings.
0 runs0 likes0 downloads0 reach0 impact
Estimated article influence scores in 2015
0 runs0 likes0 downloads0 reach0 impact
3615 instances - 7 features - 3169 classes - 48 missing values
Annual salary information including gross pay and overtime pay for all active, permanent employees of Montgomery County, MD paid in calendar year 2016. This information will be published annually each…
0 runs0 likes0 downloads0 reach0 impact
9228 instances - 13 features - 0 classes - 11169 missing values
Public procurement data for the European Economic Area, Switzerland, and the Macedonia. 2015
0 runs0 likes0 downloads0 reach0 impact
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - classes - 0 missing values
Synthetic 2-d data with N=5000 vectors and k=15 Gaussian clusters with different degree of cluster overlap P. Fränti and O. Virmajoki, "Iterative shrinking method for clustering…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 3 features - 0 classes - 0 missing values
Synthetic 2-d data with N=5000 vectors and k=15 Gaussian clusters with different degree of cluster overlap P. Fränti and O. Virmajoki, "Iterative shrinking method for clustering…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 3 features - 0 classes - 0 missing values
Public procurement data for the European Economic Area, Switzerland, and the Macedonia. 2015
0 runs0 likes0 downloads0 reach0 impact
uci
0 runs0 likes0 downloads0 reach0 impact
30000 instances - 27 features - classes - 0 missing values
uci
0 runs0 likes0 downloads0 reach0 impact
101766 instances - 52 features - classes - 192849 missing values
hmeq_p,BAD,binary
0 runs0 likes0 downloads0 reach0 impact
5960 instances - 15 features - classes - 5271 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach0 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
Employee remuneration and expenses (earning over 75,000CAD per year). This data set includes remuneration and expenses from employees earning over 75,000CAD per year. Attributes: NAME: Name of…
0 runs0 likes0 downloads0 reach0 impact
classification
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 3 classes - 0 missing values
Israeli lottery
0 runs0 likes0 downloads0 reach0 impact
1153 instances - 11 features - classes - 0 missing values
uci adult partitioned
0 runs0 likes0 downloads0 reach0 impact
48844 instances - 17 features - classes - 6495 missing values
Auto MPG (6 variables) dataset The data concerns city-cycle fuel consumption in miles per gallon (Mpg), to be predicted in terms of 1 multivalued discrete and 5 continuous attributes (two multivalued…
0 runs0 likes0 downloads0 reach0 impact
392 instances - 6 features - 0 classes - 0 missing values
At Santander our mission is to help people and businesses prosper. We are always looking for ways to help our customers understand their financial health and identify which products and services might…
0 runs0 likes0 downloads0 reach0 impact
200000 instances - 202 features - 2 classes - 0 missing values
Multiclass from binary: Expanding one-vs-all, one-vs-one and ECOC-based approaches. Dataset taken from LIBSVM: https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/multiclass.html In this dataset…
0 runs0 likes0 downloads0 reach8 impact
108000 instances - 129 features - 1000 classes - 0 missing values
Dataset includes construction cost, sale prices, project variables, and economic variables corresponding to real estate single-family residential apartments in Tehran, Iran. Totally 105: 8 project…
0 runs0 likes0 downloads0 reach0 impact
372 instances - 109 features - 0 classes - 0 missing values
This file contains the Economic data information of USA from 01/04/1980 to 02/04/2000 on a weekly basis. From given features, the goal is to predict 1 Month CD Rate. 1. 1Y-CMaturityRate real [77.055,…
0 runs0 likes0 downloads0 reach0 impact
1049 instances - 16 features - 0 classes - 0 missing values
This file contains the weather information of Ankara from 01/01/1994 to 28/05/1998. From given features, the goal is to predict the mean temperature. 1. Max_temperature real [23.0, 100.0] 2.…
0 runs0 likes0 downloads0 reach0 impact
321 instances - 10 features - 0 classes - 0 missing values
Electrical-Maintenance data set This problem consists of four input variables and the available data set is comprised of a representative number of well distributed examples. In this case, the…
0 runs0 likes0 downloads0 reach0 impact
1056 instances - 5 features - 0 classes - 0 missing values
Forest Fires Data Set This is a difficult regression task, where the aim is to predict the burned area of forest fires, in the northeast region of Portugal, by using meteorological and other data.…
0 runs0 likes0 downloads0 reach0 impact
517 instances - 13 features - 0 classes - 0 missing values
This data set was originally a univariate time record of a single observed quantity, recorded from a Far-Infrared-Laser in a chaotic state. The original set 1000 points has been adapted for regression…
0 runs0 likes0 downloads0 reach0 impact
993 instances - 5 features - 0 classes - 0 missing values
``**Author**: Cigdem Inan Aci","Mehmet Fatih Akay ### Data Set Information All simulations have done under the software named OPNET Modeler. Message passing is used as the communication mechanism in…
0 runs0 likes0 downloads0 reach0 impact
640 instances - 10 features - classes - 0 missing values
This file contains the weather information of Izmir from 01/01/1994 to 31/12/1997. From given features, the goal is to predict the mean temperature. 1. Max_temperature real[36.7,105.0] 2.…
0 runs0 likes0 downloads0 reach0 impact
1461 instances - 10 features - 0 classes - 0 missing values
Prediction of residuary resistance of sailing yachts at the initial design stage is of a great value for evaluating the ship’s performance and for estimating the required propulsive…
0 runs0 likes0 downloads0 reach0 impact
308 instances - 7 features - 0 classes - 0 missing values
Conventional and Social Media Movies (CSM) - Dataset 2014 and 2015 Data Set 12 features categorized as conventional and social media features. Both conventional features, collected from movies…
0 runs0 likes0 downloads0 reach0 impact
231 instances - 13 features - classes - 46 missing values
Context It is important that credit card companies are able to recognize fraudulent credit card transactions so that customers are not charged for items that they did not purchase. Content The…
0 runs1 likes3 downloads4 reach9 impact
284807 instances - 31 features - 2 classes - 0 missing values
Test
0 runs0 likes0 downloads0 reach0 impact
958 instances - 10 features - classes - 0 missing values
Test
0 runs0 likes0 downloads0 reach0 impact
2 instances - 1 features - classes - 0 missing values
qsqs
0 runs0 likes0 downloads0 reach0 impact
2 instances - 1 features - classes - 0 missing values
swdw
0 runs0 likes0 downloads0 reach0 impact
2 instances - 2 features - classes - 0 missing values
werr
0 runs0 likes0 downloads0 reach0 impact
2 instances - 2 features - classes - 0 missing values
efef fdfef
0 runs0 likes0 downloads0 reach0 impact
2 instances - 2 features - classes - 0 missing values
zaxa xcdc
0 runs0 likes0 downloads0 reach0 impact
2 instances - 2 features - classes - 0 missing values
dedfef
0 runs0 likes0 downloads0 reach0 impact
2 instances - 1 features - classes - 0 missing values
Uploead test
0 runs0 likes0 downloads0 reach0 impact
958 instances - 10 features - classes - 0 missing values
ddfef fvdf
0 runs0 likes0 downloads0 reach0 impact
8 instances - 1 features - classes - 0 missing values
wdwd cd
0 runs0 likes0 downloads0 reach0 impact
2 instances - 2 features - classes - 0 missing values
scs
0 runs0 likes0 downloads0 reach0 impact
2 instances - 2 features - classes - 0 missing values
wdede
0 runs0 likes0 downloads0 reach0 impact
2 instances - 2 features - classes - 0 missing values
wdwd
0 runs0 likes0 downloads0 reach0 impact
2 instances - 1 features - classes - 0 missing values
This data represents crime reported to the Seattle Police Department (SPD). Each row contains the record of a unique event where at least one criminal offense was reported by a member of the community…
0 runs0 likes0 downloads0 reach0 impact
52358 instances - 8 features - 0 classes - 650 missing values
ede wey
0 runs0 likes0 downloads0 reach0 impact
589 instances - 2909 features - classes - 0 missing values
eevrr der
0 runs0 likes0 downloads0 reach0 impact
1557 instances - 5629 features - classes - 0 missing values
The midwest survey dataset contain individual responses from surveys about regional identification conducted for FiveThirtyEight by SurveyMonkey.
0 runs0 likes0 downloads0 reach0 impact
2778 instances - 28 features - 10 classes - 1744 missing values
The midwest survey dataset contain individual responses from surveys about regional identification conducted for FiveThirtyEight by SurveyMonkey.
0 runs0 likes0 downloads0 reach0 impact
2778 instances - 28 features - 10 classes - 1744 missing values
sqs efrf
0 runs0 likes0 downloads0 reach0 impact
4 instances - 5 features - classes - 0 missing values
b gtrg
0 runs0 likes0 downloads0 reach0 impact
4 instances - 7 features - classes - 0 missing values
Airlines Dataset Inspired in the regression dataset from Elena Ikonomovska. The task is to predict whether a given flight will be delayed, given the information of the scheduled departure. For this…
0 runs0 likes0 downloads0 reach0 impact
26969 instances - 8 features - 2 classes - 0 missing values
ssc vdv
0 runs0 likes0 downloads0 reach0 impact
1556 instances - 2 features - classes - 0 missing values
swd dced
0 runs0 likes0 downloads0 reach0 impact
589 instances - 3 features - classes - 0 missing values
sdsw frfr
0 runs0 likes0 downloads0 reach0 impact
1556 instances - 3 features - classes - 0 missing values
efe rgrg
0 runs0 likes0 downloads0 reach0 impact
efef ffrf
0 runs0 likes0 downloads0 reach0 impact
9 instances - 3 features - classes - 0 missing values
Water stress dataset for Indian variety of wheat crop: The data consist of a file system-based data of Raj 3765 variety of wheat. There are twenty-four chlorophyll fluorescence images captured every…
0 runs0 likes0 downloads0 reach0 impact
1188 instances - 23 features - 0 classes - 0 missing values
Identify jets of particles from the LHC, created for the study of ultra low latency inference with hls4ml. Use 16 high level features to identify the 5 jet classes: quark (q), gluon (g), W boson (w),…
0 runs0 likes0 downloads0 reach0 impact
830000 instances - 17 features - 5 classes - 0 missing values