Data
Filter results by:
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#mildew) - Number of nodes: 35 - Number of arcs: 46 - Number of parameters: 540150 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 35 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#mildew) - Number of nodes: 35 - Number of arcs: 46 - Number of parameters: 540150 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 35 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#mildew) - Number of nodes: 35 - Number of arcs: 46 - Number of parameters: 540150 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 35 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#mildew) - Number of nodes: 35 - Number of arcs: 46 - Number of parameters: 540150 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 35 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#mildew) - Number of nodes: 35 - Number of arcs: 46 - Number of parameters: 540150 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 35 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#mildew) - Number of nodes: 35 - Number of arcs: 46 - Number of parameters: 540150 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 35 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#mildew) - Number of nodes: 35 - Number of arcs: 46 - Number of parameters: 540150 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 35 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#mildew) - Number of nodes: 35 - Number of arcs: 46 - Number of parameters: 540150 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 35 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#mildew) - Number of nodes: 35 - Number of arcs: 46 - Number of parameters: 540150 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 35 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#mildew) - Number of nodes: 35 - Number of arcs: 46 - Number of parameters: 540150 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 35 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#insurance) - Number of nodes: 27 - Number of arcs: 52 - Number of parameters: 1008 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 27 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#insurance) - Number of nodes: 27 - Number of arcs: 52 - Number of parameters: 1008 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 27 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#insurance) - Number of nodes: 27 - Number of arcs: 52 - Number of parameters: 1008 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 27 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#insurance) - Number of nodes: 27 - Number of arcs: 52 - Number of parameters: 1008 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 27 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#insurance) - Number of nodes: 27 - Number of arcs: 52 - Number of parameters: 1008 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 27 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#insurance) - Number of nodes: 27 - Number of arcs: 52 - Number of parameters: 1008 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 27 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#insurance) - Number of nodes: 27 - Number of arcs: 52 - Number of parameters: 1008 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 27 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#insurance) - Number of nodes: 27 - Number of arcs: 52 - Number of parameters: 1008 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 27 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#insurance) - Number of nodes: 27 - Number of arcs: 52 - Number of parameters: 1008 -…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 27 features - classes - 0 missing values
This dataset consists of 6,642 question/answer pairs. The questions are supposed to be answerable by Freebase, a large knowledge graph. The questions are mostly centered around a single named entity.…
0 runs0 likes0 downloads0 reach0 impact
5810 instances - 3 features - classes - 0 missing values
Classify a chess game based on the position of the white king, the white rook and the black king.
1777 runs0 likes0 downloads0 reach0 impact
28056 instances - 7 features - 18 classes - 0 missing values
Citation Request: This primary tumor domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
1261 runs0 likes0 downloads0 reach0 impact
339 instances - 18 features - 21 classes - 225 missing values
# Space Shuttle Autolanding Domain NASA: Mr. Roger Burke's autolander design team ##### Past Usage: (several, it appears) Example: Michie,D. (1988). The Fifth Generation's Unbridged Gap. In Rolf…
1466 runs0 likes0 downloads0 reach0 impact
15 instances - 7 features - 2 classes - 26 missing values
1. Title: Lung Cancer Data 2. Source Information: - Data was published in : Hong, Z.Q. and Yang, J.Y. "Optimal Discriminant Plane for a Small Number of Samples and Design Method of Classifier on the…
1238 runs0 likes0 downloads0 reach0 impact
32 instances - 57 features - 3 classes - 5 missing values
Compilation of promoters with known transcriptional start points for E. coli genes. The task is to recognize promoters in strings that represent nucleotides (one of A, G, T, or C). A promoter is a…
138 runs0 likes0 downloads0 reach0 impact
106 instances - 58 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1058 runs0 likes0 downloads0 reach0 impact
167 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1252 runs0 likes0 downloads0 reach0 impact
130 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1202 runs0 likes0 downloads0 reach0 impact
100 instances - 4 features - 2 classes - 0 missing values
This is the large soybean database from the UCI repository, with its training and test database combined into a single file. There are 19 classes, only the first 15 of which have been used in prior…
41045 runs0 likes0 downloads0 reach0 impact
683 instances - 36 features - 19 classes - 2337 missing values
1. Title: Postoperative Patient Data 2. Source Information: -- Creators: Sharon Summers, School of Nursing, University of Kansas Medical Center, Kansas City, KS 66160 Linda Woolery, School of Nursing,…
1758 runs0 likes0 downloads0 reach0 impact
90 instances - 9 features - 3 classes - 3 missing values
This data set contains unweighted PUMS census data from the Los Angeles and Long Beach areas for the years 1970, 1980, and 1990. The coding schemes have been standardized (by the IPUMS project) to be…
366 runs0 likes0 downloads0 reach0 impact
8844 instances - 61 features - 7 classes - 51515 missing values
This database contains the HTML source of web pages plus the ratings of a single user on these web pages. The web pages are on four separate subjects (Bands- recording artists; Goats; Sheep; and…
0 runs0 likes0 downloads0 reach0 impact
70 instances - 3 features - 3 classes - 0 missing values
This data set contains unweighted PUMS census data from the Los Angeles and Long Beach areas for the years 1970, 1980, and 1990. The coding schemes have been standardized (by the IPUMS project) to be…
434 runs0 likes0 downloads0 reach0 impact
7019 instances - 61 features - 8 classes - 48089 missing values
This database contains the HTML source of web pages plus the ratings of a single user on these web pages. The web pages are on four separate subjects (Bands- recording artists; Goats; Sheep; and…
0 runs0 likes0 downloads0 reach0 impact
61 instances - 3 features - 3 classes - 0 missing values
This data set contains unweighted PUMS census data from the Los Angeles and Long Beach areas for the years 1970, 1980, and 1990. The coding schemes have been standardized (by the IPUMS project) to be…
354 runs0 likes0 downloads0 reach0 impact
7485 instances - 61 features - 7 classes - 52048 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. The maps were scanned in 8 bit grey value at density of 400dpi,…
26538 runs0 likes0 downloads0 reach0 impact
2000 instances - 241 features - 10 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
622 runs0 likes0 downloads0 reach0 impact
10108 instances - 69 features - 2 classes - 2699 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
106 runs0 likes0 downloads0 reach0 impact
76 instances - 45 features - 2 classes - 22 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
752 runs0 likes0 downloads0 reach0 impact
339 instances - 18 features - 2 classes - 225 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
736 runs0 likes0 downloads0 reach0 impact
364 instances - 33 features - 2 classes - 80 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
721 runs0 likes0 downloads0 reach0 impact
226 instances - 70 features - 2 classes - 317 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
765 runs0 likes0 downloads0 reach0 impact
1728 instances - 7 features - 2 classes - 0 missing values
No data.
337 runs0 likes0 downloads0 reach0 impact
1000000 instances - 13 features - 3 classes - 0 missing values
No data.
45 runs0 likes0 downloads0 reach0 impact
1000000 instances - 23 features - 2 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
60197 instances - 6 features - classes - 42138 missing values
Insurance is a network for evaluating car insurance risks. The insurance data set contains the following 27 variables: GoodStudent (good student): a two-level factor with levels False and True. Age…
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 27 features - classes - 0 missing values
A Gold Standard of ~4,400 questions, answers, and comments from Stack Overflow, manually annotated for polarity. The dataset has been used for developing the EMTk toolkit for polarity detection from…
0 runs0 likes0 downloads0 reach0 impact
4423 instances - 2 features - 3 classes - 0 missing values
Real-world data set about the perching behaviour of two species of lizards in the South Bimini island, from Shoener (1968). The lizards data set contains the following variables: Species (the species…
0 runs0 likes0 downloads0 reach0 impact
409 instances - 3 features - classes - 0 missing values
A synthetic dataset from Lauritzen and Spiegelhalter (1988) about lung diseases (tuberculosis, lung cancer or bronchitis) and visits to Asia. A data frame with 5000 rows and 8 binary variables: D…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 8 features - classes - 0 missing values
Probable risk factors for coronary thrombosis, comprising data from 1841 men. The coronary data set contains the following 6 variables: Smoking (smoking): a two-level factor with levels no and yes. M.…
0 runs0 likes0 downloads0 reach0 impact
1841 instances - 6 features - classes - 0 missing values
Hailfinder is a Bayesian network designed to forecast severe summer hail in northeastern Colorado. The hailfinder data set contains the following 56 variables: N07muVerMo (10.7mu vertical motion): a…
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 56 features - classes - 0 missing values
A Gold Standard of ~4,400 questions, answers, and comments from Stack Overflow, manually annotated for polarity. The dataset has been used for developing the EMTk toolkit for polarity detection from…
0 runs0 likes0 downloads0 reach0 impact
3097 instances - 2 features - 3 classes - 0 missing values
A Gold Standard of ~4,400 questions, answers, and comments from Stack Overflow, manually annotated for polarity. The dataset has been used for developing the EMTk toolkit for polarity detection from…
0 runs0 likes0 downloads0 reach0 impact
1326 instances - 2 features - 3 classes - 0 missing values
A Gold Standard of ~4,400 questions, answers, and comments from Stack Overflow, manually annotated for polarity. The dataset has been used for developing the EMTk toolkit for polarity detection from…
0 runs0 likes0 downloads0 reach0 impact
4423 instances - 2 features - 3 classes - 0 missing values
The ALARM ("A Logical Alarm Reduction Mechanism") is a Bayesian network designed to provide an alarm message system for patient monitoring. The alarm data set contains the following 37 variables: CVP…
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 37 features - classes - 0 missing values
Car buying information
0 runs0 likes0 downloads0 reach0 impact
1750 instances - 7 features - classes - 0 missing values
Dutch News Articles This dataset contains all the articles published by the NOS as of the 1st of January 2010. The data is obtained by scraping the NOS website. The NOS is one of the biggest (online)…
0 runs0 likes0 downloads0 reach0 impact
237861 instances - 5 features - classes - 0 missing values
Context and Content The COVID-19 case surveillance system database includes individual-level data reported to U.S. states and autonomous reporting entities, including New York City and the District of…
0 runs0 likes0 downloads0 reach0 impact
8405079 instances - 11 features - classes - 9543526 missing values
Context Throughout the world of data science, there are many languages and tools that can be used to complete a given task. While you are often able to use whichever tool you prefer, it is often…
0 runs0 likes0 downloads0 reach0 impact
10153 instances - 4 features - classes - 9824 missing values
Context This dataset was scraped from http://www.asapsports.com/, using the code in this repository. I designed the webscraping code to account for most of the variance in the website's formatting,…
0 runs0 likes0 downloads0 reach0 impact
2096 instances - 6 features - classes - 0 missing values
This dataset contains moonrise, moonset and lunar phase timings for every date from 2005 to 2017 for London, UK*; collected from timeanddate. Inspiration This data can be used to study the effects of…
0 runs0 likes0 downloads0 reach0 impact
4748 instances - 8 features - classes - 13280 missing values
The number of children, youth and adults not attending schools or universities because of COVID-19 is soaring. Governments all around the world have closed educational institutions in an attempt to…
0 runs0 likes0 downloads0 reach0 impact
41714 instances - 4 features - classes - 0 missing values
The coronavirus pandemic has affected the entire world and many families have been destroyed. The stock exchange was also affected, but vaccine companies took advantage of this moment and leveraged…
0 runs0 likes0 downloads0 reach0 impact
255 instances - 36 features - classes - 35 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#alarm) - Number of nodes: 37 - Number of arcs: 46 - Number of parameters: 509 - Average…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 37 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#alarm) - Number of nodes: 37 - Number of arcs: 46 - Number of parameters: 509 - Average…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 37 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#alarm) - Number of nodes: 37 - Number of arcs: 46 - Number of parameters: 509 - Average…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 37 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#alarm) - Number of nodes: 37 - Number of arcs: 46 - Number of parameters: 509 - Average…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 37 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#alarm) - Number of nodes: 37 - Number of arcs: 46 - Number of parameters: 509 - Average…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 37 features - classes - 0 missing values
Dataset used by Buntine and Niblett (1992). Composed of 10 features, one of which is irrelevant. The target is a disjunctive normal form formula over the nine other attributes, with additional…
31 runs0 likes0 downloads0 reach22 impact
973 instances - 10 features - 2 classes - 0 missing values
flare-pmlb
32 runs0 likes0 downloads0 reach0 impact
1066 instances - 11 features - 2 classes - 0 missing values
Relevant Information: -- The database contains 3 potential classes, one for the number of times a certain type of solar flare occured in a 24 hour period. -- Each instance represents captured features…
31 runs0 likes0 downloads0 reach0 impact
1066 instances - 13 features - 6 classes - 0 missing values
threeOf9-pmlb
31 runs0 likes0 downloads0 reach0 impact
512 instances - 10 features - 2 classes - 0 missing values
parity5_plus_5-pmlb
31 runs0 likes0 downloads0 reach22 impact
1124 instances - 11 features - 2 classes - 0 missing values
The origin is not clear, but presumably this is an artificial problem representing M-of-N rules. The target is 1 if a certain M 'bits' are '1'? (Joaquin Vanschoren)
31 runs0 likes1 downloads1 reach22 impact
1324 instances - 11 features - 2 classes - 0 missing values
mux6-pmlb
31 runs0 likes0 downloads0 reach0 impact
128 instances - 7 features - 2 classes - 0 missing values
postoperative-patient-data-pmlb
26 runs0 likes0 downloads0 reach0 impact
88 instances - 9 features - 2 classes - 0 missing values
Relevant Information: -- The database contains 3 potential classes, one for the number of times a certain type of solar flare occured in a 24 hour period. -- Each instance represents captured features…
31 runs0 likes0 downloads0 reach0 impact
315 instances - 13 features - 5 classes - 0 missing values
cleveland-nominal-pmlb
31 runs0 likes0 downloads0 reach0 impact
303 instances - 8 features - 5 classes - 0 missing values
parity5-pmlb
32 runs0 likes0 downloads0 reach0 impact
32 instances - 6 features - 2 classes - 0 missing values
fake dataset without any value
0 runs0 likes0 downloads0 reach0 impact
73503 instances - 4 features - classes - 0 missing values
No data.
71 runs0 likes5 downloads5 reach12 impact
1000000 instances - 17 features - 2 classes - 0 missing values
No data.
63 runs0 likes0 downloads0 reach0 impact
1000000 instances - 26 features - 7 classes - 0 missing values
No data.
63 runs0 likes0 downloads0 reach0 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
68 runs0 likes0 downloads0 reach0 impact
1000000 instances - 10 features - 2 classes - 0 missing values
No data.
48 runs0 likes0 downloads0 reach0 impact
1000000 instances - 77 features - 10 classes - 0 missing values
No data.
50 runs0 likes0 downloads0 reach0 impact
1000000 instances - 65 features - 10 classes - 0 missing values
No data.
167 runs0 likes0 downloads0 reach0 impact
399940 instances - 1002 features - 2 classes - 0 missing values
No data.
73 runs0 likes0 downloads0 reach0 impact
1000000 instances - 30 features - 2 classes - 0 missing values
No data.
50 runs0 likes0 downloads0 reach0 impact
1000000 instances - 61 features - 2 classes - 0 missing values
No data.
90 runs0 likes0 downloads0 reach0 impact
137781 instances - 10 features - 7 classes - 0 missing values
No data.
314 runs0 likes0 downloads0 reach0 impact
1000000 instances - 36 features - 19 classes - 0 missing values
No data.
219 runs0 likes0 downloads0 reach0 impact
1000000 instances - 58 features - 2 classes - 0 missing values
No data.
66 runs0 likes0 downloads0 reach0 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
1457 runs0 likes0 downloads0 reach0 impact
39366 instances - 10 features - 2 classes - 0 missing values
No data.
66 runs0 likes0 downloads0 reach0 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
334 runs0 likes0 downloads0 reach0 impact
1000000 instances - 33 features - 2 classes - 0 missing values
No data.
70 runs0 likes0 downloads0 reach0 impact
1000000 instances - 14 features - 2 classes - 0 missing values
No data.
68 runs0 likes0 downloads0 reach0 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
69 runs0 likes0 downloads0 reach0 impact
1000000 instances - 20 features - 2 classes - 0 missing values
No data.
356 runs0 likes0 downloads0 reach0 impact
131072 instances - 17 features - 2 classes - 0 missing values