Data
Filter results by:
This is a preprocessed version of the anneal dataset (version 1). All missing values are treated as a nominal value with label '?'. (Quotes for clarity). Because this is not good…
0 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 5 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 2 classes - 0 missing values
The original Annealing dataset from UCI. The exact meaning of the features and classes is largely unknown. Annealing, in metallurgy and materials science, is a heat treatment that alters the physical…
13779 runs0 likes16 downloads16 reach14 impact
898 instances - 39 features - 5 classes - 22175 missing values
Author: Alen Shapiro Source: [UCI](https://archive.ics.uci.edu/ml/datasets/Chess+(King-Rook+vs.+King-Pawn)) Please cite: [UCI citation policy](https://archive.ics.uci.edu/ml/citation_policy.html) 1.…
274205 runs1 likes44 downloads45 reach16 impact
3196 instances - 37 features - 2 classes - 0 missing values
Date: Tue, 15 Nov 88 15:44:08 EST From: stan To: aha@ICS.UCI.EDU 1. Title: Final settlements in labor negotitions in Canadian industry 2. Source Information -- Creators:…
7681 runs0 likes17 downloads17 reach13 impact
57 instances - 17 features - 2 classes - 326 missing values
The aim is to determine the type of arrhythmia from the ECG recordings. This database contains 279 attributes, 206 of which are linear valued and the rest are nominal. Concerning the study of H. Altay…
4430 runs0 likes50 downloads50 reach15 impact
452 instances - 280 features - 13 classes - 408 missing values
1. TITLE: Letter Image Recognition Data The objective is to identify each of a large number of black-and-white rectangular pixel displays as one of the 26 capital letters in the English alphabet. The…
69578 runs3 likes75 downloads78 reach12 impact
20000 instances - 17 features - 26 classes - 0 missing values
This database is a standardized version of the original audiology database (see audiology.* in this directory). The non-standard set of attributes have been converted to a standard set of attributes…
7303 runs0 likes14 downloads14 reach12 impact
226 instances - 70 features - 24 classes - 317 missing values
The first 5 variables are all blood tests which are thought to be sensitive to liver disorders that might arise from excessive alcohol consumption. Each line in the dataset constitutes the record of a…
193 runs0 likes0 downloads0 reach0 impact
345 instances - 6 features - 0 classes - 0 missing values
This data set consists of three types of entities: (a) the specification of an auto in terms of various characteristics, (b) its assigned insurance risk rating, (c) its normalized losses in use as…
3252 runs2 likes26 downloads28 reach10 impact
205 instances - 26 features - 6 classes - 59 missing values
Citation Request: This lymphography domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
1972 runs0 likes32 downloads32 reach12 impact
148 instances - 19 features - 4 classes - 0 missing values
This data set was generated to model psychological experimental results. Each example is classified as having the balance scale tip to the right, tip to the left, or be balanced. The attributes are…
29968 runs2 likes18 downloads20 reach15 impact
625 instances - 5 features - 3 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
37684 runs0 likes18 downloads18 reach15 impact
2000 instances - 217 features - 10 classes - 0 missing values
Citation Request: This breast cancer domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
2009 runs1 likes37 downloads38 reach9 impact
286 instances - 10 features - 2 classes - 9 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
38337 runs0 likes13 downloads13 reach13 impact
2000 instances - 77 features - 10 classes - 0 missing values
Current dataset was adapted to ARFF format from the UCI version. Sample code ID's were removed. ! Note that there is also a related Breast Cancer Wisconsin (Diagnosis) Data Set with a different set of…
28632 runs2 likes22 downloads24 reach9 impact
699 instances - 10 features - 2 classes - 16 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
38799 runs0 likes20 downloads20 reach12 impact
2000 instances - 65 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
35656 runs1 likes19 downloads20 reach12 impact
2000 instances - 7 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. The maps were scanned in 8 bit grey value at density of 400dpi,…
26529 runs0 likes18 downloads18 reach13 impact
2000 instances - 241 features - 10 classes - 0 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. Corresponding patterns in different datasets correspond to the same…
34868 runs0 likes23 downloads23 reach12 impact
2000 instances - 48 features - 10 classes - 0 missing values
1. Title: Contraceptive Method Choice 2. Sources: (a) Origin: This dataset is a subset of the 1987 National Indonesia Contraceptive Prevalence Survey (b) Creator: Tjen-Sien Lim (limt@stat.wisc.edu)…
24219 runs0 likes21 downloads21 reach10 impact
1473 instances - 10 features - 3 classes - 0 missing values
### Description This dataset describes mushrooms in terms of their physical characteristics. They are classified into: poisonous or edible. ### Source ``` (a) Origin: Mushroom records are drawn from…
16692 runs1 likes42 downloads43 reach13 impact
8124 instances - 23 features - 2 classes - 2480 missing values
Donor: Will Taylor (taylor@pluto.arc.nasa.gov) Database of surgeries on horses. Possible class attributes: 24 (whether lesion is surgical), others include: 23, 25, 26, and 27 Notes: * Hospital_Number…
236 runs0 likes9 downloads9 reach9 impact
368 instances - 27 features - 2 classes - 1927 missing values
1. Title: Nursery Database 2. Sources: (a) Creator: Vladislav Rajkovic et al. (13 experts) (b) Donors: Marko Bohanec (marko.bohanec@ijs.si) Blaz Zupan (blaz.zupan@ijs.si) (c) Date: June, 1997 3. Past…
2210 runs0 likes18 downloads18 reach12 impact
12960 instances - 9 features - 5 classes - 0 missing values
Donor: Will Taylor (taylor@pluto.arc.nasa.gov) In this version (version 2), some features were removed. It is unclear why of how this was done.
1883 runs1 likes10 downloads11 reach9 impact
368 instances - 23 features - 2 classes - 1927 missing values
1. Title of Database: Optical Recognition of Handwritten Digits 2. Source: E. Alpaydin, C. Kaynak Department of Computer Engineering Bogazici University, 80815 Istanbul Turkey alpaydin@boun.edu.tr…
36107 runs3 likes22 downloads25 reach12 impact
5620 instances - 65 features - 10 classes - 0 missing values
This file concerns credit card applications. All attribute names and values have been changed to meaningless symbols to protect the confidentiality of the data. This dataset is interesting because…
25377 runs1 likes36 downloads37 reach12 impact
690 instances - 16 features - 2 classes - 67 missing values
1. Title of Database: Blocks Classification 2. Sources: (a) Donato Malerba Dipartimento di Informatica University of Bari via Orabona 4 70126 Bari - Italy phone: +39 - 80 - 5443269 fax: +39 - 80 -…
2719 runs0 likes18 downloads18 reach12 impact
5473 instances - 11 features - 5 classes - 0 missing values
This dataset classifies people described by a set of attributes as good or bad credit risks. This dataset comes with a cost matrix: ``` Good Bad (predicted) Good 0 1 (actual) Bad 5 0 ``` It is worse…
506285 runs28 likes311 downloads339 reach32 impact
1000 instances - 21 features - 2 classes - 0 missing values
We create a digit database by collecting 250 samples from 44 writers. The samples, written by 30 writers, are used for training, cross-validation and writer dependent testing, and the digits written…
37564 runs0 likes21 downloads21 reach12 impact
10992 instances - 17 features - 10 classes - 0 missing values
1. Title: Postoperative Patient Data 2. Source Information: -- Creators: Sharon Summers, School of Nursing, University of Kansas Medical Center, Kansas City, KS 66160 Linda Woolery, School of Nursing,…
1758 runs0 likes10 downloads10 reach9 impact
90 instances - 9 features - 3 classes - 3 missing values
1. Title: Dermatology Database 2. Source Information: (a) Original owners: -- 1. Nilsel Ilter, M.D., Ph.D., Gazi University, School of Medicine 06510 Ankara, Turkey Phone: +90 (312) 214 1080 -- 2. H.…
1756 runs0 likes14 downloads14 reach12 impact
366 instances - 35 features - 6 classes - 8 missing values
The instances were drawn randomly from a database of 7 outdoor images. The images were hand-segmented to create a classification for every pixel. Each instance is a 3x3 region. ### Attribute…
23424 runs0 likes25 downloads25 reach12 impact
2310 instances - 20 features - 7 classes - 0 missing values
1. Title: Pima Indians Diabetes Database 2. Sources: (a) Original owners: National Institute of Diabetes and Digestive and Kidney Diseases (b) Donor of database: Vincent Sigillito…
202456 runs8 likes107 downloads115 reach19 impact
768 instances - 9 features - 2 classes - 0 missing values
Attribute information: ``` sick, negative. | classes age: continuous. sex: M, F. on thyroxine: f, t. query on thyroxine: f, t. on antithyroid medication: f, t. sick: f, t. pregnant: f, t. thyroid…
19943 runs0 likes32 downloads32 reach9 impact
3772 instances - 30 features - 2 classes - 6064 missing values
1. Title: Protein Localization Sites 2. Creator and Maintainer: Kenta Nakai Institue of Molecular and Cellular Biology Osaka, University 1-3 Yamada-oka, Suita 565 Japan nakai@imcb.osaka-u.ac.jp…
1806 runs0 likes15 downloads15 reach12 impact
336 instances - 8 features - 8 classes - 0 missing values
NAME: Sonar, Mines vs. Rocks SUMMARY: This is the data set used by Gorman and Sejnowski in their study of the classification of sonar signals using a neural network [1]. The task is to train a network…
2372 runs1 likes25 downloads26 reach9 impact
208 instances - 61 features - 2 classes - 0 missing values
1. Title: Glass Identification Database 2. Sources: (a) Creator: B. German -- Central Research Establishment Home Office Forensic Science Service Aldermaston, Reading, Berkshire RG7 4PN (b) Donor:…
1776 runs1 likes52 downloads53 reach9 impact
214 instances - 10 features - 6 classes - 0 missing values
This is the large soybean database from the UCI repository, with its training and test database combined into a single file. There are 19 classes, only the first 15 of which have been used in prior…
41045 runs1 likes56 downloads57 reach13 impact
683 instances - 36 features - 19 classes - 2337 missing values
1. Title: Haberman's Survival Data 2. Sources: (a) Donor: Tjen-Sien Lim (limt@stat.wisc.edu) (b) Date: March 4, 1999 3. Past Usage: 1. Haberman, S. J. (1976). Generalized Residuals for Log-Linear…
3241 runs1 likes19 downloads20 reach9 impact
306 instances - 4 features - 2 classes - 0 missing values
SPAM E-mail Database The "spam" concept is diverse: advertisements for products/websites, make money fast schemes, chain letters, pornography... Our collection of spam e-mails came from our postmaster…
161987 runs5 likes93 downloads98 reach12 impact
4601 instances - 58 features - 2 classes - 0 missing values
Primate splice-junction gene sequences (DNA) with associated imperfect domain theory. Splice junctions are points on a DNA sequence at which 'superfluous' DNA is removed during the process of protein…
24639 runs1 likes18 downloads19 reach9 impact
3190 instances - 61 features - 3 classes - 0 missing values
1. Title: Teaching Assistant Evaluation 2. Sources: (a) Collector: Wei-Yin Loh (Department of Statistics, UW-Madison) (b) Donor: Tjen-Sien Lim (limt@stat.wisc.edu) (b) Date: June 7, 1997 3. Past…
2028 runs0 likes13 downloads13 reach9 impact
151 instances - 6 features - 3 classes - 0 missing values
Publication Request: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> This file describes the contents of the heart-disease directory. This directory contains 4 databases…
1763 runs0 likes10 downloads10 reach10 impact
303 instances - 14 features - 2 classes - 7 missing values
This database encodes the complete set of possible board configurations at the end of tic-tac-toe games, where "x" is assumed to have played first. The target concept is "win for x" (i.e., true when…
386781 runs2 likes97 downloads99 reach10 impact
958 instances - 10 features - 2 classes - 0 missing values
Publication Request: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> This file describes the contents of the heart-disease directory. This directory contains 4 databases…
1792 runs0 likes12 downloads12 reach9 impact
294 instances - 14 features - 2 classes - 782 missing values
1. Title: INDUCE Trains Data set 2. Sources: - Donor: GMU, Center for AI, Software Librarian, Eric E. Bloedorn (bloedorn@aic.gmu.edu) - Original owners: Ryszard S. Michalski (michalski@aic.gmu.edu)…
1973 runs0 likes9 downloads9 reach15 impact
10 instances - 33 features - 2 classes - 51 missing values
This database contains 13 attributes (which have been extracted from a larger set of 75) Attribute Information: ------------------------ -- 1. age -- 2. sex -- 3. chest pain type (4 values) -- 4.…
3214 runs0 likes19 downloads19 reach12 impact
270 instances - 14 features - 2 classes - 0 missing values
NAME vehicle silhouettes PURPOSE to classify a given silhouette as one of four types of vehicle, using a set of features extracted from the silhouette. The vehicle may be viewed from one of many…
31958 runs2 likes35 downloads37 reach11 impact
846 instances - 19 features - 4 classes - 0 missing values
1. Title: Hepatitis Domain 2. Sources: (a) unknown (b) Donor: G.Gong (Carnegie-Mellon University) via Bojan Cestnik Jozef Stefan Institute Jamova 39 61000 Ljubljana Yugoslavia (tel.: (38)(+61) 214-399…
2134 runs1 likes13 downloads14 reach9 impact
155 instances - 20 features - 2 classes - 167 missing values
1. Title: 1984 United States Congressional Voting Records Database 2. Source Information: (a) Source: Congressional Quarterly Almanac, 98th Congress, 2nd session 1984, Volume XL: Congressional…
2262 runs0 likes17 downloads17 reach9 impact
435 instances - 17 features - 2 classes - 392 missing values
; ; Thyroid disease records supplied by the Garavan Institute and J. Ross ; Quinlan, New South Wales Institute, Syndney, Australia. ; ; 1987. ; hypothyroid, primary hypothyroid, compensated…
883 runs0 likes13 downloads13 reach9 impact
3772 instances - 30 features - 4 classes - 6064 missing values
This radar data was collected by a system in Goose Bay, Labrador. This system consists of a phased array of 16 high-frequency antennas with a total transmitted power on the order of 6.4 kilowatts. See…
2490 runs3 likes28 downloads31 reach12 impact
351 instances - 35 features - 2 classes - 0 missing values
Generator generating 3 classes of waves. Each class is generated from a combination of 2 of 3 "base" waves. For details, see Breiman,L., Friedman,J.H., Olshen,R.A., and Stone,C.J. (1984).…
20125 runs1 likes54 downloads55 reach12 impact
5000 instances - 41 features - 3 classes - 0 missing values
This is perhaps the best known database to be found in the pattern recognition literature. Fisher's paper is a classic in the field and is referenced frequently to this day. (See Duda & Hart, for…
7545 runs11 likes162 downloads173 reach27 impact
150 instances - 5 features - 3 classes - 0 missing values
A simple database containing 17 Boolean-valued attributes describing animals. The "type" attribute appears to be the class attribute. Notes: * I find it unusual that there are 2 instances of "frog"…
179 runs4 likes22 downloads26 reach9 impact
101 instances - 17 features - 7 classes - 0 missing values
No data.
143 runs0 likes0 downloads0 reach0 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
66 runs0 likes0 downloads0 reach0 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
324 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
71 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 2 classes - 0 missing values
No data.
60 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
63 runs0 likes0 downloads0 reach0 impact
1000000 instances - 26 features - 7 classes - 0 missing values
No data.
63 runs0 likes0 downloads0 reach0 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
68 runs0 likes0 downloads0 reach0 impact
1000000 instances - 10 features - 2 classes - 0 missing values
No data.
48 runs0 likes0 downloads0 reach0 impact
1000000 instances - 77 features - 10 classes - 0 missing values
No data.
50 runs0 likes0 downloads0 reach0 impact
1000000 instances - 65 features - 10 classes - 0 missing values
No data.
67 runs0 likes0 downloads0 reach0 impact
1000000 instances - 13 features - 6 classes - 0 missing values
No data.
66 runs0 likes0 downloads0 reach0 impact
1000000 instances - 13 features - 6 classes - 0 missing values
No data.
51 runs0 likes0 downloads0 reach0 impact
1000000 instances - 48 features - 10 classes - 0 missing values
No data.
1038 runs0 likes10 downloads10 reach9 impact
55296 instances - 10 features - 3 classes - 0 missing values
No data.
326 runs0 likes0 downloads0 reach0 impact
1000000 instances - 23 features - 2 classes - 0 missing values
No data.
70 runs0 likes0 downloads0 reach0 impact
1000000 instances - 28 features - 2 classes - 0 missing values
No data.
72 runs0 likes0 downloads0 reach0 impact
1000000 instances - 23 features - 2 classes - 0 missing values
No data.
194 runs0 likes0 downloads0 reach0 impact
1000000 instances - 65 features - 10 classes - 0 missing values
No data.
73 runs0 likes0 downloads0 reach0 impact
1000000 instances - 16 features - 2 classes - 0 missing values
No data.
87 runs0 likes0 downloads0 reach0 impact
295245 instances - 11 features - 5 classes - 0 missing values
No data.
68 runs0 likes0 downloads0 reach0 impact
1000000 instances - 21 features - 2 classes - 0 missing values
No data.
67 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 10 classes - 0 missing values
This data sets consists of 3 different types of irises' (Setosa, Versicolour, and Virginica) petal and sepal length, stored in a 150x4 numpy.ndarray
65 runs0 likes4 downloads4 reach10 impact
1000000 instances - 40 features - 2 classes - 0 missing values
No data.
66 runs0 likes0 downloads0 reach0 impact
1000000 instances - 35 features - 6 classes - 0 missing values
No data.
211 runs0 likes0 downloads0 reach0 impact
1000000 instances - 20 features - 7 classes - 0 missing values
No data.
73 runs0 likes0 downloads0 reach0 impact
1000000 instances - 30 features - 2 classes - 0 missing values
No data.
50 runs0 likes0 downloads0 reach0 impact
1000000 instances - 61 features - 2 classes - 0 missing values
No data.
90 runs0 likes0 downloads0 reach0 impact
137781 instances - 10 features - 7 classes - 0 missing values
No data.
314 runs0 likes0 downloads0 reach0 impact
1000000 instances - 36 features - 19 classes - 0 missing values
No data.
219 runs0 likes0 downloads0 reach0 impact
1000000 instances - 58 features - 2 classes - 0 missing values
No data.
66 runs0 likes0 downloads0 reach0 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
1457 runs0 likes13 downloads13 reach9 impact
39366 instances - 10 features - 2 classes - 0 missing values
No data.
66 runs0 likes0 downloads0 reach0 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
334 runs0 likes0 downloads0 reach0 impact
1000000 instances - 33 features - 2 classes - 0 missing values
No data.
70 runs0 likes0 downloads0 reach0 impact
1000000 instances - 14 features - 2 classes - 0 missing values
No data.
68 runs0 likes0 downloads0 reach0 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
69 runs0 likes0 downloads0 reach0 impact
1000000 instances - 20 features - 2 classes - 0 missing values
No data.
356 runs0 likes0 downloads0 reach0 impact
131072 instances - 17 features - 2 classes - 0 missing values
No data.
65 runs0 likes0 downloads0 reach0 impact
1000000 instances - 30 features - 4 classes - 0 missing values
No data.
230 runs0 likes0 downloads0 reach0 impact
1000000 instances - 35 features - 2 classes - 0 missing values
No data.
63 runs0 likes0 downloads0 reach0 impact
1000000 instances - 41 features - 3 classes - 0 missing values
No data.
65 runs0 likes0 downloads0 reach0 impact
1000000 instances - 18 features - 7 classes - 0 missing values
Dataset created to study concept drift in stream mining. It is constructed by combining the Covertype, Poker-Hand, and Electricity datasets. More details can be found in: Albert Bifet, Geoff Holmes,…
332 runs0 likes0 downloads0 reach0 impact
1455525 instances - 73 features - 10 classes - 0 missing values