Data
Filter results by:
No data.
108 runs0 likes0 downloads0 reach0 impact
927 instances - 10129 features - 7 classes - 0 missing values
ARCENE's task is to distinguish cancer versus normal patterns from mass-spectrometric data. This is a two-class classification problem with continuous input variables. This dataset is one of 5…
17 runs0 likes0 downloads0 reach0 impact
200 instances - 10001 features - 2 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
0 runs0 likes1 downloads1 reach17 impact
100 instances - 10001 features - 2 classes - 0 missing values
Dataset creator and donator: Zhi Liu, e-mail: liuzhi8673 '@' gmail.com, institution: National Engineering Research Center for E-Learning, Hubei Wuhan, China Data Set Information: dataset are derived…
65168 runs2 likes49 downloads51 reach217 impact
1500 instances - 10001 features - 50 classes - 0 missing values
Domain dataset
0 runs0 likes0 downloads0 reach0 impact
1637 instances - 9839 features - 3 classes - 13231887 missing values
No data.
163 runs0 likes0 downloads0 reach0 impact
1560 instances - 8461 features - 20 classes - 0 missing values
No data.
414 runs0 likes0 downloads0 reach0 impact
690 instances - 8262 features - 10 classes - 0 missing values
No data.
220 runs0 likes0 downloads0 reach0 impact
336 instances - 7903 features - 6 classes - 0 missing values
MEG data from auditory stimulation experiment using 305 sensors. The design matrix/forward operator is `data[:, :7498]`. The measurements for left stimulation are `data[:, 7498:7583]`. The…
0 runs0 likes0 downloads0 reach0 impact
305 instances - 7668 features - classes - 0 missing values
MEG data from auditory stimulation experiment using 305 sensors. The design matrix/forward operator is `data[:, :7498]`. The measurements for left stimulation are `data[:, 7498:7583]`. The…
0 runs0 likes0 downloads0 reach0 impact
305 instances - 7498 features - classes - 0 missing values
No data.
203 runs0 likes0 downloads0 reach0 impact
878 instances - 7455 features - 10 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
15 runs0 likes1 downloads1 reach21 impact
10000 instances - 7201 features - 10 classes - 0 missing values
libSVM","AAD group A simple and efficient algorithm for gene selection using sparse logistic regression. Bioinformatics, 19(17):2246-2253, 2003. #Dataset from the LIBSVM data repository.…
0 runs0 likes0 downloads0 reach0 impact
86 instances - 7130 features - 0 classes - 0 missing values
Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring. Science, VOL 286, pp. 531-537, 15 October 1999. Web supplement to the article T.R. Golub, D. K.…
451 runs0 likes0 downloads0 reach0 impact
72 instances - 7130 features - 2 classes - 0 missing values
Embryonal tumours of the central nervous system Prediction of Central Nervous System Embryonal Tumour Outcome based on Gene Expression. Nature, VOL 415, pp. 436-442, 24 January 2002. Scott L. Pomeroy,…
343 runs0 likes0 downloads0 reach0 impact
60 instances - 7130 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
60 instances - 7130 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
72 instances - 7130 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
72 instances - 7130 features - 3 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
72 instances - 7130 features - 4 classes - 0 missing values
No data.
219 runs0 likes0 downloads0 reach0 impact
414 instances - 6430 features - 9 classes - 0 missing values
eating
9413 runs0 likes0 downloads0 reach0 impact
945 instances - 6374 features - 7 classes - 0 missing values
No data.
215 runs0 likes0 downloads0 reach0 impact
204 instances - 5833 features - 6 classes - 0 missing values
No data.
211 runs0 likes0 downloads0 reach0 impact
313 instances - 5805 features - 8 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
60 instances - 5727 features - 9 classes - 0 missing values
eevrr der
0 runs0 likes0 downloads0 reach0 impact
1557 instances - 5629 features - classes - 0 missing values
dd efrg
15 runs0 likes0 downloads0 reach0 impact
1556 instances - 5629 features - classes - 0 missing values
sde c
5 runs0 likes0 downloads0 reach0 impact
1556 instances - 5629 features - classes - 0 missing values
as cscs
3 runs0 likes0 downloads0 reach0 impact
1557 instances - 5629 features - classes - 0 missing values
de d
3 runs0 likes0 downloads0 reach0 impact
1556 instances - 5628 features - classes - 0 missing values
fr frf
2 runs0 likes0 downloads0 reach0 impact
1556 instances - 5629 features - classes - 0 missing values
ef r
2 runs0 likes0 downloads0 reach0 impact
1557 instances - 5629 features - classes - 0 missing values
as dwd
1 runs0 likes0 downloads0 reach0 impact
1557 instances - 5629 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
77 instances - 5470 features - 2 classes - 0 missing values
GISETTE is a handwritten digit recognition problem. The problem is to separate the highly confusable digits '4' and '9'. This dataset is one of five datasets of the NIPS 2003 feature selection…
466 runs0 likes0 downloads0 reach0 impact
7000 instances - 5001 features - 2 classes - 0 missing values
According to Epsilon research, 80% of customers are more likely to do business with you if you provide personalized service. Banking is no exception. The digitalization of everyday lives means that…
0 runs0 likes0 downloads0 reach0 impact
4459 instances - 4992 features - 0 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
14 runs0 likes1 downloads1 reach21 impact
20000 instances - 4297 features - 2 classes - 0 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
13 runs1 likes1 downloads2 reach21 impact
20000 instances - 4297 features - 2 classes - 0 missing values
This dataset contains a set of face images taken between April 1992 and April 1994 at AT&T Laboratories Cambridge. As described on the original website: There are ten different images of each of 40…
53 runs0 likes0 downloads0 reach0 impact
400 instances - 4097 features - 40 classes - 0 missing values
No data.
283 runs0 likes0 downloads0 reach0 impact
96 instances - 4027 features - 11 classes - 19667 missing values
No data.
496 runs0 likes0 downloads0 reach0 impact
45 instances - 4027 features - 2 classes - 5948 missing values
No data.
296 runs0 likes0 downloads0 reach0 impact
96 instances - 4027 features - 9 classes - 19667 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
66 instances - 4027 features - 3 classes - 12269 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
96 instances - 4027 features - 9 classes - 19667 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
96 instances - 4027 features - 11 classes - 19667 missing values
No data.
159 runs0 likes0 downloads0 reach0 impact
1657 instances - 3759 features - 25 classes - 0 missing values
No data.
416 runs0 likes0 downloads0 reach0 impact
1050 instances - 3239 features - 10 classes - 0 missing values
No data.
428 runs0 likes0 downloads0 reach0 impact
1003 instances - 3183 features - 10 classes - 0 missing values
No data.
377 runs0 likes0 downloads0 reach0 impact
913 instances - 3101 features - 10 classes - 0 missing values
SVHN is a real-world image dataset for developing machine learning and object recognition algorithms with minimal requirement on data preprocessing and formatting. It can be seen as similar in flavor…
52 runs0 likes0 downloads0 reach0 impact
99289 instances - 3073 features - 10 classes - 0 missing values
This is a 20,000 instance sample of the original CIFAR-10 dataset. Sampled randomly and stratified, with 2000 examples per class. Training and test set are merged. Find the corresponding task for the…
380 runs0 likes0 downloads0 reach0 impact
20000 instances - 3073 features - 10 classes - 0 missing values
0. airplane 1. automobile 2. bird 3. cat 4. deer 5. dog 6. frog 7. horse 8. ship 9. truck CIFAR-10 contains 6000 images per class. The original train-test split randomly divided these into 5000 train…
160 runs0 likes6 downloads6 reach21 impact
60000 instances - 3073 features - 10 classes - 0 missing values
This dataset is just like the CIFAR-10, except it has 100 classes containing 600 images each. There are 500 training images and 100 testing images per class. The 100 classes in the CIFAR-100 are…
0 runs0 likes0 downloads0 reach0 impact
60000 instances - 3073 features - 100 classes - 0 missing values
10% stratified subsample of the original SVHN data
0 runs0 likes0 downloads0 reach0 impact
9927 instances - 3073 features - 10 classes - 0 missing values
50% stratified subsample of the original SVHN data
0 runs0 likes0 downloads0 reach0 impact
49644 instances - 3073 features - 10 classes - 0 missing values
admet-bbb training data
0 runs0 likes0 downloads0 reach0 impact
2054 instances - 3025 features - classes - 0 missing values
No data.
373 runs0 likes0 downloads0 reach0 impact
918 instances - 3013 features - 10 classes - 0 missing values
The German Traffic Sign Benchmark is a multi-class, single-image classification challenge held at the International Joint Conference on Neural Networks (IJCNN) 2011. We cordially invite researchers…
1 runs0 likes0 downloads0 reach13 impact
51839 instances - 2917 features - 43 classes - 0 missing values
ede wey
0 runs0 likes0 downloads0 reach0 impact
589 instances - 2909 features - classes - 0 missing values
No data.
222 runs0 likes0 downloads0 reach0 impact
1504 instances - 2887 features - 13 classes - 0 missing values
Systematic determination of genetic network architecture. Nature Genetics, 1999 Jul;22(3):281-5. Data also used in Biclustering of Expression Data, by Yizong Cheng and George M. Church (web…
0 runs0 likes0 downloads0 reach0 impact
17 instances - 2884 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
83 instances - 2309 features - 4 classes - 0 missing values
libSVM","AAD group Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Cell Biology, 96:6745-6750, 1999. #Dataset from…
0 runs0 likes0 downloads0 reach0 impact
62 instances - 2001 features - 0 classes - 0 missing values
No data.
426 runs0 likes0 downloads0 reach0 impact
2463 instances - 2001 features - 17 classes - 0 missing values
Data from the PASCAL Challenge 2008 as available on the LibSVM repository ## Description Preprocessing: The raw data set (epsilon_train) is instance-wisely scaled to unit length and split into two…
0 runs0 likes0 downloads0 reach0 impact
500000 instances - 2001 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
62 instances - 2001 features - 2 classes - 0 missing values
SOURCE: [ChaLearn Automatic Machine Learning Challenge (AutoML)](https://competitions.codalab.org/competitions/2321), [ChaLearn](https://automl.chalearn.org/data) This is a "supervised learning"…
8 runs0 likes2 downloads2 reach19 impact
10000 instances - 2001 features - 5 classes - 0 missing values
Context Tumor microRNA expression profiling identifies circulating microRNAs for earlier breast cancer detection. Due to microRNA role in tumorigenesis and remarkable stability in body fluids,…
0 runs0 likes0 downloads0 reach0 impact
133 instances - 1928 features - classes - 0 missing values
Context Tumor microRNA expression profiling identifies circulating microRNAs for earlier breast cancer detection. Due to microRNA role in tumorigenesis and remarkable stability in body fluids,…
0 runs0 likes0 downloads0 reach0 impact
133 instances - 1928 features - classes - 0 missing values
Predict a biological response of molecules from their chemical properties. Each row in this data set represents a molecule. The first column contains experimental data describing an actual biological…
48680 runs2 likes40 downloads42 reach36 impact
3751 instances - 1777 features - 2 classes - 0 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
406 runs0 likes0 downloads0 reach0 impact
4229 instances - 1618 features - 2 classes - 0 missing values
SOURCE: [ChaLearn Automatic Machine Learning Challenge (AutoML)](https://competitions.codalab.org/competitions/2321), [ChaLearn](https://automl.chalearn.org/data) This is a "supervised learning"…
8 runs1 likes2 downloads3 reach20 impact
5418 instances - 1637 features - 2 classes - 0 missing values
The German Traffic Sign Benchmark is a multi-class, single-image classification challenge held at the International Joint Conference on Neural Networks (IJCNN) 2011. We cordially invite researchers…
0 runs0 likes1 downloads1 reach11 impact
51839 instances - 1569 features - 43 classes - 0 missing values
The German Traffic Sign Benchmark is a multi-class, single-image classification challenge held at the International Joint Conference on Neural Networks (IJCNN) 2011. We cordially invite researchers…
0 runs0 likes1 downloads1 reach11 impact
51839 instances - 1569 features - 43 classes - 0 missing values
* Dataset Title: MicroMass - Mixed (mixed spectra version) * Abstract: A dataset to explore machine learning approaches for the identification of microorganisms from mass-spectrometry data. * Source:…
64 runs0 likes0 downloads0 reach0 impact
360 instances - 1301 features - 10 classes - 0 missing values
### Description MicroMass (pure spectra version) is a dataset to explore machine learning approaches for the identification of microorganisms from mass-spectrometry data. ### Source ``` Pierre Mahé,…
39941 runs1 likes17 downloads18 reach100 impact
571 instances - 1301 features - 20 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
6 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
34 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
13 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
34 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
274 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
9 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
10 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
13 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
5 runs0 likes0 downloads0 reach0 impact
32 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
10 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
5 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
13 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
7 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
8 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
12 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
26 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
37 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
11 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
13 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
10 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
30 instances - 1143 features - 0 classes - 0 missing values
Multi-label dataset for text-classification. It consists of article titles and partial blurbs. Blurbs can be assigned to several categories (e.g. Science, News, Games) based on word predictors.
0 runs0 likes0 downloads0 reach0 impact
3782 instances - 1101 features - classes - 0 missing values
Multi-label dataset for text-classification. It consists of article titles and partial blurbs. Blurbs can be assigned to several categories (e.g. Science, News, Games) based on word predictors.
0 runs0 likes0 downloads0 reach0 impact
3782 instances - 1101 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10402, and it has 65 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
65 instances - 1026 features - 0 classes - 0 missing values