Data
Filter results by:
Amazon Reviews data (data source) The repository has several datasets. For this case study, we are using the Electronics dataset.
0 runs0 likes0 downloads0 reach0 impact
10000 instances - 3 features - classes - 0 missing values
This dataset contains all the player names and player ids, taken from Sofifa
0 runs0 likes0 downloads0 reach0 impact
11009 instances - 3 features - classes - 0 missing values
testing
0 runs0 likes0 downloads0 reach0 impact
366 instances - 3 features - classes - 0 missing values
Outliers data set extracted from the Illustration (Fig. 3) in "Novelty detection with application to data streams"
0 runs0 likes0 downloads0 reach0 impact
75 instances - 3 features - 4 classes - 0 missing values
Synthetic 2-d data with N=5000 vectors and k=15 Gaussian clusters with different degree of cluster overlap P. Fränti and O. Virmajoki, "Iterative shrinking method for clustering…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 3 features - 0 classes - 0 missing values
Synthetic 2-d data with N=5000 vectors and k=15 Gaussian clusters with different degree of cluster overlap P. Fränti and O. Virmajoki, "Iterative shrinking method for clustering…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 3 features - 0 classes - 0 missing values
Synthetic 2-d data with N=5000 vectors and k=15 Gaussian clusters with different degree of cluster overlap P. Fränti and O. Virmajoki, "Iterative shrinking method for clustering…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 3 features - 0 classes - 0 missing values
Synthetic 2-d data with N=5000 vectors and k=15 Gaussian clusters with different degree of cluster overlap P. Fränti and O. Virmajoki, "Iterative shrinking method for clustering…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 3 features - 0 classes - 0 missing values
swd dced
0 runs0 likes0 downloads0 reach0 impact
589 instances - 3 features - classes - 0 missing values
sdsw frfr
0 runs0 likes0 downloads0 reach0 impact
1556 instances - 3 features - classes - 0 missing values
efef ffrf
0 runs0 likes0 downloads0 reach0 impact
9 instances - 3 features - classes - 0 missing values
frf r
0 runs0 likes0 downloads0 reach0 impact
2 instances - 3 features - classes - 0 missing values
dd fgrfg
0 runs0 likes0 downloads0 reach0 impact
2 instances - 3 features - classes - 0 missing values
Author: Francesca Grisoni, Claudia S. Neuhaus, Miyabi Hishinuma, Gisela Gabernet, Jan A. Hiss, - Masaaki Kotera, Gisbert Schneider Source:…
0 runs0 likes0 downloads0 reach0 impact
901 instances - 3 features - classes - 0 missing values
Author: Francesca Grisoni, Claudia S. Neuhaus, Miyabi Hishinuma, Gisela Gabernet, Jan A. Hiss, - Masaaki Kotera, Gisbert Schneider Source:…
0 runs0 likes0 downloads0 reach0 impact
949 instances - 3 features - classes - 0 missing values
Arbres urbains
0 runs0 likes0 downloads0 reach0 impact
421 instances - 3 features - 1 classes - 0 missing values
Online advertisement clicking rates, where the metrics are cost-per-click (CPC) and cost per thousand impressions (CPM).
0 runs0 likes0 downloads0 reach0 impact
1643 instances - 3 features - classes - 0 missing values
artificial with anomaly
0 runs0 likes0 downloads0 reach0 impact
4032 instances - 3 features - classes - 0 missing values
artificial with anomaly
0 runs0 likes0 downloads0 reach0 impact
4032 instances - 3 features - 0 classes - 0 missing values
artificial with anomaly
5 runs0 likes0 downloads0 reach0 impact
4032 instances - 3 features - 2 classes - 0 missing values
Online advertisement clicking rates, where the metrics are cost-per-click (CPC) and cost per thousand impressions (CPM).
0 runs0 likes0 downloads0 reach0 impact
1624 instances - 3 features - classes - 0 missing values
Online advertisement clicking rates, where the metrics are cost-per-click (CPC) and cost per thousand impressions (CPM).
0 runs0 likes0 downloads0 reach0 impact
1538 instances - 3 features - classes - 0 missing values
MINI - Birds dataset
0 runs0 likes0 downloads0 reach0 impact
500 instances - 3 features - 20 classes - 0 missing values
MINI - Birds dataset
0 runs0 likes0 downloads0 reach0 impact
500 instances - 3 features - 20 classes - 0 missing values
## **Meta-Album PlantNet Dataset (Micro)** Meta-Album PlantNet dataset is created by sampling the Pl@ntNet-300k dataset (https://openreview.net/forum?id=eLYinD0TtIt), itself a sampling of the Pl@ntNet…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album Textures-DTD Dataset (Micro)** The Textures DTD dataset(https://www.robots.ox.ac.uk/~vgg/data/dtd/index.html) is a large textures dataset which consists of 5 640 images. The data is…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album Airplanes Dataset (Micro)** The original Airplanes dataset (https://zenodo.org/record/3464319) comprises more than 9 000 remote sensing images acquired from Google Earth satellite…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album Textures Dataset (Micro)** The original Textures dataset is a combination of 4 texture datasets: KTH-TIPS and KTH-TIPS 2 (https://www.csc.kth.se/cvap/databases/kth-tips/index.html),…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album Cars Dataset (Micro)** The original Cars dataset (https://ai.stanford.edu/~jkrause/cars/car_dataset.html) was collected in 2013, and it contains more than 16 000 images from 196…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album RESISC Dataset (Micro)** RESISC45 dataset(https://gcheng-nwpu.github.io/) gathers 700 RGB images of size 256x256 px for each of 45 scene categories. The data authors strive to provide…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album Stanford 40 Actions Dataset (Micro)** The Stanford 40 Actions dataset(http://vision.stanford.edu/Datasets/40actions.html) contains images of humans performing 40 actions. There are 9…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album Insects2 Dataset (Micro)** The pest insects dataset was originally created as a large scale benchmark dataset for Insect Pest Recognition (https://github.com/xpwu95/IP102). It contains…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album MPII Human Pose Dataset Dataset (Micro)** The MPII Human Pose dataset (http://human-pose.mpi-inf.mpg.de/#download) is a state of the art benchmark for evaluation of articulated human…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album Fungi Dataset (Micro)** Meta-Album Fungi dataset is created by sampling the Danish Fungi 2020 dataset(https://arxiv.org/abs/2103.10107), itself a sampling of the Atlas of Danish Fungi…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 0 missing values
## **Meta-Album Plant Doc Dataset (Micro)** The PlantDoc dataset(https://github.com/pratikkayal/PlantDoc-Dataset) is made up of images of leaves of healthy and unhealthy plants. The images were…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album Textures-ALOT Dataset (Micro)** Textures ALOT dataset (https://aloi.science.uva.nl/public_alot/) consists of 27 500 images from 250 categories. The images in the dataset are captured…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album Animals with Attributes Dataset (Micro)** The original Animals with Attributes 2 (AWA) dataset (https://cvml.ist.ac.at/AwA2/) was designed to benchmark transfer-learning algorithms, in…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album Insects Dataset (Micro)** The original Insects dataset is created by the National Museum of Natural History, Paris (https://www.mnhn.fr/fr). It has more than 290 000 images in…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 0 missing values
## **Meta-Album RSD Dataset (Micro)** RSD46 dataset (https://github.com/RSIA-LIESMARS-WHU/RSD46-WHU) is collected from Google Earth and Tianditu. The collection contains 46 scene categories, with a…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album Subcellular Human Protein Dataset (Micro)** This dataset is a subset of the Subcellular dataset in the Protein Atlas project(https://www.proteinatlas.org/). The original dataset, which…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album Boats Dataset (Micro)** The original version of the Meta-Album boats dataset is called MARVEL dataset (https://github.com/avaapm/marveldataset2016). It has more than 138 000 images of…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album DIBaS Dataset (Mini)** The Digital Images of Bacteria Species dataset (DIBaS) (https://github.com/gallardorafael/DIBaS-Dataset) is a dataset of 33 bacterial species with around 20…
0 runs0 likes0 downloads0 reach1 impact
1320 instances - 3 features - 33 classes - 1320 missing values
This dataset consists of 6,642 question/answer pairs. The questions are supposed to be answerable by Freebase, a large knowledge graph. The questions are mostly centered around a single named entity.…
0 runs0 likes0 downloads0 reach0 impact
5810 instances - 3 features - classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag.
7 runs0 likes0 downloads0 reach0 impact
61 instances - 3 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
746 runs0 likes0 downloads0 reach0 impact
1024 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
572 runs0 likes0 downloads0 reach0 impact
100 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1252 runs0 likes0 downloads0 reach0 impact
130 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
963 runs0 likes0 downloads0 reach0 impact
380 instances - 3 features - 2 classes - 0 missing values
This database contains the HTML source of web pages plus the ratings of a single user on these web pages. The web pages are on four separate subjects (Bands- recording artists; Goats; Sheep; and…
0 runs0 likes0 downloads0 reach0 impact
70 instances - 3 features - 3 classes - 0 missing values
This database contains the HTML source of web pages plus the ratings of a single user on these web pages. The web pages are on four separate subjects (Bands- recording artists; Goats; Sheep; and…
0 runs0 likes0 downloads0 reach0 impact
61 instances - 3 features - 3 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
511 runs0 likes0 downloads0 reach0 impact
185 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
766 runs0 likes0 downloads0 reach0 impact
55 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
775 runs0 likes0 downloads0 reach0 impact
43 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1823 runs0 likes0 downloads0 reach0 impact
120 instances - 3 features - 2 classes - 0 missing values
Real-world data set about the perching behaviour of two species of lizards in the South Bimini island, from Shoener (1968). The lizards data set contains the following variables: Species (the species…
0 runs0 likes0 downloads0 reach0 impact
409 instances - 3 features - classes - 0 missing values
Do two sentences come from the same article? We randomly sampled sentences from across Wikipedia. Some sentences came from the same articles, others do not. Sentences from the Same Article These two…
0 runs0 likes0 downloads0 reach0 impact
129156 instances - 3 features - classes - 0 missing values
A copy of PLK_Mini dataset from Meta album Set0
0 runs0 likes0 downloads0 reach0 impact
3440 instances - 3 features - 86 classes - 3440 missing values
A brief description of your dataset.
0 runs0 likes0 downloads0 reach0 impact
3 instances - 3 features - 3 classes - 0 missing values
## **Meta-Album DIBaS Dataset (Extended)** The Digital Images of Bacteria Species dataset (DIBaS) (https://github.com/gallardorafael/DIBaS-Dataset) is a dataset of 33 bacterial species with around 20…
0 runs0 likes0 downloads0 reach1 impact
4060 instances - 3 features - 33 classes - 4060 missing values
This is the description of a test dataset
0 runs0 likes0 downloads0 reach0 impact
73503 instances - 3 features - 2 classes - 0 missing values
Big dataset of russian university reviews in russian
0 runs0 likes0 downloads0 reach0 impact
5 instances - 3 features - classes - 0 missing values
This S dump contains 22 data sets from the book Visualizing Data published by Hobart Press (books@hobart.com). The dump was created by data.dump() and can be read back into S by data.restore(). The…
899 runs0 likes8 downloads8 reach14 impact
130 instances - 3 features - 5 classes - 0 missing values
test rats
0 runs0 likes0 downloads0 reach0 impact
100 instances - 3 features - classes - 0 missing values
Indoor scene recognition is a challenging open problem in high level vision. Most scene recognition models that work well for outdoor scenes perform poorly in the indoor domain. The main difficulty is…
0 runs0 likes0 downloads0 reach0 impact
15620 instances - 3 features - 67 classes - 0 missing values
Indoor scene recognition is a challenging open problem in high level vision. Most scene recognition models that work well for outdoor scenes perform poorly in the indoor domain. The main difficulty is…
1 runs0 likes0 downloads0 reach0 impact
15620 instances - 3 features - 67 classes - 0 missing values
Description: The dataset, named "clean_tweet_Dec19ToDec20.csv," comprises a collection of tweets post-processed for clarity and analysis, spanning from December 2019 to December 2020. It is designed…
0 runs0 likes0 downloads0 reach0 impact
134348 instances - 3 features - classes - 18 missing values
Description: The dataset, named "clean_tweet_Dec19ToDec20.csv," comprises a collection of tweets post-processed for clarity and analysis, spanning from December 2019 to December 2020. It is designed…
0 runs0 likes0 downloads0 reach0 impact
134348 instances - 3 features - classes - 18 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
Description: The 'heartrate_seconds_merged.csv' dataset is a comprehensive compilation of heart rate measurements collected over varying days and times, intended to provide insights into heart rate…
0 runs0 likes0 downloads0 reach0 impact
1154681 instances - 3 features - classes - 0 missing values
The dataset 'minuteStepsNarrow_merged.csv' provides detailed insights into step activity recorded for various users, identified by their unique IDs, over specific minutes of different days. The data…
0 runs0 likes0 downloads0 reach0 impact
1445040 instances - 3 features - classes - 0 missing values
car data
0 runs0 likes0 downloads0 reach0 impact
22 instances - 3 features - classes - 0 missing values
car data
0 runs0 likes0 downloads0 reach0 impact
22 instances - 3 features - classes - 0 missing values
car data
0 runs0 likes0 downloads0 reach0 impact
22 instances - 3 features - classes - 0 missing values
car data
0 runs0 likes0 downloads0 reach0 impact
22 instances - 3 features - classes - 0 missing values
CIF 2016 time series forecasting competition , monthly data. From original source: ----- Competition Data Format Data file containing time series to be predicted is a text file having the following…
0 runs0 likes0 downloads0 reach0 impact
7108 instances - 3 features - classes - 0 missing values
Dominick dataset forecasting data, weekly data. From original source: ----- This dataset contains 115704 weekly time series representing the profit of individual stock keeping units from a retailer.…
0 runs0 likes0 downloads0 reach0 impact
19092987 instances - 3 features - classes - 0 missing values
car data
0 runs0 likes0 downloads0 reach0 impact
22 instances - 3 features - classes - 0 missing values
Tiny ImageNet contains 100000 images of 200 classes (500 for each class) downsized to 64 x 64 colored images. Each class has 500 training images, 50 validation images, and 50 test images. The dataset…
0 runs0 likes0 downloads0 reach0 impact
100000 instances - 3 features - 200 classes - 0 missing values
One of the data sets used in the book "Analyzing Categorical Data" by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on statistical…
2 runs0 likes0 downloads0 reach0 impact
108 instances - 4 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach0 impact
450 instances - 4 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach0 impact
475 instances - 4 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach0 impact
475 instances - 4 features - 0 classes - 0 missing values
One of the data sets used in the book "Analyzing Categorical Data" by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on statistical…
2 runs0 likes0 downloads0 reach0 impact
100 instances - 4 features - 0 classes - 0 missing values
File README ----------- smoothmeth A collection of the data sets used in the book "Smoothing Methods in Statistics," by Jeffrey S. Simonoff, Springer-Verlag, New York, 1996. Submitted by Jeff Simonoff…
3 runs0 likes0 downloads0 reach0 impact
2178 instances - 4 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach0 impact
365 instances - 4 features - 0 classes - 30 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
694 runs0 likes0 downloads0 reach0 impact
83 instances - 4 features - 2 classes - 0 missing values
87 persons with lupus nephritis. Followed up 15+ years. 35 deaths. Var = duration of disease. Over 40 baseline variables avaiable from authors. Description : For description of this data set arising…
737 runs0 likes0 downloads0 reach0 impact
87 instances - 4 features - 2 classes - 0 missing values
CODING: ITEM 1 = BUSINESS CONDIDIONS 6 MONTHS FROM NOW (CONFERENCE BOARD) ITEM 2 = JOBS 6 MONTHS FROM NOW (CONFERENCE BOARD) ITEM 3 = FAMILY INCOME 6 MONTHS FROM NOW (CONFERENCE BOARD) ITEM 4 =…
560 runs0 likes0 downloads0 reach0 impact
72 instances - 4 features - 6 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
1030 runs0 likes0 downloads0 reach0 impact
132 instances - 4 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
1116 runs0 likes0 downloads0 reach0 impact
120 instances - 4 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach0 impact
132 instances - 4 features - 0 classes - 0 missing values