{ "data_id": "44237", "name": "Meta_Album_BCT_Micro", "exact_name": "Meta_Album_BCT_Micro", "version": 1, "version_label": null, "description": "## **Meta-Album DIBaS Dataset (Micro)**\n***\nThe Digital Images of Bacteria Species dataset (DIBaS) (https:\/\/github.com\/gallardorafael\/DIBaS-Dataset) is a dataset of 33 bacterial species with around 20 images for each species. For the Meta-Album, since the images were large (2 048x1 532) with very few samples in each class, we decided to split each image into several smaller images before resizing them to 128x128. We then obtained a preprocessed dataset of 4 060 images with at least 108 images for each class. This dataset was also preprocessed with blob normalization techniques, which is quite unusual for this type of image. The goal of this transformation was to reduce the importance of color in decision-making for a bias-aware challenge. \n\n\n\n### **Dataset Details**\n![](https:\/\/meta-album.github.io\/assets\/img\/samples\/BCT.png)\n\n**Meta Album ID**: MCR.BCT \n**Meta Album URL**: [https:\/\/meta-album.github.io\/datasets\/BCT.html](https:\/\/meta-album.github.io\/datasets\/BCT.html) \n**Domain ID**: MCR \n**Domain Name**: Microscopic \n**Dataset ID**: BCT \n**Dataset Name**: DIBaS \n**Short Description**: Digital Image of Bacterial Species (DIBaS) \n**\\# Classes**: 20 \n**\\# Images**: 800 \n**Keywords**: microscopic, bacteria \n**Data Format**: images \n**Image size**: 128x128 \n\n**License (original data release)**: Public for researchers \n**License URL(original data release)**: \n**License (Meta-Album data release)**: CC BY-NC 4.0 \n**License URL (Meta-Album data release)**: [https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/](https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/) \n\n**Source**: Digital Image of Bacterial Species (DIBaS) \n**Source URL**: http:\/\/misztal.edu.pl\/software\/databases\/dibas\/ \nhttps:\/\/journals.plos.org\/plosone\/article?id=10.1371\/journal.pone.0184554 \nhttps:\/\/github.com\/gallardorafael\/DIBaS-Dataset \n \n**Original Author**: Bartosz Zielinski, Anna Plichta, Krzysztof Misztal, Przemyslaw Spurek, Monika Brzychczy-Wloch, Dorota Ochonska \n**Original contact**: krzysztof.misztal@uj.edu.pl \n\n**Meta Album author**: Romain Mussard \n**Created Date**: 01 March 2022 \n**Contact Name**: Ihsan Ullah \n**Contact Email**: meta-album@chalearn.org \n**Contact URL**: [https:\/\/meta-album.github.io\/](https:\/\/meta-album.github.io\/) \n\n\n\n### **Cite this dataset**\n```\n@article{10.1371\/journal.pone.0184554,\n doi = {10.1371\/journal.pone.0184554},\n author = {Zielinski, Bartosz AND Plichta, Anna AND Misztal, Krzysztof AND Spurek, Przemyslaw AND Brzychczy-Wloch, Monika AND Ochonska, Dorota},\n journal = {PLOS ONE},\n publisher = {Public Library of Science},\n title = {Deep learning approach to bacterial colony classification},\n year = {2017},\n month = {09},\n volume = {12},\n url = {https:\/\/doi.org\/10.1371\/journal.pone.0184554},\n pages = {1-14},\n number = {9}\n}\n```\n\n\n### **Cite Meta-Album**\n```\n@inproceedings{meta-album-2022,\n title={Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification},\n author={Ullah, Ihsan and Carrion, Dustin and Escalera, Sergio and Guyon, Isabelle M and Huisman, Mike and Mohr, Felix and van Rijn, Jan N and Sun, Haozhe and Vanschoren, Joaquin and Vu, Phan Anh},\n booktitle={Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track},\n url = {https:\/\/meta-album.github.io\/},\n year = {2022}\n }\n```\n\n\n### **More**\nFor more information on the Meta-Album dataset, please see the [[NeurIPS 2022 paper]](https:\/\/meta-album.github.io\/paper\/Meta-Album.pdf) \nFor details on the dataset preprocessing, please see the [[supplementary materials]](https:\/\/openreview.net\/attachment?id=70_Wx-dON3q&name=supplementary_material) \nSupporting code can be found on our [[GitHub repo]](https:\/\/github.com\/ihsaan-ullah\/meta-album) \nMeta-Album on Papers with Code [[Meta-Album]](https:\/\/paperswithcode.com\/dataset\/meta-album) \n\n\n\n### **Other versions of this dataset**\n[[Mini]](https:\/\/www.openml.org\/d\/44281) [[Extended]](https:\/\/www.openml.org\/d\/44316) ", "format": "arff", "uploader": "Meta Album", "uploader_id": 30980, "visibility": "public", "creator": "\"Ihsan Ullah\"", "contributor": null, "date": "2022-10-11 17:05:08", "update_comment": null, "last_update": "2022-10-11 17:05:08", "licence": "CC BY-NC 4.0", "status": "active", "error_message": null, "url": "https:\/\/api.openml.org\/data\/download\/22109120\/dataset", "default_target_attribute": "CATEGORY", "row_id_attribute": null, "ignore_attribute": null, "runs": 0, "suggest": { "input": [ "Meta_Album_BCT_Micro", "## **Meta-Album DIBaS Dataset (Micro)** The Digital Images of Bacteria Species dataset (DIBaS) (https:\/\/github.com\/gallardorafael\/DIBaS-Dataset) is a dataset of 33 bacterial species with around 20 images for each species. For the Meta-Album, since the images were large (2 048x1 532) with very few samples in each class, we decided to split each image into several smaller images before resizing them to 128x128. We then obtained a preprocessed dataset of 4 060 images with at least 108 images for ea " ], "weight": 5 }, "qualities": { "NumberOfInstances": 800, "NumberOfFeatures": 3, "NumberOfClasses": 20, "NumberOfMissingValues": 800, "NumberOfInstancesWithMissingValues": 800, "NumberOfNumericFeatures": 1, "NumberOfSymbolicFeatures": 0, "PercentageOfBinaryFeatures": 0, "PercentageOfInstancesWithMissingValues": 100, "AutoCorrelation": 1, "PercentageOfMissingValues": 33.33333333333333, "Dimensionality": 0.00375, "PercentageOfNumericFeatures": 33.33333333333333, "MajorityClassPercentage": 5, "PercentageOfSymbolicFeatures": 0, "MajorityClassSize": 40, "MinorityClassPercentage": 5, "MinorityClassSize": 40, "NumberOfBinaryFeatures": 0 }, "tags": [ { "uploader": "38960", "tag": "Machine Learning" }, { "uploader": "38960", "tag": "Meteorology" } ], "features": [ { "name": "CATEGORY", "index": "1", "type": "string", "distinct": "20", "missing": "0", "target": "1" }, { "name": "FILE_NAME", "index": "0", "type": "string", "distinct": "800", "missing": "0" }, { "name": "SUPER_CATEGORY", "index": "2", "type": "numeric", "distinct": "0", "missing": "800", "min": "2147483647", "max": "0", "mean": "0", "stdev": "0" } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 0, "nr_of_downloads": 0, "total_downloads": 0, "reach": 0, "reuse": 1, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 1 }