{ "data_id": "44241", "name": "Meta_Album_BRD_Micro", "exact_name": "Meta_Album_BRD_Micro", "version": 1, "version_label": null, "description": "## **Meta-Album Birds Dataset (Micro)**\n***\nWhen Meta-Album was created, the Birds dataset(https:\/\/www.kaggle.com\/datasets\/gpiosenka\/100-bird-species) contained images of 315 bird species, but now it has increased the number of species to 450. It has more than 49 000 images, each with a resolution of 224x224 px. All the images have their natural background, which can lead to bias since, for example, some birds are frequently found in water backgrounds. Additionally, the dataset is imbalanced regarding the ratio of male species images to female species images. The preprocessed version distributed in Meta-Album is made from the original dataset by resizing all the images to a resolution of 128x128 px using an anti-aliasing filter. \n\n\n\n### **Dataset Details**\n![](https:\/\/meta-album.github.io\/assets\/img\/samples\/BRD.png)\n\n**Meta Album ID**: LR_AM.BRD \n**Meta Album URL**: [https:\/\/meta-album.github.io\/datasets\/BRD.html](https:\/\/meta-album.github.io\/datasets\/BRD.html) \n**Domain ID**: LR_AM \n**Domain Name**: Large Animals \n**Dataset ID**: BRD \n**Dataset Name**: Birds \n**Short Description**: Birds dataset for image classification \n**\\# Classes**: 20 \n**\\# Images**: 800 \n**Keywords**: birds, animals \n**Data Format**: images \n**Image size**: 128x128 \n\n**License (original data release)**: CC0 Public Domain \n**License URL(original data release)**: https:\/\/www.kaggle.com\/gpiosenka\/100-bird-species\nhttps:\/\/creativecommons.org\/publicdomain\/zero\/1.0\/\n \n**License (Meta-Album data release)**: CC0 Public Domain \n**License URL (Meta-Album data release)**: [https:\/\/creativecommons.org\/publicdomain\/zero\/1.0\/](https:\/\/creativecommons.org\/publicdomain\/zero\/1.0\/) \n\n**Source**: BIRDS 400 - SPECIES IMAGE CLASSIFICATION \n**Source URL**: https:\/\/www.kaggle.com\/gpiosenka\/100-bird-species \n \n**Original Author**: Gerald Piosenka \n**Original contact**: https:\/\/www.kaggle.com\/gpiosenka\/contact \n\n**Meta Album author**: Dustin Carrion \n**Created Date**: 01 March 2022 \n**Contact Name**: Ihsan Ullah \n**Contact Email**: meta-album@chalearn.org \n**Contact URL**: [https:\/\/meta-album.github.io\/](https:\/\/meta-album.github.io\/) \n\n\n\n### **Cite this dataset**\n```\n@article{birds,\n title={BIRDS 400 - SPECIES IMAGE CLASSIFICATION},\n author={Gerald Piosenka},\n url={https:\/\/www.kaggle.com\/datasets\/gpiosenka\/100-bird-species},\n publisher= {Kaggle}\n}\n```\n\n\n### **Cite Meta-Album**\n```\n@inproceedings{meta-album-2022,\n title={Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification},\n author={Ullah, Ihsan and Carrion, Dustin and Escalera, Sergio and Guyon, Isabelle M and Huisman, Mike and Mohr, Felix and van Rijn, Jan N and Sun, Haozhe and Vanschoren, Joaquin and Vu, Phan Anh},\n booktitle={Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track},\n url = {https:\/\/meta-album.github.io\/},\n year = {2022}\n }\n```\n\n\n### **More**\nFor more information on the Meta-Album dataset, please see the [[NeurIPS 2022 paper]](https:\/\/meta-album.github.io\/paper\/Meta-Album.pdf) \nFor details on the dataset preprocessing, please see the [[supplementary materials]](https:\/\/openreview.net\/attachment?id=70_Wx-dON3q&name=supplementary_material) \nSupporting code can be found on our [[GitHub repo]](https:\/\/github.com\/ihsaan-ullah\/meta-album) \nMeta-Album on Papers with Code [[Meta-Album]](https:\/\/paperswithcode.com\/dataset\/meta-album) \n\n\n\n### **Other versions of this dataset**\n[[Mini]](https:\/\/www.openml.org\/d\/44285) [[Extended]](https:\/\/www.openml.org\/d\/44320) ", "format": "arff", "uploader": "Meta Album", "uploader_id": 30980, "visibility": "public", "creator": "\"Ihsan Ullah\"", "contributor": null, "date": "2022-10-11 17:05:26", "update_comment": null, "last_update": "2022-10-11 17:05:26", "licence": "CC BY-NC 4.0", "status": "active", "error_message": null, "url": "https:\/\/api.openml.org\/data\/download\/22109124\/dataset", "default_target_attribute": "CATEGORY", "row_id_attribute": null, "ignore_attribute": null, "runs": 0, "suggest": { "input": [ "Meta_Album_BRD_Micro", "## **Meta-Album Birds Dataset (Micro)** When Meta-Album was created, the Birds dataset(https:\/\/www.kaggle.com\/datasets\/gpiosenka\/100-bird-species) contained images of 315 bird species, but now it has increased the number of species to 450. It has more than 49 000 images, each with a resolution of 224x224 px. All the images have their natural background, which can lead to bias since, for example, some birds are frequently found in water backgrounds. Additionally, the dataset is imbalanced regardi " ], "weight": 5 }, "qualities": { "NumberOfInstances": 800, "NumberOfFeatures": 3, "NumberOfClasses": 20, "NumberOfMissingValues": 800, "NumberOfInstancesWithMissingValues": 800, "NumberOfNumericFeatures": 1, "NumberOfSymbolicFeatures": 0, "PercentageOfInstancesWithMissingValues": 100, "PercentageOfMissingValues": 33.33333333333333, "AutoCorrelation": 1, "PercentageOfNumericFeatures": 33.33333333333333, "Dimensionality": 0.00375, "PercentageOfSymbolicFeatures": 0, "MajorityClassPercentage": 5, "MajorityClassSize": 40, "MinorityClassPercentage": 5, "MinorityClassSize": 40, "NumberOfBinaryFeatures": 0, "PercentageOfBinaryFeatures": 0 }, "tags": [ { "uploader": "38960", "tag": "Economics" }, { "uploader": "38960", "tag": "Machine Learning" } ], "features": [ { "name": "CATEGORY", "index": "1", "type": "string", "distinct": "20", "missing": "0", "target": "1" }, { "name": "FILE_NAME", "index": "0", "type": "string", "distinct": "800", "missing": "0" }, { "name": "SUPER_CATEGORY", "index": "2", "type": "numeric", "distinct": "0", "missing": "800", "min": "2147483647", "max": "0", "mean": "0", "stdev": "0" } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 0, "nr_of_downloads": 0, "total_downloads": 0, "reach": 0, "reuse": 1, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 1 }