{ "data_id": "44335", "name": "Meta_Album_FNG_Extended", "exact_name": "Meta_Album_FNG_Extended", "version": 1, "version_label": null, "description": "## **Meta-Album Fungi Dataset (Extended)**\n***\nMeta-Album Fungi dataset is created by sampling the Danish Fungi 2020 dataset(https:\/\/arxiv.org\/abs\/2103.10107), itself a sampling of the Atlas of Danish Fungi repository. The images and labels which enter this database are sourced by a group consisting of 3 300 citizen botanists, then verified by their peers using a ranking of each person reliability, then finally verified by experts working at the Atlas. Of the 128 classes in the original Danish Fungi 2020 dataset, FNG retains the 25 most populous classes, belonging to six genera, for a total of 15 122 images total, with min 372, and max 1 221 images per class. Each image contains a colored 128x128 image of a fungus or a piece of a fungus from the corresponding class. Because the initial data were of widely varying sizes, we needed to crop a significant portion of the images, which we implemented by taking the largest possible square with center at the middle of the initial image. We then scaled each squared image to the 128x128 standard using the INTER_AREA anti-aliasing filter from Open-CV. \n\n\n\n### **Dataset Details**\n![](https:\/\/meta-album.github.io\/assets\/img\/samples\/FNG.png)\n\n**Meta Album ID**: PLT.FNG \n**Meta Album URL**: [https:\/\/meta-album.github.io\/datasets\/FNG.html](https:\/\/meta-album.github.io\/datasets\/FNG.html) \n**Domain ID**: PLT \n**Domain Name**: Plants \n**Dataset ID**: FNG \n**Dataset Name**: Fungi \n**Short Description**: Fungi dataset from Denmark \n**\\# Classes**: 25 \n**\\# Images**: 15122 \n**Keywords**: fungi, ecology, plants \n**Data Format**: images \n**Image size**: 128x128 \n\n**License (original data release)**: BSD-3-Clause License \n**License URL(original data release)**: https:\/\/github.com\/picekl\/DanishFungiDataset\/blob\/main\/LICENSE\n \n**License (Meta-Album data release)**: BSD-3-Clause License \n**License URL (Meta-Album data release)**: [https:\/\/github.com\/picekl\/DanishFungiDataset\/blob\/main\/LICENSE](https:\/\/github.com\/picekl\/DanishFungiDataset\/blob\/main\/LICENSE) \n\n**Source**: Danish Fungi Dataset \n**Source URL**: https:\/\/sites.google.com\/view\/danish-fungi-dataset \n \n**Original Author**: Lukas Picek, Milan Sulc, Jiri Matas, Jacob Heilmann-Clausen, Thomas S. Jeppesen, Thomas Laessoe, Tobias Froslev \n**Original contact**: lukaspicek@gmail.com \n\n**Meta Album author**: Felix Herron \n**Created Date**: 01 March 2022 \n**Contact Name**: Ihsan Ullah \n**Contact Email**: meta-album@chalearn.org \n**Contact URL**: [https:\/\/meta-album.github.io\/](https:\/\/meta-album.github.io\/) \n\n\n\n### **Cite this dataset**\n```\n@article{picek2021danish,\n title={Danish Fungi 2020 - Not Just Another Image Recognition Dataset},\n author={Lukas Picek and Milan Sulc and Jiri Matas and Jacob Heilmann-Clausen and Thomas S. Jeppesen and Thomas Laessoe and Tobias Froslev},\n year={2021},\n eprint={2103.10107},\n archivePrefix={arXiv},\n primaryClass={cs.CV}\n}\n```\n\n\n### **Cite Meta-Album**\n```\n@inproceedings{meta-album-2022,\n title={Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification},\n author={Ullah, Ihsan and Carrion, Dustin and Escalera, Sergio and Guyon, Isabelle M and Huisman, Mike and Mohr, Felix and van Rijn, Jan N and Sun, Haozhe and Vanschoren, Joaquin and Vu, Phan Anh},\n booktitle={Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track},\n url = {https:\/\/meta-album.github.io\/},\n year = {2022}\n }\n```\n\n\n### **More**\nFor more information on the Meta-Album dataset, please see the [[NeurIPS 2022 paper]](https:\/\/meta-album.github.io\/paper\/Meta-Album.pdf) \nFor details on the dataset preprocessing, please see the [[supplementary materials]](https:\/\/openreview.net\/attachment?id=70_Wx-dON3q&name=supplementary_material) \nSupporting code can be found on our [[GitHub repo]](https:\/\/github.com\/ihsaan-ullah\/meta-album) \nMeta-Album on Papers with Code [[Meta-Album]](https:\/\/paperswithcode.com\/dataset\/meta-album) \n\n\n\n### **Other versions of this dataset**\n[[Micro]](https:\/\/www.openml.org\/d\/44272) [[Mini]](https:\/\/www.openml.org\/d\/44302) ", "format": "arff", "uploader": "Meta Album", "uploader_id": 30980, "visibility": "public", "creator": "\"Ihsan Ullah\"", "contributor": null, "date": "2022-11-08 18:59:45", "update_comment": null, "last_update": "2022-11-08 18:59:45", "licence": "CC BY-NC 4.0", "status": "active", "error_message": null, "url": "https:\/\/api.openml.org\/data\/download\/22111049\/dataset", "default_target_attribute": "CATEGORY", "row_id_attribute": null, "ignore_attribute": null, "runs": 0, "suggest": { "input": [ "Meta_Album_FNG_Extended", "## **Meta-Album Fungi Dataset (Extended)** Meta-Album Fungi dataset is created by sampling the Danish Fungi 2020 dataset(https:\/\/arxiv.org\/abs\/2103.10107), itself a sampling of the Atlas of Danish Fungi repository. The images and labels which enter this database are sourced by a group consisting of 3 300 citizen botanists, then verified by their peers using a ranking of each person reliability, then finally verified by experts working at the Atlas. Of the 128 classes in the original Danish Fungi " ], "weight": 5 }, "qualities": { "NumberOfInstances": 15122, "NumberOfFeatures": 3, "NumberOfClasses": 25, "NumberOfMissingValues": 0, "NumberOfInstancesWithMissingValues": 0, "NumberOfNumericFeatures": 0, "NumberOfSymbolicFeatures": 0, "NumberOfBinaryFeatures": 0, "PercentageOfBinaryFeatures": 0, "PercentageOfInstancesWithMissingValues": 0, "AutoCorrelation": 1, "PercentageOfMissingValues": 0, "Dimensionality": 0.00019838645681788123, "PercentageOfNumericFeatures": 0, "MajorityClassPercentage": 8.074328792487766, "PercentageOfSymbolicFeatures": 0, "MajorityClassSize": 1221, "MinorityClassPercentage": 2.4599920645417273, "MinorityClassSize": 372 }, "tags": [ { "uploader": "38960", "tag": "Health" } ], "features": [ { "name": "CATEGORY", "index": "1", "type": "string", "distinct": "25", "missing": "0", "target": "1" }, { "name": "FILE_NAME", "index": "0", "type": "string", "distinct": "15122", "missing": "0" }, { "name": "SUPER_CATEGORY", "index": "2", "type": "string", "distinct": "6", "missing": "0" } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 0, "nr_of_downloads": 0, "total_downloads": 0, "reach": 0, "reuse": 1, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 1 }