{ "data_id": "44326", "name": "Meta_Album_INS_2_Extended", "exact_name": "Meta_Album_INS_2_Extended", "version": 1, "version_label": null, "description": "## **Meta-Album Insects2 Dataset (Extended)**\n***\nThe pest insects dataset was originally created as a large scale benchmark dataset for Insect Pest Recognition (https:\/\/github.com\/xpwu95\/IP102). It contains more than 75 000 images belongs to 102 categories. It also has a hierarchical taxonomy and the insect pests which mainly affect one specific agricultural product are grouped into the same upper-level category. The preprocessed version is made from the original dataset by cropping the images in perfect squares and then resizing them into the required images size of 128x128. \n\n\n\n### **Dataset Details**\n![](https:\/\/meta-album.github.io\/assets\/img\/samples\/INS_2.png)\n\n**Meta Album ID**: SM_AM.INS2 \n**Meta Album URL**: [https:\/\/meta-album.github.io\/datasets\/INS_2.html](https:\/\/meta-album.github.io\/datasets\/INS_2.html) \n**Domain ID**: SM_AM \n**Domain Name**: Small Aninamls \n**Dataset ID**: INS_2 \n**Dataset Name**: Insects2 \n**Short Description**: Insects dataset for Insect Pest Recognition \n**\\# Classes**: 102 \n**\\# Images**: 75222 \n**Keywords**: insects, ecology \n**Data Format**: images \n**Image size**: 128x128 \n\n**License (original data release)**: Free for academic usage, cite to use dataset \n**License URL(original data release)**: https:\/\/github.com\/xpwu95\/IP102\n \n**License (Meta-Album data release)**: CC BY-NC 4.0 \n**License URL (Meta-Album data release)**: [https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/](https:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/) \n\n**Source**: IP102: A Large-Scale Benchmark Dataset for Insect Pest Recognition \n**Source URL**: https:\/\/github.com\/xpwu95\/IP102 \n \n**Original Author**: Xiaoping Wu, Chi Zhan, Yukun Lai, Ming-Ming Cheng, Jufeng Yang \n**Original contact**: xpwu95@163.com \n\n**Meta Album author**: Ihsan Ullah \n**Created Date**: 01 March 2022 \n**Contact Name**: Ihsan Ullah \n**Contact Email**: meta-album@chalearn.org \n**Contact URL**: [https:\/\/meta-album.github.io\/](https:\/\/meta-album.github.io\/) \n\n\n\n### **Cite this dataset**\n```\n@inproceedings{Wu2019Insect,\n title={IP102: A Large-Scale Benchmark Dataset for Insect Pest Recognition},\n author={Xiaoping Wu and Chi Zhan and Yukun Lai and Ming-Ming Cheng and Jufeng Yang},\n booktitle={IEEE CVPR},\n pages={8787--8796},\n year={2019},\n}\n```\n\n\n### **Cite Meta-Album**\n```\n@inproceedings{meta-album-2022,\n title={Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification},\n author={Ullah, Ihsan and Carrion, Dustin and Escalera, Sergio and Guyon, Isabelle M and Huisman, Mike and Mohr, Felix and van Rijn, Jan N and Sun, Haozhe and Vanschoren, Joaquin and Vu, Phan Anh},\n booktitle={Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track},\n url = {https:\/\/meta-album.github.io\/},\n year = {2022}\n }\n```\n\n\n### **More**\nFor more information on the Meta-Album dataset, please see the [[NeurIPS 2022 paper]](https:\/\/meta-album.github.io\/paper\/Meta-Album.pdf) \nFor details on the dataset preprocessing, please see the [[supplementary materials]](https:\/\/openreview.net\/attachment?id=70_Wx-dON3q&name=supplementary_material) \nSupporting code can be found on our [[GitHub repo]](https:\/\/github.com\/ihsaan-ullah\/meta-album) \nMeta-Album on Papers with Code [[Meta-Album]](https:\/\/paperswithcode.com\/dataset\/meta-album) \n\n\n\n### **Other versions of this dataset**\n[[Micro]](https:\/\/www.openml.org\/d\/44248) [[Mini]](https:\/\/www.openml.org\/d\/44292) ", "format": "arff", "uploader": "Meta Album", "uploader_id": 30980, "visibility": "public", "creator": "\"Ihsan Ullah\"", "contributor": null, "date": "2022-11-08 18:27:40", "update_comment": null, "last_update": "2022-11-08 18:27:40", "licence": "CC BY-NC 4.0", "status": "active", "error_message": null, "url": "https:\/\/api.openml.org\/data\/download\/22111040\/dataset", "kaggle_url": null, "default_target_attribute": "CATEGORY", "row_id_attribute": null, "ignore_attribute": null, "runs": 0, "suggest": { "input": [ "Meta_Album_INS_2_Extended", "## **Meta-Album Insects2 Dataset (Extended)** The pest insects dataset was originally created as a large scale benchmark dataset for Insect Pest Recognition (https:\/\/github.com\/xpwu95\/IP102). It contains more than 75 000 images belongs to 102 categories. It also has a hierarchical taxonomy and the insect pests which mainly affect one specific agricultural product are grouped into the same upper-level category. The preprocessed version is made from the original dataset by cropping the images in p " ], "weight": 5 }, "qualities": { "NumberOfInstances": 75222, "NumberOfFeatures": 3, "NumberOfClasses": 102, "NumberOfMissingValues": 75222, "NumberOfInstancesWithMissingValues": 75222, "NumberOfNumericFeatures": 1, "NumberOfSymbolicFeatures": 0, "PercentageOfBinaryFeatures": 0, "PercentageOfInstancesWithMissingValues": 100, "PercentageOfMissingValues": 33.33333333333333, "AutoCorrelation": 1, "PercentageOfNumericFeatures": 33.33333333333333, "Dimensionality": 3.988194942968812e-5, "PercentageOfSymbolicFeatures": 0, "MajorityClassPercentage": 7.630746324213661, "MajorityClassSize": 5740, "MinorityClassPercentage": 0.09438728031692856, "MinorityClassSize": 71, "NumberOfBinaryFeatures": 0 }, "tags": [], "features": [ { "name": "CATEGORY", "index": "1", "type": "string", "distinct": "102", "missing": "0", "target": "1" }, { "name": "FILE_NAME", "index": "0", "type": "string", "distinct": "75222", "missing": "0" }, { "name": "SUPER_CATEGORY", "index": "2", "type": "numeric", "distinct": "0", "missing": "75222", "min": "2147483647", "max": "0", "mean": "0", "stdev": "0" } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 0, "nr_of_downloads": 0, "total_downloads": 0, "reach": 0, "reuse": 0, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 0 }