{ "data_id": "44238", "name": "Meta_Album_PLK_Micro", "exact_name": "Meta_Album_PLK_Micro", "version": 1, "version_label": null, "description": "## **Meta-Album Plankton Dataset (Micro)**\n***\nThe Plankton dataset is created by researchers at the Woods Hole Oceanographic Institution (https:\/\/www.whoi.edu\/). Imaging FlowCytobot (IFCB) was used for the data collection. The Complete process and mechanism are described in the paper [31]. Each image in the dataset contains one or multiple planktons. The images are captured in a controlled environment and have different orientations based on the flow of the fluid in which the images are captured and the size and shape of the planktons. The preprocessed plankton dataset is prepared from the original WHOI Plankton dataset. The preprocessing of the images is done by creating a background squared image by either duplicating the top and bottom-most 3 rows or the left and right most 3 columns based on the orientation of the original image to match the width or height of the image respectively. A Gaussian kernal of size 29x29 is applied to the background image to blur the image. Finally, the original plankton image is pasted on the background image at the center of the image. The squared background image with the original plankton image on top of it as one image is then resized into 128x128 with anti-aliasing. \n\n\n\n### **Dataset Details**\n![](https:\/\/meta-album.github.io\/assets\/img\/samples\/PLK.png)\n\n**Meta Album ID**: SM_AM.PLK \n**Meta Album URL**: [https:\/\/meta-album.github.io\/datasets\/PLK.html](https:\/\/meta-album.github.io\/datasets\/PLK.html) \n**Domain ID**: SM_AM \n**Domain Name**: Small Animals \n**Dataset ID**: PLK \n**Dataset Name**: Plankton \n**Short Description**: Plankton dataset from WHOI \n**\\# Classes**: 20 \n**\\# Images**: 800 \n**Keywords**: plankton, ecology \n**Data Format**: images \n**Image size**: 128x128 \n\n**License (original data release)**: MIT License \n**License URL(original data release)**: https:\/\/github.com\/hsosik\/WHOI-Plankton\/blob\/master\/LICENSE\n \n**License (Meta-Album data release)**: MIT License \n**License URL (Meta-Album data release)**: [https:\/\/github.com\/hsosik\/WHOI-Plankton\/blob\/master\/LICENSE](https:\/\/github.com\/hsosik\/WHOI-Plankton\/blob\/master\/LICENSE) \n\n**Source**: Woods Hole Oceanographic Institution \n**Source URL**: https:\/\/github.com\/hsosik\/WHOI-Plankton \n \n**Original Author**: Heidi M. Sosik, Emily E. Peacock, Emily F. Brownlee, Eric Orenstein \n**Original contact**: hsosik@whoi.edu \n\n**Meta Album author**: Ihsan Ullah \n**Created Date**: 01 March 2022 \n**Contact Name**: Ihsan Ullah \n**Contact Email**: meta-album@chalearn.org \n**Contact URL**: [https:\/\/meta-album.github.io\/](https:\/\/meta-album.github.io\/) \n\n\n\n### **Cite this dataset**\n```\n@misc{whoiplankton,\n title={Annotated Plankton Images - Data Set for Developing and Evaluating Classification Methods.}, \n author={Heidi M. Sosik, Emily E. Peacock, Emily F. Brownlee},\n year={2015},\n DOI = {10.1575\/1912\/7341},\n url={https:\/\/hdl.handle.net\/10.1575\/1912\/7341}\n}\n```\n\n\n### **Cite Meta-Album**\n```\n@inproceedings{meta-album-2022,\n title={Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification},\n author={Ullah, Ihsan and Carrion, Dustin and Escalera, Sergio and Guyon, Isabelle M and Huisman, Mike and Mohr, Felix and van Rijn, Jan N and Sun, Haozhe and Vanschoren, Joaquin and Vu, Phan Anh},\n booktitle={Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track},\n url = {https:\/\/meta-album.github.io\/},\n year = {2022}\n }\n```\n\n\n### **More**\nFor more information on the Meta-Album dataset, please see the [[NeurIPS 2022 paper]](https:\/\/meta-album.github.io\/paper\/Meta-Album.pdf) \nFor details on the dataset preprocessing, please see the [[supplementary materials]](https:\/\/openreview.net\/attachment?id=70_Wx-dON3q&name=supplementary_material) \nSupporting code can be found on our [[GitHub repo]](https:\/\/github.com\/ihsaan-ullah\/meta-album) \nMeta-Album on Papers with Code [[Meta-Album]](https:\/\/paperswithcode.com\/dataset\/meta-album) \n\n\n\n### **Other versions of this dataset**\n[[Mini]](https:\/\/www.openml.org\/d\/44282) [[Extended]](https:\/\/www.openml.org\/d\/44317) ", "format": "arff", "uploader": "Meta Album", "uploader_id": 30980, "visibility": "public", "creator": "\"Ihsan Ullah\"", "contributor": null, "date": "2022-10-11 17:05:13", "update_comment": null, "last_update": "2022-10-11 17:05:13", "licence": "CC BY-NC 4.0", "status": "active", "error_message": null, "url": "https:\/\/api.openml.org\/data\/download\/22109121\/dataset", "default_target_attribute": "CATEGORY", "row_id_attribute": null, "ignore_attribute": null, "runs": 0, "suggest": { "input": [ "Meta_Album_PLK_Micro", "## **Meta-Album Plankton Dataset (Micro)** The Plankton dataset is created by researchers at the Woods Hole Oceanographic Institution (https:\/\/www.whoi.edu\/). Imaging FlowCytobot (IFCB) was used for the data collection. The Complete process and mechanism are described in the paper [31]. Each image in the dataset contains one or multiple planktons. The images are captured in a controlled environment and have different orientations based on the flow of the fluid in which the images are captured an " ], "weight": 5 }, "qualities": { "NumberOfInstances": 800, "NumberOfFeatures": 3, "NumberOfClasses": 20, "NumberOfMissingValues": 800, "NumberOfInstancesWithMissingValues": 800, "NumberOfNumericFeatures": 1, "NumberOfSymbolicFeatures": 0, "PercentageOfBinaryFeatures": 0, "PercentageOfInstancesWithMissingValues": 100, "AutoCorrelation": 1, "PercentageOfMissingValues": 33.33333333333333, "Dimensionality": 0.00375, "PercentageOfNumericFeatures": 33.33333333333333, "MajorityClassPercentage": 5, "PercentageOfSymbolicFeatures": 0, "MajorityClassSize": 40, "MinorityClassPercentage": 5, "MinorityClassSize": 40, "NumberOfBinaryFeatures": 0 }, "tags": [ { "uploader": "38960", "tag": "Machine Learning" }, { "uploader": "38960", "tag": "Meteorology" } ], "features": [ { "name": "CATEGORY", "index": "1", "type": "string", "distinct": "20", "missing": "0", "target": "1" }, { "name": "FILE_NAME", "index": "0", "type": "string", "distinct": "800", "missing": "0" }, { "name": "SUPER_CATEGORY", "index": "2", "type": "numeric", "distinct": "0", "missing": "800", "min": "2147483647", "max": "0", "mean": "0", "stdev": "0" } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 1, "nr_of_downloads": 1, "total_downloads": 1, "reach": 2, "reuse": 1, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 1 }