{ "data_id": "44310", "name": "Meta_Album_MD_6_Mini", "exact_name": "Meta_Album_MD_6_Mini", "version": 1, "version_label": null, "description": "## **Meta-Album OmniPrint-MD-6 Dataset (Mini)**\n***\nOmniPrint-MD-6 dataset consists of 28 120 images (128x128, RGB) from 703 categories. The images are synthesized with OmniPrint, no further processing was done. The OmniPrint synthesis parameters are stated as follows: font size is 192, image size is 128, the strength of random perspective transformation is 0.04, left\/right\/top\/bottom margins are all 20% of the image size, the strength of pre-rasterization elastic transformation is 0.035, random translation is activated both horizontally and vertically, image blending method is Poisson Image Editing, rotation is within -60 and 60 degrees, horizontal shear is within -0.5 and 0.5, both foreground and background are images taken from a personal mobile phone. \n\n\n\n### **Dataset Details**\n![](https:\/\/meta-album.github.io\/assets\/img\/samples\/MD_6.png)\n\n**Meta Album ID**: OCR.MD_6 \n**Meta Album URL**: [https:\/\/meta-album.github.io\/datasets\/MD_6.html](https:\/\/meta-album.github.io\/datasets\/MD_6.html) \n**Domain ID**: OCR \n**Domain Name**: Optical Character Recognition \n**Dataset ID**: MD_6 \n**Dataset Name**: OmniPrint-MD-6 \n**Short Description**: Character images with a specific set of nuisance parameters \n**\\# Classes**: 703 \n**\\# Images**: 28120 \n**Keywords**: ocr \n**Data Format**: images \n**Image size**: 128x128 \n\n**License (original data release)**: CC BY 4.0 \n**License URL(original data release)**: https:\/\/creativecommons.org\/licenses\/by\/4.0\/\n \n**License (Meta-Album data release)**: CC BY 4.0 \n**License URL (Meta-Album data release)**: [https:\/\/creativecommons.org\/licenses\/by\/4.0\/](https:\/\/creativecommons.org\/licenses\/by\/4.0\/) \n\n**Source**: OmniPrint \n**Source URL**: https:\/\/github.com\/SunHaozhe\/OmniPrint \n \n**Original Author**: Haozhe Sun \n**Original contact**: sunhaozhe275940200@gmail.com \n\n**Meta Album author**: Haozhe Sun \n**Created Date**: 25 June 2021 \n**Contact Name**: Haozhe Sun \n**Contact Email**: meta-album@chalearn.org \n**Contact URL**: [https:\/\/meta-album.github.io\/](https:\/\/meta-album.github.io\/) \n\n\n\n### **Cite this dataset**\n```\n@inproceedings{sun2021omniprint,\n title={OmniPrint: A Configurable Printed Character Synthesizer},\n author={Haozhe Sun and Wei-Wei Tu and Isabelle M Guyon},\n booktitle={Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 1)},\n year={2021},\n url={https:\/\/openreview.net\/forum?id=R07XwJPmgpl}\n}\n```\n\n\n### **Cite Meta-Album**\n```\n@inproceedings{meta-album-2022,\n title={Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification},\n author={Ullah, Ihsan and Carrion, Dustin and Escalera, Sergio and Guyon, Isabelle M and Huisman, Mike and Mohr, Felix and van Rijn, Jan N and Sun, Haozhe and Vanschoren, Joaquin and Vu, Phan Anh},\n booktitle={Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track},\n url = {https:\/\/meta-album.github.io\/},\n year = {2022}\n }\n```\n\n\n### **More**\nFor more information on the Meta-Album dataset, please see the [[NeurIPS 2022 paper]](https:\/\/meta-album.github.io\/paper\/Meta-Album.pdf) \nFor details on the dataset preprocessing, please see the [[supplementary materials]](https:\/\/openreview.net\/attachment?id=70_Wx-dON3q&name=supplementary_material) \nSupporting code can be found on our [[GitHub repo]](https:\/\/github.com\/ihsaan-ullah\/meta-album) \nMeta-Album on Papers with Code [[Meta-Album]](https:\/\/paperswithcode.com\/dataset\/meta-album) \n\n\n\n### **Other versions of this dataset**\n[[Micro]](https:\/\/www.openml.org\/d\/44280) ", "format": "arff", "uploader": "Meta Album", "uploader_id": 30980, "visibility": "public", "creator": "\"Ihsan Ullah\"", "contributor": null, "date": "2022-10-28 16:28:54", "update_comment": null, "last_update": "2022-10-28 16:28:54", "licence": "CC BY-NC 4.0", "status": "active", "error_message": null, "url": "https:\/\/api.openml.org\/data\/download\/22111010\/dataset", "default_target_attribute": "CATEGORY", "row_id_attribute": null, "ignore_attribute": null, "runs": 0, "suggest": { "input": [ "Meta_Album_MD_6_Mini", "## **Meta-Album OmniPrint-MD-6 Dataset (Mini)** OmniPrint-MD-6 dataset consists of 28 120 images (128x128, RGB) from 703 categories. The images are synthesized with OmniPrint, no further processing was done. The OmniPrint synthesis parameters are stated as follows: font size is 192, image size is 128, the strength of random perspective transformation is 0.04, left\/right\/top\/bottom margins are all 20% of the image size, the strength of pre-rasterization elastic transformation is 0.035, random tra " ], "weight": 5 }, "qualities": { "NumberOfInstances": 28120, "NumberOfFeatures": 47, "NumberOfClasses": 0, "NumberOfMissingValues": 56, "NumberOfInstancesWithMissingValues": 56, "NumberOfNumericFeatures": 32, "NumberOfSymbolicFeatures": 1, "PercentageOfBinaryFeatures": 2.127659574468085, "PercentageOfInstancesWithMissingValues": 0.1991465149359886, "AutoCorrelation": -3260.1489028770584, "PercentageOfMissingValues": 0.004237159892255077, "Dimensionality": 0.0016714082503556187, "PercentageOfNumericFeatures": 68.08510638297872, "MajorityClassPercentage": null, "PercentageOfSymbolicFeatures": 2.127659574468085, "MajorityClassSize": null, "MinorityClassPercentage": null, "MinorityClassSize": null, "NumberOfBinaryFeatures": 1 }, "tags": [ { "uploader": "38960", "tag": "Chemistry" }, { "uploader": "38960", "tag": "Computer Systems" } ], "features": [ { "name": "CATEGORY", "index": "2", "type": "numeric", "distinct": "703", "missing": "0", "target": "1", "min": "33", "max": "12538", "mean": "3994", "stdev": "3343" }, { "name": "FILE_NAME", "index": "0", "type": "string", "distinct": "28120", "missing": "0" }, { "name": "text", "index": "1", "type": "string", "distinct": "702", "missing": "40" }, { "name": "font_file", "index": "3", "type": "string", "distinct": "858", "missing": "0" }, { "name": "background", "index": "4", "type": "string", "distinct": "1", "missing": "0" }, { "name": "background_image_crop_x", "index": "5", "type": "numeric", "distinct": "2577", "missing": "0", "min": "2", "max": "3840", "mean": "1657", "stdev": "1030" }, { "name": "background_image_crop_x_plus_width", "index": "6", "type": "numeric", "distinct": "2577", "missing": "0", "min": "130", "max": "3968", "mean": "1785", "stdev": "1030" }, { "name": "background_image_crop_y", "index": "7", "type": "numeric", "distinct": "2469", "missing": "0", "min": "0", "max": "3839", "mean": "1492", "stdev": "967" }, { "name": "background_image_crop_y_plus_height", "index": "8", "type": "numeric", "distinct": "2469", "missing": "0", "min": "128", "max": "3967", "mean": "1620", "stdev": "967" }, { "name": "background_image_name", "index": "9", "type": "string", "distinct": "26", "missing": "0" }, { "name": "background_image_original_height", "index": "10", "type": "numeric", "distinct": "7", "missing": "0", "min": "1079", "max": "3968", "mean": "3144", "stdev": "738" }, { "name": "background_image_original_width", "index": "11", "type": "numeric", "distinct": "5", "missing": "0", "min": "1633", "max": "3968", "mean": "3480", "stdev": "655" }, { "name": "background_image_resized_height", "index": "12", "type": "numeric", "distinct": "7", "missing": "0", "min": "1079", "max": "3968", "mean": "3144", "stdev": "738" }, { "name": "background_image_resized_width", "index": "13", "type": "numeric", "distinct": "5", "missing": "0", "min": "1633", "max": "3968", "mean": "3480", "stdev": "655" }, { "name": "font_size", "index": "14", "type": "numeric", "distinct": "1", "missing": "0", "min": "192", "max": "192", "mean": "192", "stdev": "0" }, { "name": "font_weight", "index": "15", "type": "numeric", "distinct": "1", "missing": "0", "min": "400", "max": "400", "mean": "400", "stdev": "0" }, { "name": "foreground", "index": "16", "type": "string", "distinct": "1", "missing": "0" }, { "name": "foreground_image_crop_x", "index": "17", "type": "numeric", "distinct": "2636", "missing": "0", "min": "0", "max": "3839", "mean": "1665", "stdev": "1040" }, { "name": "foreground_image_crop_x_plus_width", "index": "18", "type": "numeric", "distinct": "2636", "missing": "0", "min": "128", "max": "3967", "mean": "1793", "stdev": "1040" }, { "name": "foreground_image_crop_y", "index": "19", "type": "numeric", "distinct": "2541", "missing": "0", "min": "0", "max": "3839", "mean": "1517", "stdev": "967" }, { "name": "foreground_image_crop_y_plus_height", "index": "20", "type": "numeric", "distinct": "2541", "missing": "0", "min": "128", "max": "3967", "mean": "1645", "stdev": "967" }, { "name": "foreground_image_name", "index": "21", "type": "string", "distinct": "26", "missing": "0" }, { "name": "foreground_image_original_height", "index": "22", "type": "numeric", "distinct": "7", "missing": "0", "min": "1079", "max": "3968", "mean": "3169", "stdev": "725" }, { "name": "foreground_image_original_width", "index": "23", "type": "numeric", "distinct": "5", "missing": "0", "min": "1633", "max": "3968", "mean": "3486", "stdev": "640" }, { "name": "foreground_image_resized_height", "index": "24", "type": "numeric", "distinct": "7", "missing": "0", "min": "1079", "max": "3968", "mean": "3169", "stdev": "725" }, { "name": "foreground_image_resized_width", "index": "25", "type": "numeric", "distinct": "5", "missing": "0", "min": "1633", "max": "3968", "mean": "3486", "stdev": "640" }, { "name": "image_blending_method", "index": "26", "type": "string", "distinct": "1", "missing": "0" }, { "name": "image_height_resolution", "index": "27", "type": "numeric", "distinct": "1", "missing": "0", "min": "128", "max": "128", "mean": "128", "stdev": "0" }, { "name": "image_mode", "index": "28", "type": "string", "distinct": "1", "missing": "0" }, { "name": "image_width_resolution", "index": "29", "type": "numeric", "distinct": "1", "missing": "0", "min": "128", "max": "128", "mean": "128", "stdev": "0" }, { "name": "margin_bottom", "index": "30", "type": "numeric", "distinct": "1", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "margin_left", "index": "31", "type": "numeric", "distinct": "1", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "margin_right", "index": "32", "type": "numeric", "distinct": "1", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "margin_top", "index": "33", "type": "numeric", "distinct": "1", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "offset_horizontal", "index": "34", "type": "numeric", "distinct": "301", "missing": "0", "min": "0", "max": "479", "mean": "72", "stdev": "50" }, { "name": "offset_vertical", "index": "35", "type": "numeric", "distinct": "317", "missing": "0", "min": "0", "max": "552", "mean": "72", "stdev": "51" }, { "name": "original_image_height_resolution", "index": "36", "type": "numeric", "distinct": "363", "missing": "0", "min": "69", "max": "972", "mean": "303", "stdev": "71" }, { "name": "original_image_width_resolution", "index": "37", "type": "numeric", "distinct": "363", "missing": "0", "min": "69", "max": "972", "mean": "303", "stdev": "71" }, { "name": "perspective_params", "index": "38", "type": "string", "distinct": "26552", "missing": "0" }, { "name": "pre_elastic", "index": "39", "type": "numeric", "distinct": "1", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "rotation", "index": "40", "type": "numeric", "distinct": "3120", "missing": "0", "min": "-60", "max": "60", "mean": "-1", "stdev": "35" }, { "name": "shear_x", "index": "41", "type": "numeric", "distinct": "3120", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "SUPER_CATEGORY", "index": "42", "type": "string", "distinct": "31", "missing": "0" }, { "name": "family_name", "index": "43", "type": "string", "distinct": "520", "missing": "0" }, { "name": "style_name", "index": "44", "type": "string", "distinct": "44", "missing": "0" }, { "name": "postscript_name", "index": "45", "type": "string", "distinct": "815", "missing": "16" }, { "name": "variable_font_weight", "index": "46", "type": "nominal", "distinct": "2", "missing": "0", "distr": [] } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 0, "nr_of_downloads": 0, "total_downloads": 0, "reach": 0, "reuse": 1, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 1 }