Data
Meta_Album_MD_MIX_Mini

Meta_Album_MD_MIX_Mini

active ARFF CC BY-NC 4.0 Visibility: public Uploaded 28-10-2022 by Meta Album
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
## Meta-Album OmniPrint-MD-mix Dataset (Mini) * OmniPrint-MD-mix dataset consists of 28 240 images (128x128, RGB) from 706 categories. The images are synthesized with OmniPrint, and no further processing was done. The OmniPrint synthesis parameters are stated as follows: font size is 192, image size is 128, the strength of random perspective transformation is 0.04, left/right/top/bottom margins are all 20% of the image size, the strength of pre-rasterization elastic transformation is 0.035, random translation is activated both horizontally and vertically, rotation is within -60 and 60 degrees, horizontal shear is within -0.5 and 0.5, brightness is within 0.8333 and 1.2, contrast is within 0.8333 and 1.2, color enhancement is within 0.8333 and 1.2. The other parameters vary between images. We designed 20 settings, each setting is used to synthesize 2 images. All images/textures consists of photos taken by a personal mobile phone. ### Dataset Details ![](https://meta-album.github.io/assets/img/samples/MD_MIX.png) Meta Album ID: OCR.MD_MIX Meta Album URL: [https://meta-album.github.io/datasets/MD_MIX.html](https://meta-album.github.io/datasets/MD_MIX.html) Domain ID: OCR Domain Name: Optical Character Recognition Dataset ID: MD_MIX Dataset Name: OmniPrint-MD-mix Short Description: Character images with a specific set of nuisance parameters \# Classes: 706 \# Images: 28240 Keywords: ocr Data Format: images Image size: 128x128 License (original data release): CC BY 4.0 License URL(original data release): https://creativecommons.org/licenses/by/4.0/ License (Meta-Album data release): CC BY 4.0 License URL (Meta-Album data release): [https://creativecommons.org/licenses/by/4.0/](https://creativecommons.org/licenses/by/4.0/) Source: OmniPrint Source URL: https://github.com/SunHaozhe/OmniPrint Original Author: Haozhe Sun Original contact: sunhaozhe275940200@gmail.com Meta Album author: Haozhe Sun Created Date: 25 June 2021 Contact Name: Haozhe Sun Contact Email: meta-album@chalearn.org Contact URL: [https://meta-album.github.io/](https://meta-album.github.io/) ### Cite this dataset ``` @inproceedings{sun2021omniprint, title={OmniPrint: A Configurable Printed Character Synthesizer}, author={Haozhe Sun and Wei-Wei Tu and Isabelle M Guyon}, booktitle={Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 1)}, year={2021}, url={https://openreview.net/forum?id=R07XwJPmgpl} } ``` ### Cite Meta-Album ``` @inproceedings{meta-album-2022, title={Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification}, author={Ullah, Ihsan and Carrion, Dustin and Escalera, Sergio and Guyon, Isabelle M and Huisman, Mike and Mohr, Felix and van Rijn, Jan N and Sun, Haozhe and Vanschoren, Joaquin and Vu, Phan Anh}, booktitle={Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track}, url = {https://meta-album.github.io/}, year = {2022} } ``` ### More For more information on the Meta-Album dataset, please see the [[NeurIPS 2022 paper]](https://meta-album.github.io/paper/Meta-Album.pdf) For details on the dataset preprocessing, please see the [[supplementary materials]](https://openreview.net/attachment?id=70_Wx-dON3q&name=supplementary_material) Supporting code can be found on our [[GitHub repo]](https://github.com/ihsaan-ullah/meta-album) Meta-Album on Papers with Code [[Meta-Album]](https://paperswithcode.com/dataset/meta-album) ### Other versions of this dataset [[Micro]](https://www.openml.org/d/44243)

69 features

CATEGORY (target)numeric706 unique values
0 missing
FILE_NAMEstring28240 unique values
0 missing
textstring706 unique values
0 missing
font_filestring845 unique values
0 missing
backgroundstring4 unique values
0 missing
background_colorstring763 unique values
14120 missing
brightnessnumeric26247 unique values
0 missing
color_enhancenumeric26247 unique values
0 missing
contrastnumeric26247 unique values
0 missing
font_sizenumeric1 unique values
0 missing
font_weightnumeric1 unique values
0 missing
foregroundstring2 unique values
0 missing
image_blending_methodstring2 unique values
0 missing
image_height_resolutionnumeric1 unique values
0 missing
image_modestring1 unique values
0 missing
image_width_resolutionnumeric1 unique values
0 missing
margin_bottomnumeric1 unique values
0 missing
margin_leftnumeric1 unique values
0 missing
margin_rightnumeric1 unique values
0 missing
margin_topnumeric1 unique values
0 missing
offset_horizontalnumeric251 unique values
0 missing
offset_verticalnumeric281 unique values
0 missing
original_image_height_resolutionnumeric309 unique values
0 missing
original_image_width_resolutionnumeric309 unique values
0 missing
perspective_paramsstring22224 unique values
0 missing
pre_elasticnumeric1 unique values
0 missing
rotationnumeric457 unique values
0 missing
shear_xnumeric412 unique values
4236 missing
stroke_fillstring679 unique values
14120 missing
SUPER_CATEGORYstring23 unique values
0 missing
family_namestring526 unique values
0 missing
style_namestring45 unique values
0 missing
postscript_namestring803 unique values
1 missing
variable_font_weightnominal2 unique values
0 missing
outlinestring2 unique values
15532 missing
outline_sizenumeric2 unique values
15532 missing
morph_gradient_kernel_shapestring1 unique values
19768 missing
morph_gradient_kernel_sizenumeric1 unique values
19768 missing
foreground_image_crop_xnumeric745 unique values
14120 missing
foreground_image_crop_x_plus_widthnumeric745 unique values
14120 missing
foreground_image_crop_ynumeric702 unique values
14120 missing
foreground_image_crop_y_plus_heightnumeric702 unique values
14120 missing
foreground_image_namestring26 unique values
14120 missing
foreground_image_original_heightnumeric7 unique values
14120 missing
foreground_image_original_widthnumeric5 unique values
14120 missing
foreground_image_resized_heightnumeric7 unique values
14120 missing
foreground_image_resized_widthnumeric5 unique values
14120 missing
outline_image_crop_xnumeric582 unique values
22592 missing
outline_image_crop_x_plus_widthnumeric582 unique values
22592 missing
outline_image_crop_ynumeric559 unique values
22592 missing
outline_image_crop_y_plus_heightnumeric559 unique values
22592 missing
outline_image_namestring26 unique values
22592 missing
outline_image_original_heightnumeric7 unique values
22592 missing
outline_image_original_widthnumeric5 unique values
22592 missing
outline_image_resized_heightnumeric7 unique values
22592 missing
outline_image_resized_widthnumeric5 unique values
22592 missing
background_polygon_fill_colorstring268 unique values
26828 missing
background_polygon_outline_colorstring337 unique values
26828 missing
background_random_color_composition_paramsstring1377 unique values
26828 missing
background_image_crop_xnumeric1006 unique values
14120 missing
background_image_crop_x_plus_widthnumeric1006 unique values
14120 missing
background_image_crop_ynumeric955 unique values
14120 missing
background_image_crop_y_plus_heightnumeric955 unique values
14120 missing
background_image_namestring26 unique values
14120 missing
background_image_original_heightnumeric7 unique values
14120 missing
background_image_original_widthnumeric5 unique values
14120 missing
background_image_resized_heightnumeric7 unique values
14120 missing
background_image_resized_widthnumeric5 unique values
14120 missing
shear_ynumeric202 unique values
24004 missing

19 properties

28240
Number of instances (rows) of the dataset.
69
Number of attributes (columns) of the dataset.
0
Number of distinct values of the target attribute (if it is nominal).
665053
Number of missing values in the dataset.
28240
Number of instances with at least one value missing.
46
Number of numeric attributes.
1
Number of nominal attributes.
100
Percentage of instances having missing values.
34.13
Percentage of missing values.
-3444.96
Average class difference between consecutive instances.
66.67
Percentage of numeric attributes.
0
Number of attributes divided by the number of instances.
1.45
Percentage of nominal attributes.
Percentage of instances belonging to the most frequent class.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
1
Number of binary attributes.
1.45
Percentage of binary attributes.

3 tasks

0 runs - estimation_procedure: 10-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: CATEGORY
0 runs - estimation_procedure: 10% Holdout set - evaluation_measure: predictive_accuracy - target_feature: CATEGORY
0 runs - estimation_procedure: 33% Holdout set - evaluation_measure: predictive_accuracy - target_feature: CATEGORY
Define a new task