{
    "data_id": "58",
    "name": "vowel",
    "exact_name": "vowel",
    "version": 1,
    "version_label": "1",
    "description": "**Author**:   \n**Source**: Unknown -   \n**Please cite**:   \n\n*** Please use version 2 of this dataset. This version has a train\/test feature that should be ignored in OpenML.\n\nIntroduction\n                 ============\n \n In my work on context-sensitive learning, I used the \"Deterding Vowel\n Recognition Data\", but I found it necessary to reformulate the data.\n Implicit in the original data is contextual information on the\n speaker's gender and identity. For my work, it was necessary to make\n this information explicit. The file \"vowel-context.data\" adds the\n speaker's sex and identity as new features. The format of the data file\n is described below.\n \n \n Peter Turney\n peter@ai.iit.nrc.ca\n \n \n \n                 References\n                 ==========\n \n P. Turney. \"Robust Classification With Context-Sensitive Features.\"\n Proceedings of the Sixth International Conference on Industrial\n and Engineering Applications of Artificial Intelligence and Expert\n Systems (IEA\/AIE-93): 268-276. 1993.\n \n URL: ftp:\/\/ai.iit.nrc.ca\/pub\/ksl-papers\/NRC-35074.ps.Z\n \n \n P. Turney. \"Exploiting Context When Learning to Classify.\"\n Proceedings of the European Conference on Machine Learning\n (ECML-93): 402-407. 1993.\n \n URL: ftp:\/\/ai.iit.nrc.ca\/pub\/ksl-papers\/NRC-35058.ps.Z\n \n \n \n                 File Structure\n                 ==============\n \n \n         Column          Description\n         -------------------------------\n         0               Train or Test\n         1               Speaker Number\n         2               Sex\n         3               Feature 0\n         4               Feature 1\n         5               Feature 2\n         6               Feature 3\n         7               Feature 4\n         8               Feature 5\n         9               Feature 6\n         10              Feature 7\n         11              Feature 8\n         12              Feature 9\n         13              Class\n \n \n \n \n                 Numerical Codes\n                 ===============\n \n \n         Speaker         Code Number\n         ---------------------------\n         Andrew          0\n         Bill            1\n         David           2\n         Mark            3\n         Jo              4\n         Kate            5\n         Penny           6\n         Rose            7\n         Mike            8\n         Nick            9\n         Rich            10\n         Tim             11\n         Sarah           12\n         Sue             13\n         Wendy           14\n \n \n \n         Set             Number\n         ---------------------------\n         Train           0\n         Test            1\n \n \n \n         Sex             Number\n         ---------------------------\n         Male            0\n         Female          1\n \n \n \n         Class           Number\n         ---------------------------\n         hid             0\n         hId             1\n         hEd             2\n         hAd             3\n         hYd             4\n         had             5\n         hOd             6\n         hod             7\n         hUd             8\n         hud             9\n         hed             10\n \n \n \n \n \n         Speaker         Code Number     Sex             Train\/Test\n         ---------------------------------------------------------------\n         Andrew          0               0               0\n         Bill            1               0               0\n         David           2               0               0\n         Mark            3               0               0\n         Jo              4               1               0\n         Kate            5               1               0\n         Penny           6               1               0\n         Rose            7               1               0\n         Mike            8               0               1\n         Nick            9               0               1\n         Rich            10              0               1\n         Tim             11              0               1\n         Sarah           12              1               1\n         Sue             13              1               1\n         Wendy           14              1               1\n \n \n Num Instances:  990\n Num Attributes: 14\n Num missing:    0 \/  0.0%\n\n     name                      type enum ints real     missing    distinct  (1)\n   1 'Train or Test'           Enum 100%   0%   0%     0 \/  0%     2 \/  0%   0% \n   2 'Speaker Number'          Enum   0% 100%   0%     0 \/  0%    15 \/  2%   0% \n   3 'Sex'                     Enum   0% 100%   0%     0 \/  0%     2 \/  0%   0% \n   4 'Feature 0'               Real   0%   0% 100%     0 \/  0%   853 \/ 86%  74% \n   5 'Feature 1'               Real   0%   0% 100%     0 \/  0%   877 \/ 89%  78% \n   6 'Feature 2'               Real   0%   0% 100%     0 \/  0%   815 \/ 82%  67% \n   7 'Feature 3'               Real   0%   0% 100%     0 \/  0%   836 \/ 84%  71% \n   8 'Feature 4'               Real   0%   0% 100%     0 \/  0%   803 \/ 81%  66% \n   9 'Feature 5'               Real   0%   0% 100%     0 \/  0%   798 \/ 81%  64% \n  10 'Feature 6'               Real   0%   0% 100%     0 \/  0%   748 \/ 76%  57% \n  11 'Feature 7'               Real   0%   0% 100%     0 \/  0%   794 \/ 80%  64% \n  12 'Feature 8'               Real   0%   0% 100%     0 \/  0%   788 \/ 80%  63% \n  13 'Feature 9'               Real   0%   0% 100%     0 \/  0%   775 \/ 78%  60% \n  14 'Class'                   Enum   0% 100%   0%     0 \/  0%    11 \/  1%   0% \n\n\n\n\n Relabeled values in attribute 'Speaker Number'\n    From: 0                       To: Andrew              \n    From: 1                       To: Bill                \n    From: 2                       To: David               \n    From: 3                       To: Mark                \n    From: 4                       To: Jo                  \n    From: 5                       To: Kate                \n    From: 6                       To: Penny               \n    From: 7                       To: Rose                \n    From: 8                       To: Mike                \n    From: 9                       To: Nick                \n    From: 10                      To: Rich                \n    From: 11                      To: Tim                 \n    From: 12                      To: Sarah               \n    From: 13                      To: Sue                 \n    From: 14                      To: Wendy               \n\n\n Relabeled values in attribute 'Sex'\n    From: 0                       To: Male                \n    From: 1                       To: Female              \n\n\n Relabeled values in attribute 'Class'\n    From: 0                       To: hid                 \n    From: 1                       To: hId                 \n    From: 2                       To: hEd                 \n    From: 3                       To: hAd                 \n    From: 4                       To: hYd                 \n    From: 5                       To: had                 \n    From: 6                       To: hOd                 \n    From: 7                       To: hod                 \n    From: 8                       To: hUd                 \n    From: 9                       To: hud                 \n    From: 10                      To: hed",
    "format": "ARFF",
    "uploader": "Jan van Rijn",
    "uploader_id": 1,
    "visibility": "public",
    "creator": null,
    "contributor": null,
    "date": "2014-04-06 23:23:26",
    "update_comment": "dataset should not be used (has train\/test split feature) and is set to private for now",
    "last_update": "2014-09-19 17:45:05",
    "licence": "Public",
    "status": "deactivated",
    "error_message": null,
    "url": "https:\/\/www.openml.org\/data\/download\/58\/dataset_58_vowel.arff",
    "kaggle_url": null,
    "default_target_attribute": "Class",
    "row_id_attribute": null,
    "ignore_attribute": null,
    "runs": 458,
    "suggest": {
        "input": [
            "vowel",
            "Introduction ============ In my work on context-sensitive learning, I used the \"Deterding Vowel Recognition Data\", but I found it necessary to reformulate the data. Implicit in the original data is contextual information on the speaker's gender and identity. For my work, it was necessary to make this information explicit. The file \"vowel-context.data\" adds the speaker's sex and identity as new features. The format of the data file is described below. Peter Turney peter@ai.iit.nrc.ca References = "
        ],
        "weight": 5
    },
    "qualities": {
        "NumberOfInstances": 990,
        "NumberOfFeatures": 14,
        "NumberOfClasses": 11,
        "NumberOfMissingValues": 0,
        "NumberOfInstancesWithMissingValues": 0,
        "NumberOfNumericFeatures": 10,
        "NumberOfSymbolicFeatures": 4,
        "MaxStdDevOfNumericAtts": 1.1752720041815217,
        "MinorityClassPercentage": 9.090909090909092,
        "PercentageOfNumericFeatures": 71.42857142857143,
        "Quartile3MeansOfNumericAtts": 0.5441732323232322,
        "CfsSubsetEval_DecisionStumpAUC": 0.8608742985409651,
        "RandomTreeDepth2AUC": 0.8294657687991022,
        "J48.00001.ErrRate": 0.31313131313131315,
        "MeanAttributeEntropy": 1.9668246198572636,
        "MinorityClassSize": 90,
        "PercentageOfSymbolicFeatures": 28.57142857142857,
        "Quartile3MutualInformation": 0,
        "CfsSubsetEval_DecisionStumpErrRate": 0.30505050505050507,
        "RandomTreeDepth2ErrRate": 0.3212121212121212,
        "J48.00001.Kappa": 0.6555555555555556,
        "MeanKurtosisOfNumericAtts": -0.39142721501684635,
        "NaiveBayesAUC": 0.9440538720538719,
        "Quartile1AttributeEntropy": 0.9967916319816366,
        "Quartile3SkewnessOfNumericAtts": 0.25062290775439544,
        "CfsSubsetEval_DecisionStumpKappa": 0.6644444444444445,
        "RandomTreeDepth2Kappa": 0.6466666666666667,
        "J48.0001.AUC": 0.8742699214365881,
        "MeanMeansOfNumericAtts": -0.10318040404040407,
        "NaiveBayesErrRate": 0.4171717171717172,
        "Quartile1KurtosisOfNumericAtts": -0.5598569198966739,
        "Quartile3StdDevOfNumericAtts": 0.7866928081860705,
        "CfsSubsetEval_NaiveBayesAUC": 0.8608742985409651,
        "RandomTreeDepth3AUC": 0.8294657687991022,
        "J48.0001.ErrRate": 0.31313131313131315,
        "MeanMutualInformation": 0,
        "NaiveBayesKappa": 0.5411111111111111,
        "Quartile1MeansOfNumericAtts": -0.356185606060606,
        "REPTreeDepth1AUC": 0.8406290684624018,
        "CfsSubsetEval_NaiveBayesErrRate": 0.30505050505050507,
        "RandomTreeDepth3ErrRate": 0.3212121212121212,
        "J48.0001.Kappa": 0.6555555555555556,
        "MeanNoiseToSignalRatio": null,
        "NumberOfBinaryFeatures": 2,
        "Quartile1MutualInformation": 0,
        "REPTreeDepth1ErrRate": 0.6393939393939394,
        "CfsSubsetEval_NaiveBayesKappa": 0.6644444444444445,
        "RandomTreeDepth3Kappa": 0.6466666666666667,
        "J48.001.AUC": 0.8742699214365881,
        "MeanNominalAttDistinctValues": 7.5,
        "Quartile1SkewnessOfNumericAtts": -0.006222604957687408,
        "REPTreeDepth1Kappa": 0.29666666666666663,
        "CfsSubsetEval_kNN1NAUC": 0.8608742985409651,
        "StdvNominalAttDistinctValues": 6.557438524302,
        "J48.001.ErrRate": 0.31313131313131315,
        "MeanSkewnessOfNumericAtts": 0.09111527285062525,
        "Quartile1StdDevOfNumericAtts": 0.5725169123224305,
        "REPTreeDepth2AUC": 0.8406290684624018,
        "CfsSubsetEval_kNN1NErrRate": 0.30505050505050507,
        "kNN1NAUC": 0.9572222222222221,
        "J48.001.Kappa": 0.6555555555555556,
        "MeanStdDevOfNumericAtts": 0.699331810942439,
        "Quartile2AttributeEntropy": 0.9967916319816366,
        "REPTreeDepth2ErrRate": 0.6393939393939394,
        "CfsSubsetEval_kNN1NKappa": 0.6644444444444445,
        "kNN1NErrRate": 0.07777777777777778,
        "MajorityClassPercentage": 9.090909090909092,
        "MinAttributeEntropy": 0.9967916319816366,
        "Quartile2KurtosisOfNumericAtts": -0.42609939747851944,
        "REPTreeDepth2Kappa": 0.29666666666666663,
        "ClassEntropy": 3.459431618637298,
        "kNN1NKappa": 0.9144444444444445,
        "MajorityClassSize": 90,
        "MinKurtosisOfNumericAtts": -0.7621395184642852,
        "Quartile2MeansOfNumericAtts": -0.03785202020202025,
        "REPTreeDepth3AUC": 0.8406290684624018,
        "DecisionStumpAUC": 0.6774680134680136,
        "MaxAttributeEntropy": 3.906890595608518,
        "MinMeansOfNumericAtts": -3.203740404040404,
        "Quartile2MutualInformation": 0,
        "REPTreeDepth3ErrRate": 0.6393939393939394,
        "DecisionStumpErrRate": 0.8242424242424242,
        "MaxKurtosisOfNumericAtts": 0.1524368802477145,
        "MinMutualInformation": 0,
        "Quartile2SkewnessOfNumericAtts": 0.06013385313957622,
        "REPTreeDepth3Kappa": 0.29666666666666663,
        "DecisionStumpKappa": 0.09333333333333332,
        "MaxMeansOfNumericAtts": 1.881763636363636,
        "MinNominalAttDistinctValues": 2,
        "PercentageOfBinaryFeatures": 14.285714285714285,
        "Quartile2StdDevOfNumericAtts": 0.6342938790535619,
        "RandomTreeDepth1AUC": 0.8294657687991022,
        "Dimensionality": 0.014141414141414142,
        "MaxMutualInformation": 0,
        "MinSkewnessOfNumericAtts": -0.20615204556382144,
        "PercentageOfInstancesWithMissingValues": 0,
        "Quartile3AttributeEntropy": 3.906890595608518,
        "RandomTreeDepth1ErrRate": 0.3212121212121212,
        "EquivalentNumberOfAtts": null,
        "MaxNominalAttDistinctValues": 15,
        "MinStdDevOfNumericAtts": 0.46192683815131974,
        "PercentageOfMissingValues": 0,
        "Quartile3KurtosisOfNumericAtts": -0.24255761544063992,
        "AutoCorrelation": 0,
        "RandomTreeDepth1Kappa": 0.6466666666666667,
        "J48.00001.AUC": 0.8742699214365881,
        "MaxSkewnessOfNumericAtts": 0.3570604221210384
    },
    "tags": [
        {
            "tag": "study_37",
            "uploader": "1"
        },
        {
            "tag": "uci",
            "uploader": "1"
        }
    ],
    "features": [
        {
            "name": "Class",
            "index": "13",
            "type": "nominal",
            "distinct": "11",
            "missing": "0",
            "target": "1",
            "distr": [
                [
                    "hid",
                    "hId",
                    "hEd",
                    "hAd",
                    "hYd",
                    "had",
                    "hOd",
                    "hod",
                    "hUd",
                    "hud",
                    "hed"
                ],
                [
                    [
                        "90",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0"
                    ],
                    [
                        "0",
                        "90",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0"
                    ],
                    [
                        "0",
                        "0",
                        "90",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0"
                    ],
                    [
                        "0",
                        "0",
                        "0",
                        "90",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0"
                    ],
                    [
                        "0",
                        "0",
                        "0",
                        "0",
                        "90",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0"
                    ],
                    [
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "90",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0"
                    ],
                    [
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "90",
                        "0",
                        "0",
                        "0",
                        "0"
                    ],
                    [
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "90",
                        "0",
                        "0",
                        "0"
                    ],
                    [
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "90",
                        "0",
                        "0"
                    ],
                    [
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "90",
                        "0"
                    ],
                    [
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "0",
                        "90"
                    ]
                ]
            ]
        },
        {
            "name": "Train_or_Test",
            "index": "0",
            "type": "nominal",
            "distinct": "2",
            "missing": "0",
            "distr": [
                [
                    "Train",
                    "Test"
                ],
                [
                    [
                        "48",
                        "48",
                        "48",
                        "48",
                        "48",
                        "48",
                        "48",
                        "48",
                        "48",
                        "48",
                        "48"
                    ],
                    [
                        "42",
                        "42",
                        "42",
                        "42",
                        "42",
                        "42",
                        "42",
                        "42",
                        "42",
                        "42",
                        "42"
                    ]
                ]
            ]
        },
        {
            "name": "Speaker_Number",
            "index": "1",
            "type": "nominal",
            "distinct": "15",
            "missing": "0",
            "distr": [
                [
                    "Andrew",
                    "Bill",
                    "David",
                    "Mark",
                    "Jo",
                    "Kate",
                    "Penny",
                    "Rose",
                    "Mike",
                    "Nick",
                    "Rich",
                    "Tim",
                    "Sarah",
                    "Sue",
                    "Wendy"
                ],
                [
                    [
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6"
                    ],
                    [
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6"
                    ],
                    [
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6"
                    ],
                    [
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6"
                    ],
                    [
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6"
                    ],
                    [
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6"
                    ],
                    [
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6"
                    ],
                    [
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6"
                    ],
                    [
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6"
                    ],
                    [
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6"
                    ],
                    [
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6"
                    ],
                    [
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6"
                    ],
                    [
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6"
                    ],
                    [
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6"
                    ],
                    [
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6",
                        "6"
                    ]
                ]
            ]
        },
        {
            "name": "Sex",
            "index": "2",
            "type": "nominal",
            "distinct": "2",
            "missing": "0",
            "distr": [
                [
                    "Male",
                    "Female"
                ],
                [
                    [
                        "48",
                        "48",
                        "48",
                        "48",
                        "48",
                        "48",
                        "48",
                        "48",
                        "48",
                        "48",
                        "48"
                    ],
                    [
                        "42",
                        "42",
                        "42",
                        "42",
                        "42",
                        "42",
                        "42",
                        "42",
                        "42",
                        "42",
                        "42"
                    ]
                ]
            ]
        },
        {
            "name": "Feature_0",
            "index": "3",
            "type": "numeric",
            "distinct": "853",
            "missing": "0",
            "min": "-5",
            "max": "0",
            "mean": "-3",
            "stdev": "1"
        },
        {
            "name": "Feature_1",
            "index": "4",
            "type": "numeric",
            "distinct": "877",
            "missing": "0",
            "min": "-1",
            "max": "5",
            "mean": "2",
            "stdev": "1"
        },
        {
            "name": "Feature_2",
            "index": "5",
            "type": "numeric",
            "distinct": "815",
            "missing": "0",
            "min": "-2",
            "max": "1",
            "mean": "-1",
            "stdev": "1"
        },
        {
            "name": "Feature_3",
            "index": "6",
            "type": "numeric",
            "distinct": "836",
            "missing": "0",
            "min": "-1",
            "max": "2",
            "mean": "1",
            "stdev": "1"
        },
        {
            "name": "Feature_4",
            "index": "7",
            "type": "numeric",
            "distinct": "803",
            "missing": "0",
            "min": "-2",
            "max": "2",
            "mean": "0",
            "stdev": "1"
        },
        {
            "name": "Feature_5",
            "index": "8",
            "type": "numeric",
            "distinct": "798",
            "missing": "0",
            "min": "-1",
            "max": "2",
            "mean": "1",
            "stdev": "1"
        },
        {
            "name": "Feature_6",
            "index": "9",
            "type": "numeric",
            "distinct": "748",
            "missing": "0",
            "min": "-2",
            "max": "1",
            "mean": "0",
            "stdev": "0"
        },
        {
            "name": "Feature_7",
            "index": "10",
            "type": "numeric",
            "distinct": "794",
            "missing": "0",
            "min": "-1",
            "max": "2",
            "mean": "0",
            "stdev": "1"
        },
        {
            "name": "Feature_8",
            "index": "11",
            "type": "numeric",
            "distinct": "788",
            "missing": "0",
            "min": "-2",
            "max": "1",
            "mean": "0",
            "stdev": "1"
        },
        {
            "name": "Feature_9",
            "index": "12",
            "type": "numeric",
            "distinct": "775",
            "missing": "0",
            "min": "-2",
            "max": "1",
            "mean": "0",
            "stdev": "1"
        }
    ],
    "nr_of_issues": 0,
    "nr_of_downvotes": 0,
    "nr_of_likes": 0,
    "nr_of_downloads": 0,
    "total_downloads": 0,
    "reach": 0,
    "reuse": 0,
    "impact_of_reuse": 0,
    "reach_of_reuse": 0,
    "impact": 0
}