{ "data_id": "24", "name": "mushroom", "exact_name": "mushroom", "version": 1, "version_label": "1", "description": "**Author**: [Jeff Schlimmer](Jeffrey.Schlimmer@a.gp.cs.cmu.edu) \r\n**Source**: [UCI](https:\/\/archive.ics.uci.edu\/ml\/datasets\/mushroom) - 1981 \r\n**Please cite**: The Audubon Society Field Guide to North American Mushrooms (1981). G. H. Lincoff (Pres.), New York: Alfred A. Knopf \r\n\r\n\r\n### Description\r\n\r\nThis dataset describes mushrooms in terms of their physical characteristics. They are classified into: poisonous or edible.\r\n\r\n### Source\r\n```\r\n(a) Origin: \r\nMushroom records are drawn from The Audubon Society Field Guide to North American Mushrooms (1981). G. H. Lincoff (Pres.), New York: Alfred A. Knopf \r\n\r\n(b) Donor: \r\nJeff Schlimmer (Jeffrey.Schlimmer '@' a.gp.cs.cmu.edu)\r\n```\r\n\r\n### Dataset description\r\n\r\nThis dataset includes descriptions of hypothetical samples corresponding to 23 species of gilled mushrooms in the Agaricus and Lepiota Family. Each species is identified as definitely edible, definitely poisonous, or of unknown edibility and not recommended. This latter class was combined with the poisonous one. The Guide clearly states that there is no simple rule for determining the edibility of a mushroom; no rule like ``leaflets three, let it be'' for Poisonous Oak and Ivy.\r\n\r\n### Attributes Information\r\n```\r\n1. cap-shape: bell=b,conical=c,convex=x,flat=f, knobbed=k,sunken=s \r\n2. cap-surface: fibrous=f,grooves=g,scaly=y,smooth=s \r\n3. cap-color: brown=n,buff=b,cinnamon=c,gray=g,green=r, pink=p,purple=u,red=e,white=w,yellow=y \r\n4. bruises?: bruises=t,no=f \r\n5. odor: almond=a,anise=l,creosote=c,fishy=y,foul=f, musty=m,none=n,pungent=p,spicy=s \r\n6. gill-attachment: attached=a,descending=d,free=f,notched=n \r\n7. gill-spacing: close=c,crowded=w,distant=d \r\n8. gill-size: broad=b,narrow=n \r\n9. gill-color: black=k,brown=n,buff=b,chocolate=h,gray=g, green=r,orange=o,pink=p,purple=u,red=e, white=w,yellow=y \r\n10. stalk-shape: enlarging=e,tapering=t \r\n11. stalk-root: bulbous=b,club=c,cup=u,equal=e, rhizomorphs=z,rooted=r,missing=? \r\n12. stalk-surface-above-ring: fibrous=f,scaly=y,silky=k,smooth=s \r\n13. stalk-surface-below-ring: fibrous=f,scaly=y,silky=k,smooth=s \r\n14. stalk-color-above-ring: brown=n,buff=b,cinnamon=c,gray=g,orange=o, pink=p,red=e,white=w,yellow=y \r\n15. stalk-color-below-ring: brown=n,buff=b,cinnamon=c,gray=g,orange=o, pink=p,red=e,white=w,yellow=y \r\n16. veil-type: partial=p,universal=u \r\n17. veil-color: brown=n,orange=o,white=w,yellow=y \r\n18. ring-number: none=n,one=o,two=t \r\n19. ring-type: cobwebby=c,evanescent=e,flaring=f,large=l, none=n,pendant=p,sheathing=s,zone=z \r\n20. spore-print-color: black=k,brown=n,buff=b,chocolate=h,green=r, orange=o,purple=u,white=w,yellow=y \r\n21. population: abundant=a,clustered=c,numerous=n, scattered=s,several=v,solitary=y \r\n22. habitat: grasses=g,leaves=l,meadows=m,paths=p, urban=u,waste=w,woods=d\r\n```\r\n\r\n### Relevant papers\r\n\r\nSchlimmer,J.S. (1987). Concept Acquisition Through Representational Adjustment (Technical Report 87-19). Doctoral disseration, Department of Information and Computer Science, University of California, Irvine. \r\n\r\nIba,W., Wogulis,J., & Langley,P. (1988). Trading off Simplicity and Coverage in Incremental Concept Learning. In Proceedings of the 5th International Conference on Machine Learning, 73-79. Ann Arbor, Michigan: Morgan Kaufmann. \r\n\r\nDuch W, Adamczak R, Grabczewski K (1996) Extraction of logical rules from training data using backpropagation networks, in: Proc. of the The 1st Online Workshop on Soft Computing, 19-30.Aug.1996, pp. 25-30, [Web Link] \r\n\r\nDuch W, Adamczak R, Grabczewski K, Ishikawa M, Ueda H, Extraction of crisp logical rules using constrained backpropagation networks - comparison of two new approaches, in: Proc. of the European Symposium on Artificial Neural Networks (ESANN'97), Bruge, Belgium 16-18.4.1997. \r\n\r\n\r\n\r\n", "format": "ARFF", "uploader": "Jan van Rijn", "uploader_id": 1, "visibility": "public", "creator": "Jeff Schlimmer", "contributor": null, "date": "2014-04-06 23:21:11", "update_comment": null, "last_update": "2014-04-06 23:21:11", "licence": "Public", "status": "active", "error_message": null, "url": "https:\/\/www.openml.org\/data\/download\/24\/dataset_24_mushroom.arff", "default_target_attribute": "class", "row_id_attribute": null, "ignore_attribute": null, "runs": 16692, "suggest": { "input": [ "mushroom", "### Description This dataset describes mushrooms in terms of their physical characteristics. They are classified into: poisonous or edible. ### Source ``` (a) Origin: Mushroom records are drawn from The Audubon Society Field Guide to North American Mushrooms (1981). G. H. Lincoff (Pres.), New York: Alfred A. Knopf (b) Donor: Jeff Schlimmer (Jeffrey.Schlimmer '@' a.gp.cs.cmu.edu) ``` ### Dataset description This dataset includes descriptions of hypothetical samples corresponding to 23 species of " ], "weight": 5 }, "qualities": { "NumberOfInstances": 8124, "NumberOfFeatures": 23, "NumberOfClasses": 2, "NumberOfMissingValues": 2480, "NumberOfInstancesWithMissingValues": 2480, "NumberOfNumericFeatures": 0, "NumberOfSymbolicFeatures": 23, "Quartile2AttributeEntropy": 1.467128011861462, "REPTreeDepth2ErrRate": 0.00036927621861152144, "CfsSubsetEval_kNN1NKappa": 0.9738461616958994, "kNN1NErrRate": 0, "MajorityClassPercentage": 51.7971442639094, "MeanStdDevOfNumericAtts": null, "Quartile2KurtosisOfNumericAtts": null, "REPTreeDepth2Kappa": 0.9992605118549308, "ClassEntropy": 0.9990678968724604, "kNN1NKappa": 1, "MajorityClassSize": 4208, "MinAttributeEntropy": -0, "Quartile2MeansOfNumericAtts": null, "REPTreeDepth3AUC": 0.9999987256143267, "DecisionStumpAUC": 0.8894935275772204, "MaxAttributeEntropy": 3.030432883772633, "MinKurtosisOfNumericAtts": null, "Quartile2MutualInformation": 0.174606545183155, "REPTreeDepth3ErrRate": 0.00036927621861152144, "DecisionStumpErrRate": 0.11324470704086657, "MaxKurtosisOfNumericAtts": null, "MinMeansOfNumericAtts": null, "Quartile2SkewnessOfNumericAtts": null, "REPTreeDepth3Kappa": 0.9992605118549308, "DecisionStumpKappa": 0.77457574608175, "MaxMeansOfNumericAtts": null, "MinMutualInformation": 0, "PercentageOfBinaryFeatures": 21.73913043478261, "Quartile2StdDevOfNumericAtts": null, "RandomTreeDepth1AUC": 0.9995247148288974, "Dimensionality": 0.002831117676021664, "MaxMutualInformation": 0.906074977384, "MinNominalAttDistinctValues": 1, "PercentageOfInstancesWithMissingValues": 30.526834071885773, "Quartile3AttributeEntropy": 2.0533554351937426, "RandomTreeDepth1ErrRate": 0.0004923682914820286, "EquivalentNumberOfAtts": 5.0393135801657, "MaxNominalAttDistinctValues": 12, "MinSkewnessOfNumericAtts": null, "PercentageOfMissingValues": 1.3272536552993814, "Quartile3KurtosisOfNumericAtts": null, "AutoCorrelation": 0.726332635725717, "RandomTreeDepth1Kappa": 0.9990140245420991, "J48.00001.AUC": 1, "MaxSkewnessOfNumericAtts": null, "MinStdDevOfNumericAtts": null, "PercentageOfNumericFeatures": 0, "Quartile3MeansOfNumericAtts": null, "CfsSubsetEval_DecisionStumpAUC": 0.9910519616800724, "RandomTreeDepth2AUC": 0.9995247148288974, "J48.00001.ErrRate": 0, "MaxStdDevOfNumericAtts": null, "MinorityClassPercentage": 48.20285573609059, "PercentageOfSymbolicFeatures": 100, "Quartile3MutualInformation": 0.27510225484918505, "CfsSubsetEval_DecisionStumpErrRate": 0.013047759724273756, "RandomTreeDepth2ErrRate": 0.0004923682914820286, "J48.00001.Kappa": 1, "MeanAttributeEntropy": 1.4092554739602103, "MinorityClassSize": 3916, "Quartile1AttributeEntropy": 0.8286618104993447, "Quartile3SkewnessOfNumericAtts": null, "CfsSubsetEval_DecisionStumpKappa": 0.9738461616958994, "RandomTreeDepth2Kappa": 0.9990140245420991, "J48.0001.AUC": 1, "MeanKurtosisOfNumericAtts": null, "NaiveBayesAUC": 0.9976229672941662, "Quartile1KurtosisOfNumericAtts": null, "Quartile3StdDevOfNumericAtts": null, "CfsSubsetEval_NaiveBayesAUC": 0.9910519616800724, "RandomTreeDepth3AUC": 0.9995247148288974, "J48.0001.ErrRate": 0, "MeanMeansOfNumericAtts": null, "NaiveBayesErrRate": 0.04899064500246184, "Quartile1MeansOfNumericAtts": null, "REPTreeDepth1AUC": 0.9999987256143267, "CfsSubsetEval_NaiveBayesErrRate": 0.013047759724273756, "RandomTreeDepth3ErrRate": 0.0004923682914820286, "J48.0001.Kappa": 1, "MeanMutualInformation": 0.19825475850613955, "NaiveBayesKappa": 0.9015972799616292, "Quartile1MutualInformation": 0.034184520425602494, "REPTreeDepth1ErrRate": 0.00036927621861152144, "CfsSubsetEval_NaiveBayesKappa": 0.9738461616958994, "RandomTreeDepth3Kappa": 0.9990140245420991, "J48.001.AUC": 1, "MeanNoiseToSignalRatio": 6.108305922031972, "NumberOfBinaryFeatures": 5, "Quartile1SkewnessOfNumericAtts": null, "REPTreeDepth1Kappa": 0.9992605118549308, "CfsSubsetEval_kNN1NAUC": 0.9910519616800724, "StdvNominalAttDistinctValues": 3.1809710899501766, "J48.001.ErrRate": 0, "MeanNominalAttDistinctValues": 5.130434782608695, "Quartile1StdDevOfNumericAtts": null, "REPTreeDepth2AUC": 0.9999987256143267, "CfsSubsetEval_kNN1NErrRate": 0.013047759724273756, "kNN1NAUC": 1, "J48.001.Kappa": 1, "MeanSkewnessOfNumericAtts": null }, "tags": [ { "uploader": "38960", "tag": "Life Science" }, { "uploader": "38960", "tag": "Machine Learning" }, { "uploader": "1", "tag": "mythbusting_1" }, { "uploader": "348", "tag": "OpenML100" }, { "uploader": "2", "tag": "study_1" }, { "uploader": "3886", "tag": "study_123" }, { "uploader": "64", "tag": "study_14" }, { "uploader": "5824", "tag": "study_144" }, { "uploader": "939", "tag": "study_15" }, { "uploader": "7272", "tag": "study_190" }, { "uploader": "939", "tag": "study_20" }, { "uploader": "1", "tag": "study_34" }, { "uploader": "1", "tag": "study_37" }, { "uploader": "1", "tag": "study_41" }, { "uploader": "64", "tag": "study_50" }, { "uploader": "1856", "tag": "study_70" }, { "uploader": "1140", "tag": "trivial" }, { "uploader": "1", "tag": "uci" } ], "features": [ { "name": "class", "index": "22", "type": "nominal", "distinct": "2", "missing": "0", "target": "1", "distr": [ [ "e", "p" ], [ [ "4208", "0" ], [ "0", "3916" ] ] ] }, { "name": "cap-shape", "index": "0", "type": "nominal", "distinct": "6", "missing": "0", "distr": [ [ "b", "c", "f", "k", "s", "x" ], [ [ "404", "48" ], [ "0", "4" ], [ "1596", "1556" ], [ "228", "600" ], [ "32", "0" ], [ "1948", "1708" ] ] ] }, { "name": "cap-surface", "index": "1", "type": "nominal", "distinct": "4", "missing": "0", "distr": [ [ "f", "g", "s", "y" ], [ [ "1560", "760" ], [ "0", "4" ], [ "1144", "1412" ], [ "1504", "1740" ] ] ] }, { "name": "cap-color", "index": "2", "type": "nominal", "distinct": "10", "missing": "0", "distr": [ [ "b", "c", "e", "g", "n", "p", "r", "u", "w", "y" ], [ [ "48", "120" ], [ "32", "12" ], [ "624", "876" ], [ "1032", "808" ], [ "1264", "1020" ], [ "56", "88" ], [ "16", "0" ], [ "16", "0" ], [ "720", "320" ], [ "400", "672" ] ] ] }, { "name": "bruises%3F", "index": "3", "type": "nominal", "distinct": "2", "missing": "0", "distr": [ [ "f", "t" ], [ [ "1456", "3292" ], [ "2752", "624" ] ] ] }, { "name": "odor", "index": "4", "type": "nominal", "distinct": "9", "missing": "0", "distr": [ [ "a", "c", "f", "l", "m", "n", "p", "s", "y" ], [ [ "400", "0" ], [ "0", "192" ], [ "0", "2160" ], [ "400", "0" ], [ "0", "36" ], [ "3408", "120" ], [ "0", "256" ], [ "0", "576" ], [ "0", "576" ] ] ] }, { "name": "gill-attachment", "index": "5", "type": "nominal", "distinct": "2", "missing": "0", "distr": [ [ "a", "d", "f", "n" ], [ [ "192", "18" ], [ "0", "0" ], [ "4016", "3898" ], [ "0", "0" ] ] ] }, { "name": "gill-spacing", "index": "6", "type": "nominal", "distinct": "2", "missing": "0", "distr": [ [ "c", "d", "w" ], [ [ "3008", "3804" ], [ "0", "0" ], [ "1200", "112" ] ] ] }, { "name": "gill-size", "index": "7", "type": "nominal", "distinct": "2", "missing": "0", "distr": [ [ "b", "n" ], [ [ "3920", "1692" ], [ "288", "2224" ] ] ] }, { "name": "gill-color", "index": "8", "type": "nominal", "distinct": "12", "missing": "0", "distr": [ [ "b", "e", "g", "h", "k", "n", "o", "p", "r", "u", "w", "y" ], [ [ "0", "1728" ], [ "96", "0" ], [ "248", "504" ], [ "204", "528" ], [ "344", "64" ], [ "936", "112" ], [ "64", "0" ], [ "852", "640" ], [ "0", "24" ], [ "444", "48" ], [ "956", "246" ], [ "64", "22" ] ] ] }, { "name": "stalk-shape", "index": "9", "type": "nominal", "distinct": "2", "missing": "0", "distr": [ [ "e", "t" ], [ [ "1616", "1900" ], [ "2592", "2016" ] ] ] }, { "name": "stalk-root", "index": "10", "type": "nominal", "distinct": "4", "missing": "2480", "distr": [ [ "b", "c", "e", "r", "u", "z" ], [ [ "1920", "1856" ], [ "512", "44" ], [ "864", "256" ], [ "192", "0" ], [ "0", "0" ], [ "0", "0" ] ] ] }, { "name": "stalk-surface-above-ring", "index": "11", "type": "nominal", "distinct": "4", "missing": "0", "distr": [ [ "f", "k", "s", "y" ], [ [ "408", "144" ], [ "144", "2228" ], [ "3640", "1536" ], [ "16", "8" ] ] ] }, { "name": "stalk-surface-below-ring", "index": "12", "type": "nominal", "distinct": "4", "missing": "0", "distr": [ [ "f", "k", "s", "y" ], [ [ "456", "144" ], [ "144", "2160" ], [ "3400", "1536" ], [ "208", "76" ] ] ] }, { "name": "stalk-color-above-ring", "index": "13", "type": "nominal", "distinct": "9", "missing": "0", "distr": [ [ "b", "c", "e", "g", "n", "o", "p", "w", "y" ], [ [ "0", "432" ], [ "0", "36" ], [ "96", "0" ], [ "576", "0" ], [ "16", "432" ], [ "192", "0" ], [ "576", "1296" ], [ "2752", "1712" ], [ "0", "8" ] ] ] }, { "name": "stalk-color-below-ring", "index": "14", "type": "nominal", "distinct": "9", "missing": "0", "distr": [ [ "b", "c", "e", "g", "n", "o", "p", "w", "y" ], [ [ "0", "432" ], [ "0", "36" ], [ "96", "0" ], [ "576", "0" ], [ "64", "448" ], [ "192", "0" ], [ "576", "1296" ], [ "2704", "1680" ], [ "0", "24" ] ] ] }, { "name": "veil-type", "index": "15", "type": "nominal", "distinct": "1", "missing": "0", "distr": [ [ "p", "u" ], [ [ "4208", "3916" ], [ "0", "0" ] ] ] }, { "name": "veil-color", "index": "16", "type": "nominal", "distinct": "4", "missing": "0", "distr": [ [ "n", "o", "w", "y" ], [ [ "96", "0" ], [ "96", "0" ], [ "4016", "3908" ], [ "0", "8" ] ] ] }, { "name": "ring-number", "index": "17", "type": "nominal", "distinct": "3", "missing": "0", "distr": [ [ "n", "o", "t" ], [ [ "0", "36" ], [ "3680", "3808" ], [ "528", "72" ] ] ] }, { "name": "ring-type", "index": "18", "type": "nominal", "distinct": "5", "missing": "0", "distr": [ [ "c", "e", "f", "l", "n", "p", "s", "z" ], [ [ "0", "0" ], [ "1008", "1768" ], [ "48", "0" ], [ "0", "1296" ], [ "0", "36" ], [ "3152", "816" ], [ "0", "0" ], [ "0", "0" ] ] ] }, { "name": "spore-print-color", "index": "19", "type": "nominal", "distinct": "9", "missing": "0", "distr": [ [ "b", "h", "k", "n", "o", "r", "u", "w", "y" ], [ [ "48", "0" ], [ "48", "1584" ], [ "1648", "224" ], [ "1744", "224" ], [ "48", "0" ], [ "0", "72" ], [ "48", "0" ], [ "576", "1812" ], [ "48", "0" ] ] ] }, { "name": "population", "index": "20", "type": "nominal", "distinct": "6", "missing": "0", "distr": [ [ "a", "c", "n", "s", "v", "y" ], [ [ "384", "0" ], [ "288", "52" ], [ "400", "0" ], [ "880", "368" ], [ "1192", "2848" ], [ "1064", "648" ] ] ] }, { "name": "habitat", "index": "21", "type": "nominal", "distinct": "7", "missing": "0", "distr": [ [ "d", "g", "l", "m", "p", "u", "w" ], [ [ "1880", "1268" ], [ "1408", "740" ], [ "240", "592" ], [ "256", "36" ], [ "136", "1008" ], [ "96", "272" ], [ "192", "0" ] ] ] } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 0, "nr_of_downloads": 0, "total_downloads": 0, "reach": 0, "reuse": 0, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 0 }