{ "data_id": "340", "name": "squash-stored", "exact_name": "squash-stored", "version": 1, "version_label": null, "description": "**Author**: Winna Harvey \n**Source**: [original](http:\/\/www.cs.waikato.ac.nz\/ml\/weka\/datasets.html) - \n**Please cite**: \n\nSquash Harvest Stored\n\nData source: Winna Harvey\nCrop and Food Research, Christchurch, New Zealand\n\nThe purpose of the research was to determine the changes taking place in squash fruit during the maturation and ripening so as to pinpoint the best time to give the best quality at the marketplace (Japan). The squash is transported to Japan by refrigerated cargo vessels and takes three to four weeks to reach the market. Evaluations were carried out at a stage representing the quality inspection stage prior to export and also at the stage it would reach on arriving at the market place. \n\nThe original objectives were to determine which pre-harvest variables contribute to good tasting squash after different periods of storage time. This is determined by whether a measure of acceptability found by categorising each squash as either unacceptable, acceptable or excellent.\n\nThe fruit in this dataset were stored before being measured, and they have an extra attribute that squash-unstored lacks - the weight of the fruit after storage.\n\nAttribute Information:\n1. site - where fruit is located - enumerated\n2. daf - number of days after flowering - enumerated\n3. fruit - individual number of the fruit (not unique) - enumerated\n4. weight - weight of whole fruit in grams - real\n5. storewt - weight of fruit after storage - real\n6. pene - penetrometer indicates maturity of fruit at harvest - integer\n7. solids_% - a test for dry matter - integer\n8. brix - a refractometer measurement used to indicate sweetness or ripeness of the fruit - integer \n9. a - the a coordinate of the HunterLab L-a-b notation of colour measurement - integer\n10. egdd - the heat accumulation above a base of 8c from emergence of the plant to harvest of the fruit - real\n11. fgdd - the heat accumulation above a base of 8c from flowering to harvesting - real\n12. groundspot_a - the number indicating colour of skin where the fruit rested on the ground - integer\n13. glucose - measured in mg\/100g of fresh weight - integer\n14. fructose - measured in mg\/100g of fresh weight - integer\n15. sucrose - measured in mg\/100g of fresh weight - integer\n16. total - measured in mg\/100g of fresh weight - integer\n17. glucose+fructose - measured in mg\/100g of fresh weight - integer\n18. starch - measured in mg\/100g of fresh weight - integer\n19. sweetness - the mean of eight taste panel scores; out of 1500 - integer\n20. flavour - the mean of eight taste panel scores; out of 1500 - integer\n21. dry\/moist - the mean of eight taste panel scores; out of 1500 - integer\n22. fibre - the mean of eight taste panel scores; out of 1500 - integer\n23. heat_input_emerg - the amount of heat emergence after harvest - real\n24. heat_input_flower - the amount of heat input before flowering - real\n25. Acceptability - the acceptability of the fruit - enumerated", "format": "ARFF", "uploader": "Joaquin Vanschoren", "uploader_id": 2, "visibility": "public", "creator": "\"Winna Harvey\"", "contributor": null, "date": "2014-08-26 22:02:38", "update_comment": null, "last_update": "2014-08-26 22:02:38", "licence": "Public", "status": "active", "error_message": null, "url": "https:\/\/www.openml.org\/data\/download\/52243\/phpH2fvsy", "kaggle_url": null, "default_target_attribute": "Acceptability", "row_id_attribute": null, "ignore_attribute": null, "runs": 867, "suggest": { "input": [ "squash-stored", "Squash Harvest Stored Data source: Winna Harvey Crop and Food Research, Christchurch, New Zealand The purpose of the research was to determine the changes taking place in squash fruit during the maturation and ripening so as to pinpoint the best time to give the best quality at the marketplace (Japan). The squash is transported to Japan by refrigerated cargo vessels and takes three to four weeks to reach the market. Evaluations were carried out at a stage representing the quality inspection stag " ], "weight": 5 }, "qualities": { "NumberOfInstances": 52, "NumberOfFeatures": 25, "NumberOfClasses": 3, "NumberOfMissingValues": 7, "NumberOfInstancesWithMissingValues": 2, "NumberOfNumericFeatures": 21, "NumberOfSymbolicFeatures": 4, "MaxStdDevOfNumericAtts": 425.9121464406377, "MinorityClassPercentage": 15.384615384615385, "PercentageOfNumericFeatures": 84, "Quartile3MeansOfNumericAtts": 703.1923076923076, "CfsSubsetEval_DecisionStumpAUC": 0.7042831681666498, "RandomTreeDepth2AUC": 0.5847406208919002, "J48.00001.ErrRate": 0.36538461538461536, "MeanAttributeEntropy": 2.685752542223271, "MinorityClassSize": 8, "PercentageOfSymbolicFeatures": 16, "Quartile3MutualInformation": 0.62754806675663, "CfsSubsetEval_DecisionStumpErrRate": 0.40384615384615385, "RandomTreeDepth2ErrRate": 0.4807692307692308, "J48.00001.Kappa": 0.371900826446281, "MeanKurtosisOfNumericAtts": -0.3067974601828094, "NaiveBayesAUC": 0.7184726424854345, "Quartile1AttributeEntropy": 1.5766212201074912, "Quartile3SkewnessOfNumericAtts": 0.5407755757069411, "CfsSubsetEval_DecisionStumpKappa": 0.30929791271347246, "RandomTreeDepth2Kappa": 0.21259842519685043, "J48.0001.AUC": 0.7209042000824536, "MeanMeansOfNumericAtts": 412.4587287222581, "NaiveBayesErrRate": 0.38461538461538464, "Quartile1KurtosisOfNumericAtts": -0.7088661713886617, "Quartile3StdDevOfNumericAtts": 156.192083087038, "CfsSubsetEval_NaiveBayesAUC": 0.7042831681666498, "RandomTreeDepth3AUC": 0.5847406208919002, "J48.0001.ErrRate": 0.36538461538461536, "MeanMutualInformation": 0.3768398731472633, "NaiveBayesKappa": 0.377245508982036, "Quartile1MeansOfNumericAtts": 17.292922322775265, "REPTreeDepth1AUC": 0.4833147942157953, "CfsSubsetEval_NaiveBayesErrRate": 0.40384615384615385, "RandomTreeDepth3ErrRate": 0.4807692307692308, "J48.0001.Kappa": 0.371900826446281, "MeanNoiseToSignalRatio": 6.127039184554973, "NumberOfBinaryFeatures": 0, "Quartile1MutualInformation": 0.23962744768583, "REPTreeDepth1ErrRate": 0.5576923076923077, "CfsSubsetEval_NaiveBayesKappa": 0.30929791271347246, "RandomTreeDepth3Kappa": 0.21259842519685043, "J48.001.AUC": 0.7209042000824536, "MeanNominalAttDistinctValues": 8.25, "Quartile1SkewnessOfNumericAtts": -0.5239589317341413, "REPTreeDepth1Kappa": 0, "CfsSubsetEval_kNN1NAUC": 0.7042831681666498, "StdvNominalAttDistinctValues": 9.215023964519391, "J48.001.ErrRate": 0.36538461538461536, "MeanSkewnessOfNumericAtts": -0.017664249746357514, "Quartile1StdDevOfNumericAtts": 4.666952749074117, "REPTreeDepth2AUC": 0.4833147942157953, "CfsSubsetEval_kNN1NErrRate": 0.40384615384615385, "kNN1NAUC": 0.6366543634341186, "J48.001.Kappa": 0.371900826446281, "MeanStdDevOfNumericAtts": 94.86049223849403, "Quartile2AttributeEntropy": 2.237397409783102, "REPTreeDepth2ErrRate": 0.5576923076923077, "CfsSubsetEval_kNN1NKappa": 0.30929791271347246, "kNN1NErrRate": 0.4423076923076923, "MajorityClassPercentage": 44.230769230769226, "MinAttributeEntropy": 1.5766212201074912, "Quartile2KurtosisOfNumericAtts": -0.36530084838997157, "REPTreeDepth2Kappa": 0, "ClassEntropy": 1.4642745091475293, "kNN1NKappa": 0.2880952380952381, "MajorityClassSize": 23, "MinKurtosisOfNumericAtts": -1.047318179797295, "Quartile2MeansOfNumericAtts": 89.25254901960784, "REPTreeDepth3AUC": 0.4833147942157953, "DecisionStumpAUC": 0.6869294845866035, "MaxAttributeEntropy": 4.24323899677922, "MinMeansOfNumericAtts": 8.026923076923078, "Quartile2MutualInformation": 0.26334410499933, "REPTreeDepth3ErrRate": 0.5576923076923077, "DecisionStumpErrRate": 0.38461538461538464, "MaxKurtosisOfNumericAtts": 1.2412162845303851, "MinMutualInformation": 0.23962744768583, "Quartile2SkewnessOfNumericAtts": -0.10174508875447302, "REPTreeDepth3Kappa": 0, "DecisionStumpKappa": 0.3316195372750643, "MaxMeansOfNumericAtts": 1859.480769230769, "MinNominalAttDistinctValues": 3, "PercentageOfBinaryFeatures": 0, "Quartile2StdDevOfNumericAtts": 34.99794438781439, "RandomTreeDepth1AUC": 0.5847406208919002, "Dimensionality": 0.4807692307692308, "MaxMutualInformation": 0.62754806675663, "MinSkewnessOfNumericAtts": -1.1725728555295991, "PercentageOfInstancesWithMissingValues": 3.8461538461538463, "Quartile3AttributeEntropy": 4.24323899677922, "RandomTreeDepth1ErrRate": 0.4807692307692308, "EquivalentNumberOfAtts": 3.885667662814739, "MaxNominalAttDistinctValues": 22, "MinStdDevOfNumericAtts": 1.837971987175274, "PercentageOfMissingValues": 0.5384615384615384, "Quartile3KurtosisOfNumericAtts": -0.06581672820996043, "AutoCorrelation": 0.5098039215686274, "RandomTreeDepth1Kappa": 0.21259842519685043, "J48.00001.AUC": 0.7209042000824536, "MaxSkewnessOfNumericAtts": 0.9681279603692765 }, "tags": [ { "uploader": "38960", "tag": "Agriculture" }, { "uploader": "38960", "tag": "Food Science" }, { "uploader": "2", "tag": "study_1" }, { "uploader": "1", "tag": "study_41" } ], "features": [ { "name": "Acceptability", "index": "24", "type": "nominal", "distinct": "3", "missing": "0", "target": "1", "distr": [ [ "excellent", "ok", "not_acceptable" ], [ [ "23", "0", "0" ], [ "0", "21", "0" ], [ "0", "0", "8" ] ] ] }, { "name": "site", "index": "0", "type": "nominal", "distinct": "3", "missing": "0", "distr": [ [ "P", "HB", "LINC" ], [ [ "10", "6", "0" ], [ "8", "8", "0" ], [ "5", "7", "8" ] ] ] }, { "name": "daf", "index": "1", "type": "nominal", "distinct": "5", "missing": "0", "distr": [ [ "30", "40", "50", "60", "70" ], [ [ "2", "6", "4" ], [ "5", "7", "0" ], [ "5", "5", "2" ], [ "7", "3", "2" ], [ "4", "0", "0" ] ] ] }, { "name": "fruit", "index": "2", "type": "nominal", "distinct": "22", "missing": "0", "distr": [ [ "1", "2", "9", "10", "7", "11", "17", "3", "4", "12", "8", "13", "5", "15", "6", "20", "14", "23", "27", "16", "19", "21" ], [ [ "1", "2", "0" ], [ "1", "1", "0" ], [ "0", "1", "2" ], [ "0", "1", "0" ], [ "2", "3", "1" ], [ "3", "1", "0" ], [ "1", "1", "0" ], [ "1", "1", "2" ], [ "3", "1", "0" ], [ "2", "0", "0" ], [ "2", "2", "0" ], [ "1", "1", "0" ], [ "1", "1", "0" ], [ "0", "2", "0" ], [ "0", "2", "1" ], [ "1", "0", "0" ], [ "1", "0", "1" ], [ "1", "0", "0" ], [ "1", "0", "0" ], [ "0", "1", "0" ], [ "0", "0", "1" ], [ "1", "0", "0" ] ] ] }, { "name": "weight", "index": "3", "type": "numeric", "distinct": "50", "missing": "0", "min": "1156", "max": "2872", "mean": "1859", "stdev": "426" }, { "name": "storewt", "index": "4", "type": "numeric", "distinct": "52", "missing": "0", "min": "1067", "max": "2607", "mean": "1724", "stdev": "389" }, { "name": "pene", "index": "5", "type": "numeric", "distinct": "37", "missing": "0", "min": "3", "max": "11", "mean": "8", "stdev": "2" }, { "name": "solids", "index": "6", "type": "numeric", "distinct": "43", "missing": "0", "min": "14", "max": "31", "mean": "23", "stdev": "4" }, { "name": "brix", "index": "7", "type": "numeric", "distinct": "31", "missing": "0", "min": "8", "max": "15", "mean": "12", "stdev": "2" }, { "name": "a*", "index": "8", "type": "numeric", "distinct": "45", "missing": "0", "min": "10", "max": "29", "mean": "20", "stdev": "5" }, { "name": "egdd", "index": "9", "type": "numeric", "distinct": "13", "missing": "0", "min": "601", "max": "953", "mean": "721", "stdev": "105" }, { "name": "fgdd", "index": "10", "type": "numeric", "distinct": "13", "missing": "0", "min": "190", "max": "542", "mean": "342", "stdev": "95" }, { "name": "groundspot_a*", "index": "11", "type": "numeric", "distinct": "51", "missing": "1", "min": "-7", "max": "19", "mean": "8", "stdev": "7" }, { "name": "glucose", "index": "12", "type": "numeric", "distinct": "51", "missing": "1", "min": "5", "max": "24", "mean": "14", "stdev": "5" }, { "name": "fructose", "index": "13", "type": "numeric", "distinct": "50", "missing": "1", "min": "4", "max": "21", "mean": "13", "stdev": "4" }, { "name": "sucrose", "index": "14", "type": "numeric", "distinct": "51", "missing": "1", "min": "5", "max": "46", "mean": "28", "stdev": "11" }, { "name": "total", "index": "15", "type": "numeric", "distinct": "51", "missing": "1", "min": "29", "max": "70", "mean": "55", "stdev": "9" }, { "name": "glucose+fructose", "index": "16", "type": "numeric", "distinct": "50", "missing": "1", "min": "9", "max": "45", "mean": "27", "stdev": "9" }, { "name": "starch", "index": "17", "type": "numeric", "distinct": "51", "missing": "1", "min": "28", "max": "175", "mean": "89", "stdev": "35" }, { "name": "sweetness", "index": "18", "type": "numeric", "distinct": "51", "missing": "0", "min": "320", "max": "956", "mean": "686", "stdev": "182" }, { "name": "flavour", "index": "19", "type": "numeric", "distinct": "52", "missing": "0", "min": "416", "max": "1009", "mean": "752", "stdev": "157" }, { "name": "dry\/moist", "index": "20", "type": "numeric", "distinct": "51", "missing": "0", "min": "188", "max": "949", "mean": "595", "stdev": "207" }, { "name": "fibre", "index": "21", "type": "numeric", "distinct": "52", "missing": "0", "min": "68", "max": "686", "mean": "245", "stdev": "155" }, { "name": "heat_input_emerg", "index": "22", "type": "numeric", "distinct": "14", "missing": "0", "min": "721", "max": "1087", "mean": "905", "stdev": "91" }, { "name": "heat_input_flower", "index": "23", "type": "numeric", "distinct": "14", "missing": "0", "min": "386", "max": "738", "mean": "536", "stdev": "93" } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 0, "nr_of_downloads": 0, "total_downloads": 0, "reach": 0, "reuse": 0, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 0 }