{ "data_id": "1487", "name": "ozone-level-8hr", "exact_name": "ozone-level-8hr", "version": 1, "version_label": null, "description": "**Author**: Kun Zhang, Wei Fan, XiaoJing Yuan\r\n\r\n**Source**: [UCI](https:\/\/archive.ics.uci.edu\/ml\/datasets\/ozone+level+detection)\r\n\r\n**Please cite**: \r\n\r\nForecasting skewed biased stochastic ozone days: analyses, solutions and beyond, Knowledge and Information Systems, Vol. 14, No. 3, 2008. \r\n\r\n\r\n1 . Abstract: \r\nTwo ground ozone level data sets are included in this collection. One is the eight hour peak set (eighthr.data), the other is the one hour peak set (onehr.data). Those data were collected from 1998 to 2004 at the Houston, Galveston and Brazoria area.\r\n\r\n2. Source:\r\n\r\nKun Zhang, zhang.kun05 '@' gmail.com, Department of Computer Science, Xavier University of Lousiana \r\nWei Fan, wei.fan '@' gmail.com, IBM T.J.Watson Research \r\nXiaoJing Yuan, xyuan '@' uh.edu, Engineering Technology Department, College of Technology, University of Houston \r\n\r\n\r\n3. Data Set Information:\r\n\r\nAll the attribute start with T means the temperature measured at different time throughout the day; and those starts with WS indicate the wind speed at various time. \r\n\r\nWSR_PK: continuous. peek wind speed -- resultant (meaning average of wind vector) \r\nWSR_AV: continuous. average wind speed \r\nT_PK: continuous. Peak T \r\nT_AV: continuous. Average T \r\nT85: continuous. T at 850 hpa level (or about 1500 m height) \r\nRH85: continuous. Relative Humidity at 850 hpa \r\nU85: continuous. (U wind - east-west direction wind at 850 hpa) \r\nV85: continuous. V wind - N-S direction wind at 850 \r\nHT85: continuous. Geopotential height at 850 hpa, it is about the same as height at low altitude \r\nT70: continuous. T at 700 hpa level (roughly 3100 m height) \r\n\r\nRH70: continuous. \r\nU70: continuous. \r\nV70: continuous. \r\nHT70: continuous. \r\n\r\nT50: continuous. T at 500 hpa level (roughly at 5500 m height) \r\n\r\nRH50: continuous. \r\nU50: continuous. \r\nV50: continuous. \r\nHT50: continuous. \r\n\r\nKI: continuous. K-Index [Web Link] \r\nTT: continuous. T-Totals [Web Link] \r\nSLP: continuous. Sea level pressure \r\nSLP_: continuous. SLP change from previous day \r\n\r\nPrecp: continuous. -- precipitation\r\n\r\n\r\n4. Attribute Information:\r\n\r\nThe following are specifications for several most important attributes that are highly valued by Texas Commission on Environmental Quality (TCEQ). More details can be found in the two relevant papers. \r\n\r\nO 3 - Local ozone peak prediction \r\nUpwind - Upwind ozone background level \r\nEmFactor - Precursor emissions related factor \r\nTmax - Maximum temperature in degrees F \r\nTb - Base temperature where net ozone production begins (50 F) \r\nSRd - Solar radiation total for the day \r\nWSa - Wind speed near sunrise (using 09-12 UTC forecast mode) \r\nWSp - Wind speed mid-day (using 15-21 UTC forecast mode) \r\n\r\n\r\n5. Relevant Papers:\r\n\r\nForecasting skewed biased stochastic ozone days: analyses, solutions and beyond, Knowledge and Information Systems, Vol. 14, No. 3, 2008. \r\n\r\nIt Discusses details about the dataset, its use as well as various experiments (both cross-validation and streaming) using many state-of-the-art methods. \r\nA shorter version of the paper (does not contain some detailed experiments as the journal paper above) is in: \r\nForecasting Skewed Biased Stochastic Ozone Days: Analyses and Solutions. ICDM 2006: 753-764 \r\n\r\n", "format": "ARFF", "uploader": "Rafael Gomes Mantovani", "uploader_id": 64, "visibility": "public", "creator": null, "contributor": null, "date": "2015-05-25 19:22:29", "update_comment": null, "last_update": "2015-11-09 20:25:43", "licence": "Public", "status": "active", "error_message": null, "url": "https:\/\/www.openml.org\/data\/download\/1592279\/phpdReP6S", "default_target_attribute": "Class", "row_id_attribute": null, "ignore_attribute": null, "runs": 188264, "suggest": { "input": [ "ozone-level-8hr", "Forecasting skewed biased stochastic ozone days: analyses, solutions and beyond, Knowledge and Information Systems, Vol. 14, No. 3, 2008. 1 . Abstract: Two ground ozone level data sets are included in this collection. One is the eight hour peak set (eighthr.data), the other is the one hour peak set (onehr.data). Those data were collected from 1998 to 2004 at the Houston, Galveston and Brazoria area. 2. Source: Kun Zhang, zhang.kun05 '@' gmail.com, Department of Computer Science, Xavier Universit " ], "weight": 5 }, "qualities": { "NumberOfInstances": 2534, "NumberOfFeatures": 73, "NumberOfClasses": 2, "NumberOfMissingValues": 0, "NumberOfInstancesWithMissingValues": 0, "NumberOfNumericFeatures": 72, "NumberOfSymbolicFeatures": 1, "MajorityClassPercentage": 93.68587213891081, "MeanStdDevOfNumericAtts": 7.570160866371553, "Quartile2AttributeEntropy": null, "REPTreeDepth2ErrRate": 0.06314127861089187, "CfsSubsetEval_kNN1NKappa": 0.27471774377773167, "kNN1NErrRate": 0.07971586424625099, "MajorityClassSize": 2374, "MinAttributeEntropy": null, "Quartile2KurtosisOfNumericAtts": 0.2058839276126998, "REPTreeDepth2Kappa": 0, "ClassEntropy": 0.3397904253160801, "kNN1NKappa": 0.31421390253967035, "MaxAttributeEntropy": null, "MinKurtosisOfNumericAtts": -0.9765331961089503, "Quartile2MeansOfNumericAtts": 4.815862994080504, "REPTreeDepth3AUC": 0.5, "DecisionStumpAUC": 0.7327664279696714, "MaxKurtosisOfNumericAtts": 77.65898710476209, "MinMeansOfNumericAtts": -10.511409688239937, "Quartile2MutualInformation": null, "REPTreeDepth3ErrRate": 0.06314127861089187, "DecisionStumpErrRate": 0.06314127861089187, "MaxMeansOfNumericAtts": 10164.198441985003, "MinMutualInformation": null, "Quartile2SkewnessOfNumericAtts": 0.0471494119269262, "REPTreeDepth3Kappa": 0, "DecisionStumpKappa": 0, "MaxMutualInformation": null, "MinNominalAttDistinctValues": 2, "PercentageOfBinaryFeatures": 1.36986301369863, "Quartile2StdDevOfNumericAtts": 6.228337109415218, "RandomTreeDepth1AUC": 0.6752435235888795, "Dimensionality": 0.028808208366219414, "MaxNominalAttDistinctValues": 2, "MinSkewnessOfNumericAtts": -1.3083589753871372, "PercentageOfInstancesWithMissingValues": 0, "Quartile3AttributeEntropy": null, "RandomTreeDepth1ErrRate": 0.06314127861089187, "EquivalentNumberOfAtts": null, "MaxSkewnessOfNumericAtts": 7.37646091154844, "MinStdDevOfNumericAtts": 0.24285789396290183, "PercentageOfMissingValues": 0, "Quartile3KurtosisOfNumericAtts": 1.0593317075159525, "AutoCorrelation": 0.9155151993683379, "RandomTreeDepth1Kappa": 0, "J48.00001.AUC": 0.5, "MaxStdDevOfNumericAtts": 77.4101100876053, "MinorityClassPercentage": 6.3141278610891876, "PercentageOfNumericFeatures": 98.63013698630137, "Quartile3MeansOfNumericAtts": 20.78420038506314, "CfsSubsetEval_DecisionStumpAUC": 0.7327664279696714, "RandomTreeDepth2AUC": 0.7445108466722831, "J48.00001.ErrRate": 0.06314127861089187, "MeanAttributeEntropy": null, "MinorityClassSize": 160, "PercentageOfSymbolicFeatures": 1.36986301369863, "Quartile3MutualInformation": null, "CfsSubsetEval_DecisionStumpErrRate": 0.06314127861089187, "RandomTreeDepth2ErrRate": 0.06314127861089187, "J48.00001.Kappa": 0, "MeanKurtosisOfNumericAtts": 1.4930547521290494, "NaiveBayesAUC": 0.8217510278493866, "Quartile1AttributeEntropy": null, "Quartile3SkewnessOfNumericAtts": 0.5116487297426875, "CfsSubsetEval_DecisionStumpKappa": 0, "RandomTreeDepth2Kappa": 0, "J48.0001.AUC": 0.5, "MeanMeansOfNumericAtts": 296.494606499841, "NaiveBayesErrRate": 0.30386740331491713, "Quartile1KurtosisOfNumericAtts": -0.260168431307105, "Quartile3StdDevOfNumericAtts": 7.043592907600336, "CfsSubsetEval_NaiveBayesAUC": 0.8579034669356066, "RandomTreeDepth3AUC": 0.7811354780960404, "J48.0001.ErrRate": 0.06314127861089187, "MeanMutualInformation": null, "NaiveBayesKappa": 0.16475175167633535, "Quartile1MeansOfNumericAtts": 1.8374475797158643, "REPTreeDepth1AUC": 0.5, "CfsSubsetEval_NaiveBayesErrRate": 0.2490134175217048, "RandomTreeDepth3ErrRate": 0.06393054459352802, "J48.0001.Kappa": 0, "MeanNoiseToSignalRatio": null, "NumberOfBinaryFeatures": 1, "Quartile1MutualInformation": null, "REPTreeDepth1ErrRate": 0.06314127861089187, "CfsSubsetEval_NaiveBayesKappa": 0.20648153018980478, "RandomTreeDepth3Kappa": 0.009143221271747803, "J48.001.AUC": 0.5, "MeanNominalAttDistinctValues": 2, "Quartile1SkewnessOfNumericAtts": -0.6536046299943934, "REPTreeDepth1Kappa": 0, "CfsSubsetEval_kNN1NAUC": 0.6374526116259478, "StdvNominalAttDistinctValues": 0, "J48.001.ErrRate": 0.06314127861089187, "J48.001.Kappa": 0, "MeanSkewnessOfNumericAtts": 0.08595601481421308, "Quartile1StdDevOfNumericAtts": 1.1660936206969923, "REPTreeDepth2AUC": 0.5, "CfsSubsetEval_kNN1NErrRate": 0.0840568271507498, "kNN1NAUC": 0.6566870261162595 }, "tags": [ { "tag": "Chemistry", "uploader": "38960" }, { "tag": "Machine Learning", "uploader": "38960" }, { "tag": "OpenML-CC18", "uploader": "1" }, { "tag": "OpenML100", "uploader": "348" }, { "tag": "study_123", "uploader": "3886" }, { "tag": "study_14", "uploader": "64" }, { "tag": "study_34", "uploader": "1" }, { "tag": "study_50", "uploader": "64" }, { "tag": "study_52", "uploader": "64" }, { "tag": "study_7", "uploader": "64" }, { "tag": "study_98", "uploader": "1935" }, { "tag": "study_99", "uploader": "1" }, { "tag": "study_225", "uploader": "0" }, { "tag": "study_236", "uploader": "0" }, { "tag": "study_293", "uploader": "0" }, { "tag": "study_270", "uploader": "0" }, { "tag": "study_271", "uploader": "0" }, { "tag": "study_253", "uploader": "0" }, { "tag": "study_446", "uploader": "0" }, { "tag": "study_447", "uploader": "0" }, { "tag": "study_448", "uploader": "0" }, { "tag": "study_449", "uploader": "0" }, { "tag": "study_275", "uploader": "0" } ], "features": [ { "name": "Class", "index": "72", "type": "nominal", "distinct": "2", "missing": "0", "target": "1", "distr": [ [ "1", "2" ], [ [ "2374", "0" ], [ "0", "160" ] ] ] }, { "name": "V1", "index": "0", "type": "numeric", "distinct": "69", "missing": "0", "min": "0", "max": "8", "mean": "2", "stdev": "1" }, { "name": "V2", "index": "1", "type": "numeric", "distinct": "71", "missing": "0", "min": "0", "max": "8", "mean": "2", "stdev": "1" }, { "name": "V3", "index": "2", "type": "numeric", "distinct": "66", "missing": "0", "min": "0", "max": "7", "mean": "2", "stdev": "1" }, { "name": "V4", "index": "3", "type": "numeric", "distinct": "67", "missing": "0", "min": "0", "max": "7", "mean": "2", "stdev": "1" }, { "name": "V5", "index": "4", "type": "numeric", "distinct": "65", "missing": "0", "min": "0", "max": "7", "mean": "2", "stdev": "1" }, { "name": "V6", "index": "5", "type": "numeric", "distinct": "64", "missing": "0", "min": "0", "max": "7", "mean": "2", "stdev": "1" }, { "name": "V7", "index": "6", "type": "numeric", "distinct": "67", "missing": "0", "min": "0", "max": "7", "mean": "2", "stdev": "1" }, { "name": "V8", "index": "7", "type": "numeric", "distinct": "68", "missing": "0", "min": "0", "max": "8", "mean": "2", "stdev": "1" }, { "name": "V9", "index": "8", "type": "numeric", "distinct": "70", "missing": "0", "min": "0", "max": "9", "mean": "3", "stdev": "1" }, { "name": "V10", "index": "9", "type": "numeric", "distinct": "71", "missing": "0", "min": "0", "max": "9", "mean": "3", "stdev": "1" }, { "name": "V11", "index": "10", "type": "numeric", "distinct": "77", "missing": "0", "min": "0", "max": "9", "mean": "3", "stdev": "1" }, { "name": "V12", "index": "11", "type": "numeric", "distinct": "78", "missing": "0", "min": "0", "max": "9", "mean": "3", "stdev": "1" }, { "name": "V13", "index": "12", "type": "numeric", "distinct": "78", "missing": "0", "min": "0", "max": "9", "mean": "3", "stdev": "1" }, { "name": "V14", "index": "13", "type": "numeric", "distinct": "79", "missing": "0", "min": "0", "max": "10", "mean": "3", "stdev": "1" }, { "name": "V15", "index": "14", "type": "numeric", "distinct": "78", "missing": "0", "min": "0", "max": "9", "mean": "3", "stdev": "1" }, { "name": "V16", "index": "15", "type": "numeric", "distinct": "79", "missing": "0", "min": "0", "max": "9", "mean": "3", "stdev": "1" }, { "name": "V17", "index": "16", "type": "numeric", "distinct": "73", "missing": "0", "min": "0", "max": "9", "mean": "3", "stdev": "1" }, { "name": "V18", "index": "17", "type": "numeric", "distinct": "74", "missing": "0", "min": "0", "max": "8", "mean": "3", "stdev": "1" }, { "name": "V19", "index": "18", "type": "numeric", "distinct": "71", "missing": "0", "min": "0", "max": "8", "mean": "3", "stdev": "1" }, { "name": "V20", "index": "19", "type": "numeric", "distinct": "66", "missing": "0", "min": "0", "max": "7", "mean": "2", "stdev": "1" }, { "name": "V21", "index": "20", "type": "numeric", "distinct": "69", "missing": "0", "min": "0", "max": "9", "mean": "2", "stdev": "1" }, { "name": "V22", "index": "21", "type": "numeric", "distinct": "70", "missing": "0", "min": "0", "max": "9", "mean": "2", "stdev": "1" }, { "name": "V23", "index": "22", "type": "numeric", "distinct": "69", "missing": "0", "min": "0", "max": "8", "mean": "2", "stdev": "1" }, { "name": "V24", "index": "23", "type": "numeric", "distinct": "66", "missing": "0", "min": "0", "max": "8", "mean": "2", "stdev": "1" }, { "name": "V25", "index": "24", "type": "numeric", "distinct": "75", "missing": "0", "min": "1", "max": "10", "mean": "4", "stdev": "1" }, { "name": "V26", "index": "25", "type": "numeric", "distinct": "56", "missing": "0", "min": "0", "max": "6", "mean": "2", "stdev": "1" }, { "name": "V27", "index": "26", "type": "numeric", "distinct": "283", "missing": "0", "min": "-2", "max": "30", "mean": "19", "stdev": "7" }, { "name": "V28", "index": "27", "type": "numeric", "distinct": "285", "missing": "0", "min": "-2", "max": "29", "mean": "18", "stdev": "7" }, { "name": "V29", "index": "28", "type": "numeric", "distinct": "288", "missing": "0", "min": "-3", "max": "29", "mean": "18", "stdev": "7" }, { "name": "V30", "index": "29", "type": "numeric", "distinct": "284", "missing": "0", "min": "-3", "max": "28", "mean": "18", "stdev": "7" }, { "name": "V31", "index": "30", "type": "numeric", "distinct": "284", "missing": "0", "min": "-3", "max": "28", "mean": "18", "stdev": "7" }, { "name": "V32", "index": "31", "type": "numeric", "distinct": "292", "missing": "0", "min": "-4", "max": "28", "mean": "17", "stdev": "7" }, { "name": "V33", "index": "32", "type": "numeric", "distinct": "296", "missing": "0", "min": "-3", "max": "29", "mean": "18", "stdev": "7" }, { "name": "V34", "index": "33", "type": "numeric", "distinct": "312", "missing": "0", "min": "-3", "max": "30", "mean": "18", "stdev": "8" }, { "name": "V35", "index": "34", "type": "numeric", "distinct": "314", "missing": "0", "min": "-2", "max": "31", "mean": "20", "stdev": "8" }, { "name": "V36", "index": "35", "type": "numeric", "distinct": "315", "missing": "0", "min": "-1", "max": "34", "mean": "21", "stdev": "7" }, { "name": "V37", "index": "36", "type": "numeric", "distinct": "328", "missing": "0", "min": "-1", "max": "36", "mean": "22", "stdev": "7" }, { "name": "V38", "index": "37", "type": "numeric", "distinct": "331", "missing": "0", "min": "0", "max": "39", "mean": "23", "stdev": "7" }, { "name": "V39", "index": "38", "type": "numeric", "distinct": "335", "missing": "0", "min": "0", "max": "40", "mean": "24", "stdev": "7" }, { "name": "V40", "index": "39", "type": "numeric", "distinct": "336", "missing": "0", "min": "1", "max": "41", "mean": "24", "stdev": "7" }, { "name": "V41", "index": "40", "type": "numeric", "distinct": "336", "missing": "0", "min": "2", "max": "42", "mean": "25", "stdev": "7" }, { "name": "V42", "index": "41", "type": "numeric", "distinct": "340", "missing": "0", "min": "2", "max": "41", "mean": "25", "stdev": "7" }, { "name": "V43", "index": "42", "type": "numeric", "distinct": "338", "missing": "0", "min": "1", "max": "41", "mean": "24", "stdev": "7" }, { "name": "V44", "index": "43", "type": "numeric", "distinct": "330", "missing": "0", "min": "-1", "max": "40", "mean": "24", "stdev": "7" }, { "name": "V45", "index": "44", "type": "numeric", "distinct": "322", "missing": "0", "min": "0", "max": "38", "mean": "23", "stdev": "7" }, { "name": "V46", "index": "45", "type": "numeric", "distinct": "307", "missing": "0", "min": "0", "max": "36", "mean": "21", "stdev": "7" }, { "name": "V47", "index": "46", "type": "numeric", "distinct": "303", "missing": "0", "min": "0", "max": "35", "mean": "21", "stdev": "7" }, { "name": "V48", "index": "47", "type": "numeric", "distinct": "295", "missing": "0", "min": "0", "max": "33", "mean": "20", "stdev": "7" }, { "name": "V49", "index": "48", "type": "numeric", "distinct": "288", "missing": "0", "min": "-1", "max": "33", "mean": "20", "stdev": "7" }, { "name": "V50", "index": "49", "type": "numeric", "distinct": "285", "missing": "0", "min": "-1", "max": "31", "mean": "19", "stdev": "7" }, { "name": "V51", "index": "50", "type": "numeric", "distinct": "331", "missing": "0", "min": "2", "max": "42", "mean": "26", "stdev": "7" }, { "name": "V52", "index": "51", "type": "numeric", "distinct": "297", "missing": "0", "min": "0", "max": "34", "mean": "21", "stdev": "7" }, { "name": "V53", "index": "52", "type": "numeric", "distinct": "252", "missing": "0", "min": "-7", "max": "25", "mean": "14", "stdev": "5" }, { "name": "V54", "index": "53", "type": "numeric", "distinct": "101", "missing": "0", "min": "0", "max": "1", "mean": "1", "stdev": "0" }, { "name": "V55", "index": "54", "type": "numeric", "distinct": "1289", "missing": "0", "min": "-16", "max": "19", "mean": "2", "stdev": "5" }, { "name": "V56", "index": "55", "type": "numeric", "distinct": "1462", "missing": "0", "min": "-18", "max": "22", "mean": "2", "stdev": "6" }, { "name": "V57", "index": "56", "type": "numeric", "distinct": "369", "missing": "0", "min": "1351", "max": "1642", "mean": "1531", "stdev": "36" }, { "name": "V58", "index": "57", "type": "numeric", "distinct": "246", "missing": "0", "min": "-10", "max": "16", "mean": "6", "stdev": "4" }, { "name": "V59", "index": "58", "type": "numeric", "distinct": "101", "missing": "0", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "V60", "index": "59", "type": "numeric", "distinct": "1538", "missing": "0", "min": "-14", "max": "28", "mean": "5", "stdev": "6" }, { "name": "V61", "index": "60", "type": "numeric", "distinct": "1430", "missing": "0", "min": "-24", "max": "26", "mean": "1", "stdev": "6" }, { "name": "V62", "index": "61", "type": "numeric", "distinct": "442", "missing": "0", "min": "2919", "max": "3249", "mean": "3145", "stdev": "48" }, { "name": "V63", "index": "62", "type": "numeric", "distinct": "187", "missing": "0", "min": "-25", "max": "0", "mean": "-11", "stdev": "4" }, { "name": "V64", "index": "63", "type": "numeric", "distinct": "101", "missing": "0", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "V65", "index": "64", "type": "numeric", "distinct": "1688", "missing": "0", "min": "-15", "max": "42", "mean": "10", "stdev": "9" }, { "name": "V66", "index": "65", "type": "numeric", "distinct": "1510", "missing": "0", "min": "-26", "max": "30", "mean": "1", "stdev": "7" }, { "name": "V67", "index": "66", "type": "numeric", "distinct": "86", "missing": "0", "min": "5480", "max": "5965", "mean": "5819", "stdev": "77" }, { "name": "V68", "index": "67", "type": "numeric", "distinct": "1048", "missing": "0", "min": "-57", "max": "42", "mean": "11", "stdev": "20" }, { "name": "V69", "index": "68", "type": "numeric", "distinct": "658", "missing": "0", "min": "-10", "max": "59", "mean": "37", "stdev": "11" }, { "name": "V70", "index": "69", "type": "numeric", "distinct": "72", "missing": "0", "min": "9975", "max": "10350", "mean": "10164", "stdev": "51" }, { "name": "V71", "index": "70", "type": "numeric", "distinct": "57", "missing": "0", "min": "-135", "max": "140", "mean": "0", "stdev": "35" }, { "name": "V72", "index": "71", "type": "numeric", "distinct": "175", "missing": "0", "min": "0", "max": "21", "mean": "0", "stdev": "1" } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 1, "nr_of_downloads": 20, "total_downloads": 27, "reach": 21, "reuse": 31, "impact_of_reuse": 0, "reach_of_reuse": 1, "impact": 31 }