{ "data_id": "42897", "name": "bias-correction", "exact_name": "bias-correction", "version": 1, "version_label": "1", "description": "**Author**: Dongjin Cho,Cheolhee Yoo\n**Source**: [UCI](https:\/\/archive.ics.uci.edu\/ml\/datasets\/Bias+correction+of+numerical+prediction+model+temperature+forecast) - 2020\n**Please cite**: [Paper](https:\/\/www.semanticscholar.org\/paper\/Comparative-Assessment-of-Various-Machine-Bias-for-Cho-Yoo\/a24c181f054e35e8caf4fff78949f20569cb11b2\n\n**Bias correction of numerical prediction model temperature forecast dataset**\n\nThis data is for the purpose of bias correction of next-day maximum and minimum air temperatures forecast of the LDAPS model operated by the Korea Meteorological Administration over Seoul, South Korea. This data consists of summer data from 2013 to 2017. The input data is largely composed of the LDAPS model's next-day forecast data, in-situ maximum and minimum temperatures of present-day, and geographic auxiliary variables. There are two outputs (i.e. next-day maximum and minimum air temperatures) in this data. Hindcast validation was conducted for the period from 2015 to 2017.\n\n### Attribute information\n\n1. station - used weather station number: 1 to 25\n2. Date - Present day: yyyy-mm-dd ('2013-06-30' to '2017-08-30')\n3. Present_Tmax - Maximum air temperature between 0 and 21 h on the present day (degrees Celsius): 20 to 37.6\n4. Present_Tmin - Minimum air temperature between 0 and 21 h on the present day (degrees Celsius): 11.3 to 29.9\n5. LDAPS_RHmin - LDAPS model forecast of next-day minimum relative humidity (%): 19.8 to 98.5\n6. LDAPS_RHmax - LDAPS model forecast of next-day maximum relative humidity (%): 58.9 to 100\n7. LDAPS_Tmax_lapse - LDAPS model forecast of next-day maximum air temperature applied lapse rate (degrees Celsius): 17.6 to 38.5\n8. LDAPS_Tmin_lapse - LDAPS model forecast of next-day minimum air temperature applied lapse rate (degrees Celsius): 14.3 to 29.6\n9. LDAPS_WS - LDAPS model forecast of next-day average wind speed (m\/s): 2.9 to 21.9\n10. LDAPS_LH - LDAPS model forecast of next-day average latent heat flux (W\/m2): -13.6 to 213.4\n11. LDAPS_CC1 - LDAPS model forecast of next-day 1st 6-hour split average cloud cover (0-5 h) (%): 0 to 0.97\n12. LDAPS_CC2 - LDAPS model forecast of next-day 2nd 6-hour split average cloud cover (6-11 h) (%): 0 to 0.97\n13. LDAPS_CC3 - LDAPS model forecast of next-day 3rd 6-hour split average cloud cover (12-17 h) (%): 0 to 0.98\n14. LDAPS_CC4 - LDAPS model forecast of next-day 4th 6-hour split average cloud cover (18-23 h) (%): 0 to 0.97\n15. LDAPS_PPT1 - LDAPS model forecast of next-day 1st 6-hour split average precipitation (0-5 h) (%): 0 to 23.7\n16. LDAPS_PPT2 - LDAPS model forecast of next-day 2nd 6-hour split average precipitation (6-11 h) (%): 0 to 21.6\n17. LDAPS_PPT3 - LDAPS model forecast of next-day 3rd 6-hour split average precipitation (12-17 h) (%): 0 to 15.8\n18. LDAPS_PPT4 - LDAPS model forecast of next-day 4th 6-hour split average precipitation (18-23 h) (%): 0 to 16.7\n19. lat - Latitude (degrees): 37.456 to 37.645\n20. lon - Longitude (degrees): 126.826 to 127.135\n21. DEM - Elevation (m): 12.4 to 212.3\n22. Slope - Slope (degrees): 0.1 to 5.2\n23. Solar radiation - Daily incoming solar radiation (wh\/m2): 4329.5 to 5992.9\n24. Next_Tmax - The next-day maximum air temperature (degrees Celsius): 17.4 to 38.9\n25. Next_Tmin - The next-day minimum air temperature (degrees Celsius): 11.3 to 29.8\n\n", "format": "arff", "uploader": "Meilina Reksoprodjo", "uploader_id": 24140, "visibility": "public", "creator": "Dongjin Cho,Cheolhee Yoo", "contributor": null, "date": "2021-05-19 18:26:05", "update_comment": null, "last_update": "2021-05-19 18:26:05", "licence": "CC-BY", "status": "active", "error_message": null, "url": "https:\/\/www.openml.org\/data\/download\/22045528\/dataset", "default_target_attribute": null, "row_id_attribute": null, "ignore_attribute": null, "runs": 0, "suggest": { "input": [ "bias-correction", "This data is for the purpose of bias correction of next-day maximum and minimum air temperatures forecast of the LDAPS model operated by the Korea Meteorological Administration over Seoul, South Korea. This data consists of summer data from 2013 to 2017. The input data is largely composed of the LDAPS model's next-day forecast data, in-situ maximum and minimum temperatures of present-day, and geographic auxiliary variables. There are two outputs (i.e. next-day maximum and minimum air temperature " ], "weight": 5 }, "qualities": { "NumberOfInstances": 7752, "NumberOfFeatures": 25, "NumberOfClasses": null, "NumberOfMissingValues": 1248, "NumberOfInstancesWithMissingValues": 164, "NumberOfNumericFeatures": 24, "NumberOfSymbolicFeatures": 0, "Dimensionality": 0.003224974200206398, "PercentageOfNumericFeatures": 96, "MajorityClassPercentage": null, "PercentageOfSymbolicFeatures": 0, "MajorityClassSize": null, "MinorityClassPercentage": null, "MinorityClassSize": null, "NumberOfBinaryFeatures": 0, "PercentageOfBinaryFeatures": 0, "PercentageOfInstancesWithMissingValues": 2.1155830753353975, "AutoCorrelation": null, "PercentageOfMissingValues": 0.6439628482972136 }, "tags": [ { "uploader": "38960", "tag": "Machine Learning" }, { "uploader": "38960", "tag": "Manufacturing" } ], "features": [ { "name": "station", "index": "0", "type": "numeric", "distinct": "25", "missing": "2", "min": "1", "max": "25", "mean": "13", "stdev": "7" }, { "name": "Date", "index": "1", "type": "string", "distinct": "310", "missing": "2" }, { "name": "Present_Tmax", "index": "2", "type": "numeric", "distinct": "167", "missing": "70", "min": "20", "max": "38", "mean": "30", "stdev": "3" }, { "name": "Present_Tmin", "index": "3", "type": "numeric", "distinct": "155", "missing": "70", "min": "11", "max": "30", "mean": "23", "stdev": "2" }, { "name": "LDAPS_RHmin", "index": "4", "type": "numeric", "distinct": "7672", "missing": "75", "min": "20", "max": "99", "mean": "57", "stdev": "15" }, { "name": "LDAPS_RHmax", "index": "5", "type": "numeric", "distinct": "7664", "missing": "75", "min": "59", "max": "100", "mean": "88", "stdev": "7" }, { "name": "LDAPS_Tmax_lapse", "index": "6", "type": "numeric", "distinct": "7675", "missing": "75", "min": "18", "max": "39", "mean": "30", "stdev": "3" }, { "name": "LDAPS_Tmin_lapse", "index": "7", "type": "numeric", "distinct": "7675", "missing": "75", "min": "14", "max": "30", "mean": "24", "stdev": "2" }, { "name": "LDAPS_WS", "index": "8", "type": "numeric", "distinct": "7675", "missing": "75", "min": "3", "max": "22", "mean": "7", "stdev": "2" }, { "name": "LDAPS_LH", "index": "9", "type": "numeric", "distinct": "7675", "missing": "75", "min": "-14", "max": "213", "mean": "63", "stdev": "34" }, { "name": "LDAPS_CC1", "index": "10", "type": "numeric", "distinct": "7569", "missing": "75", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "LDAPS_CC2", "index": "11", "type": "numeric", "distinct": "7582", "missing": "75", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "LDAPS_CC3", "index": "12", "type": "numeric", "distinct": "7599", "missing": "75", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "LDAPS_CC4", "index": "13", "type": "numeric", "distinct": "7524", "missing": "75", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "LDAPS_PPT1", "index": "14", "type": "numeric", "distinct": "2812", "missing": "75", "min": "0", "max": "24", "mean": "1", "stdev": "2" }, { "name": "LDAPS_PPT2", "index": "15", "type": "numeric", "distinct": "2510", "missing": "75", "min": "0", "max": "22", "mean": "0", "stdev": "2" }, { "name": "LDAPS_PPT3", "index": "16", "type": "numeric", "distinct": "2356", "missing": "75", "min": "0", "max": "16", "mean": "0", "stdev": "1" }, { "name": "LDAPS_PPT4", "index": "17", "type": "numeric", "distinct": "1918", "missing": "75", "min": "0", "max": "17", "mean": "0", "stdev": "1" }, { "name": "lat", "index": "18", "type": "numeric", "distinct": "12", "missing": "0", "min": "37", "max": "38", "mean": "38", "stdev": "0" }, { "name": "lon", "index": "19", "type": "numeric", "distinct": "25", "missing": "0", "min": "127", "max": "127", "mean": "127", "stdev": "0" }, { "name": "DEM", "index": "20", "type": "numeric", "distinct": "25", "missing": "0", "min": "12", "max": "212", "mean": "62", "stdev": "54" }, { "name": "Slope", "index": "21", "type": "numeric", "distinct": "27", "missing": "0", "min": "0", "max": "5", "mean": "1", "stdev": "1" }, { "name": "Solar radiation", "index": "22", "type": "numeric", "distinct": "1575", "missing": "0", "min": "4330", "max": "5993", "mean": "5342", "stdev": "429" }, { "name": "Next_Tmax", "index": "23", "type": "numeric", "distinct": "183", "missing": "27", "min": "17", "max": "39", "mean": "30", "stdev": "3" }, { "name": "Next_Tmin", "index": "24", "type": "numeric", "distinct": "157", "missing": "27", "min": "11", "max": "30", "mean": "23", "stdev": "2" } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 0, "nr_of_downloads": 0, "total_downloads": 0, "reach": 0, "reuse": 0, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 0 }