{ "data_id": "44191", "name": "Diabetes(scikit-learn)", "exact_name": "Diabetes(scikit-learn)", "version": 12, "version_label": "test", "description": ".. _diabetes_dataset:\n\nDiabetes dataset\n----------------\n\nTen baseline variables, age, sex, body mass index, average blood\npressure, and six blood serum measurements were obtained for each of n =\n442 diabetes patients, as well as the response of interest, a\nquantitative measure of disease progression one year after baseline.\n\n**Data Set Characteristics:**\n\n :Number of Instances: 442\n\n :Number of Attributes: First 10 columns are numeric predictive values\n\n :Target: Column 11 is a quantitative measure of disease progression one year after baseline\n\n :Attribute Information:\n - age age in years\n - sex\n - bmi body mass index\n - bp average blood pressure\n - s1 tc, total serum cholesterol\n - s2 ldl, low-density lipoproteins\n - s3 hdl, high-density lipoproteins\n - s4 tch, total cholesterol \/ HDL\n - s5 ltg, possibly log of serum triglycerides level\n - s6 glu, blood sugar level\n\nNote: Each of these 10 feature variables have been mean centered and scaled by the standard deviation times the square root of `n_samples` (i.e. the sum of squares of each column totals 1).\n\nSource URL:\nhttps:\/\/www4.stat.ncsu.edu\/~boos\/var.select\/diabetes.html\n\nFor more information see:\nBradley Efron, Trevor Hastie, Iain Johnstone and Robert Tibshirani (2004) \"Least Angle Regression,\" Annals of Statistics (with discussion), 407-499.\n(https:\/\/web.stanford.edu\/~hastie\/Papers\/LARS\/LeastAngle_2002.pdf)", "format": "arff", "uploader": "Ilin Tolovski", "uploader_id": 31897, "visibility": "public", "creator": "\"Bradley Efron, Trevor Hastie, Iain Johnstone and Robert Tibshirani\"", "contributor": null, "date": "2022-08-26 16:23:16", "update_comment": null, "last_update": "2022-08-26 16:23:16", "licence": "BSD (from scikit-learn)", "status": "active", "error_message": null, "url": "https:\/\/api.openml.org\/data\/download\/22103384\/dataset", "default_target_attribute": "class", "row_id_attribute": null, "ignore_attribute": null, "runs": 0, "suggest": { "input": [ "Diabetes(scikit-learn)", ".. _diabetes_dataset: Diabetes dataset ---------------- Ten baseline variables, age, sex, body mass index, average blood pressure, and six blood serum measurements were obtained for each of n = 442 diabetes patients, as well as the response of interest, a quantitative measure of disease progression one year after baseline. :Number of Instances: 442 :Number of Attributes: First 10 columns are numeric predictive values :Target: Column 11 is a quantitative measure of disease progression one year af " ], "weight": 5 }, "qualities": { "NumberOfInstances": 442, "NumberOfFeatures": 11, "NumberOfClasses": 0, "NumberOfMissingValues": 0, "NumberOfInstancesWithMissingValues": 0, "NumberOfNumericFeatures": 11, "NumberOfSymbolicFeatures": 0, "PercentageOfInstancesWithMissingValues": 0, "AutoCorrelation": -84.63718820861678, "PercentageOfMissingValues": 0, "Dimensionality": 0.024886877828054297, "PercentageOfNumericFeatures": 100, "MajorityClassPercentage": null, "PercentageOfSymbolicFeatures": 0, "MajorityClassSize": null, "MinorityClassPercentage": null, "MinorityClassSize": null, "NumberOfBinaryFeatures": 0, "PercentageOfBinaryFeatures": 0 }, "tags": [ { "uploader": "38960", "tag": "Machine Learning" }, { "uploader": "38960", "tag": "Meteorology" } ], "features": [ { "name": "class", "index": "10", "type": "numeric", "distinct": "214", "missing": "0", "target": "1", "min": "25", "max": "346", "mean": "152", "stdev": "77" }, { "name": "age", "index": "0", "type": "numeric", "distinct": "58", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "sex", "index": "1", "type": "numeric", "distinct": "2", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "bmi", "index": "2", "type": "numeric", "distinct": "163", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "bp", "index": "3", "type": "numeric", "distinct": "100", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "s1", "index": "4", "type": "numeric", "distinct": "141", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "s2", "index": "5", "type": "numeric", "distinct": "302", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "s3", "index": "6", "type": "numeric", "distinct": "63", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "s4", "index": "7", "type": "numeric", "distinct": "66", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "s5", "index": "8", "type": "numeric", "distinct": "184", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "s6", "index": "9", "type": "numeric", "distinct": "56", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 0, "nr_of_downloads": 0, "total_downloads": 0, "reach": 0, "reuse": 0, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 0 }