{ "data_id": "524", "name": "pbc", "exact_name": "pbc", "version": 2, "version_label": null, "description": "**Author**: \n**Source**: Unknown - Date unknown \n**Please cite**: \n\n------------------------------------------------------------------------\nPrimary Biliary Cirrhosis\n\nThe data set found in appendix D of Fleming and Harrington, Counting\nProcesses and Survival Analysis, Wiley, 1991. The only differences are:\nage is in days\nstatus is coded as 0=censored, 1=censored due to liver tx, 2=death\nthe sex and stage variables are not missing for obs 313-418\n\nQuoting from F&H. \"The following pages contain the data from the Mayo Clinic\ntrial in primary biliary cirrhosis (PBC) of the liver conducted between 1974\nand 1984. A description of the clinical background for the trial and the\ncovariates recorded here is in Chapter 0, especially Section 0.2. A more\nextended discussion can be found in Dickson, et al., Hepatology 10:1-7 (1989)\nand in Markus, et al., N Eng J of Med 320:1709-13 (1989).\n\"A total of 424 PBC patients, referred to Mayo Clinic during that ten-year\ninterval, met eligibility criteria for the randomized placebo controlled\ntrial of the drug D-penicillamine. The first 312 cases in the data set\nparticipated in the randomized trial and contain largely complete data. The\nadditional 112 cases did not participate in the clinical trial, but consented\nto have basic measurements recorded and to be followed for survival. Six of\nthose cases were lost to follow-up shortly after diagnosis, so the data here\nare on an additional 106 cases as well as the 312 randomized participants.\nMissing data items are denoted by `.'. \"\n\nVariables:\ncase number\nnumber of days between registration and the earlier of death,\ntransplantion, or study analysis time in July, 1986\nstatus\ndrug: 1= D-penicillamine, 2=placebo\nage in days\nsex: 0=male, 1=female\npresence of asictes: 0=no 1=yes\npresence of hepatomegaly 0=no 1=yes\npresence of spiders 0=no 1=yes\npresence of edema 0=no edema and no diuretic therapy for edema;\n.5 = edema present without diuretics, or edema resolved by diuretics;\n1 = edema despite diuretic therapy\nserum bilirubin in mg\/dl\nserum cholesterol in mg\/dl\nalbumin in gm\/dl\nurine copper in ug\/day\nalkaline phosphatase in U\/liter\nSGOT in U\/ml\ntriglicerides in mg\/dl\nplatelets per cubic ml \/ 1000\nprothrombin time in seconds\nhistologic stage of disease\n\n\nInformation about the dataset\nCLASSTYPE: numeric\nCLASSINDEX: 3", "format": "ARFF", "uploader": "Joaquin Vanschoren", "uploader_id": 2, "visibility": "public", "creator": null, "contributor": null, "date": "2014-09-29 00:07:54", "update_comment": null, "last_update": "2014-09-29 00:07:54", "licence": "Public", "status": "active", "error_message": null, "url": "https:\/\/www.openml.org\/data\/download\/52636\/pbc.arff", "kaggle_url": null, "default_target_attribute": "status", "row_id_attribute": null, "ignore_attribute": null, "runs": 18, "suggest": { "input": [ "pbc", "------------------------------------------------------------------------ Primary Biliary Cirrhosis The data set found in appendix D of Fleming and Harrington, Counting Processes and Survival Analysis, Wiley, 1991. The only differences are: age is in days status is coded as 0=censored, 1=censored due to liver tx, 2=death the sex and stage variables are not missing for obs 313-418 Quoting from F&H. \"The following pages contain the data from the Mayo Clinic trial in primary biliary cirrhosis (PBC) " ], "weight": 5 }, "qualities": { "NumberOfInstances": 418, "NumberOfFeatures": 20, "NumberOfClasses": 0, "NumberOfMissingValues": 1033, "NumberOfInstancesWithMissingValues": 142, "NumberOfNumericFeatures": 14, "NumberOfSymbolicFeatures": 6, "Quartile2AttributeEntropy": null, "REPTreeDepth2ErrRate": null, "CfsSubsetEval_kNN1NKappa": null, "kNN1NErrRate": null, "MajorityClassPercentage": null, "MeanStdDevOfNumericAtts": 551.9387034146282, "Quartile2KurtosisOfNumericAtts": 2.5875102077483723, "REPTreeDepth2Kappa": null, "ClassEntropy": null, "kNN1NKappa": null, "MajorityClassSize": null, "MinAttributeEntropy": null, "Quartile2MeansOfNumericAtts": 123.6292369067103, "REPTreeDepth3AUC": null, "DecisionStumpAUC": null, "MaxAttributeEntropy": null, "MinKurtosisOfNumericAtts": -1.8213677692237562, "Quartile2MutualInformation": null, "REPTreeDepth3ErrRate": null, "DecisionStumpErrRate": null, "MaxKurtosisOfNumericAtts": 14.337869865983695, "MinMeansOfNumericAtts": 0.8301435406698554, "Quartile2SkewnessOfNumericAtts": 1.0381473419798883, "REPTreeDepth3Kappa": null, "DecisionStumpKappa": null, "MaxMeansOfNumericAtts": 18533.351674641148, "MinMutualInformation": null, "PercentageOfBinaryFeatures": 25, "Quartile2StdDevOfNumericAtts": 75.38127928740545, "RandomTreeDepth1AUC": null, "Dimensionality": 0.04784688995215311, "MaxMutualInformation": null, "MinNominalAttDistinctValues": 2, "PercentageOfInstancesWithMissingValues": 33.97129186602871, "Quartile3AttributeEntropy": null, "RandomTreeDepth1ErrRate": null, "EquivalentNumberOfAtts": null, "MaxNominalAttDistinctValues": 3, "MinSkewnessOfNumericAtts": -0.49627341798543095, "PercentageOfMissingValues": 12.35645933014354, "Quartile3KurtosisOfNumericAtts": 9.757108279495345, "AutoCorrelation": 0.09352517985611511, "RandomTreeDepth1Kappa": null, "J48.00001.AUC": null, "MaxSkewnessOfNumericAtts": 3.408525855721415, "MinStdDevOfNumericAtts": 0.4249716057796193, "PercentageOfNumericFeatures": 70, "Quartile3MeansOfNumericAtts": 756.5784966978904, "CfsSubsetEval_DecisionStumpAUC": null, "RandomTreeDepth2AUC": null, "J48.00001.ErrRate": null, "MaxStdDevOfNumericAtts": 3815.8450545514706, "MinorityClassPercentage": null, "PercentageOfSymbolicFeatures": 30, "Quartile3MutualInformation": null, "CfsSubsetEval_DecisionStumpErrRate": null, "RandomTreeDepth2ErrRate": null, "J48.00001.Kappa": null, "MeanAttributeEntropy": null, "MinorityClassSize": null, "Quartile1AttributeEntropy": null, "Quartile3SkewnessOfNumericAtts": 2.5723295270284483, "CfsSubsetEval_DecisionStumpKappa": null, "RandomTreeDepth2Kappa": null, "J48.0001.AUC": null, "MeanKurtosisOfNumericAtts": 4.465463039070742, "NaiveBayesAUC": null, "Quartile1KurtosisOfNumericAtts": -0.6221360947775739, "Quartile3StdDevOfNumericAtts": 450.1266568760886, "CfsSubsetEval_NaiveBayesAUC": null, "RandomTreeDepth3AUC": null, "J48.0001.ErrRate": null, "MeanMeansOfNumericAtts": 1688.288295327197, "NaiveBayesErrRate": null, "Quartile1MeansOfNumericAtts": 3.4282834928229664, "REPTreeDepth1AUC": null, "CfsSubsetEval_NaiveBayesErrRate": null, "RandomTreeDepth3ErrRate": null, "J48.0001.Kappa": null, "MeanMutualInformation": null, "NaiveBayesKappa": null, "Quartile1MutualInformation": null, "REPTreeDepth1ErrRate": null, "CfsSubsetEval_NaiveBayesKappa": null, "RandomTreeDepth3Kappa": null, "J48.001.AUC": null, "MeanNoiseToSignalRatio": null, "NumberOfBinaryFeatures": 5, "Quartile1SkewnessOfNumericAtts": 0.06513761706304133, "REPTreeDepth1Kappa": null, "CfsSubsetEval_kNN1NAUC": null, "StdvNominalAttDistinctValues": 0.408248290463863, "J48.001.ErrRate": null, "MeanNominalAttDistinctValues": 2.1666666666666665, "Quartile1StdDevOfNumericAtts": 1.0054465977713705, "REPTreeDepth2AUC": null, "CfsSubsetEval_kNN1NErrRate": null, "kNN1NAUC": null, "J48.001.Kappa": null, "MeanSkewnessOfNumericAtts": 1.299082454333576 }, "tags": [ { "uploader": "38960", "tag": "Chronic Diseases" }, { "uploader": "38960", "tag": "Health" }, { "uploader": "38960", "tag": "Medicine" }, { "uploader": "7210", "tag": "survival" } ], "features": [ { "name": "status", "index": "2", "type": "numeric", "distinct": "3", "missing": "0", "target": "1", "min": "0", "max": "2", "mean": "1", "stdev": "1" }, { "name": "case_number", "index": "0", "type": "numeric", "distinct": "418", "missing": "0", "min": "1", "max": "418", "mean": "210", "stdev": "121" }, { "name": "number_of_days", "index": "1", "type": "numeric", "distinct": "399", "missing": "0", "min": "41", "max": "4795", "mean": "1918", "stdev": "1105" }, { "name": "drug", "index": "3", "type": "nominal", "distinct": "2", "missing": "106", "distr": [] }, { "name": "age", "index": "4", "type": "numeric", "distinct": "344", "missing": "0", "min": "9598", "max": "28650", "mean": "18533", "stdev": "3816" }, { "name": "sex", "index": "5", "type": "nominal", "distinct": "2", "missing": "0", "distr": [] }, { "name": "presence_of_asictes", "index": "6", "type": "nominal", "distinct": "2", "missing": "106", "distr": [] }, { "name": "presence_of_hepatomegaly", "index": "7", "type": "nominal", "distinct": "2", "missing": "106", "distr": [] }, { "name": "presence_of_spiders", "index": "8", "type": "nominal", "distinct": "2", "missing": "106", "distr": [] }, { "name": "presence_of_edema", "index": "9", "type": "nominal", "distinct": "3", "missing": "0", "distr": [] }, { "name": "serum_bilirubin", "index": "10", "type": "numeric", "distinct": "98", "missing": "0", "min": "0", "max": "28", "mean": "3", "stdev": "4" }, { "name": "serum_cholesterol", "index": "11", "type": "numeric", "distinct": "201", "missing": "134", "min": "120", "max": "1775", "mean": "370", "stdev": "232" }, { "name": "albumin", "index": "12", "type": "numeric", "distinct": "154", "missing": "0", "min": "2", "max": "5", "mean": "3", "stdev": "0" }, { "name": "urine_copper", "index": "13", "type": "numeric", "distinct": "158", "missing": "108", "min": "4", "max": "588", "mean": "98", "stdev": "86" }, { "name": "alkaline_phosphatase", "index": "14", "type": "numeric", "distinct": "295", "missing": "106", "min": "289", "max": "13862", "mean": "1983", "stdev": "2140" }, { "name": "SGOT", "index": "15", "type": "numeric", "distinct": "179", "missing": "106", "min": "26", "max": "457", "mean": "123", "stdev": "57" }, { "name": "triglicerides", "index": "16", "type": "numeric", "distinct": "146", "missing": "136", "min": "33", "max": "598", "mean": "125", "stdev": "65" }, { "name": "platelets", "index": "17", "type": "numeric", "distinct": "243", "missing": "11", "min": "62", "max": "721", "mean": "257", "stdev": "98" }, { "name": "prothrombin_time", "index": "18", "type": "numeric", "distinct": "48", "missing": "2", "min": "9", "max": "18", "mean": "11", "stdev": "1" }, { "name": "histologic_stage_of_disease", "index": "19", "type": "numeric", "distinct": "4", "missing": "6", "min": "1", "max": "4", "mean": "3", "stdev": "1" } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 0, "nr_of_downloads": 0, "total_downloads": 0, "reach": 0, "reuse": 0, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 0 }