{ "data_id": "10", "name": "lymph", "exact_name": "lymph", "version": 1, "version_label": "1", "description": "**Author**: \n**Source**: Unknown - \n**Please cite**: \n\nCitation Request:\n This lymphography domain was obtained from the University Medical Centre,\n Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and \n M. Soklic for providing the data. Please include this citation if you plan\n to use this database.\n \n 1. Title: Lymphography Domain\n \n 2. Sources: \n (a) See Above.\n (b) Donors: Igor Kononenko, \n University E.Kardelj\n Faculty for electrical engineering\n Trzaska 25\n 61000 Ljubljana (tel.: (38)(+61) 265-161\n \n Bojan Cestnik\n Jozef Stefan Institute\n Jamova 39\n 61000 Ljubljana\n Yugoslavia (tel.: (38)(+61) 214-399 ext.287) \n (c) Date: November 1988\n \n 3. Past Usage: (sveral)\n 1. Cestnik,G., Konenenko,I, & Bratko,I. (1987). Assistant-86: A\n Knowledge-Elicitation Tool for Sophisticated Users. In I.Bratko\n & N.Lavrac (Eds.) Progress in Machine Learning, 31-45, Sigma Press.\n -- Assistant-86: 76% accuracy\n 2. Clark,P. & Niblett,T. (1987). Induction in Noisy Domains. In\n I.Bratko & N.Lavrac (Eds.) Progress in Machine Learning, 11-30,\n Sigma Press.\n -- Simple Bayes: 83% accuracy\n -- CN2 (99% threshold): 82%\n 3. Michalski,R., Mozetic,I. Hong,J., & Lavrac,N. (1986). The Multi-Purpose\n Incremental Learning System AQ15 and its Testing Applications to Three\n Medical Domains. In Proceedings of the Fifth National Conference on\n Artificial Intelligence, 1041-1045. Philadelphia, PA: Morgan Kaufmann.\n -- Experts: 85% accuracy (estimate)\n -- AQ15: 80-82%\n \n 4. Relevant Information:\n This is one of three domains provided by the Oncology Institute\n that has repeatedly appeared in the machine learning literature.\n (See also breast-cancer and primary-tumor.)\n \n 5. Number of Instances: 148\n \n 6. Number of Attributes: 19 including the class attribute\n \n 7. Attribute information:\n --- NOTE: All attribute values in the database have been entered as\n numeric values corresponding to their index in the list\n of attribute values for that attribute domain as given below.\n 1. class: normal find, metastases, malign lymph, fibrosis\n 2. lymphatics: normal, arched, deformed, displaced\n 3. block of affere: no, yes\n 4. bl. of lymph. c: no, yes\n 5. bl. of lymph. s: no, yes\n 6. by pass: no, yes\n 7. extravasates: no, yes\n 8. regeneration of: no, yes\n 9. early uptake in: no, yes\n 10. lym.nodes dimin: 0-3\n 11. lym.nodes enlar: 1-4\n 12. changes in lym.: bean, oval, round\n 13. defect in node: no, lacunar, lac. marginal, lac. central\n 14. changes in node: no, lacunar, lac. margin, lac. central\n 15. changes in stru: no, grainy, drop-like, coarse, diluted, reticular, \n stripped, faint, \n 16. special forms: no, chalices, vesicles\n 17. dislocation of: no, yes\n 18. exclusion of no: no, yes\n 19. no. of nodes in: 0-9, 10-19, 20-29, 30-39, 40-49, 50-59, 60-69, >=70\n \n 8. Missing Attribute Values: None\n \n 9. Class Distribution: \n Class: Number of Instances:\n normal find: 2\n metastases: 81\n malign lymph: 61\n fibrosis: 4\n \n \n\n\n\n\n Relabeled values in attribute 'lymphatics'\n From: '1' To: normal \n From: '2' To: arched \n From: '3' To: deformed \n From: '4' To: displaced \n\n\n Relabeled values in attribute 'block_of_affere'\n From: '1' To: no \n From: '2' To: yes \n\n\n Relabeled values in attribute 'bl_of_lymph_c'\n From: '1' To: no \n From: '2' To: yes \n\n\n Relabeled values in attribute 'bl_of_lymph_s'\n From: '1' To: no \n From: '2' To: yes \n\n\n Relabeled values in attribute 'by_pass'\n From: '1' To: no \n From: '2' To: yes \n\n\n Relabeled values in attribute 'extravasates'\n From: '1' To: no \n From: '2' To: yes \n\n\n Relabeled values in attribute 'regeneration_of'\n From: '1' To: no \n From: '2' To: yes \n\n\n Relabeled values in attribute 'early_uptake_in'\n From: '1' To: no \n From: '2' To: yes \n\n\n Relabeled values in attribute 'changes_in_lym'\n From: '1' To: bean \n From: '2' To: oval \n From: '3' To: round \n\n\n Relabeled values in attribute 'defect_in_node'\n From: '1' To: no \n From: '2' To: lacunar \n From: '3' To: lac_margin \n From: '4' To: lac_central \n\n\n Relabeled values in attribute 'changes_in_node'\n From: '1' To: no \n From: '2' To: lacunar \n From: '3' To: lac_margin \n From: '4' To: lac_central \n\n\n Relabeled values in attribute 'changes_in_stru'\n From: '1' To: no \n From: '2' To: grainy \n From: '3' To: drop_like \n From: '4' To: coarse \n From: '5' To: diluted \n From: '6' To: reticular \n From: '7' To: stripped \n From: '8' To: faint \n\n\n Relabeled values in attribute 'special_forms'\n From: '1' To: no \n From: '2' To: chalices \n From: '3' To: vesicles \n\n\n Relabeled values in attribute 'dislocation_of'\n From: '1' To: no \n From: '2' To: yes \n\n\n Relabeled values in attribute 'exclusion_of_no'\n From: '1' To: no \n From: '2' To: yes \n\n\n Relabeled values in attribute 'class'\n From: '1' To: normal \n From: '2' To: metastases \n From: '3' To: malign_lymph \n From: '4' To: fibrosis", "format": "ARFF", "uploader": "Jan van Rijn", "uploader_id": 1, "visibility": "public", "creator": "Institute of Oncology Ljubljana", "contributor": "Igor Kononenko, Bojan Cestnik", "date": "2014-04-06 23:19:52", "update_comment": null, "last_update": "2014-04-06 23:19:52", "licence": "Public", "status": "active", "error_message": null, "url": "https:\/\/www.openml.org\/data\/download\/10\/dataset_10_lymph.arff", "default_target_attribute": "class", "row_id_attribute": null, "ignore_attribute": null, "runs": 1973, "suggest": { "input": [ "lymph", "Citation Request: This lymphography domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data. Please include this citation if you plan to use this database. 1. Title: Lymphography Domain 2. Sources: (a) See Above. (b) Donors: Igor Kononenko, University E.Kardelj Faculty for electrical engineering Trzaska 25 61000 Ljubljana (tel.: (38)(+61) 265-161 Bojan Cestnik Jozef Stefan Institute Jamova " ], "weight": 5 }, "qualities": { "NumberOfInstances": 148, "NumberOfFeatures": 19, "NumberOfClasses": 4, "NumberOfMissingValues": 0, "NumberOfInstancesWithMissingValues": 0, "NumberOfNumericFeatures": 3, "NumberOfSymbolicFeatures": 16, "Quartile2AttributeEntropy": 0.9915528503834039, "REPTreeDepth2ErrRate": 0.2905405405405405, "CfsSubsetEval_kNN1NKappa": 0.5474401537655076, "kNN1NErrRate": 0.19594594594594594, "MajorityClassPercentage": 54.729729729729726, "MeanStdDevOfNumericAtts": 1.0184023049334343, "MinAttributeEntropy": 0.2748031957462935, "Quartile2KurtosisOfNumericAtts": 0.40502113265619144, "REPTreeDepth2Kappa": 0.430361618331543, "ClassEntropy": 1.2276775019465804, "kNN1NKappa": 0.6237068209714186, "MajorityClassSize": 81, "MinKurtosisOfNumericAtts": -0.5040960482425287, "Quartile2MeansOfNumericAtts": 2.472972972972973, "REPTreeDepth3AUC": 0.7466579226863443, "DecisionStumpAUC": 0.7715656536027917, "MaxAttributeEntropy": 2.527125737973009, "MinMeansOfNumericAtts": 1.060810810810811, "Quartile2MutualInformation": 0.135651202733, "REPTreeDepth3ErrRate": 0.2905405405405405, "DecisionStumpErrRate": 0.24324324324324326, "MaxKurtosisOfNumericAtts": 29.749465128075876, "MinMutualInformation": 0.02911996300275, "Quartile2SkewnessOfNumericAtts": 1.2033115898658382, "REPTreeDepth3Kappa": 0.430361618331543, "DecisionStumpKappa": 0.5316455696202532, "MaxMeansOfNumericAtts": 2.6013513513513518, "MinNominalAttDistinctValues": 2, "PercentageOfBinaryFeatures": 47.368421052631575, "Quartile2StdDevOfNumericAtts": 0.8366270631541782, "RandomTreeDepth1AUC": 0.7576210719961072, "Dimensionality": 0.12837837837837837, "MaxMutualInformation": 0.40188387586188, "MinSkewnessOfNumericAtts": 0.33379516180165014, "PercentageOfInstancesWithMissingValues": 0, "Quartile3AttributeEntropy": 1.6082585569929884, "RandomTreeDepth1ErrRate": 0.24324324324324326, "EquivalentNumberOfAtts": 9.37680223405617, "MaxNominalAttDistinctValues": 8, "MinStdDevOfNumericAtts": 0.3135565426849874, "PercentageOfMissingValues": 0, "Quartile3KurtosisOfNumericAtts": 29.749465128075876, "AutoCorrelation": 0.5034013605442177, "RandomTreeDepth1Kappa": 0.5295364238410597, "J48.00001.AUC": 0.8035040133716935, "MaxSkewnessOfNumericAtts": 5.442361694493849, "MinorityClassPercentage": 1.3513513513513513, "PercentageOfNumericFeatures": 15.789473684210526, "Quartile3MeansOfNumericAtts": 2.6013513513513518, "CfsSubsetEval_DecisionStumpAUC": 0.7924545850419331, "RandomTreeDepth2AUC": 0.7576210719961072, "J48.00001.ErrRate": 0.24324324324324326, "MaxStdDevOfNumericAtts": 1.9050233089611373, "MinorityClassSize": 2, "PercentageOfSymbolicFeatures": 84.21052631578947, "Quartile3MutualInformation": 0.17368798992783, "CfsSubsetEval_DecisionStumpErrRate": 0.23648648648648649, "RandomTreeDepth2ErrRate": 0.24324324324324326, "J48.00001.Kappa": 0.55, "MeanAttributeEntropy": 1.1174061851513224, "NaiveBayesAUC": 0.9083282647773021, "Quartile1AttributeEntropy": 0.7404482452691425, "Quartile3SkewnessOfNumericAtts": 5.442361694493849, "CfsSubsetEval_DecisionStumpKappa": 0.5474401537655076, "RandomTreeDepth2Kappa": 0.5295364238410597, "J48.0001.AUC": 0.8035040133716935, "MeanKurtosisOfNumericAtts": 9.883463404163178, "NaiveBayesErrRate": 0.1554054054054054, "Quartile1KurtosisOfNumericAtts": -0.5040960482425287, "Quartile3StdDevOfNumericAtts": 1.9050233089611373, "CfsSubsetEval_NaiveBayesAUC": 0.7924545850419331, "RandomTreeDepth3AUC": 0.7576210719961072, "J48.0001.ErrRate": 0.24324324324324326, "MeanMeansOfNumericAtts": 2.045045045045045, "NaiveBayesKappa": 0.7014820661229503, "Quartile1MeansOfNumericAtts": 1.060810810810811, "REPTreeDepth1AUC": 0.7466579226863443, "CfsSubsetEval_NaiveBayesErrRate": 0.23648648648648649, "RandomTreeDepth3ErrRate": 0.24324324324324326, "J48.0001.Kappa": 0.55, "MeanMutualInformation": 0.13092709767170999, "NumberOfBinaryFeatures": 9, "Quartile1MutualInformation": 0.0637948721468, "REPTreeDepth1ErrRate": 0.2905405405405405, "CfsSubsetEval_NaiveBayesKappa": 0.5474401537655076, "RandomTreeDepth3Kappa": 0.5295364238410597, "J48.001.AUC": 0.8035040133716935, "MeanNoiseToSignalRatio": 7.534567748176438, "Quartile1SkewnessOfNumericAtts": 0.33379516180165014, "REPTreeDepth1Kappa": 0.430361618331543, "CfsSubsetEval_kNN1NAUC": 0.7924545850419331, "StdvNominalAttDistinctValues": 1.591644851508443, "J48.001.ErrRate": 0.24324324324324326, "MeanNominalAttDistinctValues": 3, "Quartile1StdDevOfNumericAtts": 0.3135565426849874, "REPTreeDepth2AUC": 0.7466579226863443, "CfsSubsetEval_kNN1NErrRate": 0.23648648648648649, "kNN1NAUC": 0.8277376333822596, "J48.001.Kappa": 0.55, "MeanSkewnessOfNumericAtts": 2.326489482053779 }, "tags": [ { "uploader": "38960", "tag": "Machine Learning" }, { "uploader": "38960", "tag": "Medicine" }, { "uploader": "2", "tag": "study_1" }, { "uploader": "1", "tag": "study_41" }, { "uploader": "64", "tag": "study_7" }, { "uploader": "4209", "tag": "study_88" }, { "uploader": "1", "tag": "uci" } ], "features": [ { "name": "class", "index": "18", "type": "nominal", "distinct": "4", "missing": "0", "target": "1", "distr": [ [ "normal", "metastases", "malign_lymph", "fibrosis" ], [ [ "2", "0", "0", "0" ], [ "0", "81", "0", "0" ], [ "0", "0", "61", "0" ], [ "0", "0", "0", "4" ] ] ] }, { "name": "lymphatics", "index": "0", "type": "nominal", "distinct": "4", "missing": "0", "distr": [ [ "normal", "arched", "deformed", "displaced" ], [ [ "2", "0", "0", "0" ], [ "0", "39", "28", "0" ], [ "0", "26", "16", "4" ], [ "0", "16", "17", "0" ] ] ] }, { "name": "block_of_affere", "index": "1", "type": "nominal", "distinct": "2", "missing": "0", "distr": [ [ "no", "yes" ], [ [ "2", "19", "43", "2" ], [ "0", "62", "18", "2" ] ] ] }, { "name": "bl_of_lymph_c", "index": "2", "type": "nominal", "distinct": "2", "missing": "0", "distr": [ [ "no", "yes" ], [ [ "2", "63", "55", "2" ], [ "0", "18", "6", "2" ] ] ] }, { "name": "bl_of_lymph_s", "index": "3", "type": "nominal", "distinct": "2", "missing": "0", "distr": [ [ "no", "yes" ], [ [ "2", "79", "58", "2" ], [ "0", "2", "3", "2" ] ] ] }, { "name": "by_pass", "index": "4", "type": "nominal", "distinct": "2", "missing": "0", "distr": [ [ "no", "yes" ], [ [ "2", "59", "51", "0" ], [ "0", "22", "10", "4" ] ] ] }, { "name": "extravasates", "index": "5", "type": "nominal", "distinct": "2", "missing": "0", "distr": [ [ "no", "yes" ], [ [ "1", "43", "29", "0" ], [ "1", "38", "32", "4" ] ] ] }, { "name": "regeneration_of", "index": "6", "type": "nominal", "distinct": "2", "missing": "0", "distr": [ [ "no", "yes" ], [ [ "2", "80", "56", "0" ], [ "0", "1", "5", "4" ] ] ] }, { "name": "early_uptake_in", "index": "7", "type": "nominal", "distinct": "2", "missing": "0", "distr": [ [ "no", "yes" ], [ [ "1", "35", "5", "3" ], [ "1", "46", "56", "1" ] ] ] }, { "name": "lym_nodes_dimin", "index": "8", "type": "numeric", "distinct": "3", "missing": "0", "min": "1", "max": "3", "mean": "1", "stdev": "0" }, { "name": "lym_nodes_enlar", "index": "9", "type": "numeric", "distinct": "4", "missing": "0", "min": "1", "max": "4", "mean": "2", "stdev": "1" }, { "name": "changes_in_lym", "index": "10", "type": "nominal", "distinct": "3", "missing": "0", "distr": [ [ "bean", "oval", "round" ], [ [ "1", "2", "0", "3" ], [ "1", "37", "38", "1" ], [ "0", "42", "23", "0" ] ] ] }, { "name": "defect_in_node", "index": "11", "type": "nominal", "distinct": "4", "missing": "0", "distr": [ [ "no", "lacunar", "lac_margin", "lac_central" ], [ [ "2", "1", "0", "0" ], [ "0", "24", "23", "2" ], [ "0", "34", "12", "0" ], [ "0", "22", "26", "2" ] ] ] }, { "name": "changes_in_node", "index": "12", "type": "nominal", "distinct": "4", "missing": "0", "distr": [ [ "no", "lacunar", "lac_margin", "lac_central" ], [ [ "2", "1", "2", "1" ], [ "0", "15", "24", "3" ], [ "0", "63", "12", "0" ], [ "0", "2", "23", "0" ] ] ] }, { "name": "changes_in_stru", "index": "13", "type": "nominal", "distinct": "8", "missing": "0", "distr": [ [ "no", "grainy", "drop_like", "coarse", "diluted", "reticular", "stripped", "faint" ], [ [ "1", "1", "0", "0" ], [ "1", "11", "2", "0" ], [ "0", "12", "7", "0" ], [ "0", "20", "10", "1" ], [ "0", "16", "10", "2" ], [ "0", "1", "1", "0" ], [ "0", "0", "7", "0" ], [ "0", "20", "24", "1" ] ] ] }, { "name": "special_forms", "index": "14", "type": "nominal", "distinct": "3", "missing": "0", "distr": [ [ "no", "chalices", "vesicles" ], [ [ "2", "19", "6", "1" ], [ "0", "35", "8", "0" ], [ "0", "27", "47", "3" ] ] ] }, { "name": "dislocation_of", "index": "15", "type": "nominal", "distinct": "2", "missing": "0", "distr": [ [ "no", "yes" ], [ [ "2", "34", "12", "2" ], [ "0", "47", "49", "2" ] ] ] }, { "name": "exclusion_of_no", "index": "16", "type": "nominal", "distinct": "2", "missing": "0", "distr": [ [ "no", "yes" ], [ [ "2", "22", "6", "1" ], [ "0", "59", "55", "3" ] ] ] }, { "name": "no_of_nodes_in", "index": "17", "type": "numeric", "distinct": "8", "missing": "0", "min": "1", "max": "8", "mean": "3", "stdev": "2" } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 0, "nr_of_downloads": 0, "total_downloads": 0, "reach": 0, "reuse": 0, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 0 }