{ "data_id": "4540", "name": "ParkinsonSpeechDatasetwithMultipleTypesofSoundRecordings", "exact_name": "ParkinsonSpeechDatasetwithMultipleTypesofSoundRecordings", "version": 1, "version_label": null, "description": "Source:\n\n1. Olcay KURSUN, PhD., \nIstanbul University, \nDepartment of Computer Engineering, \n34320, Istanbul, Turkey \nPhone: +90 (212) 473 7070 - 17827 \nEmail: okursun '@' istanbul.edu.tr \n\n2. Betul ERDOGDU SAKAR, PhD., \nBahcesehir University, \nDepartment of Software Engineering, \n34381, Istanbul, Turkey \nPhone: +90 (212) 381 0589 \nEmail: betul.erdogdu '@' eng.bahcesehir.edu.tr \n\n3. M. Erdem ISENKUL, M.S., \nIstanbul University, \nDepartment of Computer Engineering, \n34320, Istanbul, Turkey \nEmail: eisenkul '@' istanbul.edu.tr \n\n4. C. Okan SAKAR, PhD., \nBahcesehir University, \nDepartment of Computer Engineering, \n34381, Istanbul, Turkey \nPhone: +90 (212) 381 0571 \nEmail: okan.sakar '@' eng.bahcesehir.edu.tr \n\n5. Ahmet SERTBAS, PhD, \nIstanbul University, \nDepartment of Computer Engineering, \n34320, Istanbul, Turkey \nEmail: asertbas '@' istanbul.edu.tr \n\n6. Fikret GURGEN, PhD., \nBogazici University, \nDepartment of Computer Engineering, \n34342, Istanbul, Turkey \nEmail: gurgen '@' boun.edu.tr \n\n7. Sakir DELIL, M.D., PhD., \nIstanbul University, \nCerrahpaşa Faculty of Medicine, \nDepartment of Neurology, \n34098, Istanbul, Turkey \nEmail: sakir.delil '@' ctf.edu.tr \n\n8. Hulya APAYDIN, M.D., PhD., \nIstanbul University, \nCerrahpaşa Faculty of Medicine, \nDepartment of Neurology, \n34098, Istanbul, Turkey \nEmail: hulya.apaydin '@' ctf.edu.tr \n\nDonor: \nC. Okan SAKAR, PhD., \nBahcesehir University, \nDepartment of Computer Engineering, \n34381, Istanbul, Turkey \nPhone: +90 (212) 381 0571 \nEmail: okan.sakar '@' eng.bahcesehir.edu.tr \n\n\nData Set Information:\n\nThe PD database consists of training and test files. The training data belongs to 20 PWP (6 female, 14 male) and 20 healthy individuals (10 female, 10 male) who appealed at the Department of Neurology in Cerrahpasa Faculty of Medicine, Istanbul University. From all subjects, multiple types of sound recordings (26 voice samples including sustained vowels, numbers, words and short sentences) are taken. A group of 26 linear and time–frequency based features are extracted from each voice sample. UPDRS ((Unified Parkinson’s Disease Rating Scale) score of each patient which is determined by expert physician is also available in this dataset. Therefore, this dataset can also be used for regression. \n\nAfter collecting the training dataset which consists of multiple types of sound recordings and performing our experiments, in line with the obtained findings we continued collecting an independent test set from PWP via the same physician’s examination process under the same conditions. During the collection of this dataset, 28 PD patients are asked to say only the sustained vowels 'a' and 'o' three times respectively which makes a total of 168 recordings. The same 26 features are extracted from voice samples of this dataset. This dataset can be used as an independent test set to validate the results obtained on training set. \n\nFurther details are contained in the following reference -- if you use this dataset, please cite: \nErdogdu Sakar, B., Isenkul, M., Sakar, C.O., Sertbas, A., Gurgen, F., Delil, S., Apaydin, H., Kursun, \nO., 'Collection and Analysis of a Parkinson Speech Dataset with Multiple Types of Sound \nRecordings', IEEE Journal of Biomedical and Health Informatics, vol. 17(4), pp. 828-834, 2013 \n\nTraining Data File: \nEach subject has 26 voice samples including sustained vowels, numbers, words and short \nsentences. The voice samples in the training data file are given in the \nfollowing order: \n\nsample# - corresponding voice samples \n1: sustained vowel (aaa……) \n2: sustained vowel (ooo…...) \n3: sustained vowel (uuu…...) \n4-13: numbers from 1 to 10 \n14-17: short sentences \n18-26: words \n\nTest Data File: \n28 PD patients are asked to say only the sustained vowels 'a' and 'o' three times respectively which makes a total of 168 recordings (each subject has 6 voice samples) The voice samples in the test data file are given in the following order: \n\nsample# - corresponding voice samples \n1-3: sustained vowel (aaa……) \n4-6: sustained vowel (ooo……) \n\n\nAttribute Information:\n\nTraining Data File: \ncolumn 1: Subject id \n\ncolum 2-27: features \nfeatures 1-5: Jitter (local),Jitter (local, absolute),Jitter (rap),Jitter (ppq5),Jitter (ddp), \nfeatures 6-11: Shimmer (local),Shimmer (local, dB),Shimmer (apq3),Shimmer (apq5), Shimmer (apq11),Shimmer (dda), \nfeatures 12-14: AC,NTH,HTN, \nfeatures 15-19: Median pitch,Mean pitch,Standard deviation,Minimum pitch,Maximum pitch, \nfeatures 20-23: Number of pulses,Number of periods,Mean period,Standard deviation of period, features 24-26: Fraction of locally unvoiced frames,Number of voice breaks,Degree of voice breaks \n\ncolumn 28: UPDRS \ncolumn 29: class information \n\nTest Data File: \ncolumn 1: Subject id \n\ncolum 2-27: features \nfeatures 1-5: Jitter (local),Jitter (local, absolute),Jitter (rap),Jitter (ppq5),Jitter (ddp), \nfeatures 6-11: Shimmer (local),Shimmer (local, dB),Shimmer (apq3),Shimmer (apq5), Shimmer (apq11),Shimmer (dda), \nfeatures 12-14: AC,NTH,HTN, \nfeatures 15-19: Median pitch,Mean pitch,Standard deviation,Minimum pitch,Maximum pitch, \nfeatures 20-23: Number of pulses,Number of periods,Mean period,Standard deviation of period, \nfeatures 24-26: Fraction of locally unvoiced frames,Number of voice breaks,Degree of voice breaks \n\ncolumn 28: class information \n\n\nRelevant Papers:\n\nErdogdu Sakar, B., Isenkul, M., Sakar, C.O., Sertbas, A., Gurgen, F., Delil, S., Apaydin, H., Kursun, O., 'Collection and Analysis of a Parkinson Speech Dataset with Multiple Types of Sound Recordings', IEEE Journal of Biomedical and Health Informatics, vol. 17(4), pp. 828-834, 2013. \n\nIsenkul, M.E., Erdoğdu, B., Sakar, C.O., Gümüs, E., Delil, M.S., Gürgen, F., Sertbas, A., Kursun, O., \nParkinson Hastalığının Ses Disfonilerinden Teşhisi için bir Ses Veritabanı Olusturulması ve \nÖrüntülerinin Kullanımı, 16. Biyomedikal Mühendisliği Ulusal Toplantısı (BİYOMUT 2011), \nAntalya, Turkey, October, 2011.\n\n\n\nCitation Request:\n\nPlease cite the following paper if you use this dataset: \nErdogdu Sakar, B., Isenkul, M., Sakar, C.O., Sertbas, A., Gurgen, F., Delil, S., Apaydin, H., Kursun, O., 'Collection and Analysis of a Parkinson Speech Dataset with Multiple Types of Sound Recordings', IEEE Journal of Biomedical and Health Informatics, vol. 17(4), pp. 828-834, 2013.", "format": "ARFF", "uploader": "Hilda Fabiola Bernard", "uploader_id": 874, "visibility": "public", "creator": "\"1. Olcay KURSUN\",\"PhD.\",\"Istanbul University\",\"Department of Computer Engineering\",\"34320\",\"Istanbul\",\"Turkey Phone: +90 (212) 473 7070 - 17827 Email: okursun '@' istanbul.edu.tr 2. Betul ERDOGDU SAKAR\",\"PhD.\",\"Bahcesehir University\",\"Department of Software Engineering\",\"34381\",\"Istanbul\",\"Turkey Phone: +90 (212) 381 0589 Email: betul.erdogdu '@' eng.bahcesehir.edu.tr 3. M. Erdem ISENKUL\",\"M.S.\",\"Istanbul University\",\"Department of Computer Engineering\",\"34320\",\"Istanbul\",\"Turkey Email: eisenkul '@' istanbul.edu.tr 4. C. Okan SAKAR\",\"PhD.\",\"Bahcesehir University\",\"Department of Computer Engineering\",\"34381\",\"Istanbul\",\"Turkey Phone: +90 (212) 381 0571 Email: okan.sakar '@' eng.bahcesehir.edu.tr 5. Ahmet SERTBAS\",\"PhD\",\"Istanbul University\",\"Department of Computer Engineering\",\"34320\",\"Istanbul\",\"Turkey Email: asertbas '@' istanbul.edu.tr 6. Fikret GURGEN\",\"PhD.\",\"Bogazici University\",\"Department of Computer Engineering\",\"34342\",\"Istanbul\",\"Turkey Email: gurgen '@' boun.edu.tr 7. Sakir DELIL\",\"M.D.\",\"PhD.\",\"Istanbul University\",\"Cerrahpa\u00c5\u0178a Faculty of Medicine\",\"Department of Neurology\",\"34098\",\"Istanbul\",\"Turkey Email: sakir.delil '@' ctf.edu.tr 8. Hulya APAYDIN\",\"M.D.\",\"PhD.\",\"Istanbul University\",\"Cerrahpa\u00c5\u0178a Faculty of Medicine\",\"Department of Neurology\",\"34098\",\"Istanbul\",\"Turkey Email: hulya.apaydin '@' ctf.edu.tr\"", "contributor": null, "date": "2016-02-17 11:55:26", "update_comment": null, "last_update": "2016-02-17 11:55:26", "licence": "Public", "status": "active", "error_message": null, "url": "https:\/\/www.openml.org\/data\/download\/1798767\/php6jsECG", "default_target_attribute": null, "row_id_attribute": null, "ignore_attribute": null, "runs": 0, "suggest": { "input": [ "ParkinsonSpeechDatasetwithMultipleTypesofSoundRecordings", "Source: 1. Olcay KURSUN, PhD., Istanbul University, Department of Computer Engineering, 34320, Istanbul, Turkey Phone: +90 (212) 473 7070 - 17827 Email: okursun '@' istanbul.edu.tr 2. Betul ERDOGDU SAKAR, PhD., Bahcesehir University, Department of Software Engineering, 34381, Istanbul, Turkey Phone: +90 (212) 381 0589 Email: betul.erdogdu '@' eng.bahcesehir.edu.tr 3. M. Erdem ISENKUL, M.S., Istanbul University, Department of Computer Engineering, 34320, Istanbul, Turkey Email: eisenkul '@' istan " ], "weight": 5 }, "qualities": { "NumberOfInstances": 1039, "NumberOfFeatures": 29, "NumberOfClasses": null, "NumberOfMissingValues": 0, "NumberOfInstancesWithMissingValues": 0, "NumberOfNumericFeatures": 29, "NumberOfSymbolicFeatures": 0, "REPTreeDepth3Kappa": null, "DecisionStumpKappa": null, "MaxMeansOfNumericAtts": 234.92151441770935, "MinMutualInformation": null, "Quartile2SkewnessOfNumericAtts": 1.4279334775091523, "RandomTreeDepth1AUC": null, "Dimensionality": 0.02791145332050048, "MaxMutualInformation": null, "MinNominalAttDistinctValues": null, "PercentageOfBinaryFeatures": 0, "Quartile2StdDevOfNumericAtts": 4.8420894178950045, "RandomTreeDepth1ErrRate": null, "EquivalentNumberOfAtts": null, "MaxNominalAttDistinctValues": null, "MinSkewnessOfNumericAtts": -0.8556663473769629, "PercentageOfInstancesWithMissingValues": 0, "Quartile3AttributeEntropy": null, "RandomTreeDepth1Kappa": null, "J48.00001.AUC": null, "MaxSkewnessOfNumericAtts": 4.899314231854812, "MinStdDevOfNumericAtts": 0.00010647638546233881, "PercentageOfMissingValues": 0, "Quartile3KurtosisOfNumericAtts": 9.063302563984637, "AutoCorrelation": null, "RandomTreeDepth2AUC": null, "J48.00001.ErrRate": null, "MaxStdDevOfNumericAtts": 150.0918405170379, "MinorityClassPercentage": null, "PercentageOfNumericFeatures": 100, "Quartile3MeansOfNumericAtts": 27.63680629451395, "CfsSubsetEval_DecisionStumpAUC": null, "RandomTreeDepth2ErrRate": null, "J48.00001.Kappa": null, "MeanAttributeEntropy": null, "MinorityClassSize": null, "PercentageOfSymbolicFeatures": 0, "Quartile3MutualInformation": null, "CfsSubsetEval_DecisionStumpErrRate": null, "RandomTreeDepth2Kappa": null, "J48.0001.AUC": null, "MeanKurtosisOfNumericAtts": 7.366918095387811, "NaiveBayesAUC": null, "Quartile1AttributeEntropy": null, "Quartile3SkewnessOfNumericAtts": 2.5086745550111735, "CfsSubsetEval_DecisionStumpKappa": null, "RandomTreeDepth3AUC": null, "J48.0001.ErrRate": null, "MeanMeansOfNumericAtts": 37.83427026899721, "NaiveBayesErrRate": null, "Quartile1KurtosisOfNumericAtts": 0.6670492361572509, "Quartile3StdDevOfNumericAtts": 28.82710532443652, "CfsSubsetEval_NaiveBayesAUC": null, "RandomTreeDepth3ErrRate": null, "J48.0001.Kappa": null, "MeanMutualInformation": null, "NaiveBayesKappa": null, "Quartile1MeansOfNumericAtts": 1.1655014629451397, "REPTreeDepth1AUC": null, "CfsSubsetEval_NaiveBayesErrRate": null, "RandomTreeDepth3Kappa": null, "J48.001.AUC": null, "MeanNoiseToSignalRatio": null, "NumberOfBinaryFeatures": 0, "Quartile1MutualInformation": null, "REPTreeDepth1ErrRate": null, "CfsSubsetEval_NaiveBayesKappa": null, "CfsSubsetEval_kNN1NAUC": null, "StdvNominalAttDistinctValues": null, "J48.001.ErrRate": null, "MeanNominalAttDistinctValues": null, "Quartile1SkewnessOfNumericAtts": 0.9487571288419119, "REPTreeDepth1Kappa": null, "CfsSubsetEval_kNN1NErrRate": null, "kNN1NAUC": null, "J48.001.Kappa": null, "MeanSkewnessOfNumericAtts": 1.7199419684031358, "Quartile1StdDevOfNumericAtts": 0.7400575055430636, "REPTreeDepth2AUC": null, "CfsSubsetEval_kNN1NKappa": null, "kNN1NErrRate": null, "MajorityClassPercentage": null, "MeanStdDevOfNumericAtts": 24.9246507551552, "Quartile2AttributeEntropy": null, "REPTreeDepth2ErrRate": null, "ClassEntropy": null, "kNN1NKappa": null, "MajorityClassSize": null, "MinAttributeEntropy": null, "Quartile2KurtosisOfNumericAtts": 3.6016018105282557, "REPTreeDepth2Kappa": null, "REPTreeDepth3AUC": null, "DecisionStumpAUC": null, "MaxAttributeEntropy": null, "MinKurtosisOfNumericAtts": -2.003857280617148, "Quartile2MeansOfNumericAtts": 9.998453666987489, "REPTreeDepth3ErrRate": null, "DecisionStumpErrRate": null, "MaxKurtosisOfNumericAtts": 35.80202572289484, "MinMeansOfNumericAtts": 0.0001704751511068335, "Quartile2MutualInformation": null }, "tags": [ { "uploader": "38960", "tag": "Transportation" } ], "features": [ { "name": "X1", "index": "0", "type": "numeric", "distinct": "40", "missing": "0", "min": "1", "max": "40", "mean": "21", "stdev": "12" }, { "name": "X1.488", "index": "1", "type": "numeric", "distinct": "943", "missing": "0", "min": "0", "max": "14", "mean": "3", "stdev": "2" }, { "name": "X0.000090213", "index": "2", "type": "numeric", "distinct": "1036", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "X0.9", "index": "3", "type": "numeric", "distinct": "838", "missing": "0", "min": "0", "max": "8", "mean": "1", "stdev": "1" }, { "name": "X0.794", "index": "4", "type": "numeric", "distinct": "873", "missing": "0", "min": "0", "max": "14", "mean": "1", "stdev": "1" }, { "name": "X2.699", "index": "5", "type": "numeric", "distinct": "960", "missing": "0", "min": "0", "max": "24", "mean": "4", "stdev": "3" }, { "name": "X8.334", "index": "6", "type": "numeric", "distinct": "1006", "missing": "0", "min": "1", "max": "41", "mean": "13", "stdev": "5" }, { "name": "X0.779", "index": "7", "type": "numeric", "distinct": "722", "missing": "0", "min": "0", "max": "3", "mean": "1", "stdev": "0" }, { "name": "X4.517", "index": "8", "type": "numeric", "distinct": "979", "missing": "0", "min": "0", "max": "26", "mean": "6", "stdev": "3" }, { "name": "X4.609", "index": "9", "type": "numeric", "distinct": "979", "missing": "0", "min": "1", "max": "73", "mean": "8", "stdev": "5" }, { "name": "X6.802", "index": "10", "type": "numeric", "distinct": "943", "missing": "0", "min": "1", "max": "45", "mean": "12", "stdev": "6" }, { "name": "X13.551", "index": "11", "type": "numeric", "distinct": "1017", "missing": "0", "min": "1", "max": "77", "mean": "17", "stdev": "9" }, { "name": "X0.905905", "index": "12", "type": "numeric", "distinct": "1038", "missing": "0", "min": "1", "max": "1", "mean": "1", "stdev": "0" }, { "name": "X0.119116", "index": "13", "type": "numeric", "distinct": "1034", "missing": "0", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "X11.13", "index": "14", "type": "numeric", "distinct": "992", "missing": "0", "min": "1", "max": "28", "mean": "10", "stdev": "4" }, { "name": "X166.533", "index": "15", "type": "numeric", "distinct": "1037", "missing": "0", "min": "81", "max": "469", "mean": "163", "stdev": "56" }, { "name": "X164.781", "index": "16", "type": "numeric", "distinct": "1036", "missing": "0", "min": "82", "max": "470", "mean": "169", "stdev": "56" }, { "name": "X10.421", "index": "17", "type": "numeric", "distinct": "1028", "missing": "0", "min": "1", "max": "294", "mean": "28", "stdev": "37" }, { "name": "X142.229", "index": "18", "type": "numeric", "distinct": "1036", "missing": "0", "min": "68", "max": "452", "mean": "135", "stdev": "47" }, { "name": "X187.576", "index": "19", "type": "numeric", "distinct": "1037", "missing": "0", "min": "86", "max": "598", "mean": "235", "stdev": "122" }, { "name": "X160", "index": "20", "type": "numeric", "distinct": "274", "missing": "0", "min": "0", "max": "1490", "mean": "110", "stdev": "150" }, { "name": "X159", "index": "21", "type": "numeric", "distinct": "270", "missing": "0", "min": "0", "max": "1489", "mean": "106", "stdev": "149" }, { "name": "X0.006064725", "index": "22", "type": "numeric", "distinct": "1039", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "X0.000416276", "index": "23", "type": "numeric", "distinct": "1038", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "X0", "index": "24", "type": "numeric", "distinct": "669", "missing": "0", "min": "0", "max": "88", "mean": "28", "stdev": "21" }, { "name": "X0.1", "index": "25", "type": "numeric", "distinct": "13", "missing": "0", "min": "0", "max": "12", "mean": "1", "stdev": "2" }, { "name": "X0.2", "index": "26", "type": "numeric", "distinct": "594", "missing": "0", "min": "0", "max": "69", "mean": "12", "stdev": "15" }, { "name": "X23", "index": "27", "type": "numeric", "distinct": "15", "missing": "0", "min": "1", "max": "55", "mean": "13", "stdev": "16" }, { "name": "X1.1", "index": "28", "type": "numeric", "distinct": "2", "missing": "0", "min": "0", "max": "1", "mean": "0", "stdev": "1" } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 0, "nr_of_downloads": 0, "total_downloads": 0, "reach": 0, "reuse": 0, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 0 }