OpenML

JavaScript is required to properly view the contents of this page!

Explore
- Data
- Task
- Flow
- Run
- Study
- Task type
- Measure
- People
Help
Blog
Contact
Please cite us

Parkinson_Dataset

active ARFF Public Domain (CC0) Visibility: public Uploaded 31-05-2024 by Iwo Godzwon
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes

Issue	#Downvotes for this reason	By

Loading wiki

Help us complete this description Edit

Description: This dataset, named "parkinsons data.csv", encompasses a collection of voice measurement data from individuals, some of whom have Parkinson's disease. It includes a diverse range of voice signal attributes aimed at assisting in the early diagnosis and tracking of Parkinson's disease through non-invasive methods. The dataset contains several columns such as 'MDVP:Fo(Hz)', 'MDVP:Fhi(Hz)', 'MDVP:Flo(Hz)', signifying the voice frequency measurements, and others like 'MDVP:Jitter(%)', 'MDVP:Shimmer', 'NHR', 'HNR', relaying the variation in voice frequency and amplitude. 'Status' is a binary indicator where '1' denotes the presence and '0' the absence of Parkinson's disease. Additional metrics relevant to voice disorders are included, covering various aspects of voice quality and dynamics, such as 'RPDE', 'DFA', 'spread1', 'spread2', 'D2', and 'PPE', offering a comprehensive set of features for analysis. Attribute Description: 1. MDVP:Fo(Hz): Average vocal fundamental frequency. 2. MDVP:Fhi(Hz): Maximum vocal fundamental frequency. 3. MDVP:Flo(Hz): Minimum vocal fundamental frequency. 4. MDVP:Jitter(%), MDVP:Jitter(Abs), MDVP:RAP, MDVP:PPQ, Jitter:DDP: Various measures of variation in frequency. 5. MDVP:Shimmer, MDVP:Shimmer(dB), Shimmer:APQ3, Shimmer:APQ5, MDVP:APQ, Shimmer:DDA: Different measures of variation in amplitude. 6. NHR, HNR: Ratios depicting noise components in the voice. 7. Status: Binary status for the presence of Parkinson's disease. 8. RPDE, DFA, spread1, spread2, D2, PPE: Nonlinear dynamical measurements. Use Case: This dataset can serve multiple purposes ranging from academic research in biomedical voice signal processing to the practical development of diagnostic tools for early detection of Parkinson's disease. It can be utilized by data scientists and researchers to devise machine learning models capable of distinguishing between healthy individuals and those affected by Parkinson's disease based on voice measurements alone. Furthermore, the dataset can contribute to enhancing our understanding of how Parkinson's disease impacts voice characteristics, aiding in the development of new therapies and treatments.

24 features

name	string	195 unique values 0 missing
MDVP:Fo(Hz)	numeric	195 unique values 0 missing
MDVP:Fhi(Hz)	numeric	195 unique values 0 missing
MDVP:Flo(Hz)	numeric	195 unique values 0 missing
MDVP:Jitter(%)	numeric	173 unique values 0 missing
MDVP:Jitter(Abs)	nominal	19 unique values 0 missing
MDVP:RAP	numeric	155 unique values 0 missing
MDVP:PPQ	numeric	165 unique values 0 missing
Jitter:DDP	numeric	180 unique values 0 missing
MDVP:Shimmer	numeric	188 unique values 0 missing
MDVP:Shimmer(dB)	numeric	149 unique values 0 missing
Shimmer:APQ3	numeric	184 unique values 0 missing
Shimmer:APQ5	numeric	189 unique values 0 missing
MDVP:APQ	numeric	189 unique values 0 missing
Shimmer:DDA	numeric	189 unique values 0 missing
NHR	numeric	185 unique values 0 missing
HNR	numeric	195 unique values 0 missing
status	nominal	2 unique values 0 missing
RPDE	numeric	195 unique values 0 missing
DFA	numeric	195 unique values 0 missing
spread1	numeric	195 unique values 0 missing
spread2	numeric	194 unique values 0 missing
D2	numeric	195 unique values 0 missing
PPE	numeric	195 unique values 0 missing