Data
Diabetes_Dataset

Diabetes_Dataset

active ARFF Public Domain (CC0) Visibility: public Uploaded 30-06-2024 by Iwo Godzwon
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Description: The dataset, named 'diabetes.csv', serves as a comprehensive resource for understanding various factors that may influence the occurrence of diabetes in individuals. Consisting of several medically relevant parameters, the dataset captures key details across 9 columns, namely Pregnancies, Glucose, BloodPressure, SkinThickness, Insulin, BMI (Body Mass Index), DiabetesPedigreeFunction, Age, and Outcome. Each column reflects a distinct attribute significant to diabetes research and potential predictive modeling. Attribute Description: 1. Pregnancies: Number of times pregnant (Example values: 2, 1) 2. Glucose: Plasma glucose concentration over 2 hours in an oral glucose tolerance test (Example values: 82, 142) 3. BloodPressure: Diastolic blood pressure (mm Hg) (Example values: 70, 64) 4. SkinThickness: Triceps skin fold thickness (mm) (Example values: 27, 0) 5. Insulin: 2-Hour serum insulin (mu U/ml) (Example values: 168, 0) 6. BMI: Body mass index (weight in kg/(height in m)^2) (Example values: 36.8, 30.1) 7. DiabetesPedigreeFunction: Diabetes pedigree function (Example values: 0.34, 0.396) 8. Age: Age in years (Example values: 54, 24) 9. Outcome: Class variable (0 or 1) where 1 denotes the presence of diabetes and 0 denotes absence (Example values: 1, 0) Use Case: This dataset is particularly useful for medical researchers, data scientists, and healthcare providers seeking to identify patterns or factors that significantly contribute to diabetes. By employing statistical analysis or machine learning models, one can predict the likelihood of diabetes occurrence based on the dataset's parameters. Furthermore, this dataset can facilitate a better understanding of how various factors, such as pregnancy, BMI, and age, interact with each other in the context of diabetes, thereby aiding in preventative healthcare planning and patient education.

9 features

Pregnanciesnumeric17 unique values
0 missing
Glucosenumeric136 unique values
0 missing
BloodPressurenumeric47 unique values
0 missing
SkinThicknessnumeric51 unique values
0 missing
Insulinnumeric186 unique values
0 missing
BMInumeric248 unique values
0 missing
DiabetesPedigreeFunctionnumeric517 unique values
0 missing
Agenumeric52 unique values
0 missing
Outcomenumeric2 unique values
0 missing

19 properties

768
Number of instances (rows) of the dataset.
9
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
9
Number of numeric attributes.
0
Number of nominal attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
0
Percentage of missing values.
Average class difference between consecutive instances.
100
Percentage of numeric attributes.
0.01
Number of attributes divided by the number of instances.
0
Percentage of nominal attributes.
Percentage of instances belonging to the most frequent class.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.

0 tasks

Define a new task