Data
DiabetesDataset2019

DiabetesDataset2019

active ARFF Attribution40InternationalCCBY40 Visibility: public Uploaded 23-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
ContextThisdatasetwascollectedbyNehaPrernaTiggaandDrShrutiGargoftheDepartmentofComputerScienceandEngineeringBITMesraRanchi835215forresearchnoncommercialpurposesonlyAnarticleisalsopublishedimplementingthisdatasetFormoreinformationandcitationofthisdatasetpleasereferTiggaNPGargS2020PredictionofType2DiabetesusingMachineLearningClassificationMethodsProcediaComputerScience167706716DOIhttpsdoiorg101016jprocs202003336ContentThereisatotalof952instanceswith17independentpredictorvariablesandonebinarytargetordependentvariableDiabetesAcknowledgementsWewouldliketothankalltheparticipantswhocontributedtowardsthebuildingofthisdatasetInspirationTobuildamachinelearningalgorithmtopredictifapersonhasdiabetesornot

18 features

Agestring4 unique values
0 missing
Genderstring2 unique values
0 missing
Family_Diabetesstring2 unique values
0 missing
highBPstring2 unique values
0 missing
PhysicallyActivestring4 unique values
0 missing
BMInumeric26 unique values
4 missing
Smokingstring2 unique values
0 missing
Alcoholstring2 unique values
0 missing
Sleepnumeric8 unique values
0 missing
SoundSleepnumeric12 unique values
0 missing
RegularMedicinestring3 unique values
0 missing
JunkFoodstring4 unique values
0 missing
Stressstring4 unique values
0 missing
BPLevelstring6 unique values
0 missing
Preganciesnumeric5 unique values
42 missing
Pdiabetesstring3 unique values
1 missing
UriationFreqstring2 unique values
0 missing
Diabeticstring3 unique values
1 missing

19 properties

952
Number of instances (rows) of the dataset.
18
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
48
Number of missing values in the dataset.
47
Number of instances with at least one value missing.
4
Number of numeric attributes.
0
Number of nominal attributes.
0
Percentage of binary attributes.
4.94
Percentage of instances having missing values.
0.28
Percentage of missing values.
Average class difference between consecutive instances.
22.22
Percentage of numeric attributes.
0.02
Number of attributes divided by the number of instances.
0
Percentage of nominal attributes.
Percentage of instances belonging to the most frequent class.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.

0 tasks

Define a new task