Data
risk-factors-cervical

risk-factors-cervical

active ARFF CC-BY Visibility: public Uploaded 19-05-2021 by Meilina Reksoprodjo
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Author: Kelwin Fernandes, Jaime S. Cardoso, Jessica Fernandes Source: [UCI](https://archive.ics.uci.edu/ml/datasets/Cervical+cancer+%28Risk+Factors%29) - 2017 Please cite: [Paper](https://link.springer.com/chapter/10.1007/978-3-319-58838-4_27) Cervical cancer (Risk Factors) Data Set The dataset was collected at 'Hospital Universitario de Caracas' in Caracas, Venezuela. The dataset comprises demographic information, habits, and historic medical records of 858 patients. Several patients decided not to answer some of the questions because of privacy concerns (missing values). ### Attribute information - (int) Age - (int) Number of sexual partners - (int) First sexual intercourse (age) - (int) Num of pregnancies - (bool) Smokes - (bool) Smokes (years) - (bool) Smokes (packs/year) - (bool) Hormonal Contraceptives - (int) Hormonal Contraceptives (years) - (bool) IUD - (int) IUD (years) - (bool) STDs - (int) STDs (number) - (bool) STDs:condylomatosis - (bool) STDs:cervical condylomatosis - (bool) STDs:vaginal condylomatosis - (bool) STDs:vulvo-perineal condylomatosis - (bool) STDs:syphilis - (bool) STDs:pelvic inflammatory disease - (bool) STDs:genital herpes - (bool) STDs:molluscum contagiosum - (bool) STDs:AIDS - (bool) STDs:HIV - (bool) STDs:Hepatitis B - (bool) STDs:HPV - (int) STDs: Number of diagnosis - (int) STDs: Time since first diagnosis - (int) STDs: Time since last diagnosis - (bool) Dx:Cancer - (bool) Dx:CIN - (bool) Dx:HPV - (bool) Dx - (bool) Hinselmann: target variable - (bool) Schiller: target variable - (bool) Cytology: target variable - (bool) Biopsy: target variable

36 features

Agenumeric44 unique values
0 missing
Number of sexual partnersstring12 unique values
26 missing
First sexual intercoursestring21 unique values
7 missing
Num of pregnanciesstring11 unique values
56 missing
Smokesstring2 unique values
13 missing
Smokes (years)string30 unique values
13 missing
Smokes (packs/year)string62 unique values
13 missing
Hormonal Contraceptivesstring2 unique values
108 missing
Hormonal Contraceptives (years)string40 unique values
108 missing
IUDstring2 unique values
117 missing
IUD (years)string26 unique values
117 missing
STDsstring2 unique values
105 missing
STDs (number)string5 unique values
105 missing
STDs:condylomatosisstring2 unique values
105 missing
STDs:cervical condylomatosisstring1 unique values
105 missing
STDs:vaginal condylomatosisstring2 unique values
105 missing
STDs:vulvo-perineal condylomatosisstring2 unique values
105 missing
STDs:syphilisstring2 unique values
105 missing
STDs:pelvic inflammatory diseasestring2 unique values
105 missing
STDs:genital herpesstring2 unique values
105 missing
STDs:molluscum contagiosumstring2 unique values
105 missing
STDs:AIDSstring1 unique values
105 missing
STDs:HIVstring2 unique values
105 missing
STDs:Hepatitis Bstring2 unique values
105 missing
STDs:HPVstring2 unique values
105 missing
STDs: Number of diagnosisnumeric4 unique values
0 missing
STDs: Time since first diagnosisstring18 unique values
787 missing
STDs: Time since last diagnosisstring18 unique values
787 missing
Dx:Cancernumeric2 unique values
0 missing
Dx:CINnumeric2 unique values
0 missing
Dx:HPVnumeric2 unique values
0 missing
Dxnumeric2 unique values
0 missing
Hinselmannnumeric2 unique values
0 missing
Schillernumeric2 unique values
0 missing
Citologynumeric2 unique values
0 missing
Biopsynumeric2 unique values
0 missing

19 properties

858
Number of instances (rows) of the dataset.
36
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
3622
Number of missing values in the dataset.
799
Number of instances with at least one value missing.
10
Number of numeric attributes.
0
Number of nominal attributes.
0.04
Number of attributes divided by the number of instances.
27.78
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
93.12
Percentage of instances having missing values.
Average class difference between consecutive instances.
11.73
Percentage of missing values.

0 tasks

Define a new task