Data
AIDS_Virus_Infection_Prediction

AIDS_Virus_Infection_Prediction

active ARFF Public Domain (CC0) Visibility: public Uploaded 31-05-2024 by Iwo Godzwon
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Description: The AIDS_Classification_50000.csv dataset is a comprehensive resource specifically compiled for researchers and healthcare professionals focusing on the statistical analysis of AIDS (Acquired Immunodeficiency Syndrome). Composed of 50,000 instances, this dataset encapsulates a broad spectrum of clinical and demographic variables related to AIDS patients. Each record in the dataset holds data across 23 columns, indicating various patient attributes including treatment details, demographic information, clinical test results, and disease progression indicators. Attribute Description: - `time`: Time since the baseline measurement, in days. - `trt`: Treatment code (0, 1, 2), where each number signifies a different treatment regimen. - `age`: Age of the patient in years. - `wtkg`: Weight of the patient in kilograms. - `hemo`: Presence of Hemophilia (0 = No, 1 = Yes). - `homo`: Homosexual behavior (0 = No, 1 = Yes). - `drugs`: Drug use (0 = No, 1 = Yes). - `karnof`: Karnofsky score indicating patient's functional impairment (scores range from 0 to 100). - `oprior`: Number of opportunistic infections prior to study. - `z30`: Presence of Z30 gene (0 = No, 1 = Yes). - `preanti`: Months before receiving antiretroviral therapy. - `race`: Race (0 = Non-white, 1 = White). - `gender`: Gender (0 = Female, 1 = Male). - `str2`: Stratification variable 2. - `strat`: Overall stratification. - `symptom`: Presence of specific AIDS-related symptoms (0 = No, 1 = Yes). - `treat`: Treatment response (0 = No, 1 = Yes). - `offtrt`: Off treatment (0 = No, 1 = Yes). - `cd40`: CD4 count at the baseline. - `cd420`: CD4 count at 20 weeks. - `cd80`: CD4 count at 8 weeks. - `cd820`: CD4 count at 20 weeks post the 8-week measurement. - `infected`: HIV infection status (0 = Negative, 1 = Positive). Use Case: This dataset is designed to facilitate a range of scientific inquiries, including the evaluation of treatment efficacy, the identification of prognostic factors for disease progression, and the development of predictive models for patient outcomes. By encompassing a rich variety of data points, the AIDS_Classification_50000.csv supports detailed statistical analyses and machine learning efforts aimed at enhancing our understanding of AIDS. It can particularly serve in the optimization of treatment protocols and in the advancement of targeted therapies, ultimately contributing to improved patient care and management strategies in the field of AIDS research.

23 features

timenumeric1094 unique values
0 missing
trtnumeric4 unique values
0 missing
agenumeric57 unique values
0 missing
wtkgnumeric49623 unique values
0 missing
hemonumeric2 unique values
0 missing
homonumeric2 unique values
0 missing
drugsnumeric2 unique values
0 missing
karnofnumeric13 unique values
0 missing
opriornumeric2 unique values
0 missing
z30numeric2 unique values
0 missing
preantinumeric1775 unique values
0 missing
racenumeric2 unique values
0 missing
gendernumeric2 unique values
0 missing
str2numeric2 unique values
0 missing
stratnumeric3 unique values
0 missing
symptomnumeric2 unique values
0 missing
treatnumeric2 unique values
0 missing
offtrtnumeric2 unique values
0 missing
cd40numeric639 unique values
0 missing
cd420numeric902 unique values
0 missing
cd80numeric2710 unique values
0 missing
cd820numeric2020 unique values
0 missing
infectednumeric2 unique values
0 missing

19 properties

50000
Number of instances (rows) of the dataset.
23
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
23
Number of numeric attributes.
0
Number of nominal attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
Average class difference between consecutive instances.
0
Percentage of missing values.
0
Number of attributes divided by the number of instances.
100
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.

0 tasks

Define a new task