Data
drug-directory

drug-directory

active ARFF Publicly available Visibility: public Uploaded 18-06-2021 by Marcos de Paula Bueno
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Product listing data submitted to the U.S. FDA for all unfinished, unapproved drugs.

20 features

PRODUCTTYPENAME (target)nominal7 unique values
0 missing
ROW_ID (row identifier)numeric120215 unique values
0 missing
PRODUCTIDstring120215 unique values
0 missing
PRODUCTNDCstring117896 unique values
0 missing
PROPRIETARYNAMEstring45019 unique values
7 missing
PROPRIETARYNAMESUFFIXstring4569 unique values
108397 missing
NONPROPRIETARYNAMEstring19307 unique values
7 missing
DOSAGEFORMNAMEstring139 unique values
0 missing
ROUTENAMEstring192 unique values
2152 missing
STARTMARKETINGDATEnumeric7474 unique values
0 missing
ENDMARKETINGDATEnumeric676 unique values
115624 missing
MARKETINGCATEGORYNAMEstring10 unique values
0 missing
APPLICATIONNUMBERstring11256 unique values
14921 missing
LABELERNAMEstring13388 unique values
0 missing
SUBSTANCENAMEstring9729 unique values
2616 missing
ACTIVE_NUMERATOR_STRENGTHstring10204 unique values
2616 missing
ACTIVE_INGRED_UNITstring2927 unique values
2616 missing
PHARM_CLASSESstring1319 unique values
74252 missing
DEASCHEDULEstring4 unique values
115504 missing
NDC_EXCLUDE_FLAGstring1 unique values
0 missing
LISTING_RECORD_CERTIFIED_THROUGHnumeric2 unique values
4593 missing

19 properties

120215
Number of instances (rows) of the dataset.
20
Number of attributes (columns) of the dataset.
7
Number of distinct values of the target attribute (if it is nominal).
443305
Number of missing values in the dataset.
120215
Number of instances with at least one value missing.
3
Number of numeric attributes.
1
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
15
Percentage of numeric attributes.
57.4
Percentage of instances belonging to the most frequent class.
5
Percentage of nominal attributes.
69001
Number of instances belonging to the most frequent class.
0.01
Percentage of instances belonging to the least frequent class.
7
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
100
Percentage of instances having missing values.
0.95
Average class difference between consecutive instances.
18.44
Percentage of missing values.

0 tasks

Define a new task