Data
WebMD-Drug-Reviews-Dataset

WebMD-Drug-Reviews-Dataset

active ARFF CC0: Public Domain Visibility: public Uploaded 24-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context The dataset provides user reviews on specific drugs along with related conditions, side effects, age, sex, and ratings reflecting overall patient satisfaction. Content Data was acquired by scraping WebMD site. There are around 0.36 million rows of unique reviews and is updated till Mar 2020. Inspiration This dataset intended to answer following questions: I. Identifying the condition of the patient based on drug reviews? II. How to predict drug rating based on patients reviews? III. How to visualize drug rating, kind of drugs, types of conditions a patient can have, sentiments based on reviews

11 features

Agestring12 unique values
0 missing
Conditionstring1806 unique values
0 missing
Datestring4524 unique values
0 missing
Drugstring7093 unique values
0 missing
DrugId (ignore)numeric6572 unique values
0 missing
EaseofUsenumeric7 unique values
0 missing
Effectivenessnumeric7 unique values
0 missing
Reviewsstring250164 unique values
42 missing
Satisfactionnumeric7 unique values
0 missing
Sexstring3 unique values
0 missing
Sidesstring1651 unique values
0 missing
UsefulCountnumeric148 unique values
0 missing

19 properties

362806
Number of instances (rows) of the dataset.
11
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
42
Number of missing values in the dataset.
42
Number of instances with at least one value missing.
4
Number of numeric attributes.
0
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
36.36
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
0.01
Percentage of instances having missing values.
Average class difference between consecutive instances.
0
Percentage of missing values.

0 tasks

Define a new task