OpenML
Updated-Wine-Enthusiast-Reviews

Updated-Wine-Enthusiast-Reviews

active ARFF CC0: Public Domain Visibility: public Uploaded 24-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computer Systems Machine Learning
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context I built it on https://www.kaggle.com/zynicide/wine-reviews updating the scraper fetching fresh data. Content All wine reviews from winemag.com for 2017-2020 years. Duplicates are cleared, vintage parsed from title. NV means non-vintage. Acknowledgments Thanks Zack Thoutt for the base scraper and original data

15 features

countrystring43 unique values
5 missing
descriptionstring81090 unique values
0 missing
designationstring29478 unique values
21321 missing
pointsnumeric21 unique values
0 missing
pricenumeric336 unique values
4647 missing
provincestring366 unique values
5 missing
region_1string1049 unique values
12913 missing
region_2string19 unique values
49894 missing
taster_namestring19 unique values
150 missing
taster_photostring19 unique values
150 missing
taster_twitter_handlestring16 unique values
1076 missing
titlestring80460 unique values
0 missing
varietystring708 unique values
0 missing
vintagestring56 unique values
0 missing
winerystring12832 unique values
0 missing

19 properties

81115
Number of instances (rows) of the dataset.
15
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
90161
Number of missing values in the dataset.
59110
Number of instances with at least one value missing.
2
Number of numeric attributes.
0
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
13.33
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
72.87
Percentage of instances having missing values.
Average class difference between consecutive instances.
7.41
Percentage of missing values.

0 tasks

Define a new task