Data
Edmunds-car-review

Edmunds-car-review

active ARFF CC BY-SA 4.0 Visibility: public Uploaded 23-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computer Systems Machine Learning
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context Started this for my final year project improved on it during quarantine. Content A mix of used and new car reviews from the year 2000 to 2019 of various brands from edmunds.com. I did not scrape all brands, just the popular ones in the US. Added a notebook for walkthrough Inspiration Did not have a proper car review dataset for my final year project, so I made one

8 features

Companystring46 unique values
0 missing
Modelstring897 unique values
0 missing
Yearnumeric21 unique values
0 missing
Reviewerstring212266 unique values
108 missing
Datestring7289 unique values
17 missing
Titlestring222588 unique values
42 missing
Ratingnumeric6 unique values
0 missing
Reviewstring296488 unique values
4 missing

19 properties

299045
Number of instances (rows) of the dataset.
8
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
171
Number of missing values in the dataset.
171
Number of instances with at least one value missing.
2
Number of numeric attributes.
0
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
25
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
0.06
Percentage of instances having missing values.
Average class difference between consecutive instances.
0.01
Percentage of missing values.

0 tasks

Define a new task