Data
Goodreads-Books---31-Features

Goodreads-Books---31-Features

active ARFF CC0: Public Domain Visibility: public Uploaded 23-03-2022 by Onur Yildirim
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computer Systems Machine Learning
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context The official Goodread's API limits retrievable data, so I decided to scrape the actual HTTP pages and grab additional details on each book. Content Books are scraped from a list titles the "Best Books Ever" which can be found here https://www.goodreads.com/list/show/1.Best_Books_Ever Acknowledgements Thanks to Goodreads for housing the data.

31 features

idnumeric52199 unique values
0 missing
titlestring48145 unique values
224 missing
linkstring52199 unique values
0 missing
seriesstring22477 unique values
28880 missing
cover_linkstring51592 unique values
607 missing
authorstring27233 unique values
6 missing
author_linkstring23009 unique values
0 missing
rating_countnumeric19267 unique values
0 missing
review_countnumeric5627 unique values
0 missing
average_ratingnumeric262 unique values
0 missing
five_star_ratingsnumeric12808 unique values
0 missing
four_star_ratingsnumeric12399 unique values
0 missing
three_star_ratingsnumeric9597 unique values
0 missing
two_star_ratingsnumeric4904 unique values
0 missing
one_star_ratingsnumeric3118 unique values
0 missing
number_of_pagesnumeric1367 unique values
2330 missing
date_publishedstring9120 unique values
860 missing
publisherstring10369 unique values
3935 missing
original_titlestring36190 unique values
13266 missing
genre_and_votesstring47301 unique values
2840 missing
isbnstring40316 unique values
11883 missing
isbn13string39507 unique values
12692 missing
asinstring5236 unique values
46963 missing
settingsstring5335 unique values
40687 missing
charactersstring12260 unique values
38509 missing
awardsstring9148 unique values
41574 missing
amazon_redirect_linkstring52199 unique values
0 missing
worldcat_redirect_linkstring48214 unique values
3985 missing
recommended_booksstring48047 unique values
3994 missing
books_in_seriesstring20318 unique values
30121 missing
descriptionstring49164 unique values
2595 missing

19 properties

52199
Number of instances (rows) of the dataset.
31
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
285951
Number of missing values in the dataset.
52199
Number of instances with at least one value missing.
10
Number of numeric attributes.
0
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
32.26
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
100
Percentage of instances having missing values.
Average class difference between consecutive instances.
17.67
Percentage of missing values.

0 tasks

Define a new task