Data
Goodreads-Computer-Books

Goodreads-Computer-Books

active ARFF CC0: Public Domain Visibility: public Uploaded 24-03-2022 by Elif Ceren Gok
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context The reason for creating this dataset is the requirement of a good clean dataset of computer books. I had searched for datasets on books in Kaggle and I found out that while most of the datasets had a good amount of books listed, there were either major columns missing or grossly unclean data. I mean, you can't determine how good a book is just from a few text reviews. So I collected this data from the Goodreads website from the "Computer" category to help people who are like this type of book. Acknowledgements This data was entirely scraped via the Webdriver Inspiration The reason behind creating this dataset is pretty straightforward, I'm listing the books for all who need computer books, irrespective of the language and publication and all of that. So go ahead and use it to your liking, find out what book you should be reading next, all possible approaches to exploring this dataset are welcome. I started creating this dataset on Jan 18, 2021, and intend to update it frequently. P.S. If you like this, please don't forget to give an upvote! Notes The missing values are imputed in this data by the creator.

9 features

Avg_Rating (target)numeric186 unique values
0 missing
Book_Id (ignore)numeric1234 unique values
0 missing
book_Titlestring1115 unique values
0 missing
Author_Namestring987 unique values
0 missing
ratings_countnumeric279 unique values
0 missing
Publish_yearnumeric58 unique values
0 missing
Editionnumeric41 unique values
0 missing
Pages_nonumeric500 unique values
0 missing
Book_languagestring4 unique values
0 missing
Reviewsnumeric96 unique values
0 missing

19 properties

1234
Number of instances (rows) of the dataset.
9
Number of attributes (columns) of the dataset.
0
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
6
Number of numeric attributes.
0
Number of nominal attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
0
Percentage of missing values.
0.51
Average class difference between consecutive instances.
66.67
Percentage of numeric attributes.
0.01
Number of attributes divided by the number of instances.
0
Percentage of nominal attributes.
Percentage of instances belonging to the most frequent class.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.

0 tasks

Define a new task