OpenML

JavaScript is required to properly view the contents of this page!

Explore
- Data
- Task
- Flow
- Run
- Study
- Task type
- Measure
- People
Help
Blog
Contact
Please cite us

Top-10000-Movies-Based-On-Ratings

active ARFF CC0: Public Domain Visibility: public Uploaded 24-03-2022 by Elif Ceren Gok
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes

Issue	#Downvotes for this reason	By

Loading wiki

Help us complete this description Edit

Context People love movies because: It takes you on a journey. Its an escape from reality. Being a vivid movie watcher I always get amazed how sites like Netflix and Hotstar always exactly suggest the next movie I planned to watch on the back of mind. I researched a lot and decide to come up with something similar to that, so I decided to start with extracting a huge dataset of movies people love to watch and apply analysis on it. Content The dataset contains the following information: Popularity: How popular the movie is. Vote Count: Number of people voted. Title: Name of the movie. Vote Average: Average number of people voted to watch this movie. Overview: Brief overview of what movie is (storyline). Release Date: Date when the movie was released. Inspiration I would love to get the following answer: Relationship between popularity and average vote count? Which machine algorithm would be effective to find relationship between movies?

6 features

Popularity	numeric	7004 unique values 0 missing
Vote_Count	numeric	2768 unique values 0 missing
Titile	string	9672 unique values 0 missing
Vote_Average	numeric	70 unique values 0 missing
Overview	string	9969 unique values 25 missing
Release_Date	string	6139 unique values 19 missing