Data
Popular-Movies-of-IMDb

Popular-Movies-of-IMDb

active ARFF CC0: Public Domain Visibility: public Uploaded 23-03-2022 by Onur Yildirim
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computer Systems Machine Learning
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Introduction TMDB.org is a crowd-sourced movie information database used by many film-related consoles, sites and apps, such as XBMC, MythTV and Plex. Dozens of media managers, mobile apps and social sites make use of its API. TMDb lists some 80,000 films at time of writing, which is considerably fewer than IMDb. While not as complete as IMDb, it holds extensive information for most popular/Hollywood films. This is dataset of the 10,000 most popular movies across the world has been fetched through the read API. TMDB's free API provides for developers and their team to programmatically fetch and use TMDb's data. Their API is to use as long as you attribute TMDb as the source of the data and/or images. Also, they update their API from time to time. This data set is fetched using exception handling process so the data set contains some null values as there are missing fields in the tmdb database. Thought it's good for a young analyst to deal with messing value. Hey analyst are you all excited?

6 features

Unnamed:_0numeric10000 unique values
0 missing
titlestring9686 unique values
0 missing
overviewstring9964 unique values
30 missing
original_languagestring52 unique values
0 missing
vote_countnumeric2773 unique values
0 missing
vote_averagenumeric72 unique values
0 missing

19 properties

10000
Number of instances (rows) of the dataset.
6
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
30
Number of missing values in the dataset.
30
Number of instances with at least one value missing.
3
Number of numeric attributes.
0
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
50
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
0.3
Percentage of instances having missing values.
Average class difference between consecutive instances.
0.05
Percentage of missing values.

0 tasks

Define a new task