Data
IMDB_movie_1972-2019

IMDB_movie_1972-2019

active ARFF CC0: Public Domain Visibility: public Uploaded 24-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computer Systems Machine Learning
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context The IMDB Movies Dataset contains information about 5834 movies. Information about these movies was scraped from imdb for purpose of creating a movie recommendation model. The data was preprocessed and cleaned to be ready for machine learning applications. Content Title Year Rating Metascore Votes Description Genre Runtime (Minutes) Revenue (Millions) Actors Director

12 features

Unnamed:_0numeric5285 unique values
0 missing
Titlestring5707 unique values
0 missing
Yearnumeric48 unique values
0 missing
Ratingnumeric74 unique values
0 missing
Metascorenumeric97 unique values
29 missing
Votesnumeric5666 unique values
0 missing
Descriptionstring5826 unique values
0 missing
Genrestring489 unique values
0 missing
Runtime_(Minutes)numeric129 unique values
0 missing
Revenue_(Millions)numeric3552 unique values
92 missing
Actorsstring5789 unique values
0 missing
Directorstring2733 unique values
0 missing

19 properties

5834
Number of instances (rows) of the dataset.
12
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
121
Number of missing values in the dataset.
99
Number of instances with at least one value missing.
7
Number of numeric attributes.
0
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
58.33
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
1.7
Percentage of instances having missing values.
Average class difference between consecutive instances.
0.17
Percentage of missing values.

0 tasks

Define a new task