Data
CSM

CSM

active ARFF Public Domain (CC0) Visibility: public Uploaded 19-04-2020 by Rafael Gomes Mantovani
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Conventional and Social Media Movies (CSM) - Dataset 2014 and 2015 Data Set 12 features categorized as conventional and social media features. Both conventional features, collected from movies databases on Web as well as social media features(YouTube,Twitter).

13 features

Movie (row identifier)nominal231 unique values
0 missing
Yearnumeric2 unique values
0 missing
Ratingsnumeric45 unique values
0 missing
Genrenumeric11 unique values
0 missing
Grossnumeric215 unique values
0 missing
Budgetnumeric104 unique values
1 missing
Screensnumeric200 unique values
10 missing
Sequelnumeric7 unique values
0 missing
Sentimentnumeric36 unique values
0 missing
Viewsnumeric231 unique values
0 missing
Likesnumeric227 unique values
0 missing
Dislikesnumeric203 unique values
0 missing
Commentsnumeric213 unique values
0 missing
Aggregate.Followersnumeric190 unique values
35 missing

19 properties

231
Number of instances (rows) of the dataset.
13
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
46
Number of missing values in the dataset.
44
Number of instances with at least one value missing.
13
Number of numeric attributes.
0
Number of nominal attributes.
0.06
Number of attributes divided by the number of instances.
100
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
19.05
Percentage of instances having missing values.
Average class difference between consecutive instances.
1.53
Percentage of missing values.

8 tasks

0 runs - estimation_procedure: 10-fold Crossvalidation - evaluation_measure: mean_absolute_error - target_feature: Likes
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
Define a new task