Data

JavaScript is required to properly view the contents of this page!

Explore
- Data
- Task
- Flow
- Run
- Study
- Task type
- Measure
- People
Help
Blog
Contact
Please cite us

CSM

CSM

active ARFF Public Domain (CC0) Visibility: public Uploaded 19-04-2020 by Rafael Gomes Mantovani
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes

Issue	#Downvotes for this reason	By

Loading wiki

Help us complete this description Edit

Conventional and Social Media Movies (CSM) - Dataset 2014 and 2015 Data Set 12 features categorized as conventional and social media features. Both conventional features, collected from movies databases on Web as well as social media features(YouTube,Twitter).

13 features

Movie (row identifier)	nominal	231 unique values 0 missing
Year	numeric	2 unique values 0 missing
Ratings	numeric	45 unique values 0 missing
Genre	numeric	11 unique values 0 missing
Gross	numeric	215 unique values 0 missing
Budget	numeric	104 unique values 1 missing
Screens	numeric	200 unique values 10 missing
Sequel	numeric	7 unique values 0 missing
Sentiment	numeric	36 unique values 0 missing
Views	numeric	231 unique values 0 missing
Likes	numeric	227 unique values 0 missing
Dislikes	numeric	203 unique values 0 missing
Comments	numeric	213 unique values 0 missing
Aggregate.Followers	numeric	190 unique values 35 missing

Show all 13 features

19 properties

NumberOfInstances

231

Number of instances (rows) of the dataset.

NumberOfFeatures

13

Number of attributes (columns) of the dataset.

NumberOfClasses

Number of distinct values of the target attribute (if it is nominal).

NumberOfMissingValues

46

Number of missing values in the dataset.

NumberOfInstancesWithMissingValues

44

Number of instances with at least one value missing.

NumberOfNumericFeatures

13

Number of numeric attributes.

NumberOfSymbolicFeatures

0

Number of nominal attributes.

PercentageOfBinaryFeatures

0

Percentage of binary attributes.

PercentageOfInstancesWithMissingValues

19.05

Percentage of instances having missing values.

PercentageOfMissingValues

1.53

Percentage of missing values.

AutoCorrelation

Average class difference between consecutive instances.

PercentageOfNumericFeatures

100

Percentage of numeric attributes.

0.06

Number of attributes divided by the number of instances.

PercentageOfSymbolicFeatures

0

Percentage of nominal attributes.

MajorityClassPercentage

Percentage of instances belonging to the most frequent class.

MajorityClassSize

Number of instances belonging to the most frequent class.

MinorityClassPercentage

Percentage of instances belonging to the least frequent class.

MinorityClassSize

Number of instances belonging to the least frequent class.

NumberOfBinaryFeatures

0

Number of binary attributes.

Show all 19 properties

8 tasks

Supervised Regression on CSM

0 runs - estimation_procedure: 10-fold Crossvalidation - evaluation_measure: mean_absolute_error - target_feature: Likes

Clustering on CSM

0 runs - estimation_procedure: 50 times Clustering

Clustering on CSM

0 runs - estimation_procedure: 50 times Clustering

Clustering on CSM

0 runs - estimation_procedure: 50 times Clustering

Clustering on CSM

0 runs - estimation_procedure: 50 times Clustering

Clustering on CSM

0 runs - estimation_procedure: 50 times Clustering

Clustering on CSM

0 runs - estimation_procedure: 50 times Clustering

Clustering on CSM

0 runs - estimation_procedure: 50 times Clustering

Define a new task