Data
geographical_origin_of_music

geographical_origin_of_music

active ARFF CC BY 4.0 Visibility: public Uploaded 22-12-2022 by Sebastian Fischer
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Data Description Instances in this dataset contain audio features extracted from 1059 wave files. The task associated with the data is to predict the geographical origin of music. The dataset was built from a personal collection of 1059 tracks covering 33 countries/area. The music used is traditional, ethnic or `world' only, as classified by the publishers of the product on which it appears. Any Western music is not included because its influence is global - what we seek are the aspects of music that most influence location. Thus, being able to specify a location with strong influence on the music is central. The geographical location of origin was manually collected the information from the CD sleeve notes, and when this information was inadequate we searched other information sources. The location data is limited in precision to the country of origin. The country of origin was determined by the artist's or artists' main country/area of residence. Any track that had ambiguous origin is not included. We have taken the position of each country's capital city (or the province of the area) by latitude and longitude as the absolute point of origin. The program MARSYAS was used to extract audio features from the wave files. We used the default MARSYAS settings in single vector format (68 features) to estimate the performance with basic timbal information covering the entire length of each track. No feature weighting or pre-filtering was applied. All features were transformed to have a mean of 0, and a standard deviation of 1. We also investigated the utility of adding chromatic attributes. These describe the notes of the scale being used. This is especially important as a distinguishing feature in geographical ethnomusicology. The chromatic features provided by MARSYAS are 12 per octave - Western tuning, but it may be possible to tell something from how similar to or different from Western tuning the music is. Attribute Description The first 116 columns are audio features of the track, and the last two columns (*latitude*, *longitude*) are the origin of the music.

117 features

latitude (target)numeric31 unique values
0 missing
V1numeric1054 unique values
0 missing
V2numeric1045 unique values
0 missing
V3numeric1055 unique values
0 missing
V4numeric1037 unique values
0 missing
V5numeric1058 unique values
0 missing
V6numeric1059 unique values
0 missing
V7numeric1059 unique values
0 missing
V8numeric1059 unique values
0 missing
V9numeric1059 unique values
0 missing
V10numeric1059 unique values
0 missing
V11numeric1059 unique values
0 missing
V12numeric1059 unique values
0 missing
V13numeric1059 unique values
0 missing
V14numeric1059 unique values
0 missing
V15numeric1059 unique values
0 missing
V16numeric1059 unique values
0 missing
V17numeric1058 unique values
0 missing
V18numeric746 unique values
0 missing
V19numeric746 unique values
0 missing
V20numeric746 unique values
0 missing
V21numeric746 unique values
0 missing
V22numeric746 unique values
0 missing
V23numeric746 unique values
0 missing
V24numeric746 unique values
0 missing
V25numeric746 unique values
0 missing
V26numeric746 unique values
0 missing
V27numeric746 unique values
0 missing
V28numeric746 unique values
0 missing
V29numeric746 unique values
0 missing
V30numeric1042 unique values
0 missing
V31numeric1037 unique values
0 missing
V32numeric1058 unique values
0 missing
V33numeric1029 unique values
0 missing
V34numeric1059 unique values
0 missing
V35numeric1058 unique values
0 missing
V36numeric1059 unique values
0 missing
V37numeric1057 unique values
0 missing
V38numeric1053 unique values
0 missing
V39numeric1058 unique values
0 missing
V40numeric1055 unique values
0 missing
V41numeric1059 unique values
0 missing
V42numeric1058 unique values
0 missing
V43numeric1058 unique values
0 missing
V44numeric1058 unique values
0 missing
V45numeric1058 unique values
0 missing
V46numeric1055 unique values
0 missing
V47numeric681 unique values
0 missing
V48numeric681 unique values
0 missing
V49numeric681 unique values
0 missing
V50numeric681 unique values
0 missing
V51numeric681 unique values
0 missing
V52numeric681 unique values
0 missing
V53numeric681 unique values
0 missing
V54numeric681 unique values
0 missing
V55numeric681 unique values
0 missing
V56numeric681 unique values
0 missing
V57numeric681 unique values
0 missing
V58numeric681 unique values
0 missing
V59numeric1041 unique values
0 missing
V60numeric1054 unique values
0 missing
V61numeric1056 unique values
0 missing
V62numeric997 unique values
0 missing
V63numeric1059 unique values
0 missing
V64numeric1059 unique values
0 missing
V65numeric1058 unique values
0 missing
V66numeric1059 unique values
0 missing
V67numeric1059 unique values
0 missing
V68numeric1058 unique values
0 missing
V69numeric1057 unique values
0 missing
V70numeric1057 unique values
0 missing
V71numeric1059 unique values
0 missing
V72numeric1059 unique values
0 missing
V73numeric1058 unique values
0 missing
V74numeric1059 unique values
0 missing
V75numeric1058 unique values
0 missing
V76numeric680 unique values
0 missing
V77numeric680 unique values
0 missing
V78numeric680 unique values
0 missing
V79numeric680 unique values
0 missing
V80numeric680 unique values
0 missing
V81numeric680 unique values
0 missing
V82numeric680 unique values
0 missing
V83numeric680 unique values
0 missing
V84numeric680 unique values
0 missing
V85numeric680 unique values
0 missing
V86numeric680 unique values
0 missing
V87numeric680 unique values
0 missing
V88numeric1037 unique values
0 missing
V89numeric1046 unique values
0 missing
V90numeric1052 unique values
0 missing
V91numeric932 unique values
0 missing
V92numeric1059 unique values
0 missing
V93numeric1058 unique values
0 missing
V94numeric1058 unique values
0 missing
V95numeric1055 unique values
0 missing
V96numeric1057 unique values
0 missing
V97numeric1056 unique values
0 missing
V98numeric1058 unique values
0 missing
V99numeric1055 unique values
0 missing
V100numeric1057 unique values
0 missing
V101numeric1056 unique values
0 missing
V102numeric1058 unique values
0 missing
V103numeric1054 unique values
0 missing
V104numeric1052 unique values
0 missing
V105numeric621 unique values
0 missing
V106numeric621 unique values
0 missing
V107numeric621 unique values
0 missing
V108numeric621 unique values
0 missing
V109numeric621 unique values
0 missing
V110numeric621 unique values
0 missing
V111numeric621 unique values
0 missing
V112numeric621 unique values
0 missing
V113numeric621 unique values
0 missing
V114numeric621 unique values
0 missing
V115numeric621 unique values
0 missing
V116numeric621 unique values
0 missing
longitude (ignore)numeric33 unique values
0 missing

19 properties

1059
Number of instances (rows) of the dataset.
117
Number of attributes (columns) of the dataset.
0
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
117
Number of numeric attributes.
0
Number of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
-8.52
Average class difference between consecutive instances.
0
Percentage of missing values.
0.11
Number of attributes divided by the number of instances.
100
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.

1 tasks

0 runs - estimation_procedure: 10-fold Crossvalidation - target_feature: latitude
Define a new task