Data
BTS-Lyrics

BTS-Lyrics

active ARFF CC0: Public Domain Visibility: public Uploaded 23-03-2022 by Onur Yildirim
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computer Systems Machine Learning
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context The dataset was collated as a casual project using data from Genius and Big Hit. Currently contains 18 albums (check section "Albums in dataset" below for more details). Columns id (int) : sequential numerical id to uniquely identify track album_title (string) : title of the album eng_ album_title (string) : title of album without non-english characters album_rd (string) : date that album was released in isoformat (YYYY-MM-DD) album_seq (int) : sequence of track in album track_title (string) : title of the track kor_ track_ title (string) : title of the track with korean characters eng_ track_ title (string) : title of the track without non-english characters lyrics (string) : english translated lyrics of the track (from genius) will be empty for instrumentals hidden_track (boolean) : indicates whether the track is a hidden track remix (boolean) : indicates whether the track is a remix featured (string) : indicates who is featured in the track performed_by (string) : indicates who performed the track multiple individuals seperated with ";" (e.g. "RM;SUGA;J-HOPE" would mean that the track is performed by RM, SUGA and J-HOPE) featured artists will not be reflected in this column repackaged (boolean) : indicates if the track was already released in an earlier album language (string) : language of track KOR: Korean ENG: English NA: Not Applicable (e.g. instrumentals) Other Acknowledgements Banner and Profile Image Inspiration Taylor Swift Song Lyrics from all the albums Planned Improvements adding Japanese albums

16 features

idnumeric231 unique values
0 missing
album_titlestring19 unique values
0 missing
eng_album_titlestring18 unique values
5 missing
album_rdstring19 unique values
0 missing
album_seqnumeric27 unique values
0 missing
track_titlestring177 unique values
0 missing
kor_track_titlestring18 unique values
198 missing
eng_track_titlestring177 unique values
0 missing
lyricsstring167 unique values
7 missing
hidden_tracknominal2 unique values
0 missing
remixnominal2 unique values
0 missing
featuredstring6 unique values
222 missing
performed_bystring14 unique values
3 missing
repackagednominal2 unique values
0 missing
langstring2 unique values
4 missing
has_full_vernominal2 unique values
0 missing

19 properties

231
Number of instances (rows) of the dataset.
16
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
439
Number of missing values in the dataset.
227
Number of instances with at least one value missing.
2
Number of numeric attributes.
4
Number of nominal attributes.
0.07
Number of attributes divided by the number of instances.
12.5
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
25
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
4
Number of binary attributes.
25
Percentage of binary attributes.
98.27
Percentage of instances having missing values.
Average class difference between consecutive instances.
11.88
Percentage of missing values.

0 tasks

Define a new task