Data
Al_Hilal-Archive-Scores

Al_Hilal-Archive-Scores

active ARFF CC0: Public Domain Visibility: public Uploaded 23-03-2022 by Elif Ceren Gok
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Machine Learning Manufacturing
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context As Al Hilal football club is one of the most successful football clubs in Saudi Arabia, Middle East and Asia this dataset was an excercise for Web Scraping where i collected the archive of the results for the club. The dataset inclues all matches from 1996 to 2019 and many important matches from 1961 to 1995. The source where the data has been scraped of is Koora website which is one of te oldest arabic websites that specilize in football news and matches scores. Content The dataset includes home team column which contains the team that hosts the match, the away team who came to play against the home team, and between them the score of the match. After that the date where the game has been played in, then the wining team from the match. the column before the last contains the competition which the game has been a part of and the last column is the stage of the competition of the game. in home and away columns when the game is played in local competition the name of the team is written without the name of the country, but when the game is played in an international competiton the name of the team is followed by the country of the team as there could be teams which has the same name but from different countries. All column are object type except date column which is Datetime type. The dataset contains no null values except the stage column as the null value means that the match didn't have a stage such as friendly matches or the stage is not provided by the website Acknowledgements Thanks to General Assembly Misk Academy for the immersive data science course which taught many skills and methods one of which is Web Scraping that was used in this project. Thanks and apperciation goes to my instructors as they are always there for help and dedicate alot of time to ensure that all students master all skills. Inspiration which years are the most successful ? which leagues the club scored the highest point ? what are the clubs that the Al Hilal won against the most and what are the teams that defated Al Hilal through the years ? could there be a model to predict the result of the feature matches for Al Hilal ? which months Al Hilal win in the most and which that Al Hilal lose in it ? and many exploerations that cone be done.

8 features

Unnamed:_0numeric1363 unique values
0 missing
home_teamstring162 unique values
0 missing
scoresstring68 unique values
0 missing
away_teamstring199 unique values
0 missing
datestring1346 unique values
0 missing
wining_teamstring102 unique values
0 missing
competitonstring69 unique values
0 missing
stagestring116 unique values
219 missing

19 properties

1363
Number of instances (rows) of the dataset.
8
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
219
Number of missing values in the dataset.
219
Number of instances with at least one value missing.
1
Number of numeric attributes.
0
Number of nominal attributes.
0.01
Number of attributes divided by the number of instances.
12.5
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
16.07
Percentage of instances having missing values.
Average class difference between consecutive instances.
2.01
Percentage of missing values.

0 tasks

Define a new task