Data
Ipl_predictions2020

Ipl_predictions2020

active ARFF Database: Open Database, Contents: Original Authors Visibility: public Uploaded 24-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computer Systems Machine Learning
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context Indian Premier League (IPL) is a Twenty20 cricket format league in India. It is usually played in April and May every year. As of 2019, the title sponsor of the game is Vivo. The league was founded by Board of Control for Cricket India (BCCI) in 2008. Content Data till Season 11 (2008 - 2019) matches.csv - Match by match data Acknowledgements Data source from 2008-2017 - CricSheet.org and Manas - Kaggle Data source for 2018-2019 - IPL T20 - Official website Inspiration Draw analysis, player/team performance, apply and learn statistical methods on real data Kernels -Statistics Summarizing quantitative data (mean, median, std. deviation, percentile, box plots etc.) Distributions - Cumulative relative frequency, Normal distribution, PDF, Z-score, empirical rule, binomial distribution, Bernoulli distribution Bivariate data - Scatter plot, Correlation, Covariance, Least square regression, R-Squared, Root mean square error

18 features

idnumeric756 unique values
0 missing
seasonnumeric12 unique values
0 missing
citystring32 unique values
7 missing
datestring546 unique values
0 missing
team1string15 unique values
0 missing
team2string15 unique values
0 missing
toss_winnerstring15 unique values
0 missing
toss_decisionstring2 unique values
0 missing
resultstring3 unique values
0 missing
dl_appliednumeric2 unique values
0 missing
winnerstring15 unique values
4 missing
win_by_runsnumeric89 unique values
0 missing
win_by_wicketsnumeric11 unique values
0 missing
player_of_matchstring226 unique values
4 missing
venuestring41 unique values
0 missing
umpire1string61 unique values
2 missing
umpire2string65 unique values
2 missing
umpire3string25 unique values
637 missing

19 properties

756
Number of instances (rows) of the dataset.
18
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
656
Number of missing values in the dataset.
638
Number of instances with at least one value missing.
5
Number of numeric attributes.
0
Number of nominal attributes.
0.02
Number of attributes divided by the number of instances.
27.78
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
84.39
Percentage of instances having missing values.
Average class difference between consecutive instances.
4.82
Percentage of missing values.

0 tasks

Define a new task