Data
Google-Play-Store-Apps

Google-Play-Store-Apps

active ARFF Database: Open Database, Contents: Database Contents Visibility: public Uploaded 24-03-2022 by Elif Ceren Gok
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context Google PlayStore App analytics. (1.1 Million + App Data) Source: https://github.com/gauthamp10/Google-Playstore-Dataset Content I've collected the data with the help of Python and Scrapy running on a google cloud vm instance. The data was collected on December 2020. Acknowledgements I couldn't have build this dataset without the help of Google's Cloud engine. Inspiration Took inspiration from: https://www.kaggle.com/lava18/google-play-store-apps to build a big database for students and researchers.

24 features

App_Namestring2055998 unique values
30479 missing
App_Idstring2312944 unique values
0 missing
Categorystring48 unique values
0 missing
Ratingnumeric42 unique values
22883 missing
Rating_Countnumeric38482 unique values
22883 missing
Installsstring22 unique values
107 missing
Minimum_Installsnumeric22 unique values
107 missing
Maximum_Installsnumeric251563 unique values
0 missing
Freenominal2 unique values
0 missing
Pricenumeric1063 unique values
0 missing
Currencystring15 unique values
135 missing
Sizestring1657 unique values
196 missing
Minimum_Androidstring154 unique values
6530 missing
Developer_Idstring744128 unique values
19314 missing
Developer_Websitestring810196 unique values
760835 missing
Developer_Emailstring950436 unique values
31 missing
Releasedstring4158 unique values
71053 missing
Last_Updatedstring3918 unique values
0 missing
Content_Ratingstring6 unique values
0 missing
Privacy_Policystring977519 unique values
420953 missing
Ad_Supportednominal2 unique values
0 missing
In_App_Purchasesnominal2 unique values
0 missing
Editors_Choicenominal2 unique values
0 missing
Scraped_Timestring67374 unique values
0 missing

19 properties

2312944
Number of instances (rows) of the dataset.
24
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
1355506
Number of missing values in the dataset.
1049842
Number of instances with at least one value missing.
5
Number of numeric attributes.
4
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
20.83
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
16.67
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
4
Number of binary attributes.
16.67
Percentage of binary attributes.
45.39
Percentage of instances having missing values.
Average class difference between consecutive instances.
2.44
Percentage of missing values.

0 tasks

Define a new task