Data
online-shoppers-intention

online-shoppers-intention

active ARFF Publicly available Visibility: public Uploaded 05-06-2023 by Matthias Feurer
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
## Source: 1. C. Okan Sakar Department of Computer Engineering, Faculty of Engineering and Natural Sciences, Bahcesehir University, 34349 Besiktas, Istanbul, Turkey 2. Yomi Kastro Inveon Information Technologies Consultancy and Trade, 34335 Istanbul, Turkey ## Data Set Information: The dataset consists of feature vectors belonging to 12,330 sessions. The dataset was formed so that each session would belong to a different user in a 1-year period to avoid any tendency to a specific campaign, special day, user profile, or period. ## Attribute Information: The dataset consists of 10 numerical and 8 categorical attributes. The 'Revenue' attribute can be used as the class label. "Administrative", "Administrative Duration", "Informational", "Informational Duration", "Product Related" and "Product Related Duration" represent the number of different types of pages visited by the visitor in that session and total time spent in each of these page categories. The values of these features are derived from the URL information of the pages visited by the user and updated in real time when a user takes an action, e.g. moving from one page to another. The "Bounce Rate", "Exit Rate" and "Page Value" features represent the metrics measured by "Google Analytics" for each page in the e-commerce site. The value of "Bounce Rate" feature for a web page refers to the percentage of visitors who enter the site from that page and then leave ("bounce") without triggering any other requests to the analytics server during that session. The value of "Exit Rate" feature for a specific web page is calculated as for all pageviews to the page, the percentage that were the last in the session. The "Page Value" feature represents the average value for a web page that a user visited before completing an e-commerce transaction. The "Special Day" feature indicates the closeness of the site visiting time to a specific special day (e.g. Mother's Day, Valentine's Day) in which the sessions are more likely to be finalized with transaction. The value of this attribute is determined by considering the dynamics of e-commerce such as the duration between the order date and delivery date. For example, for Valentina's day, this value takes a nonzero value between February 2 and February 12, zero before and after this date unless it is close to another special day, and its maximum value of 1 on February 8. The dataset also includes operating system, browser, region, traffic type, visitor type as returning or new visitor, a Boolean value indicating whether the date of the visit is weekend, and month of the year. ## Relevant Papers: Sakar, C.O., Polat, S.O., Katircioglu, M. et al. Neural Comput & Applic (2018). [Web Link] ## Note * Compared to v1 this one contains correct variable coding.

18 features

Revenue (target)nominal2 unique values
0 missing
Administrativenumeric27 unique values
0 missing
Administrative_Durationnumeric3335 unique values
0 missing
Informationalnumeric17 unique values
0 missing
Informational_Durationnumeric1258 unique values
0 missing
ProductRelatednumeric311 unique values
0 missing
ProductRelated_Durationnumeric9551 unique values
0 missing
BounceRatesnumeric1872 unique values
0 missing
ExitRatesnumeric4777 unique values
0 missing
PageValuesnumeric2704 unique values
0 missing
SpecialDaynumeric6 unique values
0 missing
Monthnominal10 unique values
0 missing
OperatingSystemsnominal8 unique values
0 missing
Browsernominal13 unique values
0 missing
Regionnominal9 unique values
0 missing
TrafficTypenominal20 unique values
0 missing
VisitorTypenominal3 unique values
0 missing
Weekendnominal2 unique values
0 missing

19 properties

12330
Number of instances (rows) of the dataset.
18
Number of attributes (columns) of the dataset.
2
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
10
Number of numeric attributes.
8
Number of nominal attributes.
11.11
Percentage of binary attributes.
0
Percentage of instances having missing values.
0.74
Average class difference between consecutive instances.
0
Percentage of missing values.
0
Number of attributes divided by the number of instances.
55.56
Percentage of numeric attributes.
84.53
Percentage of instances belonging to the most frequent class.
44.44
Percentage of nominal attributes.
10422
Number of instances belonging to the most frequent class.
15.47
Percentage of instances belonging to the least frequent class.
1908
Number of instances belonging to the least frequent class.
2
Number of binary attributes.

2 tasks

0 runs - estimation_procedure: 4-fold Crossvalidation - target_feature: Revenue
0 runs - estimation_procedure: 10-fold Crossvalidation - target_feature: Revenue
Define a new task