Data
Oilst_Order_Items

Oilst_Order_Items

active ARFF Attribution-NonCommercial-ShareAlike (CC BY-NC-SA) Visibility: public Uploaded 31-05-2024 by Iwo Godzwon
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Description: The olist_order_items_dataset.csv is a comprehensive dataset that features transactional data from the Olist e-commerce platform. The dataset documents items purchased within each order made on the platform, making it an essential asset for analysis on shopping behaviors, seller performance, and logistical operations. Each entry in the dataset is uniquely identified by an order_id, alongside the order_item_id which enables the tracking of multiple items within a single order. The product_id and seller_id fields offer a direct link to specific products and sellers, respectively. Dates by which orders should be shipped are captured under shipping_limit_date, ensuring insights into the logistical timelines. The dataset also quantifies each purchase in terms of item price and the freight value, providing a detailed perspective on pricing and shipping costs associated with each transaction. Attribute Description: - order_id: A unique identifier for each order (e.g., '43bc3119f6a029c0b7293bf06ac67f38'). - order_item_id: Identifies the sequence of items within an order (e.g., 1). - product_id: A unique code for each product (e.g., '00e32638060f6356e6f00749dc466b5c'). - seller_id: A unique identifier for the seller of the product (e.g., '7040e82f899a04d1b434b795a43b4617'). - shipping_limit_date: The latest date and time by when the order should be shipped (e.g., '2017-03-08 04:10:21'). - price: The price of the item (e.g., 120.0). - freight_value: The shipping cost of the item (e.g., 7.39). Use Case: This dataset is invaluable for analyzing e-commerce operations within the Olist platform. It facilitates the understanding of customer purchasing patterns, evaluates seller performance, and streamlines logistic operations by examining shipping deadlines. Furthermore, it can be used for pricing strategy analyses by comparing item prices and freight values. Academics and data scientists can employ this dataset for market research, supply chain analysis, and for developing predictive models on sales trends, shipping delays, and customer satisfaction metrics.

7 features

order_idstring98666 unique values
0 missing
order_item_idnominal21 unique values
0 missing
product_idstring32951 unique values
0 missing
seller_idstring3095 unique values
0 missing
shipping_limit_datestring93318 unique values
0 missing
pricenumeric5968 unique values
0 missing
freight_valuenumeric6999 unique values
0 missing

19 properties

112650
Number of instances (rows) of the dataset.
7
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
2
Number of numeric attributes.
1
Number of nominal attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
0
Percentage of missing values.
Average class difference between consecutive instances.
28.57
Percentage of numeric attributes.
0
Number of attributes divided by the number of instances.
14.29
Percentage of nominal attributes.
Percentage of instances belonging to the most frequent class.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.

0 tasks

Define a new task