Data
OpenML
Help
Sign in
×
Sign in
No account? Join OpenML
Forgot password
×
JavaScript is required to properly view the contents of this page!
OpenML
Explore
Data
Task
Flow
Run
Study
Task type
Measure
People
Help
Blog
Contact
Please cite us
Risk_Level_Classification
ARFF
CSV
JSON
XML
RDF
Risk_Level_Classification
active
ARFF
CC BY 4.0
Visibility: public
Uploaded 19-11-2024 by
Anna Wiewer
0 likes
downloaded by 0 people , 0 total downloads
0 issues
0 downvotes
Add tag
Issue
#Downvotes for this reason
By
Loading wiki
Help us complete this description
Edit
An updated version of the dataset for classifying risk levels in transactions. The target variable 'anomaly' is now treated as nominal with categories: low risk, moderate risk, and high risk.
19 features
anomaly
(target)
nominal
3 unique values
0 missing
hour_of_day
numeric
24 unique values
0 missing
amount
numeric
76771 unique values
0 missing
ip_prefix
numeric
5 unique values
0 missing
login_frequency
numeric
8 unique values
0 missing
session_duration
numeric
140 unique values
0 missing
risk_score
numeric
31 unique values
0 missing
transaction_type_purchase
numeric
2 unique values
0 missing
transaction_type_sale
numeric
2 unique values
0 missing
transaction_type_scam
numeric
2 unique values
0 missing
transaction_type_transfer
numeric
2 unique values
0 missing
location_region_asia
numeric
2 unique values
0 missing
location_region_europe
numeric
2 unique values
0 missing
location_region_north america
numeric
2 unique values
0 missing
location_region_south america
numeric
2 unique values
0 missing
purchase_pattern_high_value
numeric
2 unique values
0 missing
purchase_pattern_random
numeric
2 unique values
0 missing
age_group_new
numeric
2 unique values
0 missing
age_group_veteran
numeric
2 unique values
0 missing
Show all 19 features
19 properties
NumberOfInstances
78600
Number of instances (rows) of the dataset.
NumberOfFeatures
19
Number of attributes (columns) of the dataset.
NumberOfClasses
3
Number of distinct values of the target attribute (if it is nominal).
NumberOfMissingValues
0
Number of missing values in the dataset.
NumberOfInstancesWithMissingValues
0
Number of instances with at least one value missing.
NumberOfNumericFeatures
18
Number of numeric attributes.
NumberOfSymbolicFeatures
1
Number of nominal attributes.
PercentageOfBinaryFeatures
0
Percentage of binary attributes.
PercentageOfInstancesWithMissingValues
0
Percentage of instances having missing values.
AutoCorrelation
0.67
Average class difference between consecutive instances.
PercentageOfMissingValues
0
Percentage of missing values.
Dimensionality
0
Number of attributes divided by the number of instances.
PercentageOfNumericFeatures
94.74
Percentage of numeric attributes.
MajorityClassPercentage
80.78
Percentage of instances belonging to the most frequent class.
PercentageOfSymbolicFeatures
5.26
Percentage of nominal attributes.
MajorityClassSize
63494
Number of instances belonging to the most frequent class.
MinorityClassPercentage
8.26
Percentage of instances belonging to the least frequent class.
MinorityClassSize
6495
Number of instances belonging to the least frequent class.
NumberOfBinaryFeatures
0
Number of binary attributes.
Show all 19 properties
1 tasks
Supervised Classification on Risk_Level_Classification
0 runs
- estimation_procedure: 10-fold Crossvalidation - target_feature: anomaly
Define a new task