Data
Swiss-banknote-conterfeit-detection

Swiss-banknote-conterfeit-detection

active ARFF CC BY-NC-SA 4.0 Visibility: public Uploaded 23-03-2022 by Onur Yildirim
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computer Systems Machine Learning
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context Will you be able to identify genuine and conterfeit banknotes, even if half of the data is conterfeit? Perfect for testing different outlier detection algorithms. Content The dataset includes information about the shape of the bill, as well as the label. It is made up of 200 banknotes in total, 100 for genuine/conterfeit each. Attributes: -conterfeit: Wether a banknote is conterfeit (1) or genuine (0) -Length: Length of bill (mm) -Left: Width of left edge (mm) -Right: Width of right edge (mm) -Bottom: Bottom margin width (mm) -Top: Top margin width (mm) -Diagonal: Length of diagonal (mm) Original Data Source Flury, B. and Riedwyl, H. (1988). Multivariate Statistics: A practical approach. London: Chapman Hall, Tables 1.1 and 1.2, pp. 5-8. Applications While it might be pretty easy for a classifier to decide wether the banknotes are conterfeit or not, what about methods using outlier detection? Classical methods of outlier detection won't work, since half of the data consist of outliers (conterfeit bills), so more robust methods will be needed.

7 features

conterfeit (target)numeric2 unique values
0 missing
Lengthnumeric21 unique values
0 missing
Leftnumeric17 unique values
0 missing
Rightnumeric19 unique values
0 missing
Bottomnumeric50 unique values
0 missing
Topnumeric35 unique values
0 missing
Diagonalnumeric42 unique values
0 missing

19 properties

200
Number of instances (rows) of the dataset.
7
Number of attributes (columns) of the dataset.
0
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
7
Number of numeric attributes.
0
Number of nominal attributes.
0.04
Number of attributes divided by the number of instances.
100
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
0.99
Average class difference between consecutive instances.
0
Percentage of missing values.

0 tasks

Define a new task