Data
OpenML
Help
Sign in
×
Sign in
No account? Join OpenML
Forgot password
×
JavaScript is required to properly view the contents of this page!
OpenML
Explore
Data
Task
Flow
Run
Study
Task type
Measure
People
Help
Blog
Contact
Please cite us
yeast
ARFF
CSV
JSON
XML
RDF
yeast
active
ARFF
Public Domain (CC0)
Visibility: public
Uploaded 09-02-2024 by
Juan Alfaro
0 likes
downloaded by 0 people , 0 total downloads
0 issues
0 downvotes
Add tag
Issue
#Downvotes for this reason
By
Loading wiki
Help us complete this description
Edit
This dataset consists of predicting the cellular localization sites of proteins.
18 features
L1
(target)
nominal
8 unique values
0 missing
L2
(target)
nominal
5 unique values
0 missing
L3
(target)
nominal
10 unique values
0 missing
L4
(target)
nominal
9 unique values
0 missing
L5
(target)
nominal
7 unique values
0 missing
L6
(target)
nominal
9 unique values
0 missing
L7
(target)
nominal
7 unique values
0 missing
L8
(target)
nominal
9 unique values
0 missing
L9
(target)
nominal
7 unique values
0 missing
L10
(target)
nominal
7 unique values
0 missing
A1
numeric
81 unique values
0 missing
A2
numeric
79 unique values
0 missing
A3
numeric
53 unique values
0 missing
A4
numeric
78 unique values
0 missing
A5
numeric
2 unique values
0 missing
A6
numeric
3 unique values
0 missing
A7
numeric
48 unique values
0 missing
A8
numeric
68 unique values
0 missing
Show all 18 features
19 properties
NumberOfInstances
1484
Number of instances (rows) of the dataset.
NumberOfFeatures
18
Number of attributes (columns) of the dataset.
NumberOfClasses
Number of distinct values of the target attribute (if it is nominal).
NumberOfMissingValues
0
Number of missing values in the dataset.
NumberOfInstancesWithMissingValues
0
Number of instances with at least one value missing.
NumberOfNumericFeatures
8
Number of numeric attributes.
NumberOfSymbolicFeatures
10
Number of nominal attributes.
MajorityClassPercentage
Percentage of instances belonging to the most frequent class.
PercentageOfSymbolicFeatures
55.56
Percentage of nominal attributes.
MajorityClassSize
Number of instances belonging to the most frequent class.
MinorityClassPercentage
Percentage of instances belonging to the least frequent class.
MinorityClassSize
Number of instances belonging to the least frequent class.
NumberOfBinaryFeatures
0
Number of binary attributes.
PercentageOfBinaryFeatures
0
Percentage of binary attributes.
PercentageOfInstancesWithMissingValues
0
Percentage of instances having missing values.
AutoCorrelation
Average class difference between consecutive instances.
PercentageOfMissingValues
0
Percentage of missing values.
Dimensionality
0.01
Number of attributes divided by the number of instances.
PercentageOfNumericFeatures
44.44
Percentage of numeric attributes.
Show all 19 properties
0 tasks
Define a new task