Data
OpenML
Help
Sign in
×
Sign in
No account? Join OpenML
Forgot password
×
JavaScript is required to properly view the contents of this page!
OpenML
Explore
Data
Task
Flow
Run
Study
Task type
Measure
People
Help
Blog
Contact
Please cite us
sick
ARFF
CSV
JSON
XML
RDF
sick
active
ARFF
Publicly available
Visibility: public
Uploaded 27-01-2023 by
Young Lee
0 likes
downloaded by 0 people , 0 total downloads
0 issues
0 downvotes
Computational Universe
Computer Systems
Add tag
Issue
#Downvotes for this reason
By
Loading wiki
Help us complete this description
Edit
Thyroid disease records supplied by the Garavan Institute and J. Ross Quinlan, New South Wales Institute, Syndney, Australia. 1987.
23 features
class
(target)
string
2 unique values
0 missing
age
numeric
93 unique values
0 missing
TSH
numeric
280 unique values
0 missing
TT4
numeric
235 unique values
0 missing
T4U
numeric
144 unique values
0 missing
FTI
numeric
225 unique values
0 missing
sex
nominal
2 unique values
0 missing
on_thyroxine
nominal
2 unique values
0 missing
query_on_thyroxine
nominal
2 unique values
0 missing
on_antithyroid_medication
nominal
2 unique values
0 missing
sick
nominal
2 unique values
0 missing
pregnant
nominal
2 unique values
0 missing
thyroid_surgery
nominal
2 unique values
0 missing
I131_treatment
nominal
2 unique values
0 missing
query_hypothyroid
nominal
2 unique values
0 missing
query_hyperthyroid
nominal
2 unique values
0 missing
lithium
nominal
2 unique values
0 missing
goitre
nominal
2 unique values
0 missing
tumor
nominal
2 unique values
0 missing
hypopituitary
nominal
2 unique values
0 missing
psych
nominal
2 unique values
0 missing
T3_measured
nominal
2 unique values
0 missing
referral_source
nominal
5 unique values
0 missing
Show all 23 features
19 properties
NumberOfInstances
3103
Number of instances (rows) of the dataset.
NumberOfFeatures
23
Number of attributes (columns) of the dataset.
NumberOfClasses
2
Number of distinct values of the target attribute (if it is nominal).
NumberOfMissingValues
0
Number of missing values in the dataset.
NumberOfInstancesWithMissingValues
0
Number of instances with at least one value missing.
NumberOfNumericFeatures
5
Number of numeric attributes.
NumberOfSymbolicFeatures
17
Number of nominal attributes.
MajorityClassSize
2888
Number of instances belonging to the most frequent class.
MinorityClassPercentage
6.93
Percentage of instances belonging to the least frequent class.
MinorityClassSize
215
Number of instances belonging to the least frequent class.
NumberOfBinaryFeatures
16
Number of binary attributes.
PercentageOfBinaryFeatures
69.57
Percentage of binary attributes.
PercentageOfInstancesWithMissingValues
0
Percentage of instances having missing values.
AutoCorrelation
1
Average class difference between consecutive instances.
PercentageOfMissingValues
0
Percentage of missing values.
Dimensionality
0.01
Number of attributes divided by the number of instances.
PercentageOfNumericFeatures
21.74
Percentage of numeric attributes.
MajorityClassPercentage
93.07
Percentage of instances belonging to the most frequent class.
PercentageOfSymbolicFeatures
73.91
Percentage of nominal attributes.
Show all 19 properties
1 tasks
Supervised Classification on sick
0 runs
- estimation_procedure: 10-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: class
Define a new task