Data
Autism-Screening

Autism-Screening

active ARFF CC0: Public Domain Visibility: public Uploaded 23-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computer Systems Machine Learning
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Data Set Name: Autistic Spectrum Disorder Screening Data for Adult Autistic Spectrum Disorder (ASD) is a neurodevelopment condition associated with significant healthcare costs, and early diagnosis can significantly reduce these. Unfortunately, waiting times for an ASD diagnosis are lengthy and procedures are not cost effective. The economic impact of autism and the increase in the number of ASD cases across the world reveals an urgent need for the development of easily implemented and effective screening methods. Therefore, a time-efficient and accessible ASD screening is imminent to help health professionals and inform individuals whether they should pursue formal clinical diagnosis. The rapid growth in the number of ASD cases worldwide necessitates datasets related to behaviour traits. However, such datasets are rare making it difficult to perform thorough analyses to improve the efficiency, sensitivity, specificity and predictive accuracy of the ASD screening process. Presently, very limited autism datasets associated with clinical or screening are available and most of them are genetic in nature. Hence, we propose a new dataset related to autism screening of adults that contained 20 features to be utilised for further analysis especially in determining influential autistic traits and improving the classification of ASD cases. In this dataset, we record ten behavioural features (AQ-10-Adult) plus ten individuals characteristics that have proved to be effective in detecting the ASD cases from controls in behaviour science. Source: Fadi Fayez Thabtah Department of Digital Technology Manukau Institute of Technology, Auckland, New Zealand fadi.fayezmanukau.ac.nz Data Type: Multivariate OR Univariate OR Sequential OR Time-Series OR Text OR Domain-Theory Nominal / categorical, binary and continuous Task: Classification Attribute Type: Categorical, continuous and binary Area: Medical, health and social science Format Type: Non-Matrix Number of Instances (records in your data set): 704 Number of Attributes (fields within each record): 21 Relevant Information: For Further information about the attributes/feature see below table. Attribute Information: Attribute Type Description Age Number Age in years Gender String Male or Female Ethnicity String List of common ethnicities in text format Born with jaundice Boolean (yes or no) Whether the case was born with jaundice Family member with PDD Boolean (yes or no) Whether any immediate family member has a PDD Who is completing the test String Parent, self, caregiver, medical staff, clinician ,etc. Country of residence String List of countries in text format Used the screening app before Boolean (yes or no) Whether the user has used a screening app Screening Method Type Integer (0,1,2,3) The type of screening methods chosen based on age category (0=toddler, 1=child, 2= adolescent, 3= adult) Question 1 Answer Binary (0, 1) The answer code of the question based on the screening method used Question 2 Answer Binary (0, 1) The answer code of the question based on the screening method used Question 3 Answer Binary (0, 1) The answer code of the question based on the screening method used Question 4 Answer Binary (0, 1) The answer code of the question based on the screening method used Question 5 Answer Binary (0, 1) The answer code of the question based on the screening method used Question 6 Answer Binary (0, 1) The answer code of the question based on the screening method used Question 7 Answer Binary (0, 1) The answer code of the question based on the screening method used Question 8 Answer Binary (0, 1) The answer code of the question based on the screening method used Question 9 Answer Binary (0, 1) The answer code of the question based on the screening method used Question 10 Answer Binary (0, 1) The answer code of the question based on the screening method used Screening Score Integer The final score obtained based on the scoring algorithm of the screening method used. This was computed in an automated manner. Relevant Papers: 1) Tabtah, F. (2017). Autism Spectrum Disorder Screening: Machine Learning Adaptation and DSM-5 Fulfillment. Proceedings of the 1st International Conference on Medical and Health Informatics 2017, pp.1-6. Taichung City, Taiwan, ACM. 2) Thabtah, F. (2017). ASDTests. A mobile app for ASD screening. www.asdtests.com [accessed December 20th, 2017]. 3) Thabtah, F. (2017). Machine Learning in Autistic Spectrum Disorder Behavioural Research: A Review. To Appear in Informatics for Health and Social Care Journal. December, 2017 (in press) Citation Request: Please refer to the Machine Learning Repository's citation policy

21 features

A1_Scorenumeric2 unique values
0 missing
A2_Scorenumeric2 unique values
0 missing
A3_Scorenumeric2 unique values
0 missing
A4_Scorenumeric2 unique values
0 missing
A5_Scorenumeric2 unique values
0 missing
A6_Scorenumeric2 unique values
0 missing
A7_Scorenumeric2 unique values
0 missing
A8_Scorenumeric2 unique values
0 missing
A9_Scorenumeric2 unique values
0 missing
A10_Scorenumeric2 unique values
0 missing
agestring46 unique values
2 missing
genderstring2 unique values
0 missing
ethnicitystring11 unique values
95 missing
jundicestring2 unique values
0 missing
austimstring2 unique values
0 missing
contry_of_resstring67 unique values
0 missing
used_app_beforestring2 unique values
0 missing
resultnumeric11 unique values
0 missing
age_descstring1 unique values
0 missing
relationstring5 unique values
95 missing
Class/ASDstring2 unique values
0 missing

19 properties

704
Number of instances (rows) of the dataset.
21
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
192
Number of missing values in the dataset.
95
Number of instances with at least one value missing.
11
Number of numeric attributes.
0
Number of nominal attributes.
0.03
Number of attributes divided by the number of instances.
52.38
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
13.49
Percentage of instances having missing values.
Average class difference between consecutive instances.
1.3
Percentage of missing values.

0 tasks

Define a new task