Data
Airlines_DepDelay_1M

Airlines_DepDelay_1M

active ARFF CC0: Public Domain Visibility: public Uploaded 02-01-2023 by Sebastian Fischer
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Data Description The dataset contains information about departure delays of airlines from years 1987 - 2013. This is a subset of the 10M version (which is once again a subset of the original dataset). Unique carrier is the airline: https://aspm.faa.gov/aspmhelp/index/ASQP__Carrier_Codes_and_Names.html Preprocessing on time features of the data has been performed. Attribute Description 1. *DepDelay* - target feature, how long the delay was 2. *Month* 3. *DayofMonth* - 1-31 4. *DayOfWeek* - 1 (Monday) - 7 (Sunday) 5. *UniqueCarrier* - unique carrier code 6. *Origin* - origin airport code 7. *Dest* - destination airport code 8. *Distance* - distance between the airports in miles 9. *CRSDepTime_hour* - scheduled departure time, hour 10. *CRSDepTime_minute* - scheduled departure time, minutes 11. *CRSArrTime_hour* - scheduled arrive time, hour 12. *CRSArrTime_minute* - schedule arrive time, minutes

12 features

DepDelay (target)numeric561 unique values
0 missing
Monthnominal12 unique values
0 missing
DayOfWeeknominal7 unique values
0 missing
UniqueCarriernominal14 unique values
0 missing
Originnominal245 unique values
0 missing
Destnominal244 unique values
0 missing
Distancenumeric1145 unique values
0 missing
DayOfMonthnominal31 unique values
0 missing
CRSDepTime_hournumeric24 unique values
0 missing
CRSDepTime_minutenumeric60 unique values
0 missing
CRSArrTime_hournumeric25 unique values
0 missing
CRSArrTime_minutenumeric60 unique values
0 missing

19 properties

1000000
Number of instances (rows) of the dataset.
12
Number of attributes (columns) of the dataset.
0
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
6
Number of numeric attributes.
6
Number of nominal attributes.
Percentage of instances belonging to the most frequent class.
50
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
-11.6
Average class difference between consecutive instances.
0
Percentage of missing values.
0
Number of attributes divided by the number of instances.
50
Percentage of numeric attributes.

1 tasks

0 runs - estimation_procedure: 33% Holdout set - target_feature: DepDelay
Define a new task