Data
USA-Airport-Dataset

USA-Airport-Dataset

active ARFF CC0: Public Domain Visibility: public Uploaded 23-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computer Systems Machine Learning
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
What is it ? This dataset is a record of 3.5 Million+ US Domestic Flights from 1990 to 2009. It has been taken from OpenFlights website which have a huge database of different travelling mediums across the globe. I came across this dataset while I was preparing for a hackathon and thought it should on kaggle's dataset list. What's in it ? Here is some info about the attributes present in the dataset: Origin_airport: Three letter airport code of the origin airport Destination_airport: Three letter airport code of the destination airport Origin_city: Origin city name Destination_city: Destination city name Passengers: Number of passengers transported from origin to destination Seats: Number of seats available on flights from origin to destination Flights: Number of flights between origin and destination (multiple records for one month, many with flights 1) Distance: Distance (to nearest mile) flown between origin and destination Fly_date: The date (yyyymm) of flight Origin_population: Origin city's population as reported by US Census Destination_population: Destination city's population as reported by US Census Where did you get it ? I would like to thank the original author of this dataset Jacob Perkins for putting such a good effort and collecting this data. I will be updating this dataset for attributes that might reveal some more information about this dataset. Thanks for stopping by. Peace.

15 features

Origin_airportstring683 unique values
0 missing
Destination_airportstring708 unique values
0 missing
Origin_citystring535 unique values
0 missing
Destination_citystring548 unique values
0 missing
Passengersnumeric37484 unique values
0 missing
Seatsnumeric41098 unique values
0 missing
Flightsnumeric920 unique values
0 missing
Distancenumeric2776 unique values
0 missing
Fly_datestring240 unique values
0 missing
Origin_populationnumeric6679 unique values
0 missing
Destination_populationnumeric6696 unique values
0 missing
Org_airport_latnumeric477 unique values
6954 missing
Org_airport_longnumeric477 unique values
6954 missing
Dest_airport_latnumeric484 unique values
6807 missing
Dest_airport_longnumeric484 unique values
6807 missing

19 properties

3606803
Number of instances (rows) of the dataset.
15
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
27522
Number of missing values in the dataset.
12351
Number of instances with at least one value missing.
10
Number of numeric attributes.
0
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
66.67
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
0.34
Percentage of instances having missing values.
Average class difference between consecutive instances.
0.05
Percentage of missing values.

0 tasks

Define a new task