Data
U.S.-Airbnb-Open-Data

U.S.-Airbnb-Open-Data

active ARFF CC0: Public Domain Visibility: public Uploaded 23-03-2022 by Onur Yildirim
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computer Systems Machine Learning
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Author: Kritik Seth ### Context Since its inception in 2008, Airbnb has disrupted the traditional hospitality industry as more travellers decide to use Airbnb as their primary means of accommodation. Airbnb offers travellers a more unique and personalized way of accommodation and experience. ### Content This dataset has columns describing features such as host id, hostname, listing id, listing name, latitude and longitude of listing, the neighbourhood, price, room type, minimum number of nights, number of reviews, last review date, reviews per month, availability, host listings and city. ### Acknowledgements This dataset is a compilation of multiple datasets found on Inside Airbnb. ### Inspiration * Can we predict the price of each house in different regions? * Can we describe a region using the names of listings in that region? * What can we learn about different regions from the data? * Based on different factors is it possible to recommend a title to the host for his/her listing? * Can we estimate the popularity of a listing based on given features?

17 features

idnumeric226029 unique values
0 missing
namestring217702 unique values
481 missing
host_idnumeric130425 unique values
0 missing
host_namestring30305 unique values
350 missing
neighbourhood_groupstring34 unique values
115845 missing
neighbourhoodstring1450 unique values
0 missing
latitudenumeric145091 unique values
0 missing
longitudenumeric145243 unique values
0 missing
room_typestring4 unique values
0 missing
pricenumeric1975 unique values
0 missing
minimum_nightsnumeric169 unique values
0 missing
number_of_reviewsnumeric660 unique values
0 missing
last_reviewstring2377 unique values
48602 missing
reviews_per_monthnumeric1242 unique values
48602 missing
calculated_host_listings_countnumeric142 unique values
0 missing
availability_365numeric366 unique values
0 missing
citystring28 unique values
0 missing

19 properties

226030
Number of instances (rows) of the dataset.
17
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
213880
Number of missing values in the dataset.
141316
Number of instances with at least one value missing.
10
Number of numeric attributes.
0
Number of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
62.52
Percentage of instances having missing values.
Average class difference between consecutive instances.
5.57
Percentage of missing values.
0
Number of attributes divided by the number of instances.
58.82
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.

0 tasks

Define a new task