Data
USA-Housing-Listings

USA-Housing-Listings

active ARFF CC0: Public Domain Visibility: public Uploaded 23-03-2022 by Onur Yildirim
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computer Systems Machine Learning
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context Craigslist is the world's largest collection of privately sold housing options, yet it's very difficult to collect all of them in the same place. I built this dataset as a means in by which to perform experimental analysis on the United States states as a whole instead of isolated urban housing markets. Content This data is scraped every few months, it contains most all relevant information that Craigslist provides on retail sales

22 features

idnumeric384977 unique values
0 missing
urlstring384977 unique values
0 missing
regionstring404 unique values
0 missing
region_urlstring413 unique values
0 missing
pricenumeric3961 unique values
0 missing
typestring12 unique values
0 missing
sqfeetnumeric3277 unique values
0 missing
bedsnumeric11 unique values
0 missing
bathsnumeric20 unique values
0 missing
cats_allowednumeric2 unique values
0 missing
dogs_allowednumeric2 unique values
0 missing
smoking_allowednumeric2 unique values
0 missing
wheelchair_accessnumeric2 unique values
0 missing
electric_vehicle_chargenumeric2 unique values
0 missing
comes_furnishednumeric2 unique values
0 missing
laundry_optionsstring5 unique values
79026 missing
parking_optionsstring7 unique values
140687 missing
image_urlstring181068 unique values
0 missing
descriptionstring280644 unique values
2 missing
latnumeric56772 unique values
1918 missing
longnumeric54035 unique values
1918 missing
statestring51 unique values
0 missing

19 properties

384977
Number of instances (rows) of the dataset.
22
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
223551
Number of missing values in the dataset.
149008
Number of instances with at least one value missing.
13
Number of numeric attributes.
0
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
59.09
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
38.71
Percentage of instances having missing values.
Average class difference between consecutive instances.
2.64
Percentage of missing values.

0 tasks

Define a new task