OpenML
Melbourne-Housing-Snapshot

Melbourne-Housing-Snapshot

active ARFF CC BY-NC-SA 4.0 Visibility: public Uploaded 23-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context Melbourne real estate is BOOMING. Can you find the insight or predict the next big trend to become a real estate mogul or even harder, to snap up a reasonably priced 2-bedroom unit? Content This is a snapshot of a dataset created by Tony Pino. It was scraped from publicly available results posted every week from Domain.com.au. He cleaned it well, and now it's up to you to make data analysis magic. The dataset includes Address, Type of Real estate, Suburb, Method of Selling, Rooms, Price, Real Estate Agent, Date of Sale and distance from C.B.D. Notes on Specific Variables Rooms: Number of rooms Price: Price in dollars Method: S - property sold; SP - property sold prior; PI - property passed in; PN - sold prior not disclosed; SN - sold not disclosed; NB - no bid; VB - vendor bid; W - withdrawn prior to auction; SA - sold after auction; SS - sold after auction price not disclosed. N/A - price or highest bid not available. Type: br - bedroom(s); h - house,cottage,villa, semi,terrace; u - unit, duplex; t - townhouse; dev site - development site; o res - other residential. SellerG: Real Estate Agent Date: Date sold Distance: Distance from CBD Regionname: General Region (West, North West, North, North east etc) Propertycount: Number of properties that exist in the suburb. Bedroom2 : Scraped of Bedrooms (from different source) Bathroom: Number of Bathrooms Car: Number of carspots Landsize: Land Size BuildingArea: Building Size CouncilArea: Governing council for the area Acknowledgements This is intended as a static (unchanging) snapshot of https://www.kaggle.com/anthonypino/melbourne-housing-market. It was created in September 2017. Additionally, homes with no Price have been removed.

21 features

Suburbstring314 unique values
0 missing
Addressstring13378 unique values
0 missing
Roomsnumeric9 unique values
0 missing
Typestring3 unique values
0 missing
Pricenumeric2204 unique values
0 missing
Methodstring5 unique values
0 missing
SellerGstring268 unique values
0 missing
Datestring58 unique values
0 missing
Distancenumeric202 unique values
0 missing
Postcodenumeric198 unique values
0 missing
Bedroom2numeric12 unique values
0 missing
Bathroomnumeric9 unique values
0 missing
Carnumeric11 unique values
62 missing
Landsizenumeric1448 unique values
0 missing
BuildingAreanumeric602 unique values
6450 missing
YearBuiltnumeric144 unique values
5375 missing
CouncilAreastring33 unique values
1369 missing
Lattitudenumeric6503 unique values
0 missing
Longtitudenumeric7063 unique values
0 missing
Regionnamestring8 unique values
0 missing
Propertycountnumeric311 unique values
0 missing

19 properties

13580
Number of instances (rows) of the dataset.
21
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
13256
Number of missing values in the dataset.
7384
Number of instances with at least one value missing.
13
Number of numeric attributes.
0
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
61.9
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
54.37
Percentage of instances having missing values.
Average class difference between consecutive instances.
4.65
Percentage of missing values.

0 tasks

Define a new task