{ "data_id": "43737", "name": "Aircraft-Pricing-Dataset", "exact_name": "Aircraft-Pricing-Dataset", "version": 1, "version_label": "v1.0", "description": "For a more comprehensive dataset with many more features check out the \"Yacht\/Motorboat Pricing Data (10,000+ listings)\" dataset.\nLink below:\nhttps:\/\/www.kaggle.com\/artemkorottchenko\/large-boatyacht-pricing-dataset\nContext\nWhat are the most important features in determining the price of a new or used aircraft? Is it the aircraft type? Year? Manufacturer? Other characteristics? \nThis is one of many questions regarding the used\/new aircraft markets I hope to answer with this dataset. \nThe dataset contains over 2000 aircraft that are for sale around the world. The data was scraped during July of 2020.\nContent\nThe data was scraped from various websites using the Scrapy framework for Python. \nScrapy script:\nhttps:\/\/github.com\/akorott\/Aircraft-Scrapy-Script.git\nContent scraped:\n\nNew\/Used\nPrice \nCurrency (USD, EUR, GBP)\nCategory\nYear\nMake\nModel\nLocation\nSerial number\nRegistration number\nTotal hours\nEngine 1 hours \nEngine 2 hours \nProp 1 hours\nProp 2 hours\nTotal Seats\nFlight Rules\nNational Origin\n\nKeep in mind that the data was scraped from 2 different sources. Some of the data (New\/Used, Engine 1 hours, Engine 2 hours, Prop 1 hours, Prop 2 hours, Total Seats, Flight Rules) was only easily accessible on one source, thus is missing for part of the dataset. \nFAQ\nFlight Rules: Visual Flight Rules (VFR) VS Instrument Flight Rules (IFR). In a nutshell, an aircraft equipped with IFR is one where a pilot can fully navigate an aircraft using instruments in the cockpit. Any aircraft flying over 18,000 feet, by law, has to be equipped with IFR equipment.\nBTH - Beyond the Horizon - according to my research, BTH means that an aircraft is equipped with a radar, but doesn't fully meet IFR criteria.\nVFR - (https:\/\/en.wikipedia.org\/wiki\/Visual_flight_rules)\nIFR - (https:\/\/en.wikipedia.org\/wiki\/Instrument_flight_rules)\nSome of the acronyms used within total hours, engine 1, engine 2, prop 1, prop 2 columns:\nSMOH - Since major overhaul\nSNEW - Since new\nSPOH - Since prop overhaul\nSFOH - Since factory overhaul (more reliable)\nSOH - Since overhaul\nSTOH - Since top overhaul\nSFRM - Since factory re-manufactured\nThank you for checking out this dataset and happy kaggling!", "format": "arff", "uploader": "Dustin Carrion", "uploader_id": 30123, "visibility": "public", "creator": null, "contributor": null, "date": "2022-03-24 07:44:15", "update_comment": null, "last_update": "2022-03-24 07:44:15", "licence": "CC0: Public Domain", "status": "active", "error_message": null, "url": "https:\/\/www.openml.org\/data\/download\/22102562\/dataset", "kaggle_url": null, "default_target_attribute": null, "row_id_attribute": null, "ignore_attribute": null, "runs": 0, "suggest": { "input": [ "Aircraft-Pricing-Dataset", "For a more comprehensive dataset with many more features check out the \"Yacht\/Motorboat Pricing Data (10,000+ listings)\" dataset. Link below: https:\/\/www.kaggle.com\/artemkorottchenko\/large-boatyacht-pricing-dataset Context What are the most important features in determining the price of a new or used aircraft? Is it the aircraft type? Year? Manufacturer? Other characteristics? This is one of many questions regarding the used\/new aircraft markets I hope to answer with this dataset. The dataset co " ], "weight": 5 }, "qualities": { "NumberOfInstances": 2530, "NumberOfFeatures": 18, "NumberOfClasses": null, "NumberOfMissingValues": 11212, "NumberOfInstancesWithMissingValues": 2397, "NumberOfNumericFeatures": 1, "NumberOfSymbolicFeatures": 0, "PercentageOfBinaryFeatures": 0, "PercentageOfInstancesWithMissingValues": 94.74308300395256, "PercentageOfMissingValues": 24.620114185331577, "AutoCorrelation": null, "PercentageOfNumericFeatures": 5.555555555555555, "Dimensionality": 0.0071146245059288534, "PercentageOfSymbolicFeatures": 0, "MajorityClassPercentage": null, "MajorityClassSize": null, "MinorityClassPercentage": null, "MinorityClassSize": null, "NumberOfBinaryFeatures": 0 }, "tags": [], "features": [ { "name": "Condition", "index": "0", "type": "string", "distinct": "3", "missing": "769" }, { "name": "Price", "index": "1", "type": "string", "distinct": "1148", "missing": "0" }, { "name": "Currency", "index": "2", "type": "string", "distinct": "5", "missing": "552" }, { "name": "Category", "index": "3", "type": "string", "distinct": "13", "missing": "0" }, { "name": "Year", "index": "4", "type": "string", "distinct": "94", "missing": "0" }, { "name": "Make", "index": "5", "type": "string", "distinct": "187", "missing": "0" }, { "name": "Model", "index": "6", "type": "string", "distinct": "1020", "missing": "0" }, { "name": "Location", "index": "7", "type": "string", "distinct": "1007", "missing": "12" }, { "name": "S\/N", "index": "8", "type": "string", "distinct": "1824", "missing": "3" }, { "name": "REG", "index": "9", "type": "string", "distinct": "2029", "missing": "2" }, { "name": "Total_Hours", "index": "10", "type": "string", "distinct": "1832", "missing": "97" }, { "name": "Engine_1_Hours", "index": "11", "type": "string", "distinct": "1252", "missing": "948" }, { "name": "Engine_2_Hours", "index": "12", "type": "string", "distinct": "352", "missing": "2149" }, { "name": "Prop_1_Hours", "index": "13", "type": "string", "distinct": "913", "missing": "1365" }, { "name": "Prop_2_Hours", "index": "14", "type": "string", "distinct": "243", "missing": "2267" }, { "name": "Total_Seats", "index": "15", "type": "numeric", "distinct": "15", "missing": "1378", "min": "1", "max": "56", "mean": "4", "stdev": "3" }, { "name": "Flight_Rules", "index": "16", "type": "string", "distinct": "3", "missing": "1662" }, { "name": "National_Origin", "index": "17", "type": "string", "distinct": "26", "missing": "8" } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 0, "nr_of_downloads": 0, "total_downloads": 0, "reach": 0, "reuse": 0, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 0 }