{ "data_id": "43963", "name": "CPS1988", "exact_name": "CPS1988", "version": 1, "version_label": null, "description": "**Source**\n\nThis dataset was obtained from the **AER** R package (see citation request below).\n\n**Data description**\n\nCross-section data originating from the March 1988 Current Population Survey by the US Census Bureau.\nThe data is a sample of men aged 18 to 70 with positive annual income greater than USD 50 in 1992, who\nare not self-employed nor working without pay. Wages are deflated by the deflator of Personal\nConsumption Expenditure for 1992.\nA problem with CPS data is that it does not provide actual work experience. It is therefore customary\nto compute experience as age - education - 6 (as was done by Bierens and Ginther, 2001), this\nmay be considered potential experience. As a result, some respondents have negative experience\n\n\n**Attribute Information**\n\n* wage Wage (in dollars per week).\n* education Number of years of education.\n* experience Number of years of potential work experience.\n* ethnicity Factor with levels \"cauc\" and \"afam\" (African-American).\n* smsa Factor. Does the individual reside in a Standard Metropolitan Statistical Area (SMSA)?\n* region Factor with levels \"northeast\", \"midwest\", \"south\", \"west\".\n* parttime Factor. Does the individual work part-time?\n\n\n**Citation Request**\n\nChristian Kleiber and Achim Zeileis (2008). Applied Econometrics with\nR. New York: Springer-Verlag. ISBN 978-0-387-77316-2. URL\nhttps:\/\/CRAN.R-project.org\/package=AER\n\n*Bibtex*\n\n@book{kleiber2008applied,\n title={Applied econometrics with R},\n author={Kleiber, Christian and Zeileis, Achim},\n year={2008},\n publisher={Springer Science \\& Business Media}\n}", "format": "arff", "uploader": "Sebastian Fischer", "uploader_id": 30127, "visibility": "public", "creator": null, "contributor": null, "date": "2022-06-16 12:51:55", "update_comment": null, "last_update": "2022-06-16 12:51:55", "licence": "Public", "status": "active", "error_message": null, "url": "https:\/\/old.openml.org\/data\/download\/22103051\/data.arff", "default_target_attribute": "wage", "row_id_attribute": null, "ignore_attribute": null, "runs": 0, "suggest": { "input": [ "CPS1988", "This dataset was obtained from the **AER** R package (see citation request below). Cross-section data originating from the March 1988 Current Population Survey by the US Census Bureau. The data is a sample of men aged 18 to 70 with positive annual income greater than USD 50 in 1992, who are not self-employed nor working without pay. Wages are deflated by the deflator of Personal Consumption Expenditure for 1992. A problem with CPS data is that it does not provide actual work experience. It is th " ], "weight": 5 }, "qualities": { "NumberOfInstances": 28155, "NumberOfFeatures": 7, "NumberOfClasses": 0, "NumberOfMissingValues": 0, "NumberOfInstancesWithMissingValues": 0, "NumberOfNumericFeatures": 3, "NumberOfSymbolicFeatures": 4, "PercentageOfBinaryFeatures": 42.857142857142854, "PercentageOfInstancesWithMissingValues": 0, "AutoCorrelation": -389.4422746323893, "PercentageOfMissingValues": 0, "Dimensionality": 0.00024862369028591725, "PercentageOfNumericFeatures": 42.857142857142854, "MajorityClassPercentage": null, "PercentageOfSymbolicFeatures": 57.14285714285714, "MajorityClassSize": null, "MinorityClassPercentage": null, "MinorityClassSize": null, "NumberOfBinaryFeatures": 3 }, "tags": [ { "uploader": "38960", "tag": "Machine Learning" }, { "uploader": "38960", "tag": "Statistics" } ], "features": [ { "name": "wage", "index": "0", "type": "numeric", "distinct": "5970", "missing": "0", "target": "1", "min": "50", "max": "18777", "mean": "604", "stdev": "454" }, { "name": "education", "index": "1", "type": "numeric", "distinct": "19", "missing": "0", "min": "0", "max": "18", "mean": "13", "stdev": "3" }, { "name": "experience", "index": "2", "type": "numeric", "distinct": "67", "missing": "0", "min": "-4", "max": "63", "mean": "18", "stdev": "13" }, { "name": "ethnicity", "index": "3", "type": "nominal", "distinct": "2", "missing": "0", "distr": [] }, { "name": "smsa", "index": "4", "type": "nominal", "distinct": "2", "missing": "0", "distr": [] }, { "name": "region", "index": "5", "type": "nominal", "distinct": "4", "missing": "0", "distr": [] }, { "name": "parttime", "index": "6", "type": "nominal", "distinct": "2", "missing": "0", "distr": [] } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 0, "nr_of_downloads": 0, "total_downloads": 0, "reach": 0, "reuse": 0, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 0 }