{ "data_id": "1057", "name": "usp05-ft", "exact_name": "usp05-ft", "version": 1, "version_label": null, "description": "**Author**: \n**Source**: Unknown - Date unknown \n**Please cite**: \n\n%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%\nThis is a PROMISE Software Engineering Repository data set made publicly\navailable in order to encourage repeatable, verifiable, refutable, and\/or\nimprovable predictive models of software engineering.\n\nIf you publish material based on PROMISE data sets then, please\nfollow the acknowledgment guidelines posted on the PROMISE repository\nweb page http:\/\/promisedata.org\/repository .\n%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%\n\n%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%\n(c) 2007 Jingzhou Li\n(jingli@ucalgary.ca)\nThis data set is distributed under the\nCreative Commons Attribution-Share Alike 3.0 License\nhttp:\/\/creativecommons.org\/licenses\/by-sa\/3.0\/\n\nYou are free:\n\n* to Share -- copy, distribute and transmit the work\n* to Remix -- to adapt the work\n\nUnder the following conditions:\n\nAttribution. You must attribute the work in the manner specified by\nthe author or licensor (but not in any way that suggests that they endorse\nyou or your use of the work).\n\nShare Alike. If you alter, transform, or build upon this work, you\nmay distribute the resulting work only under the same, similar or a\ncompatible license.\n\n* For any reuse or distribution, you must make clear to others the\nlicense terms of this work.\n* Any of the above conditions can be waived if you get permission from\nthe copyright holder.\n* Apart from the remix rights granted under this license, nothing in\nthis license impairs or restricts the author's moral rights.\n\n1. Title: USP05-FT: Software effort estimation at feature level\n\n2. Source Information\n-- Donor: Jingzhou Li (jingli@ucalgary.ca), Guenther Ruhe (ruhe@ucalgary.ca)\ncomputer science department\nUniversity of Calgary, Canada\n(403) 210-5440\n-- Date: December 2005\n\n3. Past Usage:\n[1]. J.Z. Li, G. Ruhe, A. Al-Emran, M. M. Ritcher, \"A Flexible Method for Effort Estimation by Analogy\", Empirical Software Engineering, Vol. 12, No. 1, 2007, pp 65-106.\n[2]. J.Z. Li, G. Ruhe, \"A Comparative Study of Attribute Weighting Heuristics for Effort Estimation by Analogy\", Proceedings of the ACM-IEEE International Symposium on Empirical Software Engineering (ISESE'06), September 2006, Brazil.\n\n4. Relevant Information:\n-- This data set was part of USP05 that was collected from university student projects about Web and client\/server applications\n-- The detailed description of the whole data set can be found in reference [1].\n\n5. Number of Instances: 76 (features)\n\n6. Number of Attributes: 15 (including ID, Effort is the actual effort)\n\n7. Attribute Information:\n1. ID: Three digit Object ID,\n2. Effort: Actual effort in hours expended on tasks related to implementing the object by all participating persons.\n3. IntComplx: Complexity of Internal Calculation (1-VeryLow, 2-Low, 3-Medium, 4-High, 5-VeryHigh )\n4. DataFile: Number of Data Files\/Database Tables Accessed (Positive integer)\n5. DataEn: Number of Data Entry Items (Positive integer)\n6. DataOut: Number of Data Output Items (Positive integer)\n7. UFP: Unadjusted Function Point Count (Positive integer)\n8. Lang: Language Used (C++, Java, VB, Java Script, VB Script, SQL, Php, Perl, Asp, Html, XML, Others)\n9. Tools: Development Tools and Platforms (VJ++, VB, Delphi, VisualCafe, JUnit, PowerBuilder, BorlandC++, Others)\n10. ToolExpr: Language and Tool Experience Level (Range of number of months of experience, e.g. [2, 5] for 2 to 5 months, as the minimum experience level is 2 and 5 the maximum in the team)\n11. AppExpr: Applications Experience Level (1-VeryLow, 2-Low, 3-Medium, 4-High, 5-VeryHigh)\n12. TeamSize: Team size for implementing the object (Range: [a, b], min-max number of persons, e.g. [2, 5])\n13. DBMS: Database Systems (Oracle, Access, SQLServer, MySQL, Others)\n14. Method: Methodology (OO, SA, SD, RAD, JAD, MVC, Others)\n15. AppType: Type of System\/Application Architecture (B\/S, C\/S, BC\/S, Centered, Other)\n\n\n8. Missing Attribute Values: 37\n\n9. Data", "format": "ARFF", "uploader": "Joaquin Vanschoren", "uploader_id": 2, "visibility": "public", "creator": "Jingzhou Li, Guenther Ruhe", "contributor": null, "date": "2014-10-06 23:57:26", "update_comment": null, "last_update": "2014-10-06 23:57:26", "licence": "Public", "status": "active", "error_message": null, "url": "https:\/\/www.openml.org\/data\/download\/53940\/usp05-ft.arff", "default_target_attribute": "AppType", "row_id_attribute": null, "ignore_attribute": null, "runs": 485, "suggest": { "input": [ "usp05-ft", "%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable, verifiable, refutable, and\/or improvable predictive models of software engineering. If you publish material based on PROMISE data sets then, please follow the acknowledgment guidelines posted on the PROMISE repository web page http:\/\/promisedata.org\/repository . %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% " ], "weight": 5 }, "qualities": { "NumberOfInstances": 76, "NumberOfFeatures": 15, "NumberOfClasses": 7, "NumberOfMissingValues": 37, "NumberOfInstancesWithMissingValues": 18, "NumberOfNumericFeatures": 8, "NumberOfSymbolicFeatures": 7, "REPTreeDepth3Kappa": 0.3641304347826087, "DecisionStumpKappa": 0.7545056015586946, "MaxMeansOfNumericAtts": 344.27631578947387, "MinMutualInformation": 0.31169273672267, "Quartile2SkewnessOfNumericAtts": 2.2663796688595843, "RandomTreeDepth1AUC": 0.8684426113739494, "Dimensionality": 0.19736842105263158, "MaxMutualInformation": 0.98088513369066, "MinNominalAttDistinctValues": 4, "PercentageOfBinaryFeatures": 0, "Quartile2StdDevOfNumericAtts": 7.476650038293158, "RandomTreeDepth1ErrRate": 0.25, "EquivalentNumberOfAtts": 1.568333440911103, "MaxNominalAttDistinctValues": 16, "MinSkewnessOfNumericAtts": 0.44696291238496094, "PercentageOfInstancesWithMissingValues": 23.684210526315788, "Quartile3AttributeEntropy": 2.7317077768350715, "RandomTreeDepth1Kappa": 0.2918032786885247, "J48.00001.AUC": 0.5521338115248637, "MaxSkewnessOfNumericAtts": 6.759226792982435, "MinStdDevOfNumericAtts": 1.2532064138798018, "PercentageOfMissingValues": 3.245614035087719, "Quartile3KurtosisOfNumericAtts": 23.587073888364475, "AutoCorrelation": 0.7866666666666666, "RandomTreeDepth2AUC": 0.8684426113739494, "J48.00001.ErrRate": 0.20833333333333334, "MaxStdDevOfNumericAtts": 215.85214684650606, "MinorityClassPercentage": 1.3157894736842104, "PercentageOfNumericFeatures": 53.333333333333336, "Quartile3MeansOfNumericAtts": 16.099573257467988, "CfsSubsetEval_DecisionStumpAUC": 0.5521338115248637, "RandomTreeDepth2ErrRate": 0.25, "J48.00001.Kappa": 0.1842900302114801, "MeanAttributeEntropy": 2.3449282850231135, "MinorityClassSize": 1, "PercentageOfSymbolicFeatures": 46.666666666666664, "Quartile3MutualInformation": 0.97056886831944, "CfsSubsetEval_DecisionStumpErrRate": 0.20833333333333334, "RandomTreeDepth2Kappa": 0.2918032786885247, "J48.0001.AUC": 0.5521338115248637, "MeanKurtosisOfNumericAtts": 13.264152133573145, "NaiveBayesAUC": 0.7499574673797955, "Quartile1AttributeEntropy": 1.8737334423779577, "Quartile3SkewnessOfNumericAtts": 4.5127936639029835, "CfsSubsetEval_DecisionStumpKappa": 0.1842900302114801, "RandomTreeDepth3AUC": 0.8684426113739494, "J48.0001.ErrRate": 0.20833333333333334, "MeanMeansOfNumericAtts": 48.646870554765314, "NaiveBayesErrRate": 0.18055555555555555, "Quartile1KurtosisOfNumericAtts": 0.25730699246766375, "Quartile3StdDevOfNumericAtts": 42.88561986021658, "CfsSubsetEval_NaiveBayesAUC": 0.5521338115248637, "RandomTreeDepth3ErrRate": 0.25, "J48.0001.Kappa": 0.1842900302114801, "MeanMutualInformation": 0.7563921733659517, "NaiveBayesKappa": 0.45038167938931284, "Quartile1MeansOfNumericAtts": 2.131845661450924, "REPTreeDepth1AUC": 0.8568123862707542, "CfsSubsetEval_NaiveBayesErrRate": 0.20833333333333334, "RandomTreeDepth3Kappa": 0.2918032786885247, "J48.001.AUC": 0.5521338115248637, "MeanNoiseToSignalRatio": 2.1001487952845443, "NumberOfBinaryFeatures": 0, "Quartile1MutualInformation": 0.5540476678910751, "REPTreeDepth1ErrRate": 0.18055555555555555, "CfsSubsetEval_NaiveBayesKappa": 0.1842900302114801, "CfsSubsetEval_kNN1NAUC": 0.5521338115248637, "StdvNominalAttDistinctValues": 4.790864322059325, "J48.001.ErrRate": 0.20833333333333334, "MeanNominalAttDistinctValues": 10.428571428571429, "Quartile1SkewnessOfNumericAtts": 1.1938304283179035, "REPTreeDepth1Kappa": 0.3641304347826087, "CfsSubsetEval_kNN1NErrRate": 0.20833333333333334, "kNN1NAUC": 0.886263775463237, "J48.001.Kappa": 0.1842900302114801, "MeanSkewnessOfNumericAtts": 2.858195078505295, "Quartile1StdDevOfNumericAtts": 1.883323619668397, "REPTreeDepth2AUC": 0.8568123862707542, "CfsSubsetEval_kNN1NKappa": 0.1842900302114801, "kNN1NErrRate": 0.08333333333333333, "MajorityClassPercentage": 72.36842105263158, "MeanStdDevOfNumericAtts": 38.94290292467051, "Quartile2AttributeEntropy": 2.5113886167162316, "REPTreeDepth2ErrRate": 0.18055555555555555, "ClassEntropy": 1.1862751399332505, "kNN1NKappa": 0.7677419354838708, "MajorityClassSize": 55, "MinAttributeEntropy": 1.6107283753454333, "Quartile2KurtosisOfNumericAtts": 5.429875788499264, "REPTreeDepth2Kappa": 0.3641304347826087, "REPTreeDepth3AUC": 0.8568123862707542, "DecisionStumpAUC": 0.9032150630628263, "MaxAttributeEntropy": 2.748577901287835, "MinKurtosisOfNumericAtts": -1.2536699972789094, "Quartile2MeansOfNumericAtts": 4.447368421052632, "REPTreeDepth3ErrRate": 0.18055555555555555, "DecisionStumpErrRate": 0.09722222222222222, "MaxKurtosisOfNumericAtts": 49.69637050795677, "MinMeansOfNumericAtts": 1.9473684210526314, "Quartile2MutualInformation": 0.8219062058197351 }, "tags": [ { "uploader": "38960", "tag": "Chemistry" }, { "uploader": "38960", "tag": "Life Science" }, { "uploader": "24659", "tag": "PROMISE" }, { "uploader": "2", "tag": "study_1" }, { "uploader": "1", "tag": "study_41" } ], "features": [ { "name": "AppType", "index": "14", "type": "nominal", "distinct": "6", "missing": "4", "target": "1", "distr": [ [ "BC\/S", "B\/S", "C\/S", "C", "S", "B" ], [ [ "55", "0", "0", "0", "0", "0" ], [ "0", "8", "0", "0", "0", "0" ], [ "0", "0", "5", "0", "0", "0" ], [ "0", "0", "0", "1", "0", "0" ], [ "0", "0", "0", "0", "2", "0" ], [ "0", "0", "0", "0", "0", "1" ] ] ] }, { "name": "ID", "index": "0", "type": "numeric", "distinct": "76", "missing": "0", "min": "101", "max": "920", "mean": "344", "stdev": "216" }, { "name": "Effort", "index": "1", "type": "numeric", "distinct": "18", "missing": "0", "min": "1", "max": "40", "mean": "6", "stdev": "9" }, { "name": "IntComplx", "index": "2", "type": "numeric", "distinct": "5", "missing": "0", "min": "1", "max": "5", "mean": "2", "stdev": "1" }, { "name": "DataFile", "index": "3", "type": "numeric", "distinct": "11", "missing": "0", "min": "0", "max": "18", "mean": "3", "stdev": "3" }, { "name": "DataEn", "index": "4", "type": "numeric", "distinct": "19", "missing": "0", "min": "0", "max": "314", "mean": "17", "stdev": "48" }, { "name": "DataOut", "index": "5", "type": "numeric", "distinct": "8", "missing": "2", "min": "0", "max": "50", "mean": "2", "stdev": "6" }, { "name": "UFP", "index": "6", "type": "numeric", "distinct": "24", "missing": "2", "min": "0", "max": "180", "mean": "12", "stdev": "26" }, { "name": "Lang", "index": "7", "type": "nominal", "distinct": "14", "missing": "2", "distr": [ [ "sql", "html,_php,_sql,_proprietary", "php,_sql", "Html,_JavaScript", "Php,_Html,_Sql,_JavaScript", "C#,_ASP.Net_SQL", "PHP,_SQL,_SH", "PHP,_SQL", "HTML,_PHP,_SQL", "PHP,_HTML", "PHP,_MySql,_HTML", "PHP", "HTML", "SQL" ], [ [ "2", "0", "0", "0", "0", "0" ], [ "6", "0", "0", "0", "0", "0" ], [ "1", "0", "0", "0", "0", "0" ], [ "0", "1", "0", "0", "0", "0" ], [ "38", "0", "0", "0", "0", "0" ], [ "3", "0", "2", "1", "0", "0" ], [ "0", "1", "0", "0", "0", "0" ], [ "0", "2", "0", "0", "0", "0" ], [ "0", "0", "3", "0", "0", "0" ], [ "2", "0", "0", "0", "0", "0" ], [ "1", "0", "0", "0", "0", "0" ], [ "5", "4", "0", "0", "0", "0" ], [ "0", "0", "0", "0", "0", "1" ], [ "1", "0", "0", "0", "0", "0" ] ] ] }, { "name": "Tools", "index": "8", "type": "nominal", "distinct": "16", "missing": "2", "distr": [ [ "mySQLweb", "notepad,_webforms", "notepad", "Vim", "Vim,_Emacs,_Microsoft_visual_studio", "Visual_Studio.Net_2003,_Microsoft_SQL_Server_Enterprise_Manager\/Query_Analyzer", "Pico,_MySQLDump", "Dreamweaver,_Rapid_PHP,_Eclipse,_PHPMyAdmin", "ConText,_Jedit", "ConTEXT,_Jedit,_myAdmin", "ConTEXT", "Microsoft_Visual,_Emacs,_VIM,_Notepad", "Dreamweaver", "Notepad", "Emacs", "SQL" ], [ [ "2", "0", "0", "0", "0", "0" ], [ "6", "0", "0", "0", "0", "0" ], [ "1", "0", "0", "0", "0", "0" ], [ "0", "1", "0", "0", "0", "0" ], [ "38", "0", "0", "0", "0", "0" ], [ "3", "0", "2", "1", "0", "0" ], [ "0", "1", "0", "0", "0", "0" ], [ "0", "2", "0", "0", "0", "0" ], [ "0", "0", "1", "0", "0", "0" ], [ "0", "0", "1", "0", "0", "0" ], [ "0", "0", "1", "0", "0", "0" ], [ "1", "0", "0", "0", "0", "0" ], [ "2", "0", "0", "0", "0", "0" ], [ "0", "4", "0", "0", "0", "0" ], [ "5", "0", "0", "0", "0", "1" ], [ "1", "0", "0", "0", "0", "0" ] ] ] }, { "name": "ToolExpr", "index": "9", "type": "nominal", "distinct": "15", "missing": "2", "distr": [ [ "[2,60]", "[2,_60]", "[5,10]", "[1,10]", "[0]", "[0,_48]", "[5,_100]", "[4,24]", "[2,5]", "[2,12]", "[0,12]", "[40,60]", "[0,_60]", "[0,_12]", "[0,24]" ], [ [ "3", "0", "0", "0", "0", "0" ], [ "6", "0", "0", "0", "0", "0" ], [ "0", "1", "0", "0", "0", "0" ], [ "38", "0", "0", "0", "0", "0" ], [ "3", "0", "0", "1", "0", "0" ], [ "0", "0", "2", "0", "0", "0" ], [ "0", "1", "0", "0", "0", "0" ], [ "0", "1", "0", "0", "0", "0" ], [ "0", "1", "0", "0", "0", "0" ], [ "0", "0", "2", "0", "0", "0" ], [ "0", "4", "1", "0", "0", "0" ], [ "1", "0", "0", "0", "0", "0" ], [ "2", "0", "0", "0", "0", "0" ], [ "5", "0", "0", "0", "0", "1" ], [ "1", "0", "0", "0", "0", "0" ] ] ] }, { "name": "AppExpr", "index": "10", "type": "numeric", "distinct": "5", "missing": "0", "min": "1", "max": "5", "mean": "2", "stdev": "1" }, { "name": "TeamSize", "index": "11", "type": "nominal", "distinct": "11", "missing": "0", "distr": [ [ "[1]", "[1,2]", "[2,3]", "[3,5]", "[2,_4]", "[1,1]", "[2,2]", "[3,4]", "[2,_3]", "[1,_3]", "[1,_2]" ], [ [ "12", "0", "2", "0", "0", "0" ], [ "29", "5", "2", "0", "0", "0" ], [ "11", "0", "0", "0", "0", "0" ], [ "2", "0", "0", "0", "0", "0" ], [ "0", "0", "0", "1", "0", "1" ], [ "0", "3", "0", "0", "0", "0" ], [ "0", "0", "0", "0", "2", "0" ], [ "0", "0", "1", "0", "0", "0" ], [ "2", "0", "0", "0", "0", "0" ], [ "1", "0", "0", "0", "0", "0" ], [ "2", "0", "0", "0", "0", "0" ] ] ] }, { "name": "DBMS", "index": "12", "type": "nominal", "distinct": "4", "missing": "9", "distr": [ [ "mysql", "Oracle", "SQLServer", "MySQL" ], [ [ "9", "0", "0", "0", "0", "0" ], [ "39", "4", "0", "0", "0", "0" ], [ "3", "0", "2", "1", "0", "0" ], [ "6", "3", "0", "0", "0", "0" ] ] ] }, { "name": "Method", "index": "13", "type": "nominal", "distinct": "7", "missing": "14", "distr": [ [ "SA,SD", "None", "SA", "3_Tier_Architecture", "OO", "Imperative", "oo" ], [ [ "6", "0", "0", "0", "0", "0" ], [ "1", "0", "0", "0", "0", "0" ], [ "0", "1", "0", "0", "0", "0" ], [ "38", "0", "0", "0", "0", "0" ], [ "0", "7", "0", "0", "0", "0" ], [ "3", "0", "0", "0", "0", "0" ], [ "5", "0", "0", "0", "0", "1" ] ] ] } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 0, "nr_of_downloads": 0, "total_downloads": 0, "reach": 0, "reuse": 0, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 0 }