{ "data_id": "40966", "name": "MiceProtein", "exact_name": "MiceProtein", "version": 4, "version_label": null, "description": "**Author**: Clara Higuera, Katheleen J. Gardiner, Krzysztof J. Cios \r\n**Source**: [UCI](https:\/\/archive.ics.uci.edu\/ml\/datasets\/Mice+Protein+Expression) - 2015 \r\n**Please cite**: Higuera C, Gardiner KJ, Cios KJ (2015) Self-Organizing Feature Maps Identify Proteins Critical to Learning in a Mouse Model of Down Syndrome. PLoS ONE 10(6): e0129126.\r\n\r\nExpression levels of 77 proteins measured in the cerebral cortex of 8 classes of control and Down syndrome mice exposed to context fear conditioning, a task used to assess associative learning.\r\n\r\nThe data set consists of the expression levels of 77 proteins\/protein modifications that produced detectable signals in the nuclear fraction of cortex. There are 38 control mice and 34 trisomic mice (Down syndrome), for a total of 72 mice. In the experiments, 15 measurements were registered of each protein per sample\/mouse. Therefore, for control mice, there are 38x15, or 570 measurements, and for trisomic mice, there are 34x15, or 510 measurements. The dataset contains a total of 1080 measurements per protein. Each measurement can be considered as an independent sample\/mouse. \r\n\r\nThe eight classes of mice are described based on features such as genotype, behavior and treatment. According to genotype, mice can be control or trisomic. According to behavior, some mice have been stimulated to learn (context-shock) and others have not (shock-context) and in order to assess the effect of the drug memantine in recovering the ability to learn in trisomic mice, some mice have been injected with the drug and others have not. \r\n\r\nClasses: \r\n```\r\n* c-CS-s: control mice, stimulated to learn, injected with saline (9 mice) \r\n* c-CS-m: control mice, stimulated to learn, injected with memantine (10 mice) \r\n* c-SC-s: control mice, not stimulated to learn, injected with saline (9 mice) \r\n* c-SC-m: control mice, not stimulated to learn, injected with memantine (10 mice) \r\n* t-CS-s: trisomy mice, stimulated to learn, injected with saline (7 mice) \r\n* t-CS-m: trisomy mice, stimulated to learn, injected with memantine (9 mice) \r\n* t-SC-s: trisomy mice, not stimulated to learn, injected with saline (9 mice) \r\n* t-SC-m: trisomy mice, not stimulated to learn, injected with memantine (9 mice) \r\n```\r\n\r\nThe aim is to identify subsets of proteins that are discriminant between the classes. \r\n\r\n### Attribute Information:\r\n\r\n```\r\n1 Mouse ID \r\n2..78 Values of expression levels of 77 proteins; the names of proteins are followed by “_nâ€\u009d indicating that they were measured in the nuclear fraction. For example: DYRK1A_n \r\n79 Genotype: control (c) or trisomy (t) \r\n80 Treatment type: memantine (m) or saline (s) \r\n81 Behavior: context-shock (CS) or shock-context (SC) \r\n82 Class: c-CS-s, c-CS-m, c-SC-s, c-SC-m, t-CS-s, t-CS-m, t-SC-s, t-SC-m \r\n```\r\n\r\n### Relevant Papers:\r\n\r\nHiguera C, Gardiner KJ, Cios KJ (2015) Self-Organizing Feature Maps Identify Proteins Critical to Learning in a Mouse Model of Down Syndrome. PLoS ONE 10(6): e0129126. [Web Link] journal.pone.0129126 \r\n\r\nAhmed MM, Dhanasekaran AR, Block A, Tong S, Costa ACS, Stasko M, et al. (2015) Protein Dynamics Associated with Failed and Rescued Learning in the Ts65Dn Mouse Model of Down Syndrome. PLoS ONE 10(3): e0119491.\r\n\r\n", "format": "ARFF", "uploader": "Joaquin Vanschoren", "uploader_id": 2, "visibility": "public", "creator": null, "contributor": null, "date": "2017-11-08 16:00:15", "update_comment": null, "last_update": "2017-11-08 16:00:15", "licence": "Public", "status": "active", "error_message": null, "url": "https:\/\/www.openml.org\/data\/download\/17928620\/phpchCuL5", "default_target_attribute": "class", "row_id_attribute": "MouseID", "ignore_attribute": "Genotype,Treatment,Behavior", "runs": 9545, "suggest": { "input": [ "MiceProtein", "Expression levels of 77 proteins measured in the cerebral cortex of 8 classes of control and Down syndrome mice exposed to context fear conditioning, a task used to assess associative learning. The data set consists of the expression levels of 77 proteins\/protein modifications that produced detectable signals in the nuclear fraction of cortex. There are 38 control mice and 34 trisomic mice (Down syndrome), for a total of 72 mice. In the experiments, 15 measurements were registered of each protei " ], "weight": 5 }, "qualities": { "NumberOfInstances": 1080, "NumberOfFeatures": 82, "NumberOfClasses": 8, "NumberOfMissingValues": 1396, "NumberOfInstancesWithMissingValues": 528, "NumberOfNumericFeatures": 77, "NumberOfSymbolicFeatures": 5, "EquivalentNumberOfAtts": null, "MeanSkewnessOfNumericAtts": 0.7856041308218359, "Quartile2MeansOfNumericAtts": 0.37851854251532036, "MajorityClassPercentage": 13.88888888888889, "MeanStdDevOfNumericAtts": 0.15821417818483133, "Quartile2MutualInformation": null, "MajorityClassSize": 150, "MinAttributeEntropy": null, "Quartile2SkewnessOfNumericAtts": 0.4589498552493325, "MaxAttributeEntropy": null, "MinKurtosisOfNumericAtts": -0.737462966449089, "PercentageOfBinaryFeatures": 3.6585365853658534, "Quartile2StdDevOfNumericAtts": 0.0664321589555671, "MaxKurtosisOfNumericAtts": 62.54850588815514, "MinMeansOfNumericAtts": 0.12088914805462964, "PercentageOfInstancesWithMissingValues": 48.888888888888886, "Quartile3AttributeEntropy": null, "MaxMeansOfNumericAtts": 3.843933946299907, "MinMutualInformation": null, "PercentageOfMissingValues": 1.5763324299909665, "Quartile3KurtosisOfNumericAtts": 2.288843620529632, "MaxMutualInformation": null, "MinNominalAttDistinctValues": 8, "PercentageOfNumericFeatures": 93.90243902439023, "Quartile3MeansOfNumericAtts": 0.792429770604921, "MaxNominalAttDistinctValues": 8, "MinSkewnessOfNumericAtts": -0.8812167378685444, "PercentageOfSymbolicFeatures": 6.097560975609756, "Quartile3MutualInformation": null, "MaxSkewnessOfNumericAtts": 4.738900324275432, "MinStdDevOfNumericAtts": 0.013233318836526299, "Quartile1AttributeEntropy": null, "Quartile3SkewnessOfNumericAtts": 0.9742697242064902, "MaxStdDevOfNumericAtts": 1.2951694678998307, "MinorityClassPercentage": 9.722222222222223, "Quartile1KurtosisOfNumericAtts": 0.21007824919108353, "Quartile3StdDevOfNumericAtts": 0.23056986700195908, "MeanAttributeEntropy": null, "MinorityClassSize": 105, "Quartile1MeansOfNumericAtts": 0.194287056601043, "StdvNominalAttDistinctValues": 0, "MeanKurtosisOfNumericAtts": 4.539576265311827, "NumberOfBinaryFeatures": 3, "Quartile1MutualInformation": null, "MeanMeansOfNumericAtts": 0.6718868120257284, "Quartile1SkewnessOfNumericAtts": 0.03001495604201651, "AutoCorrelation": 0.9935125115848007, "MeanMutualInformation": null, "Quartile1StdDevOfNumericAtts": 0.03224130544967179, "ClassEntropy": 2.993026787316555, "MeanNoiseToSignalRatio": null, "Quartile2AttributeEntropy": null, "Dimensionality": 0.07592592592592592, "MeanNominalAttDistinctValues": 8, "Quartile2KurtosisOfNumericAtts": 0.892962900950149 }, "tags": [ { "uploader": "38960", "tag": "Economics" }, { "uploader": "1", "tag": "OpenML-CC18" }, { "uploader": "5824", "tag": "study_135" }, { "uploader": "1935", "tag": "study_98" }, { "uploader": "1", "tag": "study_99" } ], "features": [ { "name": "class", "index": "81", "type": "nominal", "distinct": "8", "missing": "0", "target": "1", "distr": [ [ "c-CS-m", "c-CS-s", "c-SC-m", "c-SC-s", "t-CS-m", "t-CS-s", "t-SC-m", "t-SC-s" ], [ [ "150", "0", "0", "0", "0", "0", "0", "0" ], [ "0", "135", "0", "0", "0", "0", "0", "0" ], [ "0", "0", "150", "0", "0", "0", "0", "0" ], [ "0", "0", "0", "135", "0", "0", "0", "0" ], [ "0", "0", "0", "0", "135", "0", "0", "0" ], [ "0", "0", "0", "0", "0", "105", "0", "0" ], [ "0", "0", "0", "0", "0", "0", "135", "0" ], [ "0", "0", "0", "0", "0", "0", "0", "135" ] ] ] }, { "name": "MouseID", "index": "0", "type": "nominal", "distinct": "1080", "missing": "0", "identifier": "1", "distr": [] }, { "name": "DYRK1A_N", "index": "1", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "3", "mean": "0", "stdev": "0" }, { "name": "ITSN1_N", "index": "2", "type": "numeric", "distinct": "1076", "missing": "3", "min": "0", "max": "3", "mean": "1", "stdev": "0" }, { "name": "BDNF_N", "index": "3", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "NR1_N", "index": "4", "type": "numeric", "distinct": "1077", "missing": "3", "min": "1", "max": "4", "mean": "2", "stdev": "0" }, { "name": "NR2A_N", "index": "5", "type": "numeric", "distinct": "1077", "missing": "3", "min": "2", "max": "8", "mean": "4", "stdev": "1" }, { "name": "pAKT_N", "index": "6", "type": "numeric", "distinct": "1076", "missing": "3", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "pBRAF_N", "index": "7", "type": "numeric", "distinct": "1075", "missing": "3", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "pCAMKII_N", "index": "8", "type": "numeric", "distinct": "1077", "missing": "3", "min": "1", "max": "7", "mean": "4", "stdev": "1" }, { "name": "pCREB_N", "index": "9", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "pELK_N", "index": "10", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "6", "mean": "1", "stdev": "0" }, { "name": "pERK_N", "index": "11", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "4", "mean": "1", "stdev": "0" }, { "name": "pJNK_N", "index": "12", "type": "numeric", "distinct": "1076", "missing": "3", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "PKCA_N", "index": "13", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "pMEK_N", "index": "14", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "pNR1_N", "index": "15", "type": "numeric", "distinct": "1077", "missing": "3", "min": "1", "max": "1", "mean": "1", "stdev": "0" }, { "name": "pNR2A_N", "index": "16", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "1", "mean": "1", "stdev": "0" }, { "name": "pNR2B_N", "index": "17", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "3", "mean": "2", "stdev": "0" }, { "name": "pPKCAB_N", "index": "18", "type": "numeric", "distinct": "1077", "missing": "3", "min": "1", "max": "3", "mean": "2", "stdev": "0" }, { "name": "pRSK_N", "index": "19", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "AKT_N", "index": "20", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "1", "mean": "1", "stdev": "0" }, { "name": "BRAF_N", "index": "21", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "2", "mean": "0", "stdev": "0" }, { "name": "CAMKII_N", "index": "22", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "CREB_N", "index": "23", "type": "numeric", "distinct": "1073", "missing": "3", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "ELK_N", "index": "24", "type": "numeric", "distinct": "1062", "missing": "18", "min": "0", "max": "3", "mean": "1", "stdev": "0" }, { "name": "ERK_N", "index": "25", "type": "numeric", "distinct": "1077", "missing": "3", "min": "1", "max": "5", "mean": "2", "stdev": "1" }, { "name": "GSK3B_N", "index": "26", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "2", "mean": "1", "stdev": "0" }, { "name": "JNK_N", "index": "27", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "MEK_N", "index": "28", "type": "numeric", "distinct": "1072", "missing": "7", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "TRKA_N", "index": "29", "type": "numeric", "distinct": "1075", "missing": "3", "min": "0", "max": "1", "mean": "1", "stdev": "0" }, { "name": "RSK_N", "index": "30", "type": "numeric", "distinct": "1074", "missing": "3", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "APP_N", "index": "31", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "Bcatenin_N", "index": "32", "type": "numeric", "distinct": "1062", "missing": "18", "min": "1", "max": "4", "mean": "2", "stdev": "0" }, { "name": "SOD1_N", "index": "33", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "2", "mean": "1", "stdev": "0" }, { "name": "MTOR_N", "index": "34", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "P38_N", "index": "35", "type": "numeric", "distinct": "1075", "missing": "3", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "pMTOR_N", "index": "36", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "1", "mean": "1", "stdev": "0" }, { "name": "DSCR1_N", "index": "37", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "1", "mean": "1", "stdev": "0" }, { "name": "AMPKA_N", "index": "38", "type": "numeric", "distinct": "1075", "missing": "3", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "NR2B_N", "index": "39", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "1", "mean": "1", "stdev": "0" }, { "name": "pNUMB_N", "index": "40", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "RAPTOR_N", "index": "41", "type": "numeric", "distinct": "1077", "missing": "3", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "TIAM1_N", "index": "42", "type": "numeric", "distinct": "1075", "missing": "3", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "pP70S6_N", "index": "43", "type": "numeric", "distinct": "1076", "missing": "3", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "NUMB_N", "index": "44", "type": "numeric", "distinct": "1080", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "P70S6_N", "index": "45", "type": "numeric", "distinct": "1080", "missing": "0", "min": "0", "max": "2", "mean": "1", "stdev": "0" }, { "name": "pGSK3B_N", "index": "46", "type": "numeric", "distinct": "1080", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "pPKCG_N", "index": "47", "type": "numeric", "distinct": "1080", "missing": "0", "min": "1", "max": "3", "mean": "2", "stdev": "1" }, { "name": "CDK5_N", "index": "48", "type": "numeric", "distinct": "1080", "missing": "0", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "S6_N", "index": "49", "type": "numeric", "distinct": "1080", "missing": "0", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "ADARB1_N", "index": "50", "type": "numeric", "distinct": "1080", "missing": "0", "min": "1", "max": "3", "mean": "1", "stdev": "0" }, { "name": "AcetylH3K9_N", "index": "51", "type": "numeric", "distinct": "1080", "missing": "0", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "RRP1_N", "index": "52", "type": "numeric", "distinct": "1080", "missing": "0", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "BAX_N", "index": "53", "type": "numeric", "distinct": "1080", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "ARC_N", "index": "54", "type": "numeric", "distinct": "1080", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "ERBB4_N", "index": "55", "type": "numeric", "distinct": "1079", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "nNOS_N", "index": "56", "type": "numeric", "distinct": "1079", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "Tau_N", "index": "57", "type": "numeric", "distinct": "1080", "missing": "0", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "GFAP_N", "index": "58", "type": "numeric", "distinct": "1079", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "GluR3_N", "index": "59", "type": "numeric", "distinct": "1080", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "GluR4_N", "index": "60", "type": "numeric", "distinct": "1079", "missing": "0", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "IL1B_N", "index": "61", "type": "numeric", "distinct": "1080", "missing": "0", "min": "0", "max": "1", "mean": "1", "stdev": "0" }, { "name": "P3525_N", "index": "62", "type": "numeric", "distinct": "1080", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "pCASP9_N", "index": "63", "type": "numeric", "distinct": "1080", "missing": "0", "min": "1", "max": "3", "mean": "2", "stdev": "0" }, { "name": "PSD95_N", "index": "64", "type": "numeric", "distinct": "1080", "missing": "0", "min": "1", "max": "3", "mean": "2", "stdev": "0" }, { "name": "SNCA_N", "index": "65", "type": "numeric", "distinct": "1079", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "Ubiquitin_N", "index": "66", "type": "numeric", "distinct": "1080", "missing": "0", "min": "1", "max": "2", "mean": "1", "stdev": "0" }, { "name": "pGSK3B_Tyr216_N", "index": "67", "type": "numeric", "distinct": "1080", "missing": "0", "min": "1", "max": "1", "mean": "1", "stdev": "0" }, { "name": "SHH_N", "index": "68", "type": "numeric", "distinct": "1080", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "BAD_N", "index": "69", "type": "numeric", "distinct": "866", "missing": "213", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "BCL2_N", "index": "70", "type": "numeric", "distinct": "795", "missing": "285", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "pS6_N", "index": "71", "type": "numeric", "distinct": "1080", "missing": "0", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "pCFOS_N", "index": "72", "type": "numeric", "distinct": "1005", "missing": "75", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "SYP_N", "index": "73", "type": "numeric", "distinct": "1079", "missing": "0", "min": "0", "max": "1", "mean": "0", "stdev": "0" }, { "name": "H3AcK18_N", "index": "74", "type": "numeric", "distinct": "900", "missing": "180", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "EGR1_N", "index": "75", "type": "numeric", "distinct": "870", "missing": "210", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "H3MeK4_N", "index": "76", "type": "numeric", "distinct": "810", "missing": "270", "min": "0", "max": "0", "mean": "0", "stdev": "0" }, { "name": "CaNA_N", "index": "77", "type": "numeric", "distinct": "1080", "missing": "0", "min": "1", "max": "2", "mean": "1", "stdev": "0" }, { "name": "Genotype", "index": "78", "type": "nominal", "distinct": "2", "missing": "0", "ignore": "1", "distr": [ [ "Control", "Ts65Dn" ], [ [ "150", "135", "150", "135", "0", "0", "0", "0" ], [ "0", "0", "0", "0", "135", "105", "135", "135" ] ] ] }, { "name": "Treatment", "index": "79", "type": "nominal", "distinct": "2", "missing": "0", "ignore": "1", "distr": [ [ "Memantine", "Saline" ], [ [ "150", "0", "150", "0", "135", "0", "135", "0" ], [ "0", "135", "0", "135", "0", "105", "0", "135" ] ] ] }, { "name": "Behavior", "index": "80", "type": "nominal", "distinct": "2", "missing": "0", "ignore": "1", "distr": [ [ "C\/S", "S\/C" ], [ [ "150", "135", "0", "0", "135", "105", "0", "0" ], [ "0", "0", "150", "135", "0", "0", "135", "135" ] ] ] } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 0, "nr_of_downloads": 0, "total_downloads": 0, "reach": 0, "reuse": 21, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 21 }