Data
Allstate_Claims_Severity

Allstate_Claims_Severity

active ARFF Publicly available Visibility: public Uploaded 29-06-2020 by Marcos de Paula Bueno
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
When you've been devastated by a serious car accident, your focus is on the things that matter the most: family, friends, and other loved ones. Pushing paper with your insurance agent is the last place you want your time or mental energy spent. This is why Allstate, a personal insurer in the United States, is continually seeking fresh ideas to improve their claims service for the over 16 million households they protect. Allstate is currently developing automated methods of predicting the cost, and hence severity, of claims. In this recruitment challenge, Kagglers are invited to show off their creativity and flex their technical chops by creating an algorithm which accurately predicts claims severity. Aspiring competitors will demonstrate insight into better ways to predict claims severity for the chance to be part of Allstate's efforts to ensure a worry-free customer experience. Each row in this dataset represents an insurance claim. You must predict the value for the 'loss' column. Variables prefaced with 'cat' are categorical, while those prefaced with 'cont' are continuous.

131 features

loss (target)numeric158223 unique values
0 missing
id (row identifier)numeric188318 unique values
0 missing
cat1nominal2 unique values
0 missing
cat2nominal2 unique values
0 missing
cat3nominal2 unique values
0 missing
cat4nominal2 unique values
0 missing
cat5nominal2 unique values
0 missing
cat6nominal2 unique values
0 missing
cat7nominal2 unique values
0 missing
cat8nominal2 unique values
0 missing
cat9nominal2 unique values
0 missing
cat10nominal2 unique values
0 missing
cat11nominal2 unique values
0 missing
cat12nominal2 unique values
0 missing
cat13nominal2 unique values
0 missing
cat14nominal2 unique values
0 missing
cat15nominal2 unique values
0 missing
cat16nominal2 unique values
0 missing
cat17nominal2 unique values
0 missing
cat18nominal2 unique values
0 missing
cat19nominal2 unique values
0 missing
cat20nominal2 unique values
0 missing
cat21nominal2 unique values
0 missing
cat22nominal2 unique values
0 missing
cat23nominal2 unique values
0 missing
cat24nominal2 unique values
0 missing
cat25nominal2 unique values
0 missing
cat26nominal2 unique values
0 missing
cat27nominal2 unique values
0 missing
cat28nominal2 unique values
0 missing
cat29nominal2 unique values
0 missing
cat30nominal2 unique values
0 missing
cat31nominal2 unique values
0 missing
cat32nominal2 unique values
0 missing
cat33nominal2 unique values
0 missing
cat34nominal2 unique values
0 missing
cat35nominal2 unique values
0 missing
cat36nominal2 unique values
0 missing
cat37nominal2 unique values
0 missing
cat38nominal2 unique values
0 missing
cat39nominal2 unique values
0 missing
cat40nominal2 unique values
0 missing
cat41nominal2 unique values
0 missing
cat42nominal2 unique values
0 missing
cat43nominal2 unique values
0 missing
cat44nominal2 unique values
0 missing
cat45nominal2 unique values
0 missing
cat46nominal2 unique values
0 missing
cat47nominal2 unique values
0 missing
cat48nominal2 unique values
0 missing
cat49nominal2 unique values
0 missing
cat50nominal2 unique values
0 missing
cat51nominal2 unique values
0 missing
cat52nominal2 unique values
0 missing
cat53nominal2 unique values
0 missing
cat54nominal2 unique values
0 missing
cat55nominal2 unique values
0 missing
cat56nominal2 unique values
0 missing
cat57nominal2 unique values
0 missing
cat58nominal2 unique values
0 missing
cat59nominal2 unique values
0 missing
cat60nominal2 unique values
0 missing
cat61nominal2 unique values
0 missing
cat62nominal2 unique values
0 missing
cat63nominal2 unique values
0 missing
cat64nominal2 unique values
0 missing
cat65nominal2 unique values
0 missing
cat66nominal2 unique values
0 missing
cat67nominal2 unique values
0 missing
cat68nominal2 unique values
0 missing
cat69nominal2 unique values
0 missing
cat70nominal2 unique values
0 missing
cat71nominal2 unique values
0 missing
cat72nominal2 unique values
0 missing
cat73nominal3 unique values
0 missing
cat74nominal3 unique values
0 missing
cat75nominal3 unique values
0 missing
cat76nominal3 unique values
0 missing
cat77nominal4 unique values
0 missing
cat78nominal4 unique values
0 missing
cat79nominal4 unique values
0 missing
cat80nominal4 unique values
0 missing
cat81nominal4 unique values
0 missing
cat82nominal4 unique values
0 missing
cat83nominal4 unique values
0 missing
cat84nominal4 unique values
0 missing
cat85nominal4 unique values
0 missing
cat86nominal4 unique values
0 missing
cat87nominal4 unique values
0 missing
cat88nominal4 unique values
0 missing
cat89nominal8 unique values
0 missing
cat90nominal7 unique values
0 missing
cat91nominal8 unique values
0 missing
cat92nominal7 unique values
0 missing
cat93nominal5 unique values
0 missing
cat94nominal7 unique values
0 missing
cat95nominal5 unique values
0 missing
cat96nominal8 unique values
0 missing
cat97nominal7 unique values
0 missing
cat98nominal5 unique values
0 missing
cat99nominal16 unique values
0 missing
cat100nominal15 unique values
0 missing
cat101nominal19 unique values
0 missing
cat102nominal9 unique values
0 missing
cat103nominal13 unique values
0 missing
cat104nominal17 unique values
0 missing
cat105nominal20 unique values
0 missing
cat106nominal17 unique values
0 missing
cat107nominal20 unique values
0 missing
cat108nominal11 unique values
0 missing
cat109nominal84 unique values
0 missing
cat110nominal131 unique values
0 missing
cat111nominal16 unique values
0 missing
cat112nominal51 unique values
0 missing
cat113nominal61 unique values
0 missing
cat114nominal19 unique values
0 missing
cat115nominal23 unique values
0 missing
cat116nominal326 unique values
0 missing
cont1numeric647 unique values
0 missing
cont2numeric33 unique values
0 missing
cont3numeric76 unique values
0 missing
cont4numeric112 unique values
0 missing
cont5numeric141 unique values
0 missing
cont6numeric2573 unique values
0 missing
cont7numeric5632 unique values
0 missing
cont8numeric201 unique values
0 missing
cont9numeric347 unique values
0 missing
cont10numeric174 unique values
0 missing
cont11numeric326 unique values
0 missing
cont12numeric328 unique values
0 missing
cont13numeric353 unique values
0 missing
cont14numeric18740 unique values
0 missing

19 properties

188318
Number of instances (rows) of the dataset.
131
Number of attributes (columns) of the dataset.
0
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
15
Number of numeric attributes.
116
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
11.45
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
88.55
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
72
Number of binary attributes.
54.96
Percentage of binary attributes.
0
Percentage of instances having missing values.
-2670.91
Average class difference between consecutive instances.
0
Percentage of missing values.

8 tasks

3 runs - estimation_procedure: 10-fold Crossvalidation - evaluation_measure: root_mean_squared_error - target_feature: loss
0 runs - estimation_procedure: 33% Holdout set - target_feature: loss
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
Define a new task