Data
bfi_dataset

bfi_dataset

active ARFF Publicly available Visibility: public Uploaded 21-01-2022 by Oleksandr Zadorozhnyi
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computer Systems Machine Learning
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Dataset description 25 personality self report items taken from the International Personality Item Pool (ipip.ori.org) were included as part of the Synthetic Aperture Personality Assessment (SAPA) web based personality assessment project. The data from 2800 subjects are included here as a demonstration set for scale construction, factor analysis, and Item Response Theory analysis. Three additional demographic variables (sex, education, and age) are also included.

28 features

A1numeric6 unique values
16 missing
A2numeric6 unique values
27 missing
A3numeric6 unique values
26 missing
A4numeric6 unique values
19 missing
A5numeric6 unique values
16 missing
C1numeric6 unique values
21 missing
C2numeric6 unique values
24 missing
C3numeric6 unique values
20 missing
C4numeric6 unique values
26 missing
C5numeric6 unique values
16 missing
E1numeric6 unique values
23 missing
E2numeric6 unique values
16 missing
E3numeric6 unique values
25 missing
E4numeric6 unique values
9 missing
E5numeric6 unique values
21 missing
N1numeric6 unique values
22 missing
N2numeric6 unique values
21 missing
N3numeric6 unique values
11 missing
N4numeric6 unique values
36 missing
N5numeric6 unique values
29 missing
O1numeric6 unique values
22 missing
O2numeric6 unique values
0 missing
O3numeric6 unique values
28 missing
O4numeric6 unique values
14 missing
O5numeric6 unique values
20 missing
gendernumeric2 unique values
0 missing
educationnumeric5 unique values
223 missing
agenumeric64 unique values
0 missing

19 properties

2800
Number of instances (rows) of the dataset.
28
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
731
Number of missing values in the dataset.
564
Number of instances with at least one value missing.
28
Number of numeric attributes.
0
Number of nominal attributes.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
20.14
Percentage of instances having missing values.
Average class difference between consecutive instances.
0.93
Percentage of missing values.
0.01
Number of attributes divided by the number of instances.
100
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.

0 tasks

Define a new task