Data
QSAR-DATASET-FOR-DRUG-TARGET-CHEMBL1293286

QSAR-DATASET-FOR-DRUG-TARGET-CHEMBL1293286

deactivated ARFF Publicly available Visibility: public Uploaded 14-07-2016 by
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target ChEMBL_ID: CHEMBL1293286 (TID: 103719), and it has 1408 rows and 43 features (not including molecule IDs and class feature: molecule_id and pXC50). The features represent Basic Molecular Descriptors which were generated from SMILES strings. Missing value imputation was applied to this dataset (By choosing the Median).

45 features

pXC50 (target)numeric269 unique values
0 missing
nFnumeric8 unique values
0 missing
nDBnumeric10 unique values
0 missing
nHnumeric37 unique values
0 missing
nHetnumeric17 unique values
0 missing
nHMnumeric8 unique values
0 missing
nInumeric1 unique values
0 missing
nNnumeric10 unique values
0 missing
nOnumeric9 unique values
0 missing
nPnumeric3 unique values
0 missing
nSnumeric6 unique values
0 missing
nSKnumeric35 unique values
0 missing
nTBnumeric4 unique values
0 missing
nXnumeric9 unique values
0 missing
O.numeric132 unique values
0 missing
RBFnumeric163 unique values
0 missing
RBNnumeric14 unique values
0 missing
SCBOnumeric89 unique values
0 missing
Senumeric1235 unique values
0 missing
Sinumeric1232 unique values
0 missing
Spnumeric1152 unique values
0 missing
Svnumeric1242 unique values
0 missing
X.numeric73 unique values
0 missing
nATnumeric61 unique values
0 missing
AMWnumeric1051 unique values
0 missing
C.numeric186 unique values
0 missing
H.numeric225 unique values
0 missing
Menumeric102 unique values
0 missing
Minumeric82 unique values
0 missing
Mpnumeric194 unique values
0 missing
Mvnumeric207 unique values
0 missing
MWnumeric1217 unique values
0 missing
N.numeric144 unique values
0 missing
nABnumeric20 unique values
0 missing
molecule_id (row identifier)nominal1408 unique values
0 missing
nBnumeric1 unique values
0 missing
nBMnumeric31 unique values
0 missing
nBOnumeric40 unique values
0 missing
nBRnumeric3 unique values
0 missing
nBTnumeric64 unique values
0 missing
nCnumeric30 unique values
0 missing
nCLnumeric7 unique values
0 missing
nCspnumeric5 unique values
0 missing
nCsp2numeric26 unique values
0 missing
nCsp3numeric21 unique values
0 missing

62 properties

12 tasks

0 runs - estimation_procedure: Custom 10-fold Crossvalidation - target_feature: pXC50
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
Define a new task