Data
Quora_Insincere_Questions_2018

Quora_Insincere_Questions_2018

active ARFF CC0: Public Domain Visibility: public Uploaded 23-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context It's the preprocessed train data from Quora Insincere Questions competition 2018 The original train data is preprocessed to remove stop words, numbers, punctuations, common words and converted to lower case. The resultant data set is lemmatised and stemmed with scikit-learn/NLTK library. Content It contains approximately 1.3 million rows of quora questions with target =0 for sincere questions and target=1 for insincere questions. Acknowledgements Thanks for Co-learning lounge mentors to help me to work on this problem Inspiration It's very handy to build the ML models in NLP.

4 features

Unnamed:_0numeric1306122 unique values
0 missing
qidstring1306122 unique values
0 missing
question_textstring1304660 unique values
1 missing
targetnumeric2 unique values
0 missing

19 properties

1306122
Number of instances (rows) of the dataset.
4
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
1
Number of missing values in the dataset.
1
Number of instances with at least one value missing.
2
Number of numeric attributes.
0
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
50
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
Average class difference between consecutive instances.
0
Percentage of missing values.

0 tasks

Define a new task