Data
HotpotQA_distractor

HotpotQA_distractor

active ARFF Publicly available Visibility: public Uploaded 15-06-2023 by Debayan Banerjee
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
HotpotQA is a new dataset with 113k Wikipedia-based question-answer pairs with four key features: (1) the questions require finding and reasoning over multiple supporting documents to answer; (2) the questions are diverse and not constrained to any pre-existing knowledge bases or knowledge schemas; (3) we provide sentence-level supporting facts required for reasoning, allowingQA systems to reason with strong supervision and explain the predictions; (4) we offer a new type of factoid comparison questions to test QA systems' ability to extract relevant facts and perform necessary comparison. The dataset is taken from https://huggingface.co/datasets/hotpot_qa and this upload is the 'distractor' subset.

7 features

idstring97852 unique values
0 missing
questionstring97845 unique values
0 missing
answerstring57258 unique values
0 missing
typestring2 unique values
0 missing
levelstring3 unique values
0 missing
supporting_factsstring94135 unique values
0 missing
contextstring97852 unique values
0 missing

19 properties

97852
Number of instances (rows) of the dataset.
7
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
0
Number of numeric attributes.
0
Number of nominal attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
Average class difference between consecutive instances.
0
Percentage of missing values.
0
Number of attributes divided by the number of instances.
0
Percentage of numeric attributes.

0 tasks

Define a new task