SQuAD-explorer | Visually Explore the Stanford Question Answering Dataset | Dataset library
kandi X-RAY | SQuAD-explorer Summary
kandi X-RAY | SQuAD-explorer Summary
Visually Explore the Stanford Question Answering Dataset
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
- remove function arguments
SQuAD-explorer Key Features
SQuAD-explorer Examples and Code Snippets
Community Discussions
Trending Discussions on SQuAD-explorer
QUESTION
I'm running a fine-tuned model of BERT and ALBERT for Questing Answering. And, I'm evaluating the performance of these models on a subset of questions from SQuAD v2.0. I use SQuAD's official evaluation script for evaluation.
I use Huggingface transformers
and in the following you can find an actual code and example I'm running (might be also helpful for some folks who are trying to run fine-tuned model of ALBERT on SQuAD v2.0):
ANSWER
Answered 2020-Feb-13 at 21:41You should simply treat them as invalid because you try to predict a proper answer span from the variable text
. Everything else should be invalid. This is also the way how huggingface treats this predictions:
We could hypothetically create invalid predictions, e.g., predict that the start of the span is in the question. We throw out all invalid predictions.
You should also note that they use a more sopisticated method to get the predictions for each question (don't ask me why they show torch.argmax in their example). Please have a look at the example below:
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install SQuAD-explorer
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page