G. Thomas Hudson
Ask me in your own words: paraphrasing for multitask question answering
Hudson, G. Thomas; Al Moubayed, Noura
Abstract
Multitask learning has led to significant advances in Natural Language Processing, including the decaNLP benchmark where question answering is used to frame 10 natural language understanding tasks in a single model. In this work we show how models trained to solve decaNLP fail with simple paraphrasing of the question. We contribute a crowd-sourced corpus of paraphrased questions (PQ-decaNLP), annotated with paraphrase phenomena. This enables analysis of how transformations such as swapping the class labels and changing the sentence modality lead to a large performance degradation. Training both MQAN and the newer T5 model using PQ-decaNLP improves their robustness and for some tasks improves the performance on the original questions, demonstrating the benefits of a model which is more robust to paraphrasing. Additionally, we explore how paraphrasing knowledge is transferred between tasks, with the aim of exploiting the multitask property to improve the robustness of the models. We explore the addition of paraphrase detection and paraphrase generation tasks, and find that while both models are able to learn these new tasks, knowledge about paraphrasing does not transfer to other decaNLP tasks.
Citation
Hudson, G. T., & Al Moubayed, N. (2021). Ask me in your own words: paraphrasing for multitask question answering. PeerJ Computer Science, 7, Article e759. https://doi.org/10.7717/peerj-cs.759
Journal Article Type | Article |
---|---|
Acceptance Date | Oct 3, 2021 |
Online Publication Date | Oct 27, 2021 |
Publication Date | 2021 |
Deposit Date | Oct 27, 2021 |
Publicly Available Date | Dec 15, 2021 |
Journal | PeerJ Computer Science |
Electronic ISSN | 2376-5992 |
Publisher | PeerJ |
Peer Reviewed | Peer Reviewed |
Volume | 7 |
Article Number | e759 |
DOI | https://doi.org/10.7717/peerj-cs.759 |
Public URL | https://durham-repository.worktribe.com/output/1228194 |
Files
Published Journal Article
(1.3 Mb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
Copyright Statement
2021 Hudson and Al Moubayed
Distributed under
Creative Commons CC-BY 4.0
You might also like
Explainable text-tabular models for predicting mortality risk in companion animals
(2024)
Journal Article
Downloadable Citations
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search