MuLD: The Multitask Long Document Benchmark

Hudson, G Thomas; Al Moubayed, Noura

All Outputs (2)

Length is a Curse and a Blessing for Document-level Semantics (2023)
Presentation / Conference Contribution
Xiao, C., Li, Y., Hudson, G. T., Lin, C., & Al Moubayed, N. (2023, December). Length is a Curse and a Blessing for Document-level Semantics. Presented at The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), Singapore

In recent years, contrastive learning (CL) has been extensively utilized to recover sentence and document-level encoding capability from pre-trained language models. In this work, we question the length generalizability of CL-based models, i.e., thei... Read More about Length is a Curse and a Blessing for Document-level Semantics.

MuLD: The Multitask Long Document Benchmark (2022)
Presentation / Conference Contribution
Hudson, G. T., & Al Moubayed, N. (2022, June). MuLD: The Multitask Long Document Benchmark. Presented at 13th Conference on Language Resources and Evaluation (LREC 2022), Marseille, France

The impressive progress in NLP techniques has been driven by the development of multi-task benchmarks such as GLUE and SuperGLUE. While these benchmarks focus on tasks for one or two input sentences, there has been exciting work in designing efficien... Read More about MuLD: The Multitask Long Document Benchmark.