Improving the Domain Adaptation of Retrieval Augmented Generation (RAG) Models for Open Domain Question Answering.

Shamane Siriwardhana; Rivindu Weerasekera; Tharindu Kaluarachchi; Elliott Wen; Rajib Rana; Suranga Nanayakkara

Vol. 11 (2023)

TACL approved

Improving the Domain Adaptation of Retrieval Augmented Generation (RAG) Models for Open Domain Question Answering.

Published 2023-01-12

Shamane Siriwardhana
Rivindu Weerasekera
Tharindu Kaluarachchi
Elliott Wen
Rajib Rana
Suranga Nanayakkara

Shamane Siriwardhana
The University of Auckland

Rivindu Weerasekera
The University of Auckland

Tharindu Kaluarachchi
The University of Auckland

Elliott Wen
The University of Auckland

Rajib Rana
University of Southern Queensland

Suranga Nanayakkara
The University of Auckland

Abstract

Retrieval Augment Generation (RAG) is a recent advancement in Open-Domain Question Answering (ODQA). RAG has only been trained and explored with a Wikipedia-based external knowledge base and is not optimized for use in other specialized domains such as healthcare and news. In this paper, we evaluate the impact of joint training of the retriever and generator components of RAG for the task of domain adaptation in ODQA. We propose \textit{RAG-end2end}, an extension to RAG, that can adapt to a domain-specific knowledge base by updating all components of the external knowledge base during training. In addition, we introduce an auxiliary training signal to inject more domain-specific knowledge. This auxiliary signal forces \textit{RAG-end2end} to reconstruct a given sentence by accessing the relevant information from the external knowledge base. Our novel contribution is unlike RAG, RAG-end2end does joint training of the retriever and generator for the end QA task and domain adaptation. We evaluate our approach with datasets from three domains: COVID-19, News, and Conversations, and achieve significant performance improvements compared to the original RAG model. Our work has been open-sourced through the Huggingface Transformers library, attesting to our work's credibility and technical consistency.

Presented at EMNLP 2022 Article at MIT Press

Author Biography

Shamane Siriwardhana

PhD candidate