Conditional Generation with a Question-Answering Blueprint

Shashi Narayan; Joshua Maynez; Reinald Kim Amplayo; Kuzman Ganchev; Annie Louis; Fantine Huot; Anders Sandholm; Dipanjan Das; Mirella Lapata

Vol. 11 (2023)

TACL approved

Conditional Generation with a Question-Answering Blueprint

Published 2023-08-17

Shashi Narayan
Joshua Maynez
Reinald Kim Amplayo
Kuzman Ganchev
Annie Louis
Fantine Huot
Anders Sandholm
Dipanjan Das
Mirella Lapata

Shashi Narayan
Google Research

Joshua Maynez
Google Research

Reinald Kim Amplayo
Google Research

Kuzman Ganchev
Google Research

Annie Louis
Google Research

Fantine Huot
Google Research

Anders Sandholm
Google Research

Dipanjan Das
Google Research

Mirella Lapata
Google Research

Abstract

The ability to convey relevant and faithful information is critical for many tasks in conditional generation and yet remains elusive for neural seq-to-seq models whose outputs often reveal hallucinations and fail to correctly cover important details. In this work, we advocate planning as a useful intermediate representation for rendering conditional generation less opaque and more grounded. We propose a new conceptualization of text plans as a sequence of question-answer (QA) pairs and enhance existing datasets (e.g., for summarization) with a QA blueprint operating as a proxy for content selection (i.e., what to say) and planning (i.e., in what order). We obtain blueprints automatically by exploiting state-of-the-art question generation technology and convert input-output pairs into input-blueprint-output tuples. We develop Transformer-based models, each varying in how they incorporate the blueprint in the generated output (e.g., as a global plan or iteratively). Evaluation across metrics and datasets demonstrates that blueprint models are more factual than alternatives which do not resort to planning and allow tighter control of the generation output.

Article at MIT Press Presented at ACL 2023

Author Biography

Shashi Narayan

Research Scientist, Google London