Transformers for Tabular Data Representation: A Survey of Models and Applications

Gilbert Badaro; Paolo Papotti; Mohammed Saeed

Vol. 11 (2023)

TACL approved

Transformers for Tabular Data Representation: A Survey of Models and Applications

Published 2023-03-14

Gilbert Badaro
Paolo Papotti
Mohammed Saeed

Gilbert Badaro
EURECOM

Paolo Papotti
EURECOM

Mohammed Saeed
EURECOM

Abstract

In the last few years, the natural language processing community has witnessed advances in neural representations of free texts with transformer-based language models (LMs). Given the importance of knowledge available in tabular data, recent research efforts extend LMs by developing neural representations for structured data. In this work, we present a survey that analyzes these efforts. We first abstract the different systems according to a traditional machine learning pipeline in terms of training data, input representation, model training, and supported downstream tasks. For each aspect, we characterize and compare the proposed solutions. Finally, we discuss future work directions.

Article at MIT Press