Federated Learning for Exploiting Annotators’ Disagreements in Natural Language Processing

Nuria Rodríguez-Barroso; Eugenio Martínez-Cámara; M. Victoria Luzón; Jose Camacho-Collados; Francisco Herrera

Vol. 12 (2024)

TACL approved

Federated Learning for Exploiting Annotators’ Disagreements in Natural Language Processing

Published 2024-05-25

Nuria Rodríguez-Barroso
Eugenio Martínez-Cámara
M. Victoria Luzón
Jose Camacho-Collados
Francisco Herrera

Nuria Rodríguez-Barroso
Department of Computer Science and Artificial Intelligence, Andalusian Research Institute in Data Science and Computational Intelligence (DaSCI), University of Granada

Eugenio Martínez-Cámara
Department of Computer Science, Advanced Studies Center in Information and Communication Technologies (CEATIC), Universidad de Jaen

M. Victoria Luzón
Department of Software Engineering, Andalusian Research Institute in Data Science and Computational Intelligence (DaSCI), University of Granada

Jose Camacho-Collados
Cardiff NLP, School of Computer Science and Informatics, Cardiff University

Francisco Herrera
Department of Computer Science and Artificial Intelligence, Andalusian Research Institute in Data Science and Computational Intelligence (DaSCI), University of Granada

Abstract

The annotation of ambiguous or subjective NLP tasks is usually addressed by various annotators. In most datasets, these annotations are aggregated into a single ground truth. However, this omits divergent opinions of annotators, hence missing individual perspectives. We propose FLEAD, a methodology built upon federated learning to independently learn from the opinions of all the annotators, thereby leveraging all their underlying information without relying on a single ground truth. We conduct an extensive experimental study and analysis in diverse text classification tasks to show the contribution of our approach with respect to mainstream approaches based on majority voting and other recent methodologies that also learn from annotator disagreements.

Presented at NAACL 2024 Article at MIT Press