Design Choices for Crowdsourcing Implicit Discourse Relations: Revealing the Biases Introduced by Task Design

Valentina Pyatkin; Frances Yung; Merel C.J. Scholman; Ido Dagan; Reut Tsarfaty; Vera Demberg

Vol. 11 (2023)

TACL approved

Design Choices for Crowdsourcing Implicit Discourse Relations: Revealing the Biases Introduced by Task Design

Published 2023-08-15

Valentina Pyatkin
Frances Yung
Merel C.J. Scholman
Ido Dagan
Reut Tsarfaty
Vera Demberg

Valentina Pyatkin
Bar-Ilan University

Frances Yung
Saarland University

Merel C.J. Scholman
Saarland University

Ido Dagan
Bar-Ilan University

Reut Tsarfaty
Bar-Ilan University

Vera Demberg
Saarland University

Abstract

Disagreement in natural language annotation has mostly been studied from a perspective of biases introduced by the annotators and the annotation frameworks. Here, we propose to analyze another source of bias: task design bias, which has a particularly strong impact on crowdsourced linguistic annotations where natural language is used to elicit the interpretation of laymen annotators.
For this purpose we look at implicit discourse relation annotation, a task that has repeatedly been shown to be difficult due to the relations' ambiguity. We compare the annotations of 1,200 discourse relations obtained using two distinct annotation tasks and quantify the biases of both methods across four different domains. Both methods are natural language annotation tasks designed for crowdsourcing.

Article at MIT Press Presented at ACL 2023