Logo des Repositoriums
  • English
  • Deutsch
Anmelden
Keine TU-ID? Klicken Sie hier für mehr Informationen.
  1. Startseite
  2. Publikationen
  3. Publikationen der Technischen Universität Darmstadt
  4. Zweitveröffentlichungen (aus DeepGreen)
  5. Stance Detection Benchmark: How Robust is Your Stance Detection?
 
  • Details
2021
Zweitveröffentlichung
Artikel
Verlagsversion

Stance Detection Benchmark: How Robust is Your Stance Detection?

File(s)
Download
Hauptpublikation
s13218-021-00714-w.pdf
CC BY 4.0 International
Format: Adobe PDF
Size: 1.25 MB
TUDa URI
tuda/10223
URN
urn:nbn:de:tuda-tuprints-235062
DOI
10.26083/tuprints-00023506
Autor:innen
Schiller, Benjamin ORCID 0000-0002-1175-4727
Daxenberger, Johannes ORCID 0000-0002-7385-5654
Gurevych, Iryna ORCID 0000-0003-2187-7621
Kurzbeschreibung (Abstract)

Stance detection (StD) aims to detect an author’s stance towards a certain topic and has become a key component in applications like fake news detection, claim validation, or argument search. However, while stance is easily detected by humans, machine learning (ML) models are clearly falling short of this task. Given the major differences in dataset sizes and framing of StD (e.g. number of classes and inputs), ML models trained on a single dataset usually generalize poorly to other domains. Hence, we introduce a StD benchmark that allows to compare ML models against a wide variety of heterogeneous StD datasets to evaluate them for generalizability and robustness. Moreover, the framework is designed for easy integration of new datasets and probing methods for robustness. Amongst several baseline models, we define a model that learns from all ten StD datasets of various domains in a multi-dataset learning (MDL) setting and present new state-of-the-art results on five of the datasets. Yet, the models still perform well below human capabilities and even simple perturbations of the original test samples (adversarial attacks) severely hurt the performance of MDL models. Deeper investigation suggests overfitting on dataset biases as the main reason for the decreased robustness. Our analysis emphasizes the need of focus on robustness and de-biasing strategies in multi-task learning approaches. To foster research on this important topic, we release the dataset splits, code, and fine-tuned weights.

Freie Schlagworte

Stance detection

Robustness

Multi-dataset learnin...

Sprache
Englisch
Fachbereich/-gebiet
20 Fachbereich Informatik > Ubiquitäre Wissensverarbeitung
DDC
000 Allgemeines, Informatik, Informationswissenschaft > 004 Informatik
Institution
Universitäts- und Landesbibliothek Darmstadt
Ort
Darmstadt
Titel der Zeitschrift / Schriftenreihe
KI - Künstliche Intelligenz : German Journal of Artificial Intelligence
Startseite
329
Endseite
341
Jahrgang der Zeitschrift
35
Heftnummer der Zeitschrift
3-4
ISSN
1610-1987
Verlag
Springer
Ort der Erstveröffentlichung
Berlin
Publikationsjahr der Erstveröffentlichung
2021
Verlags-DOI
10.1007/s13218-021-00714-w
PPN
516764616
Zusätzliche Infomationen
NLP and Semantics

  • TUprints Leitlinien
  • Cookie-Einstellungen
  • Impressum
  • Datenschutzbestimmungen
  • Webseitenanalyse
Diese Webseite wird von der Universitäts- und Landesbibliothek Darmstadt (ULB) betrieben.