ActiveGLAE: A Benchmark for Deep Active Learning with Transformers

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Rauch, Lukas; Aßenmacher, Matthias ORCID: https://orcid.org/0000-0003-2154-5774; Huseljic, Denis; Wirth, Moritz; Bischl, Bernd ORCID: https://orcid.org/0000-0001-6002-6980 und Sick, Bernhard (2023): ActiveGLAE: A Benchmark for Deep Active Learning with Transformers. European Conference, ECML PKDD 2023, Turin, Italy, September 18–22, 2023. Koutra, Danai ORCID: https://orcid.org/0000-0002-3206-8179; Plant, Claudia ORCID: https://orcid.org/0000-0001-5274-8123; Gomez Rodriguez, Manuel ORCID: https://orcid.org/0000-0003-3930-1161; Baralis, Elena ORCID: https://orcid.org/0000-0001-9231-467X und Bonchi, Francesco ORCID: https://orcid.org/0000-0001-9464-8315 (Hrsg.): In: Machine Learning and Knowledge Discovery in Databases: Research Track : European Conference, ECML PKDD 2023, Turin, Italy, September 18–22, 2023, Proceedings, Part I, Bd. 14169 Cham: Springer. S. 55-74

Volltext auf 'Open Access LMU' nicht verfügbar.

DOI: 10.1007/978-3-031-43412-9_4

Abstract

Deep active learning (DAL) seeks to reduce annotation costs by enabling the model to actively query instance annotations from which it expects to learn the most. Despite extensive research, there is currently no standardized evaluation protocol for transformer-based language models in the field of DAL. Diverse experimental settings lead to difficulties in comparing research and deriving recommendations for practitioners. To tackle this challenge, we propose the ActiveGLAE benchmark, a comprehensive collection of data sets and evaluation guidelines for assessing DAL. Our benchmark aims to facilitate and streamline the evaluation process of novel DAL strategies. Additionally, we provide an extensive overview of current practice in DAL with transformer-based language models. We identify three key challenges - data set selection, model training, and DAL settings - that pose difficulties in comparing query strategies. We establish baseline results through an extensive set of experiments as a reference point for evaluating future work. Based on our findings, we provide guidelines for researchers and practitioners.

Dokumententyp:	Konferenzbeitrag (Paper)
Fakultät:	Mathematik, Informatik und Statistik > Statistik
Themengebiete:	300 Sozialwissenschaften > 310 Statistiken 500 Naturwissenschaften und Mathematik > 510 Mathematik
ISBN:	978-3-031-43411-2 ; 978-3-031-43412-9
Ort:	Cham
Bemerkung:	Teil von: Lecture Notes in Artificial Intelligence (LNAI) ; (LNCS, volume 14169)
Sprache:	Englisch
Dokumenten ID:	121958
Datum der Veröffentlichung auf Open Access LMU:	04. Nov. 2024 14:02
Letzte Änderungen:	04. Nov. 2024 14:02

Dokument bearbeiten