LMTurk: Few-Shot Learners as Crowdsourcing Workers in a Language-Model-as-a-Service Framework

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Zhao, Mengjie; Mi, Fei; Wang, Yasheng; Li, Minglei; Jiang, Xin; Liu, Qun und Schütze, Hinrich (Juli 2022): LMTurk: Few-Shot Learners as Crowdsourcing Workers in a Language-Model-as-a-Service Framework. NAACL 2022, Seattle, United States, July 2022. Carpuat,, Marine; de Marneffe, Marie-Catherine und Meza Ruiz, Ivan Vladimir (Hrsg.): In: Findings of the Association for Computational Linguistics: NAACL 2022, Stroudsburg, PA: Association for Computational Linguistics (ACL). S. 675-692 [PDF, 2MB]

[thumbnail of 2022.findings-naacl.51.pdf]

Vorschau

Creative Commons: Namensnennung 4.0 (CC-BY)

Veröffentlichte Version

DOI: 10.18653/v1/2022.findings-naacl.51

Abstract

Vast efforts have been devoted to creating high-performance few-shot learners, i.e., large-scale pretrained language models (PLMs) that perform well with little downstream task training data. Training PLMs has incurred significant cost, but utilizing the few-shot learners is still challenging due to their enormous size. This work focuses on a crucial question: How to make effective use of these few-shot learners? We propose LMTurk, a novel approach that treats few-shotlearners as crowdsourcing workers. The rationale is that crowdsourcing workers are in fact few-shot learners: They are shown a few illustrative examples to learn about a task and then start annotating. LMTurk employs few-shot learners built upon PLMs as workers. We show that the resulting annotations can be utilized to train models that solve the task well and are small enough to be deployable in practical scenarios. Active learning is integrated into LMTurk to reduce the amount of queries made to PLMs, minimizing the computational cost of running PLM inference passes. Altogether, LMTurk is an important step towards making effective use of current PLMs.

Dokumententyp:	Konferenzbeitrag (Paper)
EU Funded Grant Agreement Number:	740516
EU-Projekte:	Horizon 2020 > ERC Grants > ERC Advanced Grant > ERC Grant 740516: NonSequeToR - Non-sequence models for tokenization replacement
Fakultätsübergreifende Einrichtungen:	Centrum für Informations- und Sprachverarbeitung (CIS)
Themengebiete:	000 Informatik, Informationswissenschaft, allgemeine Werke > 000 Informatik, Wissen, Systeme 400 Sprache > 400 Sprache 400 Sprache > 410 Linguistik
URN:	urn:nbn:de:bvb:19-epub-107422-3
Ort:	Stroudsburg, PA
Bemerkung:	ISBN 978-1-955917-76-6
Sprache:	Englisch
Dokumenten ID:	107422
Datum der Veröffentlichung auf Open Access LMU:	20. Okt. 2023 06:01
Letzte Änderungen:	20. Okt. 2023 06:05

Dokument bearbeiten