A Closer Look at Few-Shot Crosslingual Transfer: The Choice of Shots Matters

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Zhao, Mengjie; Zhu, Yi; Shareghi, Ehsan; Vulic, Ivan; Reichart, Roi; Korhonen, Anna und Schütze, Hinrich (August 2021): A Closer Look at Few-Shot Crosslingual Transfer: The Choice of Shots Matters. 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Online, August 2021. Zong, Chengqing; Xia, Fei; Li, Wenjie und Navigli, Roberto (Hrsg.): In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Association for Computational Linguistics. S. 5751-5767 [PDF, 737kB]

Vorschau

DOI: 10.5282/ubm/epub.92188

Abstract

Few-shot crosslingual transfer has been shown to outperform its zero-shot counterpart with pretrained encoders like multilingual BERT. Despite its growing popularity, little to no attention has been paid to standardizing and analyzing the design of few-shot experiments. In this work, we highlight a fundamental risk posed by this shortcoming, illustrating that the model exhibits a high degree of sensitivity to the selection of few shots. We conduct a large-scale experimental study on 40 sets of sampled few shots for six diverse NLP tasks across up to 40 languages. We provide an analysis of success and failure cases of few-shot transfer, which highlights the role of lexical features. Additionally, we show that a straightforward full model finetuning approach is quite effective for few-shot transfer, outperforming several state-of-the-art few-shot approaches. As a step towards standardizing few-shot crosslingual experimental designs, we make our sampled few shots publicly available.

Dokumententyp:	Konferenzbeitrag (Paper)
EU Funded Grant Agreement Number:	740516
EU-Projekte:	Horizon 2020 > ERC Grants > ERC Advanced Grant > ERC Grant 740516: NonSequeToR - Non-sequence models for tokenization replacement
Fakultätsübergreifende Einrichtungen:	Centrum für Informations- und Sprachverarbeitung (CIS)
Themengebiete:	000 Informatik, Informationswissenschaft, allgemeine Werke > 000 Informatik, Wissen, Systeme 400 Sprache > 400 Sprache 400 Sprache > 410 Linguistik
URN:	urn:nbn:de:bvb:19-epub-92188-7
Sprache:	Englisch
Dokumenten ID:	92188
Datum der Veröffentlichung auf Open Access LMU:	27. Mai 2022 08:40
Letzte Änderungen:	27. Mai 2022 08:51

Dokument bearbeiten