Position Information in Transformers: An Overview

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Dufter, Philipp; Schmitt, Martin und Schütze, Hinrich (September 2022): Position Information in Transformers: An Overview. In: Computational Linguistics, Bd. 48, Nr. 3: S. 733-763 [PDF, 1MB]

Vorschau

Creative Commons: Namensnennung 4.0 (CC-BY)

DOI: 10.1162/coli_a_00445

Externer Volltext: https://aclanthology.org/2022.cl-3.7

Abstract

Transformers are arguably the main workhorse in recent natural language processing research. By definition, a Transformer is invariant with respect to reordering of the input. However, language is inherently sequential and word order is essential to the semantics and syntax of an utterance. In this article, we provide an overview and theoretical comparison of existing methods to incorporate position information into Transformer models. The objectives of this survey are to (1) showcase that position information in Transformer is a vibrant and extensive research area; (2) enable the reader to compare existing methods by providing a unified notation and systematization of different approaches along important model dimensions; (3) indicate what characteristics of an application should be taken into account when selecting a position encoding; and (4) provide stimuli for future research.

Dokumententyp:	Zeitschriftenartikel
EU Funded Grant Agreement Number:	740516
EU-Projekte:	Horizon 2020 > ERC Grants > ERC Advanced Grant > ERC Grant 740516: NonSequeToR - Non-sequence models for tokenization replacement
Publikationsform:	Publisher's Version
Fakultätsübergreifende Einrichtungen:	Centrum für Informations- und Sprachverarbeitung (CIS)
Themengebiete:	400 Sprache > 400 Sprache 400 Sprache > 410 Linguistik
URN:	urn:nbn:de:bvb:19-epub-107439-1
Sprache:	Englisch
Dokumenten ID:	107439
Datum der Veröffentlichung auf Open Access LMU:	20. Okt. 2023 07:52
Letzte Änderungen:	20. Okt. 2023 07:52

Dokument bearbeiten