ORCID: https://orcid.org/0000-0002-3549-2012; Lentini, Giulia und Tushingham, Poppy
(2024):
Towards a word similarity gold standard for Akkadian: creation and model optimization.
In: it - Information Technology, Bd. 66, Nr. 1: S. 4-14
Abstract
We present a word similarity gold standard for Akkadian, a language documented in ancient Mesopotamian sources from the 24th century BCE until the first century CE. The gold standard comprises 300 word pairs ranked by their paradigmatic similarity by five independently working Assyriologists. We use the gold standard to tune PMI + SVD and fastText models to improve their performance. We also present a hyper-parametrized PMI + SVD model for building count-based word embeddings, that aims to deal with the data sparsity and repetition issues encountered in Akkadian texts. Our model combines Dirichlet smoothing with context distribution smoothing, and uses context similarity weighting to down-sample distortion caused by formulaic litanies and partially or fully duplicated passages.
Dokumententyp: | Zeitschriftenartikel |
---|---|
Keywords: | Ancient Near East; digital humanities; computational Assyriology; word embeddings; distributional semantics; machine learning; Akkadian |
Fakultät: | Geschichts- und Kunstwissenschaften > Historisches Seminar > Alte Geschichte |
Themengebiete: | 400 Sprache > 490 Andere Sprachen
900 Geschichte und Geografie > 930 Geschichte des Altertums (bis ca. 499), Archäologie |
ISSN: | 2196-7032 ; 1611-2776 |
Sprache: | Englisch |
Dokumenten ID: | 118360 |
Datum der Veröffentlichung auf Open Access LMU: | 10. Jul. 2025 06:52 |
Letzte Änderungen: | 10. Jul. 2025 06:52 |