Logo Logo
Help
Contact
Switch Language to German

Senel, Lutfi Kerem; Sahinuc, Furkan; Yuecesoy, Veysel; Schuetze, Hinrich; Cukur, Tolga and Koc, Aykut (2022): Learning interpretable word embeddings via bidirectional alignment of dimensions with semantic concepts. In: Information Processing & Management, Vol. 59, No. 3, 102925

Full text not available from 'Open Access LMU'.

Abstract

We proposebidirectional impartingorBiImp, a generalized method for aligning embeddingdimensions with concepts during the embedding learning phase. While preserving the semanticstructure of the embedding space, BiImp makes dimensions interpretable, which has a criticalrole in deciphering the black-box behavior of word embeddings. BiImp separately utilizes bothdirections of a vector space dimension: each direction can be assigned to a different concept.This increases the number of concepts that can be represented in the embedding space. Ourexperimental results demonstrate the interpretability of BiImp embeddings without makingcompromises on the semantic task performance. We also use BiImp to reduce gender biasin word embeddings by encoding gender-opposite concepts (e.g., male-female) in a singleembedding dimension. These results highlight the potential of BiImp in reducing biases andstereotypes present in word embeddings. Furthermore, task or domain-specific interpretableword embeddings can be obtained by adjusting the corresponding word groups in embeddingdimensions according to task or domain. As a result, BiImp offers wide liberty in studying wordembeddings without any further effort

Actions (login required)

View Item View Item