Logo Logo
Hilfe
Hilfe
Switch Language to English

Frei, Johann ORCID logoORCID: https://orcid.org/0000-0003-0323-0904; Frei-Stuber, Ludwig und Kramer, Frank ORCID logoORCID: https://orcid.org/0000-0002-2857-7122 (2023): GERNERMED++: Semantic annotation in German medical NLP through transfer-learning, translation and word alignment. In: Journal of Biomedical Informatics, Bd. 147, 104513 [PDF, 951kB]

Abstract

We present a statistical model, GERNERMED++, for German medical natural language processing trained for named entity recognition (NER) as an open, publicly available model. We demonstrate the effectiveness of combining multiple techniques in order to achieve strong results in entity recognition performance by the means of transfer-learning on pre-trained deep language models (LM), word-alignment and neural machine translation, outperforming a pre-existing baseline model on several datasets. Due to the sparse situation of open, public medical entity recognition models for German texts, this work offers benefits to the German research community on medical NLP as a baseline model. The work serves as a refined successor to our first GERNERMED model. Similar to our previous work, our trained model is publicly available to other researchers.

Dokument bearbeiten Dokument bearbeiten