Logo Logo
Hilfe
Hilfe
Switch Language to English

Reichel, Uwe D. (2013): From segmentation bootstrapping to transcription-to-word conversion. Interspeech, Lyon, 25. - 29. August 2013. Proc. Interspeech. S. 1443-1447 [PDF, 119kB]

[thumbnail of ReichelIS2013b.pdf]
Vorschau
Download (119kB)

Abstract

The mapping of a raw phonetic transcription to an orthographic word sequence is carried out in three steps: First, a syllable segmentation of the transcription is bootstrapped, based on unsupervised subtractive learning. Then, the syllables are grouped to word entities guided by non-linguistic distributional properties. Finally, the phonetic word segmentations are mapped onto entries of a canonic pronunciation dictionary by means of a co-occurrence based aligner. For syllable segmentation accuracies between 89 and 96% are obtained, and for word segmentation accuracies between 92 and 98%. The transcription to word conversion performance amounts 77%.

Dokument bearbeiten Dokument bearbeiten