Schiel, Florian; Stevens, Mary; Reichel, Uwe D.; Cutugno, Francesco (2013): Machine Learning of Probabilistic Phonological Pronunciation Rules from the Italian CLIPS Corpus. Interspeech, 14th Annual Conference of the International Speech Communication Association, 25. - 29. August 2013, Lyon.




A blending of phonological concepts and technical analysis is proposed to yield a better modeling and understanding of phonological processes. Based on the manual segmentation and labeling of the Italian CLIPS corpus we automatically derive a probabilistic set of phonological pronunciation rules: a new alignment technique is used to map the phonological form of spontaneous sentences onto the phonetic surface form. A machine-learning algorithm then calculates a set of phonologi- cal replacement rules together with their conditional probabilities. A critical analysis of the resulting probabilistic rule set is presented and discussed with regard to regional Italian accents. The rule set presented here is also applied in the newly published web-service WebMAUS that allows a user to segment and phonetically label Italian speech via a simple web-interface.