Logo Logo
Hilfe
Hilfe
Switch Language to English

Pérez-Toro, Paula Andrea ORCID logoORCID: https://orcid.org/0000-0002-2727-2116; Klumpp, Philipp ORCID logoORCID: https://orcid.org/0000-0002-7531-1693; Vasquez-Correa, Juan Camilo; Schuster, Maria; Nöth, Elmar ORCID logoORCID: https://orcid.org/0000-0002-3396-555X; Orozco-Arroyave, Juan Rafael ORCID logoORCID: https://orcid.org/0000-0002-8507-0782 und Arias-Vergara, Tomás ORCID logoORCID: https://orcid.org/0000-0001-9405-4154 (2022): 50 Shades of Gray: Effect of the Color Scale for the Assessment of Speech Disorders. 25th International Conference Text, Speech, and Dialogue, TSD 2022, Brno, Czech Republic, September 6–9, 2022. In: Text, Speech, and Dialogue. 25th International Conference, TSD 2022, Brno, Czech Republic, September 6–9, 2022, Proceedings, Lecture Notes in Computer Science Bd. 13502 Cham, Switzerland: Springer. S. 352-363

Volltext auf 'Open Access LMU' nicht verfügbar.

Abstract

Spectrograms provide a visual representation of the time-frequency variations of a speech signal. Furthermore, the color scales can be used as a pre-processing normalization step. In this study, we investigated the suitability of using different color scales for the reconstruction of spectrograms together with bottleneck features extracted from Convolutional AutoEncoders (CAEs). We trained several CAEs considering different parameters such as the number of channels, wideband/narrowband spectrograms, and different color scales. Additionally, we tested the suitability of the proposed CAE architecture for the prediction of the severity of Parkinson’s Disease (PD) and for the nasality level in children with Cleft Lip and Palate (CLP). The results showed that it is possible to estimate the neurological state for PD with Spearman’s correlations of up to 0.71 using the Grayscale, and the nasality level in CLP with F-scores of up to 0.58 using the raw spectrogram. Although the color scales improved performance in some cases, it is not clear which color scale is the most suitable for the selected application, as we did not find significant differences in the results for each color scale.

Dokument bearbeiten Dokument bearbeiten