Detailed info
Masking Speech Contents by Random Splicing: is Emotional Expression Preserved?
Authors | Burkhardt Felix, Derington Anna, Kahlau Matthias, Scherer Klaus, Eyben Florian, Schuller, Bjorn |
Title | Masking Speech Contents by Random Splicing: is Emotional Expression Preserved? |
Abstract | We discuss the influence of random splicing on the perception of emotional expression in speech signals. Random splicing is the randomized reconstruction of short audio snippets with the aim to obfuscate the speech contents. A part of the German parliament recordings has been random spliced and both versions – the original and the scrambled ones – manually labeled with respect to the arousal, valence and dominance dimensions. Additionally, we run a state-of-the-art transformer-based pre-trained emotional model on the data. We find sufficiently high correlation for the annotations and predictions of emotional dimensions between both sample versions to be confident that machine learners can be trained with random spliced data. |
Conference | ICASSP 2023 – 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
Date | 04-10 June 2023 |
Location | Rhodes Island, Greece |
Year of Publication | 2023 |
Publisher | IEEE |
Url | https://doi.org/10.5281/zenodo.10664711 |
DOI | 10.1109/ICASSP49357.2023.10097094 |
Menu
- Home
- About
- Experimentation
- Knowledge Hub
- ContactResults
- News & Events
- Contact
Funding
This project has received funding from the European Union’s Horizon 2020 Research and Innovation program under grant agreement No 957337. The website reflects only the view of the author(s) and the Commission is not responsible for any use that may be made of the information it contains.