Ensembles of multi-scale VGG acoustic models
Michael Heck, Masayuki Suzuki, et al.
INTERSPEECH 2017
Many paralinguistic tasks are closely related and thus representations learned in one domain can be leveraged for another. In this paper, we investigate how knowledge can be transferred between three paralinguistic tasks: speaker, emotion, and gender recognition. Further, we extend this problem to cross-dataset tasks, asking how knowledge captured in one emotion dataset can be transferred to another. We focus on progressive neural networks and compare these networks to the conventional deep learning method of pre-Training and fine-Tuning. Progressive neural networks provide a way to transfer knowledge and avoid the forgetting effect present when pre-Training neural networks on different tasks. Our experiments demonstrate that: (1) emotion recognition can benefit from using representations originally learned for different paralinguistic tasks and (2) transfer learning can effectively leverage additional datasets to improve the performance of emotion recognition systems.
Michael Heck, Masayuki Suzuki, et al.
INTERSPEECH 2017
George Saon, Gakuto Kurata, et al.
INTERSPEECH 2017
Asaf Rendel, Raul Fernandez, et al.
INTERSPEECH 2017
Zakaria Aldeneh, Mimansa Jaiswal, et al.
INTERSPEECH 2019