Robust speech recognition using dynamic noise adaptation
Steven Rennie, Pierre Dognin, et al.
ICASSP 2011
We present supervised approaches for detecting speaker roles and agreement/disagreement between speakers in broadcast conversation shows in three languages: English, Arabic, and Mandarin. We develop annotation approaches for a variety of linguistic phenomena. Various lexical, structural, and social network analysis based features are explored, and feature importance is analyzed across the three languages. We also compare the performance when using features extracted from automatically generated annotations against that when using human annotations. The algorithms achieve speaker role labeling accuracy of more than 86% for all three languages. For agreement and disagreement detection, the algorithms achieve precision of 63% to 92% and 55% to 85%, respectively, across the three languages. © 2011 IEEE.
Steven Rennie, Pierre Dognin, et al.
ICASSP 2011
Danning Jiang, Dimitri Kanevsky, et al.
ICASSP 2011
Jason Pelecanos, Weizhong Zhu, et al.
INTERSPEECH 2014
Xin Chen, Xiaodong Cui, et al.
ICASSP 2011