Hagen Soltau, George Saon, et al.
IEEE Transactions on Audio, Speech and Language Processing
The universal background model (UBM) is an effective framework widely used in speaker recognition. But so far it has received little attention from the speech recognition field. In this work, we make a first attempt to apply the UBM to acoustic modeling in ASR. We propose a tree-based parameter estimation technique for UBMs, and describe a set of smoothing and pruning methods to facilitate learning. The proposed UBM approach is benchmarked on a state-of-the-art large-vocabulary continuous speech recognition platform on a broadcast transcription task. Preliminary experiments reported in this paper already show very exciting results. ©2008 IEEE.
Hagen Soltau, George Saon, et al.
IEEE Transactions on Audio, Speech and Language Processing
Junchi Yan, Chao Zhang, et al.
CVPR 2015
Chao Xue, Junchi Yan, et al.
CVPR 2019
Stephen M. Chu, Thomas S. Huang
CVPR 2007