Paul J. Steinhardt, P. Chaudhari
Journal of Computational Physics
An automatic transcription of Mandarin broadcast speech system was developed at IBM under the DARPA GALE program. In particular, this system applies a discriminative acoustic model training method and a new topic-adaptive language modeling technique to achieve the best recognition performance using multiple pass decoding. Results are given for three Gale test sets designed to cover both the broadcast news and the broadcast conversation domains. The transcription system achieves satisfactory performance on these datasets. The recognition errors are highly dependent on the speaking style, speech overlap and accent, which helps steer future research.
Paul J. Steinhardt, P. Chaudhari
Journal of Computational Physics
Hannaneh Hajishirzi, Julia Hockenmaier, et al.
UAI 2011
A. Skumanich
SPIE OE/LASE 1992
Heng Cao, Haifeng Xi, et al.
WSC 2003