Discriminative training of decoding graphs for large vocabulary continuous speech recognition

Hong-Kwang Jeff Kuo; Brian Kingsbury; Geoffrey Zweig

doi:10.1109/ICASSP.2007.367159

ICASSP 2007

Conference paper

06 Aug 2007

Discriminative training of decoding graphs for large vocabulary continuous speech recognition

View publication

Abstract

Finite-state decoding graphs integrate the decision trees, pronunciation model and language model for speech recognition into a unified representation of the search space. We explore discriminative training of the transition weights in the decoding graph in the context of large vocabulary speech recognition. In preliminary experiments on the RT-03 English Broadcast News evaluation set, the word error rate was reduced by about 5.7% relative, from 23.0% to 21.7%. We discuss how this method is particularly applicable to low-latency and low-resource applications such as real-time closed captioning of broadcast news and interactive speech-to-speech translation. © 2007 IEEE.

Conference paper