Stereo-based stochastic mapping for robust speech recognition

Mohamed Afify; Xiaodong Cui; Yuqing Gao

doi:10.1109/ICASSP.2007.366928

ICASSP 2007

Conference paper

06 Aug 2007

Stereo-based stochastic mapping for robust speech recognition

View publication

Abstract

We present a stochastic mapping technique for robust speech recognition that uses stereo data. The idea is based on building a GMM for the joint distribution of the clean and noisy channels during training and using an iterative compensation algorithm during testing. The proposed mapping was also interpreted as a mixture of linear transforms that are estimated in a special way using stereo data. The proposed method results in 28% relative improvement in string error rate (SER) for digit recognition in the car, and in about 10% relative improvement in word error rate (WER), when applied in conjunction with multi-style training (MST), for large vocabulary English speech recognition. © 2007 IEEE.

Conference paper