Articulatory feature detection with support vector machines for integration into ASR and phone recognition

Upendra V. Chaudhari; Michael Picheny

doi:10.1109/ASRU.2009.5373326

ASRU 2009

Conference paper

01 Dec 2009

Articulatory feature detection with support vector machines for integration into ASR and phone recognition

View publication

Abstract

We study the use of Support Vector Machines (SVM) for detecting the occurrence of articulatory features in speech audio data and using the information contained in the detector outputs to improve phone and speech recognition. Our expectation is that an SVM should be able to appropriately model the separation of the classes which may have complex distributions in feature space. We show that performance improves markedly when using discriminatively trained speaker dependent parameters for the SVM inputs, and compares quite well to results in the literature using other classifiers, namely Artificial Neural Networks (ANN). Further, we show that the resulting detector outputs can be successfully integrated into a state of the art speech recognition system, with consequent performance gains. Notably, we test our system on English broadcast news data from dev04f. © 2009 IEEE.

Paper