Conference paper
Improvements to deep convolutional neural networks for LVCSR
Tara N. Sainath, Brian Kingsbury, et al.
ASRU 2013
This paper exploits the fact that when GMM and SVM classifiers with roughly the same level of performance exhibit uncorrelated errors they can be combined to produce a better classifier. The gain accrues from combining the descriptive strength of GMM models with the discriminative power of SVM classifiers. This idea, first exploited in the context of speaker recognition [1, 2], is applied to speech recognition - specifically to a digit recognition task in a noisy environment - with significant gains in performance.
Tara N. Sainath, Brian Kingsbury, et al.
ASRU 2013
George Saon, Michael Picheny
ASRU 2007
Michael A. Picheny, David Nahamoo, et al.
IBM J. Res. Dev
Ramesh A. Gopinath
IEEE TSP