Mixtures of probability experts for audio retrieval and indexing

Malcolm Slaney

doi:10.1109/ICME.2002.1035789

ICME 2002

Conference paper

26 Aug 2002

Mixtures of probability experts for audio retrieval and indexing

View publication

Abstract

This paper describes a system for connecting nonspeech sounds and words using linked multidimensional vector spaces. An approach based on a mixture of experts learns the mapping between one space and the other. This paper describes the conversion of audio and semantic data into their respective vector spaces. Two different mixture-of-probability-expert models are trained to learn the association between acoustic queries and the corresponding semantic explanation, and vice versa. Test results are presented based on commercial sound effects CDs.

Conference paper