Exploring the Limits of Conformer CTC-Encoder for Speech Emotion Recognition using Large Language ModelsEdmilson Da Silva MoraisHagai Aronowitzet al.2025INTERSPEECH 2025Conference paper
SKIP-SALSA: Skip Synchronous Fusion of ASR LLM DecodersAshish MittalDarshan Prabhuet al.2025INTERSPEECH 2025Conference paper
Voice Activity-based Text Segmentation for ASR Text DenormalizationSashi NovitasariTakashi Fukudaet al.2025INTERSPEECH 2025Conference paper
Improving End-to-end Mixed-case ASR with Knowledge Distillation and Integration of Voice Activity CuesSashi NovitasariTakashi Fukudaet al.2025INTERSPEECH 2025Conference paper
Spoken question answering for visual queriesNimrod ShabtayZvi Konset al.2025INTERSPEECH 2025Conference paper