Publications

6 results at INTERSPEECH 2024

Exploring the Benefits of Tokenization of Discrete Acoustic Units
- - Avihu Dekel
  - Raul Fernandez
- 2024
- INTERSPEECH 2024
Conference paper
Exploring the limits of decoder-only models trained on public speech recognition corpora
- - Ankit Gupta
  - George Saon
  - et al.
- 2024
- INTERSPEECH 2024
Conference paper
M2 ASR: Multilingual Multi-task Automatic Speech Recognition via Multi-objective Optimization
- - A Saif
  - Lisha Chen
  - et al.
- 2024
- INTERSPEECH 2024
Conference paper
Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation
- - Andrew Rouditchenko
  - Yuan Gong
  - et al.
- 2024
- INTERSPEECH 2024
Conference paper
Low Bitrate High-Quality RVQGAN-based Discrete Speech Tokenizer
- - Slava Shechtman
  - Avihu Dekel
- 2024
- INTERSPEECH 2024
Conference paper
SALSA: Speedy ASR-LLM Synchronous Aggregation
- - Ashish Mittal
  - Darshan Prabhu
  - et al.
- 2024
- INTERSPEECH 2024
Conference paper