Teaching VLMs to Localize Specific Objects from In-context ExamplesSivan DovehNimrod Shabtayet al.2025ICCV 2025
On the Interplay between Sparsity, Naturalness, Intelligibility, and Prosody in Speech SynthesisCheng-i Jeff LaiErica Cooperet al.2022ICASSP 2022