AVLnet: Learning audio-visual language representations from instructional videosAndrew RouditchenkoAngie Boggustet al.2021INTERSPEECH 2021
Self-supervised segmentation and source separation on videosAndrew RouditchenkoHang Zhaoet al.2019CVPRW 2019