XLAVS-R: Cross-Lingual Audio-Visual Speech Representation from Efficient Modality Injection
Mohamed Anwar     Hyojung Han     Bowen Shi     Wei-Ning Hsu    
Changhan Wang    
Meta AI    
[Paper]    

Abstract

Coming soon (January 2024)!!