Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval


Nov 02, 2022
Layne Berry, Yi-Jen Shih, Hsuan-Fu Wang, Heng-Jui Chang, Hung-yi Lee, David Harwath

Add code

* Submitted to ICASSP 2023 

   Access Paper or Ask Questions

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model


Oct 03, 2022
Yi-Jen Shih, Hsuan-Fu Wang, Heng-Jui Chang, Layne Berry, Hung-yi Lee, David Harwath

Add code

* Accepted to IEEE SLT 2022 

   Access Paper or Ask Questions

Theme Transformer: Symbolic Music Generation with Theme-Conditioned Transformer


Nov 07, 2021
Yi-Jen Shih, Shih-Lun Wu, Frank Zalkow, Meinard Müller, Yi-Hsuan Yang

Add code


   Access Paper or Ask Questions