Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation


Dec 19, 2022
Ludan Ruan, Yiyang Ma, Huan Yang, Huiguo He, Bei Liu, Jianlong Fu, Nicholas Jing Yuan, Qin Jin, Baining Guo

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Build a SRE Challenge System: Lessons from VoxSRC 2022 and CNSRC 2022


Nov 02, 2022
Zhengyang Chen, Bing Han, Xu Xiang, Houjun Huang, Bei Liu, Yanmin Qian

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning


Oct 12, 2022
Yuchong Sun, Hongwei Xue, Ruihua Song, Bei Liu, Huan Yang, Jianlong Fu

Add code

* Accepted by NeurIPS 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment


Sep 23, 2022
Hongwei Xue, Yuchong Sun, Bei Liu, Jianlong Fu, Ruihua Song, Houqiang Li, Jiebo Luo

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

SJTU-AISPEECH System for VoxCeleb Speaker Recognition Challenge 2022


Sep 20, 2022
Zhengyang Chen, Bing Han, Xu Xiang, Houjun Huang, Bei Liu, Yanmin Qian

Add code

* System description of VoxSRC 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation


Sep 08, 2022
Yiyang Ma, Huan Yang, Bei Liu, Jianlong Fu, Jiaying Liu

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Language-Guided Face Animation by Recurrent StyleGAN-based Generator


Aug 11, 2022
Tiankai Hang, Huan Yang, Bei Liu, Jianlong Fu, Xin Geng, Baining Guo

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Exploring Anchor-based Detection for Ego4D Natural Language Query


Aug 10, 2022
Sipeng Zheng, Qi Zhang, Bei Liu, Qin Jin, Jianlong Fu

Add code

* CVPR 2022 Ego4D Workshop NLQ Challenge 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

The SJTU X-LANCE Lab System for CNSRC 2022


Jun 23, 2022
Zhengyang Chen, Bei Liu, Bing Han, Leying Zhang, Yanmin Qian

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
3
>>