Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis


Dec 09, 2022
Weixi Feng, Xuehai He, Tsu-Jui Fu, Varun Jampani, Arjun Akula, Pradyumna Narayana, Sugato Basu, Xin Eric Wang, William Yang Wang

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation


Nov 23, 2022
Tsu-Jui Fu, Licheng Yu, Ning Zhang, Cheng-Yang Fu, Jong-Chyi Su, William Yang Wang, Sean Bell

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

CPL: Counterfactual Prompt Learning for Vision and Language Models


Oct 19, 2022
Xuehai He, Diji Yang, Weixi Feng, Tsu-Jui Fu, Arjun Akula, Varun Jampani, Pradyumna Narayana, Sugato Basu, William Yang Wang, Xin Eric Wang

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

ULN: Towards Underspecified Vision-and-Language Navigation


Oct 18, 2022
Weixi Feng, Tsu-Jui Fu, Yujie Lu, William Yang Wang

Add code

* EMNLP 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling


Sep 04, 2022
Tsu-Jui Fu, Linjie Li, Zhe Gan, Kevin Lin, William Yang Wang, Lijuan Wang, Zicheng Liu

Add code

* The first two authors contributed equally. arXiv admin note: text overlap with arXiv:2111.12681 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

VIOLET : End-to-End Video-Language Transformers with Masked Visual-token Modeling


Nov 24, 2021
Tsu-Jui Fu, Linjie Li, Zhe Gan, Kevin Lin, William Yang Wang, Lijuan Wang, Zicheng Liu

Add code

* Code is available at https://github.com/tsujuifu/pytorch_violet 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Language-Driven Image Style Transfer


Jun 01, 2021
Tsu-Jui Fu, Xin Eric Wang, William Yang Wang

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Language-based Video Editing via Multi-Modal Multi-Level Transformer


Apr 02, 2021
Tsu-Jui Fu, Xin Eric Wang, Scott T. Grafton, Miguel P. Eckstein, William Yang Wang

Add code

* 11 pages, 6 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents


Feb 14, 2021
Tsu-Jui Fu, William Yang Wang, Daniel McDuff, Yale Song

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

L2C: Describing Visual Differences Needs Semantic Understanding of Individuals


Feb 03, 2021
An Yan, Xin Eric Wang, Tsu-Jui Fu, William Yang Wang

Add code

* EACL-2021 short 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
3
>>