Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Scaling Autoregressive Models for Content-Rich Text-to-Image Generation


Jun 22, 2022
Jiahui Yu , Yuanzhong Xu , Jing Yu Koh , Thang Luong , Gunjan Baid , Zirui Wang , Vijay Vasudevan , Alexander Ku , Yinfei Yang , Burcu Karagol Ayan , Ben Hutchinson , Wei Han , Zarana Parekh , Xin Li , Han Zhang , Jason Baldridge , Yonghui Wu

* Preprint 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

CoCa: Contrastive Captioners are Image-Text Foundation Models


May 04, 2022
Jiahui Yu , Zirui Wang , Vijay Vasudevan , Legg Yeung , Mojtaba Seyedhosseini , Yonghui Wu

* Preprint 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Self-supervised Learning with Random-projection Quantizer for Speech Recognition


Feb 03, 2022
Chung-Cheng Chiu , James Qin , Yu Zhang , Jiahui Yu , Yonghui Wu


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Co-training Transformer with Videos and Images Improves Action Recognition


Dec 14, 2021
Bowen Zhang , Jiahui Yu , Christopher Fifty , Wei Han , Andrew M. Dai , Ruoming Pang , Fei Sha


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Vector-quantized Image Modeling with Improved VQGAN


Oct 09, 2021
Jiahui Yu , Xin Li , Jing Yu Koh , Han Zhang , Ruoming Pang , James Qin , Alexander Ku , Yuanzhong Xu , Jason Baldridge , Yonghui Wu

* Preprint 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition


Oct 01, 2021
Yu Zhang , Daniel S. Park , Wei Han , James Qin , Anmol Gulati , Joel Shor , Aren Jansen , Yuanzhong Xu , Yanping Huang , Shibo Wang , Zongwei Zhou , Bo Li , Min Ma , William Chan , Jiahui Yu , Yongqiang Wang , Liangliang Cao , Khe Chai Sim , Bhuvana Ramabhadran , Tara N. Sainath , Françoise Beaufays , Zhifeng Chen , Quoc V. Le , Chung-Cheng Chiu , Ruoming Pang , Yonghui Wu

* 14 pages, 7 figures, 13 tables; v2: minor corrections, reference baselines and bibliography updated 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

SimVLM: Simple Visual Language Model Pretraining with Weak Supervision


Aug 24, 2021
Zirui Wang , Jiahui Yu , Adams Wei Yu , Zihang Dai , Yulia Tsvetkov , Yuan Cao


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Normalization effects on shallow neural networks and related asymptotic expansions


Nov 20, 2020
Jiahui Yu , Konstantinos Spiliopoulos


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Cascaded encoders for unifying streaming and non-streaming ASR


Oct 27, 2020
Arun Narayanan , Tara N. Sainath , Ruoming Pang , Jiahui Yu , Chung-Cheng Chiu , Rohit Prabhavalkar , Ehsan Variani , Trevor Strohman


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization


Oct 21, 2020
Jiahui Yu , Chung-Cheng Chiu , Bo Li , Shuo-yiin Chang , Tara N. Sainath , Yanzhang He , Arun Narayanan , Wei Han , Anmol Gulati , Yonghui Wu , Ruoming Pang

* tech report 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
3
4
>>