Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

VQ3D: Learning a 3D-Aware Generative Model on ImageNet


Feb 14, 2023
Kyle Sargent, Jing Yu Koh, Han Zhang, Huiwen Chang, Charles Herrmann, Pratul Srinivasan, Jiajun Wu, Deqing Sun

Add code

* 15 pages. For visual results, please visit the project webpage at http://kylesargent.github.io/vq3d 

   Access Paper or Ask Questions

Grounding Language Models to Images for Multimodal Generation


Jan 31, 2023
Jing Yu Koh, Ruslan Salakhutdinov, Daniel Fried

Add code

* Project page: https://jykoh.com/fromage 

   Access Paper or Ask Questions

A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning


Oct 06, 2022
Aishwarya Kamath, Peter Anderson, Su Wang, Jing Yu Koh, Alexander Ku, Austin Waters, Yinfei Yang, Jason Baldridge, Zarana Parekh

Add code


   Access Paper or Ask Questions

Scaling Autoregressive Models for Content-Rich Text-to-Image Generation


Jun 22, 2022
Jiahui Yu, Yuanzhong Xu, Jing Yu Koh, Thang Luong, Gunjan Baid, Zirui Wang, Vijay Vasudevan, Alexander Ku, Yinfei Yang, Burcu Karagol Ayan, Ben Hutchinson, Wei Han, Zarana Parekh, Xin Li, Han Zhang, Jason Baldridge, Yonghui Wu

Add code

* Preprint 

   Access Paper or Ask Questions

Simple and Effective Synthesis of Indoor 3D Scenes


Apr 06, 2022
Jing Yu Koh, Harsh Agrawal, Dhruv Batra, Richard Tucker, Austin Waters, Honglak Lee, Yinfei Yang, Jason Baldridge, Peter Anderson

Add code


   Access Paper or Ask Questions

Vector-quantized Image Modeling with Improved VQGAN


Oct 09, 2021
Jiahui Yu, Xin Li, Jing Yu Koh, Han Zhang, Ruoming Pang, James Qin, Alexander Ku, Yuanzhong Xu, Jason Baldridge, Yonghui Wu

Add code

* Preprint 

   Access Paper or Ask Questions

Pathdreamer: A World Model for Indoor Navigation


May 18, 2021
Jing Yu Koh, Honglak Lee, Yinfei Yang, Jason Baldridge, Peter Anderson

Add code


   Access Paper or Ask Questions

Revisiting Hierarchical Approach for Persistent Long-Term Video Prediction


Apr 14, 2021
Wonkwang Lee, Whie Jung, Han Zhang, Ting Chen, Jing Yu Koh, Thomas Huang, Hyungsuk Yoon, Honglak Lee, Seunghoon Hong

Add code

* Accepted as a conference paper at ICLR 2021 

   Access Paper or Ask Questions

Cross-Modal Contrastive Learning for Text-to-Image Generation


Jan 15, 2021
Han Zhang, Jing Yu Koh, Jason Baldridge, Honglak Lee, Yinfei Yang

Add code


   Access Paper or Ask Questions

1
2
>>