Picture for Jing Yu Koh

Jing Yu Koh

Generating Images with Multimodal Language Models

Add code
May 26, 2023
Figure 1 for Generating Images with Multimodal Language Models
Figure 2 for Generating Images with Multimodal Language Models
Figure 3 for Generating Images with Multimodal Language Models
Figure 4 for Generating Images with Multimodal Language Models
Viaarxiv icon

VQ3D: Learning a 3D-Aware Generative Model on ImageNet

Add code
Feb 14, 2023
Figure 1 for VQ3D: Learning a 3D-Aware Generative Model on ImageNet
Figure 2 for VQ3D: Learning a 3D-Aware Generative Model on ImageNet
Figure 3 for VQ3D: Learning a 3D-Aware Generative Model on ImageNet
Figure 4 for VQ3D: Learning a 3D-Aware Generative Model on ImageNet
Viaarxiv icon

Grounding Language Models to Images for Multimodal Generation

Add code
Jan 31, 2023
Figure 1 for Grounding Language Models to Images for Multimodal Generation
Figure 2 for Grounding Language Models to Images for Multimodal Generation
Figure 3 for Grounding Language Models to Images for Multimodal Generation
Figure 4 for Grounding Language Models to Images for Multimodal Generation
Viaarxiv icon

A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning

Add code
Oct 06, 2022
Figure 1 for A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
Figure 2 for A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
Figure 3 for A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
Figure 4 for A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
Viaarxiv icon

Scaling Autoregressive Models for Content-Rich Text-to-Image Generation

Add code
Jun 22, 2022
Figure 1 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Figure 2 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Figure 3 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Figure 4 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Viaarxiv icon

Simple and Effective Synthesis of Indoor 3D Scenes

Add code
Apr 06, 2022
Figure 1 for Simple and Effective Synthesis of Indoor 3D Scenes
Figure 2 for Simple and Effective Synthesis of Indoor 3D Scenes
Figure 3 for Simple and Effective Synthesis of Indoor 3D Scenes
Figure 4 for Simple and Effective Synthesis of Indoor 3D Scenes
Viaarxiv icon

Vector-quantized Image Modeling with Improved VQGAN

Add code
Oct 09, 2021
Figure 1 for Vector-quantized Image Modeling with Improved VQGAN
Figure 2 for Vector-quantized Image Modeling with Improved VQGAN
Figure 3 for Vector-quantized Image Modeling with Improved VQGAN
Figure 4 for Vector-quantized Image Modeling with Improved VQGAN
Viaarxiv icon

Pathdreamer: A World Model for Indoor Navigation

Add code
May 18, 2021
Figure 1 for Pathdreamer: A World Model for Indoor Navigation
Figure 2 for Pathdreamer: A World Model for Indoor Navigation
Figure 3 for Pathdreamer: A World Model for Indoor Navigation
Figure 4 for Pathdreamer: A World Model for Indoor Navigation
Viaarxiv icon

Revisiting Hierarchical Approach for Persistent Long-Term Video Prediction

Add code
Apr 14, 2021
Figure 1 for Revisiting Hierarchical Approach for Persistent Long-Term Video Prediction
Figure 2 for Revisiting Hierarchical Approach for Persistent Long-Term Video Prediction
Figure 3 for Revisiting Hierarchical Approach for Persistent Long-Term Video Prediction
Figure 4 for Revisiting Hierarchical Approach for Persistent Long-Term Video Prediction
Viaarxiv icon

Cross-Modal Contrastive Learning for Text-to-Image Generation

Add code
Jan 15, 2021
Figure 1 for Cross-Modal Contrastive Learning for Text-to-Image Generation
Figure 2 for Cross-Modal Contrastive Learning for Text-to-Image Generation
Figure 3 for Cross-Modal Contrastive Learning for Text-to-Image Generation
Figure 4 for Cross-Modal Contrastive Learning for Text-to-Image Generation
Viaarxiv icon