Alert button
Picture for Jaemin Cho

Jaemin Cho

Alert button

Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model

Add code
Bookmark button
Alert button
Apr 15, 2024
Han Lin, Jaemin Cho, Abhay Zala, Mohit Bansal

Viaarxiv icon

Rethinking Interactive Image Segmentation with Low Latency, High Quality, and Diverse Prompts

Add code
Bookmark button
Alert button
Mar 31, 2024
Qin Liu, Jaemin Cho, Mohit Bansal, Marc Niethammer

Viaarxiv icon

EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents

Add code
Bookmark button
Alert button
Mar 18, 2024
Abhay Zala, Jaemin Cho, Han Lin, Jaehong Yoon, Mohit Bansal

Figure 1 for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents
Figure 2 for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents
Figure 3 for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents
Figure 4 for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents
Viaarxiv icon

SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data

Add code
Bookmark button
Alert button
Mar 11, 2024
Jialu Li, Jaemin Cho, Yi-Lin Sung, Jaehong Yoon, Mohit Bansal

Figure 1 for SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Figure 2 for SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Figure 3 for SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Figure 4 for SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Viaarxiv icon

Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training

Add code
Bookmark button
Alert button
Mar 04, 2024
David Wan, Jaemin Cho, Elias Stengel-Eskin, Mohit Bansal

Figure 1 for Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training
Figure 2 for Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training
Figure 3 for Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training
Figure 4 for Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training
Viaarxiv icon

Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation

Add code
Bookmark button
Alert button
Oct 30, 2023
Jaemin Cho, Yushi Hu, Roopal Garg, Peter Anderson, Ranjay Krishna, Jason Baldridge, Mohit Bansal, Jordi Pont-Tuset, Su Wang

Viaarxiv icon

DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM Planning

Add code
Bookmark button
Alert button
Oct 18, 2023
Abhay Zala, Han Lin, Jaemin Cho, Mohit Bansal

Viaarxiv icon

VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning

Add code
Bookmark button
Alert button
Sep 26, 2023
Han Lin, Abhay Zala, Jaemin Cho, Mohit Bansal

Figure 1 for VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning
Figure 2 for VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning
Figure 3 for VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning
Figure 4 for VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning
Viaarxiv icon

Paxion: Patching Action Knowledge in Video-Language Foundation Models

Add code
Bookmark button
Alert button
May 26, 2023
Zhenhailong Wang, Ansel Blume, Sha Li, Genglin Liu, Jaemin Cho, Zineng Tang, Mohit Bansal, Heng Ji

Figure 1 for Paxion: Patching Action Knowledge in Video-Language Foundation Models
Figure 2 for Paxion: Patching Action Knowledge in Video-Language Foundation Models
Figure 3 for Paxion: Patching Action Knowledge in Video-Language Foundation Models
Figure 4 for Paxion: Patching Action Knowledge in Video-Language Foundation Models
Viaarxiv icon

Visual Programming for Text-to-Image Generation and Evaluation

Add code
Bookmark button
Alert button
May 24, 2023
Jaemin Cho, Abhay Zala, Mohit Bansal

Figure 1 for Visual Programming for Text-to-Image Generation and Evaluation
Figure 2 for Visual Programming for Text-to-Image Generation and Evaluation
Figure 3 for Visual Programming for Text-to-Image Generation and Evaluation
Figure 4 for Visual Programming for Text-to-Image Generation and Evaluation
Viaarxiv icon