Alert button
Picture for Yong Jae Lee

Yong Jae Lee

Alert button

Making Large Multimodal Models Understand Arbitrary Visual Prompts

Add code
Bookmark button
Alert button
Dec 01, 2023
Mu Cai, Haotian Liu, Siva Karthik Mustikovela, Gregory P. Meyer, Yuning Chai, Dennis Park, Yong Jae Lee

Viaarxiv icon

Testing learning-enabled cyber-physical systems with Large-Language Models: A Formal Approach

Add code
Bookmark button
Alert button
Nov 13, 2023
Xi Zheng, Aloysius K. Mok, Ruzica Piskac, Yong Jae Lee, Bhaskar Krishnamachari, Dakai Zhu, Oleg Sokolsky, Insup Lee

Figure 1 for Testing learning-enabled cyber-physical systems with Large-Language Models: A Formal Approach
Figure 2 for Testing learning-enabled cyber-physical systems with Large-Language Models: A Formal Approach
Figure 3 for Testing learning-enabled cyber-physical systems with Large-Language Models: A Formal Approach
Figure 4 for Testing learning-enabled cyber-physical systems with Large-Language Models: A Formal Approach
Viaarxiv icon

Improved Baselines with Visual Instruction Tuning

Add code
Bookmark button
Alert button
Oct 05, 2023
Haotian Liu, Chunyuan Li, Yuheng Li, Yong Jae Lee

Viaarxiv icon

Investigating the Catastrophic Forgetting in Multimodal Large Language Models

Add code
Bookmark button
Alert button
Sep 26, 2023
Yuexiang Zhai, Shengbang Tong, Xiao Li, Mu Cai, Qing Qu, Yong Jae Lee, Yi Ma

Figure 1 for Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Figure 2 for Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Figure 3 for Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Figure 4 for Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Viaarxiv icon

A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance

Add code
Bookmark button
Alert button
Sep 21, 2023
Zeyi Huang, Andy Zhou, Zijian Lin, Mu Cai, Haohan Wang, Yong Jae Lee

Figure 1 for A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance
Figure 2 for A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance
Figure 3 for A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance
Figure 4 for A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance
Viaarxiv icon

Visual Instruction Inversion: Image Editing via Visual Prompting

Add code
Bookmark button
Alert button
Jul 26, 2023
Thao Nguyen, Yuheng Li, Utkarsh Ojha, Yong Jae Lee

Figure 1 for Visual Instruction Inversion: Image Editing via Visual Prompting
Figure 2 for Visual Instruction Inversion: Image Editing via Visual Prompting
Figure 3 for Visual Instruction Inversion: Image Editing via Visual Prompting
Figure 4 for Visual Instruction Inversion: Image Editing via Visual Prompting
Viaarxiv icon

Benchmarking and Analyzing Generative Data for Visual Recognition

Add code
Bookmark button
Alert button
Jul 25, 2023
Bo Li, Haotian Liu, Liangyu Chen, Yong Jae Lee, Chunyuan Li, Ziwei Liu

Figure 1 for Benchmarking and Analyzing Generative Data for Visual Recognition
Figure 2 for Benchmarking and Analyzing Generative Data for Visual Recognition
Figure 3 for Benchmarking and Analyzing Generative Data for Visual Recognition
Figure 4 for Benchmarking and Analyzing Generative Data for Visual Recognition
Viaarxiv icon

Generate Anything Anywhere in Any Scene

Add code
Bookmark button
Alert button
Jun 29, 2023
Yuheng Li, Haotian Liu, Yangming Wen, Yong Jae Lee

Figure 1 for Generate Anything Anywhere in Any Scene
Figure 2 for Generate Anything Anywhere in Any Scene
Figure 3 for Generate Anything Anywhere in Any Scene
Figure 4 for Generate Anything Anywhere in Any Scene
Viaarxiv icon

Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding

Add code
Bookmark button
Alert button
Jun 09, 2023
Mu Cai, Zeyi Huang, Yuheng Li, Haohan Wang, Yong Jae Lee

Figure 1 for Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding
Figure 2 for Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding
Figure 3 for Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding
Figure 4 for Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding
Viaarxiv icon