Alert button
Picture for Mu Cai

Mu Cai

Alert button

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

Add code
Bookmark button
Alert button
Apr 01, 2024
Yuzhang Shang, Mu Cai, Bingxin Xu, Yong Jae Lee, Yan Yan

Viaarxiv icon

CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples

Add code
Bookmark button
Alert button
Feb 20, 2024
Jianrui Zhang, Mu Cai, Tengyang Xie, Yong Jae Lee

Viaarxiv icon

Making Large Multimodal Models Understand Arbitrary Visual Prompts

Add code
Bookmark button
Alert button
Dec 01, 2023
Mu Cai, Haotian Liu, Siva Karthik Mustikovela, Gregory P. Meyer, Yuning Chai, Dennis Park, Yong Jae Lee

Viaarxiv icon

Investigating the Catastrophic Forgetting in Multimodal Large Language Models

Add code
Bookmark button
Alert button
Sep 26, 2023
Yuexiang Zhai, Shengbang Tong, Xiao Li, Mu Cai, Qing Qu, Yong Jae Lee, Yi Ma

Figure 1 for Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Figure 2 for Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Figure 3 for Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Figure 4 for Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Viaarxiv icon

A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance

Add code
Bookmark button
Alert button
Sep 21, 2023
Zeyi Huang, Andy Zhou, Zijian Lin, Mu Cai, Haohan Wang, Yong Jae Lee

Figure 1 for A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance
Figure 2 for A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance
Figure 3 for A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance
Figure 4 for A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance
Viaarxiv icon

Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding

Add code
Bookmark button
Alert button
Jun 09, 2023
Mu Cai, Zeyi Huang, Yuheng Li, Haohan Wang, Yong Jae Lee

Figure 1 for Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding
Figure 2 for Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding
Figure 3 for Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding
Figure 4 for Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding
Viaarxiv icon

Out-of-distribution Detection via Frequency-regularized Generative Models

Add code
Bookmark button
Alert button
Aug 18, 2022
Mu Cai, Yixuan Li

Figure 1 for Out-of-distribution Detection via Frequency-regularized Generative Models
Figure 2 for Out-of-distribution Detection via Frequency-regularized Generative Models
Figure 3 for Out-of-distribution Detection via Frequency-regularized Generative Models
Figure 4 for Out-of-distribution Detection via Frequency-regularized Generative Models
Viaarxiv icon

Masked Discrimination for Self-Supervised Learning on Point Clouds

Add code
Bookmark button
Alert button
Mar 21, 2022
Haotian Liu, Mu Cai, Yong Jae Lee

Figure 1 for Masked Discrimination for Self-Supervised Learning on Point Clouds
Figure 2 for Masked Discrimination for Self-Supervised Learning on Point Clouds
Figure 3 for Masked Discrimination for Self-Supervised Learning on Point Clouds
Figure 4 for Masked Discrimination for Self-Supervised Learning on Point Clouds
Viaarxiv icon