Picture for Yong Jae Lee

Yong Jae Lee

Matryoshka Multimodal Models

Add code
May 27, 2024
Figure 1 for Matryoshka Multimodal Models
Figure 2 for Matryoshka Multimodal Models
Figure 3 for Matryoshka Multimodal Models
Figure 4 for Matryoshka Multimodal Models
Viaarxiv icon

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

Add code
Apr 01, 2024
Figure 1 for LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
Figure 2 for LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
Figure 3 for LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
Figure 4 for LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
Viaarxiv icon

LLM Inference Unveiled: Survey and Roofline Model Insights

Add code
Mar 11, 2024
Viaarxiv icon

Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving

Add code
Feb 23, 2024
Figure 1 for Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving
Figure 2 for Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving
Figure 3 for Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving
Figure 4 for Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving
Viaarxiv icon

CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples

Add code
Feb 20, 2024
Figure 1 for CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples
Figure 2 for CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples
Figure 3 for CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples
Figure 4 for CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples
Viaarxiv icon

Edit One for All: Interactive Batch Image Editing

Add code
Jan 18, 2024
Figure 1 for Edit One for All: Interactive Batch Image Editing
Figure 2 for Edit One for All: Interactive Batch Image Editing
Figure 3 for Edit One for All: Interactive Batch Image Editing
Figure 4 for Edit One for All: Interactive Batch Image Editing
Viaarxiv icon

Interfacing Foundation Models' Embeddings

Add code
Dec 12, 2023
Figure 1 for Interfacing Foundation Models' Embeddings
Figure 2 for Interfacing Foundation Models' Embeddings
Figure 3 for Interfacing Foundation Models' Embeddings
Figure 4 for Interfacing Foundation Models' Embeddings
Viaarxiv icon

Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images

Add code
Dec 04, 2023
Figure 1 for Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Figure 2 for Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Figure 3 for Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Figure 4 for Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Viaarxiv icon

Making Large Multimodal Models Understand Arbitrary Visual Prompts

Add code
Dec 01, 2023
Figure 1 for Making Large Multimodal Models Understand Arbitrary Visual Prompts
Figure 2 for Making Large Multimodal Models Understand Arbitrary Visual Prompts
Figure 3 for Making Large Multimodal Models Understand Arbitrary Visual Prompts
Figure 4 for Making Large Multimodal Models Understand Arbitrary Visual Prompts
Viaarxiv icon

Testing learning-enabled cyber-physical systems with Large-Language Models: A Formal Approach

Add code
Nov 13, 2023
Viaarxiv icon