Picture for Yong Jae Lee

Yong Jae Lee

X-Fusion: Introducing New Modality to Frozen Large Language Models

Add code
Apr 29, 2025
Viaarxiv icon

YoChameleon: Personalized Vision and Language Generation

Add code
Apr 29, 2025
Viaarxiv icon

Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features

Add code
Apr 01, 2025
Viaarxiv icon

Do Vision Models Develop Human-Like Progressive Difficulty Understanding?

Add code
Mar 17, 2025
Viaarxiv icon

Stay-Positive: A Case for Ignoring Real Image Features in Fake Image Detection

Add code
Feb 11, 2025
Viaarxiv icon

LASER: Lip Landmark Assisted Speaker Detection for Robustness

Add code
Jan 21, 2025
Viaarxiv icon

Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs

Add code
Jan 08, 2025
Viaarxiv icon

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Add code
Oct 15, 2024
Figure 1 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Figure 2 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Figure 3 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Figure 4 for TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Viaarxiv icon

On the Effectiveness of Dataset Alignment for Fake Image Detection

Add code
Oct 15, 2024
Figure 1 for On the Effectiveness of Dataset Alignment for Fake Image Detection
Figure 2 for On the Effectiveness of Dataset Alignment for Fake Image Detection
Figure 3 for On the Effectiveness of Dataset Alignment for Fake Image Detection
Figure 4 for On the Effectiveness of Dataset Alignment for Fake Image Detection
Viaarxiv icon

Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos

Add code
Oct 03, 2024
Figure 1 for Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos
Figure 2 for Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos
Figure 3 for Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos
Figure 4 for Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos
Viaarxiv icon