Picture for Jewon Lee

Jewon Lee

ERGO: Efficient High-Resolution Visual Understanding for Vision-Language Models

Add code
Sep 26, 2025
Viaarxiv icon

Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features

Add code
Apr 01, 2025
Figure 1 for Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features
Figure 2 for Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features
Figure 3 for Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features
Figure 4 for Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features
Viaarxiv icon