Picture for Luoxin Ye

Luoxin Ye

SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models

Add code
May 01, 2025
Viaarxiv icon

GenEx: Generating an Explorable World

Add code
Dec 12, 2024
Viaarxiv icon

LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression

Add code
Jun 28, 2024
Figure 1 for LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression
Figure 2 for LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression
Figure 3 for LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression
Figure 4 for LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression
Viaarxiv icon