Picture for Shichaang Meng

Shichaang Meng

LaViT: Aligning Latent Visual Thoughts for Multi-modal Reasoning

Add code
Jan 15, 2026
Viaarxiv icon