Picture for Woody Haosheng Gan

Woody Haosheng Gan

Scaling Open Discrete Audio Foundation Models with Interleaved Semantic, Acoustic, and Text Tokens

Add code
Feb 18, 2026
Viaarxiv icon

Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models

Add code
May 20, 2025
Figure 1 for Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models
Figure 2 for Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models
Figure 3 for Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models
Figure 4 for Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models
Viaarxiv icon