Picture for Yaguang Song

Yaguang Song

A Step Toward Federated Pretraining of Multimodal Large Language Models

Add code
Mar 25, 2026
Viaarxiv icon

Explicit Uncertainty Modeling for Active CLIP Adaptation with Dual Prompt Tuning

Add code
Feb 04, 2026
Viaarxiv icon

Fine-tuning Pre-trained Vision-Language Models in a Human-Annotation-Free Manner

Add code
Feb 04, 2026
Viaarxiv icon

Harmony: A Unified Framework for Modality Incremental Learning

Add code
Apr 17, 2025
Viaarxiv icon

Libra: Building Decoupled Vision System on Large Language Models

Add code
May 16, 2024
Figure 1 for Libra: Building Decoupled Vision System on Large Language Models
Figure 2 for Libra: Building Decoupled Vision System on Large Language Models
Figure 3 for Libra: Building Decoupled Vision System on Large Language Models
Figure 4 for Libra: Building Decoupled Vision System on Large Language Models
Viaarxiv icon