Picture for Dacheng Tao

Dacheng Tao

and Other Contributors

Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging

Add code
May 26, 2025
Figure 1 for Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging
Figure 2 for Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging
Figure 3 for Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging
Figure 4 for Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging
Viaarxiv icon

SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data

Add code
May 25, 2025
Figure 1 for SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data
Figure 2 for SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data
Figure 3 for SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data
Figure 4 for SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data
Viaarxiv icon

MLLMs are Deeply Affected by Modality Bias

Add code
May 24, 2025
Figure 1 for MLLMs are Deeply Affected by Modality Bias
Figure 2 for MLLMs are Deeply Affected by Modality Bias
Figure 3 for MLLMs are Deeply Affected by Modality Bias
Figure 4 for MLLMs are Deeply Affected by Modality Bias
Viaarxiv icon

VORTA: Efficient Video Diffusion via Routing Sparse Attention

Add code
May 24, 2025
Figure 1 for VORTA: Efficient Video Diffusion via Routing Sparse Attention
Figure 2 for VORTA: Efficient Video Diffusion via Routing Sparse Attention
Figure 3 for VORTA: Efficient Video Diffusion via Routing Sparse Attention
Figure 4 for VORTA: Efficient Video Diffusion via Routing Sparse Attention
Viaarxiv icon

Runaway is Ashamed, But Helpful: On the Early-Exit Behavior of Large Language Model-based Agents in Embodied Environments

Add code
May 23, 2025
Viaarxiv icon

R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO

Add code
May 22, 2025
Viaarxiv icon

R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search

Add code
May 22, 2025
Viaarxiv icon

KaFT: Knowledge-aware Fine-tuning for Boosting LLMs' Domain-specific Question-Answering Performance

Add code
May 21, 2025
Viaarxiv icon

Cross-Domain Diffusion with Progressive Alignment for Efficient Adaptive Retrieval

Add code
May 20, 2025
Viaarxiv icon

Reasoning-OCR: Can Large Multimodal Models Solve Complex Logical Reasoning Problems from OCR Cues?

Add code
May 19, 2025
Viaarxiv icon