Picture for Depeng Wang

Depeng Wang

EchoingPixels: Cross-Modal Adaptive Token Reduction for Efficient Audio-Visual LLMs

Add code
Dec 11, 2025
Figure 1 for EchoingPixels: Cross-Modal Adaptive Token Reduction for Efficient Audio-Visual LLMs
Figure 2 for EchoingPixels: Cross-Modal Adaptive Token Reduction for Efficient Audio-Visual LLMs
Figure 3 for EchoingPixels: Cross-Modal Adaptive Token Reduction for Efficient Audio-Visual LLMs
Figure 4 for EchoingPixels: Cross-Modal Adaptive Token Reduction for Efficient Audio-Visual LLMs
Viaarxiv icon

Keep the General, Inject the Specific: Structured Dialogue Fine-Tuning for Knowledge Injection without Catastrophic Forgetting

Add code
Apr 27, 2025
Figure 1 for Keep the General, Inject the Specific: Structured Dialogue Fine-Tuning for Knowledge Injection without Catastrophic Forgetting
Figure 2 for Keep the General, Inject the Specific: Structured Dialogue Fine-Tuning for Knowledge Injection without Catastrophic Forgetting
Figure 3 for Keep the General, Inject the Specific: Structured Dialogue Fine-Tuning for Knowledge Injection without Catastrophic Forgetting
Figure 4 for Keep the General, Inject the Specific: Structured Dialogue Fine-Tuning for Knowledge Injection without Catastrophic Forgetting
Viaarxiv icon