Picture for Tao Jin

Tao Jin

University of Science and Technology of China

Unleashing the Power of Natural Audio Featuring Multiple Sound Sources

Add code
Apr 24, 2025
Viaarxiv icon

ConceptGuard: Continual Personalized Text-to-Image Generation with Forgetting and Confusion Mitigation

Add code
Mar 13, 2025
Viaarxiv icon

Smoothing the Shift: Towards Stable Test-Time Adaptation under Complex Multimodal Noises

Add code
Mar 04, 2025
Viaarxiv icon

Low-rank Prompt Interaction for Continual Vision-Language Retrieval

Add code
Jan 24, 2025
Figure 1 for Low-rank Prompt Interaction for Continual Vision-Language Retrieval
Figure 2 for Low-rank Prompt Interaction for Continual Vision-Language Retrieval
Figure 3 for Low-rank Prompt Interaction for Continual Vision-Language Retrieval
Figure 4 for Low-rank Prompt Interaction for Continual Vision-Language Retrieval
Viaarxiv icon

OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios

Add code
Jan 02, 2025
Figure 1 for OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios
Figure 2 for OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios
Figure 3 for OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios
Figure 4 for OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios
Viaarxiv icon

Speech Watermarking with Discrete Intermediate Representations

Add code
Dec 18, 2024
Viaarxiv icon

A Wander Through the Multimodal Landscape: Efficient Transfer Learning via Low-rank Sequence Multimodal Adapter

Add code
Dec 12, 2024
Viaarxiv icon

Bridging the Gap for Test-Time Multimodal Sentiment Analysis

Add code
Dec 10, 2024
Figure 1 for Bridging the Gap for Test-Time Multimodal Sentiment Analysis
Figure 2 for Bridging the Gap for Test-Time Multimodal Sentiment Analysis
Figure 3 for Bridging the Gap for Test-Time Multimodal Sentiment Analysis
Figure 4 for Bridging the Gap for Test-Time Multimodal Sentiment Analysis
Viaarxiv icon

Classifier-guided Gradient Modulation for Enhanced Multimodal Learning

Add code
Nov 03, 2024
Viaarxiv icon

OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup

Add code
Oct 28, 2024
Figure 1 for OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup
Figure 2 for OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup
Figure 3 for OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup
Figure 4 for OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup
Viaarxiv icon