Picture for Xueliang Zhang

Xueliang Zhang

ARiSE: Auto-Regressive Multi-Channel Speech Enhancement

Add code
May 28, 2025
Viaarxiv icon

Two-stage Audio-Visual Target Speaker Extraction System for Real-Time Processing On Edge Device

Add code
May 28, 2025
Viaarxiv icon

Room Impulse Response as a Prompt for Acoustic Echo Cancellation

Add code
May 26, 2025
Viaarxiv icon

Multi-Channel Acoustic Echo Cancellation Based on Direction-of-Arrival Estimation

Add code
May 26, 2025
Viaarxiv icon

Temporal-Spectral-Spatial Unified Remote Sensing Dense Prediction

Add code
May 18, 2025
Viaarxiv icon

Listen to Extract: Onset-Prompted Target Speaker Extraction

Add code
May 08, 2025
Viaarxiv icon

Transforming Weather Data from Pixel to Latent Space

Add code
Mar 09, 2025
Viaarxiv icon

Robust Target Speaker Direction of Arrival Estimation

Add code
Dec 25, 2024
Viaarxiv icon

LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language Interpretation

Add code
Nov 14, 2024
Figure 1 for LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language Interpretation
Figure 2 for LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language Interpretation
Figure 3 for LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language Interpretation
Figure 4 for LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language Interpretation
Viaarxiv icon

StreamAdapter: Efficient Test Time Adaptation from Contextual Streams

Add code
Nov 14, 2024
Figure 1 for StreamAdapter: Efficient Test Time Adaptation from Contextual Streams
Figure 2 for StreamAdapter: Efficient Test Time Adaptation from Contextual Streams
Figure 3 for StreamAdapter: Efficient Test Time Adaptation from Contextual Streams
Figure 4 for StreamAdapter: Efficient Test Time Adaptation from Contextual Streams
Viaarxiv icon