Picture for Xinyuan Qian

Xinyuan Qian

Exploring Length Generalization For Transformer-based Speech Enhancement

Add code
Jun 07, 2025
Viaarxiv icon

FIGhost: Fluorescent Ink-based Stealthy and Flexible Backdoor Attacks on Physical Traffic Sign Recognition

Add code
May 17, 2025
Viaarxiv icon

Audio-Visual Class-Incremental Learning for Fish Feeding intensity Assessment in Aquaculture

Add code
Apr 21, 2025
Viaarxiv icon

FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles

Add code
Jan 02, 2025
Figure 1 for FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles
Figure 2 for FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles
Figure 3 for FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles
Figure 4 for FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles
Viaarxiv icon

Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition

Add code
Jan 01, 2025
Figure 1 for Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition
Figure 2 for Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition
Figure 3 for Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition
Figure 4 for Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition
Viaarxiv icon

I2TTS: Image-indicated Immersive Text-to-speech Synthesis with Spatial Perception

Add code
Nov 20, 2024
Viaarxiv icon

SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model

Add code
Nov 12, 2024
Figure 1 for SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model
Figure 2 for SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model
Figure 3 for SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model
Figure 4 for SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model
Viaarxiv icon

Analytic Class Incremental Learning for Sound Source Localization with Privacy Protection

Add code
Sep 11, 2024
Figure 1 for Analytic Class Incremental Learning for Sound Source Localization with Privacy Protection
Figure 2 for Analytic Class Incremental Learning for Sound Source Localization with Privacy Protection
Figure 3 for Analytic Class Incremental Learning for Sound Source Localization with Privacy Protection
Figure 4 for Analytic Class Incremental Learning for Sound Source Localization with Privacy Protection
Viaarxiv icon

Text-Queried Target Sound Event Localization

Add code
Jun 23, 2024
Figure 1 for Text-Queried Target Sound Event Localization
Figure 2 for Text-Queried Target Sound Event Localization
Figure 3 for Text-Queried Target Sound Event Localization
Figure 4 for Text-Queried Target Sound Event Localization
Viaarxiv icon

An Exploration of Length Generalization in Transformer-Based Speech Enhancement

Add code
Jun 17, 2024
Figure 1 for An Exploration of Length Generalization in Transformer-Based Speech Enhancement
Figure 2 for An Exploration of Length Generalization in Transformer-Based Speech Enhancement
Figure 3 for An Exploration of Length Generalization in Transformer-Based Speech Enhancement
Figure 4 for An Exploration of Length Generalization in Transformer-Based Speech Enhancement
Viaarxiv icon