Picture for Yu Xi

Yu Xi

Low-Resource Domain Adaptation for Speech LLMs via Text-Only Fine-Tuning

Add code
Jun 06, 2025
Viaarxiv icon

Fewer Hallucinations, More Verification: A Three-Stage LLM-Based Framework for ASR Error Correction

Add code
May 30, 2025
Viaarxiv icon

Masked Self-distilled Transducer-based Keyword Spotting with Semi-autoregressive Decoding

Add code
May 30, 2025
Viaarxiv icon

MFA-KWS: Effective Keyword Spotting with Multi-head Frame-asynchronous Decoding

Add code
May 26, 2025
Viaarxiv icon

UniCodec: Unified Audio Codec with Single Domain-Adaptive Codebook

Add code
Feb 27, 2025
Viaarxiv icon

Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario

Add code
Dec 24, 2024
Figure 1 for Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario
Figure 2 for Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario
Figure 3 for Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario
Figure 4 for Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario
Viaarxiv icon

NTC-KWS: Noise-aware CTC for Robust Keyword Spotting

Add code
Dec 17, 2024
Viaarxiv icon

Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency

Add code
Dec 17, 2024
Figure 1 for Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency
Figure 2 for Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency
Figure 3 for Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency
Figure 4 for Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency
Viaarxiv icon

A Survey on Speech Large Language Models

Add code
Oct 24, 2024
Figure 1 for A Survey on Speech Large Language Models
Figure 2 for A Survey on Speech Large Language Models
Figure 3 for A Survey on Speech Large Language Models
Figure 4 for A Survey on Speech Large Language Models
Viaarxiv icon

Semi-supervised Learning for Code-Switching ASR with Large Language Model Filter

Add code
Jul 05, 2024
Viaarxiv icon