Picture for Yiteng Huang

Yiteng Huang

Thinking in Directivity: Speech Large Language Model for Multi-Talker Directional Speech Recognition

Add code
Jun 17, 2025
Viaarxiv icon

MASV: Speaker Verification with Global and Local Context Mamba

Add code
Dec 14, 2024
Figure 1 for MASV: Speaker Verification with Global and Local Context Mamba
Figure 2 for MASV: Speaker Verification with Global and Local Context Mamba
Figure 3 for MASV: Speaker Verification with Global and Local Context Mamba
Figure 4 for MASV: Speaker Verification with Global and Local Context Mamba
Viaarxiv icon

M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses

Add code
Sep 17, 2024
Figure 1 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Figure 2 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Figure 3 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Figure 4 for M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Viaarxiv icon

Effective Integration of KAN for Keyword Spotting

Add code
Sep 13, 2024
Viaarxiv icon

Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning

Add code
Aug 27, 2024
Figure 1 for Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning
Figure 2 for Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning
Figure 3 for Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning
Figure 4 for Query-by-Example Keyword Spotting Using Spectral-Temporal Graph Attentive Pooling and Multi-Task Learning
Viaarxiv icon

Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting

Add code
Aug 23, 2024
Figure 1 for Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
Figure 2 for Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
Figure 3 for Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
Figure 4 for Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
Viaarxiv icon

AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition

Add code
Jan 18, 2024
Figure 1 for AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition
Figure 2 for AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition
Figure 3 for AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition
Figure 4 for AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition
Viaarxiv icon

FADI-AEC: Fast Score Based Diffusion Model Guided by Far-end Signal for Acoustic Echo Cancellation

Add code
Jan 08, 2024
Viaarxiv icon

Directional Source Separation for Robust Speech Recognition on Smart Glasses

Add code
Sep 20, 2023
Figure 1 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Figure 2 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Figure 3 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Figure 4 for Directional Source Separation for Robust Speech Recognition on Smart Glasses
Viaarxiv icon

Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches

Add code
Feb 17, 2023
Figure 1 for Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches
Figure 2 for Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches
Figure 3 for Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches
Figure 4 for Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches
Viaarxiv icon