Picture for Mingsian R. Bai

Mingsian R. Bai

A tunable binaural audio telepresence system capable of balancing immersive and enhanced modes

Add code
May 14, 2024
Viaarxiv icon

Spatial-Temporal Activity-Informed Diarization and Separation

Add code
Jan 30, 2024
Viaarxiv icon

Learning-based Array Configuration-Independent Binaural Audio Telepresence with Scalable Signal Enhancement and Ambience Preservation

Add code
Nov 21, 2023
Viaarxiv icon

Deep Beamforming for Speech Enhancement and Speaker Localization with an Array Response-Aware Loss Function

Add code
Oct 22, 2023
Viaarxiv icon

Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence

Add code
Apr 18, 2023
Viaarxiv icon

Array Configuration-Agnostic Personalized Speech Enhancement using Long-Short-Term Spatial Coherence

Add code
Nov 16, 2022
Viaarxiv icon

Multi-channel target speech enhancement based on ERB-scaled spatial coherence features

Add code
Jul 17, 2022
Figure 1 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Figure 2 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Figure 3 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Figure 4 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Viaarxiv icon

Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection

Add code
Jun 20, 2022
Figure 1 for Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection
Figure 2 for Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection
Figure 3 for Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection
Figure 4 for Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection
Viaarxiv icon

Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features

Add code
Dec 16, 2021
Figure 1 for Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features
Figure 2 for Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features
Figure 3 for Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features
Figure 4 for Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features
Viaarxiv icon