Alert button
Picture for Mingsian R. Bai

Mingsian R. Bai

Alert button

Spatial-Temporal Activity-Informed Diarization and Separation

Add code
Bookmark button
Alert button
Jan 30, 2024
Yicheng Hsu, Ssuhan Chen, Mingsian R. Bai

Viaarxiv icon

Learning-based Array Configuration-Independent Binaural Audio Telepresence with Scalable Signal Enhancement and Ambience Preservation

Add code
Bookmark button
Alert button
Nov 21, 2023
Yicheng Hsu, Mingsian R. Bai

Viaarxiv icon

Deep Beamforming for Speech Enhancement and Speaker Localization with an Array Response-Aware Loss Function

Add code
Bookmark button
Alert button
Oct 22, 2023
Hsinyu Chang, Yicheng Hsu, Mingsian R. Bai

Figure 1 for Deep Beamforming for Speech Enhancement and Speaker Localization with an Array Response-Aware Loss Function
Figure 2 for Deep Beamforming for Speech Enhancement and Speaker Localization with an Array Response-Aware Loss Function
Figure 3 for Deep Beamforming for Speech Enhancement and Speaker Localization with an Array Response-Aware Loss Function
Figure 4 for Deep Beamforming for Speech Enhancement and Speaker Localization with an Array Response-Aware Loss Function
Viaarxiv icon

Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence

Add code
Bookmark button
Alert button
Apr 18, 2023
Yicheng Hsu, Mingsian R. Bai

Figure 1 for Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence
Figure 2 for Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence
Figure 3 for Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence
Figure 4 for Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence
Viaarxiv icon

Array Configuration-Agnostic Personalized Speech Enhancement using Long-Short-Term Spatial Coherence

Add code
Bookmark button
Alert button
Nov 16, 2022
Yicheng Hsu, Yonghan Lee, Mingsian R. Bai

Figure 1 for Array Configuration-Agnostic Personalized Speech Enhancement using Long-Short-Term Spatial Coherence
Figure 2 for Array Configuration-Agnostic Personalized Speech Enhancement using Long-Short-Term Spatial Coherence
Figure 3 for Array Configuration-Agnostic Personalized Speech Enhancement using Long-Short-Term Spatial Coherence
Figure 4 for Array Configuration-Agnostic Personalized Speech Enhancement using Long-Short-Term Spatial Coherence
Viaarxiv icon

Multi-channel target speech enhancement based on ERB-scaled spatial coherence features

Add code
Bookmark button
Alert button
Jul 17, 2022
Yicheng Hsu, Yonghan Lee, Mingsian R. Bai

Figure 1 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Figure 2 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Figure 3 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Figure 4 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Viaarxiv icon

Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection

Add code
Bookmark button
Alert button
Jun 20, 2022
Yuan Chen, Yicheng Hsu, Mingsian R. Bai

Figure 1 for Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection
Figure 2 for Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection
Figure 3 for Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection
Figure 4 for Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection
Viaarxiv icon

Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features

Add code
Bookmark button
Alert button
Dec 16, 2021
Yicheng Hsu, Yonghan Lee, Mingsian R. Bai

Figure 1 for Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features
Figure 2 for Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features
Figure 3 for Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features
Figure 4 for Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features
Viaarxiv icon