Alert button
Picture for Yicheng Hsu

Yicheng Hsu

Alert button

Spatial-Temporal Activity-Informed Diarization and Separation

Add code
Bookmark button
Alert button
Jan 30, 2024
Yicheng Hsu, Ssuhan Chen, Mingsian R. Bai

Viaarxiv icon

Learning-based Array Configuration-Independent Binaural Audio Telepresence with Scalable Signal Enhancement and Ambience Preservation

Add code
Bookmark button
Alert button
Nov 21, 2023
Yicheng Hsu, Mingsian R. Bai

Viaarxiv icon

Deep Beamforming for Speech Enhancement and Speaker Localization with an Array Response-Aware Loss Function

Add code
Bookmark button
Alert button
Oct 22, 2023
Hsinyu Chang, Yicheng Hsu, Mingsian R. Bai

Figure 1 for Deep Beamforming for Speech Enhancement and Speaker Localization with an Array Response-Aware Loss Function
Figure 2 for Deep Beamforming for Speech Enhancement and Speaker Localization with an Array Response-Aware Loss Function
Figure 3 for Deep Beamforming for Speech Enhancement and Speaker Localization with an Array Response-Aware Loss Function
Figure 4 for Deep Beamforming for Speech Enhancement and Speaker Localization with an Array Response-Aware Loss Function
Viaarxiv icon

Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence

Add code
Bookmark button
Alert button
Apr 18, 2023
Yicheng Hsu, Mingsian R. Bai

Figure 1 for Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence
Figure 2 for Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence
Figure 3 for Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence
Figure 4 for Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence
Viaarxiv icon

Learning-based Robust Speaker Counting and Separation with the Aid of Spatial Coherence

Add code
Bookmark button
Alert button
Mar 13, 2023
Yicheng Hsu, Mingsian Bai

Figure 1 for Learning-based Robust Speaker Counting and Separation with the Aid of Spatial Coherence
Figure 2 for Learning-based Robust Speaker Counting and Separation with the Aid of Spatial Coherence
Figure 3 for Learning-based Robust Speaker Counting and Separation with the Aid of Spatial Coherence
Figure 4 for Learning-based Robust Speaker Counting and Separation with the Aid of Spatial Coherence
Viaarxiv icon

Array Configuration-Agnostic Personalized Speech Enhancement using Long-Short-Term Spatial Coherence

Add code
Bookmark button
Alert button
Nov 16, 2022
Yicheng Hsu, Yonghan Lee, Mingsian R. Bai

Figure 1 for Array Configuration-Agnostic Personalized Speech Enhancement using Long-Short-Term Spatial Coherence
Figure 2 for Array Configuration-Agnostic Personalized Speech Enhancement using Long-Short-Term Spatial Coherence
Figure 3 for Array Configuration-Agnostic Personalized Speech Enhancement using Long-Short-Term Spatial Coherence
Figure 4 for Array Configuration-Agnostic Personalized Speech Enhancement using Long-Short-Term Spatial Coherence
Viaarxiv icon

Multi-channel target speech enhancement based on ERB-scaled spatial coherence features

Add code
Bookmark button
Alert button
Jul 17, 2022
Yicheng Hsu, Yonghan Lee, Mingsian R. Bai

Figure 1 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Figure 2 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Figure 3 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Figure 4 for Multi-channel target speech enhancement based on ERB-scaled spatial coherence features
Viaarxiv icon

Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection

Add code
Bookmark button
Alert button
Jun 20, 2022
Yuan Chen, Yicheng Hsu, Mingsian R. Bai

Figure 1 for Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection
Figure 2 for Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection
Figure 3 for Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection
Figure 4 for Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection
Viaarxiv icon

Acoustic echo suppression using a learning-based multi-frame minimum variance distortionless response filter

Add code
Bookmark button
Alert button
May 07, 2022
Yuefeng Tsai, Yicheng Hsu, Mingsian Bai

Figure 1 for Acoustic echo suppression using a learning-based multi-frame minimum variance distortionless response filter
Figure 2 for Acoustic echo suppression using a learning-based multi-frame minimum variance distortionless response filter
Figure 3 for Acoustic echo suppression using a learning-based multi-frame minimum variance distortionless response filter
Figure 4 for Acoustic echo suppression using a learning-based multi-frame minimum variance distortionless response filter
Viaarxiv icon