Alert button
Picture for Xuan Shi

Xuan Shi

Alert button

Unlocking Foundation Models for Privacy-Enhancing Speech Understanding: An Early Study on Low Resource Speech Training Leveraging Label-guided Synthetic Speech Content

Add code
Bookmark button
Alert button
Jun 13, 2023
Tiantian Feng, Digbalay Bose, Xuan Shi, Shrikanth Narayanan

Figure 1 for Unlocking Foundation Models for Privacy-Enhancing Speech Understanding: An Early Study on Low Resource Speech Training Leveraging Label-guided Synthetic Speech Content
Figure 2 for Unlocking Foundation Models for Privacy-Enhancing Speech Understanding: An Early Study on Low Resource Speech Training Leveraging Label-guided Synthetic Speech Content
Figure 3 for Unlocking Foundation Models for Privacy-Enhancing Speech Understanding: An Early Study on Low Resource Speech Training Leveraging Label-guided Synthetic Speech Content
Figure 4 for Unlocking Foundation Models for Privacy-Enhancing Speech Understanding: An Early Study on Low Resource Speech Training Leveraging Label-guided Synthetic Speech Content
Viaarxiv icon

A Review of Speech-centric Trustworthy Machine Learning: Privacy, Safety, and Fairness

Add code
Bookmark button
Alert button
Dec 18, 2022
Tiantian Feng, Rajat Hebbar, Nicholas Mehlman, Xuan Shi, Aditya Kommineni, and Shrikanth Narayanan

Figure 1 for A Review of Speech-centric Trustworthy Machine Learning: Privacy, Safety, and Fairness
Figure 2 for A Review of Speech-centric Trustworthy Machine Learning: Privacy, Safety, and Fairness
Figure 3 for A Review of Speech-centric Trustworthy Machine Learning: Privacy, Safety, and Fairness
Figure 4 for A Review of Speech-centric Trustworthy Machine Learning: Privacy, Safety, and Fairness
Viaarxiv icon

Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?

Add code
Bookmark button
Alert button
Nov 25, 2022
Xuan Shi, Erica Cooper, Xin Wang, Junichi Yamagishi, Shrikanth Narayanan

Figure 1 for Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?
Figure 2 for Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?
Figure 3 for Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?
Figure 4 for Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?
Viaarxiv icon

Use of speaker recognition approaches for learning timbre representations of musical instrument sounds from raw waveforms

Add code
Bookmark button
Alert button
Jul 24, 2021
Xuan Shi, Erica Cooper, Junichi Yamagishi

Figure 1 for Use of speaker recognition approaches for learning timbre representations of musical instrument sounds from raw waveforms
Figure 2 for Use of speaker recognition approaches for learning timbre representations of musical instrument sounds from raw waveforms
Figure 3 for Use of speaker recognition approaches for learning timbre representations of musical instrument sounds from raw waveforms
Figure 4 for Use of speaker recognition approaches for learning timbre representations of musical instrument sounds from raw waveforms
Viaarxiv icon

RepGN:Object Detection with Relational Proposal Graph Network

Add code
Bookmark button
Alert button
Apr 18, 2019
Xingjian Du, Xuan Shi, Risheng Huang

Figure 1 for RepGN:Object Detection with Relational Proposal Graph Network
Figure 2 for RepGN:Object Detection with Relational Proposal Graph Network
Figure 3 for RepGN:Object Detection with Relational Proposal Graph Network
Figure 4 for RepGN:Object Detection with Relational Proposal Graph Network
Viaarxiv icon

End-to-End Model for Speech Enhancement by Consistent Spectrogram Masking

Add code
Bookmark button
Alert button
Jan 02, 2019
Xingjian Du, Mengyao Zhu, Xuan Shi, Xinpeng Zhang, Wen Zhang, Jingdong Chen

Figure 1 for End-to-End Model for Speech Enhancement by Consistent Spectrogram Masking
Figure 2 for End-to-End Model for Speech Enhancement by Consistent Spectrogram Masking
Figure 3 for End-to-End Model for Speech Enhancement by Consistent Spectrogram Masking
Figure 4 for End-to-End Model for Speech Enhancement by Consistent Spectrogram Masking
Viaarxiv icon