Alert button
Picture for Zhisheng Zheng

Zhisheng Zheng

Alert button

BAT: Learning to Reason about Spatial Sounds with Large Language Models

Add code
Bookmark button
Alert button
Feb 02, 2024
Zhisheng Zheng, Puyuan Peng, Ziyang Ma, Xie Chen, Eunsol Choi, David Harwath

Viaarxiv icon

EAT: Self-Supervised Pre-Training with Efficient Audio Transformer

Add code
Bookmark button
Alert button
Jan 07, 2024
Wenxi Chen, Yuzhe Liang, Ziyang Ma, Zhisheng Zheng, Xie Chen

Viaarxiv icon

emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Add code
Bookmark button
Alert button
Dec 23, 2023
Ziyang Ma, Zhisheng Zheng, Jiaxin Ye, Jinchao Li, Zhifu Gao, Shiliang Zhang, Xie Chen

Viaarxiv icon

Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning

Add code
Bookmark button
Alert button
Sep 29, 2023
Guanrou Yang, Ziyang Ma, Zhisheng Zheng, Yakun Song, Zhikang Niu, Xie Chen

Viaarxiv icon

Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition

Add code
Bookmark button
Alert button
Sep 19, 2023
Ziyang Ma, Wen Wu, Zhisheng Zheng, Yiwei Guo, Qian Chen, Shiliang Zhang, Xie Chen

Figure 1 for Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
Figure 2 for Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
Figure 3 for Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
Figure 4 for Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition
Viaarxiv icon

Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Aug 28, 2023
Zhisheng Zheng, Ziyang Ma, Yu Wang, Xie Chen

Figure 1 for Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition
Figure 2 for Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition
Figure 3 for Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition
Figure 4 for Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition
Viaarxiv icon

Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation

Add code
Bookmark button
Alert button
Jun 15, 2023
Ziyang Ma, Zhisheng Zheng, Guanrou Yang, Yu Wang, Chao Zhang, Xie Chen

Figure 1 for Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation
Figure 2 for Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation
Figure 3 for Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation
Figure 4 for Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation
Viaarxiv icon

Front-End Adapter: Adapting Front-End Input of Speech based Self-Supervised Learning for Speech Recognition

Add code
Bookmark button
Alert button
Feb 18, 2023
Xie Chen, Ziyang Ma, Changli Tang, Yujin Wang, Zhisheng Zheng

Figure 1 for Front-End Adapter: Adapting Front-End Input of Speech based Self-Supervised Learning for Speech Recognition
Figure 2 for Front-End Adapter: Adapting Front-End Input of Speech based Self-Supervised Learning for Speech Recognition
Figure 3 for Front-End Adapter: Adapting Front-End Input of Speech based Self-Supervised Learning for Speech Recognition
Figure 4 for Front-End Adapter: Adapting Front-End Input of Speech based Self-Supervised Learning for Speech Recognition
Viaarxiv icon

Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Oct 27, 2022
Yujin Wang, Changli Tang, Ziyang Ma, Zhisheng Zheng, Xie Chen, Wei-Qiang Zhang

Figure 1 for Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition
Figure 2 for Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition
Figure 3 for Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition
Figure 4 for Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition
Viaarxiv icon