Alert button
Picture for Shiliang Zhang

Shiliang Zhang

Alert button

3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization

Add code
Bookmark button
Alert button
Mar 29, 2024
Yafeng Chen, Siqi Zheng, Hui Wang, Luyao Cheng, Tinglong Zhu, Changhe Song, Rongjie Huang, Ziyang Ma, Qian Chen, Shiliang Zhang, Xihao Li

Figure 1 for 3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization
Figure 2 for 3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization
Figure 3 for 3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization
Figure 4 for 3D-Speaker-Toolkit: An Open Source Toolkit for Multi-modal Speaker Verification and Diarization
Viaarxiv icon

Decoupled Contrastive Learning for Long-Tailed Recognition

Add code
Bookmark button
Alert button
Mar 10, 2024
Shiyu Xuan, Shiliang Zhang

Figure 1 for Decoupled Contrastive Learning for Long-Tailed Recognition
Figure 2 for Decoupled Contrastive Learning for Long-Tailed Recognition
Figure 3 for Decoupled Contrastive Learning for Long-Tailed Recognition
Figure 4 for Decoupled Contrastive Learning for Long-Tailed Recognition
Viaarxiv icon

An Embarrassingly Simple Approach for LLM with Strong ASR Capacity

Add code
Bookmark button
Alert button
Feb 13, 2024
Ziyang Ma, Guanrou Yang, Yifan Yang, Zhifu Gao, Jiaming Wang, Zhihao Du, Fan Yu, Qian Chen, Siqi Zheng, Shiliang Zhang, Xie Chen

Viaarxiv icon

LCB-net: Long-Context Biasing for Audio-Visual Speech Recognition

Add code
Bookmark button
Alert button
Jan 12, 2024
Fan Yu, Haoxu Wang, Xian Shi, Shiliang Zhang

Viaarxiv icon

E-chat: Emotion-sensitive Spoken Dialogue System with Large Language Models

Add code
Bookmark button
Alert button
Jan 06, 2024
Hongfei Xue, Yuhao Liang, Bingshen Mu, Shiliang Zhang, Mengzhe Chen, Qian Chen, Lei Xie

Viaarxiv icon

emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Add code
Bookmark button
Alert button
Dec 23, 2023
Ziyang Ma, Zhisheng Zheng, Jiaxin Ye, Jinchao Li, Zhifu Gao, Shiliang Zhang, Xie Chen

Viaarxiv icon

Advancing VAD Systems Based on Multi-Task Learning with Improved Model Structures

Add code
Bookmark button
Alert button
Dec 19, 2023
Lingyun Zuo, Keyu An, Shiliang Zhang, Zhijie Yan

Viaarxiv icon

Hourglass-AVSR: Down-Up Sampling-based Computational Efficiency Model for Audio-Visual Speech Recognition

Add code
Bookmark button
Alert button
Dec 14, 2023
Fan Yu, Haoxu Wang, Ziyang Ma, Shiliang Zhang

Viaarxiv icon

Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models

Add code
Bookmark button
Alert button
Nov 14, 2023
Yunfei Chu, Jin Xu, Xiaohuan Zhou, Qian Yang, Shiliang Zhang, Zhijie Yan, Chang Zhou, Jingren Zhou

Viaarxiv icon