Alert button
Picture for Yingming Gao

Yingming Gao

Alert button

Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation

Add code
Bookmark button
Alert button
Jan 02, 2024
Jinlong Xue, Yayue Deng, Yingming Gao, Ya Li

Viaarxiv icon

Frame-level emotional state alignment method for speech emotion recognition

Add code
Bookmark button
Alert button
Dec 27, 2023
Qifei Li, Yingming Gao, Cong Wang, Yayue Deng, Jinlong Xue, Yichen Han, Ya Li

Viaarxiv icon

CONCSS: Contrastive-based Context Comprehension for Dialogue-appropriate Prosody in Conversational Speech Synthesis

Add code
Bookmark button
Alert button
Dec 16, 2023
Yayue Deng, Jinlong Xue, Yukang Jia, Qifei Li, Yichen Han, Fengping Wang, Yingming Gao, Dengfeng Ke, Ya Li

Viaarxiv icon

Spoken Language Intelligence of Large Language Models for Language Learning

Add code
Bookmark button
Alert button
Aug 28, 2023
Linkai Peng, Baorian Nuchged, Yingming Gao

Figure 1 for Spoken Language Intelligence of Large Language Models for Language Learning
Figure 2 for Spoken Language Intelligence of Large Language Models for Language Learning
Figure 3 for Spoken Language Intelligence of Large Language Models for Language Learning
Figure 4 for Spoken Language Intelligence of Large Language Models for Language Learning
Viaarxiv icon

M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis

Add code
Bookmark button
Alert button
May 03, 2023
Jinlong Xue, Yayue Deng, Fengping Wang, Ya Li, Yingming Gao, Jianhua Tao, Jianqing Sun, Jiaen Liang

Figure 1 for M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis
Figure 2 for M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis
Figure 3 for M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis
Viaarxiv icon

A Keypoint Based Enhancement Method for Audio Driven Free View Talking Head Synthesis

Add code
Bookmark button
Alert button
Oct 07, 2022
Yichen Han, Ya Li, Yingming Gao, Jinlong Xue, Songpo Wang, Lei Yang

Figure 1 for A Keypoint Based Enhancement Method for Audio Driven Free View Talking Head Synthesis
Figure 2 for A Keypoint Based Enhancement Method for Audio Driven Free View Talking Head Synthesis
Figure 3 for A Keypoint Based Enhancement Method for Audio Driven Free View Talking Head Synthesis
Figure 4 for A Keypoint Based Enhancement Method for Audio Driven Free View Talking Head Synthesis
Viaarxiv icon

Text-Aware End-to-end Mispronunciation Detection and Diagnosis

Add code
Bookmark button
Alert button
Jun 15, 2022
Linkai Peng, Yingming Gao, Binghuai Lin, Dengfeng Ke, Yanlu Xie, Jinsong Zhang

Figure 1 for Text-Aware End-to-end Mispronunciation Detection and Diagnosis
Figure 2 for Text-Aware End-to-end Mispronunciation Detection and Diagnosis
Figure 3 for Text-Aware End-to-end Mispronunciation Detection and Diagnosis
Figure 4 for Text-Aware End-to-end Mispronunciation Detection and Diagnosis
Viaarxiv icon