Alert button
Picture for Yuguang Yang

Yuguang Yang

Alert button

PP-MeT: a Real-world Personalized Prompt based Meeting Transcription System

Add code
Bookmark button
Alert button
Sep 28, 2023
Xiang Lyu, Yuhang Cao, Qing Wang, Jingjing Yin, Yuguang Yang, Pengpeng Zou, Yanni Hu, Heng Lu

Figure 1 for PP-MeT: a Real-world Personalized Prompt based Meeting Transcription System
Figure 2 for PP-MeT: a Real-world Personalized Prompt based Meeting Transcription System
Figure 3 for PP-MeT: a Real-world Personalized Prompt based Meeting Transcription System
Figure 4 for PP-MeT: a Real-world Personalized Prompt based Meeting Transcription System
Viaarxiv icon

PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts

Add code
Bookmark button
Alert button
Sep 17, 2023
Jixun Yao, Yuguang Yang, Yi Lei, Ziqian Ning, Yanni Hu, Yu Pan, Jingjing Yin, Hongbin Zhou, Heng Lu, Lei Xie

Figure 1 for PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts
Figure 2 for PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts
Figure 3 for PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts
Figure 4 for PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts
Viaarxiv icon

GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Speech Emotion Recognition

Add code
Bookmark button
Alert button
Jun 16, 2023
Yu Pan, Yanni Hu, Yuguang Yang, Jixun Yao, Wen Fei, Lei Ma, Heng Lu

Figure 1 for GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Speech Emotion Recognition
Figure 2 for GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Speech Emotion Recognition
Figure 3 for GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Speech Emotion Recognition
Viaarxiv icon

Self-Enhancement Improves Text-Image Retrieval in Foundation Visual-Language Models

Add code
Bookmark button
Alert button
Jun 11, 2023
Yuguang Yang, Yiming Wang, Shupeng Geng, Runqi Wang, Yimi Wang, Sheng Wu, Baochang Zhang

Figure 1 for Self-Enhancement Improves Text-Image Retrieval in Foundation Visual-Language Models
Figure 2 for Self-Enhancement Improves Text-Image Retrieval in Foundation Visual-Language Models
Figure 3 for Self-Enhancement Improves Text-Image Retrieval in Foundation Visual-Language Models
Figure 4 for Self-Enhancement Improves Text-Image Retrieval in Foundation Visual-Language Models
Viaarxiv icon

Decom--CAM: Tell Me What You See, In Details! Feature-Level Interpretation via Decomposition Class Activation Map

Add code
Bookmark button
Alert button
May 27, 2023
Yuguang Yang, Runtang Guo, Sheng Wu, Yimi Wang, Juan Zhang, Xuan Gong, Baochang Zhang

Figure 1 for Decom--CAM: Tell Me What You See, In Details! Feature-Level Interpretation via Decomposition Class Activation Map
Figure 2 for Decom--CAM: Tell Me What You See, In Details! Feature-Level Interpretation via Decomposition Class Activation Map
Figure 3 for Decom--CAM: Tell Me What You See, In Details! Feature-Level Interpretation via Decomposition Class Activation Map
Figure 4 for Decom--CAM: Tell Me What You See, In Details! Feature-Level Interpretation via Decomposition Class Activation Map
Viaarxiv icon

HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism

Add code
Bookmark button
Alert button
Mar 15, 2023
Yuguang Yang, Yu Pan, Jingjing Yin, Jiangyu Han, Lei Ma, Heng Lu

Figure 1 for HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism
Figure 2 for HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism
Figure 3 for HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism
Figure 4 for HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism
Viaarxiv icon

LMEC: Learnable Multiplicative Absolute Position Embedding Based Conformer for Speech Recognition

Add code
Bookmark button
Alert button
Dec 05, 2022
Yuguang Yang, Yu Pan, Jingjing Yin, Heng Lu

Figure 1 for LMEC: Learnable Multiplicative Absolute Position Embedding Based Conformer for Speech Recognition
Figure 2 for LMEC: Learnable Multiplicative Absolute Position Embedding Based Conformer for Speech Recognition
Figure 3 for LMEC: Learnable Multiplicative Absolute Position Embedding Based Conformer for Speech Recognition
Figure 4 for LMEC: Learnable Multiplicative Absolute Position Embedding Based Conformer for Speech Recognition
Viaarxiv icon

Improving fairness in speaker verification via Group-adapted Fusion Network

Add code
Bookmark button
Alert button
Feb 23, 2022
Hua Shen, Yuguang Yang, Guoli Sun, Ryan Langman, Eunjung Han, Jasha Droppo, Andreas Stolcke

Figure 1 for Improving fairness in speaker verification via Group-adapted Fusion Network
Figure 2 for Improving fairness in speaker verification via Group-adapted Fusion Network
Figure 3 for Improving fairness in speaker verification via Group-adapted Fusion Network
Figure 4 for Improving fairness in speaker verification via Group-adapted Fusion Network
Viaarxiv icon

Self-supervised Speaker Recognition Training Using Human-Machine Dialogues

Add code
Bookmark button
Alert button
Feb 07, 2022
Metehan Cekic, Ruirui Li, Zeya Chen, Yuguang Yang, Andreas Stolcke, Upamanyu Madhow

Figure 1 for Self-supervised Speaker Recognition Training Using Human-Machine Dialogues
Figure 2 for Self-supervised Speaker Recognition Training Using Human-Machine Dialogues
Figure 3 for Self-supervised Speaker Recognition Training Using Human-Machine Dialogues
Figure 4 for Self-supervised Speaker Recognition Training Using Human-Machine Dialogues
Viaarxiv icon