Alert button
Picture for Yongqi Wang

Yongqi Wang

Alert button

Text-to-Song: Towards Controllable Music Generation Incorporating Vocals and Accompaniment

Add code
Bookmark button
Alert button
Apr 16, 2024
Zhiqing Hong, Rongjie Huang, Xize Cheng, Yongqi Wang, Ruiqi Li, Fuming You, Zhou Zhao, Zhimeng Zhang

Viaarxiv icon

AUD-TGN: Advancing Action Unit Detection with Temporal Convolution and GPT-2 in Wild Audiovisual Contexts

Add code
Bookmark button
Alert button
Mar 20, 2024
Jun Yu, Zerui Zhang, Zhihong Wei, Gongpeng Zhao, Zhongpeng Cai, Yongqi Wang, Guochen Xie, Jichao Zhu, Wangyuan Zhu

Figure 1 for AUD-TGN: Advancing Action Unit Detection with Temporal Convolution and GPT-2 in Wild Audiovisual Contexts
Figure 2 for AUD-TGN: Advancing Action Unit Detection with Temporal Convolution and GPT-2 in Wild Audiovisual Contexts
Figure 3 for AUD-TGN: Advancing Action Unit Detection with Temporal Convolution and GPT-2 in Wild Audiovisual Contexts
Viaarxiv icon

Multimodal Fusion Method with Spatiotemporal Sequences and Relationship Learning for Valence-Arousal Estimation

Add code
Bookmark button
Alert button
Mar 20, 2024
Jun Yu, Gongpeng Zhao, Yongqi Wang, Zhihong Wei, Yang Zheng, Zerui Zhang, Zhongpeng Cai, Guochen Xie, Jichao Zhu, Wangyuan Zhu

Figure 1 for Multimodal Fusion Method with Spatiotemporal Sequences and Relationship Learning for Valence-Arousal Estimation
Figure 2 for Multimodal Fusion Method with Spatiotemporal Sequences and Relationship Learning for Valence-Arousal Estimation
Figure 3 for Multimodal Fusion Method with Spatiotemporal Sequences and Relationship Learning for Valence-Arousal Estimation
Viaarxiv icon

Exploring Facial Expression Recognition through Semi-Supervised Pretraining and Temporal Modeling

Add code
Bookmark button
Alert button
Mar 19, 2024
Jun Yu, Zhihong Wei, Zhongpeng Cai, Gongpeng Zhao, Zerui Zhang, Yongqi Wang, Guochen Xie, Jichao Zhu, Wangyuan Zhu

Figure 1 for Exploring Facial Expression Recognition through Semi-Supervised Pretraining and Temporal Modeling
Figure 2 for Exploring Facial Expression Recognition through Semi-Supervised Pretraining and Temporal Modeling
Viaarxiv icon

Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt

Add code
Bookmark button
Alert button
Mar 18, 2024
Yongqi Wang, Ruofan Hu, Rongjie Huang, Zhiqing Hong, Ruiqi Li, Wenrui Liu, Fuming You, Tao Jin, Zhou Zhao

Figure 1 for Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
Figure 2 for Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
Figure 3 for Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
Figure 4 for Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
Viaarxiv icon

Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning

Add code
Bookmark button
Alert button
Mar 02, 2024
Shuo Yang, Zirui Shang, Yongqi Wang, Derong Deng, Hongwei Chen, Qiyuan Cheng, Xinxiao Wu

Figure 1 for Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning
Figure 2 for Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning
Figure 3 for Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning
Figure 4 for Data-free Multi-label Image Recognition via LLM-powered Prompt Tuning
Viaarxiv icon

Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer

Add code
Bookmark button
Alert button
Sep 14, 2023
Yongqi Wang, Jionghao Bai, Rongjie Huang, Ruiqi Li, Zhiqing Hong, Zhou Zhao

Figure 1 for Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer
Figure 2 for Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer
Figure 3 for Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer
Figure 4 for Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer
Viaarxiv icon

Make-A-Voice: Unified Voice Synthesis With Discrete Representation

Add code
Bookmark button
Alert button
May 30, 2023
Rongjie Huang, Chunlei Zhang, Yongqi Wang, Dongchao Yang, Luping Liu, Zhenhui Ye, Ziyue Jiang, Chao Weng, Zhou Zhao, Dong Yu

Figure 1 for Make-A-Voice: Unified Voice Synthesis With Discrete Representation
Figure 2 for Make-A-Voice: Unified Voice Synthesis With Discrete Representation
Figure 3 for Make-A-Voice: Unified Voice Synthesis With Discrete Representation
Figure 4 for Make-A-Voice: Unified Voice Synthesis With Discrete Representation
Viaarxiv icon

Connecting Multi-modal Contrastive Representations

Add code
Bookmark button
Alert button
May 22, 2023
Zehan Wang, Yang Zhao, Xize Cheng, Haifeng Huang, Jiageng Liu, Li Tang, Linjun Li, Yongqi Wang, Aoxiong Yin, Ziang Zhang, Zhou Zhao

Figure 1 for Connecting Multi-modal Contrastive Representations
Figure 2 for Connecting Multi-modal Contrastive Representations
Figure 3 for Connecting Multi-modal Contrastive Representations
Figure 4 for Connecting Multi-modal Contrastive Representations
Viaarxiv icon

FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech Synthesis

Add code
Bookmark button
Alert button
Jul 13, 2022
Yongqi Wang, Zhou Zhao

Figure 1 for FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech Synthesis
Figure 2 for FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech Synthesis
Figure 3 for FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech Synthesis
Figure 4 for FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech Synthesis
Viaarxiv icon