Alert button
Picture for Kai Yu

Kai Yu

Alert button

Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning

Add code
Bookmark button
Alert button
Aug 17, 2023
Chun-Mei Feng, Kai Yu, Yong Liu, Salman Khan, Wangmeng Zuo

Figure 1 for Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning
Figure 2 for Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning
Figure 3 for Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning
Figure 4 for Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning
Viaarxiv icon

Towards Instance-adaptive Inference for Federated Learning

Add code
Bookmark button
Alert button
Aug 17, 2023
Chun-Mei Feng, Kai Yu, Nian Liu, Xinxing Xu, Salman Khan, Wangmeng Zuo

Figure 1 for Towards Instance-adaptive Inference for Federated Learning
Figure 2 for Towards Instance-adaptive Inference for Federated Learning
Figure 3 for Towards Instance-adaptive Inference for Federated Learning
Figure 4 for Towards Instance-adaptive Inference for Federated Learning
Viaarxiv icon

DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech

Add code
Bookmark button
Alert button
Jun 25, 2023
Sen Liu, Yiwei Guo, Chenpeng Du, Xie Chen, Kai Yu

Figure 1 for DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
Figure 2 for DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
Figure 3 for DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
Figure 4 for DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
Viaarxiv icon

UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding

Add code
Bookmark button
Alert button
Jun 18, 2023
Chenpeng Du, Yiwei Guo, Feiyu Shen, Zhijun Liu, Zheng Liang, Xie Chen, Shuai Wang, Hui Zhang, Kai Yu

Figure 1 for UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
Figure 2 for UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
Figure 3 for UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
Figure 4 for UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
Viaarxiv icon

Improving Audio Caption Fluency with Automatic Error Correction

Add code
Bookmark button
Alert button
Jun 16, 2023
Hanxue Zhang, Zeyu Xie, Xuenan Xu, Mengyue Wu, Kai Yu

Figure 1 for Improving Audio Caption Fluency with Automatic Error Correction
Figure 2 for Improving Audio Caption Fluency with Automatic Error Correction
Figure 3 for Improving Audio Caption Fluency with Automatic Error Correction
Figure 4 for Improving Audio Caption Fluency with Automatic Error Correction
Viaarxiv icon

Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation

Add code
Bookmark button
Alert button
Jun 14, 2023
Zheng Liang, Zheshu Song, Ziyang Ma, Chenpeng Du, Kai Yu, Xie Chen

Figure 1 for Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation
Figure 2 for Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation
Figure 3 for Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation
Figure 4 for Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation
Viaarxiv icon

Large Language Model Is Semi-Parametric Reinforcement Learning Agent

Add code
Bookmark button
Alert button
Jun 09, 2023
Danyang Zhang, Lu Chen, Situo Zhang, Hongshen Xu, Zihan Zhao, Kai Yu

Figure 1 for Large Language Model Is Semi-Parametric Reinforcement Learning Agent
Figure 2 for Large Language Model Is Semi-Parametric Reinforcement Learning Agent
Figure 3 for Large Language Model Is Semi-Parametric Reinforcement Learning Agent
Figure 4 for Large Language Model Is Semi-Parametric Reinforcement Learning Agent
Viaarxiv icon