Alert button
Picture for Kai Yu

Kai Yu

Alert button

Boosting3D: High-Fidelity Image-to-3D by Boosting 2D Diffusion Prior to 3D Prior with Progressive Learning

Add code
Bookmark button
Alert button
Nov 22, 2023
Kai Yu, Jinlin Liu, Mengyang Feng, Miaomiao Cui, Xuansong Xie

Viaarxiv icon

In-Context Learning for MIMO Equalization Using Transformer-Based Sequence Models

Add code
Bookmark button
Alert button
Nov 10, 2023
Matteo Zecchin, Kai Yu, Osvaldo Simeone

Viaarxiv icon

DiffDub: Person-generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-encoder

Add code
Bookmark button
Alert button
Nov 03, 2023
Tao Liu, Chenpeng Du, Shuai Fan, Feilong Chen, Kai Yu

Viaarxiv icon

Expressive TTS Driven by Natural Language Prompts Using Few Human Annotations

Add code
Bookmark button
Alert button
Nov 02, 2023
Hanglei Zhang, Yiwei Guo, Sen Liu, Xie Chen, Kai Yu

Viaarxiv icon

ASTormer: An AST Structure-aware Transformer Decoder for Text-to-SQL

Add code
Bookmark button
Alert button
Oct 28, 2023
Ruisheng Cao, Hanchong Zhang, Hongshen Xu, Jieyu Li, Da Ma, Lu Chen, Kai Yu

Figure 1 for ASTormer: An AST Structure-aware Transformer Decoder for Text-to-SQL
Figure 2 for ASTormer: An AST Structure-aware Transformer Decoder for Text-to-SQL
Figure 3 for ASTormer: An AST Structure-aware Transformer Decoder for Text-to-SQL
Figure 4 for ASTormer: An AST Structure-aware Transformer Decoder for Text-to-SQL
Viaarxiv icon

ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought

Add code
Bookmark button
Alert button
Oct 26, 2023
Hanchong Zhang, Ruisheng Cao, Lu Chen, Hongshen Xu, Kai Yu

Viaarxiv icon

Acoustic BPE for Speech Generation with Discrete Tokens

Add code
Bookmark button
Alert button
Oct 23, 2023
Feiyu Shen, Yiwei Guo, Chenpeng Du, Xie Chen, Kai Yu

Viaarxiv icon

Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS

Add code
Bookmark button
Alert button
Sep 14, 2023
Yifan Yang, Feiyu Shen, Chenpeng Du, Ziyang Ma, Kai Yu, Daniel Povey, Xie Chen

Figure 1 for Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS
Figure 2 for Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS
Figure 3 for Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS
Figure 4 for Towards Universal Speech Discrete Tokens: A Case Study for ASR and TTS
Viaarxiv icon

VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching

Add code
Bookmark button
Alert button
Sep 10, 2023
Yiwei Guo, Chenpeng Du, Ziyang Ma, Xie Chen, Kai Yu

Figure 1 for VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
Figure 2 for VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
Figure 3 for VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
Figure 4 for VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
Viaarxiv icon

SciEval: A Multi-Level Large Language Model Evaluation Benchmark for Scientific Research

Add code
Bookmark button
Alert button
Aug 25, 2023
Liangtai Sun, Yang Han, Zihan Zhao, Da Ma, Zhennan Shen, Baocai Chen, Lu Chen, Kai Yu

Figure 1 for SciEval: A Multi-Level Large Language Model Evaluation Benchmark for Scientific Research
Figure 2 for SciEval: A Multi-Level Large Language Model Evaluation Benchmark for Scientific Research
Figure 3 for SciEval: A Multi-Level Large Language Model Evaluation Benchmark for Scientific Research
Figure 4 for SciEval: A Multi-Level Large Language Model Evaluation Benchmark for Scientific Research
Viaarxiv icon