Alert button
Picture for Dacheng Yin

Dacheng Yin

Alert button

ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with Diffusion Models

Add code
Bookmark button
Alert button
Nov 30, 2023
Wenming Weng, Ruoyu Feng, Yanhui Wang, Qi Dai, Chunyu Wang, Dacheng Yin, Zhiyuan Zhao, Kai Qiu, Jianmin Bao, Yuhui Yuan, Chong Luo, Yueyi Zhang, Zhiwei Xiong

Viaarxiv icon

MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation

Add code
Bookmark button
Alert button
Nov 30, 2023
Yanhui Wang, Jianmin Bao, Wenming Weng, Ruoyu Feng, Dacheng Yin, Tao Yang, Jingxu Zhang, Qi Dai Zhiyuan Zhao, Chunyu Wang, Kai Qiu, Yuhui Yuan, Xiaoyan Sun, Chong Luo, Baining Guo

Viaarxiv icon

Learning Trajectories are Generalization Indicators

Add code
Bookmark button
Alert button
May 04, 2023
Jingwen Fu, Zhizheng Zhang, Dacheng Yin, Yan Lu, Nanning Zheng

Figure 1 for Learning Trajectories are Generalization Indicators
Figure 2 for Learning Trajectories are Generalization Indicators
Figure 3 for Learning Trajectories are Generalization Indicators
Figure 4 for Learning Trajectories are Generalization Indicators
Viaarxiv icon

Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss

Add code
Bookmark button
Alert button
Apr 12, 2023
Zhiyuan Zhao, Lijun Wu, Chuanxin Tang, Dacheng Yin, Yucheng Zhao, Chong Luo

Figure 1 for Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss
Figure 2 for Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss
Figure 3 for Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss
Figure 4 for Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss
Viaarxiv icon

TridentSE: Guiding Speech Enhancement with 32 Global Tokens

Add code
Bookmark button
Alert button
Oct 24, 2022
Dacheng Yin, Zhiyuan Zhao, Chuanxin Tang, Zhiwei Xiong, Chong Luo

Figure 1 for TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Figure 2 for TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Figure 3 for TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Figure 4 for TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Viaarxiv icon

RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion

Add code
Bookmark button
Alert button
Jun 28, 2022
Dacheng Yin, Chuanxin Tang, Yanqing Liu, Xiaoqiang Wang, Zhiyuan Zhao, Yucheng Zhao, Zhiwei Xiong, Sheng Zhao, Chong Luo

Figure 1 for RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
Figure 2 for RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
Figure 3 for RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
Figure 4 for RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
Viaarxiv icon

Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph

Add code
Bookmark button
Alert button
Feb 24, 2022
Dacheng Yin, Xuanchi Ren, Chong Luo, Yuwang Wang, Zhiwei Xiong, Wenjun Zeng

Figure 1 for Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph
Figure 2 for Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph
Figure 3 for Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph
Figure 4 for Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph
Viaarxiv icon

Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Add code
Bookmark button
Alert button
Sep 12, 2021
Chuanxin Tang, Chong Luo, Zhiyuan Zhao, Dacheng Yin, Yucheng Zhao, Wenjun Zeng

Figure 1 for Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Figure 2 for Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Figure 3 for Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Figure 4 for Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Viaarxiv icon

General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework

Add code
Bookmark button
Alert button
Feb 03, 2021
Yucheng Zhao, Dacheng Yin, Chong Luo, Zhiyuan Zhao, Chuanxin Tang, Wenjun Zeng, Zheng-Jun Zha

Figure 1 for General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework
Figure 2 for General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework
Figure 3 for General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework
Figure 4 for General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework
Viaarxiv icon