Alert button
Picture for Chuanxin Tang

Chuanxin Tang

Alert button

Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss

Add code
Bookmark button
Alert button
Apr 12, 2023
Zhiyuan Zhao, Lijun Wu, Chuanxin Tang, Dacheng Yin, Yucheng Zhao, Chong Luo

Figure 1 for Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss
Figure 2 for Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss
Figure 3 for Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss
Figure 4 for Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss
Viaarxiv icon

Streaming Video Model

Add code
Bookmark button
Alert button
Mar 30, 2023
Yucheng Zhao, Chong Luo, Chuanxin Tang, Dongdong Chen, Noel Codella, Zheng-Jun Zha

Figure 1 for Streaming Video Model
Figure 2 for Streaming Video Model
Figure 3 for Streaming Video Model
Figure 4 for Streaming Video Model
Viaarxiv icon

Look Before You Match: Instance Understanding Matters in Video Object Segmentation

Add code
Bookmark button
Alert button
Dec 13, 2022
Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Chuanxin Tang, Xiyang Dai, Yucheng Zhao, Yujia Xie, Lu Yuan, Yu-Gang Jiang

Figure 1 for Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Figure 2 for Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Figure 3 for Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Figure 4 for Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Viaarxiv icon

TridentSE: Guiding Speech Enhancement with 32 Global Tokens

Add code
Bookmark button
Alert button
Oct 24, 2022
Dacheng Yin, Zhiyuan Zhao, Chuanxin Tang, Zhiwei Xiong, Chong Luo

Figure 1 for TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Figure 2 for TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Figure 3 for TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Figure 4 for TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Viaarxiv icon

An Anchor-Free Detector for Continuous Speech Keyword Spotting

Add code
Bookmark button
Alert button
Aug 09, 2022
Zhiyuan Zhao, Chuanxin Tang, Chengdong Yao, Chong Luo

Figure 1 for An Anchor-Free Detector for Continuous Speech Keyword Spotting
Figure 2 for An Anchor-Free Detector for Continuous Speech Keyword Spotting
Figure 3 for An Anchor-Free Detector for Continuous Speech Keyword Spotting
Figure 4 for An Anchor-Free Detector for Continuous Speech Keyword Spotting
Viaarxiv icon

RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion

Add code
Bookmark button
Alert button
Jun 28, 2022
Dacheng Yin, Chuanxin Tang, Yanqing Liu, Xiaoqiang Wang, Zhiyuan Zhao, Yucheng Zhao, Zhiwei Xiong, Sheng Zhao, Chong Luo

Figure 1 for RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
Figure 2 for RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
Figure 3 for RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
Figure 4 for RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
Viaarxiv icon

When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism

Add code
Bookmark button
Alert button
Jan 26, 2022
Guangting Wang, Yucheng Zhao, Chuanxin Tang, Chong Luo, Wenjun Zeng

Figure 1 for When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism
Figure 2 for When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism
Figure 3 for When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism
Figure 4 for When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism
Viaarxiv icon

Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Add code
Bookmark button
Alert button
Sep 12, 2021
Chuanxin Tang, Chong Luo, Zhiyuan Zhao, Dacheng Yin, Yucheng Zhao, Wenjun Zeng

Figure 1 for Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Figure 2 for Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Figure 3 for Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Figure 4 for Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Viaarxiv icon

Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?

Add code
Bookmark button
Alert button
Sep 12, 2021
Chuanxin Tang, Yucheng Zhao, Guangting Wang, Chong Luo, Wenxuan Xie, Wenjun Zeng

Figure 1 for Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Figure 2 for Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Figure 3 for Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Figure 4 for Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?
Viaarxiv icon

A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP

Add code
Bookmark button
Alert button
Aug 30, 2021
Yucheng Zhao, Guangting Wang, Chuanxin Tang, Chong Luo, Wenjun Zeng, Zheng-Jun Zha

Figure 1 for A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP
Figure 2 for A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP
Figure 3 for A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP
Figure 4 for A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP
Viaarxiv icon