Alert button
Picture for Feilong Chen

Feilong Chen

Alert button

DiffDub: Person-generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-encoder

Add code
Bookmark button
Alert button
Nov 03, 2023
Tao Liu, Chenpeng Du, Shuai Fan, Feilong Chen, Kai Yu

Viaarxiv icon

ViLaS: Integrating Vision and Language into Automatic Speech Recognition

Add code
Bookmark button
Alert button
May 31, 2023
Minglun Han, Feilong Chen, Ziyi Ni, Linghui Meng, Jing Shi, Shuang Xu, Bo Xu

Figure 1 for ViLaS: Integrating Vision and Language into Automatic Speech Recognition
Figure 2 for ViLaS: Integrating Vision and Language into Automatic Speech Recognition
Figure 3 for ViLaS: Integrating Vision and Language into Automatic Speech Recognition
Figure 4 for ViLaS: Integrating Vision and Language into Automatic Speech Recognition
Viaarxiv icon

X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages

Add code
Bookmark button
Alert button
May 10, 2023
Feilong Chen, Minglun Han, Haozhi Zhao, Qingyang Zhang, Jing Shi, Shuang Xu, Bo Xu

Figure 1 for X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
Figure 2 for X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
Figure 3 for X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
Figure 4 for X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
Viaarxiv icon

Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation

Add code
Bookmark button
Alert button
Jan 30, 2023
Minglun Han, Feilong Chen, Jing Shi, Shuang Xu, Bo Xu

Figure 1 for Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
Figure 2 for Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
Figure 3 for Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
Figure 4 for Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
Viaarxiv icon

An Online Sparse Streaming Feature Selection Algorithm

Add code
Bookmark button
Alert button
Aug 03, 2022
Feilong Chen, Di Wu, Jie Yang, Yi He

Figure 1 for An Online Sparse Streaming Feature Selection Algorithm
Figure 2 for An Online Sparse Streaming Feature Selection Algorithm
Figure 3 for An Online Sparse Streaming Feature Selection Algorithm
Figure 4 for An Online Sparse Streaming Feature Selection Algorithm
Viaarxiv icon

HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval

Add code
Bookmark button
Alert button
May 31, 2022
Feilong Chen, Xiuyi Chen, Jiaxin Shi, Duzhen Zhang, Jianlong Chang, Qi Tian

Figure 1 for HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval
Figure 2 for HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval
Figure 3 for HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval
Figure 4 for HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval
Viaarxiv icon

Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning

Add code
Bookmark button
Alert button
Apr 15, 2022
Feilong Chen, Xiuyi Chen, Shuang Xu, Bo Xu

Figure 1 for Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning
Figure 2 for Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning
Figure 3 for Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning
Figure 4 for Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning
Viaarxiv icon

VLP: A Survey on Vision-Language Pre-training

Add code
Bookmark button
Alert button
Feb 21, 2022
Feilong Chen, Duzhen Zhang, Minglun Han, Xiuyi Chen, Jing Shi, Shuang Xu, Bo Xu

Figure 1 for VLP: A Survey on Vision-Language Pre-training
Figure 2 for VLP: A Survey on Vision-Language Pre-training
Figure 3 for VLP: A Survey on Vision-Language Pre-training
Viaarxiv icon

Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation

Add code
Bookmark button
Alert button
Sep 17, 2021
Feilong Chen, Fandong Meng, Xiuyi Chen, Peng Li, Jie Zhou

Figure 1 for Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation
Figure 2 for Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation
Figure 3 for Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation
Figure 4 for Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation
Viaarxiv icon