Alert button
Picture for Jing Shi

Jing Shi

Alert button

VIXEN: Visual Text Comparison Network for Image Difference Captioning

Add code
Bookmark button
Alert button
Mar 14, 2024
Alexander Black, Jing Shi, Yifei Fan, Tu Bui, John Collomosse

Viaarxiv icon

Text-to-Audio Generation Synchronized with Videos

Add code
Bookmark button
Alert button
Mar 08, 2024
Shentong Mo, Jing Shi, Yapeng Tian

Figure 1 for Text-to-Audio Generation Synchronized with Videos
Figure 2 for Text-to-Audio Generation Synchronized with Videos
Figure 3 for Text-to-Audio Generation Synchronized with Videos
Figure 4 for Text-to-Audio Generation Synchronized with Videos
Viaarxiv icon

Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models

Add code
Bookmark button
Alert button
Feb 22, 2024
Yixuan Ren, Yang Zhou, Jimei Yang, Jing Shi, Difan Liu, Feng Liu, Mingi Kwon, Abhinav Shrivastava

Viaarxiv icon

A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction

Add code
Bookmark button
Alert button
Jul 30, 2023
Zefa Hu, Ziyi Ni, Jing Shi, Shuang Xu, Bo Xu

Figure 1 for A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction
Figure 2 for A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction
Figure 3 for A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction
Figure 4 for A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction
Viaarxiv icon

ViLaS: Integrating Vision and Language into Automatic Speech Recognition

Add code
Bookmark button
Alert button
May 31, 2023
Minglun Han, Feilong Chen, Ziyi Ni, Linghui Meng, Jing Shi, Shuang Xu, Bo Xu

Figure 1 for ViLaS: Integrating Vision and Language into Automatic Speech Recognition
Figure 2 for ViLaS: Integrating Vision and Language into Automatic Speech Recognition
Figure 3 for ViLaS: Integrating Vision and Language into Automatic Speech Recognition
Figure 4 for ViLaS: Integrating Vision and Language into Automatic Speech Recognition
Viaarxiv icon

DiffAVA: Personalized Text-to-Audio Generation with Visual Alignment

Add code
Bookmark button
Alert button
May 22, 2023
Shentong Mo, Jing Shi, Yapeng Tian

Figure 1 for DiffAVA: Personalized Text-to-Audio Generation with Visual Alignment
Figure 2 for DiffAVA: Personalized Text-to-Audio Generation with Visual Alignment
Figure 3 for DiffAVA: Personalized Text-to-Audio Generation with Visual Alignment
Viaarxiv icon

Mixture of personality improved Spiking actor network for efficient multi-agent cooperation

Add code
Bookmark button
Alert button
May 10, 2023
Xiyun Li, Ziyi Ni, Jingqing Ruan, Linghui Meng, Jing Shi, Tielin Zhang, Bo Xu

Figure 1 for Mixture of personality improved Spiking actor network for efficient multi-agent cooperation
Figure 2 for Mixture of personality improved Spiking actor network for efficient multi-agent cooperation
Figure 3 for Mixture of personality improved Spiking actor network for efficient multi-agent cooperation
Figure 4 for Mixture of personality improved Spiking actor network for efficient multi-agent cooperation
Viaarxiv icon

X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages

Add code
Bookmark button
Alert button
May 10, 2023
Feilong Chen, Minglun Han, Haozhi Zhao, Qingyang Zhang, Jing Shi, Shuang Xu, Bo Xu

Figure 1 for X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
Figure 2 for X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
Figure 3 for X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
Figure 4 for X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
Viaarxiv icon

InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning

Add code
Bookmark button
Alert button
Apr 06, 2023
Jing Shi, Wei Xiong, Zhe Lin, Hyun Joon Jung

Figure 1 for InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning
Figure 2 for InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning
Figure 3 for InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning
Figure 4 for InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning
Viaarxiv icon