Alert button
Picture for Yuan Gong

Yuan Gong

Alert button

Generic Knowledge Boosted Pre-training For Remote Sensing Images

Add code
Bookmark button
Alert button
Jan 21, 2024
Ziyue Huang, Mingming Zhang, Yuan Gong, Qingjie Liu, Yunhong Wang

Viaarxiv icon

Joint Audio and Speech Understanding

Add code
Bookmark button
Alert button
Oct 02, 2023
Yuan Gong, Alexander H. Liu, Hongyin Luo, Leonid Karlinsky, James Glass

Viaarxiv icon

Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning

Add code
Bookmark button
Alert button
Sep 19, 2023
Tianhua Zhang, Jiaxin Ge, Hongyin Luo, Yung-Sung Chuang, Mingye Gao, Yuan Gong, Xixin Wu, Yoon Kim, Helen Meng, James Glass

Figure 1 for Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning
Figure 2 for Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning
Figure 3 for Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning
Figure 4 for Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning
Viaarxiv icon

ToonTalker: Cross-Domain Face Reenactment

Add code
Bookmark button
Alert button
Aug 24, 2023
Yuan Gong, Yong Zhang, Xiaodong Cun, Fei Yin, Yanbo Fan, Xuan Wang, Baoyuan Wu, Yujiu Yang

Figure 1 for ToonTalker: Cross-Domain Face Reenactment
Figure 2 for ToonTalker: Cross-Domain Face Reenactment
Figure 3 for ToonTalker: Cross-Domain Face Reenactment
Figure 4 for ToonTalker: Cross-Domain Face Reenactment
Viaarxiv icon

Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation

Add code
Bookmark button
Alert button
Jul 13, 2023
Yingqing He, Menghan Xia, Haoxin Chen, Xiaodong Cun, Yuan Gong, Jinbo Xing, Yong Zhang, Xintao Wang, Chao Weng, Ying Shan, Qifeng Chen

Figure 1 for Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Figure 2 for Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Figure 3 for Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Figure 4 for Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Viaarxiv icon

Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers

Add code
Bookmark button
Alert button
Jul 06, 2023
Yuan Gong, Sameer Khurana, Leonid Karlinsky, James Glass

Figure 1 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Figure 2 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Figure 3 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Figure 4 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Viaarxiv icon

TaleCrafter: Interactive Story Visualization with Multiple Characters

Add code
Bookmark button
Alert button
May 30, 2023
Yuan Gong, Youxin Pang, Xiaodong Cun, Menghan Xia, Yingqing He, Haoxin Chen, Longyue Wang, Yong Zhang, Xintao Wang, Ying Shan, Yujiu Yang

Figure 1 for TaleCrafter: Interactive Story Visualization with Multiple Characters
Figure 2 for TaleCrafter: Interactive Story Visualization with Multiple Characters
Figure 3 for TaleCrafter: Interactive Story Visualization with Multiple Characters
Figure 4 for TaleCrafter: Interactive Story Visualization with Multiple Characters
Viaarxiv icon

SAIL: Search-Augmented Instruction Learning

Add code
Bookmark button
Alert button
May 24, 2023
Hongyin Luo, Yung-Sung Chuang, Yuan Gong, Tianhua Zhang, Yoon Kim, Xixin Wu, Danny Fox, Helen Meng, James Glass

Figure 1 for SAIL: Search-Augmented Instruction Learning
Figure 2 for SAIL: Search-Augmented Instruction Learning
Figure 3 for SAIL: Search-Augmented Instruction Learning
Figure 4 for SAIL: Search-Augmented Instruction Learning
Viaarxiv icon

Listen, Think, and Understand

Add code
Bookmark button
Alert button
May 18, 2023
Yuan Gong, Hongyin Luo, Alexander H. Liu, Leonid Karlinsky, James Glass

Figure 1 for Listen, Think, and Understand
Figure 2 for Listen, Think, and Understand
Figure 3 for Listen, Think, and Understand
Figure 4 for Listen, Think, and Understand
Viaarxiv icon