Picture for Yuan Gong

Yuan Gong

Generic Knowledge Boosted Pre-training For Remote Sensing Images

Add code
Jan 21, 2024
Viaarxiv icon

Joint Audio and Speech Understanding

Oct 02, 2023
Viaarxiv icon

Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning

Add code
Sep 19, 2023
Figure 1 for Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning
Figure 2 for Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning
Figure 3 for Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning
Figure 4 for Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning
Viaarxiv icon

ToonTalker: Cross-Domain Face Reenactment

Aug 24, 2023
Figure 1 for ToonTalker: Cross-Domain Face Reenactment
Figure 2 for ToonTalker: Cross-Domain Face Reenactment
Figure 3 for ToonTalker: Cross-Domain Face Reenactment
Figure 4 for ToonTalker: Cross-Domain Face Reenactment
Viaarxiv icon

Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation

Add code
Jul 13, 2023
Figure 1 for Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Figure 2 for Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Figure 3 for Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Figure 4 for Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation
Viaarxiv icon

Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers

Add code
Jul 06, 2023
Figure 1 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Figure 2 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Figure 3 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Figure 4 for Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers
Viaarxiv icon

TaleCrafter: Interactive Story Visualization with Multiple Characters

Add code
May 30, 2023
Figure 1 for TaleCrafter: Interactive Story Visualization with Multiple Characters
Figure 2 for TaleCrafter: Interactive Story Visualization with Multiple Characters
Figure 3 for TaleCrafter: Interactive Story Visualization with Multiple Characters
Figure 4 for TaleCrafter: Interactive Story Visualization with Multiple Characters
Viaarxiv icon

SAIL: Search-Augmented Instruction Learning

Add code
May 24, 2023
Figure 1 for SAIL: Search-Augmented Instruction Learning
Figure 2 for SAIL: Search-Augmented Instruction Learning
Figure 3 for SAIL: Search-Augmented Instruction Learning
Figure 4 for SAIL: Search-Augmented Instruction Learning
Viaarxiv icon

Listen, Think, and Understand

May 18, 2023
Figure 1 for Listen, Think, and Understand
Figure 2 for Listen, Think, and Understand
Figure 3 for Listen, Think, and Understand
Figure 4 for Listen, Think, and Understand
Viaarxiv icon

3D GAN Inversion with Facial Symmetry Prior

Add code
Nov 30, 2022
Figure 1 for 3D GAN Inversion with Facial Symmetry Prior
Figure 2 for 3D GAN Inversion with Facial Symmetry Prior
Figure 3 for 3D GAN Inversion with Facial Symmetry Prior
Figure 4 for 3D GAN Inversion with Facial Symmetry Prior
Viaarxiv icon