Picture for Yuan Gong

Yuan Gong

TaleCrafter: Interactive Story Visualization with Multiple Characters

Add code
May 30, 2023
Figure 1 for TaleCrafter: Interactive Story Visualization with Multiple Characters
Figure 2 for TaleCrafter: Interactive Story Visualization with Multiple Characters
Figure 3 for TaleCrafter: Interactive Story Visualization with Multiple Characters
Figure 4 for TaleCrafter: Interactive Story Visualization with Multiple Characters
Viaarxiv icon

SAIL: Search-Augmented Instruction Learning

Add code
May 24, 2023
Figure 1 for SAIL: Search-Augmented Instruction Learning
Figure 2 for SAIL: Search-Augmented Instruction Learning
Figure 3 for SAIL: Search-Augmented Instruction Learning
Figure 4 for SAIL: Search-Augmented Instruction Learning
Viaarxiv icon

Listen, Think, and Understand

Add code
May 18, 2023
Figure 1 for Listen, Think, and Understand
Figure 2 for Listen, Think, and Understand
Figure 3 for Listen, Think, and Understand
Figure 4 for Listen, Think, and Understand
Viaarxiv icon

3D GAN Inversion with Facial Symmetry Prior

Add code
Nov 30, 2022
Figure 1 for 3D GAN Inversion with Facial Symmetry Prior
Figure 2 for 3D GAN Inversion with Facial Symmetry Prior
Figure 3 for 3D GAN Inversion with Facial Symmetry Prior
Figure 4 for 3D GAN Inversion with Facial Symmetry Prior
Viaarxiv icon

MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model

Add code
Oct 11, 2022
Figure 1 for MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model
Figure 2 for MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model
Figure 3 for MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model
Figure 4 for MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model
Viaarxiv icon

Rethinking Knowledge Distillation via Cross-Entropy

Add code
Aug 22, 2022
Figure 1 for Rethinking Knowledge Distillation via Cross-Entropy
Figure 2 for Rethinking Knowledge Distillation via Cross-Entropy
Figure 3 for Rethinking Knowledge Distillation via Cross-Entropy
Figure 4 for Rethinking Knowledge Distillation via Cross-Entropy
Viaarxiv icon

UAVM: A Unified Model for Audio-Visual Learning

Add code
Jul 29, 2022
Figure 1 for UAVM: A Unified Model for Audio-Visual Learning
Figure 2 for UAVM: A Unified Model for Audio-Visual Learning
Figure 3 for UAVM: A Unified Model for Audio-Visual Learning
Figure 4 for UAVM: A Unified Model for Audio-Visual Learning
Viaarxiv icon

Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition

Add code
May 06, 2022
Figure 1 for Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition
Figure 2 for Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition
Figure 3 for Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition
Figure 4 for Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition
Viaarxiv icon

Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment

Add code
May 06, 2022
Figure 1 for Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment
Figure 2 for Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment
Figure 3 for Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment
Figure 4 for Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment
Viaarxiv icon

Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network

Add code
Apr 22, 2022
Figure 1 for Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network
Figure 2 for Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network
Figure 3 for Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network
Figure 4 for Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network
Viaarxiv icon