Alert button
Picture for Yuan Gong

Yuan Gong

Alert button

3D GAN Inversion with Facial Symmetry Prior

Add code
Bookmark button
Alert button
Nov 30, 2022
Fei Yin, Yong Zhang, Xuan Wang, Tengfei Wang, Xiaoyu Li, Yuan Gong, Yanbo Fan, Xiaodong Cun, Ying Shan, Cengiz Oztireli, Yujiu Yang

Figure 1 for 3D GAN Inversion with Facial Symmetry Prior
Figure 2 for 3D GAN Inversion with Facial Symmetry Prior
Figure 3 for 3D GAN Inversion with Facial Symmetry Prior
Figure 4 for 3D GAN Inversion with Facial Symmetry Prior
Viaarxiv icon

MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model

Add code
Bookmark button
Alert button
Oct 11, 2022
Yatai Ji, Junjie Wang, Yuan Gong, Lin Zhang, Yanru Zhu, Hongfa Wang, Jiaxing Zhang, Tetsuya Sakai, Yujiu Yang

Figure 1 for MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model
Figure 2 for MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model
Figure 3 for MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model
Figure 4 for MAP: Modality-Agnostic Uncertainty-Aware Vision-Language Pre-training Model
Viaarxiv icon

Rethinking Knowledge Distillation via Cross-Entropy

Add code
Bookmark button
Alert button
Aug 22, 2022
Zhendong Yang, Zhe Li, Yuan Gong, Tianke Zhang, Shanshan Lao, Chun Yuan, Yu Li

Figure 1 for Rethinking Knowledge Distillation via Cross-Entropy
Figure 2 for Rethinking Knowledge Distillation via Cross-Entropy
Figure 3 for Rethinking Knowledge Distillation via Cross-Entropy
Figure 4 for Rethinking Knowledge Distillation via Cross-Entropy
Viaarxiv icon

UAVM: A Unified Model for Audio-Visual Learning

Add code
Bookmark button
Alert button
Jul 29, 2022
Yuan Gong, Alexander H. Liu, Andrew Rouditchenko, James Glass

Figure 1 for UAVM: A Unified Model for Audio-Visual Learning
Figure 2 for UAVM: A Unified Model for Audio-Visual Learning
Figure 3 for UAVM: A Unified Model for Audio-Visual Learning
Figure 4 for UAVM: A Unified Model for Audio-Visual Learning
Viaarxiv icon

Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition

Add code
Bookmark button
Alert button
May 06, 2022
Yuan Gong, Jin Yu, James Glass

Figure 1 for Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition
Figure 2 for Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition
Figure 3 for Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition
Figure 4 for Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition
Viaarxiv icon

Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment

Add code
Bookmark button
Alert button
May 06, 2022
Yuan Gong, Ziyi Chen, Iek-Heng Chu, Peng Chang, James Glass

Figure 1 for Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment
Figure 2 for Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment
Figure 3 for Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment
Figure 4 for Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment
Viaarxiv icon

Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network

Add code
Bookmark button
Alert button
Apr 22, 2022
Shanshan Lao, Yuan Gong, Shuwei Shi, Sidi Yang, Tianhe Wu, Jiahao Wang, Weihao Xia, Yujiu Yang

Figure 1 for Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network
Figure 2 for Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network
Figure 3 for Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network
Figure 4 for Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network
Viaarxiv icon

MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment

Add code
Bookmark button
Alert button
Apr 21, 2022
Sidi Yang, Tianhe Wu, Shuwei Shi, Shanshan Lao, Yuan Gong, Mingdeng Cao, Jiahao Wang, Yujiu Yang

Figure 1 for MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment
Figure 2 for MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment
Figure 3 for MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment
Figure 4 for MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment
Viaarxiv icon

CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification

Add code
Bookmark button
Alert button
Mar 13, 2022
Yuan Gong, Sameer Khurana, Andrew Rouditchenko, James Glass

Figure 1 for CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification
Figure 2 for CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification
Figure 3 for CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification
Figure 4 for CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification
Viaarxiv icon

Focal and Global Knowledge Distillation for Detectors

Add code
Bookmark button
Alert button
Nov 23, 2021
Zhendong Yang, Zhe Li, Xiaohu Jiang, Yuan Gong, Zehuan Yuan, Danpei Zhao, Chun Yuan

Figure 1 for Focal and Global Knowledge Distillation for Detectors
Figure 2 for Focal and Global Knowledge Distillation for Detectors
Figure 3 for Focal and Global Knowledge Distillation for Detectors
Figure 4 for Focal and Global Knowledge Distillation for Detectors
Viaarxiv icon