Alert button
Picture for Di Hu

Di Hu

Alert button

SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model

Add code
Bookmark button
Alert button
Mar 15, 2024
Tao Wu, Xuewei Li, Zhongang Qi, Di Hu, Xintao Wang, Ying Shan, Xi Li

Figure 1 for SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model
Figure 2 for SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model
Figure 3 for SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model
Figure 4 for SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model
Viaarxiv icon

Quantifying and Enhancing Multi-modal Robustness with Modality Preference

Add code
Bookmark button
Alert button
Feb 09, 2024
Zequn Yang, Yake Wei, Ce Liang, Di Hu

Viaarxiv icon

Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs

Add code
Bookmark button
Alert button
Nov 08, 2023
Wenke Xia, Dong Wang, Xincheng Pang, Zhigang Wang, Bin Zhao, Di Hu

Viaarxiv icon

Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer

Add code
Bookmark button
Alert button
Sep 18, 2023
Yaoting Wang, Weisong Liu, Guangyao Li, Jian Ding, Di Hu, Xi Li

Figure 1 for Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer
Figure 2 for Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer
Figure 3 for Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer
Figure 4 for Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer
Viaarxiv icon

Enhancing Multi-modal Cooperation via Fine-grained Modality Valuation

Add code
Bookmark button
Alert button
Sep 12, 2023
Yake Wei, Ruoxuan Feng, Zihe Wang, Di Hu

Viaarxiv icon

Progressive Spatio-temporal Perception for Audio-Visual Question Answering

Add code
Bookmark button
Alert button
Aug 10, 2023
Guangyao Li, Wenxuan Hou, Di Hu

Figure 1 for Progressive Spatio-temporal Perception for Audio-Visual Question Answering
Figure 2 for Progressive Spatio-temporal Perception for Audio-Visual Question Answering
Figure 3 for Progressive Spatio-temporal Perception for Audio-Visual Question Answering
Figure 4 for Progressive Spatio-temporal Perception for Audio-Visual Question Answering
Viaarxiv icon

Supervised Knowledge May Hurt Novel Class Discovery Performance

Add code
Bookmark button
Alert button
Jun 06, 2023
Ziyun Li, Jona Otholt, Ben Dai, Di Hu, Christoph Meinel, Haojin Yang

Figure 1 for Supervised Knowledge May Hurt Novel Class Discovery Performance
Figure 2 for Supervised Knowledge May Hurt Novel Class Discovery Performance
Figure 3 for Supervised Knowledge May Hurt Novel Class Discovery Performance
Figure 4 for Supervised Knowledge May Hurt Novel Class Discovery Performance
Viaarxiv icon

Multi-Scale Attention for Audio Question Answering

Add code
Bookmark button
Alert button
May 29, 2023
Guangyao Li, Yixin Xu, Di Hu

Figure 1 for Multi-Scale Attention for Audio Question Answering
Figure 2 for Multi-Scale Attention for Audio Question Answering
Figure 3 for Multi-Scale Attention for Audio Question Answering
Figure 4 for Multi-Scale Attention for Audio Question Answering
Viaarxiv icon

Robust Cross-Modal Knowledge Distillation for Unconstrained Videos

Add code
Bookmark button
Alert button
Apr 27, 2023
Wenke Xia, Xingjian Li, Andong Deng, Haoyi Xiong, Dejing Dou, Di Hu

Figure 1 for Robust Cross-Modal Knowledge Distillation for Unconstrained Videos
Figure 2 for Robust Cross-Modal Knowledge Distillation for Unconstrained Videos
Figure 3 for Robust Cross-Modal Knowledge Distillation for Unconstrained Videos
Figure 4 for Robust Cross-Modal Knowledge Distillation for Unconstrained Videos
Viaarxiv icon