Alert button
Picture for Xiatian Zhu

Xiatian Zhu

Alert button

Source-Free Domain Adaptation with Frozen Multimodal Foundation Model

Nov 27, 2023
Song Tang, Wenxin Su, Mao Ye, Xiatian Zhu

Viaarxiv icon

DreamCreature: Crafting Photorealistic Virtual Creatures from Imagination

Nov 27, 2023
Kam Woh Ng, Xiatian Zhu, Yi-Zhe Song, Tao Xiang

Figure 1 for DreamCreature: Crafting Photorealistic Virtual Creatures from Imagination
Figure 2 for DreamCreature: Crafting Photorealistic Virtual Creatures from Imagination
Figure 3 for DreamCreature: Crafting Photorealistic Virtual Creatures from Imagination
Figure 4 for DreamCreature: Crafting Photorealistic Virtual Creatures from Imagination
Viaarxiv icon

Adaptive-Labeling for Enhancing Remote Sensing Cloud Understanding

Nov 09, 2023
Jay Gala, Sauradip Nag, Huichou Huang, Ruirui Liu, Xiatian Zhu

Viaarxiv icon

Recognize Any Regions

Nov 02, 2023
Haosen Yang, Chuofan Ma, Bin Wen, Yi Jiang, Zehuan Yuan, Xiatian Zhu

Viaarxiv icon

Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping

Oct 19, 2023
Zijie Pan, Jiachen Lu, Xiatian Zhu, Li Zhang

Viaarxiv icon

Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting

Oct 16, 2023
Zeyu Yang, Hongye Yang, Zijie Pan, Xiatian Zhu, Li Zhang

Viaarxiv icon

Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection

Sep 29, 2023
Swapnil Bhosale, Abhra Chaudhuri, Alex Lee Robert Williams, Divyank Tiwari, Anjan Dutta, Xiatian Zhu, Pushpak Bhattacharyya, Diptesh Kanojia

Figure 1 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Figure 2 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Figure 3 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Figure 4 for Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection
Viaarxiv icon

Leveraging Foundation models for Unsupervised Audio-Visual Segmentation

Sep 13, 2023
Swapnil Bhosale, Haosen Yang, Diptesh Kanojia, Xiatian Zhu

Figure 1 for Leveraging Foundation models for Unsupervised Audio-Visual Segmentation
Figure 2 for Leveraging Foundation models for Unsupervised Audio-Visual Segmentation
Figure 3 for Leveraging Foundation models for Unsupervised Audio-Visual Segmentation
Figure 4 for Leveraging Foundation models for Unsupervised Audio-Visual Segmentation
Viaarxiv icon

DiffSED: Sound Event Detection with Denoising Diffusion

Aug 16, 2023
Swapnil Bhosale, Sauradip Nag, Diptesh Kanojia, Jiankang Deng, Xiatian Zhu

Figure 1 for DiffSED: Sound Event Detection with Denoising Diffusion
Figure 2 for DiffSED: Sound Event Detection with Denoising Diffusion
Figure 3 for DiffSED: Sound Event Detection with Denoising Diffusion
Figure 4 for DiffSED: Sound Event Detection with Denoising Diffusion
Viaarxiv icon