Alert button
Picture for Xiatian Zhu

Xiatian Zhu

Alert button

DiffSED: Sound Event Detection with Denoising Diffusion

Add code
Bookmark button
Alert button
Aug 14, 2023
Swapnil Bhosale, Sauradip Nag, Diptesh Kanojia, Jiankang Deng, Xiatian Zhu

Figure 1 for DiffSED: Sound Event Detection with Denoising Diffusion
Figure 2 for DiffSED: Sound Event Detection with Denoising Diffusion
Figure 3 for DiffSED: Sound Event Detection with Denoising Diffusion
Figure 4 for DiffSED: Sound Event Detection with Denoising Diffusion
Viaarxiv icon

Actor-agnostic Multi-label Action Recognition with Multi-modal Query

Add code
Bookmark button
Alert button
Aug 08, 2023
Anindya Mondal, Sauradip Nag, Joaquin M Prada, Xiatian Zhu, Anjan Dutta

Figure 1 for Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Figure 2 for Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Figure 3 for Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Figure 4 for Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Viaarxiv icon

MSQNet: Actor-agnostic Action Recognition with Multi-modal Query

Add code
Bookmark button
Alert button
Jul 20, 2023
Anindya Mondal, Sauradip Nag, Joaquin M Prada, Xiatian Zhu, Anjan Dutta

Figure 1 for MSQNet: Actor-agnostic Action Recognition with Multi-modal Query
Figure 2 for MSQNet: Actor-agnostic Action Recognition with Multi-modal Query
Figure 3 for MSQNet: Actor-agnostic Action Recognition with Multi-modal Query
Figure 4 for MSQNet: Actor-agnostic Action Recognition with Multi-modal Query
Viaarxiv icon

Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models

Add code
Bookmark button
Alert button
Jun 15, 2023
Junting Pan, Ziyi Lin, Yuying Ge, Xiatian Zhu, Renrui Zhang, Yi Wang, Yu Qiao, Hongsheng Li

Figure 1 for Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models
Figure 2 for Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models
Figure 3 for Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models
Figure 4 for Retrieving-to-Answer: Zero-Shot Video Question Answering with Frozen Large Language Models
Viaarxiv icon

HeadSculpt: Crafting 3D Head Avatars with Text

Add code
Bookmark button
Alert button
Jun 05, 2023
Xiao Han, Yukang Cao, Kai Han, Xiatian Zhu, Jiankang Deng, Yi-Zhe Song, Tao Xiang, Kwan-Yee K. Wong

Figure 1 for HeadSculpt: Crafting 3D Head Avatars with Text
Figure 2 for HeadSculpt: Crafting 3D Head Avatars with Text
Figure 3 for HeadSculpt: Crafting 3D Head Avatars with Text
Figure 4 for HeadSculpt: Crafting 3D Head Avatars with Text
Viaarxiv icon

DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion

Add code
Bookmark button
Alert button
Mar 27, 2023
Sauradip Nag, Xiatian Zhu, Jiankang Deng, Yi-Zhe Song, Tao Xiang

Figure 1 for DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion
Figure 2 for DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion
Figure 3 for DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion
Figure 4 for DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion
Viaarxiv icon

Generative Semantic Segmentation

Add code
Bookmark button
Alert button
Mar 20, 2023
Jiaqi Chen, Jiachen Lu, Xiatian Zhu, Li Zhang

Figure 1 for Generative Semantic Segmentation
Figure 2 for Generative Semantic Segmentation
Figure 3 for Generative Semantic Segmentation
Figure 4 for Generative Semantic Segmentation
Viaarxiv icon

PersonalTailor: Personalizing 2D Pattern Design from 3D Garment Point Clouds

Add code
Bookmark button
Alert button
Mar 17, 2023
Anran Qi, Sauradip Nag, Xiatian Zhu, Ariel Shamir

Figure 1 for PersonalTailor: Personalizing 2D Pattern Design from 3D Garment Point Clouds
Figure 2 for PersonalTailor: Personalizing 2D Pattern Design from 3D Garment Point Clouds
Figure 3 for PersonalTailor: Personalizing 2D Pattern Design from 3D Garment Point Clouds
Figure 4 for PersonalTailor: Personalizing 2D Pattern Design from 3D Garment Point Clouds
Viaarxiv icon

FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks

Add code
Bookmark button
Alert button
Mar 04, 2023
Xiao Han, Xiatian Zhu, Licheng Yu, Li Zhang, Yi-Zhe Song, Tao Xiang

Figure 1 for FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks
Figure 2 for FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks
Figure 3 for FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks
Figure 4 for FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks
Viaarxiv icon

Unsupervised Hashing via Similarity Distribution Calibration

Add code
Bookmark button
Alert button
Feb 15, 2023
Kam Woh Ng, Xiatian Zhu, Jiun Tian Hoe, Chee Seng Chan, Tianyu Zhang, Yi-Zhe Song, Tao Xiang

Figure 1 for Unsupervised Hashing via Similarity Distribution Calibration
Figure 2 for Unsupervised Hashing via Similarity Distribution Calibration
Figure 3 for Unsupervised Hashing via Similarity Distribution Calibration
Figure 4 for Unsupervised Hashing via Similarity Distribution Calibration
Viaarxiv icon