Picture for Xiaobin Hu

Xiaobin Hu

RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network

Add code
Jun 26, 2024
Figure 1 for RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network
Figure 2 for RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network
Figure 3 for RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network
Figure 4 for RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network
Viaarxiv icon

AnyMaker: Zero-shot General Object Customization via Decoupled Dual-Level ID Injection

Add code
Jun 17, 2024
Viaarxiv icon

Open-Vocabulary SAM3D: Understand Any 3D Scene

Add code
May 24, 2024
Viaarxiv icon

Learning Unified Reference Representation for Unsupervised Multi-class Anomaly Detection

Add code
Mar 18, 2024
Figure 1 for Learning Unified Reference Representation for Unsupervised Multi-class Anomaly Detection
Figure 2 for Learning Unified Reference Representation for Unsupervised Multi-class Anomaly Detection
Figure 3 for Learning Unified Reference Representation for Unsupervised Multi-class Anomaly Detection
Figure 4 for Learning Unified Reference Representation for Unsupervised Multi-class Anomaly Detection
Viaarxiv icon

ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models

Add code
Mar 17, 2024
Figure 1 for ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Figure 2 for ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Figure 3 for ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Figure 4 for ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Viaarxiv icon

PointSeg: A Training-Free Paradigm for 3D Scene Segmentation via Foundation Models

Add code
Mar 11, 2024
Figure 1 for PointSeg: A Training-Free Paradigm for 3D Scene Segmentation via Foundation Models
Figure 2 for PointSeg: A Training-Free Paradigm for 3D Scene Segmentation via Foundation Models
Figure 3 for PointSeg: A Training-Free Paradigm for 3D Scene Segmentation via Foundation Models
Figure 4 for PointSeg: A Training-Free Paradigm for 3D Scene Segmentation via Foundation Models
Viaarxiv icon

DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation

Add code
Mar 10, 2024
Figure 1 for DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation
Figure 2 for DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation
Figure 3 for DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation
Figure 4 for DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation
Viaarxiv icon

Joint-Individual Fusion Structure with Fusion Attention Module for Multi-Modal Skin Cancer Classification

Add code
Dec 07, 2023
Figure 1 for Joint-Individual Fusion Structure with Fusion Attention Module for Multi-Modal Skin Cancer Classification
Figure 2 for Joint-Individual Fusion Structure with Fusion Attention Module for Multi-Modal Skin Cancer Classification
Figure 3 for Joint-Individual Fusion Structure with Fusion Attention Module for Multi-Modal Skin Cancer Classification
Figure 4 for Joint-Individual Fusion Structure with Fusion Attention Module for Multi-Modal Skin Cancer Classification
Viaarxiv icon

SR-R$^2$KAC: Improving Single Image Defocus Deblurring

Add code
Jul 30, 2023
Figure 1 for SR-R$^2$KAC: Improving Single Image Defocus Deblurring
Figure 2 for SR-R$^2$KAC: Improving Single Image Defocus Deblurring
Figure 3 for SR-R$^2$KAC: Improving Single Image Defocus Deblurring
Figure 4 for SR-R$^2$KAC: Improving Single Image Defocus Deblurring
Viaarxiv icon

Application of the nnU-Net for automatic segmentation of lung lesion on CT images, and implication on radiomic models

Add code
Sep 24, 2022
Figure 1 for Application of the nnU-Net for automatic segmentation of lung lesion on CT images, and implication on radiomic models
Figure 2 for Application of the nnU-Net for automatic segmentation of lung lesion on CT images, and implication on radiomic models
Figure 3 for Application of the nnU-Net for automatic segmentation of lung lesion on CT images, and implication on radiomic models
Figure 4 for Application of the nnU-Net for automatic segmentation of lung lesion on CT images, and implication on radiomic models
Viaarxiv icon