Picture for Shiguang Shan

Shiguang Shan

HIS-GPT: Towards 3D Human-In-Scene Multimodal Understanding

Add code
Mar 17, 2025
Viaarxiv icon

Decoupled Doubly Contrastive Learning for Cross Domain Facial Action Unit Detection

Add code
Mar 12, 2025
Viaarxiv icon

Dynamically evolving segment anything model with continuous learning for medical image segmentation

Add code
Mar 08, 2025
Figure 1 for Dynamically evolving segment anything model with continuous learning for medical image segmentation
Figure 2 for Dynamically evolving segment anything model with continuous learning for medical image segmentation
Figure 3 for Dynamically evolving segment anything model with continuous learning for medical image segmentation
Figure 4 for Dynamically evolving segment anything model with continuous learning for medical image segmentation
Viaarxiv icon

MATS: An Audio Language Model under Text-only Supervision

Add code
Feb 20, 2025
Figure 1 for MATS: An Audio Language Model under Text-only Supervision
Figure 2 for MATS: An Audio Language Model under Text-only Supervision
Figure 3 for MATS: An Audio Language Model under Text-only Supervision
Figure 4 for MATS: An Audio Language Model under Text-only Supervision
Viaarxiv icon

G2PDiffusion: Genotype-to-Phenotype Prediction with Diffusion Models

Add code
Feb 07, 2025
Figure 1 for G2PDiffusion: Genotype-to-Phenotype Prediction with Diffusion Models
Figure 2 for G2PDiffusion: Genotype-to-Phenotype Prediction with Diffusion Models
Figure 3 for G2PDiffusion: Genotype-to-Phenotype Prediction with Diffusion Models
Figure 4 for G2PDiffusion: Genotype-to-Phenotype Prediction with Diffusion Models
Viaarxiv icon

M$^3$oralBench: A MultiModal Moral Benchmark for LVLMs

Add code
Dec 30, 2024
Viaarxiv icon

Multi-P$^2$A: A Multi-perspective Benchmark on Privacy Assessment for Large Vision-Language Models

Add code
Dec 27, 2024
Viaarxiv icon

RefHCM: A Unified Model for Referring Perceptions in Human-Centric Scenarios

Add code
Dec 19, 2024
Figure 1 for RefHCM: A Unified Model for Referring Perceptions in Human-Centric Scenarios
Figure 2 for RefHCM: A Unified Model for Referring Perceptions in Human-Centric Scenarios
Figure 3 for RefHCM: A Unified Model for Referring Perceptions in Human-Centric Scenarios
Figure 4 for RefHCM: A Unified Model for Referring Perceptions in Human-Centric Scenarios
Viaarxiv icon

Autoregressive Video Generation without Vector Quantization

Add code
Dec 18, 2024
Figure 1 for Autoregressive Video Generation without Vector Quantization
Figure 2 for Autoregressive Video Generation without Vector Quantization
Figure 3 for Autoregressive Video Generation without Vector Quantization
Figure 4 for Autoregressive Video Generation without Vector Quantization
Viaarxiv icon

UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing

Add code
Nov 25, 2024
Figure 1 for UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing
Figure 2 for UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing
Figure 3 for UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing
Figure 4 for UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing
Viaarxiv icon