Picture for Shiguang Shan

Shiguang Shan

Semantic or Covariate? A Study on the Intractable Case of Out-of-Distribution Detection

Add code
Nov 18, 2024
Figure 1 for Semantic or Covariate? A Study on the Intractable Case of Out-of-Distribution Detection
Figure 2 for Semantic or Covariate? A Study on the Intractable Case of Out-of-Distribution Detection
Figure 3 for Semantic or Covariate? A Study on the Intractable Case of Out-of-Distribution Detection
Figure 4 for Semantic or Covariate? A Study on the Intractable Case of Out-of-Distribution Detection
Viaarxiv icon

UMFC: Unsupervised Multi-Domain Feature Calibration for Vision-Language Models

Add code
Nov 11, 2024
Figure 1 for UMFC: Unsupervised Multi-Domain Feature Calibration for Vision-Language Models
Figure 2 for UMFC: Unsupervised Multi-Domain Feature Calibration for Vision-Language Models
Figure 3 for UMFC: Unsupervised Multi-Domain Feature Calibration for Vision-Language Models
Figure 4 for UMFC: Unsupervised Multi-Domain Feature Calibration for Vision-Language Models
Viaarxiv icon

Confidence Aware Learning for Reliable Face Anti-spoofing

Add code
Nov 02, 2024
Figure 1 for Confidence Aware Learning for Reliable Face Anti-spoofing
Figure 2 for Confidence Aware Learning for Reliable Face Anti-spoofing
Figure 3 for Confidence Aware Learning for Reliable Face Anti-spoofing
Figure 4 for Confidence Aware Learning for Reliable Face Anti-spoofing
Viaarxiv icon

Face-MLLM: A Large Face Perception Model

Add code
Oct 28, 2024
Figure 1 for Face-MLLM: A Large Face Perception Model
Figure 2 for Face-MLLM: A Large Face Perception Model
Figure 3 for Face-MLLM: A Large Face Perception Model
Figure 4 for Face-MLLM: A Large Face Perception Model
Viaarxiv icon

CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation

Add code
Oct 12, 2024
Viaarxiv icon

HERM: Benchmarking and Enhancing Multimodal LLMs for Human-Centric Understanding

Add code
Oct 09, 2024
Viaarxiv icon

Face Forgery Detection with Elaborate Backbone

Add code
Sep 25, 2024
Figure 1 for Face Forgery Detection with Elaborate Backbone
Figure 2 for Face Forgery Detection with Elaborate Backbone
Figure 3 for Face Forgery Detection with Elaborate Backbone
Figure 4 for Face Forgery Detection with Elaborate Backbone
Viaarxiv icon

UniLearn: Enhancing Dynamic Facial Expression Recognition through Unified Pre-Training and Fine-Tuning on Images and Videos

Add code
Sep 10, 2024
Figure 1 for UniLearn: Enhancing Dynamic Facial Expression Recognition through Unified Pre-Training and Fine-Tuning on Images and Videos
Figure 2 for UniLearn: Enhancing Dynamic Facial Expression Recognition through Unified Pre-Training and Fine-Tuning on Images and Videos
Figure 3 for UniLearn: Enhancing Dynamic Facial Expression Recognition through Unified Pre-Training and Fine-Tuning on Images and Videos
Figure 4 for UniLearn: Enhancing Dynamic Facial Expression Recognition through Unified Pre-Training and Fine-Tuning on Images and Videos
Viaarxiv icon

T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models

Add code
Jul 05, 2024
Figure 1 for T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models
Figure 2 for T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models
Figure 3 for T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models
Figure 4 for T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models
Viaarxiv icon

Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs

Add code
Jun 27, 2024
Figure 1 for Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs
Figure 2 for Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs
Figure 3 for Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs
Figure 4 for Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs
Viaarxiv icon