Picture for Zhen Lei

Zhen Lei

Multimodal Causal-Driven Representation Learning for Generalizable Medical Image Segmentation

Add code
Aug 07, 2025
Viaarxiv icon

MM2CT: MR-to-CT translation for multi-modal image fusion with mamba

Add code
Aug 07, 2025
Viaarxiv icon

Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation

Add code
Jun 06, 2025
Figure 1 for Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation
Figure 2 for Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation
Figure 3 for Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation
Figure 4 for Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation
Viaarxiv icon

SA-Person: Text-Based Person Retrieval with Scene-aware Re-ranking

Add code
May 30, 2025
Viaarxiv icon

From Data to Modeling: Fully Open-vocabulary Scene Graph Generation

Add code
May 26, 2025
Viaarxiv icon

Benchmarking Unified Face Attack Detection via Hierarchical Prompt Tuning

Add code
May 19, 2025
Viaarxiv icon

MLLM-Enhanced Face Forgery Detection: A Vision-Language Fusion Solution

Add code
May 04, 2025
Viaarxiv icon

Compile Scene Graphs with Reinforcement Learning

Add code
Apr 18, 2025
Viaarxiv icon

Mixture-of-Attack-Experts with Class Regularization for Unified Physical-Digital Face Attack Detection

Add code
Apr 01, 2025
Viaarxiv icon

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

Add code
Apr 01, 2025
Viaarxiv icon