Picture for Zhen Lei

Zhen Lei

One Ring to Rule Them All: Unifying Group-Based RL via Dynamic Power-Mean Geometry

Add code
Jan 30, 2026
Viaarxiv icon

UPA: Unsupervised Prompt Agent via Tree-Based Search and Selection

Add code
Jan 30, 2026
Viaarxiv icon

Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning

Add code
Aug 28, 2025
Figure 1 for Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning
Figure 2 for Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning
Figure 3 for Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning
Figure 4 for Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning
Viaarxiv icon

Pose-RFT: Enhancing MLLMs for 3D Pose Generation via Hybrid Action Reinforcement Fine-Tuning

Add code
Aug 11, 2025
Viaarxiv icon

MM2CT: MR-to-CT translation for multi-modal image fusion with mamba

Add code
Aug 07, 2025
Viaarxiv icon

F2PASeg: Feature Fusion for Pituitary Anatomy Segmentation in Endoscopic Surgery

Add code
Aug 07, 2025
Viaarxiv icon

Multimodal Causal-Driven Representation Learning for Generalizable Medical Image Segmentation

Add code
Aug 07, 2025
Viaarxiv icon

Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation

Add code
Jun 06, 2025
Figure 1 for Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation
Figure 2 for Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation
Figure 3 for Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation
Figure 4 for Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation
Viaarxiv icon

SA-Person: Text-Based Person Retrieval with Scene-aware Re-ranking

Add code
May 30, 2025
Viaarxiv icon

From Data to Modeling: Fully Open-vocabulary Scene Graph Generation

Add code
May 26, 2025
Viaarxiv icon