Picture for Lefei Zhang

Lefei Zhang

CtrlAttack: A Unified Attack on World-Model Control in Diffusion Models

Add code
Mar 13, 2026
Viaarxiv icon

DSA-SRGS: Super-Resolution Gaussian Splatting for Dynamic Sparse-View DSA Reconstruction

Add code
Mar 05, 2026
Viaarxiv icon

SongSong: A Time Phonograph for Chinese SongCi Music from Thousand of Years Away

Add code
Feb 27, 2026
Viaarxiv icon

UniX: Unifying Autoregression and Diffusion for Chest X-Ray Understanding and Generation

Add code
Jan 16, 2026
Viaarxiv icon

ClearAIR: A Human-Visual-Perception-Inspired All-in-One Image Restoration

Add code
Jan 06, 2026
Viaarxiv icon

OFL-SAM2: Prompt SAM2 with Online Few-shot Learner for Efficient Medical Image Segmentation

Add code
Dec 31, 2025
Viaarxiv icon

TGC-Net: A Structure-Aware and Semantically-Aligned Framework for Text-Guided Medical Image Segmentation

Add code
Dec 24, 2025
Viaarxiv icon

ClusIR: Towards Cluster-Guided All-in-One Image Restoration

Add code
Dec 11, 2025
Viaarxiv icon

DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving

Add code
Dec 08, 2025
Figure 1 for DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving
Figure 2 for DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving
Figure 3 for DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving
Figure 4 for DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving
Viaarxiv icon

CoViPAL: Layer-wise Contextualized Visual Token Pruning for Large Vision-Language Models

Add code
Aug 24, 2025
Viaarxiv icon