Picture for Lei Zhu

Lei Zhu

More than Segmentation: Benchmarking SAM 3 for Segmentation, 3D Perception, and Reconstruction in Robotic Surgery

Add code
Dec 10, 2025
Viaarxiv icon

AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs

Add code
Nov 18, 2025
Viaarxiv icon

MTR-DuplexBench: Towards a Comprehensive Evaluation of Multi-Round Conversations for Full-Duplex Speech Language Models

Add code
Nov 13, 2025
Viaarxiv icon

K-Stain: Keypoint-Driven Correspondence for H&E-to-IHC Virtual Staining

Add code
Nov 10, 2025
Viaarxiv icon

LucidFlux: Caption-Free Universal Image Restoration via a Large-Scale Diffusion Transformer

Add code
Sep 26, 2025
Viaarxiv icon

Toward Medical Deepfake Detection: A Comprehensive Dataset and Novel Method

Add code
Sep 19, 2025
Viaarxiv icon

HybridMamba: A Dual-domain Mamba for 3D Medical Image Segmentation

Add code
Sep 18, 2025
Viaarxiv icon

Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance

Add code
Aug 10, 2025
Figure 1 for Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance
Figure 2 for Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance
Figure 3 for Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance
Figure 4 for Think Before You Talk: Enhancing Meaningful Dialogue Generation in Full-Duplex Speech Language Models with Planning-Inspired Text Guidance
Viaarxiv icon

EventRR: Event Referential Reasoning for Referring Video Object Segmentation

Add code
Aug 10, 2025
Viaarxiv icon

S2-UniSeg: Fast Universal Agglomerative Pooling for Scalable Segment Anything without Supervision

Add code
Aug 09, 2025
Viaarxiv icon