Picture for Zhanyu Ma

Zhanyu Ma

Recolour What Matters: Region-Aware Colour Editing via Token-Level Diffusion

Add code
Mar 19, 2026
Viaarxiv icon

PolGS++: Physically-Guided Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction

Add code
Mar 11, 2026
Viaarxiv icon

Seeing as Experts Do: A Knowledge-Augmented Agent for Open-Set Fine-Grained Visual Understanding

Add code
Mar 04, 2026
Viaarxiv icon

EvalMVX: A Unified Benchmarking for Neural 3D Reconstruction under Diverse Multiview Setups

Add code
Mar 04, 2026
Viaarxiv icon

Benchmarking Semantic Segmentation Models via Appearance and Geometry Attribute Editing

Add code
Mar 02, 2026
Viaarxiv icon

Generative Visual Chain-of-Thought for Image Editing

Add code
Mar 02, 2026
Viaarxiv icon

Hepato-LLaVA: An Expert MLLM with Sparse Topo-Pack Attention for Hepatocellular Pathology Analysis on Whole Slide Images

Add code
Feb 26, 2026
Viaarxiv icon

Geometric Image Editing via Effects-Sensitive In-Context Inpainting with Diffusion Transformers

Add code
Feb 09, 2026
Viaarxiv icon

State Rank Dynamics in Linear Attention LLMs

Add code
Feb 02, 2026
Viaarxiv icon

LTS-VoiceAgent: A Listen-Think-Speak Framework for Efficient Streaming Voice Interaction via Semantic Triggering and Incremental Reasoning

Add code
Jan 26, 2026
Viaarxiv icon