Picture for Zhanyu Ma

Zhanyu Ma

SpecGen: Neural Spectral BRDF Generation via Spectral-Spatial Tri-plane Aggregation

Add code
Aug 24, 2025
Viaarxiv icon

MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision

Add code
Aug 11, 2025
Viaarxiv icon

PolarAnything: Diffusion-based Polarimetric Image Synthesis

Add code
Jul 24, 2025
Viaarxiv icon

FairHuman: Boosting Hand and Face Quality in Human Image Generation with Minimum Potential Delay Fairness in Diffusion Models

Add code
Jul 03, 2025
Viaarxiv icon

Towards Privacy-Preserving Fine-Grained Visual Classification via Hierarchical Learning from Label Proportions

Add code
May 29, 2025
Viaarxiv icon

DriveRX: A Vision-Language Reasoning Model for Cross-Task Autonomous Driving

Add code
May 27, 2025
Viaarxiv icon

Multimodal Conditional Information Bottleneck for Generalizable AI-Generated Image Detection

Add code
May 21, 2025
Viaarxiv icon

CineTechBench: A Benchmark for Cinematographic Technique Understanding and Generation

Add code
May 21, 2025
Viaarxiv icon

Harnessing Caption Detailness for Data-Efficient Text-to-Image Generation

Add code
May 21, 2025
Viaarxiv icon

NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results

Add code
Apr 19, 2025
Viaarxiv icon