Picture for Hongyuan Zhang

Hongyuan Zhang

ViewMask-1-to-3: Multi-View Consistent Image Generation via Multimodal Diffusion Models

Add code
Dec 16, 2025
Viaarxiv icon

GRPO-RM: Fine-Tuning Representation Models via GRPO-Driven Reinforcement Learning

Add code
Nov 19, 2025
Viaarxiv icon

Is Your VLM for Autonomous Driving Safety-Ready? A Comprehensive Benchmark for Evaluating External and In-Cabin Risks

Add code
Nov 19, 2025
Viaarxiv icon

Explore How to Inject Beneficial Noise in MLLMs

Add code
Nov 17, 2025
Viaarxiv icon

Rectified Noise: A Generative Model Using Positive-incentive Noise

Add code
Nov 12, 2025
Viaarxiv icon

Laytrol: Preserving Pretrained Knowledge in Layout Control for Multimodal Diffusion Transformers

Add code
Nov 11, 2025
Viaarxiv icon

CoLM: Collaborative Large Models via A Client-Server Paradigm

Add code
Nov 10, 2025
Viaarxiv icon

Class-Aware Prototype Learning with Negative Contrast for Test-Time Adaptation of Vision-Language Models

Add code
Oct 22, 2025
Viaarxiv icon

AI Flow: Perspectives, Scenarios, and Approaches

Add code
Jun 14, 2025
Viaarxiv icon

Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning

Add code
Jun 06, 2025
Viaarxiv icon