Picture for Weinan Zhang

Weinan Zhang

Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis

Add code
Mar 11, 2026
Viaarxiv icon

Revealing Behavioral Plasticity in Large Language Models: A Token-Conditional Perspective

Add code
Mar 09, 2026
Viaarxiv icon

PhotoBench: Beyond Visual Matching Towards Personalized Intent-Driven Photo Retrieval

Add code
Mar 02, 2026
Viaarxiv icon

MuonRec: Shifting the Optimizer Paradigm Beyond Adam in Scalable Generative Recommendation

Add code
Feb 28, 2026
Viaarxiv icon

Beyond Imitation: Reinforcement Learning-Based Sim-Real Co-Training for VLA Models

Add code
Feb 16, 2026
Viaarxiv icon

LogitsCoder: Towards Efficient Chain-of-Thought Path Search via Logits Preference Decoding for Code Generation

Add code
Feb 15, 2026
Viaarxiv icon

Plan-MCTS: Plan Exploration for Action Exploitation in Web Navigation

Add code
Feb 15, 2026
Viaarxiv icon

Adaptive Milestone Reward for GUI Agents

Add code
Feb 12, 2026
Viaarxiv icon

OSCAR: Optimization-Steered Agentic Planning for Composed Image Retrieval

Add code
Feb 09, 2026
Viaarxiv icon

MARTI-MARS$^2$: Scaling Multi-Agent Self-Search via Reinforcement Learning for Code Generation

Add code
Feb 08, 2026
Viaarxiv icon