Picture for Yifan Zhu

Yifan Zhu

Stanford University Department of Electrical Engineering

Prior Diffusiveness and Regret in the Linear-Gaussian Bandit

Add code
Jan 05, 2026
Viaarxiv icon

CreBench: Human-Aligned Creativity Evaluation from Idea to Process to Product

Add code
Nov 17, 2025
Viaarxiv icon

More Than Irrational: Modeling Belief-Biased Agents

Add code
Nov 15, 2025
Viaarxiv icon

TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making

Add code
Sep 10, 2025
Figure 1 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 2 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 3 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 4 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Viaarxiv icon

A Foundation Model for Chest X-ray Interpretation with Grounded Reasoning via Online Reinforcement Learning

Add code
Sep 04, 2025
Viaarxiv icon

C-Flat++: Towards a More Efficient and Powerful Framework for Continual Learning

Add code
Aug 26, 2025
Viaarxiv icon

Graph-R1: Towards Agentic GraphRAG Framework via End-to-end Reinforcement Learning

Add code
Jul 29, 2025
Viaarxiv icon

Branch, or Layer? Zeroth-Order Optimization for Continual Learning of Vision-Language Models

Add code
Jun 14, 2025
Viaarxiv icon

MASTER: Multi-Agent Security Through Exploration of Roles and Topological Structures -- A Comprehensive Framework

Add code
May 24, 2025
Viaarxiv icon

DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization

Add code
May 22, 2025
Viaarxiv icon