Picture for Yuxuan Gu

Yuxuan Gu

SAVOIR: Learning Social Savoir-Faire via Shapley-based Reward Attribution

Add code
Apr 21, 2026
Viaarxiv icon

Stratagem: Learning Transferable Reasoning via Trajectory-Modulated Game Self-Play

Add code
Apr 20, 2026
Viaarxiv icon

Can Large Language Models Simulate Human Cognition Beyond Behavioral Imitation?

Add code
Mar 29, 2026
Viaarxiv icon

Learning Pore-scale Multiphase Flow from 4D Velocimetry

Add code
Mar 12, 2026
Viaarxiv icon

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Add code
Mar 04, 2026
Viaarxiv icon

PERSONA: Dynamic and Compositional Inference-Time Personality Control via Activation Vector Algebra

Add code
Feb 17, 2026
Viaarxiv icon

KromHC: Manifold-Constrained Hyper-Connections with Kronecker-Product Residual Matrices

Add code
Jan 29, 2026
Viaarxiv icon

DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs

Add code
Jan 07, 2026
Viaarxiv icon

Masked Auto-Regressive Variational Acceleration: Fast Inference Makes Practical Reinforcement Learning

Add code
Nov 19, 2025
Viaarxiv icon

Adaptive Backtracking for Privacy Protection in Large Language Models

Add code
Aug 08, 2025
Viaarxiv icon