Picture for Xiaocheng Feng

Xiaocheng Feng

SAVOIR: Learning Social Savoir-Faire via Shapley-based Reward Attribution

Add code
Apr 21, 2026
Viaarxiv icon

Stratagem: Learning Transferable Reasoning via Trajectory-Modulated Game Self-Play

Add code
Apr 20, 2026
Viaarxiv icon

ImplicitMemBench: Measuring Unconscious Behavioral Adaptation in Large Language Models

Add code
Apr 09, 2026
Viaarxiv icon

Not All Tokens See Equally: Perception-Grounded Policy Optimization for Large Vision-Language Models

Add code
Apr 02, 2026
Viaarxiv icon

Can Large Language Models Simulate Human Cognition Beyond Behavioral Imitation?

Add code
Mar 29, 2026
Viaarxiv icon

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Add code
Mar 04, 2026
Viaarxiv icon

PERSONA: Dynamic and Compositional Inference-Time Personality Control via Activation Vector Algebra

Add code
Feb 17, 2026
Viaarxiv icon

Fine-Mem: Fine-Grained Feedback Alignment for Long-Horizon Memory Management

Add code
Jan 13, 2026
Viaarxiv icon

WebAnchor: Anchoring Agent Planning to Stabilize Long-Horizon Web Reasoning

Add code
Jan 07, 2026
Viaarxiv icon

Learning Depth from Past Selves: Self-Evolution Contrast for Robust Depth Estimation

Add code
Nov 19, 2025
Viaarxiv icon