Picture for Zhaoran Wang

Zhaoran Wang

Phys4D: Fine-Grained Physics-Consistent 4D Modeling from Video Diffusion

Add code
Mar 03, 2026
Viaarxiv icon

Farther the Shift, Sparser the Representation: Analyzing OOD Mechanisms in LLMs

Add code
Mar 03, 2026
Viaarxiv icon

HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents

Add code
Feb 18, 2026
Viaarxiv icon

Training-Free Adaptation of Diffusion Models via Doob's $h$-Transform

Add code
Feb 18, 2026
Viaarxiv icon

Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents

Add code
Jan 26, 2026
Viaarxiv icon

Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regression

Add code
Oct 01, 2025
Viaarxiv icon

The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability

Add code
Jun 11, 2025
Viaarxiv icon

Beyond Markovian: Reflective Exploration via Bayes-Adaptive RL for LLM Reasoning

Add code
May 26, 2025
Viaarxiv icon

BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning

Add code
Jan 31, 2025
Figure 1 for BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning
Figure 2 for BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning
Figure 3 for BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning
Figure 4 for BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning
Viaarxiv icon

Are Transformers Able to Reason by Connecting Separated Knowledge in Training Data?

Add code
Jan 27, 2025
Viaarxiv icon