Picture for Jianye Hao

Jianye Hao

Uncertainty-quantified Rollout Policy Adaptation for Unlabelled Cross-domain Temporal Grounding

Add code
Aug 08, 2025
Viaarxiv icon

Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model

Add code
Jul 09, 2025
Viaarxiv icon

Reasoning on a Budget: A Survey of Adaptive and Controllable Test-Time Compute in LLMs

Add code
Jul 02, 2025
Viaarxiv icon

AgentSwift: Efficient LLM Agent Design via Value-guided Hierarchical Search

Add code
Jun 06, 2025
Viaarxiv icon

STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization

Add code
Jun 04, 2025
Viaarxiv icon

One Demo Is All It Takes: Planning Domain Derivation with LLMs from A Single Demonstration

Add code
May 23, 2025
Viaarxiv icon

Reinforcing Question Answering Agents with Minimalist Policy Gradient Optimization

Add code
May 20, 2025
Viaarxiv icon

Conditioning Matters: Training Diffusion Policies is Faster Than You Think

Add code
May 16, 2025
Viaarxiv icon

EmbodiedMAE: A Unified 3D Multi-Modal Representation for Robot Manipulation

Add code
May 15, 2025
Viaarxiv icon

From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation

Add code
May 13, 2025
Viaarxiv icon