Picture for Hua Chen

Hua Chen

ARM: Advantage Reward Modeling for Long-Horizon Manipulation

Add code
Apr 03, 2026
Viaarxiv icon

Where-to-Learn: Analytical Policy Gradient Directed Exploration for On-Policy Robotic Reinforcement Learning

Add code
Apr 01, 2026
Viaarxiv icon

Near-Field NLOS Localization via Position-Unknown HRIS:From Self-Localization to Target Positioning

Add code
Mar 18, 2026
Viaarxiv icon

Near-Field Multiuser Beam Training for XL-MIMO: An End-to-End Interference-Aware Approach with Pilot Limitations

Add code
Mar 12, 2026
Viaarxiv icon

Diffusion Policy through Conditional Proximal Policy Optimization

Add code
Mar 05, 2026
Viaarxiv icon

BFA++: Hierarchical Best-Feature-Aware Token Prune for Multi-View Vision Language Action Model

Add code
Feb 24, 2026
Viaarxiv icon

Compute Only Once: UG-Separation for Efficient Large Recommendation Models

Add code
Feb 11, 2026
Viaarxiv icon

Beyond $λ/2$: Can Arbitrary EMVS Arrays Achieve Unambiguous NLOS Localization?

Add code
Feb 07, 2026
Viaarxiv icon

PolicyFlow: Policy Optimization with Continuous Normalizing Flow in Reinforcement Learning

Add code
Feb 01, 2026
Viaarxiv icon

FastStair: Learning to Run Up Stairs with Humanoid Robots

Add code
Jan 15, 2026
Viaarxiv icon