Picture for Kuan-Chen Pan

Kuan-Chen Pan

Cross-Domain Policy Optimization via Bellman Consistency and Hybrid Critics

Add code
Mar 12, 2026
Viaarxiv icon