Picture for Zichuan Lin

Zichuan Lin

CORA: Analyzing and bridging thinking-answer gap in Multimodal RLVR via Consistency-Oriented Reasoning Alignment

Add code
Jun 12, 2026
Viaarxiv icon

Faithful-MR1: Faithful Multimodal Reasoning via Anchoring and Reinforcing Visual Attention

Add code
May 21, 2026
Viaarxiv icon

Debiased Model-based Representations for Sample-efficient Continuous Control

Add code
May 12, 2026
Viaarxiv icon

HiRO-Nav: Hybrid ReasOning Enables Efficient Embodied Navigation

Add code
Apr 09, 2026
Viaarxiv icon

UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

Add code
Mar 25, 2026
Viaarxiv icon

HISR: Hindsight Information Modulated Segmental Process Rewards For Multi-turn Agentic Reinforcement Learning

Add code
Mar 19, 2026
Viaarxiv icon

ProAct: Agentic Lookahead in Interactive Environments

Add code
Feb 05, 2026
Viaarxiv icon

Cross-Domain Offline Policy Adaptation via Selective Transition Correction

Add code
Feb 05, 2026
Viaarxiv icon

PIPCFR: Pseudo-outcome Imputation with Post-treatment Variables for Individual Treatment Effect Estimation

Add code
Dec 21, 2025
Viaarxiv icon

EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control

Add code
Nov 19, 2025
Viaarxiv icon