Picture for Hongyu Lin

Hongyu Lin

Learning from Failures: Correction-Oriented Policy Optimization with Verifiable Rewards

Add code
May 14, 2026
Viaarxiv icon

Compositional Sparsity as an Inductive Bias for Neural Architecture Design

Add code
May 14, 2026
Viaarxiv icon

All Languages Matter: Understanding and Mitigating Language Bias in Multilingual RAG

Add code
Apr 22, 2026
Viaarxiv icon

Towards Real-world Human Behavior Simulation: Benchmarking Large Language Models on Long-horizon, Cross-scenario, Heterogeneous Behavior Traces

Add code
Apr 09, 2026
Viaarxiv icon

P^2O: Joint Policy and Prompt Optimization

Add code
Mar 23, 2026
Viaarxiv icon

Tackling Length Inflation Without Trade-offs: Group Relative Reward Rescaling for Reinforcement Learning

Add code
Mar 11, 2026
Viaarxiv icon

Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards

Add code
Mar 10, 2026
Viaarxiv icon

DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation

Add code
Feb 26, 2026
Viaarxiv icon

Imandra CodeLogician: Neuro-Symbolic Reasoning for Precise Analysis of Software Logic

Add code
Jan 17, 2026
Viaarxiv icon

Coupled Variational Reinforcement Learning for Language Model General Reasoning

Add code
Dec 14, 2025
Viaarxiv icon