Picture for Sirui Han

Sirui Han

NanoResearch: Co-Evolving Skills, Memory, and Policy for Personalized Research Automation

Add code
May 11, 2026
Viaarxiv icon

ProDrive: Proactive Planning for Autonomous Driving via Ego-Environment Co-Evolution

Add code
Apr 28, 2026
Viaarxiv icon

ContextLens: Modeling Imperfect Privacy and Safety Context for Legal Compliance

Add code
Apr 14, 2026
Viaarxiv icon

QaRL: Rollout-Aligned Quantization-Aware RL for Fast and Stable Training under Training--Inference Mismatch

Add code
Apr 09, 2026
Viaarxiv icon

Bit-by-Bit: Progressive QAT Strategy with Outlier Channel Splitting for Stable Low-Bit LLMs

Add code
Apr 09, 2026
Viaarxiv icon

Not Just the Destination, But the Journey: Reasoning Traces Causally Shape Generalization Behaviors

Add code
Mar 12, 2026
Viaarxiv icon

LABSHIELD: A Multimodal Benchmark for Safety-Critical Reasoning and Planning in Scientific Laboratories

Add code
Mar 12, 2026
Viaarxiv icon

DC-W2S: Dual-Consensus Weak-to-Strong Training for Reliable Process Reward Modeling in Biological Reasoning

Add code
Mar 09, 2026
Viaarxiv icon

TwinRL-VLA: Digital Twin-Driven Reinforcement Learning for Real-World Robotic Manipulation

Add code
Feb 09, 2026
Viaarxiv icon

What, Whether and How? Unveiling Process Reward Models for Thinking with Images Reasoning

Add code
Feb 09, 2026
Viaarxiv icon