Picture for Han Zhao

Han Zhao

Not All Correct Answers Are Equal: Why Your Distillation Source Matters

Add code
May 20, 2025
Viaarxiv icon

SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning

Add code
May 18, 2025
Viaarxiv icon

Unveiling the Potential of Vision-Language-Action Models with Open-Ended Multimodal Instructions

Add code
May 16, 2025
Viaarxiv icon

MergeBench: A Benchmark for Merging Domain-Specialized LLMs

Add code
May 16, 2025
Viaarxiv icon

AM-Thinking-v1: Advancing the Frontier of Reasoning at 32B Scale

Add code
May 13, 2025
Viaarxiv icon

ReinboT: Amplifying Robot Visual-Language Manipulation with Reinforcement Learning

Add code
May 12, 2025
Viaarxiv icon

OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation

Add code
May 06, 2025
Viaarxiv icon

Exploring the Potential of Offline RL for Reasoning in LLMs: A Preliminary Study

Add code
May 04, 2025
Viaarxiv icon

DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training

Add code
Apr 24, 2025
Viaarxiv icon

Leveraging Reasoning Model Answers to Enhance Non-Reasoning Model Capability

Add code
Apr 13, 2025
Viaarxiv icon