Picture for Han Zhao

Han Zhao

DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training

Add code
Apr 24, 2025
Viaarxiv icon

Leveraging Reasoning Model Answers to Enhance Non-Reasoning Model Capability

Add code
Apr 13, 2025
Viaarxiv icon

How Difficulty-Aware Staged Reinforcement Learning Enhances LLMs' Reasoning Capabilities: A Preliminary Experimental Study

Add code
Apr 01, 2025
Viaarxiv icon

1.4 Million Open-Source Distilled Reasoning Dataset to Empower Large Language Model Training

Add code
Mar 25, 2025
Viaarxiv icon

Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking

Add code
Mar 25, 2025
Viaarxiv icon

Predicting Potential Customer Support Needs and Optimizing Search Ranking in a Two-Sided Marketplace

Add code
Mar 21, 2025
Viaarxiv icon

Towards Scalable Foundation Model for Multi-modal and Hyperspectral Geospatial Data

Add code
Mar 17, 2025
Viaarxiv icon

MoRE: Unlocking Scalability in Reinforcement Learning for Quadruped Vision-Language-Action Models

Add code
Mar 11, 2025
Viaarxiv icon

Accelerating Vision-Language-Action Model Integrated with Action Chunking via Parallel Decoding

Add code
Mar 04, 2025
Viaarxiv icon

Structural Alignment Improves Graph Test-Time Adaptation

Add code
Feb 25, 2025
Viaarxiv icon