Picture for Canbin Huang

Canbin Huang

Beyond Trajectory Imitation: Strategy-Guided Policy Optimization for LLM Reasoning

Add code
Jun 23, 2026
Viaarxiv icon

When Model Merging Breaks Routing: Training-Free Calibration for MoE

Add code
Jun 02, 2026
Viaarxiv icon

FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion

Add code
Mar 06, 2025
Viaarxiv icon

ProFuser: Progressive Fusion of Large Language Models

Add code
Aug 09, 2024
Figure 1 for ProFuser: Progressive Fusion of Large Language Models
Figure 2 for ProFuser: Progressive Fusion of Large Language Models
Figure 3 for ProFuser: Progressive Fusion of Large Language Models
Figure 4 for ProFuser: Progressive Fusion of Large Language Models
Viaarxiv icon

Retrieval-Generation Alignment for End-to-End Task-Oriented Dialogue System

Add code
Oct 20, 2023
Viaarxiv icon