Picture for Jiangcheng Song

Jiangcheng Song

MIND: From Passive Mimicry to Active Reasoning through Capability-Aware Multi-Perspective CoT Distillation

Add code
Jan 07, 2026
Viaarxiv icon

DeepKD: A Deeply Decoupled and Denoised Knowledge Distillation Trainer

Add code
May 21, 2025
Figure 1 for DeepKD: A Deeply Decoupled and Denoised Knowledge Distillation Trainer
Figure 2 for DeepKD: A Deeply Decoupled and Denoised Knowledge Distillation Trainer
Figure 3 for DeepKD: A Deeply Decoupled and Denoised Knowledge Distillation Trainer
Figure 4 for DeepKD: A Deeply Decoupled and Denoised Knowledge Distillation Trainer
Viaarxiv icon