Picture for Dachuan Shi

Dachuan Shi

PACT: Privileged Trace Co-Training for Multi-Turn Tool-Use Agents

Add code
Jun 15, 2026
Viaarxiv icon

CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning

Add code
May 19, 2026
Viaarxiv icon

Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding

Add code
Mar 09, 2026
Viaarxiv icon

Behavior Knowledge Merge in Reinforced Agentic Models

Add code
Jan 20, 2026
Viaarxiv icon

SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

Add code
Oct 06, 2025
Viaarxiv icon

Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners

Add code
Oct 06, 2025
Figure 1 for Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners
Figure 2 for Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners
Figure 3 for Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners
Figure 4 for Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners
Viaarxiv icon

Superficial Self-Improved Reasoners Benefit from Model Merging

Add code
Mar 03, 2025
Viaarxiv icon

AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment

Add code
Nov 15, 2024
Viaarxiv icon

CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers

Add code
May 27, 2023
Figure 1 for CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers
Figure 2 for CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers
Figure 3 for CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers
Figure 4 for CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers
Viaarxiv icon

UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers

Add code
Jan 31, 2023
Figure 1 for UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers
Figure 2 for UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers
Figure 3 for UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers
Figure 4 for UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers
Viaarxiv icon