Picture for Xiangchi Yuan

Xiangchi Yuan

Behavior Knowledge Merge in Reinforced Agentic Models

Add code
Jan 20, 2026
Viaarxiv icon

MTMCS-Bench: Evaluating Contextual Safety of Multimodal Large Language Models in Multi-Turn Dialogues

Add code
Jan 11, 2026
Viaarxiv icon

Sparsity-Controllable Dynamic Top-p MoE for Large Foundation Model Pre-training

Add code
Dec 16, 2025
Viaarxiv icon

What Makes a Good Curriculum? Disentangling the Effects of Data Ordering on LLM Mathematical Reasoning

Add code
Oct 21, 2025
Viaarxiv icon

SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

Add code
Oct 06, 2025
Viaarxiv icon

Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners

Add code
Oct 06, 2025
Figure 1 for Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners
Figure 2 for Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners
Figure 3 for Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners
Figure 4 for Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners
Viaarxiv icon

LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field Enlargement

Add code
Apr 22, 2025
Viaarxiv icon

Superficial Self-Improved Reasoners Benefit from Model Merging

Add code
Mar 03, 2025
Viaarxiv icon

Modality-Aware Neuron Pruning for Unlearning in Multimodal Large Language Models

Add code
Feb 21, 2025
Viaarxiv icon

AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment

Add code
Nov 15, 2024
Viaarxiv icon