Picture for Li Shen

Li Shen

Unveiling the Power of Multiple Gossip Steps: A Stability-Based Generalization Analysis in Decentralized Training

Add code
Oct 09, 2025
Viaarxiv icon

Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives

Add code
Sep 26, 2025
Viaarxiv icon

Beyond Two-Stage Training: Cooperative SFT and RL for LLM Reasoning

Add code
Sep 08, 2025
Viaarxiv icon

Diffusion Language Models Know the Answer Before Decoding

Add code
Aug 27, 2025
Figure 1 for Diffusion Language Models Know the Answer Before Decoding
Figure 2 for Diffusion Language Models Know the Answer Before Decoding
Figure 3 for Diffusion Language Models Know the Answer Before Decoding
Figure 4 for Diffusion Language Models Know the Answer Before Decoding
Viaarxiv icon

DynamiCare: A Dynamic Multi-Agent Framework for Interactive and Open-Ended Medical Decision-Making

Add code
Jul 03, 2025
Viaarxiv icon

AlphaDecay:Module-wise Weight Decay for Heavy-Tailed Balancing in LLMs

Add code
Jun 17, 2025
Viaarxiv icon

MaskPro: Linear-Space Probabilistic Learning for Strict (N:M)-Sparsity on Large Language Models

Add code
Jun 15, 2025
Viaarxiv icon

TrojanTO: Action-Level Backdoor Attacks against Trajectory Optimization Models

Add code
Jun 15, 2025
Viaarxiv icon

Vulnerability-Aware Alignment: Mitigating Uneven Forgetting in Harmful Fine-Tuning

Add code
Jun 04, 2025
Viaarxiv icon

Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer

Add code
May 30, 2025
Viaarxiv icon