Picture for Weijie Shi

Weijie Shi

TCOD: Exploring Temporal Curriculum in On-Policy Distillation for Multi-turn Autonomous Agents

Add code
Apr 28, 2026
Viaarxiv icon

LoopGuard: Breaking Self-Reinforcing Attention Loops via Dynamic KV Cache Intervention

Add code
Apr 11, 2026
Viaarxiv icon

Head-wise Modality Specialization within MLLMs for Robust Fake News Detection under Missing Modality

Add code
Apr 08, 2026
Viaarxiv icon

Data-Driven Function Calling Improvements in Large Language Model for Online Financial QA

Add code
Apr 07, 2026
Viaarxiv icon

R$^3$L: Reflect-then-Retry Reinforcement Learning with Language-Guided Exploration, Pivotal Credit, and Positive Amplification

Add code
Jan 07, 2026
Viaarxiv icon

DynaGen: Unifying Temporal Knowledge Graph Reasoning with Dynamic Subgraphs and Generative Regularization

Add code
Dec 14, 2025
Viaarxiv icon

Automatic Failure Attribution and Critical Step Prediction Method for Multi-Agent Systems Based on Causal Inference

Add code
Sep 10, 2025
Viaarxiv icon

E3-Rewrite: Learning to Rewrite SQL for Executability, Equivalence,and Efficiency

Add code
Aug 12, 2025
Viaarxiv icon

LegalReasoner: Step-wised Verification-Correction for Legal Judgment Reasoning

Add code
Jun 09, 2025
Viaarxiv icon

InfoSAM: Fine-Tuning the Segment Anything Model from An Information-Theoretic Perspective

Add code
May 28, 2025
Viaarxiv icon