Picture for Rongxiang Weng

Rongxiang Weng

LANG: Reinforcement Learning for Multilingual Reasoning with Language-Adaptive Hint Guidance

Add code
May 21, 2026
Viaarxiv icon

Prefix Teach, Suffix Fade: Local Teachability Collapse in Strong-to-Weak On-Policy Distillation

Add code
May 13, 2026
Viaarxiv icon

Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization

Add code
May 13, 2026
Viaarxiv icon

LongCat-Flash-Thinking-2601 Technical Report

Add code
Jan 23, 2026
Viaarxiv icon

Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text

Add code
Jan 15, 2026
Viaarxiv icon

Efficient Context Scaling with LongCat ZigZag Attention

Add code
Dec 30, 2025
Viaarxiv icon

A Survey on LLM Mid-training

Add code
Oct 27, 2025
Viaarxiv icon

Making Mathematical Reasoning Adaptive

Add code
Oct 06, 2025
Figure 1 for Making Mathematical Reasoning Adaptive
Figure 2 for Making Mathematical Reasoning Adaptive
Figure 3 for Making Mathematical Reasoning Adaptive
Figure 4 for Making Mathematical Reasoning Adaptive
Viaarxiv icon

OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation

Add code
Sep 03, 2025
Figure 1 for OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation
Figure 2 for OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation
Figure 3 for OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation
Figure 4 for OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation
Viaarxiv icon

XBOUND: Exploring the Capability Boundaries of Device-Control Agents through Trajectory Tree Exploration

Add code
May 27, 2025
Viaarxiv icon