Picture for Tong Xiao

Tong Xiao

Jack

MSRL: Scaling Generative Multimodal Reward Modeling via Multi-Stage Reinforcement Learning

Add code
Mar 26, 2026
Viaarxiv icon

PoC: Performance-oriented Context Compression for Large Language Models via Performance Prediction

Add code
Mar 20, 2026
Viaarxiv icon

DaPT: A Dual-Path Framework for Multilingual Multi-hop Question Answering

Add code
Mar 19, 2026
Viaarxiv icon

On the Emotion Understanding of Synthesized Speech

Add code
Mar 17, 2026
Viaarxiv icon

Offline Exploration-Aware Fine-Tuning for Long-Chain Mathematical Reasoning

Add code
Mar 17, 2026
Viaarxiv icon

StyleBench: Evaluating Speech Language Models on Conversational Speaking Style Control

Add code
Mar 08, 2026
Viaarxiv icon

When Scaling Fails: Mitigating Audio Perception Decay of LALMs via Multi-Step Perception-Aware Reasoning

Add code
Feb 28, 2026
Viaarxiv icon

CoMeT: Collaborative Memory Transformer for Efficient Long Context Modeling

Add code
Feb 02, 2026
Viaarxiv icon

APR: Penalizing Structural Redundancy in Large Reasoning Models via Anchor-based Process Rewards

Add code
Jan 31, 2026
Viaarxiv icon

SpanNorm: Reconciling Training Stability and Performance in Deep Transformers

Add code
Jan 30, 2026
Viaarxiv icon