Picture for Xunliang Cai

Xunliang Cai

MUSE: MCTS-Driven Red Teaming Framework for Enhanced Multi-Turn Dialogue Safety in Large Language Models

Add code
Sep 18, 2025
Viaarxiv icon

Instance-level Randomization: Toward More Stable LLM Evaluations

Add code
Sep 16, 2025
Viaarxiv icon

OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation

Add code
Sep 03, 2025
Viaarxiv icon

MUA-RL: Multi-turn User-interacting Agent Reinforcement Learning for agentic tool use

Add code
Aug 26, 2025
Viaarxiv icon

Stop Spinning Wheels: Mitigating LLM Overthinking via Mining Patterns for Early Reasoning Exit

Add code
Aug 25, 2025
Viaarxiv icon

InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing

Add code
Aug 19, 2025
Viaarxiv icon

Reasoner for Real-World Event Detection: Scaling Reinforcement Learning via Adaptive Perplexity-Aware Sampling Strategy

Add code
Jul 02, 2025
Viaarxiv icon

MoTE: Mixture of Ternary Experts for Memory-efficient Large Multimodal Models

Add code
Jun 17, 2025
Viaarxiv icon

OIBench: Benchmarking Strong Reasoning Models with Olympiad in Informatics

Add code
Jun 12, 2025
Viaarxiv icon

AMoPO: Adaptive Multi-objective Preference Optimization without Reward Models and Reference Models

Add code
Jun 08, 2025
Viaarxiv icon