Picture for Xu Tan

Xu Tan

Pause or Fabricate? Training Language Models for Grounded Reasoning

Add code
Apr 21, 2026
Viaarxiv icon

VoxSafeBench: Not Just What Is Said, but Who, How, and Where

Add code
Apr 16, 2026
Viaarxiv icon

UI-Copilot: Advancing Long-Horizon GUI Automation via Tool-Integrated Policy Optimization

Add code
Apr 15, 2026
Viaarxiv icon

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

Add code
Apr 15, 2026
Viaarxiv icon

KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation

Add code
Apr 09, 2026
Viaarxiv icon

Towards Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training

Add code
Jan 06, 2026
Viaarxiv icon

Chain-of-Model Learning for Language Model

Add code
May 17, 2025
Viaarxiv icon

Kimi-Audio Technical Report

Add code
Apr 25, 2025
Figure 1 for Kimi-Audio Technical Report
Figure 2 for Kimi-Audio Technical Report
Figure 3 for Kimi-Audio Technical Report
Figure 4 for Kimi-Audio Technical Report
Viaarxiv icon

MoonCast: High-Quality Zero-Shot Podcast Generation

Add code
Mar 19, 2025
Figure 1 for MoonCast: High-Quality Zero-Shot Podcast Generation
Figure 2 for MoonCast: High-Quality Zero-Shot Podcast Generation
Figure 3 for MoonCast: High-Quality Zero-Shot Podcast Generation
Figure 4 for MoonCast: High-Quality Zero-Shot Podcast Generation
Viaarxiv icon

AudioX: Diffusion Transformer for Anything-to-Audio Generation

Add code
Mar 13, 2025
Viaarxiv icon