Picture for Jian Luan

Jian Luan

Attention Basin: Why Contextual Position Matters in Large Language Models

Add code
Aug 07, 2025
Viaarxiv icon

Shuffle-R1: Efficient RL framework for Multimodal Large Language Models via Data-centric Dynamic Shuffle

Add code
Aug 07, 2025
Viaarxiv icon

MiDashengLM: Efficient Audio Understanding with General Audio Captions

Add code
Aug 06, 2025
Viaarxiv icon

Efficient Speech Enhancement via Embeddings from Pre-trained Generative Audioencoders

Add code
Jun 13, 2025
Viaarxiv icon

GLAP: General contrastive audio-text pretraining across domains and languages

Add code
Jun 12, 2025
Viaarxiv icon

BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism

Add code
May 27, 2025
Viaarxiv icon

TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization

Add code
May 26, 2025
Viaarxiv icon

X-ARES: A Comprehensive Framework for Assessing Audio Encoder Performance

Add code
May 22, 2025
Viaarxiv icon

Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains

Add code
May 22, 2025
Viaarxiv icon

Enhance Mobile Agents Thinking Process Via Iterative Preference Learning

Add code
May 18, 2025
Viaarxiv icon