Picture for Jian Luan

Jian Luan

Efficient Speech Enhancement via Embeddings from Pre-trained Generative Audioencoders

Add code
Jun 13, 2025
Viaarxiv icon

GLAP: General contrastive audio-text pretraining across domains and languages

Add code
Jun 12, 2025
Viaarxiv icon

BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism

Add code
May 27, 2025
Viaarxiv icon

TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization

Add code
May 26, 2025
Viaarxiv icon

Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains

Add code
May 22, 2025
Viaarxiv icon

X-ARES: A Comprehensive Framework for Assessing Audio Encoder Performance

Add code
May 22, 2025
Viaarxiv icon

Enhance Mobile Agents Thinking Process Via Iterative Preference Learning

Add code
May 18, 2025
Viaarxiv icon

Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents

Add code
May 17, 2025
Viaarxiv icon

Weaving Context Across Images: Improving Vision-Language Models through Focus-Centric Visual Chains

Add code
Apr 28, 2025
Viaarxiv icon

Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering

Add code
Mar 17, 2025
Viaarxiv icon