Picture for Jian Luan

Jian Luan

BacktrackAgent: Enhancing GUI Agent with Error Detection and Backtracking Mechanism

Add code
May 27, 2025
Viaarxiv icon

TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization

Add code
May 26, 2025
Viaarxiv icon

X-ARES: A Comprehensive Framework for Assessing Audio Encoder Performance

Add code
May 22, 2025
Viaarxiv icon

Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains

Add code
May 22, 2025
Viaarxiv icon

Enhance Mobile Agents Thinking Process Via Iterative Preference Learning

Add code
May 18, 2025
Viaarxiv icon

Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents

Add code
May 17, 2025
Viaarxiv icon

Weaving Context Across Images: Improving Vision-Language Models through Focus-Centric Visual Chains

Add code
Apr 28, 2025
Viaarxiv icon

Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering

Add code
Mar 17, 2025
Viaarxiv icon

Joint Training And Decoding for Multilingual End-to-End Simultaneous Speech Translation

Add code
Mar 14, 2025
Viaarxiv icon

Direction-Aware Diagonal Autoregressive Image Generation

Add code
Mar 14, 2025
Viaarxiv icon