Picture for Lu Wang

Lu Wang

CSSE, Shenzhen University

Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation

Add code
Jan 21, 2026
Viaarxiv icon

Skill-Aware Data Selection and Fine-Tuning for Data-Efficient Reasoning Distillation

Add code
Jan 15, 2026
Viaarxiv icon

IndexTTS 2.5 Technical Report

Add code
Jan 08, 2026
Viaarxiv icon

Index-ASR Technical Report

Add code
Dec 31, 2025
Viaarxiv icon

Blur-Robust Detection via Feature Restoration: An End-to-End Framework for Prior-Guided Infrared UAV Target Detection

Add code
Nov 18, 2025
Viaarxiv icon

Heterogeneous Complementary Distillation

Add code
Nov 14, 2025
Viaarxiv icon

Seeing the Forest and the Trees: Query-Aware Tokenizer for Long-Video Multimodal Language Models

Add code
Nov 14, 2025
Viaarxiv icon

GUI-360$^\circ$: A Comprehensive Dataset and Benchmark for Computer-Using Agents

Add code
Nov 10, 2025
Viaarxiv icon

GUI-360: A Comprehensive Dataset and Benchmark for Computer-Using Agents

Add code
Nov 06, 2025
Viaarxiv icon

LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?

Add code
Oct 10, 2025
Figure 1 for LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?
Figure 2 for LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?
Figure 3 for LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?
Figure 4 for LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?
Viaarxiv icon