Picture for Jian Luan

Jian Luan

ELVA: Exploring Ranking-Driven Universal Multimodal Retrieval

Add code
Jun 18, 2026
Viaarxiv icon

STAR: SpatioTemporal Adaptive Reward Allocation for Text-to-Image RL Post-Training

Add code
Jun 18, 2026
Viaarxiv icon

HarnessX: A Composable, Adaptive, and Evolvable Agent Harness Foundry

Add code
Jun 12, 2026
Viaarxiv icon

Teaching the Way, Not the Answer: Privileged Tutoring Distillation for Multimodal Policy Optimization

Add code
Jun 05, 2026
Viaarxiv icon

SpeakerCard-1M: An Evidence-Grounded Speaker Card Corpus for In-the-Wild Speaker Verification

Add code
Jun 03, 2026
Viaarxiv icon

Restoring Initial Noise Sensitivity in Text-to-Image Distillation via Geometric Alignment

Add code
Jun 01, 2026
Viaarxiv icon

Scaling, Benchmarking, and Reasoning of Vision-Language Agents for Mobile GUI Navigation

Add code
May 26, 2026
Viaarxiv icon

PixelWizard: Towards Efficient High-Fidelity Video Generation at Ultra-Large Spatial Resolution

Add code
May 25, 2026
Viaarxiv icon

SimuWoB: Simulating Real-World Mobile Apps for Fast and Faithful GUI Agent Benchmarking

Add code
May 24, 2026
Viaarxiv icon

Beyond Binary: Reframing GUI Critique as Continuous Semantic Alignment

Add code
May 14, 2026
Viaarxiv icon