Picture for Yan Lu

Yan Lu

Perfecting Depth: Uncertainty-Aware Enhancement of Metric Depth

Add code
Jun 05, 2025
Viaarxiv icon

LTM3D: Bridging Token Spaces for Conditional 3D Generation with Auto-Regressive Diffusion Framework

Add code
May 30, 2025
Viaarxiv icon

UI-Evol: Automatic Knowledge Evolving for Computer Use Agents

Add code
May 28, 2025
Viaarxiv icon

Text-Queried Audio Source Separation via Hierarchical Modeling

Add code
May 27, 2025
Viaarxiv icon

Zero-Shot Streaming Text to Speech Synthesis with Transducer and Auto-Regressive Modeling

Add code
May 26, 2025
Viaarxiv icon

Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding

Add code
May 23, 2025
Viaarxiv icon

Generative Latent Coding for Ultra-Low Bitrate Image and Video Compression

Add code
May 22, 2025
Viaarxiv icon

One-Step Diffusion-Based Image Compression with Semantic Distillation

Add code
May 22, 2025
Viaarxiv icon

PICD: Versatile Perceptual Image Compression with Diffusion Rendering

Add code
May 09, 2025
Viaarxiv icon

UI-E2I-Synth: Advancing GUI Grounding with Large-Scale Instruction Synthesis

Add code
Apr 16, 2025
Viaarxiv icon