Picture for Jing Lyu

Jing Lyu

CapVector: Learning Transferable Capability Vectors in Parametric Space for Vision-Language-Action Models

Add code
May 11, 2026
Viaarxiv icon

Beyond Chain-of-Thought: Rewrite as a Universal Interface for Generative Multimodal Embeddings

Add code
Apr 24, 2026
Viaarxiv icon

Beyond Few-Step Inference: Accelerating Video Diffusion Transformer Model Serving with Inter-Request Caching Reuse

Add code
Apr 06, 2026
Viaarxiv icon

From Understanding to Erasing: Towards Complete and Stable Video Object Removal

Add code
Apr 02, 2026
Viaarxiv icon

Identity as Presence: Towards Appearance and Voice Personalized Joint Audio-Video Generation

Add code
Mar 18, 2026
Viaarxiv icon

AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents

Add code
Mar 17, 2026
Viaarxiv icon

NOVA: Sparse Control, Dense Synthesis for Pair-Free Video Editing

Add code
Mar 03, 2026
Viaarxiv icon

Learning Cross-View Object Correspondence via Cycle-Consistent Mask Prediction

Add code
Feb 22, 2026
Viaarxiv icon

Improving Reconstruction of Representation Autoencoder

Add code
Feb 09, 2026
Viaarxiv icon

D-ORCA: Dialogue-Centric Optimization for Robust Audio-Visual Captioning

Add code
Feb 08, 2026
Viaarxiv icon