Picture for Jing Lyu

Jing Lyu

Identity as Presence: Towards Appearance and Voice Personalized Joint Audio-Video Generation

Add code
Mar 18, 2026
Viaarxiv icon

AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents

Add code
Mar 17, 2026
Viaarxiv icon

NOVA: Sparse Control, Dense Synthesis for Pair-Free Video Editing

Add code
Mar 03, 2026
Viaarxiv icon

Learning Cross-View Object Correspondence via Cycle-Consistent Mask Prediction

Add code
Feb 22, 2026
Viaarxiv icon

Improving Reconstruction of Representation Autoencoder

Add code
Feb 09, 2026
Viaarxiv icon

D-ORCA: Dialogue-Centric Optimization for Robust Audio-Visual Captioning

Add code
Feb 08, 2026
Viaarxiv icon

Reshaping Action Error Distributions for Reliable Vision-Language-Action Models

Add code
Feb 04, 2026
Viaarxiv icon

ObjEmbed: Towards Universal Multimodal Object Embeddings

Add code
Feb 03, 2026
Viaarxiv icon

Latent Reasoning VLA: Latent Thinking and Prediction for Vision-Language-Action Models

Add code
Feb 01, 2026
Viaarxiv icon

WeDetect: Fast Open-Vocabulary Object Detection as Retrieval

Add code
Dec 13, 2025
Viaarxiv icon