Picture for Lie Lu

Lie Lu

Video-Robin: Autoregressive Diffusion Planning for Intent-Grounded Video-to-Music Generation

Add code
Apr 19, 2026
Viaarxiv icon

Variable-Length Audio Fingerprinting

Add code
Mar 25, 2026
Viaarxiv icon

PhyPrompt: RL-based Prompt Refinement for Physically Plausible Text-to-Video Generation

Add code
Mar 03, 2026
Viaarxiv icon

Phys4D: Fine-Grained Physics-Consistent 4D Modeling from Video Diffusion

Add code
Mar 03, 2026
Viaarxiv icon

Towards Sparse Video Understanding and Reasoning

Add code
Feb 14, 2026
Viaarxiv icon

SPUR: A Plug-and-Play Framework for Integrating Spatial Audio Understanding and Reasoning into Large Audio-Language Models

Add code
Nov 13, 2025
Figure 1 for SPUR: A Plug-and-Play Framework for Integrating Spatial Audio Understanding and Reasoning into Large Audio-Language Models
Figure 2 for SPUR: A Plug-and-Play Framework for Integrating Spatial Audio Understanding and Reasoning into Large Audio-Language Models
Figure 3 for SPUR: A Plug-and-Play Framework for Integrating Spatial Audio Understanding and Reasoning into Large Audio-Language Models
Figure 4 for SPUR: A Plug-and-Play Framework for Integrating Spatial Audio Understanding and Reasoning into Large Audio-Language Models
Viaarxiv icon

Transformation of audio embeddings into interpretable, concept-based representations

Add code
Apr 18, 2025
Viaarxiv icon

XAttnMark: Learning Robust Audio Watermarking with Cross-Attention

Add code
Feb 07, 2025
Figure 1 for XAttnMark: Learning Robust Audio Watermarking with Cross-Attention
Figure 2 for XAttnMark: Learning Robust Audio Watermarking with Cross-Attention
Figure 3 for XAttnMark: Learning Robust Audio Watermarking with Cross-Attention
Figure 4 for XAttnMark: Learning Robust Audio Watermarking with Cross-Attention
Viaarxiv icon

CiTrus: Squeezing Extra Performance out of Low-data Bio-signal Transfer Learning

Add code
Dec 16, 2024
Figure 1 for CiTrus: Squeezing Extra Performance out of Low-data Bio-signal Transfer Learning
Figure 2 for CiTrus: Squeezing Extra Performance out of Low-data Bio-signal Transfer Learning
Figure 3 for CiTrus: Squeezing Extra Performance out of Low-data Bio-signal Transfer Learning
Figure 4 for CiTrus: Squeezing Extra Performance out of Low-data Bio-signal Transfer Learning
Viaarxiv icon

Audio Match Cutting: Finding and Creating Matching Audio Transitions in Movies and Videos

Add code
Aug 20, 2024
Viaarxiv icon