Picture for Wen Li

Wen Li

Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model

Add code
Jun 10, 2025
Viaarxiv icon

Integrating Intermediate Layer Optimization and Projected Gradient Descent for Solving Inverse Problems with Diffusion Models

Add code
May 28, 2025
Viaarxiv icon

Learned Image Compression with Dictionary-based Entropy Model

Add code
Apr 01, 2025
Viaarxiv icon

Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction

Add code
Feb 18, 2025
Viaarxiv icon

Achieving Hiding and Smart Anti-Jamming Communication: A Parallel DRL Approach against Moving Reactive Jammer

Add code
Feb 04, 2025
Viaarxiv icon

The Devil is in the Spurious Correlation: Boosting Moment Retrieval via Temporal Dynamic Learning

Add code
Jan 13, 2025
Viaarxiv icon

Towards Unsupervised Model Selection for Domain Adaptive Object Detection

Add code
Dec 23, 2024
Viaarxiv icon

S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field

Add code
Dec 23, 2024
Viaarxiv icon

ResCLIP: Residual Attention for Training-free Dense Vision-language Inference

Add code
Nov 24, 2024
Viaarxiv icon

Generalized Eigenvalue Problems with Generative Priors

Add code
Nov 02, 2024
Viaarxiv icon