Picture for Yong Zhang

Yong Zhang

Beijing University of Technology

InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing

Add code
Aug 19, 2025
Viaarxiv icon

SMARTAPS: Tool-augmented LLMs for Operations Management

Add code
Jul 23, 2025
Viaarxiv icon

DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution

Add code
Jul 01, 2025
Viaarxiv icon

Sentinel: Attention Probing of Proxy Models for LLM Context Compression with an Understanding Perspective

Add code
May 29, 2025
Viaarxiv icon

Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation

Add code
May 28, 2025
Viaarxiv icon

Using Large Language Models to Tackle Fundamental Challenges in Graph Learning: A Comprehensive Survey

Add code
May 24, 2025
Viaarxiv icon

Enhancing Learned Knowledge in LoRA Adapters Through Efficient Contrastive Decoding on Ascend NPUs

Add code
May 20, 2025
Viaarxiv icon

Avoid Recommending Out-of-Domain Items: Constrained Generative Recommendation with LLMs

Add code
May 06, 2025
Viaarxiv icon

CASP: Compression of Large Multimodal Models Based on Attention Sparsity

Add code
Mar 07, 2025
Viaarxiv icon

DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models

Add code
Mar 04, 2025
Viaarxiv icon