Picture for Chen Zhang

Chen Zhang

SenseTime Research

Few-shot Unknown Class Discovery of Hyperspectral Images with Prototype Learning and Clustering

Add code
Aug 25, 2025
Viaarxiv icon

ZigzagAttention: Efficient Long-Context Inference with Exclusive Retrieval and Streaming Heads

Add code
Aug 17, 2025
Viaarxiv icon

AudioGen-Omni: A Unified Multimodal Diffusion Transformer for Video-Synchronized Audio, Speech, and Song Generation

Add code
Aug 01, 2025
Viaarxiv icon

Hypernetworks for Model-Heterogeneous Personalized Federated Learning

Add code
Jul 30, 2025
Viaarxiv icon

AC-Refiner: Efficient Arithmetic Circuit Optimization Using Conditional Diffusion Models

Add code
Jul 03, 2025
Viaarxiv icon

Commonsense Generation and Evaluation for Dialogue Systems using Large Language Models

Add code
Jun 24, 2025
Viaarxiv icon

Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation

Add code
Jun 24, 2025
Viaarxiv icon

Bipedal Balance Control with Whole-body Musculoskeletal Standing and Falling Simulations

Add code
Jun 11, 2025
Viaarxiv icon

Quickest Causal Change Point Detection by Adaptive Intervention

Add code
Jun 09, 2025
Viaarxiv icon

Accelerating 3D Gaussian Splatting with Neural Sorting and Axis-Oriented Rasterization

Add code
Jun 08, 2025
Viaarxiv icon