Picture for Hui Li

Hui Li

Jiangnan University, Wuxi, China

Steering Vision-Language Models with Joint Sparse Autoencoders

Add code
Jun 24, 2026
Viaarxiv icon

Metis: Bridging Text and Code Memory for Self-Evolving Agents

Add code
Jun 23, 2026
Viaarxiv icon

Text-Driven Fusion for Infrared and Visible Images: Achieving Image Scene Adaptation on Hyperbolic Space

Add code
Jun 13, 2026
Viaarxiv icon

Nemotron 3 Ultra: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Add code
Jun 12, 2026
Viaarxiv icon

EntangleCodec: A Unified Discrete Audio Tokenizer via Semantic-Acoustic Entanglement

Add code
Jun 01, 2026
Viaarxiv icon

SlotMemory: Object-Centric KV Memory for Streaming Long-Video Generation

Add code
May 29, 2026
Viaarxiv icon

Semantic and Visual Evidence for Efficient Long-Video Reasoning: A Solution for the HD-EPIC VQA Challenge

Add code
May 28, 2026
Viaarxiv icon

SIREN: Unified Multi-Granularity Semantic Interaction for Multi-Modal Lifelong User Interest Modeling

Add code
May 25, 2026
Viaarxiv icon

RQ-MoE: Residual Quantization via Mixture of Experts for Efficient Input-Dependent Vector Compression

Add code
May 14, 2026
Viaarxiv icon

ScribbleDose: Scribble-Guided Dose Prediction in Radiotherapy

Add code
May 12, 2026
Viaarxiv icon