Picture for Liang Wang

Liang Wang

Institute of Automation, CAS

MultiBind: A Benchmark for Attribute Misbinding in Multi-Subject Generation

Add code
Mar 23, 2026
Viaarxiv icon

FloorPlan-VLN: A New Paradigm for Floor Plan Guided Vision-Language Navigation

Add code
Mar 18, 2026
Viaarxiv icon

SCAN: Sparse Circuit Anchor Interpretable Neuron for Lifelong Knowledge Editing

Add code
Mar 16, 2026
Viaarxiv icon

Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation

Add code
Mar 13, 2026
Viaarxiv icon

FVG-PT: Adaptive Foreground View-Guided Prompt Tuning for Vision-Language Models

Add code
Mar 09, 2026
Viaarxiv icon

Learning to Draft: Adaptive Speculative Decoding with Reinforcement Learning

Add code
Mar 02, 2026
Viaarxiv icon

Multimodal Adaptive Retrieval Augmented Generation through Internal Representation Learning

Add code
Feb 28, 2026
Viaarxiv icon

Chatting with Images for Introspective Visual Thinking

Add code
Feb 12, 2026
Viaarxiv icon

Beyond Closed-Pool Video Retrieval: A Benchmark and Agent Framework for Real-World Video Search and Moment Localization

Add code
Feb 10, 2026
Viaarxiv icon

Human Identification at a Distance: Challenges, Methods and Results on the Competition HID 2025

Add code
Feb 07, 2026
Viaarxiv icon