Picture for Liang Wang

Liang Wang

Institute of Automation, CAS

Multi-View Video Diffusion Policy: A 3D Spatio-Temporal-Aware Video Action Model

Add code
Apr 03, 2026
Viaarxiv icon

MotionRFT: Unified Reinforcement Fine-Tuning for Text-to-Motion Generation

Add code
Mar 28, 2026
Viaarxiv icon

RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation

Add code
Mar 26, 2026
Viaarxiv icon

MultiBind: A Benchmark for Attribute Misbinding in Multi-Subject Generation

Add code
Mar 23, 2026
Viaarxiv icon

FloorPlan-VLN: A New Paradigm for Floor Plan Guided Vision-Language Navigation

Add code
Mar 18, 2026
Viaarxiv icon

SCAN: Sparse Circuit Anchor Interpretable Neuron for Lifelong Knowledge Editing

Add code
Mar 16, 2026
Viaarxiv icon

Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation

Add code
Mar 13, 2026
Viaarxiv icon

FVG-PT: Adaptive Foreground View-Guided Prompt Tuning for Vision-Language Models

Add code
Mar 09, 2026
Viaarxiv icon

Learning to Draft: Adaptive Speculative Decoding with Reinforcement Learning

Add code
Mar 02, 2026
Viaarxiv icon

Multimodal Adaptive Retrieval Augmented Generation through Internal Representation Learning

Add code
Feb 28, 2026
Viaarxiv icon