Picture for Shuo Wang

Shuo Wang

State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, China

ETA-VLA: Efficient Token Adaptation via Temporal Fusion and Intra-LLM Sparsification for Vision-Language-Action Models

Add code
Mar 26, 2026
Viaarxiv icon

CurvZO: Adaptive Curvature-Guided Sparse Zeroth-Order Optimization for Efficient LLM Fine-Tuning

Add code
Mar 23, 2026
Viaarxiv icon

Adapting Point Cloud Analysis via Multimodal Bayesian Distribution Learning

Add code
Mar 23, 2026
Viaarxiv icon

Principled Steering via Null-space Projection for Jailbreak Defense in Vision-Language Models

Add code
Mar 23, 2026
Viaarxiv icon

Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation

Add code
Mar 13, 2026
Viaarxiv icon

FG-CLTP: Fine-Grained Contrastive Language Tactile Pretraining for Robotic Manipulation

Add code
Mar 11, 2026
Viaarxiv icon

UniUncer: Unified Dynamic Static Uncertainty for End to End Driving

Add code
Mar 08, 2026
Viaarxiv icon

GuardAlign: Test-time Safety Alignment in Multimodal Large Language Models

Add code
Feb 27, 2026
Viaarxiv icon

Look Carefully: Adaptive Visual Reinforcements in Multimodal Large Language Models for Hallucination Mitigation

Add code
Feb 27, 2026
Viaarxiv icon

SpikingTac: A Miniaturized Neuromorphic Visuotactile Sensor for High-Precision Dynamic Tactile Imprint Tracking

Add code
Feb 27, 2026
Viaarxiv icon