Picture for Liang Lin

Liang Lin

TAG: Target-Agnostic Guidance for Stable Object-Centric Inference in Vision-Language-Action Models

Add code
Mar 25, 2026
Viaarxiv icon

Cognitive Mismatch in Multimodal Large Language Models for Discrete Symbol Understanding

Add code
Mar 19, 2026
Viaarxiv icon

AtomicVLA: Unlocking the Potential of Atomic Skill Learning in Robots

Add code
Mar 08, 2026
Viaarxiv icon

PointCoT: A Multi-modal Benchmark for Explicit 3D Geometric Reasoning

Add code
Feb 27, 2026
Viaarxiv icon

RADAR: Benchmarking Vision-Language-Action Generalization via Real-World Dynamics, Spatial-Physical Intelligence, and Autonomous Evaluation

Add code
Feb 11, 2026
Viaarxiv icon

Spectral Gating Networks

Add code
Feb 07, 2026
Viaarxiv icon

AR-MAP: Are Autoregressive Large Language Models Implicit Teachers for Diffusion Large Language Models?

Add code
Feb 03, 2026
Viaarxiv icon

DDP-WM: Disentangled Dynamics Prediction for Efficient World Models

Add code
Feb 03, 2026
Viaarxiv icon

Unveiling Perceptual Artifacts: A Fine-Grained Benchmark for Interpretable AI-Generated Image Detection

Add code
Jan 27, 2026
Viaarxiv icon

TangramPuzzle: Evaluating Multimodal Large Language Models with Compositional Spatial Reasoning

Add code
Jan 23, 2026
Viaarxiv icon