Picture for Zhe Liu

Zhe Liu

ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Language Models

Add code
Mar 22, 2026
Viaarxiv icon

FASTER: Rethinking Real-Time Flow VLAs

Add code
Mar 19, 2026
Viaarxiv icon

Towards the Vision-Sound-Language-Action Paradigm: The HEAR Framework for Sound-Centric Manipulation

Add code
Mar 17, 2026
Viaarxiv icon

RegFormer++: An Efficient Large-Scale 3D LiDAR Point Registration Network with Projection-Aware 2D Transformer

Add code
Mar 15, 2026
Viaarxiv icon

RESBev: Making BEV Perception More Robust

Add code
Mar 10, 2026
Viaarxiv icon

ACE-Brain-0: Spatial Intelligence as a Shared Scaffold for Universal Embodiments

Add code
Mar 03, 2026
Viaarxiv icon

Fair in Mind, Fair in Action? A Synchronous Benchmark for Understanding and Generation in UMLLMs

Add code
Feb 28, 2026
Viaarxiv icon

WeatherCity: Urban Scene Reconstruction with Controllable Multi-Weather Transformation

Add code
Feb 25, 2026
Viaarxiv icon

PILOT: A Perceptive Integrated Low-level Controller for Loco-manipulation over Unstructured Scenes

Add code
Jan 24, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon