Picture for Yu Liu

Yu Liu

Peking University

DM0: An Embodied-Native Vision-Language-Action Model towards Physical AI

Add code
Feb 16, 2026
Viaarxiv icon

GSM-GS: Geometry-Constrained Single and Multi-view Gaussian Splatting for Surface Reconstruction

Add code
Feb 13, 2026
Viaarxiv icon

SpotAgent: Grounding Visual Geo-localization in Large Vision-Language Models through Agentic Reasoning

Add code
Feb 11, 2026
Viaarxiv icon

Dialogue Model Optimization via Agent Game and Adaptive Tree-based GRPO

Add code
Feb 09, 2026
Viaarxiv icon

Open-Text Aerial Detection: A Unified Framework For Aerial Visual Grounding And Detection

Add code
Feb 08, 2026
Viaarxiv icon

Difficulty-Estimated Policy Optimization

Add code
Feb 06, 2026
Viaarxiv icon

Unified ROI-based Image Compression Paradigm with Generalized Gaussian Model

Add code
Feb 01, 2026
Viaarxiv icon

CRAFT: Calibrated Reasoning with Answer-Faithful Traces via Reinforcement Learning for Multi-Hop Question Answering

Add code
Feb 01, 2026
Viaarxiv icon

V2X-DSC: Multi-Agent Collaborative Perception with Distributed Source Coding Guided Communication

Add code
Jan 31, 2026
Viaarxiv icon

DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment

Add code
Jan 28, 2026
Viaarxiv icon