Picture for Liu Ren

Liu Ren

EmbodiedMidtrain: Bridging the Gap between Vision-Language Models and Vision-Language-Action Models via Mid-training

Add code
Apr 21, 2026
Viaarxiv icon

ExploreVLA: Dense World Modeling and Exploration for End-to-End Autonomous Driving

Add code
Apr 03, 2026
Viaarxiv icon

UniDAC: Universal Metric Depth Estimation for Any Camera

Add code
Mar 28, 2026
Viaarxiv icon

Reconstruction Matters: Learning Geometry-Aligned BEV Representation through 3D Gaussian Splatting

Add code
Mar 19, 2026
Viaarxiv icon

No Calibration, No Depth, No Problem: Cross-Sensor View Synthesis with 3D Consistency

Add code
Feb 27, 2026
Viaarxiv icon

UniDrive-WM: Unified Understanding, Planning and Generation World Model For Autonomous Driving

Add code
Jan 07, 2026
Viaarxiv icon

Open Ad-hoc Categorization with Contextualized Feature Learning

Add code
Dec 18, 2025
Viaarxiv icon

LAVQA: A Latency-Aware Visual Question Answering Framework for Shared Autonomy in Self-Driving Vehicles

Add code
Nov 14, 2025
Viaarxiv icon

3DGEER: Exact and Efficient Volumetric Rendering with 3D Gaussians

Add code
May 29, 2025
Viaarxiv icon

DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models

Add code
May 29, 2025
Figure 1 for DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models
Figure 2 for DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models
Figure 3 for DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models
Figure 4 for DINO-R1: Incentivizing Reasoning Capability in Vision Foundation Models
Viaarxiv icon