Picture for Shijie Li

Shijie Li

Ego2World: Compiling Egocentric Cooking Videos into Executable Worlds for Belief-State Planning

Add code
May 13, 2026
Viaarxiv icon

Grounding by Remembering: Cross-Scene and In-Scene Memory for 3D Functional Affordances

Add code
May 12, 2026
Viaarxiv icon

PRISM: : Planning and Reasoning with Intent in Simulated Embodied Environments

Add code
May 12, 2026
Viaarxiv icon

From Priors to Perception: Grounding Video-LLMs in Physical Reality

Add code
May 06, 2026
Viaarxiv icon

PanDA: Unsupervised Domain Adaptation for Multimodal 3D Panoptic Segmentation in Autonomous Driving

Add code
Apr 21, 2026
Viaarxiv icon

Tell2Adapt: A Unified Framework for Source Free Unsupervised Domain Adaptation via Vision Foundation Model

Add code
Mar 05, 2026
Viaarxiv icon

DenoiseFlow: Uncertainty-Aware Denoising for Reliable LLM Agentic Workflows

Add code
Feb 28, 2026
Viaarxiv icon

One Agent to Guide Them All: Empowering MLLMs for Vision-and-Language Navigation via Explicit World Representation

Add code
Feb 17, 2026
Viaarxiv icon

GLM-5: from Vibe Coding to Agentic Engineering

Add code
Feb 17, 2026
Viaarxiv icon

DV-VLN: Dual Verification for Reliable LLM-Based Vision-and-Language Navigation

Add code
Jan 26, 2026
Viaarxiv icon