Picture for Peng Li

Peng Li

DJI Innovations Inc

Know3D: Prompting 3D Generation with Knowledge from Vision-Language Models

Add code
Mar 24, 2026
Viaarxiv icon

GigaWorld-Policy: An Efficient Action-Centered World--Action Model

Add code
Mar 18, 2026
Viaarxiv icon

Evaluating Time Awareness and Cross-modal Active Perception of Large Models via 4D Escape Room Task

Add code
Mar 16, 2026
Viaarxiv icon

TacMamba: A Tactile History Compression Adapter Bridging Fast Reflexes and Slow VLA Reasoning

Add code
Mar 02, 2026
Viaarxiv icon

Beyond Words: Evaluating and Bridging Epistemic Divergence in User-Agent Interaction via Theory of Mind

Add code
Feb 14, 2026
Viaarxiv icon

FlexAM: Flexible Appearance-Motion Decomposition for Versatile Video Generation Control

Add code
Feb 13, 2026
Viaarxiv icon

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Add code
Feb 12, 2026
Viaarxiv icon

Reasoning and Tool-use Compete in Agentic RL:From Quantifying Interference to Disentangled Tuning

Add code
Feb 01, 2026
Viaarxiv icon

InstructDiff: Domain-Adaptive Data Selection via Differential Entropy for Efficient LLM Fine-Tuning

Add code
Jan 30, 2026
Viaarxiv icon

FRoM-W1: Towards General Humanoid Whole-Body Control with Language Instructions

Add code
Jan 19, 2026
Viaarxiv icon