Picture for Lingdong Kong

Lingdong Kong

Masked Generative Transformer Is What You Need for Image Editing

Add code
May 11, 2026
Viaarxiv icon

Is Your Driving World Model an All-Around Player?

Add code
May 11, 2026
Viaarxiv icon

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Add code
Apr 24, 2026
Viaarxiv icon

The First Challenge on Remote Sensing Infrared Image Super-Resolution at NTIRE 2026: Benchmark Results and Method Overview

Add code
Apr 23, 2026
Viaarxiv icon

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

Add code
Apr 20, 2026
Viaarxiv icon

AdaSFormer: Adaptive Serialized Transformers for Monocular Semantic Scene Completion from Indoor Environments

Add code
Mar 26, 2026
Viaarxiv icon

NavThinker: Action-Conditioned World Models for Coupled Prediction and Planning in Social Navigation

Add code
Mar 16, 2026
Viaarxiv icon

FLUX: Accelerating Cross-Embodiment Generative Navigation Policies via Rectified Flow and Static-to-Dynamic Learning

Add code
Mar 13, 2026
Viaarxiv icon

The RoboSense Challenge: Sense Anything, Navigate Anywhere, Adapt Across Platforms

Add code
Jan 08, 2026
Viaarxiv icon

Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems

Add code
Dec 30, 2025
Viaarxiv icon