Picture for Bing Wang

Bing Wang

ParkGaussian: Surround-view 3D Gaussian Splatting for Autonomous Parking

Add code
Jan 04, 2026
Viaarxiv icon

DriveLaW:Unifying Planning and Video Generation in a Latent Driving World

Add code
Dec 31, 2025
Viaarxiv icon

Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes

Add code
Dec 30, 2025
Viaarxiv icon

Where Did This Sentence Come From? Tracing Provenance in LLM Reasoning Distillation

Add code
Dec 24, 2025
Viaarxiv icon

MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning

Add code
Dec 16, 2025
Figure 1 for MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
Figure 2 for MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
Figure 3 for MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
Figure 4 for MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
Viaarxiv icon

Enhancing Multimodal Misinformation Detection by Replaying the Whole Story from Image Modality Perspective

Add code
Nov 09, 2025
Figure 1 for Enhancing Multimodal Misinformation Detection by Replaying the Whole Story from Image Modality Perspective
Figure 2 for Enhancing Multimodal Misinformation Detection by Replaying the Whole Story from Image Modality Perspective
Figure 3 for Enhancing Multimodal Misinformation Detection by Replaying the Whole Story from Image Modality Perspective
Figure 4 for Enhancing Multimodal Misinformation Detection by Replaying the Whole Story from Image Modality Perspective
Viaarxiv icon

LiveStar: Live Streaming Assistant for Real-World Online Video Understanding

Add code
Nov 07, 2025
Viaarxiv icon

Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks

Add code
Oct 22, 2025
Viaarxiv icon

Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers

Add code
Oct 08, 2025
Figure 1 for Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
Figure 2 for Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
Figure 3 for Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
Figure 4 for Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
Viaarxiv icon

DiGS: Accurate and Complete Surface Reconstruction from 3D Gaussians via Direct SDF Learning

Add code
Sep 09, 2025
Viaarxiv icon