Picture for Bing Wang

Bing Wang

Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes

Add code
Dec 30, 2025
Viaarxiv icon

Where Did This Sentence Come From? Tracing Provenance in LLM Reasoning Distillation

Add code
Dec 24, 2025
Viaarxiv icon

MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning

Add code
Dec 16, 2025
Figure 1 for MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
Figure 2 for MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
Figure 3 for MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
Figure 4 for MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning
Viaarxiv icon

Enhancing Multimodal Misinformation Detection by Replaying the Whole Story from Image Modality Perspective

Add code
Nov 09, 2025
Figure 1 for Enhancing Multimodal Misinformation Detection by Replaying the Whole Story from Image Modality Perspective
Figure 2 for Enhancing Multimodal Misinformation Detection by Replaying the Whole Story from Image Modality Perspective
Figure 3 for Enhancing Multimodal Misinformation Detection by Replaying the Whole Story from Image Modality Perspective
Figure 4 for Enhancing Multimodal Misinformation Detection by Replaying the Whole Story from Image Modality Perspective
Viaarxiv icon

LiveStar: Live Streaming Assistant for Real-World Online Video Understanding

Add code
Nov 07, 2025
Viaarxiv icon

Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks

Add code
Oct 22, 2025
Viaarxiv icon

Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers

Add code
Oct 08, 2025
Figure 1 for Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
Figure 2 for Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
Figure 3 for Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
Figure 4 for Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
Viaarxiv icon

DiGS: Accurate and Complete Surface Reconstruction from 3D Gaussians via Direct SDF Learning

Add code
Sep 09, 2025
Viaarxiv icon

Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency

Add code
Jun 09, 2025
Viaarxiv icon

ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving

Add code
Jun 09, 2025
Figure 1 for ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
Figure 2 for ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
Figure 3 for ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
Figure 4 for ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
Viaarxiv icon