Picture for Bing Wang

Bing Wang

Where Did This Sentence Come From? Tracing Provenance in LLM Reasoning Distillation

Add code
Dec 24, 2025
Viaarxiv icon

MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning

Add code
Dec 16, 2025
Viaarxiv icon

Enhancing Multimodal Misinformation Detection by Replaying the Whole Story from Image Modality Perspective

Add code
Nov 09, 2025
Viaarxiv icon

LiveStar: Live Streaming Assistant for Real-World Online Video Understanding

Add code
Nov 07, 2025
Viaarxiv icon

Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks

Add code
Oct 22, 2025
Viaarxiv icon

Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers

Add code
Oct 08, 2025
Figure 1 for Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
Figure 2 for Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
Figure 3 for Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
Figure 4 for Pixel-Perfect Depth with Semantics-Prompted Diffusion Transformers
Viaarxiv icon

DiGS: Accurate and Complete Surface Reconstruction from 3D Gaussians via Direct SDF Learning

Add code
Sep 09, 2025
Viaarxiv icon

ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving

Add code
Jun 09, 2025
Figure 1 for ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
Figure 2 for ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
Figure 3 for ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
Figure 4 for ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
Viaarxiv icon

Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency

Add code
Jun 09, 2025
Viaarxiv icon

Dual-view Spatio-Temporal Feature Fusion with CNN-Transformer Hybrid Network for Chinese Isolated Sign Language Recognition

Add code
Jun 08, 2025
Viaarxiv icon