Picture for Bing Wang

Bing Wang

Dual-view Spatio-Temporal Feature Fusion with CNN-Transformer Hybrid Network for Chinese Isolated Sign Language Recognition

Add code
Jun 08, 2025
Viaarxiv icon

Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving

Add code
May 13, 2025
Figure 1 for Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Figure 2 for Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Figure 3 for Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Figure 4 for Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Viaarxiv icon

Robust Misinformation Detection by Visiting Potential Commonsense Conflict

Add code
Apr 30, 2025
Figure 1 for Robust Misinformation Detection by Visiting Potential Commonsense Conflict
Figure 2 for Robust Misinformation Detection by Visiting Potential Commonsense Conflict
Figure 3 for Robust Misinformation Detection by Visiting Potential Commonsense Conflict
Figure 4 for Robust Misinformation Detection by Visiting Potential Commonsense Conflict
Viaarxiv icon

NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors

Add code
Apr 15, 2025
Figure 1 for NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors
Figure 2 for NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors
Figure 3 for NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors
Figure 4 for NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors
Viaarxiv icon

Collaboration and Controversy Among Experts: Rumor Early Detection by Tuning a Comment Generator

Add code
Apr 05, 2025
Viaarxiv icon

CoGen: 3D Consistent Video Generation via Adaptive Conditioning for Autonomous Driving

Add code
Mar 28, 2025
Viaarxiv icon

ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation

Add code
Mar 25, 2025
Viaarxiv icon

MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving

Add code
Mar 20, 2025
Viaarxiv icon

Uni-Gaussians: Unifying Camera and Lidar Simulation with Gaussians for Dynamic Driving Scenarios

Add code
Mar 11, 2025
Viaarxiv icon

Cross-platform Prediction of Depression Treatment Outcome Using Location Sensory Data on Smartphones

Add code
Mar 10, 2025
Viaarxiv icon