Picture for Jianbing Shen

Jianbing Shen

OccDirector: Language-Guided Behavior and Interaction Generation in 4D Occupancy Space

Add code
Apr 24, 2026
Viaarxiv icon

Multimodal Large Language Models for Multi-Subject In-Context Image Generation

Add code
Apr 08, 2026
Viaarxiv icon

Accelerating Training of Autoregressive Video Generation Models via Local Optimization with Representation Continuity

Add code
Apr 08, 2026
Viaarxiv icon

Clinical Cognition Alignment for Gastrointestinal Diagnosis with Multimodal LLMs

Add code
Mar 21, 2026
Viaarxiv icon

Bridging Scene Generation and Planning: Driving with World Model via Unifying Vision and Motion Representation

Add code
Mar 16, 2026
Viaarxiv icon

HanMoVLM: Large Vision-Language Models for Professional Artistic Painting Evaluation

Add code
Mar 11, 2026
Viaarxiv icon

Condition Errors Refinement in Autoregressive Image Generation with Diffusion Loss

Add code
Feb 02, 2026
Viaarxiv icon

Towards Geometry-Aware and Motion-Guided Video Human Mesh Recovery

Add code
Jan 29, 2026
Viaarxiv icon

From Human Intention to Action Prediction: A Comprehensive Benchmark for Intention-driven End-to-End Autonomous Driving

Add code
Dec 13, 2025
Viaarxiv icon

TransBridge: Boost 3D Object Detection by Scene-Level Completion with Transformer Decoder

Add code
Dec 12, 2025
Figure 1 for TransBridge: Boost 3D Object Detection by Scene-Level Completion with Transformer Decoder
Figure 2 for TransBridge: Boost 3D Object Detection by Scene-Level Completion with Transformer Decoder
Figure 3 for TransBridge: Boost 3D Object Detection by Scene-Level Completion with Transformer Decoder
Figure 4 for TransBridge: Boost 3D Object Detection by Scene-Level Completion with Transformer Decoder
Viaarxiv icon