Picture for Huan Zheng

Huan Zheng

LoViF 2026 The First Challenge on Holistic Quality Assessment for 4D World Model (PhyScore)

Add code
May 06, 2026
Viaarxiv icon

OccDirector: Language-Guided Behavior and Interaction Generation in 4D Occupancy Space

Add code
Apr 24, 2026
Viaarxiv icon

Multimodal Large Language Models for Multi-Subject In-Context Image Generation

Add code
Apr 08, 2026
Viaarxiv icon

Clinical Cognition Alignment for Gastrointestinal Diagnosis with Multimodal LLMs

Add code
Mar 21, 2026
Viaarxiv icon

Towards Geometry-Aware and Motion-Guided Video Human Mesh Recovery

Add code
Jan 29, 2026
Viaarxiv icon

From Human Intention to Action Prediction: A Comprehensive Benchmark for Intention-driven End-to-End Autonomous Driving

Add code
Dec 13, 2025
Viaarxiv icon

Semantic Causality-Aware Vision-Based 3D Occupancy Prediction

Add code
Sep 10, 2025
Figure 1 for Semantic Causality-Aware Vision-Based 3D Occupancy Prediction
Figure 2 for Semantic Causality-Aware Vision-Based 3D Occupancy Prediction
Figure 3 for Semantic Causality-Aware Vision-Based 3D Occupancy Prediction
Figure 4 for Semantic Causality-Aware Vision-Based 3D Occupancy Prediction
Viaarxiv icon

Geometry-aware Temporal Aggregation Network for Monocular 3D Lane Detection

Add code
Apr 29, 2025
Viaarxiv icon

Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction

Add code
Apr 18, 2025
Viaarxiv icon

The Tenth NTIRE 2025 Efficient Super-Resolution Challenge Report

Add code
Apr 14, 2025
Viaarxiv icon