Picture for Pai Peng

Pai Peng

School of Mathematics and Computer Science, Jianghan University

The DAWN of World-Action Interactive Models

Add code
May 12, 2026
Viaarxiv icon

Clinical Cognition Alignment for Gastrointestinal Diagnosis with Multimodal LLMs

Add code
Mar 21, 2026
Viaarxiv icon

VisionNVS: Self-Supervised Inpainting for Novel View Synthesis under the Virtual-Shift Paradigm

Add code
Mar 18, 2026
Viaarxiv icon

WildSVG: Towards Reliable SVG Generation Under Real-Word Conditions

Add code
Feb 24, 2026
Viaarxiv icon

Evaluating Low-Light Image Enhancement Across Multiple Intensity Levels

Add code
Nov 19, 2025
Viaarxiv icon

Semantic Causality-Aware Vision-Based 3D Occupancy Prediction

Add code
Sep 10, 2025
Figure 1 for Semantic Causality-Aware Vision-Based 3D Occupancy Prediction
Figure 2 for Semantic Causality-Aware Vision-Based 3D Occupancy Prediction
Figure 3 for Semantic Causality-Aware Vision-Based 3D Occupancy Prediction
Figure 4 for Semantic Causality-Aware Vision-Based 3D Occupancy Prediction
Viaarxiv icon

Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction

Add code
Apr 18, 2025
Viaarxiv icon

You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving

Add code
Feb 28, 2025
Viaarxiv icon

Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving

Add code
Jan 15, 2025
Figure 1 for Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving
Figure 2 for Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving
Figure 3 for Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving
Figure 4 for Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving
Viaarxiv icon

Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification

Add code
Dec 28, 2024
Figure 1 for Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification
Figure 2 for Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification
Figure 3 for Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification
Figure 4 for Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification
Viaarxiv icon