Picture for Pai Peng

Pai Peng

School of Mathematics and Computer Science, Jianghan University

Clinical Cognition Alignment for Gastrointestinal Diagnosis with Multimodal LLMs

Add code
Mar 21, 2026
Viaarxiv icon

VisionNVS: Self-Supervised Inpainting for Novel View Synthesis under the Virtual-Shift Paradigm

Add code
Mar 18, 2026
Viaarxiv icon

WildSVG: Towards Reliable SVG Generation Under Real-Word Conditions

Add code
Feb 24, 2026
Viaarxiv icon

Evaluating Low-Light Image Enhancement Across Multiple Intensity Levels

Add code
Nov 19, 2025
Viaarxiv icon

Semantic Causality-Aware Vision-Based 3D Occupancy Prediction

Add code
Sep 10, 2025
Figure 1 for Semantic Causality-Aware Vision-Based 3D Occupancy Prediction
Figure 2 for Semantic Causality-Aware Vision-Based 3D Occupancy Prediction
Figure 3 for Semantic Causality-Aware Vision-Based 3D Occupancy Prediction
Figure 4 for Semantic Causality-Aware Vision-Based 3D Occupancy Prediction
Viaarxiv icon

Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction

Add code
Apr 18, 2025
Viaarxiv icon

You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving

Add code
Feb 28, 2025
Viaarxiv icon

Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving

Add code
Jan 15, 2025
Figure 1 for Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving
Figure 2 for Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving
Figure 3 for Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving
Figure 4 for Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving
Viaarxiv icon

Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification

Add code
Dec 28, 2024
Figure 1 for Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification
Figure 2 for Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification
Figure 3 for Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification
Figure 4 for Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification
Viaarxiv icon

Light-weight End-to-End Graph Interest Network for CTR Prediction in E-commerce Search

Add code
Jun 25, 2024
Viaarxiv icon