Picture for Zhimin Chen

Zhimin Chen

Pangu-ACE: Adaptive Cascaded Experts for Educational Response Generation on EduBench

Add code
Apr 16, 2026
Viaarxiv icon

Bridging the Micro--Macro Gap: Frequency-Aware Semantic Alignment for Image Manipulation Localization

Add code
Apr 14, 2026
Viaarxiv icon

Geometry-Aware Localized Watermarking for Copyright Protection in Embedding-as-a-Service

Add code
Apr 13, 2026
Viaarxiv icon

OmniSonic: Towards Universal and Holistic Audio Generation from Video and Text

Add code
Apr 06, 2026
Viaarxiv icon

REL-SF4PASS: Panoramic Semantic Segmentation with REL Depth Representation and Spherical Fusion

Add code
Jan 23, 2026
Viaarxiv icon

What Happens Next? Next Scene Prediction with a Unified Video Model

Add code
Dec 15, 2025
Figure 1 for What Happens Next? Next Scene Prediction with a Unified Video Model
Figure 2 for What Happens Next? Next Scene Prediction with a Unified Video Model
Figure 3 for What Happens Next? Next Scene Prediction with a Unified Video Model
Figure 4 for What Happens Next? Next Scene Prediction with a Unified Video Model
Viaarxiv icon

Limits To (Machine) Learning

Add code
Dec 14, 2025
Figure 1 for Limits To (Machine) Learning
Figure 2 for Limits To (Machine) Learning
Figure 3 for Limits To (Machine) Learning
Figure 4 for Limits To (Machine) Learning
Viaarxiv icon

VIDEOP2R: Video Understanding from Perception to Reasoning

Add code
Nov 14, 2025
Viaarxiv icon

DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning

Add code
Nov 13, 2024
Figure 1 for DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning
Figure 2 for DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning
Figure 3 for DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning
Figure 4 for DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning
Viaarxiv icon

SAM-Guided Masked Token Prediction for 3D Scene Understanding

Add code
Oct 17, 2024
Figure 1 for SAM-Guided Masked Token Prediction for 3D Scene Understanding
Figure 2 for SAM-Guided Masked Token Prediction for 3D Scene Understanding
Figure 3 for SAM-Guided Masked Token Prediction for 3D Scene Understanding
Figure 4 for SAM-Guided Masked Token Prediction for 3D Scene Understanding
Viaarxiv icon