Picture for Pheng-Ann Heng

Pheng-Ann Heng

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

Add code
Apr 13, 2026
Viaarxiv icon

Adapting 2D Multi-Modal Large Language Model for 3D CT Image Analysis

Add code
Apr 11, 2026
Viaarxiv icon

Unlocking Positive Transfer in Incrementally Learning Surgical Instruments: A Self-reflection Hierarchical Prompt Framework

Add code
Apr 03, 2026
Viaarxiv icon

MME-CoF-Pro: Evaluating Reasoning Coherence in Video Generative Models with Text and Visual Hints

Add code
Mar 20, 2026
Viaarxiv icon

An SO(3)-equivariant reciprocal-space neural potential for long-range interactions

Add code
Mar 19, 2026
Viaarxiv icon

Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models

Add code
Mar 17, 2026
Viaarxiv icon

HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images

Add code
Mar 03, 2026
Viaarxiv icon

Preoperative-to-intraoperative Liver Registration for Laparoscopic Surgery via Latent-Grounded Correspondence Constraints

Add code
Mar 02, 2026
Viaarxiv icon

EgoHandICL: Egocentric 3D Hand Reconstruction with In-Context Learning

Add code
Jan 27, 2026
Viaarxiv icon

IdentityStory: Taming Your Identity-Preserving Generator for Human-Centric Story Generation

Add code
Dec 29, 2025
Viaarxiv icon