Picture for Hong Chang

Hong Chang

AnyMo: Scaling Any-Modality Conditional Motion Generation with Masked Modeling

Add code
May 28, 2026
Viaarxiv icon

Component-Based Out-of-Distribution Detection

Add code
Apr 23, 2026
Viaarxiv icon

EgoMotion: Hierarchical Reasoning and Diffusion for Egocentric Vision-Language Motion Generation

Add code
Apr 21, 2026
Viaarxiv icon

LensWalk: Agentic Video Understanding by Planning How You See in Videos

Add code
Mar 25, 2026
Viaarxiv icon

DreamActor-M2: Universal Character Image Animation via Spatiotemporal In-Context Learning

Add code
Jan 29, 2026
Viaarxiv icon

CLIP-Guided Adaptable Self-Supervised Learning for Human-Centric Visual Tasks

Add code
Jan 19, 2026
Viaarxiv icon

Revisiting Multimodal Positional Encoding in Vision-Language Models

Add code
Oct 27, 2025
Viaarxiv icon

un$^2$CLIP: Improving CLIP's Visual Detail Capturing Ability via Inverting unCLIP

Add code
May 30, 2025
Viaarxiv icon

DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks

Add code
Apr 24, 2025
Viaarxiv icon

HIS-GPT: Towards 3D Human-In-Scene Multimodal Understanding

Add code
Mar 17, 2025
Viaarxiv icon