Picture for Hangjie Yuan

Hangjie Yuan

Knowledge is Power: Advancing Few-shot Action Recognition with Multimodal Semantics from MLLMs

Add code
Mar 27, 2026
Viaarxiv icon

LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation

Add code
Mar 20, 2026
Viaarxiv icon

DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning

Add code
Mar 12, 2026
Viaarxiv icon

Why Does RL Generalize Better Than SFT? A Data-Centric Perspective on VLM Post-Training

Add code
Feb 11, 2026
Viaarxiv icon

MCIE: Multimodal LLM-Driven Complex Instruction Image Editing with Spatial Guidance

Add code
Feb 08, 2026
Viaarxiv icon

Continual GUI Agents

Add code
Jan 29, 2026
Viaarxiv icon

CogFlow: Bridging Perception and Reasoning through Knowledge Internalization for Visual Mathematical Problem Solving

Add code
Jan 05, 2026
Viaarxiv icon

ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning

Add code
Dec 11, 2025
Figure 1 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Figure 2 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Figure 3 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Figure 4 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Viaarxiv icon

SAMora: Enhancing SAM through Hierarchical Self-Supervised Pre-Training for Medical Images

Add code
Nov 09, 2025
Figure 1 for SAMora: Enhancing SAM through Hierarchical Self-Supervised Pre-Training for Medical Images
Figure 2 for SAMora: Enhancing SAM through Hierarchical Self-Supervised Pre-Training for Medical Images
Figure 3 for SAMora: Enhancing SAM through Hierarchical Self-Supervised Pre-Training for Medical Images
Figure 4 for SAMora: Enhancing SAM through Hierarchical Self-Supervised Pre-Training for Medical Images
Viaarxiv icon

OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization

Add code
Aug 29, 2025
Figure 1 for OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization
Figure 2 for OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization
Figure 3 for OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization
Figure 4 for OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization
Viaarxiv icon