Picture for Jun Xiao

Jun Xiao

Stephanie2: Thinking, Waiting, and Making Decisions Like Humans in Step-by-Step AI Social Chat

Add code
Jan 09, 2026
Viaarxiv icon

CORE: Code-based Inverse Self-Training Framework with Graph Expansion for Virtual Agents

Add code
Jan 05, 2026
Viaarxiv icon

AnyMS: Bottom-up Attention Decoupling for Layout-guided and Training-free Multi-subject Customization

Add code
Dec 29, 2025
Viaarxiv icon

$\text{H}^2$em: Learning Hierarchical Hyperbolic Embeddings for Compositional Zero-Shot Learning

Add code
Dec 23, 2025
Viaarxiv icon

OmniMoGen: Unifying Human Motion Generation via Learning from Interleaved Text-Motion Instructions

Add code
Dec 22, 2025
Viaarxiv icon

SegGraph: Leveraging Graphs of SAM Segments for Few-Shot 3D Part Segmentation

Add code
Dec 18, 2025
Viaarxiv icon

FlowDC: Flow-Based Decoupling-Decay for Complex Image Editing

Add code
Dec 12, 2025
Figure 1 for FlowDC: Flow-Based Decoupling-Decay for Complex Image Editing
Figure 2 for FlowDC: Flow-Based Decoupling-Decay for Complex Image Editing
Figure 3 for FlowDC: Flow-Based Decoupling-Decay for Complex Image Editing
Figure 4 for FlowDC: Flow-Based Decoupling-Decay for Complex Image Editing
Viaarxiv icon

Adaptive Begin-of-Video Tokens for Autoregressive Video Diffusion Models

Add code
Nov 15, 2025
Figure 1 for Adaptive Begin-of-Video Tokens for Autoregressive Video Diffusion Models
Figure 2 for Adaptive Begin-of-Video Tokens for Autoregressive Video Diffusion Models
Figure 3 for Adaptive Begin-of-Video Tokens for Autoregressive Video Diffusion Models
Figure 4 for Adaptive Begin-of-Video Tokens for Autoregressive Video Diffusion Models
Viaarxiv icon

CoMo: Compositional Motion Customization for Text-to-Video Generation

Add code
Oct 27, 2025
Viaarxiv icon

Asynchronous Denoising Diffusion Models for Aligning Text-to-Image Generation

Add code
Oct 06, 2025
Viaarxiv icon