Picture for Huan Yang

Huan Yang

Depatment of Gastroenterology, Second Affiliated Hospital, Army Medical University

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

Add code
Dec 12, 2025
Viaarxiv icon

ContextDrag: Precise Drag-Based Image Editing via Context-Preserving Token Injection and Position-Consistent Attention

Add code
Dec 09, 2025
Viaarxiv icon

DialogGraph-LLM: Graph-Informed LLMs for End-to-End Audio Dialogue Intent Recognition

Add code
Nov 17, 2025
Viaarxiv icon

Align-then-Slide: A complete evaluation framework for Ultra-Long Document-Level Machine Translation

Add code
Sep 04, 2025
Viaarxiv icon

Mod-Adapter: Tuning-Free and Versatile Multi-concept Personalization via Modulation Adapter

Add code
May 24, 2025
Viaarxiv icon

MM-MovieDubber: Towards Multi-Modal Learning for Multi-Modal Movie Dubbing

Add code
May 22, 2025
Viaarxiv icon

KVShare: Semantic-Aware Key-Value Cache Sharing for Efficient Large Language Model Inference

Add code
Mar 17, 2025
Figure 1 for KVShare: Semantic-Aware Key-Value Cache Sharing for Efficient Large Language Model Inference
Figure 2 for KVShare: Semantic-Aware Key-Value Cache Sharing for Efficient Large Language Model Inference
Figure 3 for KVShare: Semantic-Aware Key-Value Cache Sharing for Efficient Large Language Model Inference
Figure 4 for KVShare: Semantic-Aware Key-Value Cache Sharing for Efficient Large Language Model Inference
Viaarxiv icon

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

Add code
Mar 14, 2025
Figure 1 for Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers
Figure 2 for Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers
Figure 3 for Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers
Figure 4 for Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers
Viaarxiv icon

Accelerating Video Diffusion Models via Distribution Matching

Add code
Dec 08, 2024
Figure 1 for Accelerating Video Diffusion Models via Distribution Matching
Figure 2 for Accelerating Video Diffusion Models via Distribution Matching
Figure 3 for Accelerating Video Diffusion Models via Distribution Matching
Figure 4 for Accelerating Video Diffusion Models via Distribution Matching
Viaarxiv icon

Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation

Add code
Dec 02, 2024
Figure 1 for Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation
Figure 2 for Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation
Figure 3 for Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation
Figure 4 for Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation
Viaarxiv icon