Picture for Yujin Han

Yujin Han

MSAVBench: Towards Comprehensive and Reliable Evaluation of Multi-Shot Audio-Video Generation

Add code
May 19, 2026
Viaarxiv icon

ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning

Add code
Dec 11, 2025
Figure 1 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Figure 2 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Figure 3 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Figure 4 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Viaarxiv icon

Self-Contradiction as Self-Improvement: Mitigating the Generation-Understanding Gap in MLLMs

Add code
Jul 22, 2025
Figure 1 for Self-Contradiction as Self-Improvement: Mitigating the Generation-Understanding Gap in MLLMs
Figure 2 for Self-Contradiction as Self-Improvement: Mitigating the Generation-Understanding Gap in MLLMs
Figure 3 for Self-Contradiction as Self-Improvement: Mitigating the Generation-Understanding Gap in MLLMs
Figure 4 for Self-Contradiction as Self-Improvement: Mitigating the Generation-Understanding Gap in MLLMs
Viaarxiv icon

Corruption-Aware Training of Latent Video Diffusion Models for Robust Text-to-Video Generation

Add code
May 24, 2025
Viaarxiv icon

Capturing Conditional Dependence via Auto-regressive Diffusion Models

Add code
Apr 30, 2025
Viaarxiv icon

Can Diffusion Models Learn Hidden Inter-Feature Rules Behind Images?

Add code
Feb 07, 2025
Viaarxiv icon

Masked Autoencoders Are Effective Tokenizers for Diffusion Models

Add code
Feb 05, 2025
Viaarxiv icon

Parallelized Autoregressive Visual Generation

Add code
Dec 19, 2024
Figure 1 for Parallelized Autoregressive Visual Generation
Figure 2 for Parallelized Autoregressive Visual Generation
Figure 3 for Parallelized Autoregressive Visual Generation
Figure 4 for Parallelized Autoregressive Visual Generation
Viaarxiv icon

Beyond Surface Structure: A Causal Assessment of LLMs' Comprehension Ability

Add code
Nov 29, 2024
Viaarxiv icon

Slight Corruption in Pre-training Data Makes Better Diffusion Models

Add code
May 30, 2024
Figure 1 for Slight Corruption in Pre-training Data Makes Better Diffusion Models
Figure 2 for Slight Corruption in Pre-training Data Makes Better Diffusion Models
Figure 3 for Slight Corruption in Pre-training Data Makes Better Diffusion Models
Figure 4 for Slight Corruption in Pre-training Data Makes Better Diffusion Models
Viaarxiv icon