Picture for Min-Hung Chen

Min-Hung Chen

TC-LoRA: Temporally Modulated Conditional LoRA for Adaptive Diffusion Control

Add code
Oct 10, 2025
Figure 1 for TC-LoRA: Temporally Modulated Conditional LoRA for Adaptive Diffusion Control
Figure 2 for TC-LoRA: Temporally Modulated Conditional LoRA for Adaptive Diffusion Control
Figure 3 for TC-LoRA: Temporally Modulated Conditional LoRA for Adaptive Diffusion Control
Figure 4 for TC-LoRA: Temporally Modulated Conditional LoRA for Adaptive Diffusion Control
Viaarxiv icon

Temporal Prompting Matters: Rethinking Referring Video Object Segmentation

Add code
Oct 08, 2025
Viaarxiv icon

MovieCORE: COgnitive REasoning in Movies

Add code
Aug 26, 2025
Viaarxiv icon

Autoregressive Universal Video Segmentation Model

Add code
Aug 26, 2025
Viaarxiv icon

LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos

Add code
Aug 19, 2025
Viaarxiv icon

ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning

Add code
Jul 22, 2025
Viaarxiv icon

AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting

Add code
Feb 07, 2025
Viaarxiv icon

Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks

Add code
Jan 14, 2025
Viaarxiv icon

CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models

Add code
Jan 04, 2025
Viaarxiv icon

ORFormer: Occlusion-Robust Transformer for Accurate Facial Landmark Detection

Add code
Dec 17, 2024
Figure 1 for ORFormer: Occlusion-Robust Transformer for Accurate Facial Landmark Detection
Figure 2 for ORFormer: Occlusion-Robust Transformer for Accurate Facial Landmark Detection
Figure 3 for ORFormer: Occlusion-Robust Transformer for Accurate Facial Landmark Detection
Figure 4 for ORFormer: Occlusion-Robust Transformer for Accurate Facial Landmark Detection
Viaarxiv icon