Picture for Min-Hung Chen

Min-Hung Chen

MovieCORE: COgnitive REasoning in Movies

Add code
Aug 26, 2025
Viaarxiv icon

Autoregressive Universal Video Segmentation Model

Add code
Aug 26, 2025
Viaarxiv icon

LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos

Add code
Aug 19, 2025
Viaarxiv icon

ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning

Add code
Jul 22, 2025
Viaarxiv icon

AuraFusion360: Augmented Unseen Region Alignment for Reference-based 360° Unbounded Scene Inpainting

Add code
Feb 07, 2025
Viaarxiv icon

Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks

Add code
Jan 14, 2025
Viaarxiv icon

CorrFill: Enhancing Faithfulness in Reference-based Inpainting with Correspondence Guidance in Diffusion Models

Add code
Jan 04, 2025
Viaarxiv icon

ORFormer: Occlusion-Robust Transformer for Accurate Facial Landmark Detection

Add code
Dec 17, 2024
Figure 1 for ORFormer: Occlusion-Robust Transformer for Accurate Facial Landmark Detection
Figure 2 for ORFormer: Occlusion-Robust Transformer for Accurate Facial Landmark Detection
Figure 3 for ORFormer: Occlusion-Robust Transformer for Accurate Facial Landmark Detection
Figure 4 for ORFormer: Occlusion-Robust Transformer for Accurate Facial Landmark Detection
Viaarxiv icon

Hymba: A Hybrid-head Architecture for Small Language Models

Add code
Nov 20, 2024
Figure 1 for Hymba: A Hybrid-head Architecture for Small Language Models
Figure 2 for Hymba: A Hybrid-head Architecture for Small Language Models
Figure 3 for Hymba: A Hybrid-head Architecture for Small Language Models
Figure 4 for Hymba: A Hybrid-head Architecture for Small Language Models
Viaarxiv icon

EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation

Add code
Oct 28, 2024
Viaarxiv icon