Picture for Ming-Hsuan Yang

Ming-Hsuan Yang

HoliGS: Holistic Gaussian Splatting for Embodied View Synthesis

Add code
Jun 24, 2025
Viaarxiv icon

CoCo4D: Comprehensive and Complex 4D Scene Generation

Add code
Jun 24, 2025
Viaarxiv icon

Dynamic Context-oriented Decomposition for Task-aware Low-rank Adaptation with Less Forgetting and Faster Convergence

Add code
Jun 16, 2025
Figure 1 for Dynamic Context-oriented Decomposition for Task-aware Low-rank Adaptation with Less Forgetting and Faster Convergence
Figure 2 for Dynamic Context-oriented Decomposition for Task-aware Low-rank Adaptation with Less Forgetting and Faster Convergence
Figure 3 for Dynamic Context-oriented Decomposition for Task-aware Low-rank Adaptation with Less Forgetting and Faster Convergence
Figure 4 for Dynamic Context-oriented Decomposition for Task-aware Low-rank Adaptation with Less Forgetting and Faster Convergence
Viaarxiv icon

InstaInpaint: Instant 3D-Scene Inpainting with Masked Large Reconstruction Model

Add code
Jun 12, 2025
Figure 1 for InstaInpaint: Instant 3D-Scene Inpainting with Masked Large Reconstruction Model
Figure 2 for InstaInpaint: Instant 3D-Scene Inpainting with Masked Large Reconstruction Model
Figure 3 for InstaInpaint: Instant 3D-Scene Inpainting with Masked Large Reconstruction Model
Figure 4 for InstaInpaint: Instant 3D-Scene Inpainting with Masked Large Reconstruction Model
Viaarxiv icon

DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos

Add code
Jun 11, 2025
Figure 1 for DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos
Figure 2 for DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos
Figure 3 for DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos
Figure 4 for DGS-LRM: Real-Time Deformable 3D Gaussian Reconstruction From Monocular Videos
Viaarxiv icon

RAPID Hand: A Robust, Affordable, Perception-Integrated, Dexterous Manipulation Platform for Generalist Robot Autonomy

Add code
Jun 09, 2025
Figure 1 for RAPID Hand: A Robust, Affordable, Perception-Integrated, Dexterous Manipulation Platform for Generalist Robot Autonomy
Figure 2 for RAPID Hand: A Robust, Affordable, Perception-Integrated, Dexterous Manipulation Platform for Generalist Robot Autonomy
Figure 3 for RAPID Hand: A Robust, Affordable, Perception-Integrated, Dexterous Manipulation Platform for Generalist Robot Autonomy
Figure 4 for RAPID Hand: A Robust, Affordable, Perception-Integrated, Dexterous Manipulation Platform for Generalist Robot Autonomy
Viaarxiv icon

VideoMathQA: Benchmarking Mathematical Reasoning via Multimodal Understanding in Videos

Add code
Jun 05, 2025
Figure 1 for VideoMathQA: Benchmarking Mathematical Reasoning via Multimodal Understanding in Videos
Figure 2 for VideoMathQA: Benchmarking Mathematical Reasoning via Multimodal Understanding in Videos
Figure 3 for VideoMathQA: Benchmarking Mathematical Reasoning via Multimodal Understanding in Videos
Figure 4 for VideoMathQA: Benchmarking Mathematical Reasoning via Multimodal Understanding in Videos
Viaarxiv icon

Manifold-aware Representation Learning for Degradation-agnostic Image Restoration

Add code
May 24, 2025
Viaarxiv icon

Self-Supervised and Generalizable Tokenization for CLIP-Based 3D Understanding

Add code
May 24, 2025
Figure 1 for Self-Supervised and Generalizable Tokenization for CLIP-Based 3D Understanding
Figure 2 for Self-Supervised and Generalizable Tokenization for CLIP-Based 3D Understanding
Figure 3 for Self-Supervised and Generalizable Tokenization for CLIP-Based 3D Understanding
Figure 4 for Self-Supervised and Generalizable Tokenization for CLIP-Based 3D Understanding
Viaarxiv icon

KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models

Add code
May 22, 2025
Viaarxiv icon