Picture for Ming-Hsuan Yang

Ming-Hsuan Yang

Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation

Add code
Oct 09, 2023
Viaarxiv icon

Module-wise Adaptive Distillation for Multimodality Foundation Models

Add code
Oct 06, 2023
Figure 1 for Module-wise Adaptive Distillation for Multimodality Foundation Models
Figure 2 for Module-wise Adaptive Distillation for Multimodality Foundation Models
Figure 3 for Module-wise Adaptive Distillation for Multimodality Foundation Models
Figure 4 for Module-wise Adaptive Distillation for Multimodality Foundation Models
Viaarxiv icon

Video Timeline Modeling For News Story Understanding

Add code
Sep 23, 2023
Figure 1 for Video Timeline Modeling For News Story Understanding
Figure 2 for Video Timeline Modeling For News Story Understanding
Figure 3 for Video Timeline Modeling For News Story Understanding
Figure 4 for Video Timeline Modeling For News Story Understanding
Viaarxiv icon

SAMPLING: Scene-adaptive Hierarchical Multiplane Images Representation for Novel View Synthesis from a Single Image

Add code
Sep 13, 2023
Viaarxiv icon

Text-driven Editing of 3D Scenes without Retraining

Add code
Sep 10, 2023
Figure 1 for Text-driven Editing of 3D Scenes without Retraining
Figure 2 for Text-driven Editing of 3D Scenes without Retraining
Figure 3 for Text-driven Editing of 3D Scenes without Retraining
Figure 4 for Text-driven Editing of 3D Scenes without Retraining
Viaarxiv icon

CiteTracker: Correlating Image and Text for Visual Tracking

Add code
Aug 22, 2023
Figure 1 for CiteTracker: Correlating Image and Text for Visual Tracking
Figure 2 for CiteTracker: Correlating Image and Text for Visual Tracking
Figure 3 for CiteTracker: Correlating Image and Text for Visual Tracking
Figure 4 for CiteTracker: Correlating Image and Text for Visual Tracking
Viaarxiv icon

Delving into Motion-Aware Matching for Monocular 3D Object Tracking

Add code
Aug 22, 2023
Viaarxiv icon

Dual Associated Encoder for Face Restoration

Add code
Aug 14, 2023
Figure 1 for Dual Associated Encoder for Face Restoration
Figure 2 for Dual Associated Encoder for Face Restoration
Figure 3 for Dual Associated Encoder for Face Restoration
Figure 4 for Dual Associated Encoder for Face Restoration
Viaarxiv icon

Foundational Models Defining a New Era in Vision: A Survey and Outlook

Add code
Jul 25, 2023
Figure 1 for Foundational Models Defining a New Era in Vision: A Survey and Outlook
Figure 2 for Foundational Models Defining a New Era in Vision: A Survey and Outlook
Figure 3 for Foundational Models Defining a New Era in Vision: A Survey and Outlook
Figure 4 for Foundational Models Defining a New Era in Vision: A Survey and Outlook
Viaarxiv icon

CLR: Channel-wise Lightweight Reprogramming for Continual Learning

Add code
Jul 21, 2023
Figure 1 for CLR: Channel-wise Lightweight Reprogramming for Continual Learning
Figure 2 for CLR: Channel-wise Lightweight Reprogramming for Continual Learning
Figure 3 for CLR: Channel-wise Lightweight Reprogramming for Continual Learning
Figure 4 for CLR: Channel-wise Lightweight Reprogramming for Continual Learning
Viaarxiv icon