Picture for Ming-Hsuan Yang

Ming-Hsuan Yang

LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation

Add code
Sep 09, 2024
Viaarxiv icon

Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS

Add code
Aug 29, 2024
Figure 1 for Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS
Figure 2 for Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS
Figure 3 for Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS
Viaarxiv icon

Re-boosting Self-Collaboration Parallel Prompt GAN for Unsupervised Image Restoration

Add code
Aug 17, 2024
Viaarxiv icon

Cropper: Vision-Language Model for Image Cropping through In-Context Learning

Add code
Aug 14, 2024
Viaarxiv icon

LLAVADI: What Matters For Multimodal Large Language Models Distillation

Add code
Jul 28, 2024
Figure 1 for LLAVADI: What Matters For Multimodal Large Language Models Distillation
Figure 2 for LLAVADI: What Matters For Multimodal Large Language Models Distillation
Figure 3 for LLAVADI: What Matters For Multimodal Large Language Models Distillation
Figure 4 for LLAVADI: What Matters For Multimodal Large Language Models Distillation
Viaarxiv icon

Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts

Add code
Jul 10, 2024
Figure 1 for Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts
Figure 2 for Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts
Figure 3 for Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts
Figure 4 for Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts
Viaarxiv icon

Learning Spatial-Semantic Features for Robust Video Object Segmentation

Add code
Jul 10, 2024
Figure 1 for Learning Spatial-Semantic Features for Robust Video Object Segmentation
Figure 2 for Learning Spatial-Semantic Features for Robust Video Object Segmentation
Figure 3 for Learning Spatial-Semantic Features for Robust Video Object Segmentation
Figure 4 for Learning Spatial-Semantic Features for Robust Video Object Segmentation
Viaarxiv icon

Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model

Add code
Jun 27, 2024
Figure 1 for Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model
Figure 2 for Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model
Figure 3 for Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model
Figure 4 for Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model
Viaarxiv icon

PVUW 2024 Challenge on Complex Video Understanding: Methods and Results

Add code
Jun 24, 2024
Viaarxiv icon

1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation

Add code
Jun 07, 2024
Figure 1 for 1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation
Figure 2 for 1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation
Figure 3 for 1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation
Figure 4 for 1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation
Viaarxiv icon