Picture for Yunhai Tong

Yunhai Tong

RAP-SAM: Towards Real-Time All-Purpose Segment Anything

Add code
Jan 18, 2024
Viaarxiv icon

Towards Language-Driven Video Inpainting via Multimodal Large Language Models

Add code
Jan 18, 2024
Figure 1 for Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Figure 2 for Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Figure 3 for Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Figure 4 for Towards Language-Driven Video Inpainting via Multimodal Large Language Models
Viaarxiv icon

DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection

Add code
Oct 02, 2023
Figure 1 for DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection
Figure 2 for DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection
Figure 3 for DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection
Figure 4 for DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection
Viaarxiv icon

Mitigating Semantic Confusion from Hostile Neighborhood for Graph Active Learning

Add code
Aug 17, 2023
Figure 1 for Mitigating Semantic Confusion from Hostile Neighborhood for Graph Active Learning
Figure 2 for Mitigating Semantic Confusion from Hostile Neighborhood for Graph Active Learning
Figure 3 for Mitigating Semantic Confusion from Hostile Neighborhood for Graph Active Learning
Figure 4 for Mitigating Semantic Confusion from Hostile Neighborhood for Graph Active Learning
Viaarxiv icon

Towards Open Vocabulary Learning: A Survey

Add code
Jul 06, 2023
Figure 1 for Towards Open Vocabulary Learning: A Survey
Figure 2 for Towards Open Vocabulary Learning: A Survey
Figure 3 for Towards Open Vocabulary Learning: A Survey
Figure 4 for Towards Open Vocabulary Learning: A Survey
Viaarxiv icon

PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation

Add code
Jan 03, 2023
Figure 1 for PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation
Figure 2 for PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation
Figure 3 for PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation
Figure 4 for PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation
Viaarxiv icon

Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation

Add code
Jan 02, 2023
Figure 1 for Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
Figure 2 for Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
Figure 3 for Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
Figure 4 for Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
Viaarxiv icon

Label-Efficient Interactive Time-Series Anomaly Detection

Add code
Dec 30, 2022
Viaarxiv icon

Convolution-enhanced Evolving Attention Networks

Add code
Dec 16, 2022
Figure 1 for Convolution-enhanced Evolving Attention Networks
Figure 2 for Convolution-enhanced Evolving Attention Networks
Figure 3 for Convolution-enhanced Evolving Attention Networks
Figure 4 for Convolution-enhanced Evolving Attention Networks
Viaarxiv icon

Towards Robust Referring Image Segmentation

Add code
Sep 20, 2022
Figure 1 for Towards Robust Referring Image Segmentation
Figure 2 for Towards Robust Referring Image Segmentation
Figure 3 for Towards Robust Referring Image Segmentation
Figure 4 for Towards Robust Referring Image Segmentation
Viaarxiv icon