Picture for Yiwei Ma

Yiwei Ma

Multi-branch Collaborative Learning Network for 3D Visual Grounding

Add code
Jul 10, 2024
Viaarxiv icon

Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model

Add code
Jul 07, 2024
Viaarxiv icon

AnyTrans: Translate AnyText in the Image with Large Scale Models

Add code
Jun 17, 2024
Figure 1 for AnyTrans: Translate AnyText in the Image with Large Scale Models
Figure 2 for AnyTrans: Translate AnyText in the Image with Large Scale Models
Figure 3 for AnyTrans: Translate AnyText in the Image with Large Scale Models
Figure 4 for AnyTrans: Translate AnyText in the Image with Large Scale Models
Viaarxiv icon

Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval

Add code
Jun 09, 2024
Viaarxiv icon

SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation

Add code
Jun 03, 2024
Viaarxiv icon

Image Captioning via Dynamic Path Customization

Add code
Jun 01, 2024
Viaarxiv icon

X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation

Add code
May 02, 2024
Figure 1 for X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation
Figure 2 for X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation
Figure 3 for X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation
Figure 4 for X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation
Viaarxiv icon

Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation

Add code
Dec 19, 2023
Figure 1 for Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
Figure 2 for Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
Figure 3 for Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
Figure 4 for Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
Viaarxiv icon

X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation

Add code
Nov 30, 2023
Figure 1 for X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation
Figure 2 for X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation
Figure 3 for X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation
Figure 4 for X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation
Viaarxiv icon

Semi-Supervised Panoptic Narrative Grounding

Add code
Oct 27, 2023
Figure 1 for Semi-Supervised Panoptic Narrative Grounding
Figure 2 for Semi-Supervised Panoptic Narrative Grounding
Figure 3 for Semi-Supervised Panoptic Narrative Grounding
Figure 4 for Semi-Supervised Panoptic Narrative Grounding
Viaarxiv icon