Alert button
Picture for Yiwei Ma

Yiwei Ma

Alert button

Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation

Add code
Bookmark button
Alert button
Dec 19, 2023
Sihan Liu, Yiwei Ma, Xiaoqing Zhang, Haowei Wang, Jiayi Ji, Xiaoshuai Sun, Rongrong Ji

Viaarxiv icon

X-Dreamer: Creating High-quality 3D Content by Bridging the Domain Gap Between Text-to-2D and Text-to-3D Generation

Add code
Bookmark button
Alert button
Nov 30, 2023
Yiwei Ma, Yijun Fan, Jiayi Ji, Haowei Wang, Xiaoshuai Sun, Guannan Jiang, Annan Shu, Rongrong Ji

Viaarxiv icon

Semi-Supervised Panoptic Narrative Grounding

Add code
Bookmark button
Alert button
Oct 27, 2023
Danni Yang, Jiayi Ji, Xiaoshuai Sun, Haowei Wang, Yinan Li, Yiwei Ma, Rongrong Ji

Viaarxiv icon

JM3D & JM3D-LLM: Elevating 3D Representation with Joint Multi-modal Cues

Add code
Bookmark button
Alert button
Oct 20, 2023
Jiayi Ji, Haowei Wang, Changli Wu, Yiwei Ma, Xiaoshuai Sun, Rongrong Ji

Viaarxiv icon

3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation

Add code
Bookmark button
Alert button
Aug 31, 2023
Changli Wu, Yiwei Ma, Qi Chen, Haowei Wang, Gen Luo, Jiayi Ji, Xiaoshuai Sun

Figure 1 for 3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation
Figure 2 for 3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation
Figure 3 for 3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation
Figure 4 for 3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation
Viaarxiv icon

Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation

Add code
Bookmark button
Alert button
Aug 06, 2023
Haowei Wang, Jiji Tang, Jiayi Ji, Xiaoshuai Sun, Rongsheng Zhang, Yiwei Ma, Minda Zhao, Lincheng Li, zeng zhao, Tangjie Lv, Rongrong Ji

Figure 1 for Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation
Figure 2 for Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation
Figure 3 for Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation
Figure 4 for Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation
Viaarxiv icon

X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance

Add code
Bookmark button
Alert button
Mar 28, 2023
Yiwei Ma, Xiaioqing Zhang, Xiaoshuai Sun, Jiayi Ji, Haowei Wang, Guannan Jiang, Weilin Zhuang, Rongrong Ji

Figure 1 for X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance
Figure 2 for X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance
Figure 3 for X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance
Figure 4 for X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance
Viaarxiv icon

Towards Local Visual Modeling for Image Captioning

Add code
Bookmark button
Alert button
Feb 13, 2023
Yiwei Ma, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Rongrong Ji

Figure 1 for Towards Local Visual Modeling for Image Captioning
Figure 2 for Towards Local Visual Modeling for Image Captioning
Figure 3 for Towards Local Visual Modeling for Image Captioning
Figure 4 for Towards Local Visual Modeling for Image Captioning
Viaarxiv icon

X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval

Add code
Bookmark button
Alert button
Jul 15, 2022
Yiwei Ma, Guohai Xu, Xiaoshuai Sun, Ming Yan, Ji Zhang, Rongrong Ji

Figure 1 for X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval
Figure 2 for X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval
Figure 3 for X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval
Figure 4 for X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval
Viaarxiv icon