Alert button
Picture for Zehuan Yuan

Zehuan Yuan

Alert button

Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models

Add code
Bookmark button
Alert button
Apr 19, 2024
Chuofan Ma, Yi Jiang, Jiannan Wu, Zehuan Yuan, Xiaojuan Qi

Viaarxiv icon

Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction

Add code
Bookmark button
Alert button
Apr 03, 2024
Keyu Tian, Yi Jiang, Zehuan Yuan, Bingyue Peng, Liwei Wang

Viaarxiv icon

Generative Region-Language Pretraining for Open-Ended Object Detection

Add code
Bookmark button
Alert button
Mar 15, 2024
Chuang Lin, Yi Jiang, Lizhen Qu, Zehuan Yuan, Jianfei Cai

Figure 1 for Generative Region-Language Pretraining for Open-Ended Object Detection
Figure 2 for Generative Region-Language Pretraining for Open-Ended Object Detection
Figure 3 for Generative Region-Language Pretraining for Open-Ended Object Detection
Figure 4 for Generative Region-Language Pretraining for Open-Ended Object Detection
Viaarxiv icon

UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces

Add code
Bookmark button
Alert button
Dec 25, 2023
Jiannan Wu, Yi Jiang, Bin Yan, Huchuan Lu, Zehuan Yuan, Ping Luo

Viaarxiv icon

General Object Foundation Model for Images and Videos at Scale

Add code
Bookmark button
Alert button
Dec 14, 2023
Junfeng Wu, Yi Jiang, Qihao Liu, Zehuan Yuan, Xiang Bai, Song Bai

Viaarxiv icon

Recognize Any Regions

Add code
Bookmark button
Alert button
Nov 02, 2023
Haosen Yang, Chuofan Ma, Bin Wen, Yi Jiang, Zehuan Yuan, Xiatian Zhu

Viaarxiv icon

CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection

Add code
Bookmark button
Alert button
Oct 25, 2023
Chuofan Ma, Yi Jiang, Xin Wen, Zehuan Yuan, Xiaojuan Qi

Figure 1 for CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Figure 2 for CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Figure 3 for CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Figure 4 for CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
Viaarxiv icon

EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE

Add code
Bookmark button
Alert button
Aug 23, 2023
Junyi Chen, Longteng Guo, Jia Sun, Shuai Shao, Zehuan Yuan, Liang Lin, Dongyu Zhang

Figure 1 for EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
Figure 2 for EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
Figure 3 for EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
Figure 4 for EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE
Viaarxiv icon

Exploring Transformers for Open-world Instance Segmentation

Add code
Bookmark button
Alert button
Aug 08, 2023
Jiannan Wu, Yi Jiang, Bin Yan, Huchuan Lu, Zehuan Yuan, Ping Luo

Figure 1 for Exploring Transformers for Open-world Instance Segmentation
Figure 2 for Exploring Transformers for Open-world Instance Segmentation
Figure 3 for Exploring Transformers for Open-world Instance Segmentation
Figure 4 for Exploring Transformers for Open-world Instance Segmentation
Viaarxiv icon

ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst

Add code
Bookmark button
Alert button
May 25, 2023
Zijia Zhao, Longteng Guo, Tongtian Yue, Sihan Chen, Shuai Shao, Xinxin Zhu, Zehuan Yuan, Jing Liu

Figure 1 for ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst
Figure 2 for ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst
Figure 3 for ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst
Figure 4 for ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst
Viaarxiv icon