Alert button
Picture for Ziyu Guo

Ziyu Guo

Alert button

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Mar 21, 2024
Renrui Zhang, Dongzhi Jiang, Yichi Zhang, Haokun Lin, Ziyu Guo, Pengshuo Qiu, Aojun Zhou, Pan Lu, Kai-Wei Chang, Peng Gao, Hongsheng Li

Viaarxiv icon

LLM-Assisted Multi-Teacher Continual Learning for Visual Question Answering in Robotic Surgery

Feb 26, 2024
Kexin Chen, Yuyang Du, Tao You, Mobarakol Islam, Ziyu Guo, Yueming Jin, Guangyong Chen, Pheng-Ann Heng

Viaarxiv icon

SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive Learning

Jan 22, 2024
Hao Chen, Jiaze Wang, Ziyu Guo, Jinpeng Li, Donghao Zhou, Bian Wu, Chenyong Guan, Guangyong Chen, Pheng-Ann Heng

Viaarxiv icon

HIGT: Hierarchical Interaction Graph-Transformer for Whole Slide Image Analysis

Sep 14, 2023
Ziyu Guo, Weiqin Zhao, Shujun Wang, Lequan Yu

Viaarxiv icon

ImageBind-LLM: Multi-modality Instruction Tuning

Sep 11, 2023
Jiaming Han, Renrui Zhang, Wenqi Shao, Peng Gao, Peng Xu, Han Xiao, Kaipeng Zhang, Chris Liu, Song Wen, Ziyu Guo, Xudong Lu, Shuai Ren, Yafei Wen, Xiaoxin Chen, Xiangyu Yue, Hongsheng Li, Yu Qiao

Figure 1 for ImageBind-LLM: Multi-modality Instruction Tuning
Figure 2 for ImageBind-LLM: Multi-modality Instruction Tuning
Figure 3 for ImageBind-LLM: Multi-modality Instruction Tuning
Figure 4 for ImageBind-LLM: Multi-modality Instruction Tuning
Viaarxiv icon

Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following

Sep 01, 2023
Ziyu Guo, Renrui Zhang, Xiangyang Zhu, Yiwen Tang, Xianzheng Ma, Jiaming Han, Kexin Chen, Peng Gao, Xianzhi Li, Hongsheng Li, Pheng-Ann Heng

Figure 1 for Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following
Figure 2 for Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following
Figure 3 for Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following
Figure 4 for Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following
Viaarxiv icon

Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks

Aug 24, 2023
Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Hao Dong, Peng Gao

Viaarxiv icon

Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation

May 25, 2023
Shilin Yan, Renrui Zhang, Ziyu Guo, Wenchao Chen, Wei Zhang, Hongyang Li, Yu Qiao, Zhongjiang He, Peng Gao

Figure 1 for Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Figure 2 for Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Figure 3 for Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Figure 4 for Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation
Viaarxiv icon

Personalize Segment Anything Model with One Shot

May 04, 2023
Renrui Zhang, Zhengkai Jiang, Ziyu Guo, Shilin Yan, Junting Pan, Hao Dong, Peng Gao, Hongsheng Li

Figure 1 for Personalize Segment Anything Model with One Shot
Figure 2 for Personalize Segment Anything Model with One Shot
Figure 3 for Personalize Segment Anything Model with One Shot
Figure 4 for Personalize Segment Anything Model with One Shot
Viaarxiv icon