Picture for Ziyu Guo

Ziyu Guo

MAVIS: Mathematical Visual Instruction Tuning

Add code
Jul 11, 2024
Viaarxiv icon

TripletMix: Triplet Data Augmentation for 3D Understanding

Add code
May 28, 2024
Viaarxiv icon

No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation

Add code
Apr 05, 2024
Viaarxiv icon

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

Add code
Mar 21, 2024
Figure 1 for MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Figure 2 for MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Figure 3 for MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Figure 4 for MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Viaarxiv icon

LLM-Assisted Multi-Teacher Continual Learning for Visual Question Answering in Robotic Surgery

Add code
Feb 26, 2024
Figure 1 for LLM-Assisted Multi-Teacher Continual Learning for Visual Question Answering in Robotic Surgery
Figure 2 for LLM-Assisted Multi-Teacher Continual Learning for Visual Question Answering in Robotic Surgery
Figure 3 for LLM-Assisted Multi-Teacher Continual Learning for Visual Question Answering in Robotic Surgery
Figure 4 for LLM-Assisted Multi-Teacher Continual Learning for Visual Question Answering in Robotic Surgery
Viaarxiv icon

SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive Learning

Add code
Jan 22, 2024
Figure 1 for SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive Learning
Figure 2 for SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive Learning
Figure 3 for SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive Learning
Figure 4 for SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive Learning
Viaarxiv icon

HIGT: Hierarchical Interaction Graph-Transformer for Whole Slide Image Analysis

Add code
Sep 14, 2023
Viaarxiv icon

ImageBind-LLM: Multi-modality Instruction Tuning

Add code
Sep 11, 2023
Figure 1 for ImageBind-LLM: Multi-modality Instruction Tuning
Figure 2 for ImageBind-LLM: Multi-modality Instruction Tuning
Figure 3 for ImageBind-LLM: Multi-modality Instruction Tuning
Figure 4 for ImageBind-LLM: Multi-modality Instruction Tuning
Viaarxiv icon

Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following

Add code
Sep 01, 2023
Figure 1 for Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following
Figure 2 for Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following
Figure 3 for Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following
Figure 4 for Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following
Viaarxiv icon

Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks

Add code
Aug 24, 2023
Viaarxiv icon