Picture for Tuo Zhang

Tuo Zhang

A Comprehensive Review of Multimodal Large Language Models: Performance and Challenges Across Different Tasks

Add code
Aug 02, 2024
Figure 1 for A Comprehensive Review of Multimodal Large Language Models: Performance and Challenges Across Different Tasks
Viaarxiv icon

Potential of Multimodal Large Language Models for Data Mining of Medical Images and Free-text Reports

Add code
Jul 08, 2024
Figure 1 for Potential of Multimodal Large Language Models for Data Mining of Medical Images and Free-text Reports
Figure 2 for Potential of Multimodal Large Language Models for Data Mining of Medical Images and Free-text Reports
Figure 3 for Potential of Multimodal Large Language Models for Data Mining of Medical Images and Free-text Reports
Figure 4 for Potential of Multimodal Large Language Models for Data Mining of Medical Images and Free-text Reports
Viaarxiv icon

Embracing Federated Learning: Enabling Weak Client Participation via Partial Model Training

Add code
Jun 21, 2024
Viaarxiv icon

Creating a Lens of Chinese Culture: A Multimodal Dataset for Chinese Pun Rebus Art Understanding

Add code
Jun 14, 2024
Viaarxiv icon

Revisiting OPRO: The Limitations of Small-Scale LLMs as Optimizers

Add code
May 16, 2024
Viaarxiv icon

Eye-gaze Guided Multi-modal Alignment Framework for Radiology

Add code
Mar 19, 2024
Figure 1 for Eye-gaze Guided Multi-modal Alignment Framework for Radiology
Figure 2 for Eye-gaze Guided Multi-modal Alignment Framework for Radiology
Figure 3 for Eye-gaze Guided Multi-modal Alignment Framework for Radiology
Figure 4 for Eye-gaze Guided Multi-modal Alignment Framework for Radiology
Viaarxiv icon

Understanding LLMs: A Comprehensive Overview from Training to Inference

Add code
Jan 06, 2024
Figure 1 for Understanding LLMs: A Comprehensive Overview from Training to Inference
Figure 2 for Understanding LLMs: A Comprehensive Overview from Training to Inference
Figure 3 for Understanding LLMs: A Comprehensive Overview from Training to Inference
Figure 4 for Understanding LLMs: A Comprehensive Overview from Training to Inference
Viaarxiv icon

High Efficiency Inference Accelerating Algorithm for NOMA-based Mobile Edge Computing

Add code
Dec 26, 2023
Viaarxiv icon

The DURel Annotation Tool: Human and Computational Measurement of Semantic Proximity, Sense Clusters and Semantic Change

Add code
Nov 21, 2023
Viaarxiv icon

ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data

Add code
Oct 10, 2023
Viaarxiv icon