Picture for Tuo Zhang

Tuo Zhang

A Survey of Foundation Models for Music Understanding

Add code
Sep 15, 2024
Figure 1 for A Survey of Foundation Models for Music Understanding
Figure 2 for A Survey of Foundation Models for Music Understanding
Figure 3 for A Survey of Foundation Models for Music Understanding
Figure 4 for A Survey of Foundation Models for Music Understanding
Viaarxiv icon

ModalityMirror: Improving Audio Classification in Modality Heterogeneity Federated Learning with Multimodal Distillation

Add code
Aug 28, 2024
Viaarxiv icon

A Comprehensive Review of Multimodal Large Language Models: Performance and Challenges Across Different Tasks

Add code
Aug 02, 2024
Figure 1 for A Comprehensive Review of Multimodal Large Language Models: Performance and Challenges Across Different Tasks
Viaarxiv icon

Potential of Multimodal Large Language Models for Data Mining of Medical Images and Free-text Reports

Add code
Jul 08, 2024
Figure 1 for Potential of Multimodal Large Language Models for Data Mining of Medical Images and Free-text Reports
Figure 2 for Potential of Multimodal Large Language Models for Data Mining of Medical Images and Free-text Reports
Figure 3 for Potential of Multimodal Large Language Models for Data Mining of Medical Images and Free-text Reports
Figure 4 for Potential of Multimodal Large Language Models for Data Mining of Medical Images and Free-text Reports
Viaarxiv icon

Embracing Federated Learning: Enabling Weak Client Participation via Partial Model Training

Add code
Jun 21, 2024
Viaarxiv icon

Creating a Lens of Chinese Culture: A Multimodal Dataset for Chinese Pun Rebus Art Understanding

Add code
Jun 14, 2024
Viaarxiv icon

Revisiting OPRO: The Limitations of Small-Scale LLMs as Optimizers

Add code
May 16, 2024
Viaarxiv icon

Eye-gaze Guided Multi-modal Alignment Framework for Radiology

Add code
Mar 19, 2024
Figure 1 for Eye-gaze Guided Multi-modal Alignment Framework for Radiology
Figure 2 for Eye-gaze Guided Multi-modal Alignment Framework for Radiology
Figure 3 for Eye-gaze Guided Multi-modal Alignment Framework for Radiology
Figure 4 for Eye-gaze Guided Multi-modal Alignment Framework for Radiology
Viaarxiv icon

Understanding LLMs: A Comprehensive Overview from Training to Inference

Add code
Jan 06, 2024
Figure 1 for Understanding LLMs: A Comprehensive Overview from Training to Inference
Figure 2 for Understanding LLMs: A Comprehensive Overview from Training to Inference
Figure 3 for Understanding LLMs: A Comprehensive Overview from Training to Inference
Figure 4 for Understanding LLMs: A Comprehensive Overview from Training to Inference
Viaarxiv icon

High Efficiency Inference Accelerating Algorithm for NOMA-based Mobile Edge Computing

Add code
Dec 26, 2023
Viaarxiv icon