Picture for Jiang Gui

Jiang Gui

Music's Multimodal Complexity in AVQA: Why We Need More than General Multimodal LLMs

Add code
May 27, 2025
Viaarxiv icon

Assessing and Mitigating Medical Knowledge Drift and Conflicts in Large Language Models

Add code
May 12, 2025
Viaarxiv icon

Learning Musical Representations for Music Performance Question Answering

Add code
Feb 10, 2025
Viaarxiv icon

Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding

Add code
Feb 09, 2025
Figure 1 for Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding
Figure 2 for Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding
Figure 3 for Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding
Figure 4 for Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding
Viaarxiv icon

Residual Kolmogorov-Arnold Network for Enhanced Deep Learning

Add code
Oct 07, 2024
Viaarxiv icon

Memory-Efficient Sparse Pyramid Attention Networks for Whole Slide Image Analysis

Add code
Jun 13, 2024
Viaarxiv icon