Picture for Can Ma

Can Ma

An Empirical Study on Configuring In-Context Learning Demonstrations for Unleashing MLLMs' Sentimental Perception Capability

Add code
May 22, 2025
Viaarxiv icon

Multi-Modal Molecular Representation Learning via Structure Awareness

Add code
May 09, 2025
Viaarxiv icon

Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition

Add code
Mar 24, 2025
Viaarxiv icon

AS-GCL: Asymmetric Spectral Augmentation on Graph Contrastive Learning

Add code
Feb 19, 2025
Viaarxiv icon

Communication-Efficient Personalized Federal Graph Learning via Low-Rank Decomposition

Add code
Dec 18, 2024
Viaarxiv icon

Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues

Add code
Dec 17, 2024
Figure 1 for Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Figure 2 for Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Figure 3 for Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Figure 4 for Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Viaarxiv icon

Falcon-UI: Understanding GUI Before Following User Instructions

Add code
Dec 12, 2024
Figure 1 for Falcon-UI: Understanding GUI Before Following User Instructions
Figure 2 for Falcon-UI: Understanding GUI Before Following User Instructions
Figure 3 for Falcon-UI: Understanding GUI Before Following User Instructions
Figure 4 for Falcon-UI: Understanding GUI Before Following User Instructions
Viaarxiv icon

Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation

Add code
Nov 22, 2024
Viaarxiv icon

Key-Point-Driven Mathematical Reasoning Distillation of Large Language Model

Add code
Jul 14, 2024
Figure 1 for Key-Point-Driven Mathematical Reasoning Distillation of Large Language Model
Figure 2 for Key-Point-Driven Mathematical Reasoning Distillation of Large Language Model
Figure 3 for Key-Point-Driven Mathematical Reasoning Distillation of Large Language Model
Figure 4 for Key-Point-Driven Mathematical Reasoning Distillation of Large Language Model
Viaarxiv icon

Resolving Sentiment Discrepancy for Multimodal Sentiment Detection via Semantics Completion and Decomposition

Add code
Jul 09, 2024
Figure 1 for Resolving Sentiment Discrepancy for Multimodal Sentiment Detection via Semantics Completion and Decomposition
Figure 2 for Resolving Sentiment Discrepancy for Multimodal Sentiment Detection via Semantics Completion and Decomposition
Figure 3 for Resolving Sentiment Discrepancy for Multimodal Sentiment Detection via Semantics Completion and Decomposition
Figure 4 for Resolving Sentiment Discrepancy for Multimodal Sentiment Detection via Semantics Completion and Decomposition
Viaarxiv icon