Picture for Can Ma

Can Ma

Gather and Trace: Rethinking Video TextVQA from an Instance-oriented Perspective

Add code
Aug 06, 2025
Viaarxiv icon

An Empirical Study on Configuring In-Context Learning Demonstrations for Unleashing MLLMs' Sentimental Perception Capability

Add code
May 22, 2025
Viaarxiv icon

Multi-Modal Molecular Representation Learning via Structure Awareness

Add code
May 09, 2025
Viaarxiv icon

Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition

Add code
Mar 24, 2025
Viaarxiv icon

AS-GCL: Asymmetric Spectral Augmentation on Graph Contrastive Learning

Add code
Feb 19, 2025
Viaarxiv icon

Communication-Efficient Personalized Federal Graph Learning via Low-Rank Decomposition

Add code
Dec 18, 2024
Viaarxiv icon

Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues

Add code
Dec 17, 2024
Figure 1 for Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Figure 2 for Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Figure 3 for Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Figure 4 for Track the Answer: Extending TextVQA from Image to Video with Spatio-Temporal Clues
Viaarxiv icon

Falcon-UI: Understanding GUI Before Following User Instructions

Add code
Dec 12, 2024
Figure 1 for Falcon-UI: Understanding GUI Before Following User Instructions
Figure 2 for Falcon-UI: Understanding GUI Before Following User Instructions
Figure 3 for Falcon-UI: Understanding GUI Before Following User Instructions
Figure 4 for Falcon-UI: Understanding GUI Before Following User Instructions
Viaarxiv icon

Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation

Add code
Nov 22, 2024
Figure 1 for Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation
Figure 2 for Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation
Figure 3 for Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation
Figure 4 for Improving Mathematical Reasoning Capabilities of Small Language Models via Feedback-Driven Distillation
Viaarxiv icon

Key-Point-Driven Mathematical Reasoning Distillation of Large Language Model

Add code
Jul 14, 2024
Figure 1 for Key-Point-Driven Mathematical Reasoning Distillation of Large Language Model
Figure 2 for Key-Point-Driven Mathematical Reasoning Distillation of Large Language Model
Figure 3 for Key-Point-Driven Mathematical Reasoning Distillation of Large Language Model
Figure 4 for Key-Point-Driven Mathematical Reasoning Distillation of Large Language Model
Viaarxiv icon