Picture for Lin Ma

Lin Ma

UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection

Add code
Apr 07, 2024
Viaarxiv icon

Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models

Add code
Mar 12, 2024
Viaarxiv icon

Misalignment-Robust Frequency Distribution Loss for Image Transformation

Add code
Feb 28, 2024
Viaarxiv icon

Tables as Images? Exploring the Strengths and Limitations of LLMs on Multimodal Representations of Tabular Data

Add code
Feb 23, 2024
Figure 1 for Tables as Images? Exploring the Strengths and Limitations of LLMs on Multimodal Representations of Tabular Data
Figure 2 for Tables as Images? Exploring the Strengths and Limitations of LLMs on Multimodal Representations of Tabular Data
Figure 3 for Tables as Images? Exploring the Strengths and Limitations of LLMs on Multimodal Representations of Tabular Data
Figure 4 for Tables as Images? Exploring the Strengths and Limitations of LLMs on Multimodal Representations of Tabular Data
Viaarxiv icon

A Multimodal In-Context Tuning Approach for E-Commerce Product Description Generation

Add code
Feb 21, 2024
Viaarxiv icon

InstaGen: Enhancing Object Detection by Training on Synthetic Dataset

Add code
Feb 20, 2024
Viaarxiv icon

LLaVA-MoLE: Sparse Mixture of LoRA Experts for Mitigating Data Conflicts in Instruction Finetuning MLLMs

Add code
Jan 30, 2024
Viaarxiv icon

MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning

Add code
Jan 24, 2024
Figure 1 for MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning
Figure 2 for MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning
Figure 3 for MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning
Figure 4 for MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning
Viaarxiv icon

Computing in the Era of Large Generative Models: From Cloud-Native to AI-Native

Add code
Jan 17, 2024
Figure 1 for Computing in the Era of Large Generative Models: From Cloud-Native to AI-Native
Figure 2 for Computing in the Era of Large Generative Models: From Cloud-Native to AI-Native
Viaarxiv icon

ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field

Add code
Dec 15, 2023
Figure 1 for ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field
Figure 2 for ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field
Figure 3 for ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field
Figure 4 for ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field
Viaarxiv icon