Alert button
Picture for Lin Ma

Lin Ma

Alert button

LaSagnA: Language-based Segmentation Assistant for Complex Queries

Add code
Bookmark button
Alert button
Apr 12, 2024
Cong Wei, Haoxian Tan, Yujie Zhong, Yujiu Yang, Lin Ma

Viaarxiv icon

UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection

Add code
Bookmark button
Alert button
Apr 07, 2024
Yingsen Zeng, Yujie Zhong, Chengjian Feng, Lin Ma

Viaarxiv icon

Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models

Add code
Bookmark button
Alert button
Mar 12, 2024
Yang Jiao, Shaoxiang Chen, Zequn Jie, Jingjing Chen, Lin Ma, Yu-Gang Jiang

Figure 1 for Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models
Figure 2 for Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models
Figure 3 for Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models
Figure 4 for Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models
Viaarxiv icon

Misalignment-Robust Frequency Distribution Loss for Image Transformation

Add code
Bookmark button
Alert button
Feb 28, 2024
Zhangkai Ni, Juncheng Wu, Zian Wang, Wenhan Yang, Hanli Wang, Lin Ma

Viaarxiv icon

Tables as Images? Exploring the Strengths and Limitations of LLMs on Multimodal Representations of Tabular Data

Add code
Bookmark button
Alert button
Feb 23, 2024
Naihao Deng, Zhenjie Sun, Ruiqi He, Aman Sikka, Yulong Chen, Lin Ma, Yue Zhang, Rada Mihalcea

Viaarxiv icon

A Multimodal In-Context Tuning Approach for E-Commerce Product Description Generation

Add code
Bookmark button
Alert button
Feb 21, 2024
Yunxin Li, Baotian Hu, Wenhan Luo, Lin Ma, Yuxin Ding, Min Zhang

Viaarxiv icon

InstaGen: Enhancing Object Detection by Training on Synthetic Dataset

Add code
Bookmark button
Alert button
Feb 20, 2024
Chengjian Feng, Yujie Zhong, Zequn Jie, Weidi Xie, Lin Ma

Viaarxiv icon

LLaVA-MoLE: Sparse Mixture of LoRA Experts for Mitigating Data Conflicts in Instruction Finetuning MLLMs

Add code
Bookmark button
Alert button
Jan 30, 2024
Shaoxiang Chen, Zequn Jie, Lin Ma

Viaarxiv icon

MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning

Add code
Bookmark button
Alert button
Jan 24, 2024
Chenyu Wang, Weixin Luo, Qianyu Chen, Haonan Mai, Jindi Guo, Sixun Dong, Xiaohua, Xuan, Zhengxin Li, Lin Ma, Shenghua Gao

Viaarxiv icon