Picture for Mengchen Liu

Mengchen Liu

Stephen

InfoChartQA: A Benchmark for Multimodal Question Answering on Infographic Charts

Add code
May 25, 2025
Viaarxiv icon

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Add code
Mar 03, 2025
Figure 1 for Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs
Figure 2 for Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs
Figure 3 for Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs
Figure 4 for Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs
Viaarxiv icon

Benchmarking Large and Small MLLMs

Add code
Jan 04, 2025
Figure 1 for Benchmarking Large and Small MLLMs
Figure 2 for Benchmarking Large and Small MLLMs
Figure 3 for Benchmarking Large and Small MLLMs
Figure 4 for Benchmarking Large and Small MLLMs
Viaarxiv icon

ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilities

Add code
Oct 08, 2024
Figure 1 for ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilities
Figure 2 for ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilities
Figure 3 for ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilities
Figure 4 for ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilities
Viaarxiv icon

SynChart: Synthesizing Charts from Language Models

Add code
Sep 25, 2024
Viaarxiv icon

On Pre-training of Multimodal Language Models Customized for Chart Understanding

Add code
Jul 19, 2024
Figure 1 for On Pre-training of Multimodal Language Models Customized for Chart Understanding
Figure 2 for On Pre-training of Multimodal Language Models Customized for Chart Understanding
Figure 3 for On Pre-training of Multimodal Language Models Customized for Chart Understanding
Figure 4 for On Pre-training of Multimodal Language Models Customized for Chart Understanding
Viaarxiv icon

Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search

Add code
Mar 15, 2024
Figure 1 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Figure 2 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Figure 3 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Figure 4 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Viaarxiv icon

An Evaluation of GPT-4V and Gemini in Online VQA

Add code
Dec 17, 2023
Figure 1 for An Evaluation of GPT-4V and Gemini in Online VQA
Figure 2 for An Evaluation of GPT-4V and Gemini in Online VQA
Figure 3 for An Evaluation of GPT-4V and Gemini in Online VQA
Figure 4 for An Evaluation of GPT-4V and Gemini in Online VQA
Viaarxiv icon

Fully Authentic Visual Question Answering Dataset from Online Communities

Add code
Nov 27, 2023
Figure 1 for Fully Authentic Visual Question Answering Dataset from Online Communities
Figure 2 for Fully Authentic Visual Question Answering Dataset from Online Communities
Figure 3 for Fully Authentic Visual Question Answering Dataset from Online Communities
Figure 4 for Fully Authentic Visual Question Answering Dataset from Online Communities
Viaarxiv icon

On the Hidden Waves of Image

Add code
Oct 19, 2023
Figure 1 for On the Hidden Waves of Image
Figure 2 for On the Hidden Waves of Image
Figure 3 for On the Hidden Waves of Image
Figure 4 for On the Hidden Waves of Image
Viaarxiv icon