Picture for Yaqiang Wu

Yaqiang Wu

SketchVL: Policy Optimization via Fine-Grained Credit Assignment for Chart Understanding and More

Add code
Jan 09, 2026
Viaarxiv icon

ChartSketcher: Reasoning with Multimodal Feedback and Reflection for Chart Understanding

Add code
May 25, 2025
Viaarxiv icon

Unleashing the Potential of Model Bias for Generalized Category Discovery

Add code
Dec 17, 2024
Figure 1 for Unleashing the Potential of Model Bias for Generalized Category Discovery
Figure 2 for Unleashing the Potential of Model Bias for Generalized Category Discovery
Figure 3 for Unleashing the Potential of Model Bias for Generalized Category Discovery
Figure 4 for Unleashing the Potential of Model Bias for Generalized Category Discovery
Viaarxiv icon

Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models

Add code
Jul 22, 2024
Figure 1 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Figure 2 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Figure 3 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Figure 4 for Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Viaarxiv icon

Transfer and Alignment Network for Generalized Category Discovery

Add code
Dec 27, 2023
Figure 1 for Transfer and Alignment Network for Generalized Category Discovery
Figure 2 for Transfer and Alignment Network for Generalized Category Discovery
Figure 3 for Transfer and Alignment Network for Generalized Category Discovery
Figure 4 for Transfer and Alignment Network for Generalized Category Discovery
Viaarxiv icon

Generalized Category Discovery with Large Language Models in the Loop

Add code
Dec 18, 2023
Figure 1 for Generalized Category Discovery with Large Language Models in the Loop
Figure 2 for Generalized Category Discovery with Large Language Models in the Loop
Figure 3 for Generalized Category Discovery with Large Language Models in the Loop
Figure 4 for Generalized Category Discovery with Large Language Models in the Loop
Viaarxiv icon

GPTR: Gestalt-Perception Transformer for Diagram Object Detection

Add code
Dec 29, 2022
Figure 1 for GPTR: Gestalt-Perception Transformer for Diagram Object Detection
Figure 2 for GPTR: Gestalt-Perception Transformer for Diagram Object Detection
Figure 3 for GPTR: Gestalt-Perception Transformer for Diagram Object Detection
Figure 4 for GPTR: Gestalt-Perception Transformer for Diagram Object Detection
Viaarxiv icon

MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction

Add code
Jun 24, 2021
Figure 1 for MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction
Figure 2 for MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction
Figure 3 for MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction
Figure 4 for MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction
Viaarxiv icon

Towards an efficient framework for Data Extraction from Chart Images

Add code
May 05, 2021
Figure 1 for Towards an efficient framework for Data Extraction from Chart Images
Figure 2 for Towards an efficient framework for Data Extraction from Chart Images
Figure 3 for Towards an efficient framework for Data Extraction from Chart Images
Figure 4 for Towards an efficient framework for Data Extraction from Chart Images
Viaarxiv icon

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

Add code
Jan 24, 2021
Figure 1 for Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution
Figure 2 for Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution
Figure 3 for Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution
Figure 4 for Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution
Viaarxiv icon