Picture for Liangcai Gao

Liangcai Gao

Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Mathematical Expression Recognition

Add code
May 29, 2025
Viaarxiv icon

Do Large Language Models Excel in Complex Logical Reasoning with Formal Language?

Add code
May 22, 2025
Viaarxiv icon

DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question Answering

Add code
Mar 20, 2025
Viaarxiv icon

CJEval: A Benchmark for Assessing Large Language Models Using Chinese Junior High School Exam Data

Add code
Sep 25, 2024
Figure 1 for CJEval: A Benchmark for Assessing Large Language Models Using Chinese Junior High School Exam Data
Figure 2 for CJEval: A Benchmark for Assessing Large Language Models Using Chinese Junior High School Exam Data
Figure 3 for CJEval: A Benchmark for Assessing Large Language Models Using Chinese Junior High School Exam Data
Figure 4 for CJEval: A Benchmark for Assessing Large Language Models Using Chinese Junior High School Exam Data
Viaarxiv icon

Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer

Add code
Aug 30, 2024
Figure 1 for Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer
Figure 2 for Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer
Figure 3 for Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer
Figure 4 for Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer
Viaarxiv icon

MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models

Add code
Aug 30, 2024
Figure 1 for MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
Figure 2 for MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
Figure 3 for MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
Figure 4 for MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
Viaarxiv icon

DocTabQA: Answering Questions from Long Documents Using Tables

Add code
Aug 21, 2024
Viaarxiv icon

TAMER: Tree-Aware Transformer for Handwritten Mathematical Expression Recognition

Add code
Aug 16, 2024
Figure 1 for TAMER: Tree-Aware Transformer for Handwritten Mathematical Expression Recognition
Figure 2 for TAMER: Tree-Aware Transformer for Handwritten Mathematical Expression Recognition
Figure 3 for TAMER: Tree-Aware Transformer for Handwritten Mathematical Expression Recognition
Figure 4 for TAMER: Tree-Aware Transformer for Handwritten Mathematical Expression Recognition
Viaarxiv icon

SketchRef: A Benchmark Dataset and Evaluation Metrics for Automated Sketch Synthesis

Add code
Aug 16, 2024
Figure 1 for SketchRef: A Benchmark Dataset and Evaluation Metrics for Automated Sketch Synthesis
Figure 2 for SketchRef: A Benchmark Dataset and Evaluation Metrics for Automated Sketch Synthesis
Figure 3 for SketchRef: A Benchmark Dataset and Evaluation Metrics for Automated Sketch Synthesis
Figure 4 for SketchRef: A Benchmark Dataset and Evaluation Metrics for Automated Sketch Synthesis
Viaarxiv icon

SegHist: A General Segmentation-based Framework for Chinese Historical Document Text Line Detection

Add code
Jun 25, 2024
Figure 1 for SegHist: A General Segmentation-based Framework for Chinese Historical Document Text Line Detection
Figure 2 for SegHist: A General Segmentation-based Framework for Chinese Historical Document Text Line Detection
Figure 3 for SegHist: A General Segmentation-based Framework for Chinese Historical Document Text Line Detection
Figure 4 for SegHist: A General Segmentation-based Framework for Chinese Historical Document Text Line Detection
Viaarxiv icon