Picture for Minghui Liao

Minghui Liao

TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models

Add code
Apr 14, 2024
Figure 1 for TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models
Figure 2 for TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models
Figure 3 for TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models
Figure 4 for TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models
Viaarxiv icon

Android in the Zoo: Chain-of-Action-Thought for GUI Agents

Add code
Mar 05, 2024
Figure 1 for Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Figure 2 for Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Figure 3 for Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Figure 4 for Android in the Zoo: Chain-of-Action-Thought for GUI Agents
Viaarxiv icon

Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition

Add code
Feb 24, 2024
Figure 1 for Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition
Figure 2 for Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition
Figure 3 for Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition
Figure 4 for Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition
Viaarxiv icon

Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition

Add code
Feb 21, 2024
Figure 1 for Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Figure 2 for Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Figure 3 for Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Figure 4 for Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition
Viaarxiv icon

Joint Learning Neuronal Skeleton and Brain Circuit Topology with Permutation Invariant Encoders for Neuron Classification

Add code
Dec 22, 2023
Viaarxiv icon

Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach

Add code
Aug 21, 2023
Figure 1 for Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach
Figure 2 for Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach
Figure 3 for Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach
Figure 4 for Self-distillation Regularized Connectionist Temporal Classification Loss for Text Recognition: A Simple Yet Effective Approach
Viaarxiv icon

Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition

Add code
Jul 01, 2022
Figure 1 for Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Figure 2 for Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Figure 3 for Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Figure 4 for Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Viaarxiv icon

Comprehensive Benchmark Datasets for Amharic Scene Text Detection and Recognition

Add code
Mar 23, 2022
Figure 1 for Comprehensive Benchmark Datasets for Amharic Scene Text Detection and Recognition
Viaarxiv icon

Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion

Add code
Feb 21, 2022
Figure 1 for Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion
Figure 2 for Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion
Figure 3 for Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion
Figure 4 for Real-Time Scene Text Detection with Differentiable Binarization and Adaptive Scale Fusion
Viaarxiv icon

SGEN: Single-cell Sequencing Graph Self-supervised Embedding Network

Add code
Oct 15, 2021
Figure 1 for SGEN: Single-cell Sequencing Graph Self-supervised Embedding Network
Figure 2 for SGEN: Single-cell Sequencing Graph Self-supervised Embedding Network
Figure 3 for SGEN: Single-cell Sequencing Graph Self-supervised Embedding Network
Figure 4 for SGEN: Single-cell Sequencing Graph Self-supervised Embedding Network
Viaarxiv icon