Picture for Zhibo Yang

Zhibo Yang

Platypus: A Generalized Specialist Model for Reading Text in Various Forms

Add code
Aug 27, 2024
Figure 1 for Platypus: A Generalized Specialist Model for Reading Text in Various Forms
Figure 2 for Platypus: A Generalized Specialist Model for Reading Text in Various Forms
Figure 3 for Platypus: A Generalized Specialist Model for Reading Text in Various Forms
Figure 4 for Platypus: A Generalized Specialist Model for Reading Text in Various Forms
Viaarxiv icon

Look Hear: Gaze Prediction for Speech-directed Human Attention

Add code
Jul 28, 2024
Figure 1 for Look Hear: Gaze Prediction for Speech-directed Human Attention
Figure 2 for Look Hear: Gaze Prediction for Speech-directed Human Attention
Figure 3 for Look Hear: Gaze Prediction for Speech-directed Human Attention
Figure 4 for Look Hear: Gaze Prediction for Speech-directed Human Attention
Viaarxiv icon

Visual Text Generation in the Wild

Add code
Jul 19, 2024
Viaarxiv icon

OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition

Add code
Mar 28, 2024
Viaarxiv icon

HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition

Add code
Mar 20, 2024
Viaarxiv icon

LORE++: Logical Location Regression Network for Table Structure Recognition with Pre-training

Add code
Jan 03, 2024
Viaarxiv icon

Efficient Monaural Speech Enhancement using Spectrum Attention Fusion

Add code
Aug 04, 2023
Viaarxiv icon

Predicting Human Attention using Computational Attention

Add code
Apr 04, 2023
Figure 1 for Predicting Human Attention using Computational Attention
Figure 2 for Predicting Human Attention using Computational Attention
Figure 3 for Predicting Human Attention using Computational Attention
Figure 4 for Predicting Human Attention using Computational Attention
Viaarxiv icon

Modeling Entities as Semantic Points for Visual Information Extraction in the Wild

Add code
Mar 29, 2023
Figure 1 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Figure 2 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Figure 3 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Figure 4 for Modeling Entities as Semantic Points for Visual Information Extraction in the Wild
Viaarxiv icon

Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention

Add code
Mar 27, 2023
Figure 1 for Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention
Figure 2 for Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention
Figure 3 for Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention
Figure 4 for Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention
Viaarxiv icon