Alert button
Picture for Yasuhisa Fujii

Yasuhisa Fujii

Alert button

Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding

Jan 19, 2024
Zilong Wang, Hao Zhang, Chun-Liang Li, Julian Martin Eisenschlos, Vincent Perot, Zifeng Wang, Lesly Miculicich, Yasuhisa Fujii, Jingbo Shang, Chen-Yu Lee, Tomas Pfister

Viaarxiv icon

Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis

Oct 25, 2023
Shangbang Long, Siyang Qin, Yasuhisa Fujii, Alessandro Bissacco, Michalis Raptis

Viaarxiv icon

OCR Language Models with Custom Vocabularies

Aug 18, 2023
Peter Garst, Reeve Ingle, Yasuhisa Fujii

Viaarxiv icon

Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models

Aug 01, 2023
Cheng-Yu Hsieh, Si-An Chen, Chun-Liang Li, Yasuhisa Fujii, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister

Figure 1 for Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models
Figure 2 for Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models
Figure 3 for Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models
Figure 4 for Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models
Viaarxiv icon

ICDAR 2023 Competition on Hierarchical Text Detection and Recognition

May 16, 2023
Shangbang Long, Siyang Qin, Dmitry Panteleev, Alessandro Bissacco, Yasuhisa Fujii, Michalis Raptis

Figure 1 for ICDAR 2023 Competition on Hierarchical Text Detection and Recognition
Figure 2 for ICDAR 2023 Competition on Hierarchical Text Detection and Recognition
Figure 3 for ICDAR 2023 Competition on Hierarchical Text Detection and Recognition
Figure 4 for ICDAR 2023 Competition on Hierarchical Text Detection and Recognition
Viaarxiv icon

Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation

May 04, 2023
Renshen Wang, Yasuhisa Fujii, Alessandro Bissacco

Figure 1 for Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation
Figure 2 for Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation
Figure 3 for Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation
Figure 4 for Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation
Viaarxiv icon

FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction

May 04, 2023
Chen-Yu Lee, Chun-Liang Li, Hao Zhang, Timothy Dozat, Vincent Perot, Guolong Su, Xiang Zhang, Kihyuk Sohn, Nikolai Glushnev, Renshen Wang, Joshua Ainslie, Shangbang Long, Siyang Qin, Yasuhisa Fujii, Nan Hua, Tomas Pfister

Figure 1 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Figure 2 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Figure 3 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Figure 4 for FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction
Viaarxiv icon

Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes

May 03, 2023
Cheng-Yu Hsieh, Chun-Liang Li, Chih-Kuan Yeh, Hootan Nakhost, Yasuhisa Fujii, Alexander Ratner, Ranjay Krishna, Chen-Yu Lee, Tomas Pfister

Figure 1 for Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
Figure 2 for Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
Figure 3 for Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
Figure 4 for Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
Viaarxiv icon

Towards End-to-End Unified Scene Text Detection and Layout Analysis

Mar 28, 2022
Shangbang Long, Siyang Qin, Dmitry Panteleev, Alessandro Bissacco, Yasuhisa Fujii, Michalis Raptis

Figure 1 for Towards End-to-End Unified Scene Text Detection and Layout Analysis
Figure 2 for Towards End-to-End Unified Scene Text Detection and Layout Analysis
Figure 3 for Towards End-to-End Unified Scene Text Detection and Layout Analysis
Figure 4 for Towards End-to-End Unified Scene Text Detection and Layout Analysis
Viaarxiv icon