Picture for Yoonsik Kim

Yoonsik Kim

TableVQA-Bench: A Visual Question Answering Benchmark on Multiple Table Domains

Add code
Apr 30, 2024
Figure 1 for TableVQA-Bench: A Visual Question Answering Benchmark on Multiple Table Domains
Figure 2 for TableVQA-Bench: A Visual Question Answering Benchmark on Multiple Table Domains
Figure 3 for TableVQA-Bench: A Visual Question Answering Benchmark on Multiple Table Domains
Figure 4 for TableVQA-Bench: A Visual Question Answering Benchmark on Multiple Table Domains
Viaarxiv icon

HyperCLOVA X Technical Report

Add code
Apr 13, 2024
Viaarxiv icon

SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap

Add code
Sep 21, 2023
Figure 1 for SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap
Figure 2 for SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap
Figure 3 for SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap
Figure 4 for SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap
Viaarxiv icon

Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models

Add code
May 24, 2023
Figure 1 for Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
Figure 2 for Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
Figure 3 for Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
Figure 4 for Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
Viaarxiv icon

Towards Unified Scene Text Spotting based on Sequence Generation

Add code
Apr 07, 2023
Figure 1 for Towards Unified Scene Text Spotting based on Sequence Generation
Figure 2 for Towards Unified Scene Text Spotting based on Sequence Generation
Figure 3 for Towards Unified Scene Text Spotting based on Sequence Generation
Figure 4 for Towards Unified Scene Text Spotting based on Sequence Generation
Viaarxiv icon

Technical Report on Web-based Visual Corpus Construction for Visual Document Understanding

Add code
Nov 07, 2022
Figure 1 for Technical Report on Web-based Visual Corpus Construction for Visual Document Understanding
Figure 2 for Technical Report on Web-based Visual Corpus Construction for Visual Document Understanding
Figure 3 for Technical Report on Web-based Visual Corpus Construction for Visual Document Understanding
Figure 4 for Technical Report on Web-based Visual Corpus Construction for Visual Document Understanding
Viaarxiv icon

DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting

Add code
Mar 10, 2022
Figure 1 for DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
Figure 2 for DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
Figure 3 for DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
Figure 4 for DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting
Viaarxiv icon

Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features

Add code
Nov 30, 2021
Figure 1 for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
Figure 2 for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
Figure 3 for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
Figure 4 for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
Viaarxiv icon

RewriteNet: Realistic Scene Text Image Generation via Editing Text in Real-world Image

Add code
Jul 23, 2021
Figure 1 for RewriteNet: Realistic Scene Text Image Generation via Editing Text in Real-world Image
Figure 2 for RewriteNet: Realistic Scene Text Image Generation via Editing Text in Real-world Image
Figure 3 for RewriteNet: Realistic Scene Text Image Generation via Editing Text in Real-world Image
Figure 4 for RewriteNet: Realistic Scene Text Image Generation via Editing Text in Real-world Image
Viaarxiv icon

SynthTIGER: Synthetic Text Image GEneratoR Towards Better Text Recognition Models

Add code
Jul 20, 2021
Figure 1 for SynthTIGER: Synthetic Text Image GEneratoR Towards Better Text Recognition Models
Figure 2 for SynthTIGER: Synthetic Text Image GEneratoR Towards Better Text Recognition Models
Figure 3 for SynthTIGER: Synthetic Text Image GEneratoR Towards Better Text Recognition Models
Figure 4 for SynthTIGER: Synthetic Text Image GEneratoR Towards Better Text Recognition Models
Viaarxiv icon