Alert button
Picture for Geewook Kim

Geewook Kim

Alert button

Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation

Jan 12, 2024
Seongyun Lee, Seungone Kim, Sue Hyun Park, Geewook Kim, Minjoon Seo

Viaarxiv icon

SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap

Sep 21, 2023
Daehee Kim, Yoonsik Kim, DongHyun Kim, Yumin Lim, Geewook Kim, Taeho Kil

Figure 1 for SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap
Figure 2 for SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap
Figure 3 for SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap
Figure 4 for SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap
Viaarxiv icon

Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models

May 24, 2023
Geewook Kim, Hodong Lee, Daehee Kim, Haeji Jung, Sanghee Park, Yoonsik Kim, Sangdoo Yun, Taeho Kil, Bado Lee, Seunghyun Park

Figure 1 for Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
Figure 2 for Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
Figure 3 for Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
Figure 4 for Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
Viaarxiv icon

Technical Report on Web-based Visual Corpus Construction for Visual Document Understanding

Nov 07, 2022
Donghyun Kim, Teakgyu Hong, Moonbin Yim, Yoonsik Kim, Geewook Kim

Figure 1 for Technical Report on Web-based Visual Corpus Construction for Visual Document Understanding
Figure 2 for Technical Report on Web-based Visual Corpus Construction for Visual Document Understanding
Figure 3 for Technical Report on Web-based Visual Corpus Construction for Visual Document Understanding
Figure 4 for Technical Report on Web-based Visual Corpus Construction for Visual Document Understanding
Viaarxiv icon

Semi-Structured Query Grounding for Document-Oriented Databases with Deep Retrieval and Its Application to Receipt and POI Matching

Feb 23, 2022
Geewook Kim, Wonseok Hwang, Minjoon Seo, Seunghyun Park

Figure 1 for Semi-Structured Query Grounding for Document-Oriented Databases with Deep Retrieval and Its Application to Receipt and POI Matching
Figure 2 for Semi-Structured Query Grounding for Document-Oriented Databases with Deep Retrieval and Its Application to Receipt and POI Matching
Figure 3 for Semi-Structured Query Grounding for Document-Oriented Databases with Deep Retrieval and Its Application to Receipt and POI Matching
Figure 4 for Semi-Structured Query Grounding for Document-Oriented Databases with Deep Retrieval and Its Application to Receipt and POI Matching
Viaarxiv icon

Donut: Document Understanding Transformer without OCR

Nov 30, 2021
Geewook Kim, Teakgyu Hong, Moonbin Yim, Jinyoung Park, Jinyeong Yim, Wonseok Hwang, Sangdoo Yun, Dongyoon Han, Seunghyun Park

Figure 1 for Donut: Document Understanding Transformer without OCR
Figure 2 for Donut: Document Understanding Transformer without OCR
Figure 3 for Donut: Document Understanding Transformer without OCR
Figure 4 for Donut: Document Understanding Transformer without OCR
Viaarxiv icon

Revisiting Additive Compositionality: AND, OR and NOT Operations with Word Embeddings

May 18, 2021
Masahiro Naito, Sho Yokoi, Geewook Kim, Hidetoshi Shimodaira

Figure 1 for Revisiting Additive Compositionality: AND, OR and NOT Operations with Word Embeddings
Figure 2 for Revisiting Additive Compositionality: AND, OR and NOT Operations with Word Embeddings
Figure 3 for Revisiting Additive Compositionality: AND, OR and NOT Operations with Word Embeddings
Figure 4 for Revisiting Additive Compositionality: AND, OR and NOT Operations with Word Embeddings
Viaarxiv icon

Cost-effective End-to-end Information Extraction for Semi-structured Document Images

Apr 16, 2021
Wonseok Hwang, Hyunji Lee, Jinyeong Yim, Geewook Kim, Minjoon Seo

Figure 1 for Cost-effective End-to-end Information Extraction for Semi-structured Document Images
Figure 2 for Cost-effective End-to-end Information Extraction for Semi-structured Document Images
Figure 3 for Cost-effective End-to-end Information Extraction for Semi-structured Document Images
Figure 4 for Cost-effective End-to-end Information Extraction for Semi-structured Document Images
Viaarxiv icon

Stochastic Neighbor Embedding of Multimodal Relational Data for Image-Text Simultaneous Visualization

May 02, 2020
Morihiro Mizutani, Akifumi Okuno, Geewook Kim, Hidetoshi Shimodaira

Figure 1 for Stochastic Neighbor Embedding of Multimodal Relational Data for Image-Text Simultaneous Visualization
Figure 2 for Stochastic Neighbor Embedding of Multimodal Relational Data for Image-Text Simultaneous Visualization
Figure 3 for Stochastic Neighbor Embedding of Multimodal Relational Data for Image-Text Simultaneous Visualization
Figure 4 for Stochastic Neighbor Embedding of Multimodal Relational Data for Image-Text Simultaneous Visualization
Viaarxiv icon

What is wrong with scene text recognition model comparisons? dataset and model analysis

Apr 03, 2019
Jeonghun Baek, Geewook Kim, Junyeop Lee, Sungrae Park, Dongyoon Han, Sangdoo Yun, Seong Joon Oh, Hwalsuk Lee

Figure 1 for What is wrong with scene text recognition model comparisons? dataset and model analysis
Figure 2 for What is wrong with scene text recognition model comparisons? dataset and model analysis
Figure 3 for What is wrong with scene text recognition model comparisons? dataset and model analysis
Figure 4 for What is wrong with scene text recognition model comparisons? dataset and model analysis
Viaarxiv icon