Alert button
Picture for Jihyung Kil

Jihyung Kil

Alert button

II-MMR: Identifying and Improving Multi-modal Multi-hop Reasoning in Visual Question Answering

Add code
Bookmark button
Alert button
Feb 16, 2024
Jihyung Kil, Farideh Tavazoee, Dongyeop Kang, Joo-Kyung Kim

Viaarxiv icon

Dual-View Visual Contextualization for Web Navigation

Add code
Bookmark button
Alert button
Feb 06, 2024
Jihyung Kil, Chan Hee Song, Boyuan Zheng, Xiang Deng, Yu Su, Wei-Lun Chao

Viaarxiv icon

GPT-4V(ision) is a Generalist Web Agent, if Grounded

Add code
Bookmark button
Alert button
Jan 03, 2024
Boyuan Zheng, Boyu Gou, Jihyung Kil, Huan Sun, Yu Su

Viaarxiv icon

PreSTU: Pre-Training for Scene-Text Understanding

Add code
Bookmark button
Alert button
Sep 12, 2022
Jihyung Kil, Soravit Changpinyo, Xi Chen, Hexiang Hu, Sebastian Goodman, Wei-Lun Chao, Radu Soricut

Figure 1 for PreSTU: Pre-Training for Scene-Text Understanding
Figure 2 for PreSTU: Pre-Training for Scene-Text Understanding
Figure 3 for PreSTU: Pre-Training for Scene-Text Understanding
Figure 4 for PreSTU: Pre-Training for Scene-Text Understanding
Viaarxiv icon

One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones

Add code
Bookmark button
Alert button
Feb 14, 2022
Chan Hee Song, Jihyung Kil, Tai-Yu Pan, Brian M. Sadler, Wei-Lun Chao, Yu Su

Figure 1 for One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones
Figure 2 for One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones
Figure 3 for One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones
Figure 4 for One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones
Viaarxiv icon

Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering

Add code
Bookmark button
Alert button
Sep 13, 2021
Jihyung Kil, Cheng Zhang, Dong Xuan, Wei-Lun Chao

Figure 1 for Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering
Figure 2 for Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering
Figure 3 for Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering
Figure 4 for Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering
Viaarxiv icon

Revisiting Document Representations for Large-Scale Zero-Shot Learning

Add code
Bookmark button
Alert button
Apr 21, 2021
Jihyung Kil, Wei-Lun Chao

Figure 1 for Revisiting Document Representations for Large-Scale Zero-Shot Learning
Figure 2 for Revisiting Document Representations for Large-Scale Zero-Shot Learning
Figure 3 for Revisiting Document Representations for Large-Scale Zero-Shot Learning
Figure 4 for Revisiting Document Representations for Large-Scale Zero-Shot Learning
Viaarxiv icon