Alert button
Picture for Yuankai Qi

Yuankai Qi

Alert button

Hierarchical Modular Network for Video Captioning

Add code
Bookmark button
Alert button
Nov 24, 2021
Hanhua Ye, Guorong Li, Yuankai Qi, Shuhui Wang, Qingming Huang, Ming-Hsuan Yang

Figure 1 for Hierarchical Modular Network for Video Captioning
Figure 2 for Hierarchical Modular Network for Video Captioning
Figure 3 for Hierarchical Modular Network for Video Captioning
Figure 4 for Hierarchical Modular Network for Video Captioning
Viaarxiv icon

Neighbor-view Enhanced Model for Vision and Language Navigation

Add code
Bookmark button
Alert button
Jul 24, 2021
Dong An, Yuankai Qi, Yan Huang, Qi Wu, Liang Wang, Tieniu Tan

Figure 1 for Neighbor-view Enhanced Model for Vision and Language Navigation
Figure 2 for Neighbor-view Enhanced Model for Vision and Language Navigation
Figure 3 for Neighbor-view Enhanced Model for Vision and Language Navigation
Figure 4 for Neighbor-view Enhanced Model for Vision and Language Navigation
Viaarxiv icon

Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation

Add code
Bookmark button
Alert button
Apr 09, 2021
Yuankai Qi, Zizheng Pan, Yicong Hong, Ming-Hsuan Yang, Anton van den Hengel, Qi Wu

Figure 1 for Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
Figure 2 for Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
Figure 3 for Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
Figure 4 for Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
Viaarxiv icon

Diagnosing Vision-and-Language Navigation: What Really Matters

Add code
Bookmark button
Alert button
Mar 30, 2021
Wanrong Zhu, Yuankai Qi, Pradyumna Narayana, Kazoo Sone, Sugato Basu, Xin Eric Wang, Qi Wu, Miguel Eckstein, William Yang Wang

Figure 1 for Diagnosing Vision-and-Language Navigation: What Really Matters
Figure 2 for Diagnosing Vision-and-Language Navigation: What Really Matters
Figure 3 for Diagnosing Vision-and-Language Navigation: What Really Matters
Figure 4 for Diagnosing Vision-and-Language Navigation: What Really Matters
Viaarxiv icon

A Recurrent Vision-and-Language BERT for Navigation

Add code
Bookmark button
Alert button
Nov 26, 2020
Yicong Hong, Qi Wu, Yuankai Qi, Cristian Rodriguez-Opazo, Stephen Gould

Figure 1 for A Recurrent Vision-and-Language BERT for Navigation
Figure 2 for A Recurrent Vision-and-Language BERT for Navigation
Figure 3 for A Recurrent Vision-and-Language BERT for Navigation
Figure 4 for A Recurrent Vision-and-Language BERT for Navigation
Viaarxiv icon

Language and Visual Entity Relationship Graph for Agent Navigation

Add code
Bookmark button
Alert button
Oct 19, 2020
Yicong Hong, Cristian Rodriguez-Opazo, Yuankai Qi, Qi Wu, Stephen Gould

Figure 1 for Language and Visual Entity Relationship Graph for Agent Navigation
Figure 2 for Language and Visual Entity Relationship Graph for Agent Navigation
Figure 3 for Language and Visual Entity Relationship Graph for Agent Navigation
Figure 4 for Language and Visual Entity Relationship Graph for Agent Navigation
Viaarxiv icon

Object-and-Action Aware Model for Visual Language Navigation

Add code
Bookmark button
Alert button
Jul 29, 2020
Yuankai Qi, Zizheng Pan, Shengping Zhang, Anton van den Hengel, Qi Wu

Figure 1 for Object-and-Action Aware Model for Visual Language Navigation
Figure 2 for Object-and-Action Aware Model for Visual Language Navigation
Figure 3 for Object-and-Action Aware Model for Visual Language Navigation
Figure 4 for Object-and-Action Aware Model for Visual Language Navigation
Viaarxiv icon

Scene Text Recognition via Transformer

Add code
Bookmark button
Alert button
Apr 10, 2020
Xinjie Feng, Hongxun Yao, Yuankai Qi, Jun Zhang, Shengping Zhang

Figure 1 for Scene Text Recognition via Transformer
Figure 2 for Scene Text Recognition via Transformer
Figure 3 for Scene Text Recognition via Transformer
Figure 4 for Scene Text Recognition via Transformer
Viaarxiv icon