Alert button
Picture for Shizhe Chen

Shizhe Chen

Alert button

INRIA

Learning from Unlabeled 3D Environments for Vision-and-Language Navigation

Aug 24, 2022
Shizhe Chen, Pierre-Louis Guhur, Makarand Tapaswi, Cordelia Schmid, Ivan Laptev

Figure 1 for Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
Figure 2 for Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
Figure 3 for Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
Figure 4 for Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
Viaarxiv icon

Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation

Feb 23, 2022
Shizhe Chen, Pierre-Louis Guhur, Makarand Tapaswi, Cordelia Schmid, Ivan Laptev

Figure 1 for Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Figure 2 for Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Figure 3 for Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Figure 4 for Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Viaarxiv icon

History Aware Multimodal Transformer for Vision-and-Language Navigation

Oct 25, 2021
Shizhe Chen, Pierre-Louis Guhur, Cordelia Schmid, Ivan Laptev

Figure 1 for History Aware Multimodal Transformer for Vision-and-Language Navigation
Figure 2 for History Aware Multimodal Transformer for Vision-and-Language Navigation
Figure 3 for History Aware Multimodal Transformer for Vision-and-Language Navigation
Figure 4 for History Aware Multimodal Transformer for Vision-and-Language Navigation
Viaarxiv icon

Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training

Aug 25, 2021
Yuqing Song, Shizhe Chen, Qin Jin, Wei Luo, Jun Xie, Fei Huang

Figure 1 for Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training
Figure 2 for Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training
Figure 3 for Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training
Figure 4 for Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training
Viaarxiv icon

Airbert: In-domain Pretraining for Vision-and-Language Navigation

Aug 20, 2021
Pierre-Louis Guhur, Makarand Tapaswi, Shizhe Chen, Ivan Laptev, Cordelia Schmid

Figure 1 for Airbert: In-domain Pretraining for Vision-and-Language Navigation
Figure 2 for Airbert: In-domain Pretraining for Vision-and-Language Navigation
Figure 3 for Airbert: In-domain Pretraining for Vision-and-Language Navigation
Figure 4 for Airbert: In-domain Pretraining for Vision-and-Language Navigation
Viaarxiv icon

Elaborative Rehearsal for Zero-shot Action Recognition

Aug 18, 2021
Shizhe Chen, Dong Huang

Figure 1 for Elaborative Rehearsal for Zero-shot Action Recognition
Figure 2 for Elaborative Rehearsal for Zero-shot Action Recognition
Figure 3 for Elaborative Rehearsal for Zero-shot Action Recognition
Figure 4 for Elaborative Rehearsal for Zero-shot Action Recognition
Viaarxiv icon

Question-controlled Text-aware Image Captioning

Aug 04, 2021
Anwen Hu, Shizhe Chen, Qin Jin

Figure 1 for Question-controlled Text-aware Image Captioning
Figure 2 for Question-controlled Text-aware Image Captioning
Figure 3 for Question-controlled Text-aware Image Captioning
Figure 4 for Question-controlled Text-aware Image Captioning
Viaarxiv icon

ICECAP: Information Concentrated Entity-aware Image Captioning

Aug 04, 2021
Anwen Hu, Shizhe Chen, Qin Jin

Figure 1 for ICECAP: Information Concentrated Entity-aware Image Captioning
Figure 2 for ICECAP: Information Concentrated Entity-aware Image Captioning
Figure 3 for ICECAP: Information Concentrated Entity-aware Image Captioning
Figure 4 for ICECAP: Information Concentrated Entity-aware Image Captioning
Viaarxiv icon

Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization

Jun 11, 2021
Ludan Ruan, Jieting Chen, Yuqing Song, Shizhe Chen, Qin Jin

Figure 1 for Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization
Figure 2 for Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization
Figure 3 for Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization
Figure 4 for Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization
Viaarxiv icon