Alert button
Picture for Jialin Wu

Jialin Wu

Alert button

RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control

Add code
Bookmark button
Alert button
Jul 28, 2023
Anthony Brohan, Noah Brown, Justice Carbajal, Yevgen Chebotar, Xi Chen, Krzysztof Choromanski, Tianli Ding, Danny Driess, Avinava Dubey, Chelsea Finn, Pete Florence, Chuyuan Fu, Montse Gonzalez Arenas, Keerthana Gopalakrishnan, Kehang Han, Karol Hausman, Alexander Herzog, Jasmine Hsu, Brian Ichter, Alex Irpan, Nikhil Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Isabel Leal, Lisa Lee, Tsang-Wei Edward Lee, Sergey Levine, Yao Lu, Henryk Michalewski, Igor Mordatch, Karl Pertsch, Kanishka Rao, Krista Reymann, Michael Ryoo, Grecia Salazar, Pannag Sanketi, Pierre Sermanet, Jaspiar Singh, Anikait Singh, Radu Soricut, Huong Tran, Vincent Vanhoucke, Quan Vuong, Ayzaan Wahid, Stefan Welker, Paul Wohlhart, Jialin Wu, Fei Xia, Ted Xiao, Peng Xu, Sichun Xu, Tianhe Yu, Brianna Zitkovich

Figure 1 for RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Figure 2 for RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Figure 3 for RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Figure 4 for RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Viaarxiv icon

PaLI-X: On Scaling up a Multilingual Vision and Language Model

Add code
Bookmark button
Alert button
May 29, 2023
Xi Chen, Josip Djolonga, Piotr Padlewski, Basil Mustafa, Soravit Changpinyo, Jialin Wu, Carlos Riquelme Ruiz, Sebastian Goodman, Xiao Wang, Yi Tay, Siamak Shakeri, Mostafa Dehghani, Daniel Salz, Mario Lucic, Michael Tschannen, Arsha Nagrani, Hexiang Hu, Mandar Joshi, Bo Pang, Ceslee Montgomery, Paulina Pietrzyk, Marvin Ritter, AJ Piergiovanni, Matthias Minderer, Filip Pavetic, Austin Waters, Gang Li, Ibrahim Alabdulmohsin, Lucas Beyer, Julien Amelot, Kenton Lee, Andreas Peter Steiner, Yang Li, Daniel Keysers, Anurag Arnab, Yuanzhong Xu, Keran Rong, Alexander Kolesnikov, Mojtaba Seyedhosseini, Anelia Angelova, Xiaohua Zhai, Neil Houlsby, Radu Soricut

Figure 1 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Figure 2 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Figure 3 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Figure 4 for PaLI-X: On Scaling up a Multilingual Vision and Language Model
Viaarxiv icon

Entity-Focused Dense Passage Retrieval for Outside-Knowledge Visual Question Answering

Add code
Bookmark button
Alert button
Oct 18, 2022
Jialin Wu, Raymond J. Mooney

Figure 1 for Entity-Focused Dense Passage Retrieval for Outside-Knowledge Visual Question Answering
Figure 2 for Entity-Focused Dense Passage Retrieval for Outside-Knowledge Visual Question Answering
Figure 3 for Entity-Focused Dense Passage Retrieval for Outside-Knowledge Visual Question Answering
Figure 4 for Entity-Focused Dense Passage Retrieval for Outside-Knowledge Visual Question Answering
Viaarxiv icon

Multi-Modal Answer Validation for Knowledge-Based VQA

Add code
Bookmark button
Alert button
Mar 23, 2021
Jialin Wu, Jiasen Lu, Ashish Sabharwal, Roozbeh Mottaghi

Figure 1 for Multi-Modal Answer Validation for Knowledge-Based VQA
Figure 2 for Multi-Modal Answer Validation for Knowledge-Based VQA
Figure 3 for Multi-Modal Answer Validation for Knowledge-Based VQA
Figure 4 for Multi-Modal Answer Validation for Knowledge-Based VQA
Viaarxiv icon

Visual Question Answering based on Local-Scene-Aware Referring Expression Generation

Add code
Bookmark button
Alert button
Jan 22, 2021
Jung-Jun Kim, Dong-Gyu Lee, Jialin Wu, Hong-Gyu Jung, Seong-Whan Lee

Figure 1 for Visual Question Answering based on Local-Scene-Aware Referring Expression Generation
Figure 2 for Visual Question Answering based on Local-Scene-Aware Referring Expression Generation
Figure 3 for Visual Question Answering based on Local-Scene-Aware Referring Expression Generation
Figure 4 for Visual Question Answering based on Local-Scene-Aware Referring Expression Generation
Viaarxiv icon

Improving VQA and its Explanations \\ by Comparing Competing Explanations

Add code
Bookmark button
Alert button
Jun 28, 2020
Jialin Wu, Liyan Chen, Raymond J. Mooney

Figure 1 for Improving VQA and its Explanations \\ by Comparing Competing Explanations
Figure 2 for Improving VQA and its Explanations \\ by Comparing Competing Explanations
Figure 3 for Improving VQA and its Explanations \\ by Comparing Competing Explanations
Figure 4 for Improving VQA and its Explanations \\ by Comparing Competing Explanations
Viaarxiv icon

Hidden State Guidance: Improving Image Captioning using An Image Conditioned Autoencoder

Add code
Bookmark button
Alert button
Oct 31, 2019
Jialin Wu, Raymond J. Mooney

Figure 1 for Hidden State Guidance: Improving Image Captioning using An Image Conditioned Autoencoder
Figure 2 for Hidden State Guidance: Improving Image Captioning using An Image Conditioned Autoencoder
Figure 3 for Hidden State Guidance: Improving Image Captioning using An Image Conditioned Autoencoder
Figure 4 for Hidden State Guidance: Improving Image Captioning using An Image Conditioned Autoencoder
Viaarxiv icon

Generating Question Relevant Captions to Aid Visual Question Answering

Add code
Bookmark button
Alert button
Jun 03, 2019
Jialin Wu, Zeyuan Hu, Raymond J. Mooney

Figure 1 for Generating Question Relevant Captions to Aid Visual Question Answering
Figure 2 for Generating Question Relevant Captions to Aid Visual Question Answering
Figure 3 for Generating Question Relevant Captions to Aid Visual Question Answering
Figure 4 for Generating Question Relevant Captions to Aid Visual Question Answering
Viaarxiv icon

Self-Critical Reasoning for Robust Visual Question Answering

Add code
Bookmark button
Alert button
May 24, 2019
Jialin Wu, Raymond J. Mooney

Figure 1 for Self-Critical Reasoning for Robust Visual Question Answering
Figure 2 for Self-Critical Reasoning for Robust Visual Question Answering
Figure 3 for Self-Critical Reasoning for Robust Visual Question Answering
Figure 4 for Self-Critical Reasoning for Robust Visual Question Answering
Viaarxiv icon

Image Score: How to Select Useful Samples

Add code
Bookmark button
Alert button
Dec 02, 2018
Simiao Zuo, Jialin Wu

Figure 1 for Image Score: How to Select Useful Samples
Figure 2 for Image Score: How to Select Useful Samples
Figure 3 for Image Score: How to Select Useful Samples
Figure 4 for Image Score: How to Select Useful Samples
Viaarxiv icon