Picture for Noriyuki Kojima

Noriyuki Kojima

A Joint Study of Phrase Grounding and Task Performance in Vision and Language Models

Add code
Sep 06, 2023
Figure 1 for A Joint Study of Phrase Grounding and Task Performance in Vision and Language Models
Figure 2 for A Joint Study of Phrase Grounding and Task Performance in Vision and Language Models
Figure 3 for A Joint Study of Phrase Grounding and Task Performance in Vision and Language Models
Figure 4 for A Joint Study of Phrase Grounding and Task Performance in Vision and Language Models
Viaarxiv icon

Abstract Visual Reasoning with Tangram Shapes

Nov 29, 2022
Figure 1 for Abstract Visual Reasoning with Tangram Shapes
Figure 2 for Abstract Visual Reasoning with Tangram Shapes
Figure 3 for Abstract Visual Reasoning with Tangram Shapes
Figure 4 for Abstract Visual Reasoning with Tangram Shapes
Viaarxiv icon

lilGym: Natural Language Visual Reasoning with Reinforcement Learning

Nov 03, 2022
Figure 1 for lilGym: Natural Language Visual Reasoning with Reinforcement Learning
Figure 2 for lilGym: Natural Language Visual Reasoning with Reinforcement Learning
Figure 3 for lilGym: Natural Language Visual Reasoning with Reinforcement Learning
Figure 4 for lilGym: Natural Language Visual Reasoning with Reinforcement Learning
Viaarxiv icon

Markup-to-Image Diffusion Models with Scheduled Sampling

Add code
Oct 11, 2022
Figure 1 for Markup-to-Image Diffusion Models with Scheduled Sampling
Figure 2 for Markup-to-Image Diffusion Models with Scheduled Sampling
Figure 3 for Markup-to-Image Diffusion Models with Scheduled Sampling
Figure 4 for Markup-to-Image Diffusion Models with Scheduled Sampling
Viaarxiv icon

Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior

Aug 10, 2021
Figure 1 for Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior
Figure 2 for Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior
Figure 3 for Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior
Figure 4 for Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior
Viaarxiv icon

OASIS: A Large-Scale Dataset for Single Image 3D in the Wild

Jul 26, 2020
Figure 1 for OASIS: A Large-Scale Dataset for Single Image 3D in the Wild
Figure 2 for OASIS: A Large-Scale Dataset for Single Image 3D in the Wild
Figure 3 for OASIS: A Large-Scale Dataset for Single Image 3D in the Wild
Figure 4 for OASIS: A Large-Scale Dataset for Single Image 3D in the Wild
Viaarxiv icon

What is Learned in Visually Grounded Neural Syntax Acquisition

Add code
May 18, 2020
Figure 1 for What is Learned in Visually Grounded Neural Syntax Acquisition
Figure 2 for What is Learned in Visually Grounded Neural Syntax Acquisition
Figure 3 for What is Learned in Visually Grounded Neural Syntax Acquisition
Figure 4 for What is Learned in Visually Grounded Neural Syntax Acquisition
Viaarxiv icon

To Learn or Not to Learn: Analyzing the Role of Learning for Navigation in Virtual Environments

Add code
Jul 26, 2019
Figure 1 for To Learn or Not to Learn: Analyzing the Role of Learning for Navigation in Virtual Environments
Figure 2 for To Learn or Not to Learn: Analyzing the Role of Learning for Navigation in Virtual Environments
Figure 3 for To Learn or Not to Learn: Analyzing the Role of Learning for Navigation in Virtual Environments
Figure 4 for To Learn or Not to Learn: Analyzing the Role of Learning for Navigation in Virtual Environments
Viaarxiv icon

Speaker Naming in Movies

Add code
Sep 24, 2018
Figure 1 for Speaker Naming in Movies
Figure 2 for Speaker Naming in Movies
Figure 3 for Speaker Naming in Movies
Figure 4 for Speaker Naming in Movies
Viaarxiv icon