Picture for Yinfei Yang

Yinfei Yang

A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning

Add code
Oct 06, 2022
Figure 1 for A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
Figure 2 for A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
Figure 3 for A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
Figure 4 for A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
Viaarxiv icon

Scaling Autoregressive Models for Content-Rich Text-to-Image Generation

Add code
Jun 22, 2022
Figure 1 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Figure 2 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Figure 3 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Figure 4 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Viaarxiv icon

Simple and Effective Synthesis of Indoor 3D Scenes

Add code
Apr 06, 2022
Figure 1 for Simple and Effective Synthesis of Indoor 3D Scenes
Figure 2 for Simple and Effective Synthesis of Indoor 3D Scenes
Figure 3 for Simple and Effective Synthesis of Indoor 3D Scenes
Figure 4 for Simple and Effective Synthesis of Indoor 3D Scenes
Viaarxiv icon

LongT5: Efficient Text-To-Text Transformer for Long Sequences

Add code
Dec 15, 2021
Figure 1 for LongT5: Efficient Text-To-Text Transformer for Long Sequences
Figure 2 for LongT5: Efficient Text-To-Text Transformer for Long Sequences
Figure 3 for LongT5: Efficient Text-To-Text Transformer for Long Sequences
Figure 4 for LongT5: Efficient Text-To-Text Transformer for Long Sequences
Viaarxiv icon

Large Dual Encoders Are Generalizable Retrievers

Add code
Dec 15, 2021
Figure 1 for Large Dual Encoders Are Generalizable Retrievers
Figure 2 for Large Dual Encoders Are Generalizable Retrievers
Figure 3 for Large Dual Encoders Are Generalizable Retrievers
Figure 4 for Large Dual Encoders Are Generalizable Retrievers
Viaarxiv icon

MURAL: Multimodal, Multitask Retrieval Across Languages

Add code
Sep 10, 2021
Figure 1 for MURAL: Multimodal, Multitask Retrieval Across Languages
Figure 2 for MURAL: Multimodal, Multitask Retrieval Across Languages
Figure 3 for MURAL: Multimodal, Multitask Retrieval Across Languages
Figure 4 for MURAL: Multimodal, Multitask Retrieval Across Languages
Viaarxiv icon

A Simple and Effective Method To Eliminate the Self Language Bias in Multilingual Representations

Add code
Sep 10, 2021
Figure 1 for A Simple and Effective Method To Eliminate the Self Language Bias in Multilingual Representations
Figure 2 for A Simple and Effective Method To Eliminate the Self Language Bias in Multilingual Representations
Figure 3 for A Simple and Effective Method To Eliminate the Self Language Bias in Multilingual Representations
Figure 4 for A Simple and Effective Method To Eliminate the Self Language Bias in Multilingual Representations
Viaarxiv icon

Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models

Add code
Aug 26, 2021
Figure 1 for Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models
Figure 2 for Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models
Figure 3 for Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models
Figure 4 for Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models
Viaarxiv icon

Pathdreamer: A World Model for Indoor Navigation

Add code
May 18, 2021
Figure 1 for Pathdreamer: A World Model for Indoor Navigation
Figure 2 for Pathdreamer: A World Model for Indoor Navigation
Figure 3 for Pathdreamer: A World Model for Indoor Navigation
Figure 4 for Pathdreamer: A World Model for Indoor Navigation
Viaarxiv icon

Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision

Add code
Feb 11, 2021
Figure 1 for Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Figure 2 for Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Figure 3 for Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Figure 4 for Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Viaarxiv icon