Picture for Lala Li

Lala Li

Guiding Image Captioning Models Toward More Specific Captions

Jul 31, 2023
Figure 1 for Guiding Image Captioning Models Toward More Specific Captions
Figure 2 for Guiding Image Captioning Models Toward More Specific Captions
Figure 3 for Guiding Image Captioning Models Toward More Specific Captions
Figure 4 for Guiding Image Captioning Models Toward More Specific Captions
Viaarxiv icon

FIT: Far-reaching Interleaved Transformers

Add code
May 25, 2023
Figure 1 for FIT: Far-reaching Interleaved Transformers
Figure 2 for FIT: Far-reaching Interleaved Transformers
Figure 3 for FIT: Far-reaching Interleaved Transformers
Figure 4 for FIT: Far-reaching Interleaved Transformers
Viaarxiv icon

A Generalist Framework for Panoptic Segmentation of Images and Videos

Add code
Oct 12, 2022
Figure 1 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Figure 2 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Figure 3 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Figure 4 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Viaarxiv icon

A Unified Sequence Interface for Vision Tasks

Add code
Jun 15, 2022
Figure 1 for A Unified Sequence Interface for Vision Tasks
Figure 2 for A Unified Sequence Interface for Vision Tasks
Figure 3 for A Unified Sequence Interface for Vision Tasks
Figure 4 for A Unified Sequence Interface for Vision Tasks
Viaarxiv icon

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Add code
May 23, 2022
Figure 1 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Figure 2 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Figure 3 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Figure 4 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Viaarxiv icon

Pix2seq: A Language Modeling Framework for Object Detection

Add code
Sep 22, 2021
Figure 1 for Pix2seq: A Language Modeling Framework for Object Detection
Figure 2 for Pix2seq: A Language Modeling Framework for Object Detection
Figure 3 for Pix2seq: A Language Modeling Framework for Object Detection
Figure 4 for Pix2seq: A Language Modeling Framework for Object Detection
Viaarxiv icon

Intriguing Properties of Contrastive Losses

Add code
Nov 05, 2020
Figure 1 for Intriguing Properties of Contrastive Losses
Figure 2 for Intriguing Properties of Contrastive Losses
Figure 3 for Intriguing Properties of Contrastive Losses
Figure 4 for Intriguing Properties of Contrastive Losses
Viaarxiv icon

Big Bidirectional Insertion Representations for Documents

Oct 29, 2019
Figure 1 for Big Bidirectional Insertion Representations for Documents
Figure 2 for Big Bidirectional Insertion Representations for Documents
Figure 3 for Big Bidirectional Insertion Representations for Documents
Viaarxiv icon

Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model

Add code
Jul 09, 2019
Figure 1 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Figure 2 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Figure 3 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Figure 4 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Viaarxiv icon