Picture for Lala Li

Lala Li

Controlling Space and Time with Diffusion Models

Add code
Jul 10, 2024
Viaarxiv icon

Guiding Image Captioning Models Toward More Specific Captions

Add code
Jul 31, 2023
Figure 1 for Guiding Image Captioning Models Toward More Specific Captions
Figure 2 for Guiding Image Captioning Models Toward More Specific Captions
Figure 3 for Guiding Image Captioning Models Toward More Specific Captions
Figure 4 for Guiding Image Captioning Models Toward More Specific Captions
Viaarxiv icon

FIT: Far-reaching Interleaved Transformers

Add code
May 25, 2023
Figure 1 for FIT: Far-reaching Interleaved Transformers
Figure 2 for FIT: Far-reaching Interleaved Transformers
Figure 3 for FIT: Far-reaching Interleaved Transformers
Figure 4 for FIT: Far-reaching Interleaved Transformers
Viaarxiv icon

A Generalist Framework for Panoptic Segmentation of Images and Videos

Add code
Oct 12, 2022
Figure 1 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Figure 2 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Figure 3 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Figure 4 for A Generalist Framework for Panoptic Segmentation of Images and Videos
Viaarxiv icon

A Unified Sequence Interface for Vision Tasks

Add code
Jun 15, 2022
Figure 1 for A Unified Sequence Interface for Vision Tasks
Figure 2 for A Unified Sequence Interface for Vision Tasks
Figure 3 for A Unified Sequence Interface for Vision Tasks
Figure 4 for A Unified Sequence Interface for Vision Tasks
Viaarxiv icon

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Add code
May 23, 2022
Figure 1 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Figure 2 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Figure 3 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Figure 4 for Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Viaarxiv icon

Pix2seq: A Language Modeling Framework for Object Detection

Add code
Sep 22, 2021
Figure 1 for Pix2seq: A Language Modeling Framework for Object Detection
Figure 2 for Pix2seq: A Language Modeling Framework for Object Detection
Figure 3 for Pix2seq: A Language Modeling Framework for Object Detection
Figure 4 for Pix2seq: A Language Modeling Framework for Object Detection
Viaarxiv icon

Intriguing Properties of Contrastive Losses

Add code
Nov 05, 2020
Figure 1 for Intriguing Properties of Contrastive Losses
Figure 2 for Intriguing Properties of Contrastive Losses
Figure 3 for Intriguing Properties of Contrastive Losses
Figure 4 for Intriguing Properties of Contrastive Losses
Viaarxiv icon

Big Bidirectional Insertion Representations for Documents

Add code
Oct 29, 2019
Figure 1 for Big Bidirectional Insertion Representations for Documents
Figure 2 for Big Bidirectional Insertion Representations for Documents
Figure 3 for Big Bidirectional Insertion Representations for Documents
Viaarxiv icon

Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model

Add code
Jul 09, 2019
Figure 1 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Figure 2 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Figure 3 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Figure 4 for Which Algorithmic Choices Matter at Which Batch Sizes? Insights From a Noisy Quadratic Model
Viaarxiv icon