Picture for Yeqing Li

Yeqing Li

Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis

Add code
Jul 08, 2024
Viaarxiv icon

Improving Multi-Agent Debate with Sparse Communication Topology

Add code
Jun 17, 2024
Viaarxiv icon

Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models

Add code
May 27, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation

Add code
Dec 11, 2023
Figure 1 for MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Figure 2 for MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Figure 3 for MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Figure 4 for MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation
Viaarxiv icon

Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models

Add code
Jul 15, 2022
Figure 1 for Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models
Figure 2 for Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models
Figure 3 for Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models
Figure 4 for Multimodal Open-Vocabulary Video Classification via Pre-Trained Vision and Language Models
Viaarxiv icon

Exploring Temporal Granularity in Self-Supervised Video Representation Learning

Add code
Dec 08, 2021
Figure 1 for Exploring Temporal Granularity in Self-Supervised Video Representation Learning
Figure 2 for Exploring Temporal Granularity in Self-Supervised Video Representation Learning
Figure 3 for Exploring Temporal Granularity in Self-Supervised Video Representation Learning
Figure 4 for Exploring Temporal Granularity in Self-Supervised Video Representation Learning
Viaarxiv icon

Revisiting 3D ResNets for Video Recognition

Add code
Sep 03, 2021
Figure 1 for Revisiting 3D ResNets for Video Recognition
Figure 2 for Revisiting 3D ResNets for Video Recognition
Figure 3 for Revisiting 3D ResNets for Video Recognition
Figure 4 for Revisiting 3D ResNets for Video Recognition
Viaarxiv icon

High Resolution Medical Image Analysis with Spatial Partitioning

Add code
Sep 12, 2019
Figure 1 for High Resolution Medical Image Analysis with Spatial Partitioning
Figure 2 for High Resolution Medical Image Analysis with Spatial Partitioning
Figure 3 for High Resolution Medical Image Analysis with Spatial Partitioning
Viaarxiv icon

AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions

Add code
Apr 30, 2018
Figure 1 for AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Figure 2 for AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Figure 3 for AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Figure 4 for AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Viaarxiv icon