Picture for Agrim Gupta

Agrim Gupta

A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation

Add code
May 22, 2024
Viaarxiv icon

Densify & Conquer: Densified, smaller base-stations can conquer the increasing carbon footprint problem in nextG wireless

Mar 20, 2024
Figure 1 for Densify & Conquer: Densified, smaller base-stations can conquer the increasing carbon footprint problem in nextG wireless
Figure 2 for Densify & Conquer: Densified, smaller base-stations can conquer the increasing carbon footprint problem in nextG wireless
Figure 3 for Densify & Conquer: Densified, smaller base-stations can conquer the increasing carbon footprint problem in nextG wireless
Figure 4 for Densify & Conquer: Densified, smaller base-stations can conquer the increasing carbon footprint problem in nextG wireless
Viaarxiv icon

VideoPoet: A Large Language Model for Zero-Shot Video Generation

Dec 21, 2023
Figure 1 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 2 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 3 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 4 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Viaarxiv icon

Photorealistic Video Generation with Diffusion Models

Add code
Dec 11, 2023
Figure 1 for Photorealistic Video Generation with Diffusion Models
Figure 2 for Photorealistic Video Generation with Diffusion Models
Figure 3 for Photorealistic Video Generation with Diffusion Models
Figure 4 for Photorealistic Video Generation with Diffusion Models
Viaarxiv icon

Holistic Evaluation of Text-To-Image Models

Add code
Nov 07, 2023
Figure 1 for Holistic Evaluation of Text-To-Image Models
Figure 2 for Holistic Evaluation of Text-To-Image Models
Figure 3 for Holistic Evaluation of Text-To-Image Models
Figure 4 for Holistic Evaluation of Text-To-Image Models
Viaarxiv icon

Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation

Oct 09, 2023
Figure 1 for Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Figure 2 for Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Figure 3 for Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Figure 4 for Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Viaarxiv icon

RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation

Add code
Jun 20, 2023
Viaarxiv icon

Siamese Masked Autoencoders

Add code
May 23, 2023
Figure 1 for Siamese Masked Autoencoders
Figure 2 for Siamese Masked Autoencoders
Figure 3 for Siamese Masked Autoencoders
Figure 4 for Siamese Masked Autoencoders
Viaarxiv icon

GreenMO: Virtualized User-proportionate MIMO

Nov 29, 2022
Figure 1 for GreenMO: Virtualized User-proportionate MIMO
Figure 2 for GreenMO: Virtualized User-proportionate MIMO
Figure 3 for GreenMO: Virtualized User-proportionate MIMO
Figure 4 for GreenMO: Virtualized User-proportionate MIMO
Viaarxiv icon

VIMA: General Robot Manipulation with Multimodal Prompts

Add code
Oct 06, 2022
Figure 1 for VIMA: General Robot Manipulation with Multimodal Prompts
Figure 2 for VIMA: General Robot Manipulation with Multimodal Prompts
Figure 3 for VIMA: General Robot Manipulation with Multimodal Prompts
Figure 4 for VIMA: General Robot Manipulation with Multimodal Prompts
Viaarxiv icon