Alert button
Picture for Animesh Sinha

Animesh Sinha

Alert button

Gen2Det: Generate to Detect

Dec 07, 2023
Saksham Suri, Fanyi Xiao, Animesh Sinha, Sean Chang Culatana, Raghuraman Krishnamoorthi, Chenchen Zhu, Abhinav Shrivastava

Figure 1 for Gen2Det: Generate to Detect
Figure 2 for Gen2Det: Generate to Detect
Figure 3 for Gen2Det: Generate to Detect
Figure 4 for Gen2Det: Generate to Detect
Viaarxiv icon

GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation

Dec 07, 2023
Shoufa Chen, Mengmeng Xu, Jiawei Ren, Yuren Cong, Sen He, Yanping Xie, Animesh Sinha, Ping Luo, Tao Xiang, Juan-Manuel Perez-Rua

Figure 1 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 2 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 3 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 4 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Viaarxiv icon

Context Diffusion: In-Context Aware Image Generation

Dec 06, 2023
Ivona Najdenkoska, Animesh Sinha, Abhimanyu Dubey, Dhruv Mahajan, Vignesh Ramanathan, Filip Radenovic

Viaarxiv icon

Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression

Nov 17, 2023
Animesh Sinha, Bo Sun, Anmol Kalia, Arantxa Casanova, Elliot Blanchard, David Yan, Winnie Zhang, Tony Nelli, Jiahui Chen, Hardik Shah, Licheng Yu, Mitesh Kumar Singh, Ankit Ramchandani, Maziar Sanjabi, Sonal Gupta, Amy Bearman, Dhruv Mahajan

Viaarxiv icon

FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning

Oct 26, 2022
Suvir Mirchandani, Licheng Yu, Mengjiao Wang, Animesh Sinha, Wenwen Jiang, Tao Xiang, Ning Zhang

Figure 1 for FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning
Figure 2 for FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning
Figure 3 for FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning
Figure 4 for FaD-VLP: Fashion Vision-and-Language Pre-training towards Unified Retrieval and Captioning
Viaarxiv icon

CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval

Feb 15, 2022
Licheng Yu, Jun Chen, Animesh Sinha, Mengjiao MJ Wang, Hugo Chen, Tamara L. Berg, Ning Zhang

Figure 1 for CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval
Figure 2 for CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval
Figure 3 for CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval
Figure 4 for CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval
Viaarxiv icon

Large-Scale Attribute-Object Compositions

May 24, 2021
Filip Radenovic, Animesh Sinha, Albert Gordo, Tamara Berg, Dhruv Mahajan

Figure 1 for Large-Scale Attribute-Object Compositions
Figure 2 for Large-Scale Attribute-Object Compositions
Figure 3 for Large-Scale Attribute-Object Compositions
Figure 4 for Large-Scale Attribute-Object Compositions
Viaarxiv icon

Qubit Routing using Graph Neural Network aided Monte Carlo Tree Search

Apr 01, 2021
Animesh Sinha, Utkarsh Azad, Harjinder Singh

Figure 1 for Qubit Routing using Graph Neural Network aided Monte Carlo Tree Search
Figure 2 for Qubit Routing using Graph Neural Network aided Monte Carlo Tree Search
Figure 3 for Qubit Routing using Graph Neural Network aided Monte Carlo Tree Search
Figure 4 for Qubit Routing using Graph Neural Network aided Monte Carlo Tree Search
Viaarxiv icon