Alert button
Picture for Ishan Misra

Ishan Misra

Alert button

InstanceDiffusion: Instance-level Control for Image Generation

Feb 05, 2024
Xudong Wang, Trevor Darrell, Sai Saketh Rambhatla, Rohit Girdhar, Ishan Misra

Viaarxiv icon

FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis

Dec 29, 2023
Feng Liang, Bichen Wu, Jialiang Wang, Licheng Yu, Kunpeng Li, Yinan Zhao, Ishan Misra, Jia-Bin Huang, Peizhao Zhang, Peter Vajda, Diana Marculescu

Viaarxiv icon

Generating Illustrated Instructions

Dec 07, 2023
Sachit Menon, Ishan Misra, Rohit Girdhar

Viaarxiv icon

On Bringing Robots Home

Nov 27, 2023
Nur Muhammad Mahi Shafiullah, Anant Rai, Haritheja Etukuru, Yiqian Liu, Ishan Misra, Soumith Chintala, Lerrel Pinto

Viaarxiv icon

Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning

Nov 17, 2023
Rohit Girdhar, Mannat Singh, Andrew Brown, Quentin Duval, Samaneh Azadi, Sai Saketh Rambhatla, Akbar Shah, Xi Yin, Devi Parikh, Ishan Misra

Viaarxiv icon

SelfEval: Leveraging the discriminative nature of generative models for evaluation

Nov 17, 2023
Sai Saketh Rambhatla, Ishan Misra

Viaarxiv icon

VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation

Aug 28, 2023
Xudong Wang, Ishan Misra, Ziyun Zeng, Rohit Girdhar, Trevor Darrell

Figure 1 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Figure 2 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Figure 3 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Figure 4 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Viaarxiv icon

GeneCIS: A Benchmark for General Conditional Image Similarity

Jun 13, 2023
Sagar Vaze, Nicolas Carion, Ishan Misra

Figure 1 for GeneCIS: A Benchmark for General Conditional Image Similarity
Figure 2 for GeneCIS: A Benchmark for General Conditional Image Similarity
Figure 3 for GeneCIS: A Benchmark for General Conditional Image Similarity
Figure 4 for GeneCIS: A Benchmark for General Conditional Image Similarity
Viaarxiv icon

ImageBind: One Embedding Space To Bind Them All

May 09, 2023
Rohit Girdhar, Alaaeldin El-Nouby, Zhuang Liu, Mannat Singh, Kalyan Vasudev Alwala, Armand Joulin, Ishan Misra

Figure 1 for ImageBind: One Embedding Space To Bind Them All
Figure 2 for ImageBind: One Embedding Space To Bind Them All
Figure 3 for ImageBind: One Embedding Space To Bind Them All
Figure 4 for ImageBind: One Embedding Space To Bind Them All
Viaarxiv icon

DINOv2: Learning Robust Visual Features without Supervision

Apr 14, 2023
Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy Vo, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, Mahmoud Assran, Nicolas Ballas, Wojciech Galuba, Russell Howes, Po-Yao Huang, Shang-Wen Li, Ishan Misra, Michael Rabbat, Vasu Sharma, Gabriel Synnaeve, Hu Xu, Hervé Jegou, Julien Mairal, Patrick Labatut, Armand Joulin, Piotr Bojanowski

Figure 1 for DINOv2: Learning Robust Visual Features without Supervision
Figure 2 for DINOv2: Learning Robust Visual Features without Supervision
Figure 3 for DINOv2: Learning Robust Visual Features without Supervision
Figure 4 for DINOv2: Learning Robust Visual Features without Supervision
Viaarxiv icon