Picture for Ishan Misra

Ishan Misra

InstanceDiffusion: Instance-level Control for Image Generation

Add code
Feb 05, 2024
Viaarxiv icon

FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis

Add code
Dec 29, 2023
Viaarxiv icon

Generating Illustrated Instructions

Add code
Dec 07, 2023
Figure 1 for Generating Illustrated Instructions
Figure 2 for Generating Illustrated Instructions
Figure 3 for Generating Illustrated Instructions
Figure 4 for Generating Illustrated Instructions
Viaarxiv icon

On Bringing Robots Home

Add code
Nov 27, 2023
Figure 1 for On Bringing Robots Home
Figure 2 for On Bringing Robots Home
Figure 3 for On Bringing Robots Home
Figure 4 for On Bringing Robots Home
Viaarxiv icon

Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning

Add code
Nov 17, 2023
Viaarxiv icon

SelfEval: Leveraging the discriminative nature of generative models for evaluation

Add code
Nov 17, 2023
Viaarxiv icon

VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation

Add code
Aug 28, 2023
Figure 1 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Figure 2 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Figure 3 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Figure 4 for VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation
Viaarxiv icon

GeneCIS: A Benchmark for General Conditional Image Similarity

Add code
Jun 13, 2023
Figure 1 for GeneCIS: A Benchmark for General Conditional Image Similarity
Figure 2 for GeneCIS: A Benchmark for General Conditional Image Similarity
Figure 3 for GeneCIS: A Benchmark for General Conditional Image Similarity
Figure 4 for GeneCIS: A Benchmark for General Conditional Image Similarity
Viaarxiv icon

ImageBind: One Embedding Space To Bind Them All

Add code
May 09, 2023
Figure 1 for ImageBind: One Embedding Space To Bind Them All
Figure 2 for ImageBind: One Embedding Space To Bind Them All
Figure 3 for ImageBind: One Embedding Space To Bind Them All
Figure 4 for ImageBind: One Embedding Space To Bind Them All
Viaarxiv icon

DINOv2: Learning Robust Visual Features without Supervision

Add code
Apr 14, 2023
Figure 1 for DINOv2: Learning Robust Visual Features without Supervision
Figure 2 for DINOv2: Learning Robust Visual Features without Supervision
Figure 3 for DINOv2: Learning Robust Visual Features without Supervision
Figure 4 for DINOv2: Learning Robust Visual Features without Supervision
Viaarxiv icon