Picture for Ishan Misra

Ishan Misra

Jack

Human detectors are surprisingly powerful reward models

Add code
Jan 21, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Generating Multi-Image Synthetic Data for Text-to-Image Customization

Add code
Feb 03, 2025
Viaarxiv icon

Diffusion Autoencoders are Scalable Image Tokenizers

Add code
Jan 30, 2025
Figure 1 for Diffusion Autoencoders are Scalable Image Tokenizers
Figure 2 for Diffusion Autoencoders are Scalable Image Tokenizers
Figure 3 for Diffusion Autoencoders are Scalable Image Tokenizers
Figure 4 for Diffusion Autoencoders are Scalable Image Tokenizers
Viaarxiv icon

LLMs can see and hear without any training

Add code
Jan 30, 2025
Figure 1 for LLMs can see and hear without any training
Figure 2 for LLMs can see and hear without any training
Figure 3 for LLMs can see and hear without any training
Figure 4 for LLMs can see and hear without any training
Viaarxiv icon

CAT: Content-Adaptive Image Tokenization

Add code
Jan 06, 2025
Figure 1 for CAT: Content-Adaptive Image Tokenization
Figure 2 for CAT: Content-Adaptive Image Tokenization
Figure 3 for CAT: Content-Adaptive Image Tokenization
Figure 4 for CAT: Content-Adaptive Image Tokenization
Viaarxiv icon

Movie Gen: A Cast of Media Foundation Models

Add code
Oct 17, 2024
Figure 1 for Movie Gen: A Cast of Media Foundation Models
Figure 2 for Movie Gen: A Cast of Media Foundation Models
Figure 3 for Movie Gen: A Cast of Media Foundation Models
Figure 4 for Movie Gen: A Cast of Media Foundation Models
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

InstanceDiffusion: Instance-level Control for Image Generation

Add code
Feb 05, 2024
Viaarxiv icon

FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis

Add code
Dec 29, 2023
Figure 1 for FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis
Figure 2 for FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis
Figure 3 for FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis
Figure 4 for FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis
Viaarxiv icon