Alert button
Picture for Ranjay Krishna

Ranjay Krishna

Alert button

BLINK: Multimodal Large Language Models Can See but Not Perceive

Add code
Bookmark button
Alert button
Apr 18, 2024
Xingyu Fu, Yushi Hu, Bangzheng Li, Yu Feng, Haoyu Wang, Xudong Lin, Dan Roth, Noah A. Smith, Wei-Chiu Ma, Ranjay Krishna

Viaarxiv icon

Iterated Learning Improves Compositionality in Large Vision-Language Models

Add code
Bookmark button
Alert button
Apr 17, 2024
Chenhao Zheng, Jieyu Zhang, Aniruddha Kembhavi, Ranjay Krishna

Viaarxiv icon

EVE: Enabling Anyone to Train Robot using Augmented Reality

Add code
Bookmark button
Alert button
Apr 09, 2024
Jun Wang, Chun-Cheng Chang, Jiafei Duan, Dieter Fox, Ranjay Krishna

Viaarxiv icon

Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion

Add code
Bookmark button
Alert button
Mar 22, 2024
Xiang Fan, Anand Bhattad, Ranjay Krishna

Figure 1 for Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion
Figure 2 for Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion
Figure 3 for Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion
Figure 4 for Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion
Viaarxiv icon

m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks

Add code
Bookmark button
Alert button
Mar 21, 2024
Zixian Ma, Weikai Huang, Jieyu Zhang, Tanmay Gupta, Ranjay Krishna

Figure 1 for m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks
Figure 2 for m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks
Figure 3 for m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks
Figure 4 for m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks
Viaarxiv icon

Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use

Add code
Bookmark button
Alert button
Mar 05, 2024
Imad Eddine Toubal, Aditya Avinash, Neil Gordon Alldrin, Jan Dlabal, Wenlei Zhou, Enming Luo, Otilia Stretcu, Hao Xiong, Chun-Ta Lu, Howard Zhou, Ranjay Krishna, Ariel Fuxman, Tom Duerig

Figure 1 for Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use
Figure 2 for Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use
Figure 3 for Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use
Figure 4 for Modeling Collaborator: Enabling Subjective Vision Classification With Minimal Human Effort via LLM Tool-Use
Viaarxiv icon

Training Language Model Agents without Modifying Language Models

Add code
Bookmark button
Alert button
Feb 17, 2024
Shaokun Zhang, Jieyu Zhang, Jiale Liu, Linxin Song, Chi Wang, Ranjay Krishna, Qingyun Wu

Viaarxiv icon

THE COLOSSEUM: A Benchmark for Evaluating Generalization for Robotic Manipulation

Add code
Bookmark button
Alert button
Feb 13, 2024
Wilbert Pumacay, Ishika Singh, Jiafei Duan, Ranjay Krishna, Jesse Thomason, Dieter Fox

Viaarxiv icon