Alert button
Picture for Kaizhi Zheng

Kaizhi Zheng

Alert button

MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens

Add code
Bookmark button
Alert button
Oct 05, 2023
Kaizhi Zheng, Xuehai He, Xin Eric Wang

Viaarxiv icon

R2H: Building Multimodal Navigation Helpers that Respond to Help

Add code
Bookmark button
Alert button
May 23, 2023
Yue Fan, Kaizhi Zheng, Jing Gu, Xin Eric Wang

Figure 1 for R2H: Building Multimodal Navigation Helpers that Respond to Help
Figure 2 for R2H: Building Multimodal Navigation Helpers that Respond to Help
Figure 3 for R2H: Building Multimodal Navigation Helpers that Respond to Help
Figure 4 for R2H: Building Multimodal Navigation Helpers that Respond to Help
Viaarxiv icon

ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation

Add code
Bookmark button
Alert button
Jan 30, 2023
Kaiwen Zhou, Kaizhi Zheng, Connor Pryor, Yilin Shen, Hongxia Jin, Lise Getoor, Xin Eric Wang

Figure 1 for ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation
Figure 2 for ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation
Figure 3 for ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation
Figure 4 for ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation
Viaarxiv icon

JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents

Add code
Bookmark button
Alert button
Aug 30, 2022
Kaizhi Zheng, Kaiwen Zhou, Jing Gu, Yue Fan, Jialu Wang, Zonglin Di, Xuehai He, Xin Eric Wang

Figure 1 for JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents
Figure 2 for JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents
Figure 3 for JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents
Figure 4 for JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents
Viaarxiv icon

VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation

Add code
Bookmark button
Alert button
Jun 17, 2022
Kaizhi Zheng, Xiaotong Chen, Odest Chadwicke Jenkins, Xin Eric Wang

Figure 1 for VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation
Figure 2 for VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation
Figure 3 for VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation
Figure 4 for VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation
Viaarxiv icon

Manipulation-Oriented Object Perception in Clutter through Affordance Coordinate Frames

Add code
Bookmark button
Alert button
Oct 16, 2020
Xiaotong Chen, Kaizhi Zheng, Zhen Zeng, Shreshtha Basu, James Cooney, Jana Pavlasek, Odest Chadwicke Jenkins

Figure 1 for Manipulation-Oriented Object Perception in Clutter through Affordance Coordinate Frames
Figure 2 for Manipulation-Oriented Object Perception in Clutter through Affordance Coordinate Frames
Figure 3 for Manipulation-Oriented Object Perception in Clutter through Affordance Coordinate Frames
Figure 4 for Manipulation-Oriented Object Perception in Clutter through Affordance Coordinate Frames
Viaarxiv icon