Alert button
Picture for Zhiwei Jia

Zhiwei Jia

Alert button

KAFA: Rethinking Image Ad Understanding with Knowledge-Augmented Feature Adaptation of Vision-Language Models

May 28, 2023
Zhiwei Jia, Pradyumna Narayana, Arjun R. Akula, Garima Pruthi, Hao Su, Sugato Basu, Varun Jampani

Figure 1 for KAFA: Rethinking Image Ad Understanding with Knowledge-Augmented Feature Adaptation of Vision-Language Models
Figure 2 for KAFA: Rethinking Image Ad Understanding with Knowledge-Augmented Feature Adaptation of Vision-Language Models
Figure 3 for KAFA: Rethinking Image Ad Understanding with Knowledge-Augmented Feature Adaptation of Vision-Language Models
Figure 4 for KAFA: Rethinking Image Ad Understanding with Knowledge-Augmented Feature Adaptation of Vision-Language Models
Viaarxiv icon

Chain-of-Thought Predictive Control

Apr 03, 2023
Zhiwei Jia, Fangchen Liu, Vineet Thumuluri, Linghao Chen, Zhiao Huang, Hao Su

Figure 1 for Chain-of-Thought Predictive Control
Figure 2 for Chain-of-Thought Predictive Control
Figure 3 for Chain-of-Thought Predictive Control
Figure 4 for Chain-of-Thought Predictive Control
Viaarxiv icon

MetaCLUE: Towards Comprehensive Visual Metaphors Research

Dec 19, 2022
Arjun R. Akula, Brendan Driscoll, Pradyumna Narayana, Soravit Changpinyo, Zhiwei Jia, Suyash Damle, Garima Pruthi, Sugato Basu, Leonidas Guibas, William T. Freeman, Yuanzhen Li, Varun Jampani

Figure 1 for MetaCLUE: Towards Comprehensive Visual Metaphors Research
Figure 2 for MetaCLUE: Towards Comprehensive Visual Metaphors Research
Figure 3 for MetaCLUE: Towards Comprehensive Visual Metaphors Research
Figure 4 for MetaCLUE: Towards Comprehensive Visual Metaphors Research
Viaarxiv icon

Improving Policy Optimization with Generalist-Specialist Learning

Jun 26, 2022
Zhiwei Jia, Xuanlin Li, Zhan Ling, Shuang Liu, Yiran Wu, Hao Su

Figure 1 for Improving Policy Optimization with Generalist-Specialist Learning
Figure 2 for Improving Policy Optimization with Generalist-Specialist Learning
Figure 3 for Improving Policy Optimization with Generalist-Specialist Learning
Figure 4 for Improving Policy Optimization with Generalist-Specialist Learning
Viaarxiv icon

Learning to Act with Affordance-Aware Multimodal Neural SLAM

Feb 04, 2022
Zhiwei Jia, Kaixiang Lin, Yizhou Zhao, Qiaozi Gao, Govind Thattai, Gaurav Sukhatme

Figure 1 for Learning to Act with Affordance-Aware Multimodal Neural SLAM
Figure 2 for Learning to Act with Affordance-Aware Multimodal Neural SLAM
Figure 3 for Learning to Act with Affordance-Aware Multimodal Neural SLAM
Figure 4 for Learning to Act with Affordance-Aware Multimodal Neural SLAM
Viaarxiv icon

TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance

Nov 16, 2021
Yue Tao, Zhiwei Jia, Runze Ma, Shugong Xu

Figure 1 for TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance
Figure 2 for TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance
Figure 3 for TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance
Figure 4 for TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance
Viaarxiv icon

LUMINOUS: Indoor Scene Generation for Embodied AI Challenges

Nov 10, 2021
Yizhou Zhao, Kaixiang Lin, Zhiwei Jia, Qiaozi Gao, Govind Thattai, Jesse Thomason, Gaurav S. Sukhatme

Figure 1 for LUMINOUS: Indoor Scene Generation for Embodied AI Challenges
Figure 2 for LUMINOUS: Indoor Scene Generation for Embodied AI Challenges
Figure 3 for LUMINOUS: Indoor Scene Generation for Embodied AI Challenges
Figure 4 for LUMINOUS: Indoor Scene Generation for Embodied AI Challenges
Viaarxiv icon

IFR: Iterative Fusion Based Recognizer For Low Quality Scene Text Recognition

Aug 13, 2021
Zhiwei Jia, Shugong Xu, Shiyi Mu, Yue Tao, Shan Cao, Zhiyong Chen

Figure 1 for IFR: Iterative Fusion Based Recognizer For Low Quality Scene Text Recognition
Figure 2 for IFR: Iterative Fusion Based Recognizer For Low Quality Scene Text Recognition
Figure 3 for IFR: Iterative Fusion Based Recognizer For Low Quality Scene Text Recognition
Figure 4 for IFR: Iterative Fusion Based Recognizer For Low Quality Scene Text Recognition
Viaarxiv icon

ManiSkill: Learning-from-Demonstrations Benchmark for Generalizable Manipulation Skills

Aug 09, 2021
Tongzhou Mu, Zhan Ling, Fanbo Xiang, Derek Yang, Xuanlin Li, Stone Tao, Zhiao Huang, Zhiwei Jia, Hao Su

Figure 1 for ManiSkill: Learning-from-Demonstrations Benchmark for Generalizable Manipulation Skills
Figure 2 for ManiSkill: Learning-from-Demonstrations Benchmark for Generalizable Manipulation Skills
Figure 3 for ManiSkill: Learning-from-Demonstrations Benchmark for Generalizable Manipulation Skills
Figure 4 for ManiSkill: Learning-from-Demonstrations Benchmark for Generalizable Manipulation Skills
Viaarxiv icon