Alert button
Picture for Rui Zhao

Rui Zhao

Alert button

Link-Context Learning for Multimodal LLMs

Aug 15, 2023
Yan Tai, Weichen Fan, Zhao Zhang, Feng Zhu, Rui Zhao, Ziwei Liu

Figure 1 for Link-Context Learning for Multimodal LLMs
Figure 2 for Link-Context Learning for Multimodal LLMs
Figure 3 for Link-Context Learning for Multimodal LLMs
Figure 4 for Link-Context Learning for Multimodal LLMs
Viaarxiv icon

DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models

Aug 11, 2023
Weijia Wu, Yuzhong Zhao, Hao Chen, Yuchao Gu, Rui Zhao, Yefei He, Hong Zhou, Mike Zheng Shou, Chunhua Shen

Figure 1 for DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models
Figure 2 for DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models
Figure 3 for DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models
Figure 4 for DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models
Viaarxiv icon

Zero-shot Text-driven Physically Interpretable Face Editing

Aug 11, 2023
Yapeng Meng, Songru Yang, Xu Hu, Rui Zhao, Lincheng Li, Zhenwei Shi, Zhengxia Zou

Figure 1 for Zero-shot Text-driven Physically Interpretable Face Editing
Figure 2 for Zero-shot Text-driven Physically Interpretable Face Editing
Figure 3 for Zero-shot Text-driven Physically Interpretable Face Editing
Figure 4 for Zero-shot Text-driven Physically Interpretable Face Editing
Viaarxiv icon

TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents

Aug 07, 2023
Jingqing Ruan, Yihong Chen, Bin Zhang, Zhiwei Xu, Tianpeng Bao, Guoqing Du, Shiwei Shi, Hangyu Mao, Xingyu Zeng, Rui Zhao

Figure 1 for TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents
Figure 2 for TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents
Figure 3 for TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents
Figure 4 for TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents
Viaarxiv icon

Relation-Aware Distribution Representation Network for Person Clustering with Multiple Modalities

Aug 01, 2023
Kaijian Liu, Shixiang Tang, Ziyue Li, Zhishuai Li, Lei Bai, Feng Zhu, Rui Zhao

Figure 1 for Relation-Aware Distribution Representation Network for Person Clustering with Multiple Modalities
Figure 2 for Relation-Aware Distribution Representation Network for Person Clustering with Multiple Modalities
Figure 3 for Relation-Aware Distribution Representation Network for Person Clustering with Multiple Modalities
Figure 4 for Relation-Aware Distribution Representation Network for Person Clustering with Multiple Modalities
Viaarxiv icon

Exposing the Troublemakers in Described Object Detection

Jul 24, 2023
Chi Xie, Zhao Zhang, Yixuan Wu, Feng Zhu, Rui Zhao, Shuang Liang

Figure 1 for Exposing the Troublemakers in Described Object Detection
Figure 2 for Exposing the Troublemakers in Described Object Detection
Figure 3 for Exposing the Troublemakers in Described Object Detection
Figure 4 for Exposing the Troublemakers in Described Object Detection
Viaarxiv icon

Unsupervised Optical Flow Estimation with Dynamic Timing Representation for Spike Camera

Jul 12, 2023
Lujie Xia, Ziluo Ding, Rui Zhao, Jiyuan Zhang, Lei Ma, Zhaofei Yu, Tiejun Huang, Ruiqin Xiong

Figure 1 for Unsupervised Optical Flow Estimation with Dynamic Timing Representation for Spike Camera
Figure 2 for Unsupervised Optical Flow Estimation with Dynamic Timing Representation for Spike Camera
Figure 3 for Unsupervised Optical Flow Estimation with Dynamic Timing Representation for Spike Camera
Figure 4 for Unsupervised Optical Flow Estimation with Dynamic Timing Representation for Spike Camera
Viaarxiv icon

Retrieve Anyone: A General-purpose Person Re-identification Task with Instructions

Jul 04, 2023
Weizhen He, Shixiang Tang, Yiheng Deng, Qihao Chen, Qingsong Xie, Yizhou Wang, Lei Bai, Feng Zhu, Rui Zhao, Wanli Ouyang, Donglian Qi, Yunfeng Yan

Figure 1 for Retrieve Anyone: A General-purpose Person Re-identification Task with Instructions
Figure 2 for Retrieve Anyone: A General-purpose Person Re-identification Task with Instructions
Figure 3 for Retrieve Anyone: A General-purpose Person Re-identification Task with Instructions
Figure 4 for Retrieve Anyone: A General-purpose Person Re-identification Task with Instructions
Viaarxiv icon

Shikra: Unleashing Multimodal LLM's Referential Dialogue Magic

Jul 03, 2023
Keqin Chen, Zhao Zhang, Weili Zeng, Richong Zhang, Feng Zhu, Rui Zhao

Figure 1 for Shikra: Unleashing Multimodal LLM's Referential Dialogue Magic
Figure 2 for Shikra: Unleashing Multimodal LLM's Referential Dialogue Magic
Figure 3 for Shikra: Unleashing Multimodal LLM's Referential Dialogue Magic
Figure 4 for Shikra: Unleashing Multimodal LLM's Referential Dialogue Magic
Viaarxiv icon