Alert button
Picture for Jifeng Dai

Jifeng Dai

Alert button

A Survey of Reasoning with Foundation Models

Add code
Bookmark button
Alert button
Dec 26, 2023
Jiankai Sun, Chuanyang Zheng, Enze Xie, Zhengying Liu, Ruihang Chu, Jianing Qiu, Jiaqi Xu, Mingyu Ding, Hongyang Li, Mengzhe Geng, Yue Wu, Wenhai Wang, Junsong Chen, Zhangyue Yin, Xiaozhe Ren, Jie Fu, Junxian He, Wu Yuan, Qi Liu, Xihui Liu, Yu Li, Hao Dong, Yu Cheng, Ming Zhang, Pheng Ann Heng, Jifeng Dai, Ping Luo, Jingdong Wang, Ji-Rong Wen, Xipeng Qiu, Yike Guo, Hui Xiong, Qun Liu, Zhenguo Li

Viaarxiv icon

DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving

Add code
Bookmark button
Alert button
Dec 25, 2023
Wenhai Wang, Jiangwei Xie, ChuanYang Hu, Haoming Zou, Jianan Fan, Wenwen Tong, Yang Wen, Silei Wu, Hanming Deng, Zhiqi Li, Hao Tian, Lewei Lu, Xizhou Zhu, Xiaogang Wang, Yu Qiao, Jifeng Dai

Viaarxiv icon

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Add code
Bookmark button
Alert button
Dec 21, 2023
Zhe Chen, Jiannan Wu, Wenhai Wang, Weijie Su, Guo Chen, Sen Xing, Zhong Muyan, Qinglong Zhang, Xizhou Zhu, Lewei Lu, Bin Li, Ping Luo, Tong Lu, Yu Qiao, Jifeng Dai

Viaarxiv icon

Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft

Add code
Bookmark button
Alert button
Dec 14, 2023
Hao Li, Xue Yang, Zhaokai Wang, Xizhou Zhu, Jie Zhou, Yu Qiao, Xiaogang Wang, Hongsheng Li, Lewei Lu, Jifeng Dai

Figure 1 for Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Figure 2 for Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Figure 3 for Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Figure 4 for Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Viaarxiv icon

InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation

Add code
Bookmark button
Alert button
Nov 30, 2023
Rongyao Fang, Shilin Yan, Zhaoyang Huang, Jingqiu Zhou, Hao Tian, Jifeng Dai, Hongsheng Li

Viaarxiv icon

Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision

Add code
Bookmark button
Alert button
Nov 23, 2023
Yu Yi, Xue Yang, Qingyun Li, Feipeng Da, Junchi Yan, Jifeng Dai, Yu Qiao

Viaarxiv icon

ControlLLM: Augment Language Models with Tools by Searching on Graphs

Add code
Bookmark button
Alert button
Oct 30, 2023
Zhaoyang Liu, Zeqiang Lai, Zhangwei Gao, Erfei Cui, Zhiheng Li, Xizhou Zhu, Lewei Lu, Qifeng Chen, Yu Qiao, Jifeng Dai, Wenhai Wang

Figure 1 for ControlLLM: Augment Language Models with Tools by Searching on Graphs
Figure 2 for ControlLLM: Augment Language Models with Tools by Searching on Graphs
Figure 3 for ControlLLM: Augment Language Models with Tools by Searching on Graphs
Figure 4 for ControlLLM: Augment Language Models with Tools by Searching on Graphs
Viaarxiv icon

Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models

Add code
Bookmark button
Alert button
Oct 12, 2023
Zeqiang Lai, Xizhou Zhu, Jifeng Dai, Yu Qiao, Wenhai Wang

Figure 1 for Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
Figure 2 for Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
Figure 3 for Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
Figure 4 for Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
Viaarxiv icon

The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World

Add code
Bookmark button
Alert button
Aug 03, 2023
Weiyun Wang, Min Shi, Qingyun Li, Wenhai Wang, Zhenhang Huang, Linjie Xing, Zhe Chen, Hao Li, Xizhou Zhu, Zhiguo Cao, Yushi Chen, Tong Lu, Jifeng Dai, Yu Qiao

Figure 1 for The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World
Figure 2 for The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World
Figure 3 for The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World
Figure 4 for The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World
Viaarxiv icon

JourneyDB: A Benchmark for Generative Image Understanding

Add code
Bookmark button
Alert button
Jul 03, 2023
Junting Pan, Keqiang Sun, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Hongsheng Li

Figure 1 for JourneyDB: A Benchmark for Generative Image Understanding
Figure 2 for JourneyDB: A Benchmark for Generative Image Understanding
Figure 3 for JourneyDB: A Benchmark for Generative Image Understanding
Figure 4 for JourneyDB: A Benchmark for Generative Image Understanding
Viaarxiv icon