Alert button
Picture for Xiaogang Wang

Xiaogang Wang

Alert button

DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving

Dec 25, 2023
Wenhai Wang, Jiangwei Xie, ChuanYang Hu, Haoming Zou, Jianan Fan, Wenwen Tong, Yang Wen, Silei Wu, Hanming Deng, Zhiqi Li, Hao Tian, Lewei Lu, Xizhou Zhu, Xiaogang Wang, Yu Qiao, Jifeng Dai

Viaarxiv icon

Cached Transformers: Improving Transformers with Differentiable Memory Cache

Dec 20, 2023
Zhaoyang Zhang, Wenqi Shao, Yixiao Ge, Xiaogang Wang, Jinwei Gu, Ping Luo

Viaarxiv icon

Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft

Dec 14, 2023
Hao Li, Xue Yang, Zhaokai Wang, Xizhou Zhu, Jie Zhou, Yu Qiao, Xiaogang Wang, Hongsheng Li, Lewei Lu, Jifeng Dai

Figure 1 for Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Figure 2 for Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Figure 3 for Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Figure 4 for Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Viaarxiv icon

Digital Life Project: Autonomous 3D Characters with Social Intelligence

Dec 07, 2023
Zhongang Cai, Jianping Jiang, Zhongfei Qing, Xinying Guo, Mingyuan Zhang, Zhengyu Lin, Haiyi Mei, Chen Wei, Ruisi Wang, Wanqi Yin, Xiangyu Fan, Han Du, Liang Pan, Peng Gao, Zhitao Yang, Yang Gao, Jiaqi Li, Tianxiang Ren, Yukun Wei, Xiaogang Wang, Chen Change Loy, Lei Yang, Ziwei Liu

Figure 1 for Digital Life Project: Autonomous 3D Characters with Social Intelligence
Figure 2 for Digital Life Project: Autonomous 3D Characters with Social Intelligence
Figure 3 for Digital Life Project: Autonomous 3D Characters with Social Intelligence
Figure 4 for Digital Life Project: Autonomous 3D Characters with Social Intelligence
Viaarxiv icon

CoNe: Contrast Your Neighbours for Supervised Image Classification

Aug 21, 2023
Mingkai Zheng, Shan You, Lang Huang, Xiu Su, Fei Wang, Chen Qian, Xiaogang Wang, Chang Xu

Figure 1 for CoNe: Contrast Your Neighbours for Supervised Image Classification
Figure 2 for CoNe: Contrast Your Neighbours for Supervised Image Classification
Figure 3 for CoNe: Contrast Your Neighbours for Supervised Image Classification
Figure 4 for CoNe: Contrast Your Neighbours for Supervised Image Classification
Viaarxiv icon

PPI-NET: End-to-End Parametric Primitive Inference

Aug 03, 2023
Liang Wang, Xiaogang Wang

Figure 1 for PPI-NET: End-to-End Parametric Primitive Inference
Figure 2 for PPI-NET: End-to-End Parametric Primitive Inference
Figure 3 for PPI-NET: End-to-End Parametric Primitive Inference
Figure 4 for PPI-NET: End-to-End Parametric Primitive Inference
Viaarxiv icon

ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process

Jun 08, 2023
Changyao Tian, Chenxin Tao, Jifeng Dai, Hao Li, Ziheng Li, Lewei Lu, Xiaogang Wang, Hongsheng Li, Gao Huang, Xizhou Zhu

Figure 1 for ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process
Figure 2 for ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process
Figure 3 for ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process
Figure 4 for ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process
Viaarxiv icon

FlowFormer: A Transformer Architecture and Its Masked Cost Volume Autoencoding for Optical Flow

Jun 08, 2023
Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Yijin Li, Hongwei Qin, Jifeng Dai, Xiaogang Wang, Hongsheng Li

Figure 1 for FlowFormer: A Transformer Architecture and Its Masked Cost Volume Autoencoding for Optical Flow
Figure 2 for FlowFormer: A Transformer Architecture and Its Masked Cost Volume Autoencoding for Optical Flow
Figure 3 for FlowFormer: A Transformer Architecture and Its Masked Cost Volume Autoencoding for Optical Flow
Figure 4 for FlowFormer: A Transformer Architecture and Its Masked Cost Volume Autoencoding for Optical Flow
Viaarxiv icon

Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory

Jun 01, 2023
Xizhou Zhu, Yuntao Chen, Hao Tian, Chenxin Tao, Weijie Su, Chenyu Yang, Gao Huang, Bin Li, Lewei Lu, Xiaogang Wang, Yu Qiao, Zhaoxiang Zhang, Jifeng Dai

Figure 1 for Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
Figure 2 for Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
Figure 3 for Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
Figure 4 for Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
Viaarxiv icon

A Unified Conditional Framework for Diffusion-based Image Restoration

May 31, 2023
Yi Zhang, Xiaoyu Shi, Dasong Li, Xiaogang Wang, Jian Wang, Hongsheng Li

Figure 1 for A Unified Conditional Framework for Diffusion-based Image Restoration
Figure 2 for A Unified Conditional Framework for Diffusion-based Image Restoration
Figure 3 for A Unified Conditional Framework for Diffusion-based Image Restoration
Figure 4 for A Unified Conditional Framework for Diffusion-based Image Restoration
Viaarxiv icon