Alert button
Picture for Yu Qiao

Yu Qiao

Alert button

DreamDA: Generative Data Augmentation with Diffusion Models

Mar 19, 2024
Yunxiang Fu, Chaoqi Chen, Yu Qiao, Yizhou Yu

Viaarxiv icon

MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control

Mar 19, 2024
Enshen Zhou, Yiran Qin, Zhenfei Yin, Yuzhou Huang, Ruimao Zhang, Lu Sheng, Yu Qiao, Jing Shao

Viaarxiv icon

Generalized Predictive Model for Autonomous Driving

Mar 14, 2024
Jiazhi Yang, Shenyuan Gao, Yihang Qiu, Li Chen, Tianyu Li, Bo Dai, Kashyap Chitta, Penghao Wu, Jia Zeng, Ping Luo, Jun Zhang, Andreas Geiger, Yu Qiao, Hongyang Li

Viaarxiv icon

Exploring Safety Generalization Challenges of Large Language Models via Code

Mar 14, 2024
Qibing Ren, Chang Gao, Jing Shao, Junchi Yan, Xin Tan, Yu Qiao, Wai Lam, Lizhuang Ma

Viaarxiv icon

AVIBench: Towards Evaluating the Robustness of Large Vision-Language Model on Adversarial Visual-Instructions

Mar 14, 2024
Hao Zhang, Wenqi Shao, Hong Liu, Yongqiang Ma, Ping Luo, Yu Qiao, Kaipeng Zhang

Viaarxiv icon

Desigen: A Pipeline for Controllable Design Template Generation

Mar 14, 2024
Haohan Weng, Danqing Huang, Yu Qiao, Zheng Hu, Chin-Yew Lin, Tong Zhang, C. L. Philip Chen

Viaarxiv icon

VideoMamba: State Space Model for Efficient Video Understanding

Mar 12, 2024
Kunchang Li, Xinhao Li, Yi Wang, Yinan He, Yali Wang, Limin Wang, Yu Qiao

Viaarxiv icon

Towards Implicit Prompt For Text-To-Image Models

Mar 08, 2024
Yue Yang, Yuqi lin, Hong Liu, Wenqi Shao, Runjian Chen, Hailong Shang, Yu Wang, Yu Qiao, Kaipeng Zhang, Ping Luo

Figure 1 for Towards Implicit Prompt For Text-To-Image Models
Figure 2 for Towards Implicit Prompt For Text-To-Image Models
Figure 3 for Towards Implicit Prompt For Text-To-Image Models
Figure 4 for Towards Implicit Prompt For Text-To-Image Models
Viaarxiv icon

Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures

Mar 07, 2024
Yuchen Duan, Weiyun Wang, Zhe Chen, Xizhou Zhu, Lewei Lu, Tong Lu, Yu Qiao, Hongsheng Li, Jifeng Dai, Wenhai Wang

Figure 1 for Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
Figure 2 for Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
Figure 3 for Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
Figure 4 for Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
Viaarxiv icon

Embodied Understanding of Driving Scenarios

Mar 07, 2024
Yunsong Zhou, Linyan Huang, Qingwen Bu, Jia Zeng, Tianyu Li, Hang Qiu, Hongzi Zhu, Minyi Guo, Yu Qiao, Hongyang Li

Figure 1 for Embodied Understanding of Driving Scenarios
Figure 2 for Embodied Understanding of Driving Scenarios
Figure 3 for Embodied Understanding of Driving Scenarios
Figure 4 for Embodied Understanding of Driving Scenarios
Viaarxiv icon