Alert button
Picture for Yu Qiao

Yu Qiao

Alert button

DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models

Add code
Bookmark button
Alert button
Oct 12, 2023
Licheng Wen, Daocheng Fu, Xin Li, Xinyu Cai, Tao Ma, Pinlong Cai, Min Dou, Botian Shi, Liang He, Yu Qiao

Figure 1 for DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models
Figure 2 for DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models
Figure 3 for DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models
Figure 4 for DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models
Viaarxiv icon

Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models

Add code
Bookmark button
Alert button
Oct 12, 2023
Zeqiang Lai, Xizhou Zhu, Jifeng Dai, Yu Qiao, Wenhai Wang

Figure 1 for Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
Figure 2 for Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
Figure 3 for Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
Figure 4 for Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
Viaarxiv icon

ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation

Add code
Bookmark button
Alert button
Oct 11, 2023
Bo Peng, Xinyuan Chen, Yaohui Wang, Chaochao Lu, Yu Qiao

Figure 1 for ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation
Figure 2 for ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation
Figure 3 for ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation
Figure 4 for ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation
Viaarxiv icon

REVO-LION: Evaluating and Refining Vision-Language Instruction Tuning Datasets

Add code
Bookmark button
Alert button
Oct 10, 2023
Ning Liao, Shaofeng Zhang, Renqiu Xia, Bo Zhang, Min Cao, Yu Qiao, Junchi Yan

Figure 1 for REVO-LION: Evaluating and Refining Vision-Language Instruction Tuning Datasets
Figure 2 for REVO-LION: Evaluating and Refining Vision-Language Instruction Tuning Datasets
Figure 3 for REVO-LION: Evaluating and Refining Vision-Language Instruction Tuning Datasets
Figure 4 for REVO-LION: Evaluating and Refining Vision-Language Instruction Tuning Datasets
Viaarxiv icon

Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face

Add code
Bookmark button
Alert button
Oct 10, 2023
Hao Zhang, Kaipeng Zhang, Lumin Xu, Shenqi Lai, Wenqi Shao, Nanning Zheng, Ping Luo, Yu Qiao

Figure 1 for Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face
Figure 2 for Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face
Figure 3 for Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face
Figure 4 for Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face
Viaarxiv icon

Beyond One-Preference-for-All: Multi-Objective Direct Preference Optimization

Add code
Bookmark button
Alert button
Oct 05, 2023
Zhanhui Zhou, Jie Liu, Chao Yang, Jing Shao, Yu Liu, Xiangyu Yue, Wanli Ouyang, Yu Qiao

Viaarxiv icon

Exploring Counterfactual Alignment Loss towards Human-centered AI

Add code
Bookmark button
Alert button
Oct 03, 2023
Mingzhou Liu, Xinwei Sun, Ching-Wen Lee, Yu Qiao, Yizhou Wang

Viaarxiv icon

InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition

Add code
Bookmark button
Alert button
Sep 29, 2023
Pan Zhang, Xiaoyi Dong, Bin Wang, Yuhang Cao, Chao Xu, Linke Ouyang, Zhiyuan Zhao, Shuangrui Ding, Songyang Zhang, Haodong Duan, Hang Yan, Xinyue Zhang, Wei Li, Jingwen Li, Kai Chen, Conghui He, Xingcheng Zhang, Yu Qiao, Dahua Lin, Jiaqi Wang

Figure 1 for InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition
Figure 2 for InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition
Figure 3 for InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition
Figure 4 for InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition
Viaarxiv icon

LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models

Add code
Bookmark button
Alert button
Sep 27, 2023
Yaohui Wang, Xinyuan Chen, Xin Ma, Shangchen Zhou, Ziqi Huang, Yi Wang, Ceyuan Yang, Yinan He, Jiashuo Yu, Peiqing Yang, Yuwei Guo, Tianxing Wu, Chenyang Si, Yuming Jiang, Cunjian Chen, Chen Change Loy, Bo Dai, Dahua Lin, Yu Qiao, Ziwei Liu

Figure 1 for LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Figure 2 for LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Figure 3 for LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Figure 4 for LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Viaarxiv icon