Alert button
Picture for Zhiwu Lu

Zhiwu Lu

Alert button

CoTBal: Comprehensive Task Balancing for Multi-Task Visual Instruction Tuning

Add code
Bookmark button
Alert button
Mar 07, 2024
Yanqi Dai, Dong Jing, Nanyi Fei, Zhiwu Lu

Figure 1 for CoTBal: Comprehensive Task Balancing for Multi-Task Visual Instruction Tuning
Figure 2 for CoTBal: Comprehensive Task Balancing for Multi-Task Visual Instruction Tuning
Figure 3 for CoTBal: Comprehensive Task Balancing for Multi-Task Visual Instruction Tuning
Figure 4 for CoTBal: Comprehensive Task Balancing for Multi-Task Visual Instruction Tuning
Viaarxiv icon

Improvable Gap Balancing for Multi-Task Learning

Add code
Bookmark button
Alert button
Jul 28, 2023
Yanqi Dai, Nanyi Fei, Zhiwu Lu

Figure 1 for Improvable Gap Balancing for Multi-Task Learning
Figure 2 for Improvable Gap Balancing for Multi-Task Learning
Figure 3 for Improvable Gap Balancing for Multi-Task Learning
Figure 4 for Improvable Gap Balancing for Multi-Task Learning
Viaarxiv icon

VDT: An Empirical Study on Video Diffusion with Transformers

Add code
Bookmark button
Alert button
May 22, 2023
Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding

Figure 1 for VDT: An Empirical Study on Video Diffusion with Transformers
Figure 2 for VDT: An Empirical Study on Video Diffusion with Transformers
Figure 3 for VDT: An Empirical Study on Video Diffusion with Transformers
Figure 4 for VDT: An Empirical Study on Video Diffusion with Transformers
Viaarxiv icon

UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling

Add code
Bookmark button
Alert button
Feb 13, 2023
Haoyu Lu, Mingyu Ding, Yuqi Huo, Guoxing Yang, Zhiwu Lu, Masayoshi Tomizuka, Wei Zhan

Figure 1 for UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling
Figure 2 for UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling
Figure 3 for UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling
Figure 4 for UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling
Viaarxiv icon

TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat

Add code
Bookmark button
Alert button
Jan 14, 2023
Hongpeng Lin, Ludan Ruan, Wenke Xia, Peiyu Liu, Jingyuan Wen, Yixin Xu, Di Hu, Ruihua Song, Wayne Xin Zhao, Qin Jin, Zhiwu Lu

Figure 1 for TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat
Figure 2 for TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat
Figure 3 for TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat
Figure 4 for TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat
Viaarxiv icon

Text2Poster: Laying out Stylized Texts on Retrieved Images

Add code
Bookmark button
Alert button
Jan 06, 2023
Chuhao Jin, Hongteng Xu, Ruihua Song, Zhiwu Lu

Figure 1 for Text2Poster: Laying out Stylized Texts on Retrieved Images
Figure 2 for Text2Poster: Laying out Stylized Texts on Retrieved Images
Figure 3 for Text2Poster: Laying out Stylized Texts on Retrieved Images
Figure 4 for Text2Poster: Laying out Stylized Texts on Retrieved Images
Viaarxiv icon

LGDN: Language-Guided Denoising Network for Video-Language Modeling

Add code
Bookmark button
Alert button
Oct 03, 2022
Haoyu Lu, Mingyu Ding, Nanyi Fei, Yuqi Huo, Zhiwu Lu

Figure 1 for LGDN: Language-Guided Denoising Network for Video-Language Modeling
Figure 2 for LGDN: Language-Guided Denoising Network for Video-Language Modeling
Figure 3 for LGDN: Language-Guided Denoising Network for Video-Language Modeling
Figure 4 for LGDN: Language-Guided Denoising Network for Video-Language Modeling
Viaarxiv icon

A Molecular Multimodal Foundation Model Associating Molecule Graphs with Natural Language

Add code
Bookmark button
Alert button
Sep 12, 2022
Bing Su, Dazhao Du, Zhao Yang, Yujie Zhou, Jiangmeng Li, Anyi Rao, Hao Sun, Zhiwu Lu, Ji-Rong Wen

Figure 1 for A Molecular Multimodal Foundation Model Associating Molecule Graphs with Natural Language
Figure 2 for A Molecular Multimodal Foundation Model Associating Molecule Graphs with Natural Language
Figure 3 for A Molecular Multimodal Foundation Model Associating Molecule Graphs with Natural Language
Figure 4 for A Molecular Multimodal Foundation Model Associating Molecule Graphs with Natural Language
Viaarxiv icon

Multimodal foundation models are better simulators of the human brain

Add code
Bookmark button
Alert button
Aug 17, 2022
Haoyu Lu, Qiongyi Zhou, Nanyi Fei, Zhiwu Lu, Mingyu Ding, Jingyuan Wen, Changde Du, Xin Zhao, Hao Sun, Huiguang He, Ji-Rong Wen

Figure 1 for Multimodal foundation models are better simulators of the human brain
Figure 2 for Multimodal foundation models are better simulators of the human brain
Figure 3 for Multimodal foundation models are better simulators of the human brain
Figure 4 for Multimodal foundation models are better simulators of the human brain
Viaarxiv icon