Alert button
Picture for Dahua Lin

Dahua Lin

Alert button

Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases

Dec 22, 2023
Zhangyang Qi, Ye Fang, Mengchen Zhang, Zeyi Sun, Tong Wu, Ziwei Liu, Dahua Lin, Jiaqi Wang, Hengshuang Zhao

Viaarxiv icon

T-Eval: Evaluating the Tool Utilization Capability Step by Step

Dec 21, 2023
Zehui Chen, Weihua Du, Wenwei Zhang, Kuikun Liu, Jiangning Liu, Miao Zheng, Jingming Zhuo, Songyang Zhang, Dahua Lin, Kai Chen, Feng Zhao

Viaarxiv icon

SceneWiz3D: Towards Text-guided 3D Scene Composition

Dec 13, 2023
Qihang Zhang, Chaoyang Wang, Aliaksandr Siarohin, Peiye Zhuang, Yinghao Xu, Ceyuan Yang, Dahua Lin, Bolei Zhou, Sergey Tulyakov, Hsin-Ying Lee

Viaarxiv icon

Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Dec 13, 2023
Zeyi Sun, Ye Fang, Tong Wu, Pan Zhang, Yuhang Zang, Shu Kong, Yuanjun Xiong, Dahua Lin, Jiaqi Wang

Viaarxiv icon

HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image

Dec 07, 2023
Tong Wu, Zhibing Li, Shuai Yang, Pan Zhang, Xinggang Pan, Jiaqi Wang, Dahua Lin, Ziwei Liu

Viaarxiv icon

OneLLM: One Framework to Align All Modalities with Language

Dec 06, 2023
Jiaming Han, Kaixiong Gong, Yiyuan Zhang, Jiaqi Wang, Kaipeng Zhang, Dahua Lin, Yu Qiao, Peng Gao, Xiangyu Yue

Figure 1 for OneLLM: One Framework to Align All Modalities with Language
Figure 2 for OneLLM: One Framework to Align All Modalities with Language
Figure 3 for OneLLM: One Framework to Align All Modalities with Language
Figure 4 for OneLLM: One Framework to Align All Modalities with Language
Viaarxiv icon

Open-sourced Data Ecosystem in Autonomous Driving: the Present and Future

Dec 06, 2023
Hongyang Li, Yang Li, Huijie Wang, Jia Zeng, Pinlong Cai, Huilin Xu, Dahua Lin, Junchi Yan, Feng Xu, Lu Xiong, Jingdong Wang, Futang Zhu, Kai Yan, Chunjing Xu, Tiancai Wang, Beipeng Mu, Shaoqing Ren, Zhihui Peng, Yu Qiao

Viaarxiv icon

GPT4Point: A Unified Framework for Point-Language Understanding and Generation

Dec 05, 2023
Zhangyang Qi, Ye Fang, Zeyi Sun, Xiaoyang Wu, Tong Wu, Jiaqi Wang, Dahua Lin, Hengshuang Zhao

Figure 1 for GPT4Point: A Unified Framework for Point-Language Understanding and Generation
Figure 2 for GPT4Point: A Unified Framework for Point-Language Understanding and Generation
Figure 3 for GPT4Point: A Unified Framework for Point-Language Understanding and Generation
Figure 4 for GPT4Point: A Unified Framework for Point-Language Understanding and Generation
Viaarxiv icon

VideoBooth: Diffusion-based Video Generation with Image Prompts

Dec 01, 2023
Yuming Jiang, Tianxing Wu, Shuai Yang, Chenyang Si, Dahua Lin, Yu Qiao, Chen Change Loy, Ziwei Liu

Figure 1 for VideoBooth: Diffusion-based Video Generation with Image Prompts
Figure 2 for VideoBooth: Diffusion-based Video Generation with Image Prompts
Figure 3 for VideoBooth: Diffusion-based Video Generation with Image Prompts
Figure 4 for VideoBooth: Diffusion-based Video Generation with Image Prompts
Viaarxiv icon