Alert button
Picture for Hongsheng Li

Hongsheng Li

Alert button

NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates Space

Add code
Bookmark button
Alert button
Sep 27, 2023
Jiawei Yao, Chuming Li, Keqiang Sun, Yingjie Cai, Hao Li, Wanli Ouyang, Hongsheng Li

Figure 1 for NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates Space
Figure 2 for NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates Space
Figure 3 for NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates Space
Figure 4 for NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates Space
Viaarxiv icon

ImageBind-LLM: Multi-modality Instruction Tuning

Add code
Bookmark button
Alert button
Sep 11, 2023
Jiaming Han, Renrui Zhang, Wenqi Shao, Peng Gao, Peng Xu, Han Xiao, Kaipeng Zhang, Chris Liu, Song Wen, Ziyu Guo, Xudong Lu, Shuai Ren, Yafei Wen, Xiaoxin Chen, Xiangyu Yue, Hongsheng Li, Yu Qiao

Figure 1 for ImageBind-LLM: Multi-modality Instruction Tuning
Figure 2 for ImageBind-LLM: Multi-modality Instruction Tuning
Figure 3 for ImageBind-LLM: Multi-modality Instruction Tuning
Figure 4 for ImageBind-LLM: Multi-modality Instruction Tuning
Viaarxiv icon

Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following

Add code
Bookmark button
Alert button
Sep 01, 2023
Ziyu Guo, Renrui Zhang, Xiangyang Zhu, Yiwen Tang, Xianzheng Ma, Jiaming Han, Kexin Chen, Peng Gao, Xianzhi Li, Hongsheng Li, Pheng-Ann Heng

Figure 1 for Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following
Figure 2 for Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following
Figure 3 for Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following
Figure 4 for Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following
Viaarxiv icon

Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation

Add code
Bookmark button
Alert button
Aug 20, 2023
Jinyu Chen, Wenguan Wang, Si Liu, Hongsheng Li, Yi Yang

Figure 1 for Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation
Figure 2 for Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation
Figure 3 for Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation
Figure 4 for Omnidirectional Information Gathering for Knowledge Transfer-based Audio-Visual Navigation
Viaarxiv icon

Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification

Add code
Bookmark button
Alert button
Aug 15, 2023
Aojun Zhou, Ke Wang, Zimu Lu, Weikang Shi, Sichun Luo, Zipeng Qin, Shaoqing Lu, Anya Jia, Linqi Song, Mingjie Zhan, Hongsheng Li

Figure 1 for Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification
Figure 2 for Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification
Figure 3 for Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification
Figure 4 for Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification
Viaarxiv icon

Tiny LVLM-eHub: Early Multimodal Experiments with Bard

Add code
Bookmark button
Alert button
Aug 07, 2023
Wenqi Shao, Yutao Hu, Peng Gao, Meng Lei, Kaipeng Zhang, Fanqing Meng, Peng Xu, Siyuan Huang, Hongsheng Li, Yu Qiao, Ping Luo

Figure 1 for Tiny LVLM-eHub: Early Multimodal Experiments with Bard
Figure 2 for Tiny LVLM-eHub: Early Multimodal Experiments with Bard
Figure 3 for Tiny LVLM-eHub: Early Multimodal Experiments with Bard
Figure 4 for Tiny LVLM-eHub: Early Multimodal Experiments with Bard
Viaarxiv icon

Meta-Transformer: A Unified Framework for Multimodal Learning

Add code
Bookmark button
Alert button
Jul 20, 2023
Yiyuan Zhang, Kaixiong Gong, Kaipeng Zhang, Hongsheng Li, Yu Qiao, Wanli Ouyang, Xiangyu Yue

Figure 1 for Meta-Transformer: A Unified Framework for Multimodal Learning
Figure 2 for Meta-Transformer: A Unified Framework for Multimodal Learning
Figure 3 for Meta-Transformer: A Unified Framework for Multimodal Learning
Figure 4 for Meta-Transformer: A Unified Framework for Multimodal Learning
Viaarxiv icon

Urban Radiance Field Representation with Deformable Neural Mesh Primitives

Add code
Bookmark button
Alert button
Jul 20, 2023
Fan Lu, Yan Xu, Guang Chen, Hongsheng Li, Kwan-Yee Lin, Changjun Jiang

Figure 1 for Urban Radiance Field Representation with Deformable Neural Mesh Primitives
Figure 2 for Urban Radiance Field Representation with Deformable Neural Mesh Primitives
Figure 3 for Urban Radiance Field Representation with Deformable Neural Mesh Primitives
Figure 4 for Urban Radiance Field Representation with Deformable Neural Mesh Primitives
Viaarxiv icon

JourneyDB: A Benchmark for Generative Image Understanding

Add code
Bookmark button
Alert button
Jul 03, 2023
Junting Pan, Keqiang Sun, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Hongsheng Li

Figure 1 for JourneyDB: A Benchmark for Generative Image Understanding
Figure 2 for JourneyDB: A Benchmark for Generative Image Understanding
Figure 3 for JourneyDB: A Benchmark for Generative Image Understanding
Figure 4 for JourneyDB: A Benchmark for Generative Image Understanding
Viaarxiv icon