Alert button
Picture for Xiaoqi Li

Xiaoqi Li

Alert button

ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models

Mar 17, 2024
Siyuan Huang, Iaroslav Ponomarenko, Zhengkai Jiang, Xiaoqi Li, Xiaobin Hu, Peng Gao, Hongsheng Li, Hao Dong

Viaarxiv icon

NaturalVLM: Leveraging Fine-grained Natural Language for Affordance-Guided Visual Manipulation

Mar 13, 2024
Ran Xu, Yan Shen, Xiaoqi Li, Ruihai Wu, Hao Dong

Viaarxiv icon

ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation

Dec 24, 2023
Xiaoqi Li, Mingxu Zhang, Yiran Geng, Haoran Geng, Yuxing Long, Yan Shen, Renrui Zhang, Jiaming Liu, Hao Dong

Viaarxiv icon

LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding

Dec 21, 2023
Senqiao Yang, Jiaming Liu, Ray Zhang, Mingjie Pan, Zoey Guo, Xiaoqi Li, Zehui Chen, Peng Gao, Yandong Guo, Shanghang Zhang

Viaarxiv icon

ImageManip: Image-based Robotic Manipulation with Affordance-guided Next View Selection

Oct 13, 2023
Xiaoqi Li, Yanzi Wang, Yan Shen, Ponomarenko Iaroslav, Haoran Lu, Qianxu Wang, Boshi An, Jiaming Liu, Hao Dong

Figure 1 for ImageManip: Image-based Robotic Manipulation with Affordance-guided Next View Selection
Figure 2 for ImageManip: Image-based Robotic Manipulation with Affordance-guided Next View Selection
Figure 3 for ImageManip: Image-based Robotic Manipulation with Affordance-guided Next View Selection
Figure 4 for ImageManip: Image-based Robotic Manipulation with Affordance-guided Next View Selection
Viaarxiv icon

RGBManip: Monocular Image-based Robotic Manipulation through Active Object Pose Estimation

Oct 05, 2023
Boshi An, Yiran Geng, Kai Chen, Xiaoqi Li, Qi Dou, Hao Dong

Figure 1 for RGBManip: Monocular Image-based Robotic Manipulation through Active Object Pose Estimation
Figure 2 for RGBManip: Monocular Image-based Robotic Manipulation through Active Object Pose Estimation
Figure 3 for RGBManip: Monocular Image-based Robotic Manipulation through Active Object Pose Estimation
Figure 4 for RGBManip: Monocular Image-based Robotic Manipulation through Active Object Pose Estimation
Viaarxiv icon

Distribution-Aware Continual Test Time Adaptation for Semantic Segmentation

Sep 24, 2023
Jiayi Ni, Senqiao Yang, Jiaming Liu, Xiaoqi Li, Wenyu Jiao, Ran Xu, Zehui Chen, Yi Liu, Shanghang Zhang

Figure 1 for Distribution-Aware Continual Test Time Adaptation for Semantic Segmentation
Figure 2 for Distribution-Aware Continual Test Time Adaptation for Semantic Segmentation
Figure 3 for Distribution-Aware Continual Test Time Adaptation for Semantic Segmentation
Figure 4 for Distribution-Aware Continual Test Time Adaptation for Semantic Segmentation
Viaarxiv icon

Discuss Before Moving: Visual Language Navigation via Multi-expert Discussions

Sep 20, 2023
Yuxing Long, Xiaoqi Li, Wenzhe Cai, Hao Dong

Figure 1 for Discuss Before Moving: Visual Language Navigation via Multi-expert Discussions
Figure 2 for Discuss Before Moving: Visual Language Navigation via Multi-expert Discussions
Figure 3 for Discuss Before Moving: Visual Language Navigation via Multi-expert Discussions
Figure 4 for Discuss Before Moving: Visual Language Navigation via Multi-expert Discussions
Viaarxiv icon

RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision

Sep 18, 2023
Mingjie Pan, Jiaming Liu, Renrui Zhang, Peixiang Huang, Xiaoqi Li, Li Liu, Shanghang Zhang

Figure 1 for RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision
Figure 2 for RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision
Figure 3 for RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision
Figure 4 for RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision
Viaarxiv icon

Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation

Sep 15, 2023
Hongcheng Wang, Andy Guan Hong Chen, Xiaoqi Li, Mingdong Wu, Hao Dong

Figure 1 for Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation
Figure 2 for Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation
Figure 3 for Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation
Figure 4 for Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation
Viaarxiv icon