Alert button
Picture for Ruimao Zhang

Ruimao Zhang

Alert button

MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control

Mar 19, 2024
Enshen Zhou, Yiran Qin, Zhenfei Yin, Yuzhou Huang, Ruimao Zhang, Lu Sheng, Yu Qiao, Jing Shao

Viaarxiv icon

Toward Accurate Camera-based 3D Object Detection via Cascade Depth Estimation and Calibration

Feb 07, 2024
Chaoqun Wang, Yiran Qin, Zijian Kang, Ningning Ma, Ruimao Zhang

Viaarxiv icon

MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception

Dec 13, 2023
Yiran Qin, Enshen Zhou, Qichang Liu, Zhenfei Yin, Lu Sheng, Ruimao Zhang, Yu Qiao, Jing Shao

Viaarxiv icon

X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-modal Knowledge Transfer

Dec 12, 2023
Linglin Jing, Ying Xue, Xu Yan, Chaoda Zheng, Dong Wang, Ruimao Zhang, Zhigang Wang, Hui Fang, Bin Zhao, Zhen Li

Viaarxiv icon

SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models

Dec 11, 2023
Yuzhou Huang, Liangbin Xie, Xintao Wang, Ziyang Yuan, Xiaodong Cun, Yixiao Ge, Jiantao Zhou, Chao Dong, Rui Huang, Ruimao Zhang, Ying Shan

Viaarxiv icon

SEED-Bench-2: Benchmarking Multimodal Large Language Models

Nov 28, 2023
Bohao Li, Yuying Ge, Yixiao Ge, Guangzhi Wang, Rui Wang, Ruimao Zhang, Ying Shan

Viaarxiv icon

HumanTOMATO: Text-aligned Whole-body Motion Generation

Oct 19, 2023
Shunlin Lu, Ling-Hao Chen, Ailing Zeng, Jing Lin, Ruimao Zhang, Lei Zhang, Heung-Yeung Shum

Figure 1 for HumanTOMATO: Text-aligned Whole-body Motion Generation
Figure 2 for HumanTOMATO: Text-aligned Whole-body Motion Generation
Figure 3 for HumanTOMATO: Text-aligned Whole-body Motion Generation
Figure 4 for HumanTOMATO: Text-aligned Whole-body Motion Generation
Viaarxiv icon

UniPose: Detecting Any Keypoints

Oct 12, 2023
Jie Yang, Ailing Zeng, Ruimao Zhang, Lei Zhang

Figure 1 for UniPose: Detecting Any Keypoints
Figure 2 for UniPose: Detecting Any Keypoints
Figure 3 for UniPose: Detecting Any Keypoints
Figure 4 for UniPose: Detecting Any Keypoints
Viaarxiv icon

SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection

Sep 13, 2023
Yiran Qin, Chaoqun Wang, Zijian Kang, Ningning Ma, Zhen Li, Ruimao Zhang

Figure 1 for SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection
Figure 2 for SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection
Figure 3 for SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection
Figure 4 for SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection
Viaarxiv icon

FreeMan: Towards Benchmarking 3D Human Pose Estimation in the Wild

Sep 12, 2023
Jiong Wang, Fengyu Yang, Wenbo Gou, Bingliang Li, Danqi Yan, Ailing Zeng, Yijun Gao, Junle Wang, Ruimao Zhang

Figure 1 for FreeMan: Towards Benchmarking 3D Human Pose Estimation in the Wild
Figure 2 for FreeMan: Towards Benchmarking 3D Human Pose Estimation in the Wild
Figure 3 for FreeMan: Towards Benchmarking 3D Human Pose Estimation in the Wild
Figure 4 for FreeMan: Towards Benchmarking 3D Human Pose Estimation in the Wild
Viaarxiv icon