Picture for Zheng Zhu

Zheng Zhu

Tencent, WeChat Pay

OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models

Add code
Jul 15, 2024
Viaarxiv icon

The SkatingVerse Workshop & Challenge: Methods and Results

Add code
May 27, 2024
Figure 1 for The SkatingVerse Workshop & Challenge: Methods and Results
Figure 2 for The SkatingVerse Workshop & Challenge: Methods and Results
Viaarxiv icon

MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for End-to-End Autonomous Driving

Add code
May 13, 2024
Viaarxiv icon

DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving

Add code
May 07, 2024
Viaarxiv icon

Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond

Add code
May 06, 2024
Viaarxiv icon

Are We Ready for Planetary Exploration Robots? The TAIL-Plus Dataset for SLAM in Granular Environments

Add code
Apr 21, 2024
Figure 1 for Are We Ready for Planetary Exploration Robots? The TAIL-Plus Dataset for SLAM in Granular Environments
Figure 2 for Are We Ready for Planetary Exploration Robots? The TAIL-Plus Dataset for SLAM in Granular Environments
Figure 3 for Are We Ready for Planetary Exploration Robots? The TAIL-Plus Dataset for SLAM in Granular Environments
Figure 4 for Are We Ready for Planetary Exploration Robots? The TAIL-Plus Dataset for SLAM in Granular Environments
Viaarxiv icon

TAIL: A Terrain-Aware Multi-Modal SLAM Dataset for Robot Locomotion in Deformable Granular Environments

Add code
Mar 25, 2024
Figure 1 for TAIL: A Terrain-Aware Multi-Modal SLAM Dataset for Robot Locomotion in Deformable Granular Environments
Figure 2 for TAIL: A Terrain-Aware Multi-Modal SLAM Dataset for Robot Locomotion in Deformable Granular Environments
Figure 3 for TAIL: A Terrain-Aware Multi-Modal SLAM Dataset for Robot Locomotion in Deformable Granular Environments
Figure 4 for TAIL: A Terrain-Aware Multi-Modal SLAM Dataset for Robot Locomotion in Deformable Granular Environments
Viaarxiv icon

DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation

Add code
Mar 11, 2024
Figure 1 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Figure 2 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Figure 3 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Figure 4 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Viaarxiv icon

WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens

Add code
Jan 18, 2024
Viaarxiv icon

On the Identifiability from Modulo Measurements under DFT Sensing Matrix

Add code
Dec 30, 2023
Viaarxiv icon