Picture for Zheng Zhu

Zheng Zhu

Tencent, WeChat Pay

MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for End-to-End Autonomous Driving

Add code
May 13, 2024
Figure 1 for MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for End-to-End Autonomous Driving
Figure 2 for MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for End-to-End Autonomous Driving
Figure 3 for MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for End-to-End Autonomous Driving
Figure 4 for MaskFuser: Masked Fusion of Joint Multi-Modal Tokenization for End-to-End Autonomous Driving
Viaarxiv icon

DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving

Add code
May 07, 2024
Viaarxiv icon

Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond

Add code
May 06, 2024
Viaarxiv icon

Are We Ready for Planetary Exploration Robots? The TAIL-Plus Dataset for SLAM in Granular Environments

Add code
Apr 21, 2024
Viaarxiv icon

TAIL: A Terrain-Aware Multi-Modal SLAM Dataset for Robot Locomotion in Deformable Granular Environments

Add code
Mar 25, 2024
Viaarxiv icon

DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation

Add code
Mar 11, 2024
Figure 1 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Figure 2 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Figure 3 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Figure 4 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Viaarxiv icon

WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens

Add code
Jan 18, 2024
Viaarxiv icon

On the Identifiability from Modulo Measurements under DFT Sensing Matrix

Add code
Dec 30, 2023
Figure 1 for On the Identifiability from Modulo Measurements under DFT Sensing Matrix
Figure 2 for On the Identifiability from Modulo Measurements under DFT Sensing Matrix
Figure 3 for On the Identifiability from Modulo Measurements under DFT Sensing Matrix
Figure 4 for On the Identifiability from Modulo Measurements under DFT Sensing Matrix
Viaarxiv icon

Generative Pretraining at Scale: Transformer-Based Encoding of Transactional Behavior for Fraud Detection

Add code
Dec 22, 2023
Viaarxiv icon

OpenStereo: A Comprehensive Benchmark for Stereo Matching and Strong Baseline

Add code
Dec 01, 2023
Figure 1 for OpenStereo: A Comprehensive Benchmark for Stereo Matching and Strong Baseline
Figure 2 for OpenStereo: A Comprehensive Benchmark for Stereo Matching and Strong Baseline
Figure 3 for OpenStereo: A Comprehensive Benchmark for Stereo Matching and Strong Baseline
Figure 4 for OpenStereo: A Comprehensive Benchmark for Stereo Matching and Strong Baseline
Viaarxiv icon