Picture for Yin Zhou

Yin Zhou

MoST: Multi-modality Scene Tokenization for Motion Prediction

Add code
Apr 30, 2024
Viaarxiv icon

STT: Stateful Tracking with Transformers for Autonomous Driving

Add code
Apr 30, 2024
Figure 1 for STT: Stateful Tracking with Transformers for Autonomous Driving
Figure 2 for STT: Stateful Tracking with Transformers for Autonomous Driving
Figure 3 for STT: Stateful Tracking with Transformers for Autonomous Driving
Figure 4 for STT: Stateful Tracking with Transformers for Autonomous Driving
Viaarxiv icon

3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation

Add code
Jan 04, 2024
Figure 1 for 3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation
Figure 2 for 3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation
Figure 3 for 3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation
Figure 4 for 3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation
Viaarxiv icon

Unsupervised 3D Perception with 2D Vision-Language Distillation for Autonomous Driving

Add code
Sep 25, 2023
Figure 1 for Unsupervised 3D Perception with 2D Vision-Language Distillation for Autonomous Driving
Figure 2 for Unsupervised 3D Perception with 2D Vision-Language Distillation for Autonomous Driving
Figure 3 for Unsupervised 3D Perception with 2D Vision-Language Distillation for Autonomous Driving
Figure 4 for Unsupervised 3D Perception with 2D Vision-Language Distillation for Autonomous Driving
Viaarxiv icon

3D Human Keypoints Estimation From Point Clouds in the Wild Without Human Labels

Add code
Jun 07, 2023
Figure 1 for 3D Human Keypoints Estimation From Point Clouds in the Wild Without Human Labels
Viaarxiv icon

MoDAR: Using Motion Forecasting for 3D Object Detection in Point Cloud Sequences

Add code
Jun 05, 2023
Figure 1 for MoDAR: Using Motion Forecasting for 3D Object Detection in Point Cloud Sequences
Figure 2 for MoDAR: Using Motion Forecasting for 3D Object Detection in Point Cloud Sequences
Figure 3 for MoDAR: Using Motion Forecasting for 3D Object Detection in Point Cloud Sequences
Figure 4 for MoDAR: Using Motion Forecasting for 3D Object Detection in Point Cloud Sequences
Viaarxiv icon

MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion

Add code
Jun 05, 2023
Figure 1 for MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion
Figure 2 for MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion
Figure 3 for MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion
Figure 4 for MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion
Viaarxiv icon

GINA-3D: Learning to Generate Implicit Neural Assets in the Wild

Add code
Apr 04, 2023
Figure 1 for GINA-3D: Learning to Generate Implicit Neural Assets in the Wild
Figure 2 for GINA-3D: Learning to Generate Implicit Neural Assets in the Wild
Figure 3 for GINA-3D: Learning to Generate Implicit Neural Assets in the Wild
Figure 4 for GINA-3D: Learning to Generate Implicit Neural Assets in the Wild
Viaarxiv icon

HUM3DIL: Semi-supervised Multi-modal 3D Human Pose Estimation for Autonomous Driving

Add code
Dec 15, 2022
Figure 1 for HUM3DIL: Semi-supervised Multi-modal 3D Human Pose Estimation for Autonomous Driving
Figure 2 for HUM3DIL: Semi-supervised Multi-modal 3D Human Pose Estimation for Autonomous Driving
Figure 3 for HUM3DIL: Semi-supervised Multi-modal 3D Human Pose Estimation for Autonomous Driving
Figure 4 for HUM3DIL: Semi-supervised Multi-modal 3D Human Pose Estimation for Autonomous Driving
Viaarxiv icon

NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors

Add code
Dec 06, 2022
Figure 1 for NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors
Figure 2 for NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors
Figure 3 for NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors
Figure 4 for NeRDi: Single-View NeRF Synthesis with Language-Guided Diffusion as General Image Priors
Viaarxiv icon