Picture for Yuntao Chen

Yuntao Chen

Monocular Occupancy Prediction for Scalable Indoor Scenes

Add code
Jul 16, 2024
Viaarxiv icon

Enhancing End-to-End Autonomous Driving with Latent World Model

Add code
Jun 12, 2024
Viaarxiv icon

Continual Forgetting for Pre-trained Vision Models

Add code
Mar 18, 2024
Figure 1 for Continual Forgetting for Pre-trained Vision Models
Figure 2 for Continual Forgetting for Pre-trained Vision Models
Figure 3 for Continual Forgetting for Pre-trained Vision Models
Figure 4 for Continual Forgetting for Pre-trained Vision Models
Viaarxiv icon

MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer

Add code
Jan 18, 2024
Viaarxiv icon

Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications

Add code
Jan 11, 2024
Viaarxiv icon

Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving

Add code
Nov 29, 2023
Figure 1 for Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving
Figure 2 for Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving
Figure 3 for Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving
Figure 4 for Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving
Viaarxiv icon

PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation

Add code
Jun 16, 2023
Figure 1 for PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation
Figure 2 for PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation
Figure 3 for PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation
Figure 4 for PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation
Viaarxiv icon

Tracking Objects with 3D Representation from Videos

Add code
Jun 08, 2023
Figure 1 for Tracking Objects with 3D Representation from Videos
Figure 2 for Tracking Objects with 3D Representation from Videos
Figure 3 for Tracking Objects with 3D Representation from Videos
Figure 4 for Tracking Objects with 3D Representation from Videos
Viaarxiv icon

2D Supervised Monocular 3D Object Detection by Global-to-Local 3D Reconstruction

Add code
Jun 08, 2023
Figure 1 for 2D Supervised Monocular 3D Object Detection by Global-to-Local 3D Reconstruction
Figure 2 for 2D Supervised Monocular 3D Object Detection by Global-to-Local 3D Reconstruction
Figure 3 for 2D Supervised Monocular 3D Object Detection by Global-to-Local 3D Reconstruction
Figure 4 for 2D Supervised Monocular 3D Object Detection by Global-to-Local 3D Reconstruction
Viaarxiv icon

Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory

Add code
Jun 01, 2023
Figure 1 for Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
Figure 2 for Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
Figure 3 for Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
Figure 4 for Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory
Viaarxiv icon