Picture for Jiwen Lu

Jiwen Lu

GPD-1: Generative Pre-training for Driving

Add code
Dec 11, 2024
Figure 1 for GPD-1: Generative Pre-training for Driving
Figure 2 for GPD-1: Generative Pre-training for Driving
Figure 3 for GPD-1: Generative Pre-training for Driving
Figure 4 for GPD-1: Generative Pre-training for Driving
Viaarxiv icon

Bridging the Divide: Reconsidering Softmax and Linear Attention

Add code
Dec 09, 2024
Figure 1 for Bridging the Divide: Reconsidering Softmax and Linear Attention
Figure 2 for Bridging the Divide: Reconsidering Softmax and Linear Attention
Figure 3 for Bridging the Divide: Reconsidering Softmax and Linear Attention
Figure 4 for Bridging the Divide: Reconsidering Softmax and Linear Attention
Viaarxiv icon

Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving

Add code
Dec 09, 2024
Figure 1 for Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving
Figure 2 for Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving
Figure 3 for Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving
Figure 4 for Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving
Viaarxiv icon

GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction

Add code
Dec 06, 2024
Figure 1 for GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction
Figure 2 for GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction
Figure 3 for GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction
Figure 4 for GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction
Viaarxiv icon

Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model

Add code
Dec 06, 2024
Figure 1 for Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
Figure 2 for Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
Figure 3 for Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
Figure 4 for Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model
Viaarxiv icon

Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction

Add code
Dec 05, 2024
Figure 1 for Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction
Figure 2 for Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction
Figure 3 for Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction
Figure 4 for Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction
Viaarxiv icon

EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding

Add code
Dec 05, 2024
Figure 1 for EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding
Figure 2 for EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding
Figure 3 for EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding
Figure 4 for EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding
Viaarxiv icon

XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation

Add code
Nov 20, 2024
Figure 1 for XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
Figure 2 for XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
Figure 3 for XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
Figure 4 for XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
Viaarxiv icon

PixelGaussian: Generalizable 3D Gaussian Reconstruction from Arbitrary Views

Add code
Oct 24, 2024
Viaarxiv icon

V2M: Visual 2-Dimensional Mamba for Image Representation Learning

Add code
Oct 14, 2024
Figure 1 for V2M: Visual 2-Dimensional Mamba for Image Representation Learning
Figure 2 for V2M: Visual 2-Dimensional Mamba for Image Representation Learning
Figure 3 for V2M: Visual 2-Dimensional Mamba for Image Representation Learning
Figure 4 for V2M: Visual 2-Dimensional Mamba for Image Representation Learning
Viaarxiv icon