Picture for Enze Xie

Enze Xie

Vehicle-Infrastructure Cooperative 3D Object Detection via Feature Flow Prediction

Add code
Mar 19, 2023
Figure 1 for Vehicle-Infrastructure Cooperative 3D Object Detection via Feature Flow Prediction
Figure 2 for Vehicle-Infrastructure Cooperative 3D Object Detection via Feature Flow Prediction
Figure 3 for Vehicle-Infrastructure Cooperative 3D Object Detection via Feature Flow Prediction
Figure 4 for Vehicle-Infrastructure Cooperative 3D Object Detection via Feature Flow Prediction
Viaarxiv icon

Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline

Add code
Jan 29, 2023
Figure 1 for Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline
Figure 2 for Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline
Figure 3 for Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline
Figure 4 for Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline
Viaarxiv icon

Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception

Add code
Jan 19, 2023
Figure 1 for Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception
Figure 2 for Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception
Figure 3 for Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception
Figure 4 for Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception
Viaarxiv icon

Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe

Add code
Sep 12, 2022
Figure 1 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Figure 2 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Figure 3 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Figure 4 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Viaarxiv icon

UNITS: Unsupervised Intermediate Training Stage for Scene Text Detection

Add code
May 10, 2022
Figure 1 for UNITS: Unsupervised Intermediate Training Stage for Scene Text Detection
Figure 2 for UNITS: Unsupervised Intermediate Training Stage for Scene Text Detection
Figure 3 for UNITS: Unsupervised Intermediate Training Stage for Scene Text Detection
Figure 4 for UNITS: Unsupervised Intermediate Training Stage for Scene Text Detection
Viaarxiv icon

Understanding The Robustness in Vision Transformers

Add code
Apr 27, 2022
Figure 1 for Understanding The Robustness in Vision Transformers
Figure 2 for Understanding The Robustness in Vision Transformers
Figure 3 for Understanding The Robustness in Vision Transformers
Figure 4 for Understanding The Robustness in Vision Transformers
Viaarxiv icon

M$^2$BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation

Add code
Apr 19, 2022
Figure 1 for M$^2$BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation
Figure 2 for M$^2$BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation
Figure 3 for M$^2$BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation
Figure 4 for M$^2$BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation
Viaarxiv icon

Improving Monocular Visual Odometry Using Learned Depth

Add code
Apr 04, 2022
Figure 1 for Improving Monocular Visual Odometry Using Learned Depth
Figure 2 for Improving Monocular Visual Odometry Using Learned Depth
Figure 3 for Improving Monocular Visual Odometry Using Learned Depth
Figure 4 for Improving Monocular Visual Odometry Using Learned Depth
Viaarxiv icon

BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers

Add code
Mar 31, 2022
Figure 1 for BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
Figure 2 for BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
Figure 3 for BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
Figure 4 for BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
Viaarxiv icon

WegFormer: Transformers for Weakly Supervised Semantic Segmentation

Add code
Mar 16, 2022
Figure 1 for WegFormer: Transformers for Weakly Supervised Semantic Segmentation
Figure 2 for WegFormer: Transformers for Weakly Supervised Semantic Segmentation
Figure 3 for WegFormer: Transformers for Weakly Supervised Semantic Segmentation
Figure 4 for WegFormer: Transformers for Weakly Supervised Semantic Segmentation
Viaarxiv icon