Picture for Shizhong Han

Shizhong Han

ODG: Occupancy Prediction Using Dual Gaussians

Add code
Jun 12, 2025
Viaarxiv icon

DySS: Dynamic Queries and State-Space Learning for Efficient 3D Object Detection from Multi-Camera Videos

Add code
Jun 11, 2025
Viaarxiv icon

RoCA: Robust Cross-Domain End-to-End Autonomous Driving

Add code
Jun 11, 2025
Viaarxiv icon

BePo: Leveraging Birds Eye View and Sparse Points for Efficient and Accurate 3D Occupancy Prediction

Add code
Jun 08, 2025
Viaarxiv icon

FALO: Fast and Accurate LiDAR 3D Object Detection on Resource-Constrained Devices

Add code
Jun 04, 2025
Viaarxiv icon

Distilling Multi-modal Large Language Models for Autonomous Driving

Add code
Jan 16, 2025
Viaarxiv icon

Ranking and Combining Latent Structured Predictive Scores without Labeled Data

Add code
Aug 14, 2024
Figure 1 for Ranking and Combining Latent Structured Predictive Scores without Labeled Data
Figure 2 for Ranking and Combining Latent Structured Predictive Scores without Labeled Data
Figure 3 for Ranking and Combining Latent Structured Predictive Scores without Labeled Data
Figure 4 for Ranking and Combining Latent Structured Predictive Scores without Labeled Data
Viaarxiv icon

PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer

Add code
Jul 16, 2024
Figure 1 for PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer
Figure 2 for PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer
Figure 3 for PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer
Figure 4 for PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer
Viaarxiv icon

FutureDepth: Learning to Predict the Future Improves Video Depth Estimation

Add code
Mar 19, 2024
Figure 1 for FutureDepth: Learning to Predict the Future Improves Video Depth Estimation
Figure 2 for FutureDepth: Learning to Predict the Future Improves Video Depth Estimation
Figure 3 for FutureDepth: Learning to Predict the Future Improves Video Depth Estimation
Figure 4 for FutureDepth: Learning to Predict the Future Improves Video Depth Estimation
Viaarxiv icon

OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding

Add code
May 18, 2023
Viaarxiv icon