Picture for Yifeng Shi

Yifeng Shi

DoReMi: A Domain-Representation Mixture Framework for Generalizable 3D Understanding

Add code
Nov 14, 2025
Figure 1 for DoReMi: A Domain-Representation Mixture Framework for Generalizable 3D Understanding
Figure 2 for DoReMi: A Domain-Representation Mixture Framework for Generalizable 3D Understanding
Figure 3 for DoReMi: A Domain-Representation Mixture Framework for Generalizable 3D Understanding
Figure 4 for DoReMi: A Domain-Representation Mixture Framework for Generalizable 3D Understanding
Viaarxiv icon

Sat2RealCity: Geometry-Aware and Appearance-Controllable 3D Urban Generation from Satellite Imagery

Add code
Nov 14, 2025
Viaarxiv icon

RopeBEV: A Multi-Camera Roadside Perception Network in Bird's-Eye-View

Add code
Sep 18, 2024
Figure 1 for RopeBEV: A Multi-Camera Roadside Perception Network in Bird's-Eye-View
Figure 2 for RopeBEV: A Multi-Camera Roadside Perception Network in Bird's-Eye-View
Figure 3 for RopeBEV: A Multi-Camera Roadside Perception Network in Bird's-Eye-View
Figure 4 for RopeBEV: A Multi-Camera Roadside Perception Network in Bird's-Eye-View
Viaarxiv icon

RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positive Supervision

Add code
Sep 13, 2024
Figure 1 for RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positive Supervision
Figure 2 for RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positive Supervision
Figure 3 for RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positive Supervision
Figure 4 for RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positive Supervision
Viaarxiv icon

Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection

Add code
Jul 22, 2024
Viaarxiv icon

ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions

Add code
Mar 15, 2024
Viaarxiv icon

MonoLSS: Learnable Sample Selection For Monocular 3D Detection

Add code
Dec 22, 2023
Figure 1 for MonoLSS: Learnable Sample Selection For Monocular 3D Detection
Figure 2 for MonoLSS: Learnable Sample Selection For Monocular 3D Detection
Figure 3 for MonoLSS: Learnable Sample Selection For Monocular 3D Detection
Figure 4 for MonoLSS: Learnable Sample Selection For Monocular 3D Detection
Viaarxiv icon

DUSA: Decoupled Unsupervised Sim2Real Adaptation for Vehicle-to-Everything Collaborative Perception

Add code
Oct 12, 2023
Viaarxiv icon

V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting

Add code
May 10, 2023
Figure 1 for V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting
Figure 2 for V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting
Figure 3 for V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting
Figure 4 for V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting
Viaarxiv icon

Multimodal Understanding Through Correlation Maximization and Minimization

Add code
May 04, 2023
Viaarxiv icon