Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting

Aug 09, 2025

Sandro Papais, Letian Wang, Brian Cheong, Steven L. Waslander

Figure 1 for ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting

Figure 2 for ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting

Figure 3 for ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting

Figure 4 for ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting

Share this with someone who'll enjoy it:

Abstract:We introduce ForeSight, a novel joint detection and forecasting framework for vision-based 3D perception in autonomous vehicles. Traditional approaches treat detection and forecasting as separate sequential tasks, limiting their ability to leverage temporal cues. ForeSight addresses this limitation with a multi-task streaming and bidirectional learning approach, allowing detection and forecasting to share query memory and propagate information seamlessly. The forecast-aware detection transformer enhances spatial reasoning by integrating trajectory predictions from a multiple hypothesis forecast memory queue, while the streaming forecast transformer improves temporal consistency using past forecasts and refined detections. Unlike tracking-based methods, ForeSight eliminates the need for explicit object association, reducing error propagation with a tracking-free model that efficiently scales across multi-frame sequences. Experiments on the nuScenes dataset show that ForeSight achieves state-of-the-art performance, achieving an EPA of 54.9%, surpassing previous methods by 9.3%, while also attaining the best mAP and minADE among multi-view detection and forecasting models.

* Accepted to ICCV 2025

View paper on

Share this with someone who'll enjoy it:

Title:ForeSight: Multi-View Streaming Joint Object Detection and Trajectory Forecasting

Paper and Code