Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Marek Hanzl

ACCIDENT: A Benchmark Dataset for Vehicle Accident Detection from Traffic Surveillance Videos

Apr 10, 2026

Lukas Picek, Michal Čermák, Marek Hanzl, Vojtěch Čermák

Abstract:We introduce ACCIDENT, a benchmark dataset for traffic accident detection in CCTV footage, designed to evaluate models in supervised (IID and OOD) and zero-shot settings, reflecting both data-rich and data-scarce scenarios. The benchmark consists of a curated set of 2,027 real and 2,211 synthetic clips annotated with the accident time, spatial location, and high-level collision type. We define three core tasks: (i) temporal localization of the accident, (ii) its spatial localization, and (iii) collision type classification. Each task is evaluated using custom metrics that account for the uncertainty and ambiguity inherent in CCTV footage. In addition to the benchmark, we provide a diverse set of baselines, including heuristic, motion-aware, and vision-language approaches, and show that ACCIDENT is challenging. You can access the ACCIDENT at: https://accidentbench.github.io

Via

Access Paper or Ask Questions

Zero-shot Hazard Identification in Autonomous Driving: A Case Study on the COOOL Benchmark

Dec 27, 2024

Lukas Picek, Vojtěch Čermák, Marek Hanzl

Figure 1 for Zero-shot Hazard Identification in Autonomous Driving: A Case Study on the COOOL Benchmark

Figure 2 for Zero-shot Hazard Identification in Autonomous Driving: A Case Study on the COOOL Benchmark

Figure 3 for Zero-shot Hazard Identification in Autonomous Driving: A Case Study on the COOOL Benchmark

Figure 4 for Zero-shot Hazard Identification in Autonomous Driving: A Case Study on the COOOL Benchmark

Abstract:This paper presents our submission to the COOOL competition, a novel benchmark for detecting and classifying out-of-label hazards in autonomous driving. Our approach integrates diverse methods across three core tasks: (i) driver reaction detection, (ii) hazard object identification, and (iii) hazard captioning. We propose kernel-based change point detection on bounding boxes and optical flow dynamics for driver reaction detection to analyze motion patterns. For hazard identification, we combined a naive proximity-based strategy with object classification using a pre-trained ViT model. At last, for hazard captioning, we used the MOLMO vision-language model with tailored prompts to generate precise and context-aware descriptions of rare and low-resolution hazards. The proposed pipeline outperformed the baseline methods by a large margin, reducing the relative error by 33%, and scored 2nd on the final leaderboard consisting of 32 teams.

Via

Access Paper or Ask Questions