Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Svetlana Pavlitska

Real-World On-Vehicle Evaluation of Embedding-Based Anomaly Detection

May 19, 2026

Albert Schotschneider, Daniel Bogdoll, Svetlana Pavlitska, Ahmed Abouelazm, Johann Marius Zoellner

Abstract:Detecting anomalies in traffic scenes is crucial for ensuring safety in autonomous driving, yet collecting representative anomalous data remains challenging. Existing anomaly detection methods are highly specialized and rely on normality as defined by the abstract semantic Cityscapes classes, making it difficult to adapt to diverse real-world scenarios. We propose an adaptable real-time anomaly detection method that leverages foundation models in the form of pretrained vision transformer embeddings to detect deviations via nearest-neighbor similarity in the latent semantic feature space. Based on patch-wise processing, the algorithm produces dense anomaly masks, allowing for the localization of detected anomalies. The method robustly models normality through a single reference image. This formulation avoids explicit supervision and dataset-specific training, making it suitable for real-world deployment. We evaluate the method on standard benchmarks and on an automated vehicle in real-world scenarios. Despite its simplicity, the method achieves good performance on the Road Anomaly benchmark and demonstrates consistent qualitative behavior in practice, successfully highlighting semantically unusual objects in diverse scenes. These results suggest that simple, reference-based methods can provide useful anomaly signals under realistic operating conditions.

* Accepted at CVPR 2026 Workshop AUTOPILOT-NA

Via

Access Paper or Ask Questions

Towards a Systematic Risk Assessment of Deep Neural Network Limitations in Autonomous Driving Perception

Apr 21, 2026

Svetlana Pavlitska, Christopher Gerking, J. Marius Zöllner

Abstract:Safety and security are essential for the admission and acceptance of automated and autonomous vehicles. Deep neural networks (DNNs) are widely used for perception and further components of the autonomous driving (AD) stack. However, they possess several limitations, including lack of generalization, efficiency, explainability, plausibility, and robustness. These insufficiencies can pose significant risks to autonomous driving systems. However, hazards, threats, and risks associated with DNN limitations in this domain have not been systematically studied so far. In this work, we propose a joint workflow for risk assessment combining the hazard analysis and risk assessment (HARA) following ISO 26262 and threat analysis and risk assessment (TARA) following the ISO/SAE 21434 to identify and analyze risks arising from inherent DNN limitations in AD perception.

* Accepted for publication at the SECAI workshop at ESORICS 2025

Via

Access Paper or Ask Questions

Domain-Specialized Object Detection via Model-Level Mixtures of Experts

Apr 20, 2026

Svetlana Pavlitska, Malte Stüven, Beyza Keskin, J. Marius Zöllner

Abstract:Mixture-of-Experts (MoE) models provide a structured approach to combining specialized neural networks and offer greater interpretability than conventional ensembles. While MoEs have been successfully applied to image classification and semantic segmentation, their use in object detection remains limited due to challenges in merging dense and structured predictions. In this work, we investigate model-level mixtures of object detectors and analyze their suitability for improving performance and interpretability in object detection. We propose an MoE architecture that combines YOLO-based detectors trained on semantically disjoint data subsets, with a learned gating network that dynamically weights expert contributions. We study different strategies for fusing detection outputs and for training the gating mechanism, including balancing losses to prevent expert collapse. Experiments on the BDD100K dataset demonstrate that the proposed MoE consistently outperforms standard ensemble approaches and provides insights into expert specialization across domains, highlighting model-level MoEs as a viable alternative to traditional ensembling for object detection. Our code is available at https://github.com/KASTEL-MobilityLab/mixtures-of-experts/.

* Accepted for publication at IJCNN 2026

Via

Access Paper or Ask Questions

Design and Behavior of Sparse Mixture-of-Experts Layers in CNN-based Semantic Segmentation

Apr 15, 2026

Svetlana Pavlitska, Haixi Fan, Konstantin Ditschuneit, J. Marius Zöllner

Abstract:Sparse mixture-of-experts (MoE) layers have been shown to substantially increase model capacity without a proportional increase in computational cost and are widely used in transformer architectures, where they typically replace feed-forward network blocks. In contrast, integrating sparse MoE layers into convolutional neural networks (CNNs) remains inconsistent, with most prior work focusing on fine-grained MoEs operating at the filter or channel levels. In this work, we investigate a coarser, patch-wise formulation of sparse MoE layers for semantic segmentation, where local regions are routed to a small subset of convolutional experts. Through experiments on the Cityscapes and BDD100K datasets using encoder-decoder and backbone-based CNNs, we conduct a design analysis to assess how architectural choices affect routing dynamics and expert specialization. Our results demonstrate consistent, architecture-dependent improvements (up to +3.9 mIoU) with little computational overhead, while revealing strong design sensitivity. Our work provides empirical insights into the design and internal dynamics of sparse MoE layers in CNN-based dense prediction. Our code is available at https://github.com/KASTEL-MobilityLab/moe-layers/.

* Accepted for publication at the SAIAD workshop at CVPR 2026

Via

Access Paper or Ask Questions

Runtime Safety Monitoring of Deep Neural Networks for Perception: A Survey

Nov 08, 2025

Albert Schotschneider, Svetlana Pavlitska, J. Marius Zöllner

Abstract:Deep neural networks (DNNs) are widely used in perception systems for safety-critical applications, such as autonomous driving and robotics. However, DNNs remain vulnerable to various safety concerns, including generalization errors, out-of-distribution (OOD) inputs, and adversarial attacks, which can lead to hazardous failures. This survey provides a comprehensive overview of runtime safety monitoring approaches, which operate in parallel to DNNs during inference to detect these safety concerns without modifying the DNN itself. We categorize existing methods into three main groups: Monitoring inputs, internal representations, and outputs. We analyze the state-of-the-art for each category, identify strengths and limitations, and map methods to the safety concerns they address. In addition, we highlight open challenges and future research directions.

* 6 pages, 1 figure, 2 tables, accepted at IEEE SMC 2025 in Vienna, presented on 8th October 2025

Via

Access Paper or Ask Questions

DEAP DIVE: Dataset Investigation with Vision transformers for EEG evaluation

Oct 01, 2025

Annemarie Hoffsommer, Helen Schneider, Svetlana Pavlitska, J. Marius Zöllner

Figure 1 for DEAP DIVE: Dataset Investigation with Vision transformers for EEG evaluation

Figure 2 for DEAP DIVE: Dataset Investigation with Vision transformers for EEG evaluation

Figure 3 for DEAP DIVE: Dataset Investigation with Vision transformers for EEG evaluation

Figure 4 for DEAP DIVE: Dataset Investigation with Vision transformers for EEG evaluation

Abstract:Accurately predicting emotions from brain signals has the potential to achieve goals such as improving mental health, human-computer interaction, and affective computing. Emotion prediction through neural signals offers a promising alternative to traditional methods, such as self-assessment and facial expression analysis, which can be subjective or ambiguous. Measurements of the brain activity via electroencephalogram (EEG) provides a more direct and unbiased data source. However, conducting a full EEG is a complex, resource-intensive process, leading to the rise of low-cost EEG devices with simplified measurement capabilities. This work examines how subsets of EEG channels from the DEAP dataset can be used for sufficiently accurate emotion prediction with low-cost EEG devices, rather than fully equipped EEG-measurements. Using Continuous Wavelet Transformation to convert EEG data into scaleograms, we trained a vision transformer (ViT) model for emotion classification. The model achieved over 91,57% accuracy in predicting 4 quadrants (high/low per arousal and valence) with only 12 measuring points (also referred to as channels). Our work shows clearly, that a significant reduction of input channels yields high results compared to state-of-the-art results of 96,9% with 32 channels. Training scripts to reproduce our code can be found here: https://gitlab.kit.edu/kit/aifb/ATKS/public/AutoSMiLeS/DEAP-DIVE.

* Accepted for publication at ABAW Workshop at ICCV2025

Via

Access Paper or Ask Questions

Extracting Uncertainty Estimates from Mixtures of Experts for Semantic Segmentation

Sep 05, 2025

Svetlana Pavlitska, Beyza Keskin, Alwin Faßbender, Christian Hubschneider, J. Marius Zöllner

Figure 1 for Extracting Uncertainty Estimates from Mixtures of Experts for Semantic Segmentation

Figure 2 for Extracting Uncertainty Estimates from Mixtures of Experts for Semantic Segmentation

Figure 3 for Extracting Uncertainty Estimates from Mixtures of Experts for Semantic Segmentation

Figure 4 for Extracting Uncertainty Estimates from Mixtures of Experts for Semantic Segmentation

Abstract:Estimating accurate and well-calibrated predictive uncertainty is important for enhancing the reliability of computer vision models, especially in safety-critical applications like traffic scene perception. While ensemble methods are commonly used to quantify uncertainty by combining multiple models, a mixture of experts (MoE) offers an efficient alternative by leveraging a gating network to dynamically weight expert predictions based on the input. Building on the promising use of MoEs for semantic segmentation in our previous works, we show that well-calibrated predictive uncertainty estimates can be extracted from MoEs without architectural modifications. We investigate three methods to extract predictive uncertainty estimates: predictive entropy, mutual information, and expert variance. We evaluate these methods for an MoE with two experts trained on a semantical split of the A2D2 dataset. Our results show that MoEs yield more reliable uncertainty estimates than ensembles in terms of conditional correctness metrics under out-of-distribution (OOD) data. Additionally, we evaluate routing uncertainty computed via gate entropy and find that simple gating mechanisms lead to better calibration of routing uncertainty estimates than more complex classwise gates. Finally, our experiments on the Cityscapes dataset suggest that increasing the number of experts can further enhance uncertainty calibration. Our code is available at https://github.com/KASTEL-MobilityLab/mixtures-of-experts/.

* Accepted for publication at the STREAM workshop at ICCV2025

Via

Access Paper or Ask Questions

Robust Experts: the Effect of Adversarial Training on CNNs with Sparse Mixture-of-Experts Layers

Sep 05, 2025

Svetlana Pavlitska, Haixi Fan, Konstantin Ditschuneit, J. Marius Zöllner

Figure 1 for Robust Experts: the Effect of Adversarial Training on CNNs with Sparse Mixture-of-Experts Layers

Figure 2 for Robust Experts: the Effect of Adversarial Training on CNNs with Sparse Mixture-of-Experts Layers

Figure 3 for Robust Experts: the Effect of Adversarial Training on CNNs with Sparse Mixture-of-Experts Layers

Figure 4 for Robust Experts: the Effect of Adversarial Training on CNNs with Sparse Mixture-of-Experts Layers

Abstract:Robustifying convolutional neural networks (CNNs) against adversarial attacks remains challenging and often requires resource-intensive countermeasures. We explore the use of sparse mixture-of-experts (MoE) layers to improve robustness by replacing selected residual blocks or convolutional layers, thereby increasing model capacity without additional inference cost. On ResNet architectures trained on CIFAR-100, we find that inserting a single MoE layer in the deeper stages leads to consistent improvements in robustness under PGD and AutoPGD attacks when combined with adversarial training. Furthermore, we discover that when switch loss is used for balancing, it causes routing to collapse onto a small set of overused experts, thereby concentrating adversarial training on these paths and inadvertently making them more robust. As a result, some individual experts outperform the gated MoE model in robustness, suggesting that robust subpaths emerge through specialization. Our code is available at https://github.com/KASTEL-MobilityLab/robust-sparse-moes.

* Accepted for publication at the STREAM workshop at ICCV 2025

Via

Access Paper or Ask Questions

Fool the Stoplight: Realistic Adversarial Patch Attacks on Traffic Light Detectors

Jun 05, 2025

Svetlana Pavlitska, Jamie Robb, Nikolai Polley, Melih Yazgan, J. Marius Zöllner

Figure 1 for Fool the Stoplight: Realistic Adversarial Patch Attacks on Traffic Light Detectors

Figure 2 for Fool the Stoplight: Realistic Adversarial Patch Attacks on Traffic Light Detectors

Figure 3 for Fool the Stoplight: Realistic Adversarial Patch Attacks on Traffic Light Detectors

Figure 4 for Fool the Stoplight: Realistic Adversarial Patch Attacks on Traffic Light Detectors

Abstract:Realistic adversarial attacks on various camera-based perception tasks of autonomous vehicles have been successfully demonstrated so far. However, only a few works considered attacks on traffic light detectors. This work shows how CNNs for traffic light detection can be attacked with printed patches. We propose a threat model, where each instance of a traffic light is attacked with a patch placed under it, and describe a training strategy. We demonstrate successful adversarial patch attacks in universal settings. Our experiments show realistic targeted red-to-green label-flipping attacks and attacks on pictogram classification. Finally, we perform a real-world evaluation with printed patches and demonstrate attacks in the lab settings with a mobile traffic light for construction sites and in a test area with stationary traffic lights. Our code is available at https://github.com/KASTEL-MobilityLab/attacks-on-traffic-light-detection.

* Accepted for publication at IV 2025

Via

Access Paper or Ask Questions

Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic Segmentation

Dec 16, 2024

Svetlana Pavlitska, Enrico Eisen, J. Marius Zöllner

Figure 1 for Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic Segmentation

Figure 2 for Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic Segmentation

Figure 3 for Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic Segmentation

Figure 4 for Towards Adversarial Robustness of Model-Level Mixture-of-Experts Architectures for Semantic Segmentation

Abstract:Vulnerability to adversarial attacks is a well-known deficiency of deep neural networks. Larger networks are generally more robust, and ensembling is one method to increase adversarial robustness: each model's weaknesses are compensated by the strengths of others. While an ensemble uses a deterministic rule to combine model outputs, a mixture of experts (MoE) includes an additional learnable gating component that predicts weights for the outputs of the expert models, thus determining their contributions to the final prediction. MoEs have been shown to outperform ensembles on specific tasks, yet their susceptibility to adversarial attacks has not been studied yet. In this work, we evaluate the adversarial vulnerability of MoEs for semantic segmentation of urban and highway traffic scenes. We show that MoEs are, in most cases, more robust to per-instance and universal white-box adversarial attacks and can better withstand transfer attacks. Our code is available at \url{https://github.com/KASTEL-MobilityLab/mixtures-of-experts/}.

* Accepted for publication at ICMLA 2024

Via

Access Paper or Ask Questions