Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Avishai Weizman

YOLO Meets Mixture-of-Experts: Adaptive Expert Routing for Robust Object Detection

Nov 18, 2025

Ori Meiraz, Sharon Shalev, Avishai Weizman

Abstract:This paper presents a novel Mixture-of-Experts framework for object detection, incorporating adaptive routing among multiple YOLOv9-T experts to enable dynamic feature specialization and achieve higher mean Average Precision (mAP) and Average Recall (AR) compared to a single YOLOv9-T model.

* 1 figure, 1 table

Via

Access Paper or Ask Questions

Recognition of Abnormal Events in Surveillance Videos using Weakly Supervised Dual-Encoder Models

Nov 17, 2025

Noam Tsfaty, Avishai Weizman, Liav Cohen, Moshe Tshuva, Yehudit Aperstein

Figure 1 for Recognition of Abnormal Events in Surveillance Videos using Weakly Supervised Dual-Encoder Models

Figure 2 for Recognition of Abnormal Events in Surveillance Videos using Weakly Supervised Dual-Encoder Models

Abstract:We address the challenge of detecting rare and diverse anomalies in surveillance videos using only video-level supervision. Our dual-backbone framework combines convolutional and transformer representations through top-k pooling, achieving 90.7% area under the curve (AUC) on the UCF-Crime dataset.

* 1 figure, 1 table

Via

Access Paper or Ask Questions

Find the Leak, Fix the Split: Cluster-Based Method to Prevent Leakage in Video-Derived Datasets

Nov 17, 2025

Noam Glazner, Noam Tsfaty, Sharon Shalev, Avishai Weizman

Figure 1 for Find the Leak, Fix the Split: Cluster-Based Method to Prevent Leakage in Video-Derived Datasets

Figure 2 for Find the Leak, Fix the Split: Cluster-Based Method to Prevent Leakage in Video-Derived Datasets

Abstract:We propose a cluster-based frame selection strategy to mitigate information leakage in video-derived frames datasets. By grouping visually similar frames before splitting into training, validation, and test sets, the method produces more representative, balanced, and reliable dataset partitions.

* 1 figure, 1 table

Via

Access Paper or Ask Questions

ASVspoof2019 vs. ASVspoof5: Assessment and Comparison

May 21, 2025

Avishai Weizman, Yehuda Ben-Shimol, Itshak Lapidot

Abstract:ASVspoof challenges are designed to advance the understanding of spoofing speech attacks and encourage the development of robust countermeasure systems. These challenges provide a standardized database for assessing and comparing spoofing-robust automatic speaker verification solutions. The ASVspoof5 challenge introduces a shift in database conditions compared to ASVspoof2019. While ASVspoof2019 has mismatched conditions only in spoofing attacks in the evaluation set, ASVspoof5 incorporates mismatches in both bona fide and spoofed speech statistics. This paper examines the impact of these mismatches, presenting qualitative and quantitative comparisons within and between the two databases. We show the increased difficulty for genuine and spoofed speech and demonstrate that in ASVspoof5, not only are the attacks more challenging, but the genuine speech also shifts toward spoofed speech compared to ASVspoof2019.

* 5 pages, 3 figures. Accepted to Interspeech 2025 Conference

Via

Access Paper or Ask Questions

Tandem spoofing-robust automatic speaker verification based on time-domain embeddings

Dec 22, 2024

Avishai Weizman, Yehuda Ben-Shimol, Itshak Lapidot

Figure 1 for Tandem spoofing-robust automatic speaker verification based on time-domain embeddings

Figure 2 for Tandem spoofing-robust automatic speaker verification based on time-domain embeddings

Figure 3 for Tandem spoofing-robust automatic speaker verification based on time-domain embeddings

Figure 4 for Tandem spoofing-robust automatic speaker verification based on time-domain embeddings

Abstract:Spoofing-robust automatic speaker verification (SASV) systems are a crucial technology for the protection against spoofed speech. In this study, we focus on logical access attacks and introduce a novel approach to SASV tasks. A novel representation of genuine and spoofed speech is employed, based on the probability mass function (PMF) of waveform amplitudes in the time domain. This methodology generates novel time embeddings derived from the PMF of selected groups within the training set. This paper highlights the role of gender segregation and its positive impact on performance. We propose a countermeasure (CM) system that employs time-domain embeddings derived from the PMF of spoofed and genuine speech, as well as gender recognition based on male and female time-based embeddings. The method exhibits notable gender recognition capabilities, with mismatch rates of 0.94% and 1.79% for males and females, respectively. The male and female CM systems achieve an equal error rate (EER) of 8.67% and 10.12%, respectively. By integrating this approach with traditional speaker verification systems, we demonstrate improved generalization ability and tandem detection cost function evaluation using the ASVspoof2019 challenge database. Furthermore, we investigate the impact of fusing the time embedding approach with traditional CM and illustrate how this fusion enhances generalization in SASV architectures.

* 11 pages, 8 figures

Via

Access Paper or Ask Questions