Picture for Marcel Zalmanovici

Marcel Zalmanovici

PACIFIC: a framework for generating benchmarks to check Precise Automatically Checked Instruction Following In Code

Add code
Dec 22, 2025
Viaarxiv icon

Exploring Straightforward Conversational Red-Teaming

Add code
Sep 07, 2024
Figure 1 for Exploring Straightforward Conversational Red-Teaming
Figure 2 for Exploring Straightforward Conversational Red-Teaming
Figure 3 for Exploring Straightforward Conversational Red-Teaming
Figure 4 for Exploring Straightforward Conversational Red-Teaming
Viaarxiv icon

Generating Unseen Code Tests In Infinitum

Add code
Jul 29, 2024
Figure 1 for Generating Unseen Code Tests In Infinitum
Figure 2 for Generating Unseen Code Tests In Infinitum
Figure 3 for Generating Unseen Code Tests In Infinitum
Figure 4 for Generating Unseen Code Tests In Infinitum
Viaarxiv icon

Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

Add code
Mar 09, 2024
Figure 1 for Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Figure 2 for Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Figure 3 for Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Figure 4 for Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Viaarxiv icon

Unveiling Safety Vulnerabilities of Large Language Models

Add code
Nov 07, 2023
Figure 1 for Unveiling Safety Vulnerabilities of Large Language Models
Figure 2 for Unveiling Safety Vulnerabilities of Large Language Models
Figure 3 for Unveiling Safety Vulnerabilities of Large Language Models
Figure 4 for Unveiling Safety Vulnerabilities of Large Language Models
Viaarxiv icon

Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation

Add code
Dec 22, 2021
Figure 1 for Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation
Figure 2 for Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation
Figure 3 for Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation
Figure 4 for Classifier Data Quality: A Geometric Complexity Based Method for Automated Baseline And Insights Generation
Viaarxiv icon

Automatically detecting data drift in machine learning classifiers

Add code
Nov 10, 2021
Figure 1 for Automatically detecting data drift in machine learning classifiers
Figure 2 for Automatically detecting data drift in machine learning classifiers
Figure 3 for Automatically detecting data drift in machine learning classifiers
Figure 4 for Automatically detecting data drift in machine learning classifiers
Viaarxiv icon

Density-based interpretable hypercube region partitioning for mixed numeric and categorical data

Add code
Nov 08, 2021
Figure 1 for Density-based interpretable hypercube region partitioning for mixed numeric and categorical data
Figure 2 for Density-based interpretable hypercube region partitioning for mixed numeric and categorical data
Figure 3 for Density-based interpretable hypercube region partitioning for mixed numeric and categorical data
Figure 4 for Density-based interpretable hypercube region partitioning for mixed numeric and categorical data
Viaarxiv icon

FreaAI: Automated extraction of data slices to test machine learning models

Add code
Aug 12, 2021
Figure 1 for FreaAI: Automated extraction of data slices to test machine learning models
Figure 2 for FreaAI: Automated extraction of data slices to test machine learning models
Figure 3 for FreaAI: Automated extraction of data slices to test machine learning models
Figure 4 for FreaAI: Automated extraction of data slices to test machine learning models
Viaarxiv icon

Machine Learning Model Drift Detection Via Weak Data Slices

Add code
Aug 11, 2021
Figure 1 for Machine Learning Model Drift Detection Via Weak Data Slices
Figure 2 for Machine Learning Model Drift Detection Via Weak Data Slices
Figure 3 for Machine Learning Model Drift Detection Via Weak Data Slices
Figure 4 for Machine Learning Model Drift Detection Via Weak Data Slices
Viaarxiv icon