Picture for Omer Nevo

Omer Nevo

A Content-Based Framework for Cybersecurity Refusal Decisions in Large Language Models

Add code
Feb 17, 2026
Viaarxiv icon

Leveraging LLM Inconsistency to Boost Pass@k Performance

Add code
May 19, 2025
Viaarxiv icon

What Makes an Evaluation Useful? Common Pitfalls and Best Practices

Add code
Mar 30, 2025
Figure 1 for What Makes an Evaluation Useful? Common Pitfalls and Best Practices
Figure 2 for What Makes an Evaluation Useful? Common Pitfalls and Best Practices
Viaarxiv icon