Picture for Meet Udeshi

Meet Udeshi

CyberExplorer: Benchmarking LLM Offensive Security Capabilities in a Real-World Attacking Simulation Environment

Add code
Feb 08, 2026
Viaarxiv icon

CRAKEN: Cybersecurity LLM Agent with Knowledge-Based Execution

Add code
May 21, 2025
Figure 1 for CRAKEN: Cybersecurity LLM Agent with Knowledge-Based Execution
Figure 2 for CRAKEN: Cybersecurity LLM Agent with Knowledge-Based Execution
Figure 3 for CRAKEN: Cybersecurity LLM Agent with Knowledge-Based Execution
Figure 4 for CRAKEN: Cybersecurity LLM Agent with Knowledge-Based Execution
Viaarxiv icon

EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges

Add code
Sep 24, 2024
Figure 1 for EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges
Figure 2 for EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges
Figure 3 for EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges
Figure 4 for EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges
Viaarxiv icon

NYU CTF Dataset: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security

Add code
Jun 08, 2024
Figure 1 for NYU CTF Dataset: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security
Figure 2 for NYU CTF Dataset: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security
Figure 3 for NYU CTF Dataset: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security
Figure 4 for NYU CTF Dataset: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security
Viaarxiv icon