Picture for Norbert Tihanyi

Norbert Tihanyi

DFIR-Metric: A Benchmark Dataset for Evaluating Large Language Models in Digital Forensics and Incident Response

Add code
May 26, 2025
Viaarxiv icon

From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review

Add code
Apr 28, 2025
Viaarxiv icon

Vulnerability Detection: From Formal Verification to Large Language Models and Hybrid Approaches: A Comprehensive Overview

Add code
Mar 13, 2025
Viaarxiv icon

CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE Detection

Add code
Mar 12, 2025
Viaarxiv icon

Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence

Add code
Oct 20, 2024
Figure 1 for Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence
Figure 2 for Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence
Figure 3 for Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence
Figure 4 for Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence
Viaarxiv icon

Generative AI and Large Language Models for Cyber Security: All Insights You Need

Add code
May 21, 2024
Figure 1 for Generative AI and Large Language Models for Cyber Security: All Insights You Need
Figure 2 for Generative AI and Large Language Models for Cyber Security: All Insights You Need
Figure 3 for Generative AI and Large Language Models for Cyber Security: All Insights You Need
Figure 4 for Generative AI and Large Language Models for Cyber Security: All Insights You Need
Viaarxiv icon

Do Neutral Prompts Produce Insecure Code? FormAI-v2 Dataset: Labelling Vulnerabilities in Code Generated by Large Language Models

Add code
Apr 29, 2024
Viaarxiv icon

CyberMetric: A Benchmark Dataset for Evaluating Large Language Models Knowledge in Cybersecurity

Add code
Feb 12, 2024
Viaarxiv icon

SecureFalcon: The Next Cyber Reasoning System for Cyber Security

Add code
Jul 13, 2023
Viaarxiv icon

The FormAI Dataset: Generative AI in Software Security Through the Lens of Formal Verification

Add code
Jul 05, 2023
Viaarxiv icon