Picture for Martin Vechev

Martin Vechev

MathArena: Evaluating LLMs on Uncontaminated Math Competitions

Add code
May 29, 2025
Viaarxiv icon

Mind the Gap: A Practical Attack on GGUF Quantization

Add code
May 24, 2025
Viaarxiv icon

Finetuning-Activated Backdoors in LLMs

Add code
May 22, 2025
Viaarxiv icon

MixAT: Combining Continuous and Discrete Adversarial Training for LLMs

Add code
May 22, 2025
Viaarxiv icon

Robust LLM Fingerprinting via Domain-Specific Watermarks

Add code
May 22, 2025
Viaarxiv icon

Type-Constrained Code Generation with Language Models

Add code
Apr 12, 2025
Viaarxiv icon

Proof or Bluff? Evaluating LLMs on 2025 USA Math Olympiad

Add code
Mar 27, 2025
Viaarxiv icon

Automated Benchmark Generation for Repository-Level Coding Tasks

Add code
Mar 10, 2025
Viaarxiv icon

ToolFuzz -- Automated Agent Tool Testing

Add code
Mar 06, 2025
Viaarxiv icon

GRAIN: Exact Graph Reconstruction from Gradients

Add code
Mar 03, 2025
Viaarxiv icon