Picture for Yarin Gal

Yarin Gal

Selective Safety Steering via Value-Filtered Decoding

Add code
May 14, 2026
Viaarxiv icon

Training Transformers for KV Cache Compressibility

Add code
May 07, 2026
Viaarxiv icon

Uncertainty Quantification for LLM Function-Calling

Add code
Apr 24, 2026
Viaarxiv icon

Simple Baselines are Competitive with Code Evolution

Add code
Feb 18, 2026
Viaarxiv icon

Boundary Point Jailbreaking of Black-Box LLMs

Add code
Feb 16, 2026
Viaarxiv icon

OMNI-LEAK: Orchestrator Multi-Agent Network Induced Data Leakage

Add code
Feb 13, 2026
Viaarxiv icon

Richer Bayesian Last Layers with Subsampled NTK Features

Add code
Feb 01, 2026
Viaarxiv icon

MADE: Benchmark Environments for Closed-Loop Materials Discovery

Add code
Jan 28, 2026
Viaarxiv icon

Iterative Deployment Improves Planning Skills in LLMs

Add code
Dec 31, 2025
Viaarxiv icon

Evaluating & Reducing Deceptive Dialogue From Language Models with Multi-turn RL

Add code
Oct 16, 2025
Viaarxiv icon