Picture for Yarin Gal

Yarin Gal

Simple Baselines are Competitive with Code Evolution

Add code
Feb 18, 2026
Viaarxiv icon

Boundary Point Jailbreaking of Black-Box LLMs

Add code
Feb 16, 2026
Viaarxiv icon

OMNI-LEAK: Orchestrator Multi-Agent Network Induced Data Leakage

Add code
Feb 13, 2026
Viaarxiv icon

Richer Bayesian Last Layers with Subsampled NTK Features

Add code
Feb 01, 2026
Viaarxiv icon

MADE: Benchmark Environments for Closed-Loop Materials Discovery

Add code
Jan 28, 2026
Viaarxiv icon

Iterative Deployment Improves Planning Skills in LLMs

Add code
Dec 31, 2025
Viaarxiv icon

Evaluating & Reducing Deceptive Dialogue From Language Models with Multi-turn RL

Add code
Oct 16, 2025
Viaarxiv icon

Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples

Add code
Oct 08, 2025
Viaarxiv icon

Stabilizing Policy Gradients for Sample-Efficient Reinforcement Learning in LLM Reasoning

Add code
Oct 01, 2025
Viaarxiv icon

Scaling Up Active Testing to Large Language Models

Add code
Aug 12, 2025
Viaarxiv icon