Picture for Jonas Geiping

Jonas Geiping

Models That Know How Evaluations Are Designed Score Safer

Add code
May 27, 2026
Viaarxiv icon

FutureSim: Replaying World Events to Evaluate Adaptive Agents

Add code
May 14, 2026
Viaarxiv icon

Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs

Add code
May 12, 2026
Viaarxiv icon

Efficient Test-Time Inference via Deterministic Exploration of Truncated Decoding Trees

Add code
Apr 22, 2026
Viaarxiv icon

Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs

Add code
Mar 25, 2026
Viaarxiv icon

Scaling Open-Ended Reasoning to Predict the Future

Add code
Dec 31, 2025
Viaarxiv icon

Training AI Co-Scientists Using Rubric Rewards

Add code
Dec 29, 2025
Viaarxiv icon

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Add code
Nov 10, 2025
Viaarxiv icon

Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models

Add code
Oct 16, 2025
Figure 1 for Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models
Figure 2 for Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models
Figure 3 for Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models
Figure 4 for Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models
Viaarxiv icon

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

Add code
Oct 10, 2025
Figure 1 for Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols
Figure 2 for Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols
Figure 3 for Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols
Figure 4 for Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols
Viaarxiv icon