Picture for Jan Hendrik Kirchner

Jan Hendrik Kirchner

Prover-Verifier Games improve legibility of LLM outputs

Add code
Jul 18, 2024
Viaarxiv icon

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

Add code
Dec 14, 2023
Viaarxiv icon