Picture for Lennart Stöpler

Lennart Stöpler

Towards Developmentally Plausible Rewards: Communicative Success as a Learning Signal for Interactive Language Models

Add code
May 09, 2025
Viaarxiv icon