Picture for Rufat Asadli

Rufat Asadli

Towards Developmentally Plausible Rewards: Communicative Success as a Learning Signal for Interactive Language Models

Add code
May 09, 2025
Viaarxiv icon