Picture for Tyler Marques

Tyler Marques

Data-Centric Interpretability for LLM-based Multi-Agent Reinforcement Learning

Add code
Feb 05, 2026
Viaarxiv icon

Democratizing Diplomacy: A Harness for Evaluating Any Large Language Model on Full-Press Diplomacy

Add code
Aug 10, 2025
Figure 1 for Democratizing Diplomacy: A Harness for Evaluating Any Large Language Model on Full-Press Diplomacy
Figure 2 for Democratizing Diplomacy: A Harness for Evaluating Any Large Language Model on Full-Press Diplomacy
Figure 3 for Democratizing Diplomacy: A Harness for Evaluating Any Large Language Model on Full-Press Diplomacy
Figure 4 for Democratizing Diplomacy: A Harness for Evaluating Any Large Language Model on Full-Press Diplomacy
Viaarxiv icon