Picture for Matija Franklin

Matija Franklin

Architecting Trust in Artificial Epistemic Agents

Add code
Mar 03, 2026
Viaarxiv icon

Intelligent AI Delegation

Add code
Feb 12, 2026
Viaarxiv icon

Distributional AGI Safety

Add code
Dec 18, 2025
Viaarxiv icon

Multi-Agent Risks from Advanced AI

Add code
Feb 19, 2025
Figure 1 for Multi-Agent Risks from Advanced AI
Figure 2 for Multi-Agent Risks from Advanced AI
Figure 3 for Multi-Agent Risks from Advanced AI
Figure 4 for Multi-Agent Risks from Advanced AI
Viaarxiv icon

Model-Free RL Agents Demonstrate System 1-Like Intentionality

Add code
Jan 30, 2025
Viaarxiv icon

AI Governance through Markets

Add code
Jan 29, 2025
Viaarxiv icon

LMUnit: Fine-grained Evaluation with Natural Language Unit Tests

Add code
Dec 17, 2024
Viaarxiv icon

Beyond Preferences in AI Alignment

Add code
Aug 30, 2024
Figure 1 for Beyond Preferences in AI Alignment
Figure 2 for Beyond Preferences in AI Alignment
Figure 3 for Beyond Preferences in AI Alignment
Figure 4 for Beyond Preferences in AI Alignment
Viaarxiv icon

A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI

Add code
Apr 23, 2024
Figure 1 for A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI
Figure 2 for A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI
Figure 3 for A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI
Figure 4 for A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI
Viaarxiv icon

An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI

Add code
Nov 06, 2023
Figure 1 for An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI
Figure 2 for An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI
Figure 3 for An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI
Figure 4 for An International Consortium for Evaluations of Societal-Scale Risks from Advanced AI
Viaarxiv icon