Picture for Matija Franklin

Matija Franklin

Positive Alignment: Artificial Intelligence for Human Flourishing

Add code
May 11, 2026
Viaarxiv icon

Architecting Trust in Artificial Epistemic Agents

Add code
Mar 03, 2026
Viaarxiv icon

Intelligent AI Delegation

Add code
Feb 12, 2026
Viaarxiv icon

Distributional AGI Safety

Add code
Dec 18, 2025
Viaarxiv icon

Multi-Agent Risks from Advanced AI

Add code
Feb 19, 2025
Figure 1 for Multi-Agent Risks from Advanced AI
Figure 2 for Multi-Agent Risks from Advanced AI
Figure 3 for Multi-Agent Risks from Advanced AI
Figure 4 for Multi-Agent Risks from Advanced AI
Viaarxiv icon

Model-Free RL Agents Demonstrate System 1-Like Intentionality

Add code
Jan 30, 2025
Viaarxiv icon

AI Governance through Markets

Add code
Jan 29, 2025
Viaarxiv icon

LMUnit: Fine-grained Evaluation with Natural Language Unit Tests

Add code
Dec 17, 2024
Viaarxiv icon

Beyond Preferences in AI Alignment

Add code
Aug 30, 2024
Figure 1 for Beyond Preferences in AI Alignment
Figure 2 for Beyond Preferences in AI Alignment
Figure 3 for Beyond Preferences in AI Alignment
Figure 4 for Beyond Preferences in AI Alignment
Viaarxiv icon

A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI

Add code
Apr 23, 2024
Figure 1 for A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI
Figure 2 for A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI
Figure 3 for A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI
Figure 4 for A Mechanism-Based Approach to Mitigating Harms from Persuasive Generative AI
Viaarxiv icon