Picture for Justin Svegliato

Justin Svegliato

University of Massachusetts Amherst

MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools

Add code
Apr 28, 2025
Viaarxiv icon

AssistanceZero: Scalably Solving Assistance Games

Add code
Apr 09, 2025
Viaarxiv icon

A StrongREJECT for Empty Jailbreaks

Add code
Feb 15, 2024
Viaarxiv icon

Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game

Add code
Nov 02, 2023
Viaarxiv icon

Active teacher selection for reinforcement learning from human feedback

Add code
Oct 23, 2023
Figure 1 for Active teacher selection for reinforcement learning from human feedback
Figure 2 for Active teacher selection for reinforcement learning from human feedback
Figure 3 for Active teacher selection for reinforcement learning from human feedback
Figure 4 for Active teacher selection for reinforcement learning from human feedback
Viaarxiv icon

Active Reward Learning from Multiple Teachers

Add code
Mar 02, 2023
Viaarxiv icon

Fairness and Sequential Decision Making: Limits, Lessons, and Opportunities

Add code
Jan 13, 2023
Viaarxiv icon

Agent-aware State Estimation in Autonomous Vehicles

Add code
Aug 01, 2021
Figure 1 for Agent-aware State Estimation in Autonomous Vehicles
Figure 2 for Agent-aware State Estimation in Autonomous Vehicles
Figure 3 for Agent-aware State Estimation in Autonomous Vehicles
Figure 4 for Agent-aware State Estimation in Autonomous Vehicles
Viaarxiv icon

Improving Competence for Reliable Autonomy

Add code
Jul 23, 2020
Figure 1 for Improving Competence for Reliable Autonomy
Figure 2 for Improving Competence for Reliable Autonomy
Figure 3 for Improving Competence for Reliable Autonomy
Viaarxiv icon

Learning to Optimize Autonomy in Competence-Aware Systems

Add code
Mar 17, 2020
Figure 1 for Learning to Optimize Autonomy in Competence-Aware Systems
Figure 2 for Learning to Optimize Autonomy in Competence-Aware Systems
Figure 3 for Learning to Optimize Autonomy in Competence-Aware Systems
Figure 4 for Learning to Optimize Autonomy in Competence-Aware Systems
Viaarxiv icon