Picture for Brian Hu

Brian Hu

Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping

Add code
Nov 17, 2025
Figure 1 for Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping
Figure 2 for Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping
Figure 3 for Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping
Figure 4 for Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping
Viaarxiv icon

Steerable Pluralism: Pluralistic Alignment via Few-Shot Comparative Regression

Add code
Aug 11, 2025
Viaarxiv icon

Language Models are Alignable Decision-Makers: Dataset and Application to the Medical Triage Domain

Add code
Jun 10, 2024
Figure 1 for Language Models are Alignable Decision-Makers: Dataset and Application to the Medical Triage Domain
Figure 2 for Language Models are Alignable Decision-Makers: Dataset and Application to the Medical Triage Domain
Figure 3 for Language Models are Alignable Decision-Makers: Dataset and Application to the Medical Triage Domain
Figure 4 for Language Models are Alignable Decision-Makers: Dataset and Application to the Medical Triage Domain
Viaarxiv icon

Convolutional neural networks with extra-classical receptive fields

Add code
Oct 27, 2018
Figure 1 for Convolutional neural networks with extra-classical receptive fields
Figure 2 for Convolutional neural networks with extra-classical receptive fields
Figure 3 for Convolutional neural networks with extra-classical receptive fields
Figure 4 for Convolutional neural networks with extra-classical receptive fields
Viaarxiv icon