Picture for Shun Shao

Shun Shao

Iterative Multilingual Spectral Attribute Erasure

Add code
Jun 12, 2025
Viaarxiv icon

Sparse Activation Editing for Reliable Instruction Following in Narratives

Add code
May 22, 2025
Viaarxiv icon

MIB: A Mechanistic Interpretability Benchmark

Add code
Apr 17, 2025
Viaarxiv icon

Beyond Voice Assistants: Exploring Advantages and Risks of an In-Car Social Robot in Real Driving Scenarios

Add code
Feb 20, 2024
Figure 1 for Beyond Voice Assistants: Exploring Advantages and Risks of an In-Car Social Robot in Real Driving Scenarios
Figure 2 for Beyond Voice Assistants: Exploring Advantages and Risks of an In-Car Social Robot in Real Driving Scenarios
Figure 3 for Beyond Voice Assistants: Exploring Advantages and Risks of an In-Car Social Robot in Real Driving Scenarios
Figure 4 for Beyond Voice Assistants: Exploring Advantages and Risks of an In-Car Social Robot in Real Driving Scenarios
Viaarxiv icon

Erasure of Unaligned Attributes from Neural Representations

Add code
Feb 06, 2023
Viaarxiv icon

Gold Doesn't Always Glitter: Spectral Removal of Linear and Nonlinear Guarded Attribute Information

Add code
Mar 15, 2022
Figure 1 for Gold Doesn't Always Glitter: Spectral Removal of Linear and Nonlinear Guarded Attribute Information
Figure 2 for Gold Doesn't Always Glitter: Spectral Removal of Linear and Nonlinear Guarded Attribute Information
Figure 3 for Gold Doesn't Always Glitter: Spectral Removal of Linear and Nonlinear Guarded Attribute Information
Figure 4 for Gold Doesn't Always Glitter: Spectral Removal of Linear and Nonlinear Guarded Attribute Information
Viaarxiv icon