Alert button
Picture for Aryaman Arora

Aryaman Arora

Alert button

ReFT: Representation Finetuning for Language Models

Add code
Bookmark button
Alert button
Apr 08, 2024
Zhengxuan Wu, Aryaman Arora, Zheng Wang, Atticus Geiger, Dan Jurafsky, Christopher D. Manning, Christopher Potts

Viaarxiv icon

pyvene: A Library for Understanding and Improving PyTorch Models via Interventions

Add code
Bookmark button
Alert button
Mar 12, 2024
Zhengxuan Wu, Atticus Geiger, Aryaman Arora, Jing Huang, Zheng Wang, Noah D. Goodman, Christopher D. Manning, Christopher Potts

Figure 1 for pyvene: A Library for Understanding and Improving PyTorch Models via Interventions
Figure 2 for pyvene: A Library for Understanding and Improving PyTorch Models via Interventions
Figure 3 for pyvene: A Library for Understanding and Improving PyTorch Models via Interventions
Viaarxiv icon

CausalGym: Benchmarking causal interpretability methods on linguistic tasks

Add code
Bookmark button
Alert button
Feb 19, 2024
Aryaman Arora, Dan Jurafsky, Christopher Potts

Viaarxiv icon

Predicting positive transfer for improved low-resource speech recognition using acoustic pseudo-tokens

Add code
Bookmark button
Alert button
Feb 03, 2024
Nay San, Georgios Paraskevopoulos, Aryaman Arora, Xiluo He, Prabhjot Kaur, Oliver Adams, Dan Jurafsky

Viaarxiv icon

A Reply to Makelov et al. (2023)'s "Interpretability Illusion" Arguments

Add code
Bookmark button
Alert button
Jan 23, 2024
Zhengxuan Wu, Atticus Geiger, Jing Huang, Aryaman Arora, Thomas Icard, Christopher Potts, Noah D. Goodman

Viaarxiv icon

IruMozhi: Automatically classifying diglossia in Tamil

Add code
Bookmark button
Alert button
Nov 13, 2023
Kabilan Prasanna, Aryaman Arora

Viaarxiv icon

Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP

Add code
Bookmark button
Alert button
Aug 27, 2023
Vedant Palit, Rohan Pandey, Aryaman Arora, Paul Pu Liang

Figure 1 for Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP
Figure 2 for Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP
Figure 3 for Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP
Figure 4 for Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP
Viaarxiv icon

Jambu: A historical linguistic database for South Asian languages

Add code
Bookmark button
Alert button
Jun 05, 2023
Aryaman Arora, Adam Farris, Samopriya Basu, Suresh Kolichala

Figure 1 for Jambu: A historical linguistic database for South Asian languages
Figure 2 for Jambu: A historical linguistic database for South Asian languages
Figure 3 for Jambu: A historical linguistic database for South Asian languages
Figure 4 for Jambu: A historical linguistic database for South Asian languages
Viaarxiv icon

CGELBank Annotation Manual v1.0

Add code
Bookmark button
Alert button
May 27, 2023
Brett Reynolds, Nathan Schneider, Aryaman Arora

Figure 1 for CGELBank Annotation Manual v1.0
Figure 2 for CGELBank Annotation Manual v1.0
Viaarxiv icon

Localizing Model Behavior with Path Patching

Add code
Bookmark button
Alert button
Apr 12, 2023
Nicholas Goldowsky-Dill, Chris MacLeod, Lucas Sato, Aryaman Arora

Figure 1 for Localizing Model Behavior with Path Patching
Figure 2 for Localizing Model Behavior with Path Patching
Figure 3 for Localizing Model Behavior with Path Patching
Figure 4 for Localizing Model Behavior with Path Patching
Viaarxiv icon