Alert button
Picture for Hassan Sajjad

Hassan Sajjad

Alert button

Data-centric Prediction Explanation via Kernelized Stein Discrepancy

Add code
Bookmark button
Alert button
Mar 22, 2024
Mahtab Sarvmaili, Hassan Sajjad, Ga Wu

Viaarxiv icon

Immunization against harmful fine-tuning attacks

Add code
Bookmark button
Alert button
Feb 26, 2024
Domenic Rosati, Jan Wehner, Kai Williams, Łukasz Bartoszcze, Jan Batzner, Hassan Sajjad, Frank Rudzicz

Viaarxiv icon

Long-form evaluation of model editing

Add code
Bookmark button
Alert button
Feb 14, 2024
Domenic Rosati, Robie Gonzales, Jinkun Chen, Xuemin Yu, Melis Erkan, Yahya Kayani, Satya Deepika Chavatapalli, Frank Rudzicz, Hassan Sajjad

Viaarxiv icon

Multilingual Nonce Dependency Treebanks: Understanding how LLMs represent and process syntactic structure

Add code
Bookmark button
Alert button
Nov 13, 2023
David Arps, Laura Kallmeyer, Younes Samih, Hassan Sajjad

Viaarxiv icon

NeuroX Library for Neuron Analysis of Deep NLP Models

Add code
Bookmark button
Alert button
May 26, 2023
Fahim Dalvi, Hassan Sajjad, Nadir Durrani

Figure 1 for NeuroX Library for Neuron Analysis of Deep NLP Models
Figure 2 for NeuroX Library for Neuron Analysis of Deep NLP Models
Figure 3 for NeuroX Library for Neuron Analysis of Deep NLP Models
Figure 4 for NeuroX Library for Neuron Analysis of Deep NLP Models
Viaarxiv icon

NxPlain: Web-based Tool for Discovery of Latent Concepts

Add code
Bookmark button
Alert button
Mar 06, 2023
Fahim Dalvi, Nadir Durrani, Hassan Sajjad, Tamim Jaban, Musab Husaini, Ummar Abbas

Figure 1 for NxPlain: Web-based Tool for Discovery of Latent Concepts
Figure 2 for NxPlain: Web-based Tool for Discovery of Latent Concepts
Figure 3 for NxPlain: Web-based Tool for Discovery of Latent Concepts
Figure 4 for NxPlain: Web-based Tool for Discovery of Latent Concepts
Viaarxiv icon

Evaluating Neuron Interpretation Methods of NLP Models

Add code
Bookmark button
Alert button
Jan 30, 2023
Yimin Fan, Fahim Dalvi, Nadir Durrani, Hassan Sajjad

Figure 1 for Evaluating Neuron Interpretation Methods of NLP Models
Figure 2 for Evaluating Neuron Interpretation Methods of NLP Models
Figure 3 for Evaluating Neuron Interpretation Methods of NLP Models
Figure 4 for Evaluating Neuron Interpretation Methods of NLP Models
Viaarxiv icon

ConceptX: A Framework for Latent Concept Analysis

Add code
Bookmark button
Alert button
Nov 12, 2022
Firoj Alam, Fahim Dalvi, Nadir Durrani, Hassan Sajjad, Abdul Rafae Khan, Jia Xu

Figure 1 for ConceptX: A Framework for Latent Concept Analysis
Figure 2 for ConceptX: A Framework for Latent Concept Analysis
Viaarxiv icon

Impact of Adversarial Training on Robustness and Generalizability of Language Models

Add code
Bookmark button
Alert button
Nov 10, 2022
Enes Altinisik, Hassan Sajjad, Husrev Taha Sencar, Safa Messaoud, Sanjay Chawla

Figure 1 for Impact of Adversarial Training on Robustness and Generalizability of Language Models
Figure 2 for Impact of Adversarial Training on Robustness and Generalizability of Language Models
Figure 3 for Impact of Adversarial Training on Robustness and Generalizability of Language Models
Figure 4 for Impact of Adversarial Training on Robustness and Generalizability of Language Models
Viaarxiv icon

On the Transformation of Latent Space in Fine-Tuned NLP Models

Add code
Bookmark button
Alert button
Oct 23, 2022
Nadir Durrani, Hassan Sajjad, Fahim Dalvi, Firoj Alam

Figure 1 for On the Transformation of Latent Space in Fine-Tuned NLP Models
Figure 2 for On the Transformation of Latent Space in Fine-Tuned NLP Models
Figure 3 for On the Transformation of Latent Space in Fine-Tuned NLP Models
Figure 4 for On the Transformation of Latent Space in Fine-Tuned NLP Models
Viaarxiv icon