Picture for Lijie Hu

Lijie Hu

Private Language Models via Truncated Laplacian Mechanism

Add code
Oct 10, 2024
Figure 1 for Private Language Models via Truncated Laplacian Mechanism
Figure 2 for Private Language Models via Truncated Laplacian Mechanism
Figure 3 for Private Language Models via Truncated Laplacian Mechanism
Figure 4 for Private Language Models via Truncated Laplacian Mechanism
Viaarxiv icon

Faithful Interpretation for Graph Neural Networks

Add code
Oct 09, 2024
Viaarxiv icon

Dissecting Fine-Tuning Unlearning in Large Language Models

Add code
Oct 09, 2024
Viaarxiv icon

Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing

Add code
Oct 08, 2024
Viaarxiv icon

What makes your model a low-empathy or warmth person: Exploring the Origins of Personality in LLMs

Add code
Oct 07, 2024
Viaarxiv icon

CAT: Concept-level backdoor ATtacks for Concept Bottleneck Models

Add code
Oct 07, 2024
Viaarxiv icon

Understanding Reasoning in Chain-of-Thought from the Hopfieldian View

Add code
Oct 04, 2024
Viaarxiv icon

Backdooring Vision-Language Models with Out-Of-Distribution Data

Add code
Oct 02, 2024
Viaarxiv icon

DRIVE: Dependable Robust Interpretable Visionary Ensemble Framework in Autonomous Driving

Add code
Sep 16, 2024
Figure 1 for DRIVE: Dependable Robust Interpretable Visionary Ensemble Framework in Autonomous Driving
Figure 2 for DRIVE: Dependable Robust Interpretable Visionary Ensemble Framework in Autonomous Driving
Figure 3 for DRIVE: Dependable Robust Interpretable Visionary Ensemble Framework in Autonomous Driving
Figure 4 for DRIVE: Dependable Robust Interpretable Visionary Ensemble Framework in Autonomous Driving
Viaarxiv icon

Semi-supervised Concept Bottleneck Models

Add code
Jun 27, 2024
Viaarxiv icon