Picture for Dong Shu

Dong Shu

Improving LLM Reasoning through Interpretable Role-Playing Steering

Add code
Jun 09, 2025
Viaarxiv icon

Beyond Input Activations: Identifying Influential Latents by Gradient Sparse Autoencoders

Add code
May 12, 2025
Viaarxiv icon

A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models

Add code
Mar 07, 2025
Viaarxiv icon

Large Vision-Language Model Alignment and Misalignment: A Survey Through the Lens of Explainability

Add code
Jan 02, 2025
Viaarxiv icon

Target-driven Attack for Large Language Models

Add code
Nov 13, 2024
Viaarxiv icon

Exploring the Alignment Landscape: LLMs and Geometric Deep Models in Protein Representation

Add code
Nov 08, 2024
Figure 1 for Exploring the Alignment Landscape: LLMs and Geometric Deep Models in Protein Representation
Figure 2 for Exploring the Alignment Landscape: LLMs and Geometric Deep Models in Protein Representation
Figure 3 for Exploring the Alignment Landscape: LLMs and Geometric Deep Models in Protein Representation
Figure 4 for Exploring the Alignment Landscape: LLMs and Geometric Deep Models in Protein Representation
Viaarxiv icon

Comparative Analysis of Demonstration Selection Algorithms for LLM In-Context Learning

Add code
Oct 30, 2024
Viaarxiv icon

LawLLM: Law Large Language Model for the US Legal System

Add code
Jul 27, 2024
Viaarxiv icon

Knowledge Graph Large Language Model for Link Prediction

Add code
Mar 19, 2024
Viaarxiv icon

Generative Models and Connected and Automated Vehicles: A Survey in Exploring the Intersection of Transportation and AI

Add code
Mar 14, 2024
Viaarxiv icon