Picture for Xuansheng Wu

Xuansheng Wu

Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering

Add code
May 21, 2025
Viaarxiv icon

Artificial Intelligence Bias on English Language Learners in Automatic Scoring

Add code
May 15, 2025
Viaarxiv icon

Beyond Input Activations: Identifying Influential Latents by Gradient Sparse Autoencoders

Add code
May 12, 2025
Viaarxiv icon

A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models

Add code
Mar 07, 2025
Viaarxiv icon

Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders

Add code
Feb 21, 2025
Viaarxiv icon

Self-Regularization with Latent Space Explanations for Controllable LLM-based Classification

Add code
Feb 19, 2025
Viaarxiv icon

LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models

Add code
Oct 02, 2024
Figure 1 for LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models
Figure 2 for LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models
Figure 3 for LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models
Figure 4 for LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models
Viaarxiv icon

Retrieval-Enhanced Knowledge Editing for Multi-Hop Question Answering in Language Models

Add code
Mar 28, 2024
Figure 1 for Retrieval-Enhanced Knowledge Editing for Multi-Hop Question Answering in Language Models
Figure 2 for Retrieval-Enhanced Knowledge Editing for Multi-Hop Question Answering in Language Models
Figure 3 for Retrieval-Enhanced Knowledge Editing for Multi-Hop Question Answering in Language Models
Figure 4 for Retrieval-Enhanced Knowledge Editing for Multi-Hop Question Answering in Language Models
Viaarxiv icon

Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era

Add code
Mar 13, 2024
Figure 1 for Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era
Figure 2 for Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era
Figure 3 for Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era
Figure 4 for Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era
Viaarxiv icon

InFoBench: Evaluating Instruction Following Ability in Large Language Models

Add code
Jan 07, 2024
Viaarxiv icon