Picture for Ninghao Liu

Ninghao Liu

Automating Expert-Level Medical Reasoning Evaluation of Large Language Models

Add code
Jul 10, 2025
Viaarxiv icon

RadFabric: Agentic AI System with Reasoning Capability for Radiology

Add code
Jun 17, 2025
Viaarxiv icon

Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering

Add code
May 21, 2025
Viaarxiv icon

Artificial Intelligence Bias on English Language Learners in Automatic Scoring

Add code
May 15, 2025
Viaarxiv icon

Beyond Input Activations: Identifying Influential Latents by Gradient Sparse Autoencoders

Add code
May 12, 2025
Viaarxiv icon

Towards Trustworthy GUI Agents: A Survey

Add code
Mar 30, 2025
Viaarxiv icon

A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models

Add code
Mar 07, 2025
Viaarxiv icon

Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders

Add code
Feb 21, 2025
Figure 1 for Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders
Figure 2 for Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders
Figure 3 for Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders
Figure 4 for Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders
Viaarxiv icon

Self-Regularization with Latent Space Explanations for Controllable LLM-based Classification

Add code
Feb 19, 2025
Viaarxiv icon

Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data

Add code
Feb 19, 2025
Viaarxiv icon