Picture for Ge Yan

Ge Yan

Lily

Distance Marching for Generative Modeling

Add code
Feb 03, 2026
Viaarxiv icon

Faithful and Stable Neuron Explanations for Trustworthy Mechanistic Interpretability

Add code
Dec 19, 2025
Viaarxiv icon

ReflCtrl: Controlling LLM Reflection via Representation Engineering

Add code
Dec 16, 2025
Viaarxiv icon

Sample-efficient quantum error mitigation via classical learning surrogates

Add code
Nov 10, 2025
Viaarxiv icon

SciDA: Scientific Dynamic Assessor of LLMs

Add code
Jun 15, 2025
Viaarxiv icon

Rethinking Crowd-Sourced Evaluation of Neuron Explanations

Add code
Jun 09, 2025
Viaarxiv icon

Evaluating Neuron Explanations: A Unified Framework with Sanity Checks

Add code
Jun 06, 2025
Viaarxiv icon

ThinkEdit: Interpretable Weight Editing to Mitigate Overly Short Thinking in Reasoning Models

Add code
Mar 27, 2025
Figure 1 for ThinkEdit: Interpretable Weight Editing to Mitigate Overly Short Thinking in Reasoning Models
Figure 2 for ThinkEdit: Interpretable Weight Editing to Mitigate Overly Short Thinking in Reasoning Models
Figure 3 for ThinkEdit: Interpretable Weight Editing to Mitigate Overly Short Thinking in Reasoning Models
Figure 4 for ThinkEdit: Interpretable Weight Editing to Mitigate Overly Short Thinking in Reasoning Models
Viaarxiv icon

Interpretable Generative Models through Post-hoc Concept Bottlenecks

Add code
Mar 25, 2025
Viaarxiv icon

RAT: Boosting Misclassification Detection Ability without Extra Data

Add code
Mar 18, 2025
Viaarxiv icon