Picture for Yifan Luo

Yifan Luo

From Atoms to Trees: Building a Structured Feature Forest with Hierarchical Sparse Autoencoders

Add code
Feb 12, 2026
Viaarxiv icon

InverseScope: Scalable Activation Inversion for Interpreting Large Language Models

Add code
Jun 09, 2025
Figure 1 for InverseScope: Scalable Activation Inversion for Interpreting Large Language Models
Figure 2 for InverseScope: Scalable Activation Inversion for Interpreting Large Language Models
Figure 3 for InverseScope: Scalable Activation Inversion for Interpreting Large Language Models
Figure 4 for InverseScope: Scalable Activation Inversion for Interpreting Large Language Models
Viaarxiv icon

Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting

Add code
Oct 14, 2024
Figure 1 for Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting
Figure 2 for Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting
Figure 3 for Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting
Figure 4 for Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting
Viaarxiv icon

RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework

Add code
Aug 02, 2024
Figure 1 for RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
Figure 2 for RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
Figure 3 for RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
Figure 4 for RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
Viaarxiv icon

AutoMathText: Autonomous Data Selection with Language Models for Mathematical Texts

Add code
Feb 12, 2024
Viaarxiv icon

Augmenting Math Word Problems via Iterative Question Composing

Add code
Jan 30, 2024
Figure 1 for Augmenting Math Word Problems via Iterative Question Composing
Figure 2 for Augmenting Math Word Problems via Iterative Question Composing
Figure 3 for Augmenting Math Word Problems via Iterative Question Composing
Figure 4 for Augmenting Math Word Problems via Iterative Question Composing
Viaarxiv icon

Prompt Engineering Through the Lens of Optimal Control

Add code
Nov 03, 2023
Figure 1 for Prompt Engineering Through the Lens of Optimal Control
Figure 2 for Prompt Engineering Through the Lens of Optimal Control
Figure 3 for Prompt Engineering Through the Lens of Optimal Control
Viaarxiv icon

GSLB: The Graph Structure Learning Benchmark

Add code
Oct 08, 2023
Viaarxiv icon

Won't Get Fooled Again: Answering Questions with False Premises

Add code
Jul 05, 2023
Figure 1 for Won't Get Fooled Again: Answering Questions with False Premises
Figure 2 for Won't Get Fooled Again: Answering Questions with False Premises
Figure 3 for Won't Get Fooled Again: Answering Questions with False Premises
Figure 4 for Won't Get Fooled Again: Answering Questions with False Premises
Viaarxiv icon

Double Descent of Discrepancy: A Task-, Data-, and Model-Agnostic Phenomenon

Add code
May 25, 2023
Viaarxiv icon