Picture for Yifan Luo

Yifan Luo

InverseScope: Scalable Activation Inversion for Interpreting Large Language Models

Add code
Jun 09, 2025
Viaarxiv icon

Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting

Add code
Oct 14, 2024
Figure 1 for Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting
Figure 2 for Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting
Figure 3 for Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting
Figure 4 for Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting
Viaarxiv icon

RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework

Add code
Aug 02, 2024
Figure 1 for RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
Figure 2 for RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
Figure 3 for RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
Figure 4 for RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework
Viaarxiv icon

AutoMathText: Autonomous Data Selection with Language Models for Mathematical Texts

Add code
Feb 12, 2024
Viaarxiv icon

Augmenting Math Word Problems via Iterative Question Composing

Add code
Jan 30, 2024
Figure 1 for Augmenting Math Word Problems via Iterative Question Composing
Figure 2 for Augmenting Math Word Problems via Iterative Question Composing
Figure 3 for Augmenting Math Word Problems via Iterative Question Composing
Figure 4 for Augmenting Math Word Problems via Iterative Question Composing
Viaarxiv icon

Prompt Engineering Through the Lens of Optimal Control

Add code
Nov 03, 2023
Figure 1 for Prompt Engineering Through the Lens of Optimal Control
Figure 2 for Prompt Engineering Through the Lens of Optimal Control
Figure 3 for Prompt Engineering Through the Lens of Optimal Control
Viaarxiv icon

GSLB: The Graph Structure Learning Benchmark

Add code
Oct 08, 2023
Viaarxiv icon

Won't Get Fooled Again: Answering Questions with False Premises

Add code
Jul 05, 2023
Figure 1 for Won't Get Fooled Again: Answering Questions with False Premises
Figure 2 for Won't Get Fooled Again: Answering Questions with False Premises
Figure 3 for Won't Get Fooled Again: Answering Questions with False Premises
Figure 4 for Won't Get Fooled Again: Answering Questions with False Premises
Viaarxiv icon

Double Descent of Discrepancy: A Task-, Data-, and Model-Agnostic Phenomenon

Add code
May 25, 2023
Viaarxiv icon

A Person Re-identification Data Augmentation Method with Adversarial Defense Effect

Add code
Feb 10, 2021
Figure 1 for A Person Re-identification Data Augmentation Method with Adversarial Defense Effect
Figure 2 for A Person Re-identification Data Augmentation Method with Adversarial Defense Effect
Figure 3 for A Person Re-identification Data Augmentation Method with Adversarial Defense Effect
Figure 4 for A Person Re-identification Data Augmentation Method with Adversarial Defense Effect
Viaarxiv icon