Picture for Pengfei He

Pengfei He

Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis

Add code
Jun 16, 2024
Figure 1 for Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
Figure 2 for Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
Figure 3 for Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
Figure 4 for Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
Viaarxiv icon

Learning-guided iterated local search for the minmax multiple traveling salesman problem

Add code
Mar 19, 2024
Figure 1 for Learning-guided iterated local search for the minmax multiple traveling salesman problem
Figure 2 for Learning-guided iterated local search for the minmax multiple traveling salesman problem
Figure 3 for Learning-guided iterated local search for the minmax multiple traveling salesman problem
Figure 4 for Learning-guided iterated local search for the minmax multiple traveling salesman problem
Viaarxiv icon

A Multi-population Integrated Approach for Capacitated Location Routing

Add code
Mar 14, 2024
Figure 1 for A Multi-population Integrated Approach for Capacitated Location Routing
Figure 2 for A Multi-population Integrated Approach for Capacitated Location Routing
Figure 3 for A Multi-population Integrated Approach for Capacitated Location Routing
Figure 4 for A Multi-population Integrated Approach for Capacitated Location Routing
Viaarxiv icon

The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation

Add code
Feb 23, 2024
Figure 1 for The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation
Figure 2 for The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation
Figure 3 for The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation
Figure 4 for The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation
Viaarxiv icon

Copyright Protection in Generative AI: A Technical Perspective

Add code
Feb 04, 2024
Figure 1 for Copyright Protection in Generative AI: A Technical Perspective
Figure 2 for Copyright Protection in Generative AI: A Technical Perspective
Figure 3 for Copyright Protection in Generative AI: A Technical Perspective
Figure 4 for Copyright Protection in Generative AI: A Technical Perspective
Viaarxiv icon

Superiority of Multi-Head Attention in In-Context Linear Regression

Add code
Jan 30, 2024
Viaarxiv icon

Exploring Memorization in Fine-tuned Language Models

Add code
Oct 10, 2023
Figure 1 for Exploring Memorization in Fine-tuned Language Models
Figure 2 for Exploring Memorization in Fine-tuned Language Models
Figure 3 for Exploring Memorization in Fine-tuned Language Models
Figure 4 for Exploring Memorization in Fine-tuned Language Models
Viaarxiv icon

On the Generalization of Training-based ChatGPT Detection Methods

Add code
Oct 03, 2023
Figure 1 for On the Generalization of Training-based ChatGPT Detection Methods
Figure 2 for On the Generalization of Training-based ChatGPT Detection Methods
Figure 3 for On the Generalization of Training-based ChatGPT Detection Methods
Figure 4 for On the Generalization of Training-based ChatGPT Detection Methods
Viaarxiv icon

FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models

Add code
Oct 03, 2023
Figure 1 for FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models
Figure 2 for FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models
Figure 3 for FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models
Figure 4 for FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models
Viaarxiv icon

Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering

Add code
Oct 06, 2022
Figure 1 for Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering
Figure 2 for Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering
Figure 3 for Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering
Figure 4 for Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering
Viaarxiv icon