Picture for Han Xu

Han Xu

Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis

Add code
Jun 16, 2024
Figure 1 for Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
Figure 2 for Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
Figure 3 for Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
Figure 4 for Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
Viaarxiv icon

Encoding Hierarchical Schema via Concept Flow for Multifaceted Ideology Detection

Add code
May 29, 2024
Viaarxiv icon

Self-playing Adversarial Language Game Enhances LLM Reasoning

Add code
Apr 16, 2024
Figure 1 for Self-playing Adversarial Language Game Enhances LLM Reasoning
Figure 2 for Self-playing Adversarial Language Game Enhances LLM Reasoning
Figure 3 for Self-playing Adversarial Language Game Enhances LLM Reasoning
Figure 4 for Self-playing Adversarial Language Game Enhances LLM Reasoning
Viaarxiv icon

Text-IF: Leveraging Semantic Text Guidance for Degradation-Aware and Interactive Image Fusion

Add code
Mar 25, 2024
Viaarxiv icon

Unveiling and Mitigating Memorization in Text-to-image Diffusion Models through Cross Attention

Add code
Mar 17, 2024
Figure 1 for Unveiling and Mitigating Memorization in Text-to-image Diffusion Models through Cross Attention
Figure 2 for Unveiling and Mitigating Memorization in Text-to-image Diffusion Models through Cross Attention
Figure 3 for Unveiling and Mitigating Memorization in Text-to-image Diffusion Models through Cross Attention
Figure 4 for Unveiling and Mitigating Memorization in Text-to-image Diffusion Models through Cross Attention
Viaarxiv icon

The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation

Add code
Feb 23, 2024
Figure 1 for The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation
Figure 2 for The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation
Figure 3 for The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation
Figure 4 for The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation
Viaarxiv icon

Copyright Protection in Generative AI: A Technical Perspective

Add code
Feb 04, 2024
Figure 1 for Copyright Protection in Generative AI: A Technical Perspective
Figure 2 for Copyright Protection in Generative AI: A Technical Perspective
Figure 3 for Copyright Protection in Generative AI: A Technical Perspective
Figure 4 for Copyright Protection in Generative AI: A Technical Perspective
Viaarxiv icon

A Scalable Network-Aware Multi-Agent Reinforcement Learning Framework for Decentralized Inverter-based Voltage Control

Add code
Dec 07, 2023
Viaarxiv icon

Exploring Memorization in Fine-tuned Language Models

Add code
Oct 10, 2023
Figure 1 for Exploring Memorization in Fine-tuned Language Models
Figure 2 for Exploring Memorization in Fine-tuned Language Models
Figure 3 for Exploring Memorization in Fine-tuned Language Models
Figure 4 for Exploring Memorization in Fine-tuned Language Models
Viaarxiv icon

FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models

Add code
Oct 03, 2023
Figure 1 for FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models
Figure 2 for FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models
Figure 3 for FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models
Figure 4 for FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models
Viaarxiv icon