Alert button
Picture for Xiaomeng Hu

Xiaomeng Hu

Alert button

Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes

Add code
Bookmark button
Alert button
Mar 05, 2024
Xiaomeng Hu, Pin-Yu Chen, Tsung-Yi Ho

Figure 1 for Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes
Figure 2 for Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes
Figure 3 for Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes
Figure 4 for Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes
Viaarxiv icon

RADAR: Robust AI-Text Detection via Adversarial Learning

Add code
Bookmark button
Alert button
Jul 07, 2023
Xiaomeng Hu, Pin-Yu Chen, Tsung-Yi Ho

Figure 1 for RADAR: Robust AI-Text Detection via Adversarial Learning
Figure 2 for RADAR: Robust AI-Text Detection via Adversarial Learning
Figure 3 for RADAR: Robust AI-Text Detection via Adversarial Learning
Figure 4 for RADAR: Robust AI-Text Detection via Adversarial Learning
Viaarxiv icon

Maybe Only 0.5% Data is Needed: A Preliminary Exploration of Low Training Data Instruction Tuning

Add code
Bookmark button
Alert button
May 16, 2023
Hao Chen, Yiming Zhang, Qi Zhang, Hantao Yang, Xiaomeng Hu, Xuetao Ma, Yifan Yanggong, Junbo Zhao

Figure 1 for Maybe Only 0.5% Data is Needed: A Preliminary Exploration of Low Training Data Instruction Tuning
Figure 2 for Maybe Only 0.5% Data is Needed: A Preliminary Exploration of Low Training Data Instruction Tuning
Figure 3 for Maybe Only 0.5% Data is Needed: A Preliminary Exploration of Low Training Data Instruction Tuning
Figure 4 for Maybe Only 0.5% Data is Needed: A Preliminary Exploration of Low Training Data Instruction Tuning
Viaarxiv icon

P^3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning

Add code
Bookmark button
Alert button
May 05, 2022
Xiaomeng Hu, Shi Yu, Chenyan Xiong, Zhenghao Liu, Zhiyuan Liu, Ge Yu

Figure 1 for P^3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning
Figure 2 for P^3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning
Figure 3 for P^3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning
Figure 4 for P^3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based Learning and Pre-finetuning
Viaarxiv icon