Picture for Abhinav Rao

Abhinav Rao

Jailbreak Paradox: The Achilles' Heel of LLMs

Add code
Jun 18, 2024
Viaarxiv icon

NORMAD: A Benchmark for Measuring the Cultural Adaptability of Large Language Models

Add code
Apr 18, 2024
Figure 1 for NORMAD: A Benchmark for Measuring the Cultural Adaptability of Large Language Models
Figure 2 for NORMAD: A Benchmark for Measuring the Cultural Adaptability of Large Language Models
Figure 3 for NORMAD: A Benchmark for Measuring the Cultural Adaptability of Large Language Models
Figure 4 for NORMAD: A Benchmark for Measuring the Cultural Adaptability of Large Language Models
Viaarxiv icon

Ethical Reasoning over Moral Alignment: A Case and Framework for In-Context Ethical Policies in LLMs

Add code
Oct 11, 2023
Viaarxiv icon

Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks

Add code
May 24, 2023
Figure 1 for Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks
Figure 2 for Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks
Figure 3 for Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks
Figure 4 for Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks
Viaarxiv icon

Punctuation Restoration for Singaporean Spoken Languages: English, Malay, and Mandarin

Add code
Dec 10, 2022
Figure 1 for Punctuation Restoration for Singaporean Spoken Languages: English, Malay, and Mandarin
Figure 2 for Punctuation Restoration for Singaporean Spoken Languages: English, Malay, and Mandarin
Figure 3 for Punctuation Restoration for Singaporean Spoken Languages: English, Malay, and Mandarin
Figure 4 for Punctuation Restoration for Singaporean Spoken Languages: English, Malay, and Mandarin
Viaarxiv icon