Alert button
Picture for Abel Salinas

Abel Salinas

Alert button

Risk and Response in Large Language Models: Evaluating Key Threat Categories

Add code
Bookmark button
Alert button
Mar 22, 2024
Bahareh Harandizadeh, Abel Salinas, Fred Morstatter

Viaarxiv icon

The Butterfly Effect of Altering Prompts: How Small Changes and Jailbreaks Affect Large Language Model Performance

Add code
Bookmark button
Alert button
Jan 09, 2024
Abel Salinas, Fred Morstatter

Viaarxiv icon

"Im not Racist but...": Discovering Bias in the Internal Knowledge of Large Language Models

Add code
Bookmark button
Alert button
Oct 13, 2023
Abel Salinas, Louis Penafiel, Robert McCormack, Fred Morstatter

Viaarxiv icon

The Unequal Opportunities of Large Language Models: Revealing Demographic Bias through Job Recommendations

Add code
Bookmark button
Alert button
Aug 03, 2023
Abel Salinas, Parth Vipul Shah, Yuzhong Huang, Robert McCormack, Fred Morstatter

Figure 1 for The Unequal Opportunities of Large Language Models: Revealing Demographic Bias through Job Recommendations
Figure 2 for The Unequal Opportunities of Large Language Models: Revealing Demographic Bias through Job Recommendations
Figure 3 for The Unequal Opportunities of Large Language Models: Revealing Demographic Bias through Job Recommendations
Figure 4 for The Unequal Opportunities of Large Language Models: Revealing Demographic Bias through Job Recommendations
Viaarxiv icon

Boosting Punctuation Restoration with Data Generation and Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 24, 2023
Viet Dac Lai, Abel Salinas, Hao Tan, Trung Bui, Quan Tran, Seunghyun Yoon, Hanieh Deilamsalehy, Franck Dernoncourt, Thien Huu Nguyen

Viaarxiv icon