Picture for Jen-tse Huang

Jen-tse Huang

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Add code
Apr 22, 2025
Viaarxiv icon

CODECRASH: Stress Testing LLM Reasoning under Structural and Semantic Perturbations

Add code
Apr 19, 2025
Viaarxiv icon

SOTOPIA-S4: a user-friendly system for flexible, customizable, and large-scale social simulation

Add code
Apr 19, 2025
Viaarxiv icon

BIASINSPECTOR: Detecting Bias in Structured Data through LLM Agents

Add code
Apr 07, 2025
Viaarxiv icon

Can LLMs Grasp Implicit Cultural Values? Benchmarking LLMs' Metacognitive Cultural Intelligence with CQ-Bench

Add code
Apr 01, 2025
Viaarxiv icon

VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models

Add code
Mar 10, 2025
Viaarxiv icon

CoSER: Coordinating LLM-Based Persona Simulation of Established Roles

Add code
Feb 13, 2025
Viaarxiv icon

Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related Queries

Add code
Feb 09, 2025
Viaarxiv icon

FairCode: Evaluating Social Bias of LLMs in Code Generation

Add code
Jan 09, 2025
Figure 1 for FairCode: Evaluating Social Bias of LLMs in Code Generation
Figure 2 for FairCode: Evaluating Social Bias of LLMs in Code Generation
Figure 3 for FairCode: Evaluating Social Bias of LLMs in Code Generation
Figure 4 for FairCode: Evaluating Social Bias of LLMs in Code Generation
Viaarxiv icon

On the Shortcut Learning in Multilingual Neural Machine Translation

Add code
Nov 15, 2024
Viaarxiv icon