Picture for Shuyue Stella Li

Shuyue Stella Li

Precise Information Control in Long-Form Text Generation

Add code
Jun 06, 2025
Viaarxiv icon

BehaviorSFT: Behavioral Token Conditioning for Clinical Agents Across the Proactivity Spectrum

Add code
May 27, 2025
Viaarxiv icon

BLAB: Brutally Long Audio Bench

Add code
May 05, 2025
Viaarxiv icon

A False Sense of Privacy: Evaluating Textual Data Sanitization Beyond Surface-level Privacy Leakage

Add code
Apr 28, 2025
Viaarxiv icon

Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning

Add code
Feb 20, 2025
Viaarxiv icon

CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs

Add code
Oct 03, 2024
Figure 1 for CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs
Figure 2 for CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs
Figure 3 for CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs
Figure 4 for CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs
Viaarxiv icon

ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions

Add code
Jul 02, 2024
Figure 1 for ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions
Figure 2 for ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions
Figure 3 for ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions
Figure 4 for ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions
Viaarxiv icon

Teaching LLMs to Abstain across Languages via Multilingual Feedback

Add code
Jun 22, 2024
Figure 1 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Figure 2 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Figure 3 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Figure 4 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Viaarxiv icon

MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning

Add code
Jun 04, 2024
Figure 1 for MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning
Figure 2 for MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning
Figure 3 for MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning
Figure 4 for MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning
Viaarxiv icon

CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge

Add code
Apr 10, 2024
Figure 1 for CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge
Figure 2 for CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge
Figure 3 for CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge
Figure 4 for CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge
Viaarxiv icon