Picture for Yuxin Xiao

Yuxin Xiao

When Style Breaks Safety: Defending Language Models Against Superficial Style Alignment

Add code
Jun 09, 2025
Viaarxiv icon

KScope: A Framework for Characterizing the Knowledge Status of Language Models

Add code
Jun 09, 2025
Viaarxiv icon

MedBrowseComp: Benchmarking Medical Deep Research and Computer Use

Add code
May 20, 2025
Viaarxiv icon

Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions

Add code
Feb 06, 2025
Figure 1 for Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions
Figure 2 for Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions
Figure 3 for Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions
Figure 4 for Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions
Viaarxiv icon

Enhancing Multiple Dimensions of Trustworthiness in LLMs via Sparse Activation Control

Add code
Nov 04, 2024
Figure 1 for Enhancing Multiple Dimensions of Trustworthiness in LLMs via Sparse Activation Control
Figure 2 for Enhancing Multiple Dimensions of Trustworthiness in LLMs via Sparse Activation Control
Figure 3 for Enhancing Multiple Dimensions of Trustworthiness in LLMs via Sparse Activation Control
Figure 4 for Enhancing Multiple Dimensions of Trustworthiness in LLMs via Sparse Activation Control
Viaarxiv icon

SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe

Add code
Oct 07, 2024
Figure 1 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Figure 2 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Figure 3 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Figure 4 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Viaarxiv icon

In the Name of Fairness: Assessing the Bias in Clinical Record De-identification

Add code
May 18, 2023
Viaarxiv icon

Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis

Add code
Oct 10, 2022
Figure 1 for Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis
Figure 2 for Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis
Figure 3 for Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis
Figure 4 for Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis
Viaarxiv icon

SAIS: Supervising and Augmenting Intermediate Steps for Document-Level Relation Extraction

Add code
Sep 24, 2021
Figure 1 for SAIS: Supervising and Augmenting Intermediate Steps for Document-Level Relation Extraction
Figure 2 for SAIS: Supervising and Augmenting Intermediate Steps for Document-Level Relation Extraction
Figure 3 for SAIS: Supervising and Augmenting Intermediate Steps for Document-Level Relation Extraction
Figure 4 for SAIS: Supervising and Augmenting Intermediate Steps for Document-Level Relation Extraction
Viaarxiv icon

Amortized Auto-Tuning: Cost-Efficient Transfer Optimization for Hyperparameter Recommendation

Add code
Jun 17, 2021
Figure 1 for Amortized Auto-Tuning: Cost-Efficient Transfer Optimization for Hyperparameter Recommendation
Figure 2 for Amortized Auto-Tuning: Cost-Efficient Transfer Optimization for Hyperparameter Recommendation
Figure 3 for Amortized Auto-Tuning: Cost-Efficient Transfer Optimization for Hyperparameter Recommendation
Figure 4 for Amortized Auto-Tuning: Cost-Efficient Transfer Optimization for Hyperparameter Recommendation
Viaarxiv icon