Picture for Xiaofei Wen

Xiaofei Wen

Triaging Threats to Specialized Guardrails

Add code
May 29, 2026
Viaarxiv icon

Robust and Efficient Guardrails with Latent Reasoning

Add code
May 27, 2026
Viaarxiv icon

DebugLM: Learning Traceable Training Data Provenance for LLMs

Add code
Mar 18, 2026
Viaarxiv icon

Diagnosing and Mitigating Modality Interference in Multimodal Large Language Models

Add code
May 26, 2025
Figure 1 for Diagnosing and Mitigating Modality Interference in Multimodal Large Language Models
Figure 2 for Diagnosing and Mitigating Modality Interference in Multimodal Large Language Models
Figure 3 for Diagnosing and Mitigating Modality Interference in Multimodal Large Language Models
Figure 4 for Diagnosing and Mitigating Modality Interference in Multimodal Large Language Models
Viaarxiv icon

ThinkGuard: Deliberative Slow Thinking Leads to Cautious Guardrails

Add code
Feb 19, 2025
Viaarxiv icon

MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design

Add code
Dec 20, 2024
Figure 1 for MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
Figure 2 for MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
Figure 3 for MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
Figure 4 for MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
Viaarxiv icon

Personalized Topic Selection Model for Topic-Grounded Dialogue

Add code
Jun 04, 2024
Viaarxiv icon

Red Teaming Language Models for Contradictory Dialogues

Add code
May 17, 2024
Figure 1 for Red Teaming Language Models for Contradictory Dialogues
Figure 2 for Red Teaming Language Models for Contradictory Dialogues
Figure 3 for Red Teaming Language Models for Contradictory Dialogues
Figure 4 for Red Teaming Language Models for Contradictory Dialogues
Viaarxiv icon

Sequential Topic Selection Model with Latent Variable for Topic-Grounded Dialogue

Add code
Oct 17, 2022
Figure 1 for Sequential Topic Selection Model with Latent Variable for Topic-Grounded Dialogue
Figure 2 for Sequential Topic Selection Model with Latent Variable for Topic-Grounded Dialogue
Figure 3 for Sequential Topic Selection Model with Latent Variable for Topic-Grounded Dialogue
Figure 4 for Sequential Topic Selection Model with Latent Variable for Topic-Grounded Dialogue
Viaarxiv icon