Picture for Kehan Guo

Kehan Guo

ProbeLLM: Automating Principled Diagnosis of LLM Failures

Add code
Feb 13, 2026
Viaarxiv icon

Capability-Oriented Training Induced Alignment Risk

Add code
Feb 12, 2026
Viaarxiv icon

Dissecting Logical Reasoning in LLMs: A Fine-Grained Evaluation and Supervision Study

Add code
Jun 05, 2025
Viaarxiv icon

AdaReasoner: Adaptive Reasoning Enables More Flexible Thinking

Add code
May 22, 2025
Viaarxiv icon

Beyond Single-Value Metrics: Evaluating and Enhancing LLM Unlearning with Cognitive Diagnosis

Add code
Feb 19, 2025
Viaarxiv icon

UOE: Unlearning One Expert Is Enough For Mixture-of-experts LLMS

Add code
Nov 27, 2024
Figure 1 for UOE: Unlearning One Expert Is Enough For Mixture-of-experts LLMS
Figure 2 for UOE: Unlearning One Expert Is Enough For Mixture-of-experts LLMS
Figure 3 for UOE: Unlearning One Expert Is Enough For Mixture-of-experts LLMS
Figure 4 for UOE: Unlearning One Expert Is Enough For Mixture-of-experts LLMS
Viaarxiv icon

Social Science Meets LLMs: How Reliable Are Large Language Models in Social Simulations?

Add code
Oct 30, 2024
Figure 1 for Social Science Meets LLMs: How Reliable Are Large Language Models in Social Simulations?
Figure 2 for Social Science Meets LLMs: How Reliable Are Large Language Models in Social Simulations?
Figure 3 for Social Science Meets LLMs: How Reliable Are Large Language Models in Social Simulations?
Figure 4 for Social Science Meets LLMs: How Reliable Are Large Language Models in Social Simulations?
Viaarxiv icon

LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs

Add code
Oct 18, 2024
Figure 1 for LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs
Figure 2 for LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs
Figure 3 for LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs
Figure 4 for LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs
Viaarxiv icon

ScholarChemQA: Unveiling the Power of Language Models in Chemical Research Question Answering

Add code
Jul 24, 2024
Figure 1 for ScholarChemQA: Unveiling the Power of Language Models in Chemical Research Question Answering
Figure 2 for ScholarChemQA: Unveiling the Power of Language Models in Chemical Research Question Answering
Figure 3 for ScholarChemQA: Unveiling the Power of Language Models in Chemical Research Question Answering
Figure 4 for ScholarChemQA: Unveiling the Power of Language Models in Chemical Research Question Answering
Viaarxiv icon

Defending Jailbreak Prompts via In-Context Adversarial Game

Add code
Feb 20, 2024
Figure 1 for Defending Jailbreak Prompts via In-Context Adversarial Game
Figure 2 for Defending Jailbreak Prompts via In-Context Adversarial Game
Figure 3 for Defending Jailbreak Prompts via In-Context Adversarial Game
Figure 4 for Defending Jailbreak Prompts via In-Context Adversarial Game
Viaarxiv icon