Picture for Peixuan Han

Peixuan Han

ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind

Add code
May 29, 2025
Viaarxiv icon

SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents

Add code
May 29, 2025
Viaarxiv icon

DecisionFlow: Advancing Large Language Model as Principled Decision Maker

Add code
May 27, 2025
Viaarxiv icon

Internal Activation as the Polar Star for Steering Unsafe LLM Behavior

Add code
Feb 04, 2025
Viaarxiv icon

EscapeBench: Pushing Language Models to Think Outside the Box

Add code
Dec 18, 2024
Viaarxiv icon

Distributionally Robust Unsupervised Dense Retrieval Training on Web Graphs

Add code
Oct 26, 2023
Viaarxiv icon