Picture for Yang Deng

Yang Deng

AdaSteer: Your Aligned LLM is Inherently an Adaptive Jailbreak Defender

Add code
Apr 13, 2025
Viaarxiv icon

Steady-State Drifting Equilibrium Analysis of Single-Track Two-Wheeled Robots for Controller Design

Add code
Apr 12, 2025
Viaarxiv icon

Query Understanding in LLM-based Conversational Information Seeking

Add code
Apr 08, 2025
Viaarxiv icon

LightPROF: A Lightweight Reasoning Framework for Large Language Model on Knowledge Graph

Add code
Apr 04, 2025
Viaarxiv icon

Why Stop at One Error? Benchmarking LLMs as Data Science Code Debuggers for Multi-Hop and Multi-Bug Errors

Add code
Mar 28, 2025
Viaarxiv icon

Mapless Collision-Free Flight via MPC using Dual KD-Trees in Cluttered Environments

Add code
Mar 13, 2025
Viaarxiv icon

Chain of Strategy Optimization Makes Large Language Models Better Emotional Supporter

Add code
Mar 07, 2025
Viaarxiv icon

Beware of Your Po! Measuring and Mitigating AI Safety Risks in Role-Play Fine-Tuning of LLMs

Add code
Feb 28, 2025
Viaarxiv icon

FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models

Add code
Feb 25, 2025
Viaarxiv icon

PEARL: Towards Permutation-Resilient LLMs

Add code
Feb 20, 2025
Viaarxiv icon