Picture for Wenjie Wang

Wenjie Wang

From Empathy to Personalized Empathy: Adapting Empathetic Strategies to Individual Users

Add code
May 30, 2026
Viaarxiv icon

Trustworthy Recommendation in the Era of Large Language Models: Opportunities and Challenges

Add code
May 30, 2026
Viaarxiv icon

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

Add code
May 28, 2026
Viaarxiv icon

Plant, Persist, Trigger: Sleeper Attack on Large Language Model Agents

Add code
May 27, 2026
Viaarxiv icon

ARES: Automated Rubric Synthesis for Scalable LLM Reinforcement Learning

Add code
May 25, 2026
Viaarxiv icon

Unified Data Selection for LLM Reasoning

Add code
May 21, 2026
Viaarxiv icon

Frequency-Domain Regularized Adversarial Alignment for Transferable Attacks against Closed-Source MLLMs

Add code
May 20, 2026
Viaarxiv icon

EVA: Editing for Versatile Alignment against Jailbreaks

Add code
May 14, 2026
Viaarxiv icon

UniCustom: Unified Visual Conditioning for Multi-Reference Image Generation

Add code
May 13, 2026
Viaarxiv icon

SAGE: Scalable Automated Robustness Augmentation for LLM Knowledge Evaluation

Add code
May 12, 2026
Viaarxiv icon