Picture for Bryan Hooi

Bryan Hooi

APEX: Autonomous Policy Exploration for Self-Evolving LLM Agents

Add code
May 20, 2026
Viaarxiv icon

WARD: Adversarially Robust Defense of Web Agents Against Prompt Injections

Add code
May 14, 2026
Viaarxiv icon

TACT: Mitigating Overthinking and Overacting in Coding Agents via Activation Steering

Add code
May 07, 2026
Viaarxiv icon

How Creative Are Large Language Models in Generating Molecules?

Add code
Apr 20, 2026
Viaarxiv icon

Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies

Add code
Apr 02, 2026
Viaarxiv icon

MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome

Add code
Mar 30, 2026
Viaarxiv icon

AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents

Add code
Mar 29, 2026
Viaarxiv icon

Towards Realistic Personalization: Evaluating Long-Horizon Preference Following in Personalized User-LLM Interactions

Add code
Mar 04, 2026
Viaarxiv icon

KLong: Training LLM Agent for Extremely Long-horizon Tasks

Add code
Feb 19, 2026
Viaarxiv icon

Zombie Agents: Persistent Control of Self-Evolving LLM Agents via Self-Reinforcing Injections

Add code
Feb 17, 2026
Viaarxiv icon