Picture for Bryan Hooi

Bryan Hooi

Towards Realistic Personalization: Evaluating Long-Horizon Preference Following in Personalized User-LLM Interactions

Add code
Mar 04, 2026
Viaarxiv icon

KLong: Training LLM Agent for Extremely Long-horizon Tasks

Add code
Feb 19, 2026
Viaarxiv icon

Zombie Agents: Persistent Control of Self-Evolving LLM Agents via Self-Reinforcing Injections

Add code
Feb 17, 2026
Viaarxiv icon

EvoClinician: A Self-Evolving Agent for Multi-Turn Medical Diagnosis via Test-Time Evolutionary Learning

Add code
Jan 30, 2026
Viaarxiv icon

Autonomous Chain-of-Thought Distillation for Graph-Based Fraud Detection

Add code
Jan 30, 2026
Viaarxiv icon

Conversation for Non-verifiable Learning: Self-Evolving LLMs through Meta-Evaluation

Add code
Jan 29, 2026
Viaarxiv icon

Just-In-Time Reinforcement Learning: Continual Learning in LLM Agents Without Gradient Updates

Add code
Jan 26, 2026
Viaarxiv icon

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Add code
Jan 15, 2026
Viaarxiv icon

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Add code
Jan 13, 2026
Viaarxiv icon

Echoless Label-Based Pre-computation for Memory-Efficient Heterogeneous Graph Learning

Add code
Nov 14, 2025
Viaarxiv icon