Picture for Julian McAuley

Julian McAuley

Agentic Chain-of-Thought Steering for Efficient and Controllable LLM Reasoning

Add code
Jun 02, 2026
Viaarxiv icon

Masking Stale Observations Helps Search Agents -- Until It Doesn't: A Regime Map and Its Mechanism

Add code
May 29, 2026
Viaarxiv icon

Live Music Diffusion Models: Efficient Fine-Tuning and Post-Training of Interactive Diffusion Music Generators

Add code
May 21, 2026
Viaarxiv icon

Auto-Dreamer: Learning Offline Memory Consolidation for Language Agents

Add code
May 20, 2026
Viaarxiv icon

F-GRPO: Factorized Group-Relative Policy Optimization for Unified Candidate Generation and Ranking

Add code
May 13, 2026
Viaarxiv icon

MLPs are Efficient Distilled Generative Recommenders

Add code
May 12, 2026
Viaarxiv icon

MASS-DPO: Multi-negative Active Sample Selection for Direct Policy Optimization

Add code
May 11, 2026
Viaarxiv icon

FERA: Uncertainty-Aware Federated Reasoning for Large Language Models

Add code
May 11, 2026
Viaarxiv icon

Skill-R1: Agent Skill Evolution via Reinforcement Learning

Add code
May 10, 2026
Viaarxiv icon

Expressiveness Limits of Autoregressive Semantic ID Generation in Generative Recommendation

Add code
May 07, 2026
Viaarxiv icon