Picture for Julian McAuley

Julian McAuley

MASS-DPO: Multi-negative Active Sample Selection for Direct Policy Optimization

Add code
May 11, 2026
Viaarxiv icon

FERA: Uncertainty-Aware Federated Reasoning for Large Language Models

Add code
May 11, 2026
Viaarxiv icon

Skill-R1: Agent Skill Evolution via Reinforcement Learning

Add code
May 10, 2026
Viaarxiv icon

Expressiveness Limits of Autoregressive Semantic ID Generation in Generative Recommendation

Add code
May 07, 2026
Viaarxiv icon

From Local Indices to Global Identifiers: Generative Reranking for Recommender Systems via Global Action Space

Add code
Apr 28, 2026
Viaarxiv icon

Sink-Token-Aware Pruning for Fine-Grained Video Understanding in Efficient Video LLMs

Add code
Apr 22, 2026
Viaarxiv icon

CocoaBench: Evaluating Unified Digital Agents in the Wild

Add code
Apr 14, 2026
Viaarxiv icon

Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning

Add code
Apr 07, 2026
Viaarxiv icon

Composer Vector: Style-steering Symbolic Music Generation in a Latent Space

Add code
Apr 03, 2026
Viaarxiv icon

Learning to Hint for Reinforcement Learning

Add code
Apr 01, 2026
Viaarxiv icon