Picture for Philip S. Yu

Philip S. Yu

Steve

MEMPROBE: Probing Long-Term Agent Memory via Hidden User-State Recovery

Add code
Jun 23, 2026
Viaarxiv icon

RubricsTree: Scalable and Evolving Open-Ended Evaluation of Personal Health Agents across Health Memory and Medical Skills

Add code
Jun 16, 2026
Viaarxiv icon

From Chatbot to Digital Colleague: The Paradigm Shift Toward Persistent Autonomous AI

Add code
Jun 12, 2026
Viaarxiv icon

OpenSkill: Open-World Self-Evolution for LLM Agents

Add code
Jun 04, 2026
Viaarxiv icon

Are Common Substructures Transferable? Riemannian Graph Foundation Model with Neural Vector Bundles

Add code
Jun 02, 2026
Viaarxiv icon

Learning to Retrieve: Dual-Level Long-Term Memory for Text-to-SQL Agents

Add code
May 30, 2026
Viaarxiv icon

Verifiable Rewards Beyond Math and Code: Lightweight Corpus-Grounded Process Supervision for Factual Question Answering

Add code
May 28, 2026
Viaarxiv icon

Distributionally Robust Set Representation Learning Under Inference-Time Element Corruption

Add code
May 28, 2026
Viaarxiv icon

Toward User Preference Alignment in LLM Recommendation via Explicit Context Feedback

Add code
May 27, 2026
Viaarxiv icon

FinHarness: An Inline Lifecycle Safety Harness for Finance LLM Agents

Add code
May 26, 2026
Viaarxiv icon