Picture for Chenyan Xiong

Chenyan Xiong

Microsoft Research

SkillLearnBench: Benchmarking Continual Learning Methods for Agent Skill Generation on Real-World Tasks

Add code
Apr 22, 2026
Viaarxiv icon

EmbodiedMidtrain: Bridging the Gap between Vision-Language Models and Vision-Language-Action Models via Mid-training

Add code
Apr 21, 2026
Viaarxiv icon

Efficient Dataset Selection for Continual Adaptation of Generative Recommenders

Add code
Apr 09, 2026
Viaarxiv icon

Linking Knowledge to Care: Knowledge Graph-Augmented Medical Follow-Up Question Generation

Add code
Mar 01, 2026
Viaarxiv icon

Benchmark Test-Time Scaling of General LLM Agents

Add code
Feb 22, 2026
Viaarxiv icon

Agentic Search in the Wild: Intents and Trajectory Dynamics from 14M+ Real Search Requests

Add code
Jan 24, 2026
Viaarxiv icon

ORBIT -- Open Recommendation Benchmark for Reproducible Research with Hidden Tests

Add code
Oct 30, 2025
Viaarxiv icon

AutoRule: Reasoning Chain-of-thought Extracted Rule-based Rewards Improve Preference Learning

Add code
Jun 18, 2025
Viaarxiv icon

Semi-structured LLM Reasoners Can Be Rigorously Audited

Add code
May 30, 2025
Viaarxiv icon

ConsRec: Denoising Sequential Recommendation through User-Consistent Preference Modeling

Add code
May 28, 2025
Viaarxiv icon