Picture for Yirong Chen

Yirong Chen

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

Add code
Jun 15, 2026
Viaarxiv icon

A First-Principles Derivation of LLM Policy Optimization: From Expected Reward to GRPO and Its Structural Extensions

Add code
Jun 15, 2026
Viaarxiv icon

OmniTraffic: A Controllable Generation Pipeline and Benchmark for Spatio-Temporal Traffic Reasoning

Add code
Jun 14, 2026
Viaarxiv icon

CuSearch: Curriculum Rollout Sampling via Search Depth for Agentic RAG

Add code
May 14, 2026
Viaarxiv icon

MedProbeBench: Systematic Benchmarking at Deep Evidence Integration for Expert-level Medical Guideline

Add code
Apr 20, 2026
Viaarxiv icon

Hierarchical Memory Orchestration for Personalized Persistent Agents

Add code
Apr 02, 2026
Viaarxiv icon

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

Add code
Mar 29, 2026
Viaarxiv icon

MERRY: Semantically Decoupled Evaluation of Multimodal Emotional and Role Consistencies of Role-Playing Agents

Add code
Feb 24, 2026
Viaarxiv icon

MedAD-R1: Eliciting Consistent Reasoning in Interpretible Medical Anomaly Detection via Consistency-Reinforced Policy Optimization

Add code
Feb 01, 2026
Viaarxiv icon

EpiPlanAgent: Agentic Automated Epidemic Response Planning

Add code
Dec 12, 2025
Figure 1 for EpiPlanAgent: Agentic Automated Epidemic Response Planning
Figure 2 for EpiPlanAgent: Agentic Automated Epidemic Response Planning
Figure 3 for EpiPlanAgent: Agentic Automated Epidemic Response Planning
Figure 4 for EpiPlanAgent: Agentic Automated Epidemic Response Planning
Viaarxiv icon