Picture for Yi Wu

Yi Wu

QP-OneModel: A Unified Generative LLM for Multi-Task Query Understanding in Xiaohongshu Search

Add code
Feb 10, 2026
Viaarxiv icon

Self-Evolving Recommendation System: End-To-End Autonomous Model Optimization With LLM Agents

Add code
Feb 10, 2026
Viaarxiv icon

Learning to Alleviate Familiarity Bias in Video Recommendation

Add code
Feb 08, 2026
Viaarxiv icon

AREAL-DTA: Dynamic Tree Attention for Efficient Reinforcement Learning of Large Language Models

Add code
Jan 31, 2026
Viaarxiv icon

From Self-Evolving Synthetic Data to Verifiable-Reward RL: Post-Training Multi-turn Interactive Tool-Using Agents

Add code
Jan 30, 2026
Viaarxiv icon

Self-Compression of Chain-of-Thought via Multi-Agent Reinforcement Learning

Add code
Jan 29, 2026
Viaarxiv icon

JADE: Bridging the Strategic-Operational Gap in Dynamic Agentic RAG

Add code
Jan 29, 2026
Viaarxiv icon

Scaling Test-time Inference for Visual Grounding

Add code
Jan 20, 2026
Viaarxiv icon

Anti-Length Shift: Dynamic Outlier Truncation for Training Efficient Reasoning Models

Add code
Jan 07, 2026
Viaarxiv icon

Generalist vs Specialist Time Series Foundation Models: Investigating Potential Emergent Behaviors in Assessing Human Health Using PPG Signals

Add code
Oct 16, 2025
Figure 1 for Generalist vs Specialist Time Series Foundation Models: Investigating Potential Emergent Behaviors in Assessing Human Health Using PPG Signals
Figure 2 for Generalist vs Specialist Time Series Foundation Models: Investigating Potential Emergent Behaviors in Assessing Human Health Using PPG Signals
Figure 3 for Generalist vs Specialist Time Series Foundation Models: Investigating Potential Emergent Behaviors in Assessing Human Health Using PPG Signals
Figure 4 for Generalist vs Specialist Time Series Foundation Models: Investigating Potential Emergent Behaviors in Assessing Human Health Using PPG Signals
Viaarxiv icon