Picture for Wentao Shi

Wentao Shi

Reinforcement Fine-Tuning for Reasoning towards Multi-Step Multi-Source Search in Large Language Models

Add code
Jun 10, 2025
Viaarxiv icon

Fine-grained List-wise Alignment for Generative Medication Recommendation

Add code
May 26, 2025
Viaarxiv icon

Process-Supervised LLM Recommenders via Flow-guided Tuning

Add code
Mar 10, 2025
Viaarxiv icon

Self-Improvement Towards Pareto Optimality: Mitigating Preference Conflicts in Multi-Objective Alignment

Add code
Feb 20, 2025
Viaarxiv icon

Efficient Multi-Agent System Training with Data Influence-Oriented Tree Search

Add code
Feb 02, 2025
Figure 1 for Efficient Multi-Agent System Training with Data Influence-Oriented Tree Search
Figure 2 for Efficient Multi-Agent System Training with Data Influence-Oriented Tree Search
Figure 3 for Efficient Multi-Agent System Training with Data Influence-Oriented Tree Search
Figure 4 for Efficient Multi-Agent System Training with Data Influence-Oriented Tree Search
Viaarxiv icon

Debias Can be Unreliable: Mitigating Bias Issue in Evaluating Debiasing Recommendation

Add code
Sep 07, 2024
Viaarxiv icon

Direct Multi-Turn Preference Optimization for Language Agents

Add code
Jun 25, 2024
Viaarxiv icon

Sparse Sampling is All You Need for Fast Wrong-way Cycling Detection in CCTV Videos

Add code
May 12, 2024
Figure 1 for Sparse Sampling is All You Need for Fast Wrong-way Cycling Detection in CCTV Videos
Figure 2 for Sparse Sampling is All You Need for Fast Wrong-way Cycling Detection in CCTV Videos
Figure 3 for Sparse Sampling is All You Need for Fast Wrong-way Cycling Detection in CCTV Videos
Figure 4 for Sparse Sampling is All You Need for Fast Wrong-way Cycling Detection in CCTV Videos
Viaarxiv icon

Uplift Modeling for Target User Attacks on Recommender Systems

Add code
Mar 05, 2024
Figure 1 for Uplift Modeling for Target User Attacks on Recommender Systems
Figure 2 for Uplift Modeling for Target User Attacks on Recommender Systems
Figure 3 for Uplift Modeling for Target User Attacks on Recommender Systems
Figure 4 for Uplift Modeling for Target User Attacks on Recommender Systems
Viaarxiv icon

Prospect Personalized Recommendation on Large Language Model-based Agent Platform

Add code
Mar 05, 2024
Figure 1 for Prospect Personalized Recommendation on Large Language Model-based Agent Platform
Figure 2 for Prospect Personalized Recommendation on Large Language Model-based Agent Platform
Figure 3 for Prospect Personalized Recommendation on Large Language Model-based Agent Platform
Figure 4 for Prospect Personalized Recommendation on Large Language Model-based Agent Platform
Viaarxiv icon