Picture for Wentao Shi

Wentao Shi

FMVP: Masked Flow Matching for Adversarial Video Purification

Add code
Jan 05, 2026
Viaarxiv icon

RL-MTJail: Reinforcement Learning for Automated Black-Box Multi-Turn Jailbreaking of Large Language Models

Add code
Dec 08, 2025
Viaarxiv icon

MGFRec: Towards Reinforced Reasoning Recommendation with Multiple Groundings and Feedback

Add code
Oct 27, 2025
Figure 1 for MGFRec: Towards Reinforced Reasoning Recommendation with Multiple Groundings and Feedback
Figure 2 for MGFRec: Towards Reinforced Reasoning Recommendation with Multiple Groundings and Feedback
Figure 3 for MGFRec: Towards Reinforced Reasoning Recommendation with Multiple Groundings and Feedback
Figure 4 for MGFRec: Towards Reinforced Reasoning Recommendation with Multiple Groundings and Feedback
Viaarxiv icon

AppAgent-Pro: A Proactive GUI Agent System for Multidomain Information Integration and User Assistance

Add code
Aug 27, 2025
Figure 1 for AppAgent-Pro: A Proactive GUI Agent System for Multidomain Information Integration and User Assistance
Figure 2 for AppAgent-Pro: A Proactive GUI Agent System for Multidomain Information Integration and User Assistance
Figure 3 for AppAgent-Pro: A Proactive GUI Agent System for Multidomain Information Integration and User Assistance
Figure 4 for AppAgent-Pro: A Proactive GUI Agent System for Multidomain Information Integration and User Assistance
Viaarxiv icon

VisualTrap: A Stealthy Backdoor Attack on GUI Agents via Visual Grounding Manipulation

Add code
Jul 09, 2025
Viaarxiv icon

Reinforcement Fine-Tuning for Reasoning towards Multi-Step Multi-Source Search in Large Language Models

Add code
Jun 10, 2025
Viaarxiv icon

Fine-grained List-wise Alignment for Generative Medication Recommendation

Add code
May 26, 2025
Viaarxiv icon

Process-Supervised LLM Recommenders via Flow-guided Tuning

Add code
Mar 10, 2025
Viaarxiv icon

Self-Improvement Towards Pareto Optimality: Mitigating Preference Conflicts in Multi-Objective Alignment

Add code
Feb 20, 2025
Figure 1 for Self-Improvement Towards Pareto Optimality: Mitigating Preference Conflicts in Multi-Objective Alignment
Figure 2 for Self-Improvement Towards Pareto Optimality: Mitigating Preference Conflicts in Multi-Objective Alignment
Figure 3 for Self-Improvement Towards Pareto Optimality: Mitigating Preference Conflicts in Multi-Objective Alignment
Figure 4 for Self-Improvement Towards Pareto Optimality: Mitigating Preference Conflicts in Multi-Objective Alignment
Viaarxiv icon

Efficient Multi-Agent System Training with Data Influence-Oriented Tree Search

Add code
Feb 02, 2025
Figure 1 for Efficient Multi-Agent System Training with Data Influence-Oriented Tree Search
Figure 2 for Efficient Multi-Agent System Training with Data Influence-Oriented Tree Search
Figure 3 for Efficient Multi-Agent System Training with Data Influence-Oriented Tree Search
Figure 4 for Efficient Multi-Agent System Training with Data Influence-Oriented Tree Search
Viaarxiv icon