Picture for Yaocheng Zhang

Yaocheng Zhang

CriticSearch: Fine-Grained Credit Assignment for Search Agents via a Retrospective Critic

Add code
Nov 15, 2025
Viaarxiv icon

In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning

Add code
Dec 12, 2024
Figure 1 for In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning
Figure 2 for In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning
Figure 3 for In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning
Figure 4 for In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning
Viaarxiv icon