Picture for Yuanfu Wang

Yuanfu Wang

Adversarial Preference Learning for Robust LLM Alignment

Add code
May 30, 2025
Viaarxiv icon

Inference-Time Language Model Alignment via Integrated Value Guidance

Add code
Sep 26, 2024
Viaarxiv icon

Critic-Guided Decision Transformer for Offline Reinforcement Learning

Add code
Dec 21, 2023
Figure 1 for Critic-Guided Decision Transformer for Offline Reinforcement Learning
Figure 2 for Critic-Guided Decision Transformer for Offline Reinforcement Learning
Figure 3 for Critic-Guided Decision Transformer for Offline Reinforcement Learning
Figure 4 for Critic-Guided Decision Transformer for Offline Reinforcement Learning
Viaarxiv icon