Picture for Yuanfu Wang

Yuanfu Wang

Native Reasoning Models: Training Language Models to Reason on Unverifiable Data

Add code
Feb 12, 2026
Viaarxiv icon

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law

Add code
Jul 24, 2025
Figure 1 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Figure 2 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Figure 3 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Figure 4 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Viaarxiv icon

Adversarial Preference Learning for Robust LLM Alignment

Add code
May 30, 2025
Figure 1 for Adversarial Preference Learning for Robust LLM Alignment
Figure 2 for Adversarial Preference Learning for Robust LLM Alignment
Figure 3 for Adversarial Preference Learning for Robust LLM Alignment
Figure 4 for Adversarial Preference Learning for Robust LLM Alignment
Viaarxiv icon

Inference-Time Language Model Alignment via Integrated Value Guidance

Add code
Sep 26, 2024
Figure 1 for Inference-Time Language Model Alignment via Integrated Value Guidance
Figure 2 for Inference-Time Language Model Alignment via Integrated Value Guidance
Figure 3 for Inference-Time Language Model Alignment via Integrated Value Guidance
Figure 4 for Inference-Time Language Model Alignment via Integrated Value Guidance
Viaarxiv icon

Critic-Guided Decision Transformer for Offline Reinforcement Learning

Add code
Dec 21, 2023
Figure 1 for Critic-Guided Decision Transformer for Offline Reinforcement Learning
Figure 2 for Critic-Guided Decision Transformer for Offline Reinforcement Learning
Figure 3 for Critic-Guided Decision Transformer for Offline Reinforcement Learning
Figure 4 for Critic-Guided Decision Transformer for Offline Reinforcement Learning
Viaarxiv icon