Picture for Yufeng Yuan

Yufeng Yuan

SA-WiSense: A Blind-Spot-Free Respiration Sensing Framework for Single-Antenna Wi-Fi Devices

Add code
Jul 24, 2025
Viaarxiv icon

Truncated Proximal Policy Optimization

Add code
Jun 18, 2025
Viaarxiv icon

PAG: Multi-Turn Reinforced LLM Self-Correction with Policy as Generative Verifier

Add code
Jun 12, 2025
Viaarxiv icon

Seed1.5-VL Technical Report

Add code
May 11, 2025
Viaarxiv icon

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Add code
Apr 08, 2025
Figure 1 for VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks
Figure 2 for VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks
Figure 3 for VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks
Viaarxiv icon

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Add code
Mar 18, 2025
Viaarxiv icon

What's Behind PPO's Collapse in Long-CoT? Value Optimization Holds the Secret

Add code
Mar 03, 2025
Figure 1 for What's Behind PPO's Collapse in Long-CoT? Value Optimization Holds the Secret
Figure 2 for What's Behind PPO's Collapse in Long-CoT? Value Optimization Holds the Secret
Figure 3 for What's Behind PPO's Collapse in Long-CoT? Value Optimization Holds the Secret
Figure 4 for What's Behind PPO's Collapse in Long-CoT? Value Optimization Holds the Secret
Viaarxiv icon

Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots

Add code
Mar 31, 2022
Figure 1 for Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Figure 2 for Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Figure 3 for Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Figure 4 for Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots
Viaarxiv icon

Receptive Multi-granularity Representation for Person Re-Identification

Add code
Aug 31, 2020
Figure 1 for Receptive Multi-granularity Representation for Person Re-Identification
Figure 2 for Receptive Multi-granularity Representation for Person Re-Identification
Figure 3 for Receptive Multi-granularity Representation for Person Re-Identification
Figure 4 for Receptive Multi-granularity Representation for Person Re-Identification
Viaarxiv icon

Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network

Add code
Mar 09, 2020
Figure 1 for Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network
Figure 2 for Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network
Figure 3 for Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network
Figure 4 for Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network
Viaarxiv icon