Picture for Hossam Afifi

Hossam Afifi

Long Chain-of-Thought Compression via Fine-Grained Group Policy Optimization

Add code
Feb 10, 2026
Viaarxiv icon

Automatic Constraint Policy Optimization based on Continuous Constraint Interpolation Framework for Offline Reinforcement Learning

Add code
Jan 30, 2026
Viaarxiv icon

Projection Implicit Q-Learning with Support Constraint for Offline Reinforcement Learning

Add code
Jan 15, 2025
Figure 1 for Projection Implicit Q-Learning with Support Constraint for Offline Reinforcement Learning
Figure 2 for Projection Implicit Q-Learning with Support Constraint for Offline Reinforcement Learning
Figure 3 for Projection Implicit Q-Learning with Support Constraint for Offline Reinforcement Learning
Figure 4 for Projection Implicit Q-Learning with Support Constraint for Offline Reinforcement Learning
Viaarxiv icon