Counterfactual Reward Model Training for Bias Mitigation in Multimodal Reinforcement Learning

Add code
Aug 27, 2025
Figure 1 for Counterfactual Reward Model Training for Bias Mitigation in Multimodal Reinforcement Learning
Figure 2 for Counterfactual Reward Model Training for Bias Mitigation in Multimodal Reinforcement Learning
Figure 3 for Counterfactual Reward Model Training for Bias Mitigation in Multimodal Reinforcement Learning

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: