Picture for Chenmien Tan

Chenmien Tan

CHARM: Calibrating Reward Models With Chatbot Arena Scores

Add code
Apr 14, 2025
Viaarxiv icon

Massive Editing for Large Language Models via Meta Learning

Add code
Nov 09, 2023
Viaarxiv icon

Learning Rewards to Optimize Global Performance Metrics in Deep Reinforcement Learning

Add code
Mar 16, 2023
Figure 1 for Learning Rewards to Optimize Global Performance Metrics in Deep Reinforcement Learning
Figure 2 for Learning Rewards to Optimize Global Performance Metrics in Deep Reinforcement Learning
Figure 3 for Learning Rewards to Optimize Global Performance Metrics in Deep Reinforcement Learning
Figure 4 for Learning Rewards to Optimize Global Performance Metrics in Deep Reinforcement Learning
Viaarxiv icon