Alert button

Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model

Jan 23, 2024
Zhiwei He, Xing Wang, Wenxiang Jiao, Zhuosheng Zhang, Rui Wang, Shuming Shi, Zhaopeng Tu

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: