Picture for Mingchuan yang

Mingchuan yang

Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy

Add code
Mar 07, 2024
Viaarxiv icon