Picture for Lotte Weerts

Lotte Weerts

Reinforced Self-Training (ReST) for Language Modeling

Add code
Aug 21, 2023
Figure 1 for Reinforced Self-Training (ReST) for Language Modeling
Figure 2 for Reinforced Self-Training (ReST) for Language Modeling
Figure 3 for Reinforced Self-Training (ReST) for Language Modeling
Figure 4 for Reinforced Self-Training (ReST) for Language Modeling
Viaarxiv icon