RLP: Reinforcement Learning Pre-training

Karsten Kreis; Ali Hatamizadeh
Publication International Conference on Learning Representations (ICLR)