Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization
Ramamurthy, Rajkumar; Ammanabrolu, Prithviraj; Brantley, Kianté; Hessel, Jack; Sifa, Rafet; Bauckhage, Christian; Hajishirzi, Hannaneh; Choi, Yejin, 2023
Published in: The Eleventh International Conference on Learning Representations