Application of Reward Learning to generate news

April 2021

PDF Project

Abstract

This paper examines the usage of proximal policy optimization applied to pre-trained neural language models based on the transformer architecture. This approach is then used to generate convincing News.

Type

Report

reinforcement learning