deterministic policy gradients

Entwicklung eines Reinforcement Learning basierten Flugzeugautopiloten unter der Verwendung von DeterministicPolicy Gradients

One of the most difficult challenges in reinforcement learning is the continuous control of systems in a continuous state and action space. This papers goal is to design and implement a reinforcement learning based airplane autopilot that controls an …