Reinforcement learning allows a self-learning agent to stabilize an unmanned aerial vehicle in uncontrolled flight states. To achieve this, a deep deterministic policy gradient algorithm is applied. Through extensions like experience replay memory, …