Home > Published Issues > 2020 > Volume 9, No. 12, December 2020 >

Path Optimization of a Single Surveillance Drone Based on Reinforcement Learning

Dongoo Lee 1 and Dowan Cha 2
1. Korea Aerospace Research Institute, Daejeon, Korea
2. Department of Drone-Robot Engineering, Pai Chai University, Daejeon, Korea

Abstract—A surveillance drone supervises designated area or sites against dangerous situation. In recent years, it is required to perform autonomous flight to achieve the supervision with path optimization based on minimization of the time lag. In this paper, we propose the reinforcement learning algorithm to optimize path for autonomous flight of surveillance drones. We present a simulation result of a single surveillance drone, which has reinforcement learning algorithm in an unknown grid area. A single surveillance drone finds the optimized path autonomously with minimization of the time lag. This paper provides the following two main contributions for autonomous flight of the surveillance drone. First, the surveillance drone finds the optimized path autonomously using proposed the reinforcement learning algorithm. Second, the traditional reinforcement learning was improved with parameter optimization including learning rate coefficient, convergence criteria, and adaptive error convergence detection for ε-greedy policy process.

Index Terms—path optimization, drone, machine learning, reinforcement learning

Cite: Dongoo Lee and Dowan Cha, "Path Optimization of a Single Surveillance Drone Based on Reinforcement Learning," International Journal of Mechanical Engineering and Robotics Research, Vol. 9, No. 12, pp. 1541-1547, December 2020. DOI: 10.18178/ijmerr.9.12.1541-1547

Copyright © 2020 by the authors. This is an open access article distributed under the Creative Commons Attribution License (CC BY-NC-ND 4.0), which permits use, distribution and reproduction in any medium, provided that the article is properly cited, the use is non-commercial and no modifications or adaptations are made.