Welcome to Journal of Graphics share: 

Journal of Graphics

Previous Articles     Next Articles

Training Framework of Distributed Robot Reinforcement Learning  Based on Spark

  

  1. (1. Institute of Cyber Systems and Control, Zhejiang University, Hangzhou Zhejiang 310027, China;  2. Department of Computer Science and Technology, Huaibei Vocational and Technical College, Huaibei Anhui 235000, China;  3. School of Computer Science, Hangzhou Dianzi University, Hangzhou Zhejiang 310018, China;  4. Materials Branch, State Grid Zhejiang Electric Power Company, LTD, Hangzhou Zhejiang 310000, China; 5. Institute of Intelligent Computing and Visualization Based on Big Data, Chongqing University of Arts and Sciences, Chongqing 402160, China)
  • Online:2019-10-31 Published:2019-11-06

Abstract: Through autonomous learning, reinforcement learning can train robots to complete various tasks that are difficult for them to implement with control methods, and this can effectively avoid system designers from systemic modeling or rules making. However, the training cost of reinforcement learning in the field of robot development and application is high, and it takes a large amount of time cost and hardware cost to realize learning and training. Although the hardware cost can be reduced to some extent based on simulation, for the complicated robot training platform such as Gazebo, the working efficiency of simulation process is low, and it takes a long time for data sampling. In order to effectively solve these problems, a distributed reinforcement learning framework based on Spark is put forward, which optimizes the usability and compatibility of platform of robot simulation process, offers distributed support for the training of reinforcement learning and robot simulation sampling, and has the characteristics of high compatibility and robustness. Through analyzing and contrasting the experimental data, the system framework can not only effectively improve the training speed of reinforcement learning model of robot and shorten the training time, but also help with the saving of hardware cost.

Key words: robot, reinforcement learning, Spark, distribute, data pipeline