Training Framework of Distributed Robot Reinforcement Learning  Based on Spark

doi:10.11996/JG.j.2095-302X.2019050852

Journal of Graphics

Previous Articles Next Articles

Training Framework of Distributed Robot Reinforcement Learning Based on Spark

(1. Institute of Cyber Systems and Control, Zhejiang University, Hangzhou Zhejiang 310027, China; 2. Department of Computer Science and Technology, Huaibei Vocational and Technical College, Huaibei Anhui 235000, China; 3. School of Computer Science, Hangzhou Dianzi University, Hangzhou Zhejiang 310018, China; 4. Materials Branch, State Grid Zhejiang Electric Power Company, LTD, Hangzhou Zhejiang 310000, China; 5. Institute of Intelligent Computing and Visualization Based on Big Data, Chongqing University of Arts and Sciences, Chongqing 402160, China)

Online:2019-10-31 Published:2019-11-06

Abstract

Abstract: Through autonomous learning, reinforcement learning can train robots to complete various tasks that are difficult for them to implement with control methods, and this can effectively avoid system designers from systemic modeling or rules making. However, the training cost of reinforcement learning in the field of robot development and application is high, and it takes a large amount of time cost and hardware cost to realize learning and training. Although the hardware cost can be reduced to some extent based on simulation, for the complicated robot training platform such as Gazebo, the working efficiency of simulation process is low, and it takes a long time for data sampling. In order to effectively solve these problems, a distributed reinforcement learning framework based on Spark is put forward, which optimizes the usability and compatibility of platform of robot simulation process, offers distributed support for the training of reinforcement learning and robot simulation sampling, and has the characteristics of high compatibility and robustness. Through analyzing and contrasting the experimental data, the system framework can not only effectively improve the training speed of reinforcement learning model of robot and shorten the training time, but also help with the saving of hardware cost.

Key words: robot, reinforcement learning, Spark, distribute, data pipeline

FANG Wei1,2, HUANG Zeng-qiang3, XU Jian-bin4, HUANG Yi1,5, MA Xin-qiang1,5 . Training Framework of Distributed Robot Reinforcement Learning Based on Spark[J]. Journal of Graphics, DOI: 10.11996/JG.j.2095-302X.2019050852.

[1]	WANG Qiu-hui, WANG Ya-xin . Work safety and interaction design strategies of hospital disinfection robot [J]. Journal of Graphics, 2022, 43(1): 172-180.
[2]	MA Huan, JI Jing-jing, LIU Jia-hao, LIU Yu-ting . Implementation of meat classification system for autonomous robotic cutting [J]. Journal of Graphics, 2021, 42(6): 924-930.
[3]	WANG Qiu-hui, YAO Jing-yi . Progress in the research of exogenic lower limb rehabilitation robot [J]. Journal of Graphics, 2021, 42(5): 712-718.
[4]	WU Yi-he , ZHANG Zhen-ning , QIU Dong , LI Wei-qing , SU Zhi-yong. Research on adaptive grasping of virtual hands based on deep reinforcement learning [J]. Journal of Graphics, 2021, 42(3): 462-469.
[5]	ZHANG Zhao-xuan1, WANG Cheng-bin1, YANG Xin1, PIAO Xing-lin2, WANG Peng-jie3, YIN Bao-cai1 . Indoor scene modeling method based on template replacement [J]. Journal of Graphics, 2020, 41(2): 270-276.
[6]	WANG Qiu-hui, YANG Yue . Methods of Human Robot Ergonomics Design of Restaurant Service Robot Based on QFD and RAHP [J]. Journal of Graphics, 2019, 40(4): 739-745.
[7]	SUN Rui, ZHANG Wen-sheng . Smooth Path Planning of Mobile Robot Based on Improved Ant Colony Algorithm [J]. Journal of Graphics, 2019, 40(2): 344-350.
[8]	SUN Zhao1, LIU You-quan1, ZHANG Cai-rong1, SHI Jian2, CHEN Yan-yun3 . A Scene-Distributed Interactive Rendering System [J]. Journal of Graphics, 2019, 40(1): 87-91.
[9]	LIU Zongming, GE Bihui. Design of Elderly Household Companion Robot Based on QFD [J]. Journal of Graphics, 2018, 39(4): 695-699.
[10]	LI Zhenyu, WANG Haochen. Research on Apple Picking System Based on Visual Identification and Location [J]. Journal of Graphics, 2018, 39(3): 493-500.
[11]	SUN Hui, LV Jian, CUN Wenzhe. VR System Information Visualization Model Cognition [J]. Journal of Graphics, 2018, 39(2): 317-326.
[12]	LI Lei1, LI Ling2. Distributed P2P Video on Demand Scheduling Based on Request Drop Neighborhood Overlay [J]. Journal of Graphics, 2018, 39(1): 30-35.
[13]	ZHANG Ben 1,BIAN Xingao 2,ZHU Denglin 2. Gait Analysis and Simulation of Quadruped Robot [J]. Journal of Graphics, 2017, 38(5): 670-674.
[14]	XIONG Wenshi, QING Linbo, WU Xiaohong, CHEN Zhenzhen. Side Information Fusion Algorithm Based on Reliability Evaluation in DMDVC [J]. Journal of Graphics, 2017, 38(4): 531-536.
[15]	Zeng Debiao1, Wan Shiming1, Li Yingguang2, Liu Yong1, Li Dongming1. A Hybrid Optimization Algorithm for Working Position Setting of#br# Assembling Robot [J]. Journal of Graphics, 2016, 37(4): 496-501.

Training Framework of Distributed Robot Reinforcement Learning Based on Spark

PDF (PC)

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments