欢迎访问《图学学报》 分享到:

图学学报

• 几何设计与计算 • 上一篇    下一篇

基于Q 学习的任务调度问题的改进研究

  

  • 出版日期:2012-06-29 发布日期:2015-07-28

Improvement of task scheduling based on Q-learning

  • Online:2012-06-29 Published:2015-07-28

摘要: 论文针对协同工作中的任务调度问题,建立了相应的马尔可夫决策过程模
型,在此基础上提出了一种改进的基于模拟退火的Q 学习算法。该算法通过引入模拟退火,
并结合贪婪策略,以及在状态空间上的筛选判断,显著地提高了收敛速度,缩短了执行时间。
最后与其它文献中相关算法的对比分析,验证了本改进算法的高效性。

关键词: 任务调度, Q 学习, 强化学习, 模拟退火

Abstract: In this paper, a Markov Decision Process model is built to describe the problem of
task scheduling in cooperative work, and a improved Q-learning algorithm based on Metropolis
rule is present to solve the problem. In the algorithm, Metropolis rule combined with Greedy
Strategy is introduced and a selection in state space is adopted, which accelerate the convergence,
and shorten the running time. Finally, the algorithm is compared to some related algorithms of
other papers, and the algorithm performance is analyzed as well, which indicates the efficiency of
the improved Q-learning algorithm.

Key words: task scheduling, Q-learning, reinforcement learning, simulated annealing