AC-HAPE3D：基于强化学习的异形填充算法

doi:10.11996/JG.j.2095-302X.2022061096

图学学报 ›› 2022, Vol. 43 ›› Issue (6): 1096-1103.DOI: 10.11996/JG.j.2095-302X.2022061096

• 计算机图形学与虚拟现实 • 上一篇下一篇

AC-HAPE3D：基于强化学习的异形填充算法

华南理工大学计算机科学与工程学院，广东广州 510006

出版日期:2022-12-30 发布日期:2023-01-11

AC-HAPE3D: an algorithm for irregular packing based on reinforcement learning

School of Computer Science and Engineering, South China University of Technology, Guangzhou Guangdong 510006, China

Online:2022-12-30 Published:2023-01-11

摘要/Abstract

摘要：

在 3D 打印、快递物流等领域，需要将形状各异的零件或货物在限定的空间中摆放，称为异形填充。给出一种摆放方案，以便将尽可能多的多面体放入给定容器；或者一批物体紧密地摆放，使得占用体积最小，则称为异形填充问题。这是个 NP 问题，很难高效求解。基于此，研究在一个可变维度的三维容器内摆放给定的一组多面体，使得打包后容器的可变维度最小。并提出一个基于强化学习的算法 AC-HAPE3D，利用启发式算法 HAPE3D 将问题建模为马尔可夫过程，再利用基于策略的强化学习方法 Actor-Critic 进行学习。同时用体素来表示容器和多面体，从而简化状态信息的表达，并用神经网络表示价值函数和策略函；为了解决状态信息长度以及动作空间可变的问题，采取遮罩的方法来屏蔽部分输入和输出，并且引入 LSTM 来处理变长的状态信息。在 5 个不同的数据集进行的实验表明算法能够取得较好的结果。

关键词: 异形填充, 启发式算法, 体素, 强化学习, 三维打印

Abstract:

In areas such as 3D printing and express logistics, irregular packing results from the need to place parts or goods of different shapes in a defined space. A placement solution could be put forward, allowing as many polyhedra as possible to fit into a given container, or a batch of objects could be placed so closely together that they occupy the smallest volume, which is known as the irregular packing problem. This is an NP problem but is difficult to solve efficiently. This paper undertook the following investigation: placing a given set of polyhedra inside a 3D container with a variable dimension, so that the variable dimension of the packed container could be minimized. We proposed a reinforcement learning based algorithm, AC-HAPE3D. This algorithm could model the problem into a Markov process using the heuristic algorithm HAPE3D, and then utilize the policy-based reinforcement learning method Actor-Critic. We simplified the representation of state information by using voxels to represent containers and polyhedra, and employed neural networks to represent value and policy functions; to address the problem of variable length of state information as well as action space, we adopted a masking approach to masking some of the inputs and outputs, and introduced LSTM to handle variable length of state information. Experiments conducted on five different datasets show that the algorithm can yield good results.

Key words: irregular packing, heuristic algorithm, voxel, reinforcement learning, 3-dimensional printing

中图分类号:

TP 391

朱鹏辉, 袁宏涛, 聂勇伟, 李桂清. AC-HAPE3D：基于强化学习的异形填充算法 [J]. 图学学报, 2022, 43(6): 1096-1103.

ZHU Peng-hui, YUAN Hong-tao, NIE Yong-wei, LI Gui-qing. AC-HAPE3D: an algorithm for irregular packing based on reinforcement learning[J]. Journal of Graphics, 2022, 43(6): 1096-1103.

[1]	李明 , 张乘虎 , 扈婧乔 , 胡心卓 , 刘继凯 . 多孔模型设计方法[J]. 图学学报, 2022, 43(6): 1034-1048.
[2]	伍一鹤 , 张振宁 , 仇栋 , 李蔚清 , 苏智勇 . 基于深度强化学习的虚拟手自适应抓取研究[J]. 图学学报, 2021, 42(3): 462-469.
[3]	刘尚武 , 魏巍 , 段晓东 , 刘勇奎 . 三维模型有向三角面片链码压缩方法[J]. 图学学报, 2021, 42(2): 237-244.
[4]	王新颖，王亚 . 权值优化集成卷积神经网络及其在三维模型识别中的应用[J]. 图学学报, 2019, 40(6): 1072-1078.
[5]	方伟 1,2，黄增强 3，徐建斌 4，黄羿 1,5，马新强 1,5 . 基于 Spark 的分布式机器人强化学习训练框架[J]. 图学学报, 2019, 40(5): 852-857.
[6]	路强 1,2，张春元 1，陈超 1，余烨 1,2， YUAN Xiao-hui3 . 基于体素特征重组网络的三维物体识别[J]. 图学学报, 2019, 40(2): 240-247.
[7]	徐文鹏1，苗龙涛1，侯守明1，强晓焕2，曾艳阳1. 基于体素模型的3D打印支撑算法[J]. 图学学报, 2018, 39(2): 228-234.
[8]	陈燕1,2，谢琪琦1，刘咏1，崔耀东1. 圆形件下料顺序分组启发式算法的设计与实现[J]. 图学学报, 2017, 38(1): 5-9.
[9]	刘嘉玮，陈双敏，王晓丽，辛士庆. 三维打印中喷头的最优路径规划[J]. 图学学报, 2017, 38(1): 34-38.
[10]	白柳1，宋超超2. 基于体素构造和遗传算法的三维模型检索[J]. 图学学报, 2016, 37(6): 754-.
[11]	胡钢,杨瑞,潘立武 . 基于价值修正的圆片下料顺序启发式算法[J]. 图学学报, 2016, 37(3): 337-341.
[12]	王金敏. 三维矩形布局吸引子性质的研究[J]. 图学学报, 2016, 37(3): 355-358.
[13]	王金敏，朱丽苹，甄士刚. 一种基于蜜蜂进化选择算子的布局遗传算法[J]. 图学学报, 2014, 35(5): 690-696.
[14]	王金敏，齐杨. 矩形布局问题吸引子法研究[J]. 图学学报, 2012, 33(6): 38-44.
[15]	刘晓平，杜琳，石慧. 基于Q 学习的任务调度问题的改进研究[J]. 图学学报, 2012, 33(3): 11-16.

AC-HAPE3D：基于强化学习的异形填充算法

AC-HAPE3D: an algorithm for irregular packing based on reinforcement learning

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价