欢迎访问《图学学报》 分享到:

图学学报

• 视觉与图像 • 上一篇    下一篇

基于矩阵2-范数池化的卷积神经网络图像识别算法

  

  • 出版日期:2016-10-31 发布日期:2016-10-20

Image Recognition Algorithm of Convolutional Neural Networks Based on Matrix 2-Norm Pooling

  • Online:2016-10-31 Published:2016-10-20

摘要: 卷积神经网络中的池化操作可以实现图像变换的缩放不变性,并且对噪声和杂波有
很好的鲁棒性。针对图像识别中池化操作提取局部特征时忽略了隐藏在图像中的能量信息的问
题,根据图像的能量与矩阵的奇异值之间的关系,并且考虑到图像信息的主要能量集中于奇异值
中数值较大的几个,提出一种矩阵2-范数池化方法。首先将前一卷积层特征图划分为若干个互不
重叠的子块图像,然后分别计算子块图像矩阵的奇异值,将最大奇异值作为每个池化区域的统计
结果。利用5 种不同的池化方法在Cohn-Kanade、Caltech-101、MNIST 和CIFAR-10 数据集上进
行了大量实验,实验结果表明,相比较于其他方法,该方法具有更好地识别效果和稳健性。

关键词: 深度学习, 卷积神经网络, 矩阵2-范数, 池化, 奇异值

Abstract: The pooling operation in convolutional neural networks can achieve the scale invariance of
image transformations, and has better robustness to noise and clutter. In view of the problem that
pooling operation ignores the energy information hidden in the image when it extracts local features for
image recognition, according to the relationship between energy of the image and singular value of the
matrix, and taking into account the image information of the energy mainly concentrates on the larger
singular value, a pooling method based on matrix 2-norm was proposed. The former feature map of
convolutional layer is divided into several non-overlapping sub blocks, and then singular value of the
matrix is calculated. The maximum value is used as the statistical results of each pooling region.
Various numerical experiments has been carried out based on Cohn-Kanade, Caltech-101, MNIST and
CIFAR-10 database using different kinds of pooling method. Experimental results show that the
proposed method is superior in both recognition rate and robustness compared with other methods.

Key words: deep learning, convolutional neural networks, matrix 2-norm, pooling, singular value