欢迎访问《图学学报》 分享到:

图学学报 ›› 2026, Vol. 47 ›› Issue (1): 47-56.DOI: 10.11996/JG.j.2095-302X.2026010047

• 图像处理与计算机视觉 • 上一篇    下一篇

基于不确定性引导的智能强化主动学习图像分类方法

酒明远1,2,3, 吴国伟1, 宋旭光1, 李书攀1,2,3, 徐明亮1,2,3()   

  1. 1 郑州大学计算机与人工智能学院河南 郑州 450001
    2 郑州大学智能集群系统教育部工程研究中心河南 郑州 450001
    3 国家超级计算郑州中心河南 郑州 450001
  • 收稿日期:2025-06-13 接受日期:2025-10-10 出版日期:2026-02-28 发布日期:2026-03-16
  • 通讯作者:徐明亮,E-mail:iexumingliang@zzu.edu.cn
  • 基金资助:
    国家自然科学基金(62272422);国家自然科学基金(U22B2051);国家自然科学基金(62325602);河南省优秀青年基金(252300421225);郑州大学有组织科研团队培育项目(35220549)

Image classification method based on uncertainty-driven smart reinforcement active learning

JIU Mingyuan1,2,3, WU Guowei1, SONG Xuguang1, LI Shupan1,2,3, XU Mingliang1,2,3()   

  1. 1 School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou Henan 450001, China
    2 Engineering Research Center of Intelligent Swarm Systems, Ministry of Education, Zhengzhou University, Zhengzhou Henan 450001, China
    3 National Supercomputing Center in Zhengzhou, Zhengzhou Henan 450001, China
  • Received:2025-06-13 Accepted:2025-10-10 Published:2026-02-28 Online:2026-03-16
  • Supported by:
    National Natural Science Foundation of China(62272422);National Natural Science Foundation of China(U22B2051);National Natural Science Foundation of China(62325602);Natural Science Foundation of Henan Province(252300421225);Organized Young Scientific Research Team Cultivation Foundation of Zhengzhou University(35220549)

摘要:

随着深度学习技术的快速发展,其在图像分类等任务中取得了显著成果。然而,这些模型的成功往往依赖于大量高质量的标注数据,而在实际应用中,标注数据通常稀缺,人工标注过程又极为耗时、费力,限制了模型的推广与应用。近年来,主动学习因其能够在有限标注预算下提升模型性能而受到广泛关注,其核心思想是根据样本的不确定性、多样性或代表性等指标,挑选最有价值的数据进行标注。针对传统主动学习方法多依赖手动设计的启发式采样策略,难以适应不同任务场景,且选择策略难以动态优化等问题,提出一种基于智能强化主动学习(SRAL)的图像分类方法,通过将样本选择过程建模为马尔科夫决策过程,利用强化学习的自适应策略优化能力,引导模型从未标注样本中动态挑选最具价值的样本用于标注。其中,状态由未标注样本提取的特征构成,动作表示是否选择样本进行标注,奖励函数则定义为当前样本加入训练集后模型准确率的变化差值。采用演员-评论家(Actor-Critic)算法进行策略优化,并引入不确定性启发式排序作为辅助信息以提升学习效率。实验结果表明,在CIFAR-10,SVHN和FASHION-MNIST等数据集上,所提出的SRAL方法在相同标注预算下,相比于其他主动学习方法,能够显著提高分类准确率,且在各数据集上均展现出较好的稳定性和泛化能力,验证了SRAL方法在提高图像分类模型性能方面的有效性与优势。

关键词: 深度学习, 强化学习, 主动学习, 图像分类, 策略优化

Abstract:

With the rapid development of deep learning, remarkable achievements have been made in image classification and related tasks. However, the success of these models heavily relies on large amounts of high-quality labeled data. In real-world applications, labeled data is often scarce, and manual annotation is time-consuming, labor-intensive, and costly, which limits the scalability and deployment of deep learning models. In recent years, active learning has gained significant attention due to its ability to improve model performance under limited annotation budgets. The core idea of active learning is to select the most valuable data for labeling based on certain criteria such as uncertainty, diversity, or representativeness. To address the limitations of traditional active learning methods, which often rely on manually designed heuristic sampling strategies that struggle to adapt to different task scenarios and are difficult to dynamically optimize, a Smart Reinforcement Active Learning (SRAL) approach for image classification is proposed. The sample selection process is modeled as a MARKOV DECISION PRocess (MDP), leveraging reinforcement learning’s adaptive strategy optimization ability to guide the model in dynamically selecting the most valuable samples from the unlabeled data for labeling. In this framework, the state is represented by features extracted from the unlabeled samples, the action indicates whether a sample should be selected for labeling, and the reward function is defined as the change in model accuracy after incorporating the selected sample into the training set. The Actor-Critic algorithm is adopted to optimize the sampling policy, and uncertainty-based heuristic ranking is incorporated as auxiliary information to improve the learning efficiency. Experimental results demonstrate that the proposed SRAL method significantly improves classification accuracy under the same labeling budget compared to other active learning approaches on datasets such as CIFAR-10, SVHN, and FASHION-MNIST. Furthermore, SRAL exhibits robust stability and strong generalization ability across these datasets. This confirms the effectiveness and advantages of SRAL in enhancing the performance of image classification models.

Key words: deep learning, reinforcement learning, active learning, image classification, policy optimization

中图分类号: