欢迎访问《图学学报》

图学学报 ›› 2026, Vol. 47 ›› Issue (2): 235-250.DOI: 10.11996/JG.j.2095-302X.2026020235

• 综述 • 上一篇    下一篇

基于生成模型的图像数据增强方法综述

向婷1,2, 唐卓1,2,3, 郑佳丽4, 陈长建1,2(), 吕斐1,2, 李肯立1,2   

  1. 1 湖南大学信息科学与工程学院湖南 长沙 410082
    2 超算与人工智能融合计算教育部重点实验室湖南 长沙 410082
    3 湖南大学深圳研究院广东 深圳 518000
    4 复杂系统控制与智能协同全国重点实验室北京 100074
  • 收稿日期:2025-07-10 接受日期:2025-10-20 出版日期:2026-04-30 发布日期:2026-05-20
  • 通讯作者:陈长建,E-mail:changjianchen@hnu.edu.cn
  • 基金资助:
    国家自然科学基金(62225205);国家自然科学基金(62402167);深圳市自然科学基金面上项目(JCYJ20210324140002006);长沙市科技重大专项(kh2301011);湖南省科技创新计划(2023ZJ1080);湖南省自然科学基金(2025JJ60419);湖南省重大科技专项(2024QK2010);湖南省重大科技专项(2024QK2009);云南省重大科技专项计划项目(202502AD080009);岳麓山实验室种业专项(YLS-2025-ZY01015)

A review of image data augmentation based on generative models

XIANG Ting1,2, TANG Zhuo1,2,3, ZHENG Jiali4, CHEN Changjian1,2(), LYU Fei1,2, LI Kenli1,2   

  1. 1 College of Computer Science and Electronic Engineering, Hunan University, Changsha Hunan 410082, China
    2 The Ministry of Education Key Laboratory of Fusion Computing of Supercomputing and Artificial Intelligence, Changsha Hunan 410082, China
    3 Shenzhen Research Institute, Hunan University, Shenzhen Guangdong 518000, China
    4 National Key Laboratory of Complex System Control and Intelligent Agent Cooperation, Beijing 100074, China
  • Received:2025-07-10 Accepted:2025-10-20 Published:2026-04-30 Online:2026-05-20
  • Contact: CHEN Changjian,E-mail:changjianchen@hnu.edu.cn
  • Supported by:
    National Natural Science Foundation of China(62225205);National Natural Science Foundation of China(62402167);Shenzhen Basic Research Project(JCYJ20210324140002006);Science and Technology Program of Changsha(kh2301011);Science and Technology Innovation Program of Hunan Province(2023ZJ1080);Hunan Natural Science Foundation(2025JJ60419);Major Science and Technology Research Projects of Hunan Province(2024QK2010);Major Science and Technology Research Projects of Hunan Province(2024QK2009);Yunnan Provincial Major Science and Technology Special Plan Projects(202502AD080009);Yuelushan Laboratory Breeding Program(YLS-2025-ZY01015)

摘要:

深度学习在计算机视觉领域展现出巨大的潜力,但其在实际应用中的表现依赖于大量高质量的标注数据。生成模型因其具有生成多样化数据的能力,成为解决数据稀缺问题的有效方法,旨在高效率且有效地为计算机视觉提供训练数据。进而,基于生成模型的图像数据增强技术成为近年来的热点方向。为此,对基于生成模型的图像数据增强方法进行了全面的文献调研,通过三阶段检索方法得到相关的37篇文献,将其方法过程总结归纳为4个步骤,并对每一个步骤采用的方法进行分类与详细描述。首先,从生成模型的选择出发,介绍可用于图像数据增强的各类生成模型;然后,对生成式图像数据增强方法进行分类,并详细介绍每个类别的方法流程和代表性论文,以及存在的问题和亟需优化的方向;考虑到生成数据存在噪声的问题,还介绍了对生成数据的选择和处理方法,以更好地在下游任务中利用生成数据;接着,对数据增强效果验证方法进行分类与描述,以全面验证方法的有效性和鲁棒性。最后,详细阐述了生成式图像数据增强在生成图像的语义一致性、多样性、生成效率和对黑盒模型的应用等方向面临的机遇与挑战,并指出未来潜在的探索方向。

关键词: 数据稀缺, 生成模型, 数据增强, 生成图像数据处理, 效果验证

Abstract:

Deep learning has shown great potential in the field of computer vision, but its performance in practical applications relies heavily on large amounts of high-quality labeled data. Generative models, with their ability to generate diverse data, have become an effective solution to the problem of data scarcity, aiming to provide training data for computer vision efficiently and effectively. Consequently, image data augmentation techniques based on generative models have become a popular research direction in recent years. To this end, a comprehensive literature review was conducted on image data augmentation methods based on generative models. Through a three-stage retrieval process, 37 relevant studies were collected. The methodological processes of these studies were summarized into four main steps, with each step categorized and described in detail. First, various generative models suitable for image data augmentation were introduced, focusing on model selection. Next, generative image data augmentation methods were classified, with elaborations on the workflow, representative studies, existing challenges, and areas in need of optimization for each category. Considering that generated data may contain noise, methods were also discussed for the selection and processing of generated data to better utilize them in downstream tasks. Furthermore, evaluation methods were categorized and described to comprehensively verify the effectiveness and robustness of data augmentation approaches. Finally, the opportunities and challenges faced by generative image data augmentation in aspects were elaborated upon, such as maintaining semantic consistency, ensuring diversity, improving generation efficiency, and applying to black-box models, and pointed out potential directions for future exploration.

Key words: data scarcity, generative model, data augmentation, generated image data processing, effect evaluation

中图分类号: