欢迎访问《图学学报》

图学学报 ›› 2026, Vol. 47 ›› Issue (2): 322-331.DOI: 10.11996/JG.j.2095-302X.2026020322

• 图像处理与计算机视觉 • 上一篇    下一篇

基于大模型的皮肤病图像掩膜生成与分割

陈梦琪1, 赵俊莉1(), 邓晓丹2   

  1. 1 青岛大学计算机科学技术学院山东 青岛 266071
    2 青岛大学自动化学院山东 青岛 266071
  • 收稿日期:2025-10-22 接受日期:2026-01-12 出版日期:2026-04-30 发布日期:2026-05-20
  • 通讯作者:赵俊莉,E-mail:zhaojl@yeah.net
  • 基金资助:
    山东省自然科学基金面上项目(ZR2024MF087);国家自然科学基金面上项目(62172247);山东省自然科学基金(ZR2024QF184)

SAM-based mask generation and segmentation for dermatological images

CHEN Mengqi1, ZHAO Junli1(), DENG Xiaodan2   

  1. 1 College of Computer Science and Technology, Qingdao University, Qingdao Shandong 266071, China
    2 School of Automation, Qingdao University, Qingdao Shandong 266071, China
  • Received:2025-10-22 Accepted:2026-01-12 Published:2026-04-30 Online:2026-05-20
  • Contact: ZHAO Junli,E-mail:zhaojl@yeah.net
  • Supported by:
    Shandong Provincial Natural Science Foundation General Project(ZR2024MF087);National Natural Science Foundation of China General Project(62172247);Shandong Provincial Natural Science Foundation(ZR2024QF184)

摘要:

作为恶性肿瘤中发病率较高的病种,皮肤癌的及时检出具有重要临床意义,其中皮肤病灶的准确识别与分割是计算机辅助诊断的重要前提。尽管深度学习技术在医学图像分割领域展现出了卓越性能,但现有模型普遍面临着病灶边缘分割精度不足、训练数据规模与多样性受限等问题。为解决上述问题,提出了一种名为BESA-Diff边界增强模型系统,采用边界增强扩散模型DermoSegDiff作为核心分割架构,通过模型优化训练流程。其核心技术贡献体现在:①基于扩散模型构建了皮肤病理性图像及掩膜自动生成框架;②设计了一个创新的掩膜精细化流程:创新性地整合了分割任意模型(SAM)与DermosegDiff的边缘细化模块,构建了高质量的合成医学影像数据集。通过在ISIC2018标准数据集、PH2数据集、HAM10000数据集与合成数据集上的实验结果表明,本模型在戴斯相似系数(Dice)和交并比(IoU)等关键分割指标上均显著优于基线模型。消融实验证实,引入SAM进行掩膜精细化是性能提升的关键,该模块有效改善了病灶边缘的分割效果,特别是在边界模糊或对比度低的区域。研究证实,融合扩散模型的数据生成能力与通用分割模型的边界优化能力,能够有效地提升皮肤病灶分割的精确性与鲁棒性,为皮肤癌辅助诊断提供了一种高性能的解决方案,展现了合成数据技术在突破医学人工智能数据瓶颈方面的巨大潜力。

关键词: 皮肤癌, 数据增强, 扩散模型, 边缘分割, 图像分割

Abstract:

As a malignant tumor with a relatively high incidence rate, the timely detection of skin cancer carries substantial clinical significance. Accurate identification and segmentation of skin lesions serve as critical prerequisites for computer-aided diagnosis. Despite the remarkable performance of deep learning techniques in medical image segmentation, existing models commonly encounter challenges such as insufficient segmentation accuracy at lesion edges and constraints on the scale and diversity of training data. To address these issues, a boundary-enhanced model system named BESA-Diff was proposed. The system employed the boundary-enhanced diffusion model DermoSegDiff as its core segmentation architecture and optimized the model training workflow. The core technical contributions of this research were twofold: First, a framework for the automatic generation of pathological skin images and masks was constructed based on diffusion models. Second, an innovative mask refinement pipeline was designed by innovatively integrating the Segment Anything Model (SAM) with the edge refinement module of DermoSegDiff, and a high-quality synthetic medical image dataset was established. Experimental evaluations on the ISIC2018 standard dataset, PH2 dataset, HAM10000 dataset, and the synthetic dataset demonstrated that the proposed model significantly outperformed baseline models in key segmentation metrics, including the Dice Similarity Coefficient (Dice) and Intersection over Union (IoU). Ablation experiments confirmed that the introduction of SAM for mask refinement was the pivotal factor driving performance improvement. This module effectively enhanced the segmentation of lesion edges, particularly in regions with blurred boundaries or low contrast. The findings of this study validated that integrating the data generation capability of diffusion models with the boundary optimization capability of general segmentation models can effectively improve the accuracy and robustness of skin lesion segmentation. This work provided a high-performance solution for auxiliary diagnosis of skin cancer and highlighted the immense potential of synthetic data technology in overcoming the data bottleneck in medical artificial intelligence.

Key words: skin cancer, data enhancement, diffusion model, edge segmentation, image segmentation

中图分类号: