Journal of Graphics ›› 2026, Vol. 47 ›› Issue (2): 322-331.DOI: 10.11996/JG.j.2095-302X.2026020322
• Image Processing and Computer Vision • Previous Articles Next Articles
CHEN Mengqi1, ZHAO Junli1(
), DENG Xiaodan2
Received:2025-10-22
Accepted:2026-01-12
Online:2026-04-30
Published:2026-05-20
Contact:
ZHAO Junli
Supported by:CLC Number:
CHEN Mengqi, ZHAO Junli, DENG Xiaodan. SAM-based mask generation and segmentation for dermatological images[J]. Journal of Graphics, 2026, 47(2): 322-331.
Add to citation manager EndNote|Ris|BibTeX
URL: http://www.txxb.com.cn/EN/10.11996/JG.j.2095-302X.2026020322
| 实验组别 | Dice | IoU |
|---|---|---|
| Baseline (Real) | 0.917 3 | 0.860 1 |
| Syn-DiffMask | 0.911 7 | 0.844 1 |
| Syn-SAMMask | 0.922 8 | 0.887 0 |
Table 1 Mask performance comparison
| 实验组别 | Dice | IoU |
|---|---|---|
| Baseline (Real) | 0.917 3 | 0.860 1 |
| Syn-DiffMask | 0.911 7 | 0.844 1 |
| Syn-SAMMask | 0.922 8 | 0.887 0 |
Fig. 6 Comparison of skin lesion segmentation results ((a), (e) Original synthetic image; (b), (f) Mask generated by the Diffusion model (the red area represents the segmentation result of the Diffusion model); (c), (g) Mask generated by the SAM model (the green area represents the segmentation result of SAM); (d), (h) The superimposed result of the two (the yellow area represents the overlapping part of the two methods))
| 方法 | Dice ↑ | IoU ↑ | BF4 ↑ |
|---|---|---|---|
| BASE-Diff(ISIC2018) | 0.893 2 | 0.819 2 | 0.192 3 |
| BASE-Diff (+ 1 000张合成图像) | 0.918 5 | 0.855 4 | 0.222 9 |
Table 2 Model segmentation performance on ISIC2018
| 方法 | Dice ↑ | IoU ↑ | BF4 ↑ |
|---|---|---|---|
| BASE-Diff(ISIC2018) | 0.893 2 | 0.819 2 | 0.192 3 |
| BASE-Diff (+ 1 000张合成图像) | 0.918 5 | 0.855 4 | 0.222 9 |
| 病灶类型 | 方法 | Dice↑ | IoU↑ |
|---|---|---|---|
| 黑色素瘤 | Baseline | 0.878 3 | 0.805 0 |
| BESA-Diff | 0.901 5 | 0.825 5 | |
| 基底细胞癌 | Baseline | 0.885 1 | 0.812 2 |
| BESA-Diff | 0.912 0 | 0.840 1 | |
| 痣 | Baseline | 0.915 8 | 0.848 9 |
| BESA-Diff | 0.925 3 | 0.862 0 | |
| 光化性角化病 | Baseline | 0.868 8 | 0.798 8 |
| BESA-Diff | 0.895 2 | 0.815 5 |
Table 3 Segmentation performance across different skin lesion types
| 病灶类型 | 方法 | Dice↑ | IoU↑ |
|---|---|---|---|
| 黑色素瘤 | Baseline | 0.878 3 | 0.805 0 |
| BESA-Diff | 0.901 5 | 0.825 5 | |
| 基底细胞癌 | Baseline | 0.885 1 | 0.812 2 |
| BESA-Diff | 0.912 0 | 0.840 1 | |
| 痣 | Baseline | 0.915 8 | 0.848 9 |
| BESA-Diff | 0.925 3 | 0.862 0 | |
| 光化性角化病 | Baseline | 0.868 8 | 0.798 8 |
| BESA-Diff | 0.895 2 | 0.815 5 |
Fig. 9 Comparison of skin lesion segmentation results ((a) Original dermatological image; (b) Ground truth mask; (c) Predicted results (green regions))
| 方法 | PH2 | HAM10000 | ||
|---|---|---|---|---|
| Dice↑ | IoU↑ | Dice↑ | IoU↑ | |
| Baseline (PH2) | 0.865 8 | 0.770 1 | 0.848 5 | 0.752 8 |
| BESA-Diff (生成图像) | 0.881 2 | 0.788 3 | 0.862 7 | 0.763 9 |
Table 4 Model segmentation performance on PH2 and HAM10000
| 方法 | PH2 | HAM10000 | ||
|---|---|---|---|---|
| Dice↑ | IoU↑ | Dice↑ | IoU↑ | |
| Baseline (PH2) | 0.865 8 | 0.770 1 | 0.848 5 | 0.752 8 |
| BESA-Diff (生成图像) | 0.881 2 | 0.788 3 | 0.862 7 | 0.763 9 |
| [1] | CAO H, WANG Y Y, CHEN J, et al. Swin-Unet: Unet-like pure transformer for medical image segmentation[C]// European Conference on Computer Vision. Cham: Springer, 2023: 205-218. |
| [2] | CHEN J N, LU Y Y, YU Q H, et al. TransUNet: Transformers make strong encoders for medical image segmentation[EB/OL]. [2025-07-21]. https://doi.org/10.48550/arXiv.2102.04306. |
| [3] |
HUANG X H, DENG Z F, LI D D, et al. MISSFormer: an effective transformer for 2D medical image segmentation[J]. IEEE Transactions on Medical Imaging, 2023, 42(5): 1484-1494.
DOI PMID |
| [4] | HO J, JAIN A, ABBEEL P. Denoising diffusion probabilistic models[EB/OL]. [2025-07-21]. https://doi.org/10.48550/arXiv.2006.11239. |
| [5] |
AZAD R, KAZEROUNI A, HEIDARI M, et al. Advances in medical image analysis with vision transformers: a comprehensive review[J]. Medical Image Analysis, 2024, 91: 103000.
DOI URL |
| [6] | KUMAR A, KANTHEN K R, JOHN J. GS-TransUNet:integrated 2D Gaussian splatting and transformer UNet for accurate skin lesion analysis[EB/OL]. [2025-07-21]. https://doi.org/10.48550/arXiv.2502.16748. |
| [7] | BOZORGPOUR A, SADEGHEIH Y, KAZEROUNI A, et al. DermoSegDiff: a boundary-aware segmentation diffusion model for skin lesion delineation[C]// The 6th International Workshop on Predictive intelligence in Medicine. Cham: Springer, 2023: 146-158. |
| [8] | CHEN L C, ZHU Y K, GEORGE P, et al. Encoder-decoder with atrous separable convolution for semantic image segmentation[C]// The 15th European Conference on Computer Vision. Cham: Springer, 2018: 833-851. |
| [9] | CODELLA N, ROTEMBERG V, TSCHANDL P, et al. Skin lesion analysis toward melanoma detection 2018:a challenge hosted by the international skin imaging collaboration (ISIC)[EB/OL]. [2025-07-21]. https://doi.org/10.48550/arXiv.1902.03368. |
| [10] | KIRILLOV A, MINTUN E, RAVI N, et al. Segment anything[EB/OL]. [2025-07-21]. https://doi.org/10.48550/arXiv.2304.02643. |
| [11] | FANG L X, XU Y Y, MA X, et al. Minding fuzzy regions: a data-driven alternating learning paradigm for stable lesion segmentation[C]// 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Computer Vision and Pattern Recognition. New York: IEEE Press, 2025: 10425-10434. |
| [12] | JIANG L, ZHOU Z Y, LEUNG T, et al. MentorNet: learning data-driven curriculum for very deep neural networks on corrupted labels[EB/OL]. [2025-07-21]. https://doi.org/10.48550/arXiv.1712.05055. |
| [13] | RONG S H, TU B H, WANG Z L, et al. Boundary-enhanced co-training for weakly supervised semantic segmentation[C]// 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2023: 19574-19584. |
| [14] |
YANG H R, HUA W B, XU Z B, et al. Domain-generalized discrete diffusion model for cross-domain medical image segmentation[J]. IEEE Transactions on Medical Imaging, 2025, 44(11): 4131-4142.
DOI URL |
| [15] |
XIE P T, ZHANG L, JINDAL B, et al. Generative AI enables medical image segmentation in ultra low-data regimes[J]. Nature Communications, 2025, 16(1): 6486.
DOI |
| [16] |
KAZEROUNI A, AGHDAM E K, HEIDARI M, et al. Diffusion models in medical imaging: a comprehensive survey[J]. Medical Image Analysis, 2023, 88: 102846.
DOI URL |
| [17] | KIM B, OH Y, YE J C. Diffusion adversarial representation learning for self-supervised vessel segmentation[EB/OL]. [2025-07-21]. https://doi.org/10.48550/arXiv.2209.14566. |
| [18] | KHAN T M, LIN D, IQBAL S, et al. TAFM-Net: a novel approach to skin lesion segmentation using transformer attention and focal modulation[EB/OL]. [2025-07-21]. https://doi.org/10.48550/arXiv.2411.17556. |
| [19] |
LEI B Y, XIA Z M, JIANG F, et al. Skin lesion segmentation via generative adversarial networks with dual discriminators[J]. Medical Image Analysis, 2020, 64: 101716.
DOI URL |
| [20] |
LIU X W, YANG L, CHEN J G, et al. Region-to-boundary deep learning model with multi-scale feature fusion for medical image segmentation[J]. Biomedical Signal Processing and Control, 2022, 71: 103165.
DOI URL |
| [21] |
BI L, FULHAM M, KIM J. Hyper-fusion network for semi-automatic segmentation of skin lesions[J]. Medical Image Analysis, 2022, 76: 102334.
DOI URL |
| [22] | MOLAEI A, AMINIMEHR A, TAVAKOLI A, et al. Implicit neural representation in medical imaging: a comparative survey[C]// 2023 IEEE/CVF International Conference on Computer Vision Workshops. New York: IEEE Press, 2023: 2373-2383. |
| [23] | NICHOL A, DHARIWAL P. Improved denoising diffusion probabilistic models[EB/OL]. [2025-07-21]. https://doi.org/10.48550/arXiv.2102.09672. |
| [24] | WANG H N, CAO P, WANG J Q, et al. UCTransnet: rethinking the skip connections in U-Net from a channel-wise perspective with transformer[C]// The 36th AAAI Conference on Artificial Intelligence. Philadelphia: AAAI, 2022: 2441-2449. |
| [25] | WANG J C, WEI L, WANG L S, et al. Boundary-aware transformers for skin lesion segmentation[C]// The 24th International Conference on Medical Image Computing and Computer-Assisted Intervention. Cham: Springer, 2021: 206-216. |
| [1] | ZHAO Zhenbing, ZHANG Jingliang, TANG Chenkang, BI Yuxuan, LI Haopeng. Precise-oil leakage segmentation for substation equipment under water-accumulation interference [J]. Journal of Graphics, 2026, 47(2): 296-310. |
| [2] | ZHOU Jin, ZHOU Yi, XU Pengfei, HUANG Hui. PDF-Sketch: layout-based sketch generation via primitive distance fields and discrete diffusion [J]. Journal of Graphics, 2026, 47(2): 380-389. |
| [3] | LIU Jinghao, YOU Zhenguo, DU Dong. Conditional generation of CAD models based on latent diffusion models [J]. Journal of Graphics, 2026, 47(2): 390-401. |
| [4] | YE Wenlong, CHEN Bin. PanoLoRA: an efficient finetuning method for panoramic image generation based on Stable Diffusion [J]. Journal of Graphics, 2025, 46(5): 980-989. |
| [5] | LEI Songlin, ZHAO Zhengpeng, YANG Qiuxia, PU Yuanyuan, GU Jinjing, XU Dan. Zero-shot style transfer based on decoupled diffusion models [J]. Journal of Graphics, 2025, 46(4): 727-738. |
| [6] | ZHANG Tiansheng, ZHU Minfeng, REN Yiwen, WANG Chenhan, ZHANG Lidong, ZHANG Wei, CHEN Wei. BPA-SAM: box prompt augmented SAM for traditional Chinese realistic painting [J]. Journal of Graphics, 2025, 46(2): 322-331. |
| [7] | SUN Heyi, LI Yixiao, TIAN Xi, ZHANG Songhai. Image to 3D vase generation technology combining procedural content generation and diffusion models [J]. Journal of Graphics, 2025, 46(2): 332-344. |
| [8] | LI Jiyuan, GUAN Zheyu, SONG Haichuan, TAN Xin, MA Lizhuang. Human-in-the-loop field-specific logo generation method [J]. Journal of Graphics, 2025, 46(2): 382-392. |
| [9] | TU Qinghao, LI Yuanqi, LIU Yifan, GUO Jie, GUO Yanwen. Generalization optimization method for text to material texture maps based on diffusion model [J]. Journal of Graphics, 2025, 46(1): 139-149. |
| [10] | ZHANG Ji, CUI Wenshuai, ZHANG Ronghua, WANG Wenbin, LI Yaqi. A text-driven 3D scene editing method based on key views [J]. Journal of Graphics, 2024, 45(4): 834-844. |
| [11] | LIU Yi, QIU Junhai, ZHANG Jiaxing, ZHANG Xiaofeng, WANG Hua, ZHANG Caiming. A highly robust image segmentation algorithm based on trade-off factors and multidimensional spatial metrics [J]. Journal of Graphics, 2024, 45(3): 482-494. |
| [12] | WANG Ji, WANG Sen, JIANG Zhi-wen, XIE Zhi-feng, LI Meng-tian. Zero-shot text-driven avatar generation based on depth-conditioned diffusion model [J]. Journal of Graphics, 2023, 44(6): 1218-1226. |
| [13] | WANG Dao-lei, KANG Bo, ZHU Rui. Text detection method for electrical equipment nameplates based on deep learning [J]. Journal of Graphics, 2023, 44(4): 691-698. |
| [14] | XIAO Tian-xing, WU Jing-jing. Segmentation of laser coding characters based on residual and feature-grouped attention [J]. Journal of Graphics, 2023, 44(3): 482-491. |
| [15] | ZENG Wu, ZHU Heng-liang, XING Shu-li, LIN Jiang-hong, MAO Guo-jun. Saliency detection-guided for image data augmentation [J]. Journal of Graphics, 2023, 44(2): 260-270. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||