Journal of Graphics ›› 2026, Vol. 47 ›› Issue (2): 351-359.DOI: 10.11996/JG.j.2095-302X.2026020351
• Image Processing and Computer Vision • Previous Articles Next Articles
ZHANG Zhou, WANG Zeyu(
), SONG Haiyu, LI Wei, GE Mingyu, WANG Jiayu, WANG Wenqi
Received:2025-05-22
Accepted:2025-12-04
Online:2026-04-30
Published:2026-05-20
Contact:
WANG Zeyu
Supported by:CLC Number:
ZHANG Zhou, WANG Zeyu, SONG Haiyu, LI Wei, GE Mingyu, WANG Jiayu, WANG Wenqi. Multi-focus image fusion based on 3D manifold fitting and frequency division-guided attention mechanism[J]. Journal of Graphics, 2026, 47(2): 351-359.
Add to citation manager EndNote|Ris|BibTeX
URL: http://www.txxb.com.cn/EN/10.11996/JG.j.2095-302X.2026020351
Fig. 3 Qualitative comparison of the experimental results between the proposed algorithm and the competing methods ((a) Source image 1;(b) Source image 2;(c) CU-Net;(d) DeFusion;(e) DIF-Net;(f) FusionDiff;(g) U2Fusion;(h) Ours)
| 算法 | QMI | QNICE | VIFP | PSNR | CORR | SSIM |
|---|---|---|---|---|---|---|
| Cu-Net[ | 0.733 5 | 0.820 7 | 0.679 4 | 26.035 3 | 0.970 0 | 0.862 4 |
| DeFusion[ | 0.832 2 | 0.824 8 | 0.734 4 | 29.062 7 | 0.981 4 | 0.912 5 |
| DIF-Net[ | 0.848 6 | 0.825 0 | 0.708 1 | 26.523 2 | 0.982 5 | 0.911 7 |
| FusionDiff[ | 0.901 0 | 0.828 2 | 0.733 4 | 26.872 1 | 0.977 0 | 0.893 0 |
| U2Fusion[ | 0.796 6 | 0.823 0 | 0.724 6 | 25.863 2 | 0.974 9 | 0.874 3 |
| 本文算法 | 0.949 5 | 0.851 1 | 0.793 9 | 32.681 9 | 0.988 5 | 0.932 5 |
Table 1 Quantitative comparison of experimental results between the proposed algorithm and other methods “Lytro”
| 算法 | QMI | QNICE | VIFP | PSNR | CORR | SSIM |
|---|---|---|---|---|---|---|
| Cu-Net[ | 0.733 5 | 0.820 7 | 0.679 4 | 26.035 3 | 0.970 0 | 0.862 4 |
| DeFusion[ | 0.832 2 | 0.824 8 | 0.734 4 | 29.062 7 | 0.981 4 | 0.912 5 |
| DIF-Net[ | 0.848 6 | 0.825 0 | 0.708 1 | 26.523 2 | 0.982 5 | 0.911 7 |
| FusionDiff[ | 0.901 0 | 0.828 2 | 0.733 4 | 26.872 1 | 0.977 0 | 0.893 0 |
| U2Fusion[ | 0.796 6 | 0.823 0 | 0.724 6 | 25.863 2 | 0.974 9 | 0.874 3 |
| 本文算法 | 0.949 5 | 0.851 1 | 0.793 9 | 32.681 9 | 0.988 5 | 0.932 5 |
| 算法 | QMI | QNICE | VIFP | PSNR | CORR | SSIM |
|---|---|---|---|---|---|---|
| Cu-Net[ | 0.673 7 | 0.816 6 | 0.577 7 | 24.020 7 | 0.954 1 | 0.822 5 |
| DeFusion[ | 0.744 5 | 0.810 6 | 0.644 4 | 24.366 1 | 0.968 5 | 0.868 1 |
| DIF-Net[ | 0.773 9 | 0.810 4 | 0.614 4 | 23.908 4 | 0.968 6 | 0.846 8 |
| FusionDiff[ | 0.805 3 | 0.811 2 | 0.658 3 | 23.255 6 | 0.964 8 | 0.842 9 |
| U2Fusion[ | 0.746 7 | 0.818 8 | 0.617 9 | 24.072 8 | 0.960 5 | 0.830 3 |
| 本文算法 | 0.877 4 | 0.831 2 | 0.708 3 | 28.826 3 | 0.975 2 | 0.872 7 |
Table 2 Quantitative comparison of experimental results between the proposed method and other algorithms “MFFW”
| 算法 | QMI | QNICE | VIFP | PSNR | CORR | SSIM |
|---|---|---|---|---|---|---|
| Cu-Net[ | 0.673 7 | 0.816 6 | 0.577 7 | 24.020 7 | 0.954 1 | 0.822 5 |
| DeFusion[ | 0.744 5 | 0.810 6 | 0.644 4 | 24.366 1 | 0.968 5 | 0.868 1 |
| DIF-Net[ | 0.773 9 | 0.810 4 | 0.614 4 | 23.908 4 | 0.968 6 | 0.846 8 |
| FusionDiff[ | 0.805 3 | 0.811 2 | 0.658 3 | 23.255 6 | 0.964 8 | 0.842 9 |
| U2Fusion[ | 0.746 7 | 0.818 8 | 0.617 9 | 24.072 8 | 0.960 5 | 0.830 3 |
| 本文算法 | 0.877 4 | 0.831 2 | 0.708 3 | 28.826 3 | 0.975 2 | 0.872 7 |
| 实验 | QMI | QNICE | VIFP | PSNR | CORR | SSIM |
|---|---|---|---|---|---|---|
| 消除三维流形拟合模块 | 0.632 3 | 0.532 6 | 0.685 8 | 25.373 1 | 0.832 7 | 0.792 3 |
| 消除分频引导注意力模块 | 0.704 7 | 0.642 7 | 0.452 7 | 25.427 4 | 0.842 2 | 0.810 4 |
| 完整模型 | 0.949 5 | 0.851 1 | 0.793 9 | 32.681 9 | 0.988 5 | 0.932 5 |
Table 3 Quantitative ablation comparison on the “Lytro” dataset
| 实验 | QMI | QNICE | VIFP | PSNR | CORR | SSIM |
|---|---|---|---|---|---|---|
| 消除三维流形拟合模块 | 0.632 3 | 0.532 6 | 0.685 8 | 25.373 1 | 0.832 7 | 0.792 3 |
| 消除分频引导注意力模块 | 0.704 7 | 0.642 7 | 0.452 7 | 25.427 4 | 0.842 2 | 0.810 4 |
| 完整模型 | 0.949 5 | 0.851 1 | 0.793 9 | 32.681 9 | 0.988 5 | 0.932 5 |
| 实验 | QMI | QNICE | VIFP | PSNR | CORR | SSIM |
|---|---|---|---|---|---|---|
| 消除三维流形拟合模块 | 0.474 3 | 0.683 4 | 0.623 8 | 24.934 2 | 0.734 7 | 0.812 3 |
| 消除分频引导注意力模块 | 0.694 2 | 0.713 7 | 0.517 1 | 25.421 6 | 0.817 9 | 0.774 5 |
| 完整模型 | 0.877 4 | 0.831 2 | 0.708 3 | 28.826 3 | 0.975 2 | 0.872 7 |
Table 4 Quantitative ablation comparison on the “MFFW” dataset
| 实验 | QMI | QNICE | VIFP | PSNR | CORR | SSIM |
|---|---|---|---|---|---|---|
| 消除三维流形拟合模块 | 0.474 3 | 0.683 4 | 0.623 8 | 24.934 2 | 0.734 7 | 0.812 3 |
| 消除分频引导注意力模块 | 0.694 2 | 0.713 7 | 0.517 1 | 25.421 6 | 0.817 9 | 0.774 5 |
| 完整模型 | 0.877 4 | 0.831 2 | 0.708 3 | 28.826 3 | 0.975 2 | 0.872 7 |
| 算法 | 时间 |
|---|---|
| CU-Net | 0.042 0 |
| DIF-Net | 0.041 0 |
| U2Fusion | 0.041 0 |
| DeFusion | 0.045 2 |
| FusionDiff | 12.572 0 |
| 本文算法 | 0.032 0 |
Table 5 Average running time of various fusion algorithms on the two test sets /s
| 算法 | 时间 |
|---|---|
| CU-Net | 0.042 0 |
| DIF-Net | 0.041 0 |
| U2Fusion | 0.041 0 |
| DeFusion | 0.045 2 |
| FusionDiff | 12.572 0 |
| 本文算法 | 0.032 0 |
| [1] | 李奕, 吴小俊. 香农熵加权稀疏表示图像融合方法研究[J]. 自动化学报, 2014, 40(8): 1819-1835. |
| LI Y, WU X J. Image fusion based on sparse representation using Shannon entropy weighting[J]. Acta Automatica Sinica, 2014, 40(8): 1819-1835 (in Chinese). | |
| [2] | ZHAO L B, ZHANG X L, HUANG B, et al. MFANet: multi-feature aggregation network for multi-focus image fusion[C]// ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing. New York: IEEE Press, 2025: 1-5. |
| [3] |
HU X Y, JIANG J J, LIU X M, et al. ZMFF: Zero-shot multi-focus image fusion[J]. Information Fusion, 2023, 92: 127-138.
DOI URL |
| [4] |
LIU J Y, LI S T, LIU H B, et al. A lightweight pixel-level unified image fusion network[J]. IEEE Transactions on Neural Networks and Learning Systems, 2024, 35(12): 18120-18132.
DOI URL |
| [5] |
BAI H W, ZHAO Z X, ZHANG J S, et al. ReFusion: learning image fusion from reconstruction with learnable loss via meta-learning[J]. International Journal of Computer Vision, 2025, 133(5): 2547-2567.
DOI |
| [6] |
WANG Z Y, LI X F, ZHAO L B, et al. When multi-focus image fusion networks meet traditional edge-preservation technology[J]. International Journal of Computer Vision, 2023, 131(10): 2529-2552.
DOI |
| [7] |
LIU Y, CHEN X, PENG H, et al. Multi-focus image fusion with a deep convolutional neural network[J]. Information Fusion, 2017, 36: 191-207.
DOI URL |
| [8] |
AMIN-NAJI M, AGHAGOLZADEH A, EZOJI M. Ensemble of CNN for multi-focus image fusion[J]. Information Fusion, 2019, 51: 201-214.
DOI URL |
| [9] |
LI J X, GUO X B, LU G M, et al. DRPL: deep regression pair learning for multi-focus image fusion[J]. IEEE Transactions on Image Processing, 2020, 29: 4816-4831.
DOI URL |
| [10] |
潘树焱, 刘立群. MSFAFuse: 基于多尺度特征信息与注意力机制的SAR和可见光图像融合模型[J]. 图学学报, 2025, 46(2): 300-311.
DOI |
|
PAN S Y, LIU L Q. MSFAFuse: SAR and optical image fusion model based on multi-scale feature information and attention mechanism[J]. Journal of Graphics, 2025, 46(2): 300-311 (in Chinese).
DOI |
|
| [11] | ZHAO Z X, BAI H W, ZHANG J S, et al. CDDFuse: correlation-driven dual-branch feature decomposition for multi-modality image fusion[C]// 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2023: 5906-5916. |
| [12] | ZHAO Z X, XU S, ZHANG C X, et al. DIDFuse: deep image decomposition for infrared and visible image fusion[EB/OL]. [2025-03-22]. https://www.ijcai.org/proceedings/2020/135. |
| [13] |
DENG X, DRAGOTTI P L. Deep convolutional neural network for multi-modal image restoration and fusion[J]. IEEE transactions on pattern analysis and machine intelligence, 2021, 43(10): 3333-3348.
DOI URL |
| [14] | HORÉ A, ZIOU D. Image quality metrics: PSNR vs. SSIM[C]// The 20th International Conference on Pattern Recognition. New York: IEEE Press, 2010: 2366-2369. |
| [15] |
ZHANG J C, LIAO Q M, LIU S J, et al. Real-MFF: a large realistic multi-focus image dataset with ground truth[J]. Pattern Recognition Letters, 2020, 138: 370-377.
DOI URL |
| [16] |
NEJATI M, SAMAVI S, SHIRANI S. Multi-focus image fusion using dictionary-based sparse representation[J]. Information Fusion, 2015, 25: 72-84.
DOI URL |
| [17] | XU S, WEI X L, ZHANG C X, et al. MFFW: a new dataset for multi-focus image fusion[EB/OL]. [2025-12-04]. https://arxiv.org/abs/2002.04780.pdf. |
| [18] |
QU G H, ZHANG D L, YAN P F. Information measure for performance of image fusion[J]. Electronics Letters, 2002, 38(7): 313-315.
DOI URL |
| [19] | WANG Q, SHEN Y, JIN J. Performance evaluation of image fusion techniques[M]//STATHAKI T. Image Fusion: Algorithms and Applications. Amsterdam: Academic Press, 2008: 469-492. |
| [20] | WILLIAMS S. Pearson’s correlation coefficient[J]. The New Zealand Medical Journal, 1996, 109(1015): 38. |
| [21] | LIANG P W, JIANG J J, LIU X M, et al. Fusion from decomposition: a self-supervised decomposition approach for image fusion[C]// The 17th European Conference on Computer Vision. Cham: Springer, 2022: 719-735. |
| [22] |
JUNG H, KIM Y, JANG H, et al. Unsupervised deep image fusion with structure tensor representations[J]. IEEE Transactions on Image Processing, 2020, 29: 3845-3858.
DOI URL |
| [23] |
LI M N, PEI R H, ZHENG T Y, et al. FusionDiff: multi-focus image fusion using denoising diffusion probabilistic models[J]. Expert Systems with Applications, 2024, 238: 121664.
DOI URL |
| [24] |
XU H, MA J Y, JIANG J, et al. U2Fusion: a unified unsupervised image fusion network[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44(1): 502-518.
DOI URL |
| [1] | GUO Yecai, HU Xiaowei, AMITAVE Saha, MAO Xiangnan. Multiscale dense interactive attention residual real image denoising network [J]. Journal of Graphics, 2025, 46(2): 279-287. |
| [2] | DONG Jiale, DENG Zhengjie, LI Xiyan, WANG Shiyun. Deepfake detection method based on multi-feature fusion of frequency domain and spatial domain [J]. Journal of Graphics, 2025, 46(1): 104-113. |
| [3] | ZHAI Yongjie, LI Jiawei, CHEN Nianhao, WANG Qianming, WANG Xinying. The vehicle parts detection method enhanced with Transformer integration [J]. Journal of Graphics, 2024, 45(5): 930-940. |
| [4] | WEI Min, YAO Xin. Two-stage storm entity prediction based on multiscale and attention [J]. Journal of Graphics, 2024, 45(4): 696-704. |
| [5] | GUO Zongyang, LIU Lidong, JIANG Donghua, LIU Zixiang, ZHU Shukang, CHEN Jinghua. Human action recognition algorithm based on semantics guided neural networks [J]. Journal of Graphics, 2024, 45(1): 26-34. |
| [6] | WANG Xinyu, LIU Hui, ZHU Jicheng, SHENG Yurui, ZHANG Caiming. Deep multimodal medical image fusion network based on high-low frequency feature decomposition [J]. Journal of Graphics, 2024, 45(1): 65-77. |
| [7] | LIU Bing, YE Cheng-xu. Fine-grained classification model of lung disease for imbalanced data [J]. Journal of Graphics, 2023, 44(3): 513-520. |
| [8] | ZHANG Chen-yang, CAO Yan-hua, YANG Xiao-zhong. Multi-focus image fusion method based on fractional wavelet combined with guided filtering [J]. Journal of Graphics, 2023, 44(1): 77-87. |
| [9] | DONG Zhe-tong, LIN Hong-wei. Computer aided topological design ——survey on geometric design and processing based on persistent homology [J]. Journal of Graphics, 2022, 43(6): 957-966. |
| [10] | GUO Wen , LI Dong , YUAN Fei. 1. School of Information and Electronic Engineering, Shandong Technology and Business University, Yantai Shandong 264005, China; 2. Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100195, China [J]. Journal of Graphics, 2022, 43(6): 1124-1133. |
| [11] | MO Han-lin, HAO You, GUO Rui, HAO Hong-xiang, ZHANG He, LI Qi, LI Hua, . The construction and application of integral invariants and differential invariants of graphics and images [J]. Journal of Graphics, 2022, 43(6): 1182-1192. |
| [12] | CAO Li, WU Yao, XU Yi-ke. 3D model wireframe extraction method based on medial axis expression [J]. Journal of Graphics, 2022, 43(3): 461-468. |
| [13] | QIN Yu , CAO Li, WU Yao , LI Lin, . A novel wire frame generation for triangle meshes [J]. Journal of Graphics, 2021, 42(6): 963-969. |
| [14] | ZHANG Fan , YIN Xin , XU Yu-yang , HAO Peng-yi . Multi-lead heartbeat signal classification based on multi-scale feature extraction [J]. Journal of Graphics, 2021, 42(4): 581-589. |
| [15] | HE Jian1,2, LIAO Jun-jie2, ZHANG Cheng2, WEI Xin2, BAI Jia-hao2, WANG Wei-dong1,2. Visual gesture recognition technology based on long short term memory and deep neural network [J]. Journal of Graphics, 2020, 41(3): 372-381. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||