Journal of Graphics ›› 2026, Vol. 47 ›› Issue (2): 341-350.DOI: 10.11996/JG.j.2095-302X.2026020341
• Image Processing and Computer Vision • Previous Articles Next Articles
YAN Kang, ZENG Li, GU Xiaoqing(
)
Received:2025-08-26
Accepted:2025-12-06
Online:2026-04-30
Published:2026-05-20
Contact:
GU Xiaoqing
Supported by:CLC Number:
YAN Kang, ZENG Li, GU Xiaoqing. Cross-domain structured deep dictionary learning for image classification[J]. Journal of Graphics, 2026, 47(2): 341-350.
Add to citation manager EndNote|Ris|BibTeX
URL: http://www.txxb.com.cn/EN/10.11996/JG.j.2095-302X.2026020341
| 数据集 | 域(Domain) | 类别数 | 样本数 | 迁移任务数 | 数据特点 |
|---|---|---|---|---|---|
| Office31 | Amazon (A) | 31 | 2 817 | 6 | 网络商品图像,背景简洁 |
| Webcam (W) | 795 | 摄像头拍摄,光照变化大 | |||
| DSLR (D) | 498 | 单反拍摄,图像质量高 | |||
| OfficeHome | Art (Ar) | 65 | 2 427 | 12 | 手绘风格,差异显著 |
| Clipart (Cl) | 4 365 | 简化二维图像 | |||
| Product (Pr) | 4 439 | 商品图片,背景干净 | |||
| Real World (Rw) | 4 357 | 真实场景图像,复杂多样 |
Table 1 Detailed information of Office31 and OfficeHome datasets
| 数据集 | 域(Domain) | 类别数 | 样本数 | 迁移任务数 | 数据特点 |
|---|---|---|---|---|---|
| Office31 | Amazon (A) | 31 | 2 817 | 6 | 网络商品图像,背景简洁 |
| Webcam (W) | 795 | 摄像头拍摄,光照变化大 | |||
| DSLR (D) | 498 | 单反拍摄,图像质量高 | |||
| OfficeHome | Art (Ar) | 65 | 2 427 | 12 | 手绘风格,差异显著 |
| Clipart (Cl) | 4 365 | 简化二维图像 | |||
| Product (Pr) | 4 439 | 商品图片,背景干净 | |||
| Real World (Rw) | 4 357 | 真实场景图像,复杂多样 |
| 对比方法 | 核心思想 | 参数列表 |
|---|---|---|
| JAN[ | 通过联合最大均值差异对齐多个层级的特征分布 | 参数 |
| MEDA[ | 在再生核希尔伯特空间中动态对齐特征并利用流形结构进行学习 | 流形子空间维度 |
| DSAN[ | 通过局部最大均值差异对齐相关但分布不同的子域分布 | 参数 |
| MRAN[ | 学习多个域不变表示,并利用注意力机制自适应地聚合 | 多重自适应损失参数 |
| JDDA[ | 联合进行域对齐和判别性特征学习, 通过中心损失等增强类内紧凑性 | 参数 |
| LROT[ | 利用低秩约束的最优传输机制对齐源域与目标域分布,在消除噪声干扰的同时保持类间判别性,实现鲁棒域适应 | 惩罚参数 学习率 |
Table 2 Basic information and parameter settings of the compared methods
| 对比方法 | 核心思想 | 参数列表 |
|---|---|---|
| JAN[ | 通过联合最大均值差异对齐多个层级的特征分布 | 参数 |
| MEDA[ | 在再生核希尔伯特空间中动态对齐特征并利用流形结构进行学习 | 流形子空间维度 |
| DSAN[ | 通过局部最大均值差异对齐相关但分布不同的子域分布 | 参数 |
| MRAN[ | 学习多个域不变表示,并利用注意力机制自适应地聚合 | 多重自适应损失参数 |
| JDDA[ | 联合进行域对齐和判别性特征学习, 通过中心损失等增强类内紧凑性 | 参数 |
| LROT[ | 利用低秩约束的最优传输机制对齐源域与目标域分布,在消除噪声干扰的同时保持类间判别性,实现鲁棒域适应 | 惩罚参数 学习率 |
| 迁移任务 | CAN | JDDA | MEDA | MRAN | LROT | CD-SDDL |
|---|---|---|---|---|---|---|
| A->W | 81.5 | 82.6 | 86.2 | 91.4 | 94.6 | 94.1 |
| D->W | 98.2 | 95.2 | 97.2 | 96.9 | 97.5 | 97.9 |
| W->D | 99.7 | 99.7 | 99.4 | 99.8 | 100.0 | 99.0 |
| A->D | 85.5 | 79.8 | 85.3 | 86.4 | 86.8 | 94.6 |
| D->A | 65.9 | 57.4 | 72.4 | 68.3 | 76.2 | 77.1 |
| W->A | 63.4 | 66.7 | 74.0 | 70.9 | 77.5 | 79.3 |
| Avg | 82.4 | 80.2 | 85.8 | 85.6 | 88.8 | 90.3 |
Table 3 Recognition accuracy of different methods on the Office31 dataset/%
| 迁移任务 | CAN | JDDA | MEDA | MRAN | LROT | CD-SDDL |
|---|---|---|---|---|---|---|
| A->W | 81.5 | 82.6 | 86.2 | 91.4 | 94.6 | 94.1 |
| D->W | 98.2 | 95.2 | 97.2 | 96.9 | 97.5 | 97.9 |
| W->D | 99.7 | 99.7 | 99.4 | 99.8 | 100.0 | 99.0 |
| A->D | 85.5 | 79.8 | 85.3 | 86.4 | 86.8 | 94.6 |
| D->A | 65.9 | 57.4 | 72.4 | 68.3 | 76.2 | 77.1 |
| W->A | 63.4 | 66.7 | 74.0 | 70.9 | 77.5 | 79.3 |
| Avg | 82.4 | 80.2 | 85.8 | 85.6 | 88.8 | 90.3 |
| 迁移任务 | DANN | JAN | MEDA | MRAN | LROT | CD-SDDL |
|---|---|---|---|---|---|---|
| Ar->Cl | 45.6 | 45.9 | 55.2 | 53.8 | 52.3 | 56.9 |
| Ar->Pr | 59.3 | 61.2 | 76.2 | 68.6 | 73.8 | 79.2 |
| Ar->Rw | 70.1 | 68.9 | 77.3 | 75.0 | 76.4 | 77.6 |
| Cl->Ar | 47.0 | 50.4 | 58.0 | 57.3 | 58.5 | 58.5 |
| Cl->Pr | 58.5 | 59.7 | 73.7 | 68.5 | 72.4 | 77.4 |
| Cl->Rw | 60.9 | 61.0 | 71.9 | 68.3 | 66.2 | 74.4 |
| Pr->Ar | 46.1 | 45.8 | 59.3 | 58.5 | 60.1 | 58.7 |
| Pr->Cl | 43.7 | 43.4 | 52.4 | 54.6 | 54.3 | 56.0 |
| Pr->Rw | 68.5 | 70.3 | 77.9 | 77.5 | 80.1 | 78.9 |
| Rw->Ar | 63.2 | 63.9 | 68.2 | 70.4 | 75.3 | 69.1 |
| Rw->Cl | 51.8 | 52.4 | 57.5 | 60.0 | 58.7 | 60.3 |
| Rw->Pr | 76.8 | 76.8 | 81.8 | 82.2 | 85.2 | 83.1 |
| Avg | 57.6 | 58.3 | 67.5 | 66.2 | 67.8 | 69.2 |
Table 4 Recognition accuracy of different methods on the OfficeHome dataset/%
| 迁移任务 | DANN | JAN | MEDA | MRAN | LROT | CD-SDDL |
|---|---|---|---|---|---|---|
| Ar->Cl | 45.6 | 45.9 | 55.2 | 53.8 | 52.3 | 56.9 |
| Ar->Pr | 59.3 | 61.2 | 76.2 | 68.6 | 73.8 | 79.2 |
| Ar->Rw | 70.1 | 68.9 | 77.3 | 75.0 | 76.4 | 77.6 |
| Cl->Ar | 47.0 | 50.4 | 58.0 | 57.3 | 58.5 | 58.5 |
| Cl->Pr | 58.5 | 59.7 | 73.7 | 68.5 | 72.4 | 77.4 |
| Cl->Rw | 60.9 | 61.0 | 71.9 | 68.3 | 66.2 | 74.4 |
| Pr->Ar | 46.1 | 45.8 | 59.3 | 58.5 | 60.1 | 58.7 |
| Pr->Cl | 43.7 | 43.4 | 52.4 | 54.6 | 54.3 | 56.0 |
| Pr->Rw | 68.5 | 70.3 | 77.9 | 77.5 | 80.1 | 78.9 |
| Rw->Ar | 63.2 | 63.9 | 68.2 | 70.4 | 75.3 | 69.1 |
| Rw->Cl | 51.8 | 52.4 | 57.5 | 60.0 | 58.7 | 60.3 |
| Rw->Pr | 76.8 | 76.8 | 81.8 | 82.2 | 85.2 | 83.1 |
| Avg | 57.6 | 58.3 | 67.5 | 66.2 | 67.8 | 69.2 |
| 迁移任务 | CD-SDDL | CD-SDDL | ||
|---|---|---|---|---|
| A->W | 91.2 | 92.5 | 93.0 | 94.1 |
| D->W | 93.2 | 95.6 | 95.4 | 97.9 |
| W->D | 93.1 | 96.8 | 96.4 | 99.0 |
| A->D | 94.0 | 93.7 | 93.4 | 94.6 |
| D->A | 76.3 | 76.9 | 76.8 | 77.1 |
| W->A | 75.3 | 78.0 | 78.0 | 79.3 |
| Avg | 87.2 | 88.9 | 88.8 | 90.3 |
Table 5 Ablation experimental results of CD-SDDL on the Office31 dataset/%
| 迁移任务 | CD-SDDL | CD-SDDL | ||
|---|---|---|---|---|
| A->W | 91.2 | 92.5 | 93.0 | 94.1 |
| D->W | 93.2 | 95.6 | 95.4 | 97.9 |
| W->D | 93.1 | 96.8 | 96.4 | 99.0 |
| A->D | 94.0 | 93.7 | 93.4 | 94.6 |
| D->A | 76.3 | 76.9 | 76.8 | 77.1 |
| W->A | 75.3 | 78.0 | 78.0 | 79.3 |
| Avg | 87.2 | 88.9 | 88.8 | 90.3 |
| 迁移任务 | CD-SDDL | CD-SDDL | ||
|---|---|---|---|---|
| Ar->Cl | 53.5 | 54.4 | 55.2 | 56.9 |
| Ar->Pr | 76.5 | 76.7 | 77.4 | 79.2 |
| Ar->Rw | 76.4 | 76.6 | 76.6 | 77.6 |
| Cl->Ar | 55.9 | 57.8 | 56.0 | 58.5 |
| Cl->Pr | 73.9 | 75.2 | 75.6 | 77.4 |
| Cl->Rw | 70.0 | 71.9 | 70.5 | 74.4 |
| Pr->Ar | 54.9 | 57.9 | 56.9 | 58.7 |
| Pr->Cl | 54.4 | 53.7 | 53.6 | 56.0 |
| Pr->Rw | 76.2 | 76.4 | 76.2 | 78.9 |
| Rw->Ar | 66.7 | 67.5 | 67.3 | 69.1 |
| Rw->Cl | 55.6 | 59.0 | 58.8 | 60.3 |
| Rw->Pr | 81.2 | 81.2 | 81.1 | 83.1 |
| Avg | 66.3 | 67.4 | 67.1 | 69.2 |
Table 6 Ablation experiment results of CD-SDDL on OfficeHome dataset/%
| 迁移任务 | CD-SDDL | CD-SDDL | ||
|---|---|---|---|---|
| Ar->Cl | 53.5 | 54.4 | 55.2 | 56.9 |
| Ar->Pr | 76.5 | 76.7 | 77.4 | 79.2 |
| Ar->Rw | 76.4 | 76.6 | 76.6 | 77.6 |
| Cl->Ar | 55.9 | 57.8 | 56.0 | 58.5 |
| Cl->Pr | 73.9 | 75.2 | 75.6 | 77.4 |
| Cl->Rw | 70.0 | 71.9 | 70.5 | 74.4 |
| Pr->Ar | 54.9 | 57.9 | 56.9 | 58.7 |
| Pr->Cl | 54.4 | 53.7 | 53.6 | 56.0 |
| Pr->Rw | 76.2 | 76.4 | 76.2 | 78.9 |
| Rw->Ar | 66.7 | 67.5 | 67.3 | 69.1 |
| Rw->Cl | 55.6 | 59.0 | 58.8 | 60.3 |
| Rw->Pr | 81.2 | 81.2 | 81.1 | 83.1 |
| Avg | 66.3 | 67.4 | 67.1 | 69.2 |
Fig. 4 t-SNE visualization of source domain (circles) and target domain (squares) features at different training stages ((a) Epoch=6; (b) Epoch=13; (c) Epoch=15; (d) Epoch=16)
| Layers | |||||
|---|---|---|---|---|---|
| Office31 | 86.3 | 88.3 | 90.3 | 90.2 | 90.2 |
| OfficeHome | 66.3 | 68.6 | 69.2 | 69.0 | 69.1 |
Table 7 Classification accuracy for different depths/%
| Layers | |||||
|---|---|---|---|---|---|
| Office31 | 86.3 | 88.3 | 90.3 | 90.2 | 90.2 |
| OfficeHome | 66.3 | 68.6 | 69.2 | 69.0 | 69.1 |
| 对比方法 | 平均训练时间(s/Epoch) | 推理速度/FPS | 平均精度/% |
|---|---|---|---|
| MRAN | 32.9 | 495.4 | 91.4 |
| MEDA | 45.3 | 136.5 | 86.2 |
| DSAN | 29.6 | 245.7 | 93.6 |
| CD-SDDL | 46.4 | 234.1 | 94.1 |
Table 8 Comparison of computational efficiency of different methods on Office31 dataset tasks
| 对比方法 | 平均训练时间(s/Epoch) | 推理速度/FPS | 平均精度/% |
|---|---|---|---|
| MRAN | 32.9 | 495.4 | 91.4 |
| MEDA | 45.3 | 136.5 | 86.2 |
| DSAN | 29.6 | 245.7 | 93.6 |
| CD-SDDL | 46.4 | 234.1 | 94.1 |
| [1] |
黄凯奇, 武美奇, 陈宏昊, 等. 视觉图灵三境界: 大模型时代下视觉智能进展与展望[J]. 图学学报, 2025, 46(5): 919-930.
DOI |
|
HUANG K Q, WU M Q, CHEN H H, et al. The three realms of visual turing: from seeing to imagining in the LLM era[J]. Journal of Graphics, 2025, 46(5): 919-930 (in Chinese).
DOI |
|
| [2] |
时妙文, 范琳伟, 王桦, 等. 基于四元数组稀疏的彩色图像去噪[J]. 图学学报, 2023, 44(2): 298-303.
DOI |
| SHI M W, FAN L W, WANG H, et al. Quaternion patch-group sparse coding for color image denoising[J]. Journal of Graphics, 2023, 44(2): 298-303 (in Chinese). | |
| [3] |
GOU J P, HE X, DU L, et al. Deep class-weighted and class-shared dictionary learning for image classification[J]. Expert Systems with Applications, 2026, 299: 130042.
DOI URL |
| [4] |
YANG M, LING J, CHEN J M, et al. Discriminative semi-supervised learning via deep and dictionary representation for image classification[J]. Pattern Recognition, 2023, 140: 109521.
DOI URL |
| [5] | TAN B Y, LIN J, QIN Y, et al. Accelerated deep nonlinear dictionary learning[C]// The 17th Asian Conference on Computer Vision. Cham: Springer, 2024: 111-127. |
| [6] |
蔡益武, 张雨佳, 张永飞. 面向跨域行人再识别的虚拟数据生成与选择[J]. 图学学报, 2023, 44(4): 775-783.
DOI |
| CAI Y W, ZHANG Y J, ZHANG Y F. Generation and selection of synthetic data for cross-domain person re-identification[J]. Journal of Graphics, 2023, 44(4): 775-783 (in Chinese). | |
| [7] |
LI M Y, LI Y, LI Z M. A comprehensive survey of transfer dictionary learning[J]. Neurocomputing, 2025, 623: 129322.
DOI URL |
| [8] |
ZHAO D D, ZHANG P, YIN H P, et al. A novel multi-layer discriminative dictionary learning approach for image classification[J]. Signal Processing, 2025, 226: 109670.
DOI URL |
| [9] |
ZHENG X, LIN L Y, LIU B, et al. A multi-task transfer learning method with dictionary learning[J]. Knowledge-Based Systems, 2020, 191: 105233.
DOI URL |
| [10] |
FAN Z Z, SHI L R, LIU Q, et al. Discriminative fisher embedding dictionary transfer learning for object recognition[J]. IEEE Transactions on Neural Networks and Learning Systems, 2023, 34(1): 64-78.
DOI URL |
| [11] |
CHEN D L, SONG P, ZHENG W M. Learning transferable sparse representations for cross-corpus facial expression recognition[J]. IEEE Transactions on Affective Computing, 2023, 14(2): 1322-1333.
DOI URL |
| [12] | YUAN X, GOU J P, YU B S, et al. Deep dictionary learning with an intra-class constraint[C]// 2022 IEEE International Conference on Multimedia and Expo. New York: IEEE Press, 2022: 1-6. |
| [13] |
DHAINI M, BERAR M, HONEINE P, et al. Unsupervised domain adaptation for regression using dictionary learning[J]. Knowledge-Based Systems, 2023, 267: 110439.
DOI URL |
| [14] |
SCETBON M, ELAD M, MILANFAR P. Deep K-SVD denoising[J]. IEEE Transactions on Image Processing, 2021, 30: 5944-5955.
DOI URL |
| [15] |
LIU S J, MA J J, CUI C K. FPGA implementation of threshold projection orthogonal matching pursuit algorithm for compressed sensing reconstruction[J]. IEEE Transactions on Circuits and Systems I: Regular Papers, 2024, 71(3): 1184-1197.
DOI URL |
| [16] | SAENKO K, KULIS B, FRITZ M, et al. Adapting visual category models to new domains[C]// The 11th European Conference on Computer Vision. Cham: Springer, 2010: 213-226. |
| [17] | VENKATESWARA H, EUSEBIO J, CHAKRABORTY S, et al. Deep hashing network for unsupervised domain adaptation[C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2017: 5385-5394. |
| [18] |
GOU J P, HE X, DU L, et al. Hierarchical locality-aware deep dictionary learning for classification[J]. IEEE Transactions on Multimedia, 2024, 26: 447-461.
DOI URL |
| [19] | LONG M S, ZHU H, WANG J M, et al. Deep transfer learning with joint adaptation networks[EB/OL]. [2025-06-26]. https://dl.acm.org/doi/10.5555/3305890.3305909.https://dl.acm.org/doi/10.5555/3305890.3305909. |
| [20] | WANG J D, FENG W J, CHEN Y Q, et al. Visual domain adaptation with manifold embedded distribution alignment[C]// The 26th ACM International Conference on Multimedia. New York: ACM, 2018: 402-410. |
| [21] |
ZHU Y C, ZHUANG F Z, WANG J D, et al. Deep subdomain adaptation network for image classification[J]. IEEE Transactions on Neural Networks and Learning Systems, 2021, 32(4): 1713-1722.
DOI URL |
| [22] |
ZHU Y C, ZHUANG F Z, WANG J D, et al. Multi- representation adaptation network for cross-domain image classification[J]. Neural Networks, 2019, 119: 214-221.
DOI URL |
| [23] | CHEN C, CHEN Z H, JIANG B Y, et al. Joint domain alignment and discriminative feature learning for unsupervised deep domain adaptation[EB/OL]. [2025-06-26]. https://dl. acm.org/doi/10.1609/aaai.v33i01.33013296.https://dl.acm.org/doi/10.1609/aaai.v33i01.33013296. |
| [24] |
XU B R, YIN J H, LIAN C, et al. Low-rank optimal transport for robust domain adaptation[J]. IEEE/CAA Journal of Automatica Sinica, 2024, 11(7): 1667-1680.
DOI URL |
| [1] | PANG Min, LI Zhentang, ZHANG Yuan, CUI Xiaokang, XIONG Fengguang. 3D model reconstruction based on retrieval and deformation techniques [J]. Journal of Graphics, 2026, 47(2): 368-379. |
| [2] | DONG Wenyi, YANG Weidong, TANG Binghui, WANG Qi, XIAO Hongyu. Review of deep learning based methods for detecting focal liver lesions [J]. Journal of Graphics, 2026, 47(1): 1-16. |
| [3] | ZHAI Yongjie, WANG Zixuan, ZHANG Zhenqi, ZHOU Xunqi, WANG Qianming. A vehicle damage classification model incorporating dual attention and weighted dynamic convolution [J]. Journal of Graphics, 2026, 47(1): 17-28. |
| [4] | PAN Yuxuan, JIN Rui, LIU Yu, ZHANG Lin. Generative model based unsupervised multi-view stereo network [J]. Journal of Graphics, 2026, 47(1): 29-38. |
| [5] | JIU Mingyuan, WU Guowei, SONG Xuguang, LI Shupan, XU Mingliang. Image classification method based on uncertainty-driven smart reinforcement active learning [J]. Journal of Graphics, 2026, 47(1): 47-56. |
| [6] | YANG Biao, WANG Xue, GUAN Zheng, LONG Ping. BSD-YOLO: a small target vehicle detection method based on dynamic sparse attention and adaptive detection head [J]. Journal of Graphics, 2026, 47(1): 99-110. |
| [7] | JU Chen, DING Jiaxin, WANG Zexing, LI Guangzhao, GUAN Zhenxiang, ZHANG Changyou. Graph neural network-based method for approximating finite element shape functions [J]. Journal of Graphics, 2025, 46(6): 1161-1171. |
| [8] | YI Bin, ZHANG Libin, LIU Danying, TANG Jun, FANG Junjun, LI Wenqi. Prediction model of laser drilling ventilation rate in cigarette manufacturing process based on AMTA-Net [J]. Journal of Graphics, 2025, 46(6): 1224-1232. |
| [9] | BO Wen, JU Chen, LIU Weiqing, ZHANG Yan, HU Jingjing, CHENG Jinghan, ZHANG Changyou. Degradation-driven temporal modeling method for equipment maintenance interval prediction [J]. Journal of Graphics, 2025, 46(6): 1233-1246. |
| [10] | ZHAO Zhenbing, Ouyang Wenbin, FENG Shuo, LI Haopeng, MA Jun. A thermal image detection method for insulators incorporating within-class sparse prior knowledge and improved YOLOv8 [J]. Journal of Graphics, 2025, 46(6): 1247-1256. |
| [11] | HE Mengmeng, ZHANG Xiaoyan, LI Hongan. Lightweight skin lesion image segmentation network based on Mamba structure [J]. Journal of Graphics, 2025, 46(6): 1257-1266. |
| [12] | LI Xingchen, LI Zongmin, YANG Chaozhi. Test-time adaptation algorithm based on trusted pseudo-label fine-tuning [J]. Journal of Graphics, 2025, 46(6): 1292-1303. |
| [13] | FAN Lexiang, MA Ji, ZHOU Dengwen. Lightweight blind super-resolution network based on degradation separation [J]. Journal of Graphics, 2025, 46(6): 1304-1315. |
| [14] | WANG Haihan. Multi object detection method for surface defects of steel arch towers based on YOLOv8-OSRA [J]. Journal of Graphics, 2025, 46(6): 1327-1336. |
| [15] | ZHU Hongmiao, ZHONG Guojie, ZHANG Yanci. Semantic segmentation of small-scale point clouds based on integration of mean shift and deep learning [J]. Journal of Graphics, 2025, 46(5): 998-1009. |
| Viewed | ||||||
|
Full text |
|
|||||
|
Abstract |
|
|||||