| [1] | 
																						 
											 吴树生. 纳西族东巴文化艺术结晶: 东巴画[J]. 地方文化研究, 2019(6): 2.
																						 | 
										
																													
																							 | 
																						 
											 WU S S. The crystallization of Naxi Dongba culture and art: Dongba painting[J]. Local Culture Research, 2019(6): 2 (in Chinese).
																						 | 
										
																													
																							| [2] | 
																						 
											 黎克, 钱文华, 王成学, 等. 基于图神经网络的东巴画小样本分类[J]. 计算机辅助设计与图形学学报, 2021, 33(7): 1073-1083.
																						 | 
										
																													
																							 | 
																						 
											 LI K, QIAN W H, WANG C X, et al.  Dongba painting few-shot classification based on graph neural network[J]. Journal of Computer-Aided Design & Computer Graphics, 2021, 33(7): 1073-1083 (in Chinese).
																						 | 
										
																													
																							| [3] | 
																						 
											 CARION N, MASSA F, SYNNAEVE G, et al.  End-to-end object detection with transformers[C]//European Conference on Computer Vision. Cham: Springer International Publishin, 2020: 213-229.
																						 | 
										
																													
																							| [4] | 
																						 
											 MACHAJDIK J, HANBURY A. Affective image classification using features inspired by psychology and art theory[C]// MM’10 - Proceedings of the ACM Multimedia 2010 International Conference. New York: ACM, 2010: 83-92.
																						 | 
										
																													
																							| [5] | 
																						 
											 BORTH D, JI R R, CHEN T, et al.  Large-scale visual sentiment ontology and detectors using adjective noun pairs[C]// MM 2013 - Proceedings of the 2013 ACM Multimedia Conference. New York: ACM, 2013: 223-232.
																						 | 
										
																													
																							| [6] | 
																						 
											 SIERSDORFER S, MINACK E, DENG F, et al.  Analyzing and predicting sentiment of images on the social web[C]//MM'10 - Proceedings of the ACM Multimedia 2010 International Conference. New York: ACM, 2010: 715-718.
																						 | 
										
																													
																							| [7] | 
																						 
											 KRIZHEVSKY A, SUTSKEVER I. Imagenet classification with deep convolutional neural networks[C]//Advances in Neural Information Processing Systems. Lake Tahoe: NIPS, 2012: 25-26.
																						 | 
										
																													
																							| [8] | 
																						 
											 ZHAO S, YAO H, YANG Y, et al.  Affective image retrieval via multi-graph learning[C]// The 22nd ACM International Conference on Multimedia. New York: ACM, 2014: 1025-1028.
																						 | 
										
																													
																							| [9] | 
																						 
											 盛家川, 陈雅琦, 韩亚洪. 深层网络特征聚合重标定的中国画情感分类算法[J]. 计算机辅助设计与图形学学报, 2020, 32(9): 1420-1429.
																						 | 
										
																													
																							 | 
																						 
											 SHENG J C, CHEN Y Q, HAN Y H. Sentiment classification of Chinese paintings via feature recalibration of deep network aggregation[J]. Journal of Computer-Aided Design & Computer Graphics, 2020, 32(9): 1420-1429 (in Chinese).
																						 | 
										
																													
																							| [10] | 
																						 
											 VASWANI A, SHAZEER N, PARMAR N. Attention is all You need[C]// Advances in Neural Information Processing Systems. Lone Beach: NIPS, 2017: 5998-6008.
																						 | 
										
																													
																							| [11] | 
																						 
											 ALMOWALLAD A, SANCHEZ V. Human emotion distribution learning from face images using cnn and lbc features[C]//2020 8th International Workshop on Biometrics and Forensics. New York, IEEE Press, 2020: 1-6.
																						 | 
										
																													
																							| [12] | 
																						 
											 YANG J, SHE D, LAI Y, et al.  Weakly supervised coupled networks for visual sentiment analysis[C]// The IEEE Conference on Computer Vision and Pattern Recognition. New York, IEEE Press, 2018: 7584-7592.
																						 | 
										
																													
																							| [13] | 
																						 
											 钱文华, 徐丹, 徐瑾, 等. 东巴画艺术风格绘制[J]. 系统仿真学报, 2020, 32(7): 1349-1359. 
																							 
																									DOI    
																																																										 | 
										
																													
																							 | 
																						 
											 QIAN W H, XU D, XU J, et al.  Simulation of dongba art style painting[J]. Journal of System Simulation, 2020, 32(7): 1349-1359 (in Chinese). 
																							 
																									DOI    
																																																										 | 
										
																													
																							| [14] | 
																						 
											 HE K M, ZHANG X Y, REN S Q, et al.  Deep residual learning for image recognition[C]//The IEEE Conference on Computer Vision and Pattern Recognition. Los Alamitos: IEEE Computer Society Press, 2016: 770-778.
																						 | 
										
																													
																							| [15] | 
																						 
											 LIU Z, LIN Y, CAO Y, et al.  Swin transformer: hierarchical vision transformer using shifted windows[C]//2021 IEEE/CVF International Conference on Computer Vision. New York, IEEE Press, 2021: 10012-10022.
																						 | 
										
																													
																							| [16] | 
																						 
											 LOSHCHILOV I, HUTTER F. Decoupled weight decay regularization[EB/OL]. (2017-11-05) [2021-08-13].https://arxiv.org/pdf/1711.05101. 
																						 | 
										
																													
																							| [17] | 
																						 
											 DOSOVITSKIY A, BEYER L, KOLESNIKV A, et al.  An image is worth 16x16 words: transformers for image recognition at scale[EB/OL]. (2020-10-11) [2021-11-20]. https://arxiv.org/pdf/2010.11929. 
																						 |