Multimodal emotion recognition with action features

doi:10.11996/JG.j.2095-302X.2022061159

Abstract

Abstract: In recent years, using knowledge of computer science to realize emotion recognition based on multimodal data has become an important research direction in the fields of natural human-computer interaction and artificial intelligence. The emotion recognition research using visual modality information usually focuses on facial features, rarely considering action features or multimodal features fused with action features. Although action has a close relationship with emotion, it is difficult to extract valid action information from the visual modality. In this paper, we started with the relationship between action and emotion, and introduced action data extracted from visual modality to classic multimodal emotion recognition dataset, MELD. The body action features were extracted based on ST-GCN model, and the action features were applied to the LSTM model-based single-modal emotion recognition task. In addition, body action features were introduced to bi-modal emotion recognition in MELD dataset, improving the performance of the fusion model based on the LSTM network. The combination of body action features and text features enhanced the recognition accuracy of the context model with pre-trained memory compared with that only using the text features. The results of the experiment show that although the accuracy of body action features for emotion recognition is not higher than those of traditional text features and audio features, body action features play an important role in the process of multimodal emotion recognition. The experiments on emotion recognition based on single-modal and multimodal features validate that people use actions to convey their emotions, and that using body action features for emotion recognition has great potential.

Key words: , action features, emotion recognition, multimodality, action and emotion, visual modality

CLC Number:

TP 391

SUN Ya-nan, WEN Yu-hui, SHU Ye-zhi, LIU Yong-jin . Multimodal emotion recognition with action features[J]. Journal of Graphics, 2022, 43(6): 1159-1169.

[1]	LI Hong-an , ZHENG Qiao-xue , TAO Ruo-lin , ZHANG Min , LI Zhan-li , KANG Bao-sheng. Review of image super-resolutionbased on deep learning [J]. Journal of Graphics, 2023, 44(1): 1-15.
[2]	GU Yu, ZHAO Jun . Research on image detection algorithm of freight train brake shoe bolt and brake shoe fault [J]. Journal of Graphics, 2023, 44(1): 88-94.
[3]	LIU Zhen-ye, CHEN Ren-jie, LIU Li-gang. Edge length based 3D shape interpolation [J]. Journal of Graphics, 2023, 44(1): 158-165.
[4]	FAN Zhen, LIU Xiao-jing, LI Xiao-bo, CUI Ya-chao. A homography estimation method robust to illumination and occlusion [J]. Journal of Graphics, 2023, 44(1): 166-176.
[5]	ZHU Lei , LI Dong-biao , YAN Xing-zhi , LIU Xiang-yang , SHEN Cai-hua. Intelligent detection method of tunnel cracks based on improved Mask R-CNN deep learning algorithm [J]. Journal of Graphics, 2023, 44(1): 177-183.
[6]	MA Hong-yu , SHEN Li-yong , JIANG Xin , ZOU Qiang , YUAN Chun-ming. A survey of path planning and feedrate interpolation in computer numerical control [J]. Journal of Graphics, 2022, 43(6): 967-986.
[7]	ZOU Qiang. A note on solid modeling: history, state of the art, future [J]. Journal of Graphics, 2022, 43(6): 987-1001.
[8]	YAN Lan-lan, SONG Xi-chen, WEI Zi-hua, XIE Lei . Representation of a kind of G2 continuous composite curve [J]. Journal of Graphics, 2022, 43(6): 1057-1069.
[9]	WANG Han , ZHU Chun-gang. Free-form deformation based on extension factor for toric-Bézier curve [J]. Journal of Graphics, 2022, 43(6): 1070-1079.
[10]	WU Chen , CAO Li , QIN Yu , WU Miao-miao , Koo SiuKong. Atomic model rendering method based on reference images [J]. Journal of Graphics, 2022, 43(6): 1080-1087.
[11]	ZHU Peng-hui, YUAN Hong-tao, NIE Yong-wei, LI Gui-qing. AC-HAPE3D: an algorithm for irregular packing based on reinforcement learning [J]. Journal of Graphics, 2022, 43(6): 1096-1103.
[12]	GUAN Qi-chao, LIU Hao, WANG Yuan-cheng, FU Xiao-ming . Error-bounded unstructured T-spline surface fitting with low distortion [J]. Journal of Graphics, 2022, 43(6): 1104-1113.
[13]	GUO Wen , LI Dong , YUAN Fei. 1. School of Information and Electronic Engineering, Shandong Technology and Business University, Yantai Shandong 264005, China; 2. Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100195, China [J]. Journal of Graphics, 2022, 43(6): 1124-1133.
[14]	CUI Zhen-dong , LI Zong-min, YANG Shu-lin , LIU Yu-jie , LI Hua. 3D object detection based on semantic segmentation guidance [J]. Journal of Graphics, 2022, 43(6): 1134-1142.
[15]	LIU Ye-peng , YANG De-zhi , LI Si-yuan , ZHANG Fan , ZHANG Cai-ming, . Image smoothing based on image decomposition and relative total variation [J]. Journal of Graphics, 2022, 43(6): 1143-1149.

Multimodal emotion recognition with action features

PDF (PC)

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics

Comments