Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2024, Vol. 45 ›› Issue (1): 102-111.DOI: 10.11996/JG.j.2095-302X.2024010102

• Image Processing and Computer Vision • Previous Articles     Next Articles

Diversified generation of theatrical masks based on SASGAN

GU Tianjun1(), XIONG Suya2, LIN Xiao1,3,4()   

  1. 1. The College of information, Mechanical and Electrical Engineering, Shanghai Normal University, Shanghai 200234, China
    2. Institute of Marine Equipment, Shanghai Jiao tong University, Shanghai 200240, China
    3. Shanghai Engineering Research Center of Intelligent Education and Big data, Shanghai Normal University, Shanghai 200234, China
    4. The Research Base of Online Education for Shanghai Middle and Primary Schools, Shanghai 200234, China
  • Received:2023-06-29 Accepted:2023-09-27 Online:2024-02-29 Published:2024-02-29
  • Contact: LIN Xiao (1978-), professor, Ph.D. Her main research interests cover image video editing and processing, artificial intelligence, etc. E-mail:lin6008@shnu.edu.cn
  • About author:

    GU Tianjun (2002-), undergraduate student. His main research interests cover digital image processing and image generation.
    E-mail:TianjunGu_Grady@outlook.com

  • Supported by:
    National Natural Science Foundation of China(61502220);National Natural Science Foundation of China(61572326);National Natural Science Foundation of China(61775139);National Natural Science Foundation of China(61872242)

Abstract:

To address the problem of low resolution and lack of realism in existing automatically generated theatrical masks, a stylized generative adversarial network (SASGAN) based on a self-attentive mechanism was proposed. Firstly, SASGAN introduced the self-attentive mechanism and vector quantization method based on StyleGAN, thereby enhancing the extraction of geometric structure features of mask patterns. Subsequently, the diversified differentiation generation (DDG) method was supplemented with a mask hue-assisted algorithm by expanding the data with DDG to build a theatrical mask dataset containing 12,599 images. The final training was performed on this dataset to generate mask images with both diversity and realism. The experimental results demonstrated significant improvement in data augmentation for theatrical masks using the DDG method compared to the traditional methods, while SASGAN enhanced the resolution and realism of theatrical masks, achieving the desired effect in subjective visualization.

Key words: theatrical masks, generation adversarial network, image generation, attention mechanism, vector quantization

CLC Number: