Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2023, Vol. 44 ›› Issue (1): 120-130.DOI: 10.11996/JG.j.2095-302X.2023010120

• Computer Graphics and Virtual Reality • Previous Articles     Next Articles

CTH-Net: CNN-Transformer hybrid network for garment image generation from sketches and color points

PAN Dong-hui(), JIN Ying-han, SUN Xu, LIU Yu-sheng, ZHANG Dong-liang()   

  1. College of Computer Science and Technology, Zhejiang University, Hangzhou Zhejiang 310000, China
  • Received:2022-04-24 Revised:2022-07-01 Online:2023-10-31 Published:2023-02-16
  • Contact: ZHANG Dong-liang
  • About author:PAN Dong-hui (1997-), master student. His main research interest covers digital image processing. E-mail:417969567@qq.com
  • Supported by:
    National Key R&D Program of China(2022YFB3303100);National Natural Science Foundation of China(61972340);National Natural Science Foundation of China(61732015)

Abstract:

Drawing garment images is an important part of garment design. To address the problems such as low intelligence and high requirements for users' drawing skills and imagination, a CNN-Transformer hybrid network (CTH-Net) was proposed to generate garment images from sketches and color points. CTH-Net combined the advantages of convolutional neural networks (CNN) in extracting local information and Transformer in processing long-range dependencies, efficiently fusing the architectures of these two models. The ToPatch and ToFeatureMap modules were also designed to reduce the amount and dimension of data input into Transformer, thus reducing the consumption of computing resources. CTH-Net consisted of three phases: the first drafting phase, which aimed to predict the color distribution of garments and obtain watercolor images without gradients and shadows; the second refinement phase, which refined the watercolor image into a realistic garment image; the third tuning phase, which combined the outputs of the above two phases to further optimize the generation quality. The experimental results show that CTH-Net could generate high-quality garment images by simply inputting sketches and some color points. The proposed network could outperform the existing methods in the realism and accuracy of the generated images.

Key words: deep learning, convolutional neural network, image generation, Transformer

CLC Number: