欢迎访问《图学学报》 分享到:

图学学报 ›› 2023, Vol. 44 ›› Issue (5): 966-977.DOI: 10.11996/JG.j.2095-302X.2023050966

• 图像处理与计算机视觉 • 上一篇    下一篇

考虑用户感知的立体图像生成

陈鹏1(), 江浩2, 向为1()   

  1. 1.浙江大学计算机科学与技术学院,浙江 杭州 310013
    2.浙江大学宁波研究院,浙江 宁波 315048
  • 收稿日期:2023-02-07 接受日期:2023-06-15 出版日期:2023-10-31 发布日期:2023-10-31
  • 通讯作者: 向为(1991-),男,讲师,博士。主要研究方向为智能设计等。E-mail:wxiang@zju.edu.cn
  • 作者简介:陈鹏(1996-),男,硕士研究生。主要研究方向为人工智能与数字图像处理。E-mail:chen_peng2023@163.com

Stereoscopic image generation considering human perception

CHEN Peng1(), JIANG Hao2, XIANG Wei1()   

  1. 1. School of Computer Science and Technology, Zhejiang University, Hangzhou Zhejiang 310013, China
    2. Ningbo Research Institute, Zhejiang University, Ningbo Zhejiang 315048, China
  • Received:2023-02-07 Accepted:2023-06-15 Online:2023-10-31 Published:2023-10-31
  • Contact: XIANG Wei (1991-), lecturer, Ph.D. His main research interests cover intelligent design, etc. E-mail:wxiang@zju.edu.cn
  • About author:CHEN Peng (1996-), master students. His main research interests cover artificial intelligence and digital image processing. E-mail:chen_peng2023@163.com

摘要:

近年来,三维(3D)显示器由于其优越的沉浸式体验而受到越来越多的关注。然而3D内容的缺乏限制了3D显示器的发展。为了获得稀缺的3D内容,二维(2D)到3D转换是一种有前途且有效的方法。转换需要向2D内容添加额外的深度信息。然而,现有的深度估计方法由于其不稳定性,不能满足2D到3D转换的要求。为此提出一种立体图像呈现系统,其在考虑人类感知的同时,将单目图像转换为一对用于3D显示的立体图像。该系统的核心步骤提出了一种考虑人类感知的深度优化算法(DOCHP),以语义分割图作为输入,通过考虑人类感知(包括注意力机制和深度感知)来生成优化的深度图,增强立体图像的立体效果。实验结果表明,采用系统优化的深度图生成立体图像,可以让用户感受到较强的3D效果。此结果显示了立体图像制作中考虑人类感知特征的必要性,也将支持裸眼立体图像的推广应用。

关键词: 2D-to-3D, 3D显示, 人类感知, 单目图像, 立体感增强

Abstract:

In recent years, three-dimensional (3D) displays have garnered increasing attention for their superior immersive experience. However, the lack of 3D content poses a challenge to the development of 3D displays. To obtain scarce 3D content, two-dimensional (2D)-to-3D conversion has emerged as a promising and effective approach. The conversion involves adding extra depth information to 2D content. However, existing depth estimation methods cannot satisfy the requirements of 2D-to-3D conversion because of their instability. This paper presented a stereoscopic image presentation system, which was designed to transfer a monocular image to a pair of stereoscopic images for 3D displays while considering human perception. The core step of the system proposed an algorithm called depth optimization considering human perception (DOCHP), using semantic segmentation images as input and considering human perception, including attentional mechanisms and depth perception to enhance the stereoscopic effect of the stereoscopic images. The experimental results demonstrated that the stereoscopic images, which were generated through the deep map optimized by the system, provided users with a strong sense of 3D effect. This article demonstrated the necessity of incorporating human perceptual characteristics in the production of autostereoscopic images and bolstered the promotion and application of autostereoscopic images.

Key words: 2D-to-3D, 3D displays, human perception, monocular images, stereoscopic sensation enhancement

中图分类号: