模拟真实场景的场景流预测

doi:10.11996/JG.j.2095-302X.2022030404

图学学报 ›› 2022, Vol. 43 ›› Issue (3): 404-412.DOI: 10.11996/JG.j.2095-302X.2022030404

• 图像处理与计算机视觉 • 上一篇下一篇

模拟真实场景的场景流预测

1. 山东大学控制科学与工程学院，山东济南 250061；
2. 中国科学院自动化研究所，北京 100190；
3. 中国科学院大学人工智能学院，北京 100049

出版日期:2022-06-30 发布日期:2022-06-28
基金资助:
国家重点研究计划项目(2016YFA0100900，2016YFA0100902)；NSFC-山东联合基金项目(U1806202)；国家自然科学基金项目(81871442，61876178，61806196，61976229，61872367)；中国科学院青年创新促进会项目(Y201930)

Scene flow prediction with simulated real scenarios

1. School of Control Science and Engineering, Shandong University, Jinan Shandong 250061, China;
2. Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China;
3. School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 100049, China

Online:2022-06-30 Published:2022-06-28
Supported by:
National Key Research Programs of China (2016YFA0100900, 2016YFA0100902); Natural Science Foundation of China Under Grant
(U1806202); Chinese National Natural Science Foundation Projects (81871442, 61876178, 61806196, 61976229, 61872367); Youth
Innovation Promotion Association CAS (Y201930)

摘要/Abstract

摘要：

人工智能发展至今正逐渐进入认知时代，计算机对真实物理世界的认知与推理能力亟待提高。有关物体物理属性与运动预测的现有工作多局限于简单的物体和场景，因此尝试拓展常识推理至仿真场景下物体场景流的预测。首先，为了弥补相关领域数据集的短缺，提出了一个基于仿真场景的数据集 ModernCity，从常识推理的角度出发还原了现代都市的街边景象，并提供了包括 RGB 图像、深度图、场景流数据和语义分割图在内的多种标签；此外，设计了一个物体描述子解码模型(ODD)，通过物体属性辅助预测场景流。通过消融实验证明，该模型可以在仿真的场景下通过物体的属性准确地预测物体未来的运动趋势，通过与其他 SOTA 模型的对比实验验证了该模型的性能及 ModernCity 数据集的可靠性。

关键词: 常识推理, 场景流, 仿真场景, 物体物理属性, 运动预测

Abstract:

Artificial intelligence is stepping into the age of cognition, the ability of cognizing and inferring the physical world for machines needs to be improved. Recent works about exploring the physical properties of objects and predicting the motion of objects are mostly constrained by simple objects and scenes. We attempted to predict the scene flow of objects in simulated scenarios to extend common sense cognizing. First, due to the lack of data in the related field, a dataset called ModernCity based on simulated scenarios is proposed, which contains the street scene of modern cities designed from the perspective of cognizing common sense, and provides RGB images, depth maps, scene flow, and semantic segmentations. In addition, we design an object descriptor decoder (ODD) to predict the scene flow through the properties of the objects. The model we proposed is proved to have the ability to predict future motion accurately through the properties of objects in simulated scenarios by experiments. The comparison experiment with other SOTA models demonstrates the performance of the model and the reliability of the ModernCity dataset.

Key words: common sense cognizing, scene flow, simulated scenarios, properties of objects, motion prediction

中图分类号:

TP 391

梅海艺, 朱翔昱, 雷震, 高瑞, 马喜波. 模拟真实场景的场景流预测[J]. 图学学报, 2022, 43(3): 404-412.

MEI Hai-yi, ZHU Xiang-yu, LEI Zhen, GAO Rui, MA Xi-bo. Scene flow prediction with simulated real scenarios[J]. Journal of Graphics, 2022, 43(3): 404-412.

[1]	东辉, 陈鑫凯, 孙浩, 姚立纲. 基于改进 YOLOv4 和图像处理的蔬菜田杂草检测[J]. 图学学报, 2022, 43(4): 559-569.
[2]	张盾, 黄志开, 王欢, 吴义鹏, 王颖, 邹家豪. 基于多尺度特征实现超参进化的野生菌分类研究与应用[J]. 图学学报, 2022, 43(4): 580-589.
[3]	陈昭俊, 储珺, 曾伦杰. 基于动态加权类别平衡损失的多类别口罩佩戴检测[J]. 图学学报, 2022, 43(4): 590-598.
[4]	李海鹏, 徐丹, 付宇婷, 柳雁安, 张婷婷. 基于 FPFH 特征提取的散乱点云精简算法[J]. 图学学报, 2022, 43(4): 599-607.
[5]	刘玉珍, 李楠, 陶志勇. 基于环查询和通道注意力的点云分类与分割[J]. 图学学报, 2022, 43(4): 616-623.
[6]	刘世龙, 马智亮. 基于结构光相机的钢筋骨架整体点云获取算法[J]. 图学学报, 2022, 43(4): 633-640.
[7]	彭国琴, 张浩, 徐丹. 基于域自适应的云南重彩画无监督情感识别[J]. 图学学报, 2022, 43(4): 641-650.
[8]	刘南杉, 裴云强, 蒋皓, 韩永国, 吴亚东, 王赋攀, 易思恒. 基于VD-MobileNet 网络的 WebAR生活垃圾分类信息可视化方法[J]. 图学学报, 2022, 43(4): 667-676.
[9]	蔡兴泉, 霍宇晴, 李发建, 孙海燕. 面向太极拳学习的人体姿态估计及相似度计算[J]. 图学学报, 2022, 43(4): 695-706.
[10]	陈主昕, 杨沁七, 陈瑞, 张严辞, 刘艳丽, 吴志红. 基于虚拟光源的实时半透明材质渲染[J]. 图学学报, 2022, 43(4): 707-714.
[11]	刘佳, 张晶晶, 杨胜强, 乔志杰. 百叶轮抛磨叶片微结构区域识别及路径拼接方法研究 [J]. 图学学报, 2022, 43(4): 715-720.
[12]	李晓英, 余亚平. 基于多模态感官体验的儿童音画交互设计研究[J]. 图学学报, 2022, 43(4): 736-743.
[13]	邓壮林, 张绍兵, 成苗, 何莲. 多模态硬币图像单应性矩阵预测[J]. 图学学报, 2022, 43(3): 361-369.
[14]	王素琴, 任琪, 石敏, 朱登明. 基于异常检测的产品表面缺陷检测与分割[J]. 图学学报, 2022, 43(3): 377-386.
[15]	方洪波, 万广, 陈忠辉, 黄以卫, 张文勇, 谢本亮. 基于改进 YOLOv5s 的离线手写数学符号识别[J]. 图学学报, 2022, 43(3): 387-395.

模拟真实场景的场景流预测

Scene flow prediction with simulated real scenarios

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价