Journal of Graphics ›› 2025, Vol. 46 ›› Issue (4): 739-745.DOI: 10.11996/JG.j.2095-302X.2025040739
• Image Processing and Computer Vision • Previous Articles Next Articles
CHEN Dong(), LI Changlong, DU Zhenlong(
), SONG Shuang, LI Xiaoli
Received:
2024-08-30
Revised:
2025-01-05
Online:
2025-08-30
Published:
2025-08-11
Contact:
DU Zhenlong
About author:
First author contact:CHEN Dong (1978-), lecturer, master. His main research interests cover computer graphics and computer vision. E-mail:chendong@njtech.edu.cn
Supported by:
CLC Number:
CHEN Dong, LI Changlong, DU Zhenlong, SONG Shuang, LI Xiaoli. Intelligent depiction to illumination and shadow: robust video shadow extraction based on SAM[J]. Journal of Graphics, 2025, 46(4): 739-745.
Add to citation manager EndNote|Ris|BibTeX
URL: http://www.txxb.com.cn/EN/10.11996/JG.j.2095-302X.2025040739
方法 | 任务 | MAE↓ | F-measure↑ | IoU↑ | BER↓ | Temporal- level↑ |
---|---|---|---|---|---|---|
FPN | SP | 0.048 | 0.712 | 0.513 | 19.52 | 74.31 |
PSPNet | SP | 0.054 | 0.651 | 0.476 | 19.83 | 76.62 |
DSS | SOD | 0.049 | 0.703 | 0.503 | 19.84 | 75.04 |
MGA | SOD | 0.065 | 0.601 | 0.398 | 25.75 | 77.81 |
PDBM | VOS | 0.066 | 0.623 | 0.465 | 19.77 | 80.03 |
COSNet | VOS | 0.039 | 0.707 | 0.512 | 20.51 | 78.31 |
DSD | ISD | 0.046 | 0.701 | 0.516 | 19.89 | 74.65 |
FSD | ISD | 0.058 | 0.682 | 0.491 | 20.58 | 74.87 |
TVSD | VSD | 0.033 | 0.760 | 0.583 | 17.71 | 78.25 |
SC-Cor | VSD | 0.042 | 0.769 | 0.615 | 13.61 | 81.46 |
STICT | VSD | 0.046 | 0.702 | 0.640 | 16.60 | 79.61 |
SCOTCH | VSD | 0.029 | 0.793 | 0.672 | 9.07 | 80.31 |
本文方法 | VSD | 0.020 | 0.821 | 0.698 | 11.24 | 82.21 |
Table 1 Comparison experimental results
方法 | 任务 | MAE↓ | F-measure↑ | IoU↑ | BER↓ | Temporal- level↑ |
---|---|---|---|---|---|---|
FPN | SP | 0.048 | 0.712 | 0.513 | 19.52 | 74.31 |
PSPNet | SP | 0.054 | 0.651 | 0.476 | 19.83 | 76.62 |
DSS | SOD | 0.049 | 0.703 | 0.503 | 19.84 | 75.04 |
MGA | SOD | 0.065 | 0.601 | 0.398 | 25.75 | 77.81 |
PDBM | VOS | 0.066 | 0.623 | 0.465 | 19.77 | 80.03 |
COSNet | VOS | 0.039 | 0.707 | 0.512 | 20.51 | 78.31 |
DSD | ISD | 0.046 | 0.701 | 0.516 | 19.89 | 74.65 |
FSD | ISD | 0.058 | 0.682 | 0.491 | 20.58 | 74.87 |
TVSD | VSD | 0.033 | 0.760 | 0.583 | 17.71 | 78.25 |
SC-Cor | VSD | 0.042 | 0.769 | 0.615 | 13.61 | 81.46 |
STICT | VSD | 0.046 | 0.702 | 0.640 | 16.60 | 79.61 |
SCOTCH | VSD | 0.029 | 0.793 | 0.672 | 9.07 | 80.31 |
本文方法 | VSD | 0.020 | 0.821 | 0.698 | 11.24 | 82.21 |
Fig. 4 Comparison within results of video shadow detection produced by different methods ((a) Input; (b) PSPNet; (c) DSSt; (d) COSNet; (e) TVSD; (f) STICT; (g) SC-Cor; (h) Ours; (i) Ground truth)
方法 | 模型大小/MB | 计算复杂度 | 推理时间/min |
---|---|---|---|
TVSD | 243.32 | 158.89 | 32.4 |
STCIT | 104.68 | 40.99 | 13.5 |
SC-Cor | 232.63 | 218.40 | 21.8 |
SCOTCH | 211.79 | 122.46 | 9.2 |
本文方法 | 93.73 | 16.32 | 7.8 |
Table 2 Performance comparison with VSD method
方法 | 模型大小/MB | 计算复杂度 | 推理时间/min |
---|---|---|---|
TVSD | 243.32 | 158.89 | 32.4 |
STCIT | 104.68 | 40.99 | 13.5 |
SC-Cor | 232.63 | 218.40 | 21.8 |
SCOTCH | 211.79 | 122.46 | 9.2 |
本文方法 | 93.73 | 16.32 | 7.8 |
[1] | ZHU L, XU K, KE Z H, et al. Mitigating intensity bias in shadow detection via feature decomposition and reweighting[C]// 2021 IEEE/CVF International Conference on Computer Vision. New York: IEEE Press, 2021: 4682-4691. |
[2] | ZHU Y R, FU X Y, CAO C Z, et al. Single image shadow detection via complementary mechanism[C]// The 30th ACM International Conference on Multimedia. New York: ACM, 2022: 6717-6726. |
[3] | ZHANG X E, BARRON J T, TSAI Y T, et al. Portrait shadow manipulation[J]. ACM Transactions on Graphics (TOG), 2020, 39(4): 78. |
[4] | CHEN Z H, ZHU L, WAN L, et al. A multi-task mean teacher for semi-supervised shadow detection[C]// 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2020: 5610-5619. |
[5] | LIAO J W, LIU Y L, XING G Y, et al. Shadow detection via predicting the confidence maps of shadow detection methods[C]// The 29th ACM International Conference on Multimedia. New York: ACM, 2021: 704-712. |
[6] | LIN J H, WANG L S. Spatial-temporal fusion network for fast video shadow detection[C]// The 18th ACM SIGGRAPH International Conference on Virtual-Reality Continuum and Its Applications in Industry. New York: ACM, 2022: 2. |
[7] | 魏后胜, 黄雯嘉, 董琦, 等. 面向增强现实的移动视点下室外视频的阴影检测[J]. 计算机辅助设计与图形学学报, 2019, 31(6): 997-1006. |
WEI H S, HUANG W J, DONG Q, et al. Detecting shadows from outdoor videos under moving viewpoints for augmented reality[J]. Journal of Computer-Aided Design & Computer Graphics, 2019, 31(6): 997-1006 (in Chinese). | |
[8] | CHEN Z H, WAN L, ZHU L, et al. Triple-cooperative video shadow detection[C]// 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2021: 2714-2723. |
[9] | DING X P, YANG J W, HU X W, et al. Learning shadow correspondence for video shadow detection[C]// The 17th European Conference on Computer Vision. Cham: Springer, 2022: 705-722. |
[10] | LIU L H, PROST J, ZHU L, et al. SCOTCH and SODA: a transformer video shadow detection framework[C]// 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2023: 10449-10458. |
[11] | WEI H S, XING G Y, LIAO J W, et al. Structure-aware spatial-temporal interaction network for video shadow detection[C]// The 33rd International Joint Conference on Artificial Intelligence. Jeju: IJCAI, 2024: 158 |
[12] | 牟琦, 张寒, 何志强, 等. 基于深度估计和特征融合的尺度自适应目标跟踪算法[J]. 图学学报, 2021, 42(4): 563-571. |
MU Q, ZHANG H, HE Z Q, et al. Scale adaptive target tracking algorithm based on depth estimation and feature fusion[J]. Journal of Graphics, 2021, 42(4): 563-571 (in Chinese). | |
[13] | XU X H, WANG J L, LI X, et al. Reliable propagation- correction modulation for video object segmentation[C]// The 36th AAAI Conference on Artificial Intelligence. Palo Alto: AAAI, 2022: 2946-2954. |
[14] | KIRILLOV A, MINTUN E, RAVI N, et al. Segment anything[C]// IEEE/CVF International Conference on Computer Vision. New York: IEEE Press, 2023: 4015-4026. |
[15] | CHENG H K, SCHWING A G. XMem: long-term video object segmentation with an atkinson-shiffrin memory model[C]// The 17th European Conference on Computer Vision. Cham: Springer, 2022: 640-658. |
[16] | DEB K, SUNY A H. Shadow detection and removal based on YCbCr color space[J]. Smart Computing Review, 2014, 4(1): 23-33. |
[17] |
KHAN S H, BENNAMOUN M, SOHEL F, et al. Automatic shadow detection and removal from a single image[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 38(3): 431-446.
DOI PMID |
[18] |
曲海成, 佟畅, 刘万军. 注意力与多尺度融合的图像阴影去除算法[J]. 计算机工程与应用, 2022, 58(16): 234-241.
DOI |
QU H C, TONG C, LIU W J. Image shadow removal algorithm based on attention and multi-scale fusion[J]. Computer Engineering and Applications, 2022, 58(16): 234-241 (in Chinese).
DOI |
|
[19] | INOUE N, YAMASAKI T. Learning from synthetic shadows for shadow detection and removal[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2021, 31(11): 4187-4197. |
[20] | 仇栋, 吴云超, 李蔚清, 等. 面向移动增强现实的室外阴影实时检测技术[J]. 图学学报, 2022, 43(1): 85-92. |
QIU D, WU Y C, LI W Q, et al. Real time outdoor shadow detection technology for mobile augmented reality[J]. Journal of Graphics, 2022, 43(1): 85-92 (in Chinese). | |
[21] | ZHU X Z, DAI J F, YUAN L, et al. Towards high performance video object detection[C]// 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. New York: IEEE Press, 2018: 7210-7218. |
[22] | YANG Z X, WEI Y C, YANG Y. Collaborative video object segmentation by foreground-background integration[C]// The 16th European Conference on Computer Vision. Cham: Springer, 2020: 332-348. |
[23] | OH S W, LEE J Y, XU N, et al. Video object segmentation using space-time memory networks[C]// 2019 IEEE/CVF International Conference on Computer Vision. New York: IEEE Press, 2019: 9225-9234. |
[1] | GUO Mingce, HUANG Bei, CHENG Lechao, WANG Zhangye. Acceleration method for neural implicit surface reconstruction with joint point cloud priors [J]. Journal of Graphics, 2025, 46(4): 807-817. |
[2] | WANG Suqin, DU Yujie, SHI Min, ZHU Dengming. Detection of apparent defects in a small sample of industrial products with category imbalance [J]. Journal of Graphics, 2025, 46(3): 568-577. |
[3] | CUI Lisha, SONG Zhiwen, JIANG Xiaoheng, MA Xin, CHEN Enqing, XU Mingliang. An edge and sematic-aware segmentation network for defect detection [J]. Journal of Graphics, 2025, 46(3): 578-587. |
[4] | WANG Changchang, JIANG Kun, JIANG Kai, ZHANG Peng, SU Zhiyong. Feedback-based iterative sampling denoising framework for point clouds with high-level noise [J]. Journal of Graphics, 2025, 46(3): 614-624. |
[5] | LI Zhihuan, NING Xiaojuan, LV Zhiyong, SHI Zhenghao, JIN Haiyan, WANG Yinghui, ZHOU Wenming. DEMF-Net: dual-branch feature enhancement and multi-scale fusion for semantic segmentation of large-scale point clouds [J]. Journal of Graphics, 2025, 46(2): 259-269. |
[6] | LIU Gaoyi, HU Ruizhen, LIU Ligang. 3D Gaussian splatting semantic segmentation and editing based on 2D feature distillation [J]. Journal of Graphics, 2025, 46(2): 312-321. |
[7] | WANG Yaru, FENG Lilong, SONG Xiaoke, QU Zhuo, YANG Ke, WANG Qianming, ZHAI Yongjie. TFD-YOLOv8: a transmission line foreign object detection method [J]. Journal of Graphics, 2024, 45(5): 901-912. |
[8] | HAN Yazhen, YIN Mengxiao, MA Weizhao, YANG Shigeng, HU Jinfei, ZHU Congyang. DGOA: point cloud upsampling based on dynamic graph and offset attention [J]. Journal of Graphics, 2024, 45(1): 219-229. |
[9] | YAN Guang-wei, LIU Run-ze, JIAO Run-hai, HE Hui. Detection method of dropped anti-vibration hammer for transmission line based on improved Cascade RCNN [J]. Journal of Graphics, 2023, 44(5): 849-860. |
[10] | ZHANG Gui-mei, TAO Hui, LU Fei-fei, PENG Kun. Domain adaptive urban scene semantic segmentation based on dual-source discriminator [J]. Journal of Graphics, 2023, 44(5): 907-917. |
[11] | ZHAO Zhen-bing, MA Di-ya, SHI Ying, Li Gang. Appearance defect detection algorithm of substation instrument based on improved YOLOX [J]. Journal of Graphics, 2023, 44(5): 937-946. |
[12] | ZHANG Yun-peng, ZHOU Pu-cheng, XUE Mo-gen. Snow removal in video based on low-rank tensor decomposition and non-subsampled shearlet transform [J]. Journal of Graphics, 2023, 44(5): 947-954. |
[13] | LIU Jian-xiu, SU Wen-zhe, SU Zhi-dong. A comparation method of BIM model based on the shape context description of contour key points [J]. Journal of Graphics, 2023, 44(5): 1034-1040. |
[14] | HU Xin, ZHOU Yun-qiang, XIAO Jian, YANG Jie. Surface defect detection of threaded steel based on improved YOLOv5 [J]. Journal of Graphics, 2023, 44(3): 427-437. |
[15] | WU Wen-huan, ZHANG Hao-kun. Semantic segmentation with fusion of spatial criss-cross and channel multi-head attention [J]. Journal of Graphics, 2023, 44(3): 531-539. |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||