Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2025, Vol. 46 ›› Issue (4): 739-745.DOI: 10.11996/JG.j.2095-302X.2025040739

• Image Processing and Computer Vision • Previous Articles     Next Articles

Intelligent depiction to illumination and shadow: robust video shadow extraction based on SAM

CHEN Dong(), LI Changlong, DU Zhenlong(), SONG Shuang, LI Xiaoli   

  1. College of Computer and Information Engineering (College of Artificial Intelligence), Nanjing Tech University, Nanjing Jiangsu 211816, China
  • Received:2024-08-30 Revised:2025-01-05 Online:2025-08-30 Published:2025-08-11
  • Contact: DU Zhenlong
  • About author:First author contact:

    CHEN Dong (1978-), lecturer, master. His main research interests cover computer graphics and computer vision. E-mail:chendong@njtech.edu.cn

  • Supported by:
    National Natural Science Foundation of China(62202221);National Natural Science Foundation of China(61672279)

Abstract:

A video shadow detection method based on the segmented anything model (SAM) is proposed to address the problem of low accuracy and robustness of traditional methods in handling complex and dynamic shadows caused by lighting variations and object occlusions.. The SAM decoder is fine tuned to better adopt to shadow detection, leveraging SAM’s accurate segmentation ability to extract shadow area in key frames, XMem model, incorporatingsensory memory, short-term memory, and long-term memory, is introduced to integrate information from adjacent frames, thereby optimizing and stabilizing shadow detection results. Experimental results show that the proposed method reduces the mean absolute error by approximately 31.8% and improves the intersection over-union ratio by about 19.7% compared to traditional approaches. Both qualitative and quantitative evaluations indicate that the proposed method not only improves the accuracy of video shadow detection but also exhibits superior robustness.

Key words: video shadow detection, semantic segmentation, VOS, SAM, XMem

CLC Number: