Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2025, Vol. 46 ›› Issue (3): 588-601.DOI: 10.11996/JG.j.2095-302X.2025030588

• Computer Graphics and Virtual Reality • Previous Articles     Next Articles

CLIP-based semantic offset transferable attacks on 3D point clouds

MA Yang1(), HUANG Lujie1, PENG Weilong2, WU Zhize3, TANG Keke1(), FANG Meie2   

  1. 1. Cyberspace Institute of Advanced Technology, Guangzhou University, Guangzhou Guangdong 510006, China
    2. School of Computer Science and Cyber Engineering, Guangzhou University, Guangzhou Guangdong 510006, China
    3. School of Artificial Intelligence and Big Data, Hefei University, Hefei Anhui 230601, China
  • Received:2024-09-30 Accepted:2024-12-17 Online:2025-06-30 Published:2025-06-13
  • Contact: TANG Keke
  • About author:First author contact:

    MA Yang (1998-), master student. His main research interest covers computer vision security. E-mail:gzhumayang@gmail.com

  • Supported by:
    National Natural Science Foundation of China(62102105)

Abstract:

Deep learning-based 3D point cloud understanding has received increasing attention in various applications such as autonomous driving, robotics, surveillance, etc., and the study of adversarial attacks on point cloud deep learning models helps to evaluate and improve their adversarial robustness. However, most of the existing attack methods are aimed at white-box attacks, generating adversarial samples that have very low success rate and are easily defensible against transferable attacks on black-box models with unknown model parameters. These methods only consider optimization in the geometric space to mislead specific classifiers and fail to essentially change the deep intrinsic semantic structure of point cloud data, resulting in their limited ability to transferable attacks under different classifiers. To address these issues, the proposed algorithm leveraged the rich semantic comprehension capability of large multimodal models to incorporate the semantic information of the point clouds into the attack, thereby ensuring that the adversarial samples diverged significantly from the original semantic attributes to a remarkable extent to enhance transferability. In addition, considering that the current adversarial samples with high attack transferability often exhibited insufficient imperceptibility, the algorithm integrated the above semantic adversarial attack into the spectral domain space, achieving a delicate balance between transferability and imperceptibility. Extensive evaluations demonstrated the 3D CLIP-based semantic offset attack (3DCLAT) can significantly improve the transferability of the adversarial samples and is more robust to defense methods.

Key words: CLIP, point cloud, adversarial attack, attack transferability, spectral domains

CLC Number: