Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2025, Vol. 46 ›› Issue (5): 980-989.DOI: 10.11996/JG.j.2095-302X.2025050980

• Image Processing and Computer Vision • Previous Articles     Next Articles

PanoLoRA: an efficient finetuning method for panoramic image generation based on Stable Diffusion

YE Wenlong1,3(), CHEN Bin2,3()   

  1. 1 School of Earth and Space Sciences, Peking University, Beijing 100871, China
    2 School of Computer Science, Peking University, Beijing 100871, China
    3 National Key Laboratory of Intelligent Parallel Technology, Beijing 100871, China
  • Received:2024-12-11 Accepted:2025-02-20 Online:2025-10-30 Published:2025-09-10
  • Contact: CHEN Bin
  • About author:First author contact:

    YE Wenlong (2000-), master student. His main research interest covers diffusion model. E-mail:2397726787@qq.com

  • Supported by:
    Fund of National Key Laboratory(2024JK19)

Abstract:

Panoramic images, which can express the overall information of the surrounding environment, have become an important way to construct virtual scenes. However, amidst the rise of artificial intelligence generated content (AIGC) technology, especially diffusion models trained on large-scale text image datasets and parameter-efficient fine-tuning (PEFT) techniques, research on the generation and rapid transfer of panoramic images is still insufficient. To address the challenges posed by the scarcity and spatial distortion of panoramic image datasets, 14 000 open-source panoramic image datasets were collected, finely annotated, and filtered through projection transformation. Based on this, the PanoLoRA method was proposed. In the process of extracting spatial features from the original convolution and self-attention modules, PanoLoRA additionally incorporated spherical convolution and LoRA (low-rank adaptation) modules. This enabled the explicit extraction of spherical features from panoramic images, which were then fused with the original planar features, thereby achieving efficient transfer learning for panoramic image generation while retaining the strong image generation ability of Stable Diffusion. The experimental results demonstrated that PanoLoRA outperformed the latest 5 Parameter-Efficient Fine-Tuning methods in comparison tests using the collected text panoramic image dataset, achieving comprehensive advantages and improving the quality of image generation and graphic consistency. A series of ablation experiments were conducted to verify the effectiveness of each algorithm module.

Key words: diffusion model, panoramic image, parameter-efficient fine-tuning, transfer learning, LoRA

CLC Number: