Deep learning has shown great potential in the field of computer vision, but its performance in practical applications relies heavily on large amounts of high-quality labeled data. Generative models, with their ability to generate diverse data, have become an effective solution to the problem of data scarcity, aiming to provide training data for computer vision efficiently and effectively. Consequently, image data augmentation techniques based on generative models have become a popular research direction in recent years. To this end, a comprehensive literature review was conducted on image data augmentation methods based on generative models. Through a three-stage retrieval process, 37 relevant studies were collected. The methodological processes of these studies were summarized into four main steps, with each step categorized and described in detail. First, various generative models suitable for image data augmentation were introduced, focusing on model selection. Next, generative image data augmentation methods were classified, with elaborations on the workflow, representative studies, existing challenges, and areas in need of optimization for each category. Considering that generated data may contain noise, methods were also discussed for the selection and processing of generated data to better utilize them in downstream tasks. Furthermore, evaluation methods were categorized and described to comprehensively verify the effectiveness and robustness of data augmentation approaches. Finally, the opportunities and challenges faced by generative image data augmentation in aspects were elaborated upon, such as maintaining semantic consistency, ensuring diversity, improving generation efficiency, and applying to black-box models, and pointed out potential directions for future exploration.