Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2022, Vol. 43 ›› Issue (4): 685-694.DOI: 10.11996/JG.j.2095-302X.2022040685

• Computer Graphics and Virtual Reality • Previous Articles     Next Articles

A visual analysis approach for domain literature data based on word representation model

  

  1. Beijing Key Laboratory of Big Data Technology for Food Safety, School of Computer Science and Engineering, Beijing Technology and Business University, Beijing 100048, China
  • Online:2022-08-31 Published:2022-08-15
  • Contact: CEHN Yi (1963), professor, Ph.D. Her main research interests cover visualization, visual analysis, machine learning etc
  • Supported by:
    National Natural Science Foundation of China (61972010); National Key R&D Program of China (2018YFC1603602)

Abstract:

With the development of science and technology, scientific literature is mounting to an increasingly large scale. How to quickly and accurately seek the research topics, influential scholars, and high-level papers in a specific domain from the vast amount of publications remains an enormous challenge. The visual analysis method for domain literature data based on word representation model employed word2vec to recommend domain-related keywords by the similarity between word vectors, and filters the domain-related papers according to these keywords. Then it utilized the BERTopic model to extract topics from the abstracts of domain papers. Next, the values for paper impact were calculated using PageRank, and the values for author influence were calculated using Author-Rank, the author impact evaluation method, taking into account the order of authorship, the number of publications, and the impact of papers. Finally, the multi-view collaborative and interactive visualization approach could help researchers gain a quick understanding and analysis of specific areas from multiple perspectives, such as topics word frequency, topics evolution, literature impact, citation relationships, and author impact. The method can be applied to literature data analysis in the field of “food safety”, and the results and user tests can validate this method.

Key words: visualization, bibliometric analysis, word2vec, BERTopic, Author-Rank, food safety

CLC Number: