Welcome to Journal of Graphics share: 

Journal of Graphics ›› 2022, Vol. 43 ›› Issue (3): 530-536.DOI: 10.11996/JG.j.2095-302X.2022030530

• BIM/CIM • Previous Articles     Next Articles

Disaster tweets classification method based on pretrained BERT model

  

  1. Department of Civil Engineering, Tsinghua University, Beijing 100084, China
  • Online:2022-06-30 Published:2022-06-28
  • Supported by:
    National Natural Science Foundation of China (72091512, 51908323)

Abstract:

Social media has become an important medium for the release and dissemination of disaster information, the effective identification and utilization of which is of great significance to disaster emergency management. Given the shortcomings of the traditional text classification model, a disaster tweet classification method was proposed based on the pre-trained model of bidirectional encoder representations from transformers (BERT). After data cleaning and preprocessing, this study constructed a text classification model based on long short-term memory-convolutional neural network (LSTM-CNN) through comparative analysis, based on BERT. Experiments on the tweet datasets of the Kaggle competition platform showed that the proposed classification model outperforms the traditional Naive Bayesian classification model and the common fine-tuning model, with the recognition rate up to 85%. This study could shed significant light on enhancing the identification accuracy of real disaster information and the efficiency of disaster emergency response.

Key words: text classification, deep learning, BERT, pre-trained model, fine-tuning, disaster, emergency management

CLC Number: