Vietnamese SMS Spam Detection with Deep Learning and Pre-trained Language Model

Authors

  • Vu Minh Tuan Hanoi University

Keywords:

deep learning neural network, PhoBert, SMS spam, spam detection, transfer learning, Vietnamese spam

Abstract

Despite of the strong development of OTT message applications and social networks, Short Service Message (SMS) keeps its role in the marketing industry. As a top level of effective and cost-saving advertising tool, SMS has also given rise to SMS spam. To contribute for the fight against SMS spam, we suggested a model which is the combination of deep learning neural network model and pre-trained language technique – PhoBERT, a variant of BERT. Making full usage of the pre-training Vietnamese data, the proposed model achieved good accuracy at 99.53% in detecting Vietnamese spam messages.

Downloads

Published

2022-06-30