Documente Academic
Documente Profesional
Documente Cultură
necessity.
Abstract — SMS spam refers to undesired text message. SMS spam is a problem that doesn’t have clear and
Machine Learning methods for anti-spam filters have been simple solution yet and many efforts have been made to
noticeably effective in categorizing spam messages. Dataset make a model that will detect (classify) SMS spam.
used in this research is known as Tiago’s dataset. Crucial step
in the experiment was data preprocessing, which involved
Although these models can be helpful, there are still
reducing text to lower case, tokenization, removing opportunities for further enhancements. In this paper an
stopwords. Convolutional Neural Network was the proposed experiment was conducted in order to classify spam and
method for classification. Overall model’s accuracy was non-spam messages, by using Convolutional Neural
98.4%. Obtained model can be used as a tool in many Network.
applications. This research confirms that CNN can be used to make a
model for spam detection that categorize messages with
Keywords — CNN, Cost-sensitive classification,
Imbalanced dataset, Machine Learning, SMS Spam high accuracy. Moreover, adjusted model can be applied to
sentiment analysis, text categorization, or spam detecting
I. INTRODUCTION in other types of communication media such as emails,