UMSurabaya Repository

Comparative Analysis of Text Classification Using Naive Bayes and Support Vector Machine in Detecting Negative Content in Indonesian Twitter

Andana, Erie Kresna and Othman, Muhaini and Ibrahim, Rosziati (2019) Comparative Analysis of Text Classification Using Naive Bayes and Support Vector Machine in Detecting Negative Content in Indonesian Twitter. Comparative Analysis of Text Classification Using Naive Bayes and Support Vector Machine in Detecting Negative Content in Indonesian Twitter, 8 (1.3). pp. 356-362. ISSN 2278-3091

[img] PDF (Hasil Cek Plagiasi) - Supplemental Material
Download (2MB)
[img] PDF (Artikel Jurnal) - Published Version
Download (148kB)

Abstract

Research on the detection of social media content, especially Twitter, has been done. Twitter content detection is based on classifying content or words made by users (tweets) into two groups, namely positive and negative. Research to detect negative content or harsh words in Indonesian tweets is still rare. There are several studies that have been conducted, the detection of negative words is only limited to certain categories, such as pornography, hate speech, and others, so that if the negative word only includes one category, then if there are other negative words that do not belong to that category, this word will not be detected. This is a challenge for researchers to classify texts in Indonesian. In some research, to be able to detect and separate negative words and positive words in Indonesian, the Naive Bayes (NB) and Support Vector Machine (SVM) proved to produce better performance among other algorithms. Therefore, this paper aims to analyze the comparison of the results that have been achieved about detecting negative content on Indonesian twitter using NB and SVM. First, this paper will briefly explain the NB and SVM, then proceed with an explanation of the general research framework that has been carried out. In the results and discussion section, a comparison of the results achieved by existing researchers is explained. And based on these results, another approach will be proposed to detect negative content on Indonesian twitter.

Item Type: Article
Uncontrolled Keywords: Indonesian twitter, text classification, Naive Bayes, Support Vector Machine
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Jurnal > Fakultas Teknik
Depositing User: ERIE KRESNA ANDANA
Date Deposited: 05 May 2023 06:47
Last Modified: 05 May 2023 06:47
URI: http://repository.um-surabaya.ac.id/id/eprint/6960

Actions (login required)

View Item View Item