Analisis Klasifikasi Sentimen Terhadap Isu Kebocoran Data Kartu Identitas Ponsel di Twitter

Muh Ichlasul Amal; Elsa Syafira Rahmasita; Edward Suryaputra; Nur Aini Rakhmawati

doi:10.28932/jutisi.v8i3.5483

PDF

Published: Dec 21, 2022

DOI: https://doi.org/10.28932/jutisi.v8i3.5483

Keywords:

IndoBERT, Logistic Regression, Random Forest, SIM Card Data Leak, Support-Vector Machine. IndoBERT, Kebocoran Data Kartu SIM, Logistic Regression, Random Forest, Support-Vector Machine.

Muh Ichlasul Amal

Institut Teknologi Sepuluh Nopember

Elsa Syafira Rahmasita

Institut Teknologi Sepuluh Nopember

Edward Suryaputra

Institut Teknologi Sepuluh Nopember

Nur Aini Rakhmawati

Institut Teknologi Sepuluh Nopember

Abstract

Technology developments bring great threats related to privacy and security of personal data. In September 2022, a data leak incident of 1.3 billion SIM card registration data containing user's personal data was uploaded on dark web. Indonesian people voice their opinion regarding this issue on Twitter. This study aims to find out the word distribution and sentiment classification analysis of public opinion on Twitter related to the issue. Sentiment classification analysis was carried out using a machine learning approach with four methods, namely Random Forest, Logistic Regression, Support-Vector Machine, and IndoBERT model. The four methods will be compared to see which model produces the best performance. From the crawling process, 957 tweets were obtained, of which 609 were labeled and trained using the four methods. From the data obtained, there is an imbalance between classes, where positive sentiment has a much smaller number than the rest. Some words that are often used in the tweet are SIM card, data SIM, bocor data, miliar data, and kominfo. The results of the model show that the Support-Vector Machine has the best performance with an f1-score of 0.81, followed by Random Forest of 0.78, IndoBERT of 0.76, and Logistic Regression of 0.74. Class imbalance and lack of training data make IndoBERT's performance lower when compared to other algorithms. The results of this study can be used by the authorities to evaluate policies in dealing with data security issues by listening to opinions from the Indonesian people.

Downloads

Download data is not yet available.

How to Cite

[1]

M. I. Amal, E. S. . Rahmasita, E. Suryaputra, and N. A. . Rakhmawati, “Sentiment Classification Analysis On Phone Identity Card Data Leaks Issues On Twitter”, JuTISI, vol. 8, no. 3, pp. 645 –, Dec. 2022.

Issue

Vol. 8 No. 3 (2022): JuTISI

Section

Articles

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (https://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial used, distribution and reproduction in any medium.

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details

Most read articles by the same author(s)