Preprint has been submitted for publication in journal
Preprint / Version 1

Ensemble Machine Learning to Detect Sarcams in English on Twitter Social Media

Ensemble Machine Learning untuk Mendeteksi Sarkasme Dalam Bahasa Inggris pada Sosial Twitter

##article.authors##

  • Muhammad Arginanta Kafi Sambada Universitas Muhammadiyah Sidoarjo image/svg+xml
  • Mochammad Alfan Rosid Universitas Muhammadiyah Sidoarjo image/svg+xml

DOI:

https://doi.org/10.21070/ups.746

Keywords:

Sarcasm, Sosial Media, Detection Sarcasme, Machine Learning, Sentiment Analysis

Abstract

Sarcasm on social media is often used to mock or hurt someone using language or words that appear positive but have a negative meaning. This poses a challenge in sentiment analysis on social media because it's difficult to detect sarcasm even for humans.  The aim of this study is to compare the performance of four machine learning methods, namely Logistic Regression, Naive Bayes, Decision Tree, and Support Vector Machine, in detecting sarcastic sentences in news headlines. The dataset used contains news headlines in English. The results show that the Support Vector Machine method has the best performance with a model of 20 that has an accuracy score of 80% and an f1-score of 80%, compared to the Logistic Regression, Naive Bayes, and Decision Tree methods. Therefore, it can be concluded that the Support Vector Machine method is the best solution for detecting sarcasm.

Downloads

Download data is not yet available.

References

Y. V. Aritonang, D. P. Napitupulu, M. H. Sinaga, and J. Amalia, “Pengaruh Hyperparameter pada Fasttext

terhadap Performa Model Deteksi Sarkasme Berbasis Bi-LSTM,” JATISI (Jurnal Tek. Inform. dan Sist.

Informasi), vol. 9, no. 3, pp. 2612–2625, 2022, doi: 10.35957/jatisi.v9i3.1331.

Y. Yunitasari, A. Musdholifah, and A. K. Sari, “Sarcasm Detection For Sentiment Analysis in Indonesian

Tweets,” IJCCS (Indonesian J. Comput. Cybern. Syst., vol. 13, no. 1, p. 53, 2019, doi: 10.22146/ijccs.41136.

A. Muhaddisi, B. N. Prastowo, D. Utami, and K. Putri, “Sentiment Analysis With Sarcasm Detection On P

olitician ’ s Instagram,” vol. 15, no. 4, pp. 349–358, 2021.

V. Govindan and V. Balakrishnan, “A machine learning approach in analysing the effect of hyperboles using

negative sentiment tweets for sarcasm detection,” J. King Saud Univ. - Comput. Inf. Sci., vol. 34, no. 8, pp.

–5120, 2022, doi: 10.1016/j.jksuci.2022.01.008.

F. Ugm and F. Ugm, “Analisis Sentimen Twitter untuk Teks Berbahasa Indonesia dengan Maximum Entropy

dan Support Vector Machine,” vol. 8, no. 1, pp. 91–100, 2014.

A. F. Hidayatullah et al., “Analisis sentimen dan klasifikasi kategori terhadap tokoh publik pada twitter,” vol.

, no. semnasIF, pp. 115–122, 2014.

P. Arsi and R. Waluyo, “Analisis Sentimen Wacana Pemindahan Ibu Kota Indonesia Menggunakan Algoritma

Support Vector Machine (SVM),” J. Teknol. Inf. dan Ilmu Komput., vol. 8, no. 1, p. 147, 2021, doi:

25126/jtiik.0813944.

A. Syahadati, N. C. Lengkong, O. Safitri, S. Machsus, Y. R. Putra, and R. Nooraeni, “ANALISIS SENTIMEN

PENERAPAN PSBB DI DKI JAKARTA DAN DAMPAKNYA TERHADAP PERGERAKAN IHSG,” vol.

, no. 1, pp. 20–25, 2021.

M. Shandy, T. Putra, and Y. Azhar, “Perbandingan Model Logistic Regression dan Artificial Neural Network

pada Prediksi Pembatalan Hotel,” vol. 6, no. 1, pp. 29–37, 2021.

R. Rahmanda and D. S. Informasi, “Rancang bangun aplikasi berbasis microservice untuk klasifikasi

sentimen. studi kasus: pt. yesboss group indonesia (kata.ai),” 2018.

A. Setiawan, L. W. Santoso, R. Adipranata, U. K. Petra, and J. Siwalankerto, “Klasifikasi Artikel Berita

Bahasa Indonesia Dengan Naive Bayes Classifier,” pp. 3–8.

U. Verawardina, F. Edi, and R. Watrianthos, “Analisis Sentimen Pembelajaran Daring Pada Twitter di Masa

Pandemi COVID-19 Menggunakan Metode Naïve Bayes,” vol. 5, pp. 157–163, 2021, doi:

30865/mib.v5i1.2604.

A. Subekti, “Analisis Sentiment pada Ulasan Film Dengan Optimasi Ensemble Learning,” vol. 7, no. 1, pp.

–8, 2020.

M. Ma, A. Prayogo, P. Subarkah, and F. Nida, “Sentiment analysis of customer satisfaction levels on

smartphone products using Ensemble Learning,” vol. 14, no. 3, pp. 339–347, 2022.

J. Nasional, S. Informasi, M. Kamil, T. Endra, and E. Tju, “Naïve Bayes dan Confusion Matrix untuk Efisiensi

Analisa Intrusion Detection System Alert,” vol. 02, pp. 81–88, 2022.

Posted

2023-04-11