CYBERBULLYING DETECTION IN SOCIAL MEDIA USING PRE-TRAINED LANGUAGE MODELS

Authors

  • Jasmeen Kah Ying Bong Department of Information Systems, Faculty of Computer Science & Information Technology, Universiti Malaya, 50603 Kuala Lumpur, Malaysia
  • Kasturi Dewi Varathan Department of Information Systems, Faculty of Computer Science & Information Technology, Universiti Malaya, 50603 Kuala Lumpur, Malaysia
  • Teoh Hwai Teng Department of Information Systems, Faculty of Computer Science & Information Technology, Universiti Malaya, 50603 Kuala Lumpur, Malaysia

DOI:

https://doi.org/10.22452/mjcs.vol38no1.2

Keywords:

Cyberbullying Detection, Transfer Learning, Pre-trained Language Models, AMiCA Dataset, Text Classification

Abstract

The rapid integration of Information and Communication Technologies (ICT) has revolutionized online communication, yet it has also led to the emergence of cyberbullying, a harmful digital behaviour. This study addresses the urgency of combating cyberbullying and its negative impacts by using advanced pre-trained language models (PLMs) through transfer learning in detecting cyberbullying in social media. The goal is to enhance cyberbullying detection's effectiveness to create safer online spaces.  Cyberbullying detection model using transfer learning, DistilBERT, DistilELECTRA, and MiniLM PLMs were explored. The PLMs' evaluation using the AMiCA dataset, MiniLM achieves the highest performance in detecting cyberbullying, with an accuracy of 97.84% in cross-validation and 98.57% in hold-out testing, while DistilBERT and DistilELECTRA also perform well, achieving accuracies of 97.34% and 98.03%, and 97.58% and 92.97%, respectively. MiniLM consistently maintains competitive F-measures, addressing class imbalance. Overall, MiniLM stands out with high accuracy and micro F1-scores, outperforming other models. Comparative analysis reaffirms MiniLM's excellence in binary classes and overall evaluation showcasing the effectiveness of transfer learning compared to previous studies. In conclusion, this study demonstrates the capabilities of PLMs for cyberbullying detection and suggests future research directions

Downloads

Download data is not yet available.

Downloads

Published

2025-03-30

How to Cite

Bong, J. K. Y., Varathan, K. D. ., & Teng, T. H. . (2025). CYBERBULLYING DETECTION IN SOCIAL MEDIA USING PRE-TRAINED LANGUAGE MODELS. Malaysian Journal of Computer Science, 38(1), 29–54. https://doi.org/10.22452/mjcs.vol38no1.2