CYBERBULLYING DETECTION IN SOCIAL MEDIA USING PRE-TRAINED LANGUAGE MODELS

Jasmeen Kah Ying Bong; Kasturi Dewi  Varathan; Teoh Hwai  Teng

doi:10.22452/mjcs.vol38no1.2

Authors

Jasmeen Kah Ying Bong Department of Information Systems, Faculty of Computer Science & Information Technology, Universiti Malaya, 50603 Kuala Lumpur, Malaysia
Kasturi Dewi Varathan Department of Information Systems, Faculty of Computer Science & Information Technology, Universiti Malaya, 50603 Kuala Lumpur, Malaysia
Teoh Hwai Teng Department of Information Systems, Faculty of Computer Science & Information Technology, Universiti Malaya, 50603 Kuala Lumpur, Malaysia

DOI:

https://doi.org/10.22452/mjcs.vol38no1.2

Keywords:

Cyberbullying Detection, Transfer Learning, Pre-trained Language Models, AMiCA Dataset, Text Classification

Abstract

The rapid integration of Information and Communication Technologies (ICT) has revolutionized online communication, yet it has also led to the emergence of cyberbullying, a harmful digital behaviour. This study addresses the urgency of combating cyberbullying and its negative impacts by using advanced pre-trained language models (PLMs) through transfer learning in detecting cyberbullying in social media. The goal is to enhance cyberbullying detection's effectiveness to create safer online spaces. Cyberbullying detection model using transfer learning, DistilBERT, DistilELECTRA, and MiniLM PLMs were explored. The PLMs' evaluation using the AMiCA dataset, MiniLM achieves the highest performance in detecting cyberbullying, with an accuracy of 97.84% in cross-validation and 98.57% in hold-out testing, while DistilBERT and DistilELECTRA also perform well, achieving accuracies of 97.34% and 98.03%, and 97.58% and 92.97%, respectively. MiniLM consistently maintains competitive F-measures, addressing class imbalance. Overall, MiniLM stands out with high accuracy and micro F1-scores, outperforming other models. Comparative analysis reaffirms MiniLM's excellence in binary classes and overall evaluation showcasing the effectiveness of transfer learning compared to previous studies. In conclusion, this study demonstrates the capabilities of PLMs for cyberbullying detection and suggests future research directions

Downloads

Download data is not yet available.

CYBERBULLYING DETECTION IN SOCIAL MEDIA USING PRE-TRAINED LANGUAGE MODELS

Authors

DOI:

Keywords:

Abstract

Downloads

Downloads

Published

How to Cite

Issue

Section

Most read articles by the same author(s)

Editorial Information

Scope

Submission Guidelines

Indexing

Article Publication Charge

Journal Template

Special Issue

In Press Publication

Awards

Information

Conference

Articles

Top Cited Articles

Most View Articles

Publishing Timeline