Please use this identifier to cite or link to this item: http://repository.aaup.edu/jspui/handle/123456789/3704
Title: AraBERT for Arabic cyberbullying detection in Facebook comments
Authors: Hamarsheh, Mohammad M N$AAUP$Palestinian
Hithnawi, Rania Ibrahim$AAUP$Palestinian
Maree, MOhammed$AAUP$Palestinian
Keywords: AraBERT
cyberbullying detection
deep learning
natural language processing
neural network layers freezing
social media platforms
pre-trained language models
Issue Date: 4-Nov-2025
Publisher: Journal of Cybersecurity
Citation: R. I. Hithnawi, M. M. N. Hamarsheh and M. Maree, "AraBERT for Arabic cyberbullying detection in Facebook comments", Journal of Cybersecurity 2025 Vol. 11 Issue 1 Pages tyaf030 DOI: 10.1093/cybsec/tyaf030
Series/Report no.: Vol. 11 Issue 1;tyaf030
Abstract: Cyberbullying is a significant issue on social media platforms. It poses serious emotional consequences and harassment to victims. Conventional pre-trained language models, such as Bidirectional Encoder Representations from Transformers (BERT), have achieved significant success in detecting cyberbullying through the analysis of natural language texts, especially with resource-rich languages such as English. However, for low-resource languages, such as Arabic, there has been limited attention given to the detection of cyberbullying. This research investigates the effectiveness of Arabic BERT (AraBERT), a pre-trained language model, for detecting Arabic cyberbullying comments. It also explores the trade-off between computational resources and model performance through various fine-tuning and freezing strategies. From an initial pool of >40 000 collected comments, we constructed a high-quality, balanced dataset of 20 000 Facebook comments written in Arabic. This subset was then manually labeled as either bullying or non-bullying to ensure data reliability and to facilitate robust model training. We employed fine-tuning techniques to adapt AraBERTv2 to the cyberbullying detection task. Through experimentation with layer freezing technique, we explored the trade-off between leveraging pre-trained knowledge and adapting the model to the specific task. Our findings demonstrate that fine-tuning all layers of AraBERTv2, which involves adjusting the weights and biases of each layer during training, achieved the highest performance. This approach offers a flexible method for applying a pre-trained model to new problems, resulting in an accuracy of 91.9% and an F1 score of 92.8%.
URI: http://repository.aaup.edu/jspui/handle/123456789/3704
Appears in Collections:Faculty & Staff Scientific Research publications

Files in This Item:
File Description SizeFormat 
tyaf030.pdf1.29 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Admin Tools