Please use this identifier to cite or link to this item: http://repository.aaup.edu/jspui/handle/123456789/3704
Full metadata record
DC FieldValueLanguage
dc.contributor.authorHamarsheh, Mohammad M N$AAUP$Palestinian-
dc.contributor.authorHithnawi, Rania Ibrahim$AAUP$Palestinian-
dc.contributor.authorMaree, MOhammed$AAUP$Palestinian-
dc.date.accessioned2025-11-30T18:34:17Z-
dc.date.available2025-11-30T18:34:17Z-
dc.date.issued2025-11-04-
dc.identifier.citationR. I. Hithnawi, M. M. N. Hamarsheh and M. Maree, "AraBERT for Arabic cyberbullying detection in Facebook comments", Journal of Cybersecurity 2025 Vol. 11 Issue 1 Pages tyaf030 DOI: 10.1093/cybsec/tyaf030en_US
dc.identifier.urihttp://repository.aaup.edu/jspui/handle/123456789/3704-
dc.description.abstractCyberbullying is a significant issue on social media platforms. It poses serious emotional consequences and harassment to victims. Conventional pre-trained language models, such as Bidirectional Encoder Representations from Transformers (BERT), have achieved significant success in detecting cyberbullying through the analysis of natural language texts, especially with resource-rich languages such as English. However, for low-resource languages, such as Arabic, there has been limited attention given to the detection of cyberbullying. This research investigates the effectiveness of Arabic BERT (AraBERT), a pre-trained language model, for detecting Arabic cyberbullying comments. It also explores the trade-off between computational resources and model performance through various fine-tuning and freezing strategies. From an initial pool of >40 000 collected comments, we constructed a high-quality, balanced dataset of 20 000 Facebook comments written in Arabic. This subset was then manually labeled as either bullying or non-bullying to ensure data reliability and to facilitate robust model training. We employed fine-tuning techniques to adapt AraBERTv2 to the cyberbullying detection task. Through experimentation with layer freezing technique, we explored the trade-off between leveraging pre-trained knowledge and adapting the model to the specific task. Our findings demonstrate that fine-tuning all layers of AraBERTv2, which involves adjusting the weights and biases of each layer during training, achieved the highest performance. This approach offers a flexible method for applying a pre-trained model to new problems, resulting in an accuracy of 91.9% and an F1 score of 92.8%.en_US
dc.description.sponsorshipNoneen_US
dc.language.isoenen_US
dc.publisherJournal of Cybersecurityen_US
dc.relation.ispartofseriesVol. 11 Issue 1;tyaf030-
dc.subjectAraBERTen_US
dc.subjectcyberbullying detectionen_US
dc.subjectdeep learningen_US
dc.subjectnatural language processingen_US
dc.subjectneural network layers freezingen_US
dc.subjectsocial media platformsen_US
dc.subjectpre-trained language modelsen_US
dc.titleAraBERT for Arabic cyberbullying detection in Facebook commentsen_US
Appears in Collections:Faculty & Staff Scientific Research publications

Files in This Item:
File Description SizeFormat 
tyaf030.pdf1.29 MBAdobe PDFView/Open
Show simple item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Admin Tools