Please use this identifier to cite or link to this item:
http://repository.aaup.edu/jspui/handle/123456789/3704| Title: | AraBERT for Arabic cyberbullying detection in Facebook comments |
| Authors: | Hamarsheh, Mohammad M N$AAUP$Palestinian Hithnawi, Rania Ibrahim$AAUP$Palestinian Maree, MOhammed$AAUP$Palestinian |
| Keywords: | AraBERT cyberbullying detection deep learning natural language processing neural network layers freezing social media platforms pre-trained language models |
| Issue Date: | 4-Nov-2025 |
| Publisher: | Journal of Cybersecurity |
| Citation: | R. I. Hithnawi, M. M. N. Hamarsheh and M. Maree, "AraBERT for Arabic cyberbullying detection in Facebook comments", Journal of Cybersecurity 2025 Vol. 11 Issue 1 Pages tyaf030 DOI: 10.1093/cybsec/tyaf030 |
| Series/Report no.: | Vol. 11 Issue 1;tyaf030 |
| Abstract: | Cyberbullying is a significant issue on social media platforms. It poses serious emotional consequences and harassment to victims. Conventional pre-trained language models, such as Bidirectional Encoder Representations from Transformers (BERT), have achieved significant success in detecting cyberbullying through the analysis of natural language texts, especially with resource-rich languages such as English. However, for low-resource languages, such as Arabic, there has been limited attention given to the detection of cyberbullying. This research investigates the effectiveness of Arabic BERT (AraBERT), a pre-trained language model, for detecting Arabic cyberbullying comments. It also explores the trade-off between computational resources and model performance through various fine-tuning and freezing strategies. From an initial pool of >40 000 collected comments, we constructed a high-quality, balanced dataset of 20 000 Facebook comments written in Arabic. This subset was then manually labeled as either bullying or non-bullying to ensure data reliability and to facilitate robust model training. We employed fine-tuning techniques to adapt AraBERTv2 to the cyberbullying detection task. Through experimentation with layer freezing technique, we explored the trade-off between leveraging pre-trained knowledge and adapting the model to the specific task. Our findings demonstrate that fine-tuning all layers of AraBERTv2, which involves adjusting the weights and biases of each layer during training, achieved the highest performance. This approach offers a flexible method for applying a pre-trained model to new problems, resulting in an accuracy of 91.9% and an F1 score of 92.8%. |
| URI: | http://repository.aaup.edu/jspui/handle/123456789/3704 |
| Appears in Collections: | Faculty & Staff Scientific Research publications |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| tyaf030.pdf | 1.29 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
Admin Tools