AraBERT for Arabic cyberbullying detection in Facebook comments

Hamarsheh, Mohammad M N$AAUP$Palestinian; Hithnawi, Rania Ibrahim$AAUP$Palestinian; Maree, MOhammed$AAUP$Palestinian

Please use this identifier to cite or link to this item: http://repository.aaup.edu/jspui/handle/123456789/3704

Full metadata record

DC Field	Value	Language
dc.contributor.author	Hamarsheh, Mohammad M N$AAUP$Palestinian	-
dc.contributor.author	Hithnawi, Rania Ibrahim$AAUP$Palestinian	-
dc.contributor.author	Maree, MOhammed$AAUP$Palestinian	-
dc.date.accessioned	2025-11-30T18:34:17Z	-
dc.date.available	2025-11-30T18:34:17Z	-
dc.date.issued	2025-11-04	-
dc.identifier.citation	R. I. Hithnawi, M. M. N. Hamarsheh and M. Maree, "AraBERT for Arabic cyberbullying detection in Facebook comments", Journal of Cybersecurity 2025 Vol. 11 Issue 1 Pages tyaf030 DOI: 10.1093/cybsec/tyaf030	en_US
dc.identifier.uri	http://repository.aaup.edu/jspui/handle/123456789/3704	-
dc.description.abstract	Cyberbullying is a significant issue on social media platforms. It poses serious emotional consequences and harassment to victims. Conventional pre-trained language models, such as Bidirectional Encoder Representations from Transformers (BERT), have achieved significant success in detecting cyberbullying through the analysis of natural language texts, especially with resource-rich languages such as English. However, for low-resource languages, such as Arabic, there has been limited attention given to the detection of cyberbullying. This research investigates the effectiveness of Arabic BERT (AraBERT), a pre-trained language model, for detecting Arabic cyberbullying comments. It also explores the trade-off between computational resources and model performance through various fine-tuning and freezing strategies. From an initial pool of >40 000 collected comments, we constructed a high-quality, balanced dataset of 20 000 Facebook comments written in Arabic. This subset was then manually labeled as either bullying or non-bullying to ensure data reliability and to facilitate robust model training. We employed fine-tuning techniques to adapt AraBERTv2 to the cyberbullying detection task. Through experimentation with layer freezing technique, we explored the trade-off between leveraging pre-trained knowledge and adapting the model to the specific task. Our findings demonstrate that fine-tuning all layers of AraBERTv2, which involves adjusting the weights and biases of each layer during training, achieved the highest performance. This approach offers a flexible method for applying a pre-trained model to new problems, resulting in an accuracy of 91.9% and an F1 score of 92.8%.	en_US
dc.description.sponsorship	None	en_US
dc.language.iso	en	en_US
dc.publisher	Journal of Cybersecurity	en_US
dc.relation.ispartofseries	Vol. 11 Issue 1;tyaf030	-
dc.subject	AraBERT	en_US
dc.subject	cyberbullying detection	en_US
dc.subject	deep learning	en_US
dc.subject	natural language processing	en_US
dc.subject	neural network layers freezing	en_US
dc.subject	social media platforms	en_US
dc.subject	pre-trained language models	en_US
dc.title	AraBERT for Arabic cyberbullying detection in Facebook comments	en_US
Appears in Collections:	Faculty & Staff Scientific Research publications

Files in This Item:

File	Description	Size	Format
tyaf030.pdf		1.29 MB	Adobe PDF	View/Open

Show simple item record

Admin Tools

ARAB AMERICAN UNIVERSITY Repository