Detecting Hate Speech and Offensive Language using Machine Learning in Published Online Content

Sinyangwe Clement Mulenga; Douglas Kunda; William Abwino Phiri

Detecting Hate Speech and Offensive Language using Machine Learning in Published Online Content

dc.contributor.author	Sinyangwe Clement Mulenga
dc.contributor.author	Douglas Kunda
dc.contributor.author	William Abwino Phiri
dc.date.accessioned	2025-12-01T15:01:32Z
dc.date.issued	2023
dc.description	RESEARCH PAPERS AND JOURNAL ARTICLES
dc.description.abstract	Businesses are more concerned than ever about hate speech content as most brand communication and advertising move online. Different organisations may be incharge of their products and services but they do not have complete control over their content posted online via their website and social media channels, they have no control over what online users post or comment about their brand. As a result, it became imperative in our study to develop a model that will identify hate speechand, offensive language and detect cyber offence in online published content using machine learning. This study employed an experimental design to develop a detection model for determining which agile methodologies were preferred as a suitable development methodology. Deep learning and HateSonar was used to detect hate speech and offensive language in posted content. This study used data from Twitter and Facebook to detect hate speech. The text was classified as either hate speech, offensive language, or both. During the reconnaissance phase, the combined data (structured and unstructured) was obtained from kaggle.com. The combined data was stored in the database as raw data. This revealed that hate speech and offensive language exist everywhere in the world, and the trend of the vices is on the rise. Using machine learning, the researchers successfully developed a model for detecting offensive language and hate speech on online social media platforms. The labelling in the model makes it simple to categorise data in a meaningful and readable manner. The study establishes that in fore model to detect hate speech and offensive language on online social media platforms, the data set must be categorised and presented in statistical form after running the model; the count indicates the total number of data sets imported. The mean for each category, as well as the standard deviation and the minimum and maximum number of tweets in each category, are also displayed. The study established that preventing online platform abuse in Zambia requires a comprehensive approach that involves government law, responsible platform policies and practices, as well as individual responsibility and accountability. In accordance with this goal, the research was effective in developing the detection model. To guarantee that the model was completely functional, it was trained on the English dataset before being applied to the local language dataset. This was because of the fact that training deep learning models with local datasets can present a number of challenges, such as limited, biased data, data privacy, resource requirements, and model maintenance. However, the efficacy of these systems varies, and there have been concerns raised about the inherent biases and limitations of automatic moderation techniques. The study recommends that future studies consider other sources of information such as Facebook, WhatsApp, Instagram, and other social media platforms, as well as consider harvesting local data sets for training machines rather than relying on foreign data sets; the local data set can then be used to detect offences targeting Zambian citizens on local platforms.
dc.description.sponsorship	ZCAS UNIVERSITY
dc.identifier.citation	HARVARD REFRENCING
dc.identifier.uri	http://dspace.zcas.edu.zm/handle/123456789/109
dc.language.iso	en_US
dc.publisher	ZAMBIA INFORMATION COMMUNICATION TECHNOLOGY (ICT) JOURNAL
dc.subject	Hate Speech
dc.subject	Offensive Language
dc.subject	Online Content
dc.subject	Machine Learning
dc.title	Detecting Hate Speech and Offensive Language using Machine Learning in Published Online Content
dc.type	Article

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Detecting Hate Speech and Offensive Language using Machine.pdf
Size:: 402.3 KB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed to upon submission
Description:

Download

Collections

Research Papers and Journal Articles