Research & Papers

Using Machine Learning to Enhance the Detection of Obfuscated Abusive Words in Swahili: A Focus on Child Safety

A new AI model is tackling cyberbullying for over 100 million Swahili speakers.

Deep Dive

Researchers have developed machine learning models to detect obfuscated abusive language in Swahili, a low-resource language with over 100 million speakers, focusing on child safety. Using SVM, Logistic Regression, and Decision Trees optimized with SMOTE, the study addresses a critical gap in online safety for East Africa. While promising, the models' generalizability is limited by small, imbalanced datasets. The work advocates for expanded data and advanced techniques to improve cyberbullying detection systems.

Why It Matters

This directly impacts online safety for a massive, underserved population, setting a precedent for protecting children in low-resource language communities.