Languages hub

AI Must Speak Africa

The future of AI in Africa cannot happen only in English, French, Arabic, or Portuguese. African Language AI — datasets, models, translation, speech, and benchmarks.

Featured languages

SwahiliYorubaIgboHausaAmharicZuluXhosaShonaKinyarwandaWolofTwiGaEweSomaliLingalaOromoBambaraTigrinyaFulfuldeSetswanaSesothoLugandaKikuyuPulaarTamazight

Datasets, models, and tools

Named Entity Recognition

MasakhaNER

Coverage: 20+ languages

Speech & Translation

Vulavula API

Coverage: 11 languages

Speech-to-Text dataset

AfriSpeech-200

Coverage: 200+ accents languages

Language resource directory

Lanfrica

Coverage: All languages

Open speech corpus

Common Voice Africa

Coverage: 30+ languages

Multilingual LM

AfriBERTa

Coverage: 11 languages

Benchmark leaderboard

ModelTaskScore
AfroLM-largeMasakhaNER84.2 F1
Vulavula-MTEN→Zulu BLEU31.7
AfriBERTa-largeTopic class.82.4
InkubaLMMultilingual QA71.5