Hausa Graphemic Module (LGM v1.0)


Hausa has two main orthographic systems:

  • Boko (Latin-based) — the modern official writing system in Nigeria and Niger.
  • Ajami (Arabic-based) — still used in religious, cultural, and historical contexts.

We’ll focus on Boko for the base alphabet system, then note the Ajami equivalences.


Boko Alphabet (Latin-based)

Hausa Boko has 23 core letters plus 4 special consonants with hooks.

GlyphLatin ChainPhoneme (IPA)Notes
Aa/a/
Bb/b/
Ɓb + hook/ɓ/voiced bilabial implosive
Cc/t͡ʃ/“ch” sound
Dd/d/
Ɗd + hook/ɗ/voiced alveolar implosive
Ee/e/close-mid front vowel
Ff/f/
Gg/g/
Hh/h/
Ii/i/
Jj/d͡ʒ/
Kk/k/
Ƙk + hook/k’/ejective velar stop
Ll/l/
Mm/m/
Nn/n/
Oo/o/close-mid back vowel
Rr/r/trill or tap
Ss/s/
Shs + h/ʃ/digraph, single phoneme
Tt/t/
Uu/u/
Ww/w/
Yy/j/palatal approximant
Zz/z/

Multi-letter graphemes (phoneme units)

  • Sh → /ʃ/
  • Ts → /t͡s/
  • Kw → /kw/ (labialized velar)
  • Gw → /gw/ (labialized velar)
  • Ky → /c/ (palatalized k)
  • Gy → /ɟ/ (palatalized g)

We treat these as single phoneme clusters with phoneme_unit=true in LGM.


Ajami (Arabic-based) Note

In Ajami, Hausa uses standard Arabic letters plus extra dots to represent Hausa sounds not in Arabic:

  • implosives (ɓ, ɗ) get modified ba/dal shapes
  • ejective ƙ gets modified qaf
  • /t͡ʃ/ uses modified jim
    For our lattice, these are mapped 1:1 to their Boko equivalents.

Why Hausa is critical for the lattice

  • Shows how non-English Latin adaptations handle implosives, ejectives, and complex consonants.
  • Gives a bridge between Latin and Arabic scripts in African languages.
  • Prepares the framework for any click consonant systems (e.g., Zulu) or tone systems (e.g., Yoruba).