Spanish Graphemic Module (LGM v1.0)


Spanish Alphabet (27 letters)

Latin script, A–Z plus Ñ.
K and W appear only in loanwords (mainly foreign names and technical terms).


Vowels (5) — no length distinction, but stress is marked orthographically.

GlyphLatin ChainPhoneme (IPA)Notes
Aa/a/open front vowel
Ee/e/close-mid front vowel
Ii/i/close front vowel
Oo/o/close-mid back vowel
Uu/u/close back vowel

Consonants (22 native letters + Ñ)

GlyphLatin ChainPhoneme (IPA)Notes
Bb/b ~ β/[b] after pause or nasal; [β] intervocalic
Cc/k/ before a, o, u; /θ/ (Castilian) or /s/ (Latin American) before e, i
Dd/d ~ ð/[d] after pause or nasal/lateral; [ð] intervocalic
Ff/f/
Gg/ɡ ~ ɣ/ before a, o, u; /x/ before e, i; [ɣ] intervocalic
Hhsilentmarks orthographic position
Jj/x/velar/uvular fricative; stronger in Spain
Kk/k/loans only
Ll/l/
Mm/m/
Nn/n/assimilates to place of articulation before consonants
Ñn + tilde/ɲ/palatal nasal
Pp/p/
Qq/k/always with u (qu) before e, i; u silent unless with dieresis
Rr/r/trilled r in initial position or rr medial
RRr + r/r/trilled; written only between vowels
Ss/s/
Tt/t/
Vv/b ~ β/identical to b in pronunciation in most dialects
Ww/w/loans only
Xx/ks/ in most words; /x/ in some Mexican toponyms (México)
Yy/ʝ ~ ʒ/varies by dialect; in some regions merged with ll
Zz/θ/ (Castilian) or /s/ (Latin American)

Diacritics and Orthographic Marks

  • Acute accent (´) — marks stressed syllable when it breaks normal stress rules (camión, sofá).
  • Diaeresis (¨) over U in gue/gui — indicates U is pronounced (vergüenza /berˈɡwensa/).

Key Digraphs (phoneme units)

  • Ch → /t͡ʃ/ (considered a separate letter historically)
  • Ll → /ʎ/ (traditional) or /ʝ/ (yeísmo; modern common)
  • Qu → /k/ before e, i (u silent unless diaeresis)
  • Gu → /ɡ/ before e, i (u silent unless diaeresis)
  • Rr → /r/ trill (only between vowels)

System Integration Notes

  • Spanish orthography is largely phonemic, with predictable consonant/vowel correspondence.
  • Variations like /θ/ vs /s/ (distinción vs seseo) and /ʎ/ vs /ʝ/ (lleísmo vs yeísmo) are dialectal features — in our lattice, these get feature flags for dialect.
  • B/V merger means these letters map to the same phoneme node in most dialect modules.