German Graphemic Module


(Literal–Graphemic Module — German)

Script Base

  • Script: Latin (A–Z)
  • Orthography type: Moderately phonemic, with predictable grapheme–phoneme rules but some irregularities in loanwords.
  • Locale: de (supports Austria, Switzerland, Germany variants)

1) Base Alphabet (26 letters)

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z


2) Additional Graphemes / Diacritics

  • Ä ä → A + diaeresis
  • Ö ö → O + diaeresis
  • Ü ü → U + diaeresis
  • ẞ ß → sharp s (eszett); uppercase form officially recognized, also legacy SS in all-caps.

3) Vowels

GlyphLatin ChainPhoneme (IPA)Notes
Aa/a/ ~ /aː/length distinguished by context
Äa + ¨/ɛ/ ~ /ɛː/
Ee/ɛ/ ~ /eː/
Ii/ɪ/ ~ /iː/
Oo/ɔ/ ~ /oː/
Öo + ¨/œ/ ~ /øː/
Uu/ʊ/ ~ /uː/
Üu + ¨/ʏ/ ~ /yː/
Yy/y/ or /i/ in loans
Vowel lengthdoubled vowels (ee), h after vowel, open syllablefeature flag {length: long}

4) Consonants

GlyphLatin ChainPhoneme (IPA)Notes
Bb/b/ → /p/ final devoicing
Cc/k/ before a, o, u; /t͡s/ in loans from Latin/Greek
CHc + h/x/ after a, o, u; /ç/ after e, i, ä, ö, ü
Dd/d/ → /t/ final devoicing
Ff/f/
Gg/g/ → /k/ final devoicing; /ç/ in -ig endings (northern)
Hh/h/ or silent length marker
Jj/j/
Kk/k/
Ll/l/
Mm/m/
Nn/n/
NGn + g/ŋ/velar nasal
Pp/p/
Qq + u/kv/always qu sequence
Rr/ʁ/ uvular fricative (standard); /r/ trill in some dialects; vowel coloring in coda
Ss/z/ word-initial before vowel; /s/ elsewhere
ßß/s/ long vowel or diphthong preceding
Tt/t/
Vv/f/ in native words; /v/ in loans
Ww/v/
Xx/ks/
Zz/t͡s/

5) Digraphs & Trigraphs

  • Sch → /ʃ/
  • Sp, St → /ʃp/, /ʃt/ word-initial
  • Pf → /pf/ (affricate)
  • Ph → /f/ (mostly in Greek loans)
  • Th → /t/ (loans)
  • Eu, Äu → /ɔʏ/ diphthong
  • Ei, Ai, Ey, Ay → /aɪ/ diphthong
  • Au → /aʊ/ diphthong

6) Orthographic Rules

  • Final devoicing: b/d/g → p/t/k in coda
  • Capitalization: All nouns capitalized; ẞ uppercases to ẞ (preferred) or SS (legacy)
  • Length marking: vowel + h (Bahn), doubled vowels (Meer), or open syllable
  • Diphthong spelling: fixed grapheme sequences, phonemic value consistent

7) Locale Casing & Normalization

  • Preserve as distinct grapheme; fold to ss only when locale rules demand legacy form.
  • Diaeresis letters Ä/Ö/Ü are single graphemes, not base+modifier for collation purposes in de.
  • NFC/NFD round-trip safe.

8) Engine Feature Flags

  • {length: long|short} for vowels
  • {voice: voiced|voiceless} auto-updated by final devoicing rules
  • {allophone: [ç|x]} for CH
  • {capitalization: noun} for grammar-aware modules

Mint Status: German Graphemic Module now fully defined and integrated with the Latin Graphemic Module. Ready for export into the lattice.