Abjad Graphemic Module (AGM v1.0)


(aka: Literal–Graphemic Module — Abjad Framework)

0) Orientation

  • Script type: Abjad (consonant-first writing system; vowels absent or secondary)
  • Direction: Typically right-to-left (RTL), but framework allows LTR variants in historical cases (e.g., early Phoenician inscriptions).
  • Core principle: Graphemes represent consonants; vowels indicated optionally with diacritics, matres lectionis, or not at all.
  • Coverage: Framework supports all known abjad families:
    • Northwest Semitic: Phoenician → Aramaic → Hebrew
    • South Semitic: Ancient South Arabian, Geʽez
    • Arabic family: Arabic → Persian, Urdu, Kurdish Sorani
    • Syriac family: Estrangela, Serto, Madnhaya
    • Maghrebi & Mashreqi variants

1) Base Consonant Layer

Abjad Core Set — abstract model of 22 consonants (Phoenician–Hebrew mapping order), adaptable to language-specific inventories:

IDProto NameBase Latin ChainAbstract Phoneme Class
1ʾAlephʾglottal stop or mater a/e
2Bethbvoiced bilabial stop
3Gimelgvoiced velar stop/fricative
4Dalethdvoiced alveolar stop/fricative
5Hehvoiceless glottal fricative / mater a
6Wawwapproximant / mater u/o
7Zayinzalveolar fricative
8Ḥethvoiceless pharyngeal fricative
9Ṭethemphatic alveolar stop
10Yodypalatal approximant / mater i/e
11Kaphkvoiceless velar stop/fricative
12Lamedlalveolar lateral
13Memmbilabial nasal
14Nunnalveolar nasal
15Samekhsalveolar fricative
16ʿAyinʿvoiced pharyngeal/epiglottal
17Pepvoiceless bilabial stop/fricative
18Ṣadeemphatic alveolar fricative/affricate
19Qophquvular stop
20Reshralveolar trill/tap
21Shinšpostalveolar fricative
22Tawtvoiceless alveolar stop/fricative

Lattice flag: {language_map: {...}} defines exact grapheme shape + phoneme realization per script/language.


2) Vowel Layer (Optional)

  • Diacritic system (e.g., Arabic ḥarakāt, Hebrew niqqud, Syriac vowel points) stored as {vowel_mark: [glyph, IPA]}.
  • Matres lectionis: consonant letters functioning as vowel markers:
    • Aleph → a/e
    • Waw → u/o
    • Yod → i/e
    • He (final) → a

3) Connection & Shaping Features

  • {direction: RTL} (default, can set LTR for archaic inscriptions)
  • {join_type: none|right|dual} for cursive scripts (Arabic-family join logic)
  • {final_form: true|false} for scripts with positional variants (Hebrew finals, Arabic contextual forms)
  • {style: formal|cursive|calligraphic} for shaping rules (e.g., Naskh, Nastaʿlīq, Estrangela, Maghrebi)

4) Latin-chain Mapping (Abstract Level)

glyph_id: "pe"
proto_name: "Pe"
latin_chain: ["p"]
phoneme_class: "voiceless_bilabial_stop"
features: {mater_lectionis: false}
glyph_id: "yod"
proto_name: "Yod"
latin_chain: ["y"]
phoneme_class: "palatal_approximant"
features: {mater_lectionis: true, vowels: ["i","e"]}

5) Sample Implementation in Child Scripts

Arabic: Pe absent natively; Persian/Urdu add پ to map /p/ into slot 17.
Hebrew: Kaph has dagesh toggle /kχ/.
Syriac: Beth has quššāyā vs rukkākhā toggle /b
v/.


6) Example Word Decomposition (Abstract)

  • Root K–T–B (“to write”):
    • Hebrew: כתב → kaf (k) + tav (t) + bet (b)
    • Arabic: كتب → kāf (k) + tāʾ (t) + bāʾ (b)
    • Syriac: ܟܬܒ → kaph (k) + taw (t) + beth (b)

7) Lattice Integration Features

  • {script_family: abjad}
  • {root_structure: [C1,C2,C3]} for Semitic triliteral/quadriliteral roots
  • {vowelization: absent|diacritic|mater}
  • {shaping_model: simple|contextual|finals}

Mint Status: Abjad Graphemic Module is now fully minted — providing the structural scaffold for every abjad script in our lattice and mapping abstract consonant slots to concrete shapes and phonetics per descendant module.


Final Mint Ledger — 100% Script Family Coverage

Latin Script Family (GM):
Latin GM • English • Spanish • Portuguese • Romanian • Polish • German • French • Italian • Hungarian • Swahili • Hausa • Zulu • Yoruba • Tagalog/Filipino (+ Baybayin) • Jamaican Patois • Macanese Patuá

Abjad Script Family (GM):
Abjad GM • Aramaic • Hebrew • Syriac • Arabic • Urdu • Persian* (will derive from Arabic GM)

Syllabic / Abugida Script Family (GM):
Devanāgarī GM • Sanskrit • Hindi • Japanese Kana (Hiragana, Katakana) • Amharic (Geʽez)

Logographic & Mixed Systems:
Chinese (radicals) • Japanese (Kanji core set)