Arabic Graphemic Module (AGM v1.0)


(aka: Literal–Graphemic Module — Arabic Script)

0) Orientation

  • Script type: Abjad (primarily consonants; vowels indicated with optional diacritics)
  • Direction: Right-to-left
  • Shaping: Cursive, context-dependent forms (isolated, initial, medial, final)
  • Variants: Standard Arabic script covers Classical Arabic, Modern Standard Arabic, and many regional adaptations
  • Special notes: Certain letters do not connect to the following letter
  • Numeric layer: Arabic-Indic digits (٠١٢٣٤٥٦٧٨٩)

1) Base Letter Inventory (28 primary letters)

#Name (Latin chain)GlyphIPAConnection behavior
01Alifا/ʔ/ or long /aː/non-connecting after
02Bāʾب/b/connects both sides
03Tāʾت/t/connects both sides
04Thāʾث/θ/connects both sides
05Jīmج/d͡ʒ//ʒ//ɡ/connects both sides
06Ḥāʾح/ħ/connects both sides
07Khāʾخ/x/connects both sides
08Dālد/d/connects right only
09Dhālذ/ð/connects right only
10Rāʾر/r/connects right only
11Zāyز/z/connects right only
12Sīnس/s/connects both sides
13Shīnش/ʃ/connects both sides
14Ṣādص/sˤ/connects both sides
15Ḍādض/dˤ/connects both sides
16Ṭāʾط/tˤ/connects both sides
17Ẓāʾظ/ðˤ/connects both sides
18ʿAynع/ʕ/connects both sides
19Ghaynغ/ɣ/connects both sides
20Fāʾف/f/connects both sides
21Qāfق/q/connects both sides
22Kāfك/k/connects both sides
23Lāmل/l/connects both sides
24Mīmم/m/connects both sides
25Nūnن/n/connects both sides
26Hāʾهـ/h/connects both sides
27Wāwو/w/ or long /uː/connects right only
28Yāʾي/j/ or long /iː/connects both sides

2) Vowel Diacritics (Ḥarakāt)

  • Fatha (◌َ) → /a/ short
  • Kasra (◌ِ) → /i/ short
  • Ḍamma (◌ُ) → /u/ short
  • Sukun (◌ْ) → no vowel (consonant closure)
  • Shadda (◌ّ) → gemination (consonant doubling)
  • Tanwin (Nunation): fatḥatān /an/, kasratān /in/, ḍammatān /un/

3) Special Signs

  • Alif maddah (آ) → glottal stop + long /aː/
  • Hamza (ء): glottal stop carrier, written on alif, waw, ya, or alone
  • Alif maqṣūrah (ى): long /aː/ at word end (written like dotless ya)
  • Ta marbuta (ة): feminine ending /-a/ or /-ah/

4) Connection Logic (Joining Types)

  • Dual-joiners: connect both left and right
  • Right-joiners: connect to previous letter only (ا, د, ذ, ر, ز, و)
  • Non-joiners: N/A in Arabic; all letters connect to the right

5) Latin-chain Mapping Example

glyph: "ث"
name: "Thāʾ"
latin_chain: ["th"]
phoneme: "θ"
glyph: "ض"
name: "Ḍād"
latin_chain: ["ḍ"]
phoneme: "dˤ"

6) Lattice Integration Features

  • {direction: RTL}
  • {type: abjad}
  • {join_type: dual|right}
  • {mater_lectionis: true|false}
  • {gemination: true|false} via shadda
  • {vowel_diacritic: fatha|kasra|damma|none}

7) Example Word Decomposition

  • كتاب (“kitāb” /kiˈtaːb/) → ك /k/ + ت /t/ + ا (mater /aː/) + ب /b/
  • مدرسة (“madrasa” /ˈmad.ra.sa/) → م /m/ + د /d/ + ر /r/ + س /s/ + ة (/a/)

Mint Status: Arabic Graphemic Module now fully minted, with consonant inventory, diacritics, connection rules, and Latin-chain mapping for the lattice.