French Graphemic Module (FGM v1.0)


(aka: Literal–Graphemic Module — French Script)

0) Orientation

  • Script: Latin (A–Z + diacritics + ligatures)
  • Orthography type: Deep (non-phonemic), many historical spellings retained
  • Direction: Left-to-right
  • Diacritics: Acute, grave, circumflex, diaeresis, cedilla
  • Ligatures: Æ, Œ (treated as single graphemes in legacy orthography, mostly in formal/loan words)

1) Base Alphabet (26 letters)

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z


2) Diacritics & Modified Letters

LetterDiacriticGlyphUsage / IPA effect
EacuteÉ é/e/ (close-mid front vowel)
EgraveÈ è/ɛ/ (open-mid front vowel)
EcircumflexÊ ê/ɛ/ or /e/ (historical length)
AgraveÀ àdistinguishes homophones (à = preposition)
Acircumflex â/ɑ/ in formal speech; /a/ in modern FR
IcircumflexÎ î/i/ (historical length)
IdiaeresisÏ ïvowel separation (naïf)
OcircumflexÔ ô/o/
OdiaeresisÖ övowel separation (Noël)
UgraveÙ ùdistinguishes homophones (où = where)
UcircumflexÛ û/y/ (historical length)
UdiaeresisÜ ü/y/ (loanwords, separation)
CcedillaÇ ç/s/ before a, o, u

3) Vowel Inventory & Core Grapheme Mappings

GraphemeIPANotes
A/a/
Â/ɑ/ or /a/formal vs modern
E/ə/ (unstressed), silent in final position unless in monosyllables
É/e/
È, Ê/ɛ/
I, Î, Ï/i/
O/o/ or /ɔ/ depending on context
Ô/o/
U, Û, Ü/y/
Y/i/ or /j/also part of diphthongs
AU, EAU/o/
OU/u/
EU, ŒU/ø/ or /œ/ depending on openness
AN, AM, EN, EM/ɑ̃/ (nasal)
IN, IM, EIN, AIN, AIM, EIM/ɛ̃/ (nasal)
ON, OM/ɔ̃/ (nasal)
UN, UM/œ̃/ (nasal)

4) Consonant Inventory & Core Grapheme Mappings

GraphemeIPANotes
B/b/
C/k/ before a, o, u; /s/ before e, i, y
Ç/s/before a, o, u
CH/ʃ/
D/d/often silent at word end unless liaison
F/f/
G/ɡ/ before a, o, u; /ʒ/ before e, i, y
GN/ɲ/
Hsilent, aspirated-h prevents liaison
J/ʒ/
K/k/loanwords
L/l/
M/m/nasalizes preceding vowel before b, p
N/n/nasalizes preceding vowel before consonant
P/p/often silent final unless liaison
PH/f/
Q(U)/k/qu = /k/ or /kw/ in some loans
R/ʁ/uvular fricative
S/s/ or /z/ between vowels
T/t/often silent final
TH/t/Greek loans
V/v/
W/v/ or /w/ in loans
X/ks/, /ɡz/, /s/ (in names), or /z/ in liaison
Z/z//s/ in some final positions

5) Ligatures

  • Æ æ → /e/ or /ɛ/ in loans (cæcum, Æschyle)
  • Œ œ → /ø/ or /œ/ (œuf, cœur)

6) Orthographic Rules

  • Final consonants: many are silent unless liaison (grand ami → /ɡʁɑ̃.t‿ami/)
  • Nasal vowels: indicated by vowel + m/n, where m/n is silent unless followed by vowel
  • Diacritics: acute for /e/, grave for /ɛ/, circumflex for historical vowel length or lost consonant
  • H aspiré: blocks liaison and elision (e.g., le héros /lə eʁo/ not /l‿eʁo/)
  • Elision: drop e before vowel-initial word (l’homme)

7) Latin Chain Mapping Example

glyph: "Ç"
name: "C-cedilla"
latin_chain: ["c"]
features: {cedilla: true}
phoneme: "s"
context: {following: ["a","o","u"]}
glyph: "GN"
name: "Gn"
latin_chain: ["g","n"]
phoneme: "ɲ"

8) Lattice Integration Features

  • {direction: LTR}
  • {type: alphabetic}
  • {nasal_vowel: true} for AN/EN/ON/UN sequences
  • {liaison: required|blocked|optional}
  • {h_type: aspirated|mute}

Mint Status: French Graphemic Module is now fully minted with diacritics, nasalization rules, liaison handling, and digraph logic integrated into the Latin GM.