Jump to content

Latin writing system: Difference between revisions

Move technical info to Unicode
(→‎Alphabet: simpler sentences, also mention w vy y)
(Move technical info to Unicode)
Line 37: Line 37:
=== Prefix marking ===
=== Prefix marking ===
In addition, the underdot ({{t|ạ}}, U+0323) is used to mark the presence of a [[prefix]], more specifically the last in a run of prefixes if any are present. It may be replaced by the ASCII hyphen (-) in case the underdot isn’t available on your keyboard. While the underdot falls on the first vowel of the prefix [[raku]] (so where a tone mark would’ve gone), the hyphen should be placed between the last prefix and the word’s stem. For example, {{t|kı-}} + {{t|ne-}} + {{t|shı}} may be written as {{t|kınẹshı}} or {{t|kıne-shı}}; {{t|hao-}} + {{t|chuq}} = {{t|hạochuq}} or {{t|hao-chuq}}.
In addition, the underdot ({{t|ạ}}, U+0323) is used to mark the presence of a [[prefix]], more specifically the last in a run of prefixes if any are present. It may be replaced by the ASCII hyphen (-) in case the underdot isn’t available on your keyboard. While the underdot falls on the first vowel of the prefix [[raku]] (so where a tone mark would’ve gone), the hyphen should be placed between the last prefix and the word’s stem. For example, {{t|kı-}} + {{t|ne-}} + {{t|shı}} may be written as {{t|kınẹshı}} or {{t|kıne-shı}}; {{t|hao-}} + {{t|chuq}} = {{t|hạochuq}} or {{t|hao-chuq}}.
==== Tone–underdot combos ====
The new [[Delta]] orthography poses a slight challenge for fonts trying to render it as there isn’t a uniform set of precomposed tone+underdot characters to choose from and one has to rely on using a combining diacritic. Specifically, {{t|ı̣}} (ı underdot) comes out janky in some fonts because the <code>ı</code> glyph may be missing an [https://fontforge.org/docs/tutorial/editexample6.html#anchoring-marks anchoring mark]. In fact, out of the 20 possible vowel+diacritic combinations, only 7 have precompositions:
{| class="wikitable toaq" style="text-align: center;"
!
! {{done|1}}
! {{done|2}}
! {{done|3}}
! {{done|4}}
|-
! a
| style="background-color: lightgreen;" |    ạ
| style="background-color: lightpink;  | '''ạ́'''
| style="background-color: lightpink;  | '''ạ̈'''
| style="background-color: lightgreen;" |    ậ
|-
! u
| style="background-color: lightgreen;" |    ụ
| style="background-color: lightpink;  | '''ụ́'''
| style="background-color: lightpink;  | '''ụ̈'''
| style="background-color: lightpink;  | '''ụ̂'''
|-
! ı
| style="background-color: lightpink;  | '''ı̣'''
| style="background-color: lightpink;  | '''ị́'''
| style="background-color: lightpink;  | '''ị̈'''
| style="background-color: lightpink;  | '''ị̂'''
|-
! o
| style="background-color: lightgreen;" |    ọ
| style="background-color: lightpink;  | '''ọ́'''
| style="background-color: lightpink;  | '''ọ̈'''
| style="background-color: lightgreen;" |    ộ
|-
! e
| style="background-color: lightgreen;" |    ẹ
| style="background-color: lightpink;  | '''ẹ́'''
| style="background-color: lightpink;  | '''ẹ̈'''
| style="background-color: lightgreen;" |    ệ
|}
The grapheme clusters in the cells in bold red consist of a precomposed vowel+underdot glyph and a combining tone diacritic. Each cell was normalized with [[wikipedia:Unicode equivalence#Normalization|Unicode normalization form C]].
It appears that the most consistent as well as font- and input-friendly approach is to precompose the vowel with the tone mark and then add a combining underdot (U+0323):
{| class="wikitable toaq" style="text-align: center;"
!
! {{done|1}}
! {{done|2}}
! {{done|3}}
! {{done|4}}
|-
! a
| a&#x323; || á&#x323; || ä&#x323; || â&#x323;
|-
! u
| u&#x323; || ú&#x323; || ü&#x323; || û&#x323;
|-
! ı
| ı&#x323; || í&#x323; || ï&#x323; || î&#x323;
|-
! o
| o&#x323; || ó&#x323; || ö&#x323; || ô&#x323;
|-
! e
| e&#x323; || é&#x323; || ë&#x323; || ê&#x323;
|}
: '''MediaWiki note:''' The wiki software has been normalizing all page content [https://www.mediawiki.org/wiki/Unicode_normalization_considerations since time immemorial], meaning that the above table has had to use HTML entities to get the desired effect (e.g., <code>í&amp;#x323;</code> for {{t|ị́}}. [[Template:T]] will do this for you.


== See also ==
== See also ==
Line 112: Line 42:
* [[Input methods]] for writing Toaq's diacritics.
* [[Input methods]] for writing Toaq's diacritics.
* [[Deranı]], the other, non-Latin writing system.
* [[Deranı]], the other, non-Latin writing system.
* [[Unicode]]