Part I of the SimHebrew Bible project is complete.
What began last September with the conversion of one chapter a day, starting with Genesis 1 (based on the Masoretic version in ktiv malé), the SimHebrew Torah (Pentateuch) is now, with the conversion of Deut. 34, has been concluded:
This is a very good question, as it relates to one of several unresolved problems in using traditional (‘Square’) Hebrew text in computer environments:
Mixing RTL (right-to-left) text such as Hebrew or Arabic with LTR text such as English usually wreaks havoc on the display order of the text. One reason is the conflict between two competing standards of encoding in Hebrew—Logical, and Visual:
The bi-directional way (logical method) and the visual method. In the logical method characters are stored in the electronic document in the order that a normal person would type, and in the visual method the characters are ordered assuming that the display device will order them left-to-right. In HTML, only the logical method is a real standard.
[A2A] Fun exercise.
In biblical (Old Testament) times, the Canaanite alphabet was common to all Canaanite nations—from Moab and Edom in the southeast to Phoenicia in the northwest. The language was substantially the same, as well, throughout that period, with dialectal differences that widened over the centuries, so that by the time of the late First Temple period (750–580 BCE) mutual comprehension was only partial.
There are plenty of references to letter frequency in modern Hebrew (one good English-language one is Stefan Trost’s Character Frequency: Hebrew). However, although linguistically modern Hebrew is very similar to biblical Hebrew, it does contain many words of foreign origin, and tends to use ‘full spelling’ (i.e. use the letters vav and yod to indicate the vowels /o/u/ and /i/, respectively), which can skew the results somewhat.
So a better test would be a certified ancient text of sufficient length to be indicative.
The Mesha Stele—a commemoration by King Mesha of Moab of the liberation of his people from the “yoke” of Israelite rule—is a good candidate in that regard, as it dates to around 840 BCE, and is written in the Moabite language, in the Canaanite script (see right).
Typically, it doesn’t: by and large, there is just one way to spell any given word. There are three types of exception, though:
I don’t know Kazakh, but I do know that the proliferation of letters with apostrophes in the new proposed alphabet is a mistake.
The Square Hebrew / SimHebrew™ Converter will convert any Square Hebrew text into Roman characters (and back again), with full fidelity with regard to spelling and distinction between seemingly homophonous characters:
As with French, German, Italian, or any other Roman-based language, however, you would have to bear in mind that certain characters have different phonetic values compared with English:
Imagine a Latin text—e.g. the first verse of the Latin Vulgate Bible:
In principio creavit Deus caelum et terram Terra autem erat inanis et vacua et tenebrae super faciem abyssi et spiritus Dei ferebatur super aquas.
Most of us don’t know Latin, but at least we can read it, and guess at the meaning of some words—or look them up.