HIERO (Hebrew Idiom in English Roots) is a partial translation of the Old Testament, in which Hebrew word order and vocabulary choices are transparent to the reader.
HIERO consists of an English lexicon, an annotated Hebrew text, a Hebrew parsing dictionary, and the program code.
The English lexicon is the most important part of HIERO—and the part that has taken the most time by far to develop. The primary goal of the English lexicon is to link each Hebrew root with a single English root and vice versa. HIERO’s English lexicon contains about 9,500 definitions of Hebrew and Aramaic words. The lexicon is written in XML code. The XML code, part-of-speech mappings, and Strongs number mappings that structure the lexicon are based on the OSHB Hebrew Lexicon by the Open Scriptures Hebrew Bible Project. All of the definitions are my own. The following is a partial sample from the lexicon:
<entry><w xlit="bārāʾ">בָּרָא</w><pos>V</pos><qalAct>shape</qalAct>
<pielAct>reshape</pielAct>
<hiphilAct>[keep] °[in] shape</hiphilAct><xref strong="1254A,1254B" />
<entry><w xlit="bĕrî">בְּרִי</w><pos>A</pos><def>shapely</def><xref strong="1274" /></entry>
<entry><w xlit="bārîʾ">בָּרִיא</w><pos>A</pos><def>shapely</def><xref strong="1277" /></entry>
<entry><w xlit="bĕrîʾâ">בְּרִיאָה</w><pos>N</pos><def>shaping</def><xref strong="1278" /></entry>
<entry><w xlit="bĕrāʾyâ">בְּרָאיָה</w><pos>Np</pos><def>shaped-of-Yʜ</def><xref strong="1256" /></entry>
</entry>
The lexicon tags follow the following rules:
<entry>
element. Related words derived from the root will have <w>
element contains the entry’s Hebrew root.<pos>
element describes the entry’s part of speech.<def>
element contains the entry’s English translation.A verb entry does not contain a <def>
element. Instead, each verb entry contains up to eight different translations corresponding to the eight Hebrew functional verb stem types (nine in Aramaic). In this example, the stem translations are contained in the <qalAct>
, <pielAct>
, and <hiphilAct>
elements.
Any English translation of a stem that appears in the Hebrew text but not in the English lexicon is derived regularly from the translation of another stem. For example, HIERO can automatically derive the translation of the Piel passive stem (“be reshaped”) from the translation provided for the Piel active stem (“reshape”).
<xref>
element contains a list of Hebrew lexical numbers that HIERO will represent using the English entry. Entry numbers are based on the Strongs numbering system but include letter suffixes to differentiate senses that were not differentiated in the original Strongs system.Further reading:
HIERO translates from the four-part Translators Amalgamated Hebrew OT (TAHOT) by STEP Bible. TAHOT is an edition of the Leningrad codex, following the qere of the Masoretic Text.
I have extracted the relevant data from TAHOT and reformatted it for use with HIERO. The following is a sample from the reformatted text:
Word number | Hebrew | Parsing variant |
---|---|---|
Gen.1.1#01=L | בְּרֵאשִׁ֖ית | 1 |
Gen.1.1#02=L | בָּרָ֣א | 1 |
Gen.1.1#03=L | אֱלֹהִ֑ים | |
Gen.1.1#04=L | אֵ֥ת | |
Gen.1.1#05=L | הַשָּׁמַ֖יִם | |
Gen.1.1#06=L | וְאֵ֥ת | |
Gen.1.1#07=L | הָאָֽרֶץ׃ |
The Hebrew text is formatted with one Hebrew word per line. Each word is numbered (two digits, preceded by #), and its source text is noted (“L” indicates the Leningrad codex). Words that could be parsed in multiple ways are annotated with a number to indicate the appropriate parsing variant. In this example, word #01 could be followed by the word “of,” but since the context makes that impossible, the annotation “1” indicates to translate it without the word “of.” Word #02 could be parsed as an Aramaic word, but since this is a Hebrew text, the annotation “1” indicates to translate it as Hebrew.
Further reading:
HIERO includes a Hebrew parsing dictionary of 53,000 unique word forms found in the text, along with the corresponding morphological and lexical tags. This parsing dictionary includes parsing information taken from TAHOT as well as additional corrections of my own. The following is a sample from the parsing dictionary:
Hebrew | Lexicon | Morphology | Lex. variant 1 | Morph. var. 1 | Lex. var. 2 | Morph. var. 2 |
---|---|---|---|---|---|---|
גָדוֹל | 1419A | HAamsa | 1431 | Vqaa | ||
גָדְלָה | 1431 | HVqp3fs | ||||
גָּדַלְתָּ | 1431 | HVqp2ms | ||||
הִגְדִּיל | 1431 | HVhp3ms | ||||
גִּדַּל | 1431 | HVpp3ms | ||||
יַגְדִּיל | 1431 | HVhi3ms | ||||
גָּדְלוּ | 1431 | HVqp3cp |
Lexical tags refer to entry numbers found in the <xref>
tags in the English lexicon. Morphology codes describe the word’s part of speech, form, person, gender, number, and state. Alternative parsing variants are given in additional columns.
Further reading:
HIERO is written in Visual Basic and contains 2,100 lines of executable code. Much of the code is available in HIERO’s GitHub repository under “code samples.” HIERO does not use artificial intelligence. Translating the Hebrew Old Testament takes 24 seconds on an ordinary laptop.
To begin translation, HIERO iterates through the annotated Hebrew text, one word at a time. For each word, HIERO looks up the corresponding parsing from the Hebrew parsing dictionary, using the annotations from the Hebrew text, if any exist. Once the parsing has been obtained, HIERO uses the word’s lexicon tag to look up its translation in the English lexicon. HIERO uses the word’s morphology tag to inflect the English translation and apply appropriate formatting via CSS. HIERO extracts cantillation marks from the Hebrew word and uses them to apply English punctuation or line breaks via XHTML. Finally, HIERO outputs the result to an HTML file. HIERO then moves to the next word and repeats.
The user views the output by opening the HTML output files. Read about HIERO’s formatting here.