All Topics  
Romanization of Armenian

 

   Email Print
   Bookmark   Link






 

Romanization of Armenian



 
 
There are various systems of romanization
Romanization

In linguistics, romanization is the representation of a written word or spoken speech with the Latin alphabet, or a system for doing so, where the original word or language uses a different writing system ....
 of the Armenian alphabet
Armenian alphabet

The Armenian alphabet is an alphabet that has been used to write the Armenian language since the year 405 or 406. Up to the 19th century, Classical Armenian had been the literary language; since then, the Armenian alphabet has been used to write the two modern dialects of Eastern Armenian and Western Armenian....
.

inguistic literature on Classical Armenian, the commonly used transliteration is that of Hübschmann-Meillet (1913).

It uses a dot above mark to express the aspirates, t?, ch?, c?, p?, k?. However, the correct support of this diacritic has been poor for long in the past and was not very common on many usual applications and computer fonts or rendering systems. Some documents were published using the ASCII backquote ` U+0060 as a fallback (or even the ASCII apostrophe ' U+0027 when there was no confusion possible), but the preferred character today is the left half-ring modifier letter (see below).

And some ambiguities were not solved to work with modern vernacular Armenian, that has two dialects, both using two possible orthographies (in addition the modern orthography will be used for Classical Armenian in modern publications).

lass="link1" onMouseover='showByLink("m1526714",this)' onMouseout='hide("m1526714")'href="http://www.absoluteastronomy.com/topics/BGN_slash_PCGN_romanization">BGN/PCGN romanization
BGN/PCGN romanization

BGN/PCGN romanization refers to the systems for romanization and Roman-script spelling conventions adopted by the United States Board on Geographic Names and the Permanent Committee on Geographical Names for British Official Use ....
 (1981) uses a right single quotation mark
Quotation mark

Quotation marks or inverted commas are punctuation marks used in pairs to set off speech, a quotation, a phrase or a word. The pair consists of an opening quotation mark and a closing quotation mark, which may or may not be the same character....
 to express aspirates, t’, ch’, ts’, p’, k’, diverging from the original spiritus asper
Spiritus asper

The spiritus asper , is a diacritic used in the polytonic orthography. In ancient Greek, it indicates initial aspiration , or the presence of the voiceless glottal fricative at the beginning of a word....
 motivation.

This romanization was taken up by ISO (1996), and considered obsolete.






Discussion
Ask a question about 'Romanization of Armenian'
Start a new discussion about 'Romanization of Armenian'
Answer questions from other users
Full Discussion Forum



Encyclopedia


There are various systems of romanization
Romanization

In linguistics, romanization is the representation of a written word or spoken speech with the Latin alphabet, or a system for doing so, where the original word or language uses a different writing system ....
 of the Armenian alphabet
Armenian alphabet

The Armenian alphabet is an alphabet that has been used to write the Armenian language since the year 405 or 406. Up to the 19th century, Classical Armenian had been the literary language; since then, the Armenian alphabet has been used to write the two modern dialects of Eastern Armenian and Western Armenian....
.

Transliteration systems


Hübschmann-Meillet (1913)

In linguistic literature on Classical Armenian, the commonly used transliteration is that of Hübschmann-Meillet (1913).

It uses a dot above mark to express the aspirates, t?, ch?, c?, p?, k?. However, the correct support of this diacritic has been poor for long in the past and was not very common on many usual applications and computer fonts or rendering systems. Some documents were published using the ASCII backquote ` U+0060 as a fallback (or even the ASCII apostrophe ' U+0027 when there was no confusion possible), but the preferred character today is the left half-ring modifier letter (see below).

And some ambiguities were not solved to work with modern vernacular Armenian, that has two dialects, both using two possible orthographies (in addition the modern orthography will be used for Classical Armenian in modern publications).

BGN/PCGN (1981)

BGN/PCGN romanization
BGN/PCGN romanization

BGN/PCGN romanization refers to the systems for romanization and Roman-script spelling conventions adopted by the United States Board on Geographic Names and the Permanent Committee on Geographical Names for British Official Use ....
 (1981) uses a right single quotation mark
Quotation mark

Quotation marks or inverted commas are punctuation marks used in pairs to set off speech, a quotation, a phrase or a word. The pair consists of an opening quotation mark and a closing quotation mark, which may or may not be the same character....
 to express aspirates, t’, ch’, ts’, p’, k’, diverging from the original spiritus asper
Spiritus asper

The spiritus asper , is a diacritic used in the polytonic orthography. In ancient Greek, it indicates initial aspiration , or the presence of the voiceless glottal fricative at the beginning of a word....
 motivation.

This romanization was taken up by ISO (1996), and considered obsolete. This system is a loose transcription and is not reversible (without using dictionnary lookup), notably for single Armenian letters romanized into digraphs (these non reversible, or ambiguous romanizations are shown in a red cell in the table below).

Some Armenian letters have several romanizations, depending on their context:
  • the Armenian vowel letter ?/? should be romanized as ye initially or after the vowel characters ?/?, ?/?, ?/?, ?/?, ?/?, ??/?? and ?/?; in all other cases it should be romanized as e;
  • the Armenian vowel letter ?/? should be romanized as vo initially, except in the word ?? where it should be romanized as ov; in all other cases it should be romanized as o;
  • the Armenian consonant letter ?/? should be romanized yev initially, in isolation or after the vowel characters ?/?, ?/?, ?/?, ?/?, ?/?, ??/?? and ?/?; in all other cases it should be romanized as ev.


ISO 9985 (1996)

ISO 9985 (1996) is the international standard for transliteration of the modern Armenian alphabet. Like with the BGN/PCGN romanization, the right single quotation mark is used to denote the aspirates.

This system is reversible because it avoids the use of digraphs and returns to the Hübschmann-Meillet (however some diacritics for vowels are also modified).

Note that in this scheme, c (signifying ?) collides with the Hübschmann-Meillet transliteration (where it signifies ?). This system is recommended for international bibliographic text interchange (it is also the base of simplified romanizations found to localize the Armenian toponomy of for transliterating human names), where it works very well with the common ISO 8859-2 Latin encoding used in Central Europe.

ALA-LC (1997)

ALA-LC (1997) is largely compatible with BGN/PCGN, but returns to expressing aspirates with a left single quotation mark
Quotation mark

Quotation marks or inverted commas are punctuation marks used in pairs to set off speech, a quotation, a phrase or a word. The pair consists of an opening quotation mark and a closing quotation mark, which may or may not be the same character....
 (in fact the modifier letter left half-ring ? U+02BF, US-MARC hexadecimal code B0, that is also used to denote ayin
Ayin

' or ' is the sixteenth letter in many Semitic abjads, including Phoenician alphabet, Aramaic language, Hebrew language and Arabic alphabet ....
 in Arabic, so some documents may contain either the preferred left half-ring, or sometimes the ASCII backquote ` U+0060).

This standard changes the transliteration scheme used between Classical/Eastern Armenian and Western Armenian for the Armenian consonnants represented by swapping the pairs b vs. p, g vs. k, d vs. t, dz vs. ts and ch vs. j.

In all cases, and to make this romanizatrion less ambiguous and reversible,
  • a soft sign (a prime, US-MARC hexadecimal code A7) is inserted between two separate letters that would otherwise be interpreted as a digraph (in red in the table below); no prime is present in the middle of romanized digraphs zh, kh, ts, dz, gh and ch representing a single Armenian letter;
  • with the Classical Armenian orthography only, the vowel represented by e will be represented by y instead, when it is at the initial position in a name and followed by another vowel; this difficulty has disappeared in modern Armenian with the reformed orthography that changed the original Armenian letter in such case;
  • with the Classical Armenian orthography only, the vowel represented by y will be represented by h instead, when it is at the initial position of a word or of a radical in a compound word; this difficulty has disappeared in modern Armenian with the reformed orthography that changed the original Armenian letter in such case.


ASCII-only input methods

On various Armenian websites, non-standard transliterators have appeared to allow inputting modern Western or Eastern Armenian text using ASCII only characters. It is not a proper transliterator but can be convenient for users that don't have Armenian keyboards.

Despite these input methods are commonly used, they are not obeying to any approved international or Armenian standard, so they are not recommended for the romanization of Armenian. Note that the input methods recognize the Latin digraphs zh, dz, gh, tw, sh, vo, ch, rr for Classic or Eastern Armenian, and zh, dz, tz, gh, vo, ch, rr for Western Armenian, but offer no way to disambiguate words where the digraphs should not be recognized.

Some Armenian letters are entered as Latin digraphs, and may also be followed by the input of an ASCII single quote (which acts as the only letter modifier recognized) but this quote does not always mean that the intended Armenian letter should be aspirated (this may be the reverse for the input ch'), it is also used as a vowel modifier. Due to ambiguities, texts must be corrected by entering an intermediate dummy character before entering the second Latin letter or quote, then removing the dummy character, so that the automatic input converter keeps the Armenian letters distinct.

Transliteration table

Some Armenian letters have very different phonetic between Classical or Eastern Armenian and Western Armenian, so that the usage of Armenian letters is swapped between the two sub-branches of the language.

This is made visible in the table below by coloring transliterations specific to Classical or Eastern Armenian on green background, and those for Western Armenian on blue background. Other letters are transliterated independently of the language branch. However, cells with red background contain transliterations that are context dependent.

Armenian script capital ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
0531 0532 0533 0534 0535 0536 0537 0538 0539 053A 053B 053C 053D 053E 053F 0540 0541 0542 0543 0544
small ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?
0561 0562 0563 0564 0565 0566 0567 0568 0569 056A 057B 056C 056D 056E 057F 0570 0571 0572 0573 0574
Romanization of Classical or Eastern Armenian ASCII
ASCII

American Standard Code for Information Interchange , is a coding standard that can be used for interchanging information, if the information is expressed mainly by the written form of English words....
 input
a b g d e z e' y' t' zh i l x c' k h dz gh tw m
Hübschmann-Meillet ê ? t? ž c j l c
ISO 9985 e ë t’ ç g c?
BGN/PCGN
BGN/PCGN romanization

BGN/PCGN romanization refers to the systems for romanization and Roman-script spelling conventions adopted by the United States Board on Geographic Names and the Permanent Committee on Geographical Names for British Official Use ....
e, ye e y zh kh ts dz gh ch
ALA-LC e, y e e t?
Romanization of Western Armenian ALA-LC p k t dz g ts j
ASCII
ASCII

American Standard Code for Information Interchange , is a coding standard that can be used for interchanging information, if the information is expressed mainly by the written form of English words....
 input
e e' y t' x tz
 
Armenian script capital ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ??  
0545 0546 0547 0548 0549 054A 054B 054C 054D 054E 054F 0550 0551 0552 0553 0554 0555 0556 0548
0552
small ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?? ?
0575 0576 0577 0578 0579 057A 057B 057C 057D 057E 057F 0580 0581 0582 0583 0584 0585 0586 0578
0582
0587
Romanization of Classical or Eastern Armenian ASCII
ASCII

American Standard Code for Information Interchange , is a coding standard that can be used for interchanging information, if the information is expressed mainly by the written form of English words....
 input
y n sh vo ch p j rr s v t r c w p' k', q o f u ev
Hübschmann-Meillet š o c? j c? p? k? ô
ISO 9985 c ? c’ p’ k’ ò ow ew
BGN/PCGN
BGN/PCGN romanization

BGN/PCGN romanization refers to the systems for romanization and Roman-script spelling conventions adopted by the United States Board on Geographic Names and the Permanent Committee on Geographical Names for British Official Use ....
sh o, vo ch’ j rr ts’ o u ev, yev
ALA-LC y, h o ch? ? ts? p? k? o ew, ev
Romanization of Western Armenian ALA-LC b ch d
ASCII
ASCII

American Standard Code for Information Interchange , is a coding standard that can be used for interchanging information, if the information is expressed mainly by the written form of English words....
 input
h' vo ch ch' rr c p' k', q o ev


Note that in the table above, the last two columns are referring to digraphs, not isolated letters. However the last one displays the ligature that is used only as an isolated symbol for the short Armenian word ew (meaning and) in a way similar to the ampersand (&) in the Latin script (the ligature should not be used within other Armenian words so it is not really ambiguous); the same transliteration to ew (classical Armenian) or ev (reformed orthography) will be used for the letters this ligature represents, when they are used as digraphs: it used to refer to the w consonant, now it refers to the v consonant.

The Armenian script also uses some other digraphs that are often written as optional ligatures, in lowercase only (five of them are encoded in Unicode only for full roundtrip compatibility with some legacy encodings); when present, these ligatures (which are purely typographic and carry no semantic distinction in normal Armenian texts) must be romanized by decomposing their component letters.

See also


Bibliographic references

  • Antoine Meillet and Heinrich Hübschmann, Altarmenisches Elementarbuch, Heidelberg, 1913 (2nd edition, 1980).


External links

  • Supports both Eastern and Western pronunciations of Armenian, includes a spell checker.
  • by Thomas T. Pedersen, in KNAB (Kohanimeandmebaas, Place Names Database) of Eesti Keele Instituut (Institute of the Estonian Language).
  • for mobile phones and computers which can't display or type Armenian letters.


Related articles

  • Armenian language
    Armenian language

    The 'Armenian language' is an Indo-European language spoken by the Armenians. It is the official language of the Armenia as well as in the region of Nagorno-Karabakh....
    • Classical Armenian
    • Western Armenian language
      Western Armenian language

      Western Armenian is one of the two modern dialects of Armenian language, an Indo-European languages spoken by the Armenian diaspora, mainly in North America and South America, Europe and most of the Middle East except for Iran....
    • Eastern Armenian language
      Eastern Armenian language

      Eastern Armenian is one of the two dialects of modern Armenian language . It is spoken in the Caucasus Mountains and by the Armenian community in Iran....
  • Armenian alphabet
    Armenian alphabet

    The Armenian alphabet is an alphabet that has been used to write the Armenian language since the year 405 or 406. Up to the 19th century, Classical Armenian had been the literary language; since then, the Armenian alphabet has been used to write the two modern dialects of Eastern Armenian and Western Armenian....
    • Traditional Armenian orthography
      Traditional Armenian orthography

      Traditional Armenian orthography is the orthography developed during the early 19th century for the two modern dialects of the Armenian language - Eastern Armenian language and Western Armenian language....
    • Reformed Armenian orthography
    • ArmSCII
      ARMSCII

      ARMSCII or ArmSCII is the acronym of the Armenian alphabet Standard Code for Information Interchange. It refers to several single-byte character encodings defined by Armenian national standard 166-97....
       (national standard for the single-byte encodings of the Armenian alphabet, and mappings to international standards ISO 10585 and Unicode/ISO/IEC 10646)
  • List of ISO transliterations
    List of ISO transliterations

    List of ISO standards for transliterations and transcriptions :*ISO 9 — Cyrillic*ISO 233 — Arabic*ISO 259 — Hebrew*ISO 843 — Greek...
    • Romanization of Georgian
  • Wikipedia:Naming conventions (Armenian)