|
|
|
|
Romanization of Japanese
|
| |
|
| |
The romanization of Japanese or is the use of the Latin alphabet to write the Japanese language. Japanese is normally written in logographic characters borrowed from Chinese (kanji) and syllabic scripts (kana). The romanization of Japanese is done in any context where Japanese text is targeted at those who do not know the language, such as for names on street signs and passports, and in dictionaries and textbooks for foreign learners of the language.

Discussion
Ask a question about 'Romanization of Japanese'
Start a new discussion about 'Romanization of Japanese'
Answer questions from other users
|
Encyclopedia
The romanization of Japanese or is the use of the Latin alphabet to write the Japanese language. Japanese is normally written in logographic characters borrowed from Chinese (kanji) and syllabic scripts (kana). The romanization of Japanese is done in any context where Japanese text is targeted at those who do not know the language, such as for names on street signs and passports, and in dictionaries and textbooks for foreign learners of the language. The word "romaji" is sometimes incorrectly transliterated as romanji or romanji.
There are several different romanization systems. The three main ones are Hepburn romanization, Kunrei-shiki Romaji (ISO 3602), and Nihon-shiki Romaji (ISO 3602 Strict). Variants of the Hepburn system are the most widely used.
All Japanese who have attended elementary school since World War II have been taught to read and write romanized Japanese. Romanization is also the most common way to input Japanese into word processors and computers. Therefore, almost all Japanese are able to read and write Japanese using romaji. The primary usage of romaji is on computers and other electronic devices that do not support the display or input of Japanese characters, in educational materials for foreigners, and in academic papers in English (or other Western languages) written on Japanese linguistics, literature, history, and culture.
History The earliest Japanese romanization system was based on the orthography of Portuguese. It was developed around 1548 by a Japanese Catholic named Yajiro. Jesuit presses used the system in a series of printed Catholic books so that missionaries could preach and teach their converts without learning to read Japanese orthography. The most useful of these books for the study of early modern Japanese pronunciation and early attempts at romanization was the Nippo jisho, a Japanese-Portuguese dictionary written in 1603. In general, the early Portuguese system was similar to Nihon-shiki in its treatment of vowels. Some consonants were transliterated differently: for instance, the /k/ consonant was rendered, depending on context, as either c or q, and the consonant (now pronounced /h/) as f, so Nihon no kotoba ("The language of Japan") was spelled Nifon no cotoba. The Jesuits also printed some secular books in romanized Japanese, including the first printed edition of the Japanese classic The Tale of the Heike, romanized as Feiqe no monogatari, and a collection of Aesop's Fables (romanized as Esopo no fabulas). The latter continued to be printed and read after the suppression of Christianity in Japan (Chibbett, 1977).
Following the expulsion of Christians from Japan in the late 1590s and early 1600s, romaji fell out of use, and were only used sporadically in foreign texts until the mid-19th century, when Japan opened up again. The systems used today all developed in the latter half of the 19th century.
The first system to be developed was the Hepburn system, developed by James Curtis Hepburn for his dictionary of Japanese words and intended for foreigners to use. Hepburn's system included representation of some sounds that have since changed. For example, Lafcadio Hearn's book Kwaidan shows the older kw- pronunciation; in modern Hepburn romanization, this would be written Kaidan (lit., ghost tales.)
As a replacement for Japanese writing system
In the Meiji era, some Japanese scholars advocated abolishing the Japanese writing system entirely and using romaji (lit., Roman letters) instead. The Nihon shiki romanization was an outgrowth of this movement. Several Japanese texts were published entirely in romaji during this period, but it failed to catch on. Later, in the early 20th century, some scholars devised syllabary systems with characters derived from Latin (rather like the Cherokee syllabary); these were even less popular, because they were not based on any historical use of the Latin alphabet. Today, the use of Nihon shiki for writing Japanese is advocated by Oomoto and some independent organizations.
Modern systems
Hepburn The Revised Hepburn system of romanization uses a macron to indicate some long vowels, and an apostrophe to note the separation of easily-confused phonemes (usually, syllabic n ? from a following naked vowel or semivowel). For example, the name ???????, is written with the kana characters ju-n-i-chi-ro-u, and romanized as Jun'ichiro in Revised Hepburn. Without the apostrophe, it would not be possible to distinguish this correct reading from the incorrect ju-ni-chi-ro-u. This system is widely used in Japan and among foreign students and academics.
Hepburn romanization generally follows English phonology with Romance vowels, and is an intuitive method of showing Anglophones the pronunciation of a word in Japanese. It was standardized in the USA as American National Standard System for the Romanization of Japanese (Modified Hepburn), but this status was abolished on October 6, 1994. Hepburn is the most common romanization system in use today, especially in the English-speaking world.
Nihon-shiki Nihon-shiki is probably the least used of the three main systems. It was originally invented as a method for the Japanese to write their own language. It follows Japanese phonology and the syllabary order very strictly and is hence the only major system of romanization that allows lossless mapping to and from kana. It has also been standardized as ISO 3602 strict form.
Kunrei-shiki Kunrei-shiki is a slightly modified version of Nihon-shiki which eliminates differences between the kana syllabary and modern pronunciation. For example, when the words kana ?? and tsukai ??? are combined, the result is written in kana as ????? with a dakuten (voicing sign) ?on the ? (tsu) kana to indicate that the tsu ? is now voiced. The ? kana is pronounced in the same way as a different kana, ? (su), with dakuten, ?. Kunrei-shiki and Hepburn ignore the difference in kana and represent the sound in the same way, as kanazukai, using the same letters "zu" as are used to romanize ?. Nihon-shiki retains the difference, and romanizes the word as kanadukai, differentiating the ? kana from the ? kana, which is romanized as zu, even though they are pronounced identically. Similarly for the pair ? and ?, which are both zi in Kunrei-shiki and both ji in Hepburn romanization, but are zi and di respectively in Nihon-shiki. See the table below for full details.
Kunrei-shiki has been standardized by the Japanese Government and the International Organisation for Standardisation (ISO 3602). Kunrei-shiki is taught to Japanese elementary school students in their fourth year of education.
Other variants It is possible to elaborate these romanizations to enable non-native speakers to pronounce Japanese words more correctly. Typical additions include tone marks to note the Japanese pitch accent and diacritic marks to distinguish phonological changes, such as the assimilation of the moraic nasal /n/ (see Japanese phonology).
JSL JSL is a romanization system based on Japanese phonology, designed using the linguistic principles used by linguists in designing writing systems for languages that do not have any. It is a purely phonemic system, using exactly one symbol for each phoneme, and marking pitch accent using diacritics. It was created for Eleanor Harz Jorden's system of Japanese language teaching. Its principle is that such a system enables students to better internalize the phonology of Japanese. Since it does not have any of the advantages for non-native speakers that the other romaji systems have, and the Japanese already have a writing system for their language, JSL is not widely used outside the educational environment.
Non-standard romanization In addition to the standardized systems above, there are many variations in romanization, used either for simplification, in error or confusion between different systems, or for deliberate stylistic reasons.
Notably, the various mappings that Japanese input methods use to convert keystrokes on a Roman keyboard to kana often combine features of all of the systems; when used as plain text rather than being converted, these are usually known as wapuro romaji. (Wapuro is a blend of wado purosessa word processor.) Unlike the standard systems, wapuro romaji requires no characters from outside the ASCII character set.
While there may be arguments in favour of some of these variant romanizations in specific contexts, their use, especially if mixed, leads to confusion when romanized Japanese words are indexed. Note that this confusion never occurs when inputting Japanese characters with word processor, because inputted roman alphabets are transcribed into Japanese kana characters as soon as IME decides what character is input.
The following variant romanizations are common:
- Japanese words and names that have established English spellings, such as kudzu and jiu jitsu, or loanwords such as
kyatto for "cat", are sometimes written as they are in English, without regard for the rules of romanization.Jya for ??, which is ja in Hepburn and zya in Nihon-shiki and Kunrei-shiki, and similarly jyu for ?? and jyo for ??. The extraneous y seems to be the result of confusion between the romanization systems.Cchi for ?? (Hepburn tchi) and so on. This is wapuro romaji, but is often used for stylistic reasons when rendering nicknames (for example, ??? Akiko becoming ????? Acchan rather than Atchan).La for ? (Hepburn ra) and so on. The Japanese consonant has a sound (IPA ) that is near, but not identical to, both of English "r" and "l". "R" and "l" are both transcribed into Japanese using the Japanese . Examples of "l" in romanized Japanese include Japanese children's doll ??, romanized as Licca.Na for ?? (Hepburn n'a) and so on. This form of romanized Japanese is used in public information such as road and railway signs in Japan.Nn for ? (Hepburn n). This is also an example of wapuro romaji (although many Japanese input methods also accept the Hepburn n'). This leads to ambiguity with the more widespread Hepburn system. For example, the cluster nna, which is ?? in Hepburn, represents ?? in this system. The double n is sometimes seen in names.
Long vowels The most common variant romanization is to omit the macrons or circumflexes used to indicate a long vowel. This is extremely common in the romanized version of Japanese words used in English. For example the capital city of Japan, correctly written Tokyo in romanized Japanese, is universally written as Tokyo. In Japan, since romanized Japanese is seen mostly as a convenience for foreigners to be able to read signs easily, macrons and circumflexes are usually omitted for simplification.
Many typewriters, word processors, and computerized systems cannot easily deal with the macron used in Hepburn romanization. Nihon-shiki and Kunrei-shiki use a circumflex accent (thus, Tôkyô). This may allow for easier input, since all of â, î, û, ê, and ô are in the ISO-8859-1 character set, and may be easily input on a variety of systems.
In addition, the following three methods of representing long vowels are authorized by the Japanese Foreign Ministry for use in passports.
Oh for ?? or ?? (Hepburn o).Oo for ?? or ??. This is valid JSL and modified Hepburn.Ou for ??. This is also an example of wapuro romaji.
Archaic variants In older texts, other variant romanizations which are now no longer used are sometimes seen. Some of them have survived to the present day, although few of them are still actively used. Examples include:
i plus o was sometimes used to represent the Japanese yoon sound: hence Tokyo becomes "Tokio" and Kyoto becomes "Kioto". This romanization can still be seen in the species name "mioga" of the Japanese vegetable myoga. The kana ? was rendered as ye. The actual pronunciation of this kana was once we, but the w had already been lost by the time that (e.g.) ?? "Wedo" was first romanized as Yedo. The kana ? (Nihon-shiki du) was romanized as dzu, as seen in the plant names adzuki and kudzu. This enjoys some currency even today as Hepburn-like wapuro romaji, and has a phonetic value distinct from zu in many dialects of Japanese. "e" has sometimes been rendered "ye"—e.g. "Iyeyasu" instead of "Ieyasu", "Inouye" instead of "Inoue", and "yen" instead of "en." This usage, like ye for the kana ? (we), reflects the older pronunciation of e as ye. This pronunciation was lost sometime in the late 19th to early 20th centuries.
Romanization of Japanese names Names can be subject to even more variation, with spellings depending on the individual's preference. For example, the manga artist Yasuhiro Nightow's family name would be more conventionally written in Hepburn romanization as Naito.
Other variations seen in names include the substitution of K with C, as in the name of television celebrity Ricaco or the snack food Jagarico.
Example words written in each romanization system
| English | Japanese | Kana spelling | Romanization |
|---|
| Revised Hepburn | Kunrei-shiki | Nihon-shiki |
|---|
| Roman characters | ???? | ???? | romaji | rômazi | rômazi | | Mount Fuji | ??? | ???? | Fujisan | Huzisan | Huzisan | | tea | ?? | ??? | ocha | otya | otya | | governor | ?? | ?? | chiji | tizi | tizi | | to shrink | ?? | ??? | chijimu | tizimu | tidimu | | to continue | ?? | ??? | tsuzuku | tuzuku | tuduku |
Chart of romanizations This chart shows the significant differences between the major romanization systems.
| Kana | Revised Hepburn | Kunrei-shiki | Nihon-shiki |
|---|
| ?? | u | û | | ??, ?? | o | ô | | ? | shi | si | | ?? | sha | sya | | ?? | shu | syu | | ?? | sho | syo | | ? | ji | zi | | ?? | ja | zya | | ?? | ju | zyu | | ?? | jo | zyo | | ? | chi | ti | | ? | tsu | tu | | ?? | cha | tya | | ?? | chu | tyu | | ?? | cho | tyo | | ? | ji | zi | di | | ? | zu | zu | du | | ?? | ja | zya | dya | | ?? | ju | zyu | dyu | | ?? | jo | zyo | dyo | | ? | fu | hu |
Historical romanizations
1603: Vocabvlario da Lingoa de Iapam (1603)1604: Arte da Lingoa de Iapam (1604–1608)1620: Arte Breve da Lingoa Iapoa (1620)
| | ? | ? | ? | ? | ? | | | | |
|---|
| 1603 | a | i, j, y | v, u | ye | vo, uo | | | | | | 1604 | i | v | vo | | | | | | 1620 | y | | | | | | | ? | ? | ? | ? | ? | ?? | | ?? | ?? |
|---|
| 1603 | ca | qi, qui | cu, qu | qe,que | co | qia | | qio, qeo | qua | | 1604 | qui | que | quia | | quio | | 1620 | ca, ka | ki | cu, ku | ke | kia | | kio | | | ? | ? | ? | ? | ? | ?? | ?? | ?? | ?? |
|---|
| 1603 | ga | gui | gu, gv | gue | go | guia | guiu | guio | gua | | 1604 | gu | | | 1620 | ga, gha | ghi | gu, ghu | ghe | go, gho | ghia | ghiu | ghio | | | ? | ? | ? | ? | ? | ?? | ?? | ?? | |
|---|
| 1603 | sa | xi | su | xe | so | xa | xu | xo | | | 1604 | | | 1620 | | | | ? | ? | ? | ? | ? | ?? | ?? | ?? | |
|---|
| 1603 | za | ii, ji | zu | ie, ye | zo | ia, ja | iu, ju | io, jo | | | 1604 | ji | | ia | ju | jo | | | 1620 | ie | iu | io | | | | ? | ? | ? | ? | ? | ?? | ?? | ?? | |
|---|
| 1603 | ta | chi | tçu | te | to | cha | chu | cho | | | 1604 | | | 1620 | | | | ? | ? | ? | ? | ? | ?? | ?? | ?? | |
|---|
| 1603 | da | gi | zzu | de | do | gia | giu | gio | | | 1604 | dzu | | | 1620 | | | | ? | ? | ? | ? | ? | ?? | ?? | ?? | |
|---|
| 1603 | na | ni | nu | ne | no | nha | nhu, niu | nho, neo | | | 1604 | nha | nhu | nho | | | 1620 | | | | ? | ? | ? | ? | ? | ?? | ?? | ?? | |
|---|
| 1603 | fa | fi | fu | fe | fo | fia | fiu | fio, feo | | | 1604 | | | fio | | | 1620 | | | | | | ? | ? | ? | ? | ? | ?? | ?? | ?? | |
|---|
| 1603 | ba | bi | bu | be | bo | bia | biu | bio, beo | | | 1604 | | | | | | 1620 | bia | biu | | | | | ? | ? | ? | ? | ? | ?? | ?? | ?? | |
|---|
| 1603 | pa | pi | pu | pe | po | pia | | pio | | | 1604 | | | | | | 1620 | pia | | | | | | ? | ? | ? | ? | ? | ?? | | ?? | |
|---|
| 1603 | ma | mi | mu | me | mo | mia, mea | | mio, meo | | | 1604 | | | | | | 1620 | | | mio | | | | ? | | ? | | ? | | | | |
|---|
| 1603 | ya | | yu | | yo | | | | | | 1604 | | | | | | | | 1620 | | | | | | | | | ? | ? | ? | ? | ? | ?? | ?? | ?? | |
|---|
| 1603 | ra | ri | ru | re | ro | ria, rea | riu | rio, reo | | | 1604 | | | rio | | | 1620 | | riu | | | | ? | ? | | ? | ? | | | | |
|---|
| 1603 | va, ua | | | | vo, uo | | | | | | 1604 | va | y | | ye | vo | | | | | | 1620 | | | | | | | | ? |
|---|
| 1603 | n, m, ~ (tilde) | | 1604 | n | | 1620 | n, m | | | ? |
|---|
| 1603 | -t, -cc-, -cch-, -cq-, -dd-, -pp-, -ss-, -tt, -xx-, -zz- | | 1604 | -t, -cc-, -cch-, -pp-, -cq-, -ss-, -tt-, xx- | | 1620 | -t, -cc-, -cch-, -pp-, -ck-, -cq-, -ss-, -tt-, -xx- |
Alphabet letter names in Japanese The list below shows how to spell Latin character words or acronyms in Japanese. For example, NHK is spelled enu-eichi-kei, (???????). The following pronunciations are based on English letter names; otherwise, for example, A would likely be called a in Japanese.
e or ei (?? or ??)B; bi (??, alternative pronunciation be, ??)C; shi (?? or ???, sometimes pronounced si, ???)D; di (???, alternative pronunciation de, ??)E; iF; efuG; jiH; eichi or etchi (??? or ???)I; aiJ; je or jei (??? or ???)K; ke or kei (?? or ??)L; eruM; emuN; enuO; oP; pi (??, alternative pronunciation pe, ??)Q; kyuR; aruS; esuT; ti (???, though sometimes pronounced chi, ??, and alternatively pronounced te, ??)U; yuV; vi (??, though often pronounced bui, ??)W; daburyu (?????, often pronounced daburu, ???)X; ekkusuY; waiZ; zetto, zeddo, or zi (???, ???, or ???, though sometimes pronounced ji, ??)
Kana without romanized forms There is no generally accepted form of romanization for some forms of kana. In particular there is no form of romanization for full-sized kana combined with smaller versions of the vowel kana, "?", "?", "?", "?" and "?", the smaller versions of the y kana, "?", "?", and "?", and the sokuon or small tsu kana "?". Although these are usually regarded as merely phonetic marks or diacritics, they do appear on their own, such as at the end of sentences or in some names.
There is also no commonly accepted way of romanizing common combinations such as "??" of katakana to and small u, used to represent sounds as in the English word "too". Some people write this pair as tu, but this is likely to be confused with the tu Nihon-shiki and Kunrei-shiki romanizations of the kana ?, romanized as tsu in Hepburn romanization.
On a computer or word processor, these smaller kana may be produced in various ways. For example, an "x" or an "l" preceding the romanization of the full-sized kana produces a small version on some systems, thus xtu gives "?" on Microsoft Windows. However this is not standardized, and these forms are restricted to use in input systems; they are not used to represent the smaller kana in romanized Japanese.
See also
External links
- is an open source software written in C and can convert Japanese text into romaji, furigana, hiragana or katakana.
- based on KAKASI.
-
- (in Japanese) contains an extremely extensive and accurate collection of materials relating to romaji, including standards documents and even HTML versions of Hepburn's original dictionaries.
- from Andrew Horvat's contains a discussion of the problems caused by the variety of confusing romanization systems in use in Japan today.
-
- , Converts Japanese web pages or text into one of three formats for easier reading: furigana, kana or romaji
-
- Multi-language phonetic reading site that can add phonetic reading to any Japanese site or texts in five different alphabets, Hiragara, Roman, Hangul, Devanagari and Cyrillic letters for easier reading
|
| |
|
|