All Topics  
Okina

 

   Email Print
   Bookmark   Link






 

Okina



 
 
The okina, also called by several other names (see examples below), is a unicameral consonant
Consonant

In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the upper vocal tract, the upper vocal tract being defined as that part of the vocal tract that lies above the larynx....
 letter used within the Latin script to mark the phonetic glottal stop
Glottal stop

The glottal stop, or more fully, the voiceless glottal plosive, is a type of consonantal sound which is used in many Speech communication languages....
, as it is used in many Polynesian languages.



lain ASCII
ASCII

American Standard Code for Information Interchange , is a coding standard that can be used for interchanging information, if the information is expressed mainly by the written form of English words....
 the glottal is sometimes represented by the apostrophe character ('), ASCII value 39 in decimal
Decimal

The decimal numeral system has 10 as its Base . It is the most widely used numeral system....
 and 27 in hexadecimal
Hexadecimal

In mathematics and computer science, hexadecimal is a numeral system with a radix, or base, of 16. It uses sixteen distinct symbols, most often the symbols 09 to represent values zero to nine, and A, B, C, D, E, F to represent values ten to fifteen....
, which in most fonts currently used renders as a straight, data-processing, typewriter
Typewriter

A typewriter is a Machine or electromechanical device with a set of "keys" that, when pressed, cause Typeface to be printed on a medium, usually paper....
 apostrophe as is also specified in Unicode.






Discussion
Ask a question about 'Okina'
Start a new discussion about 'Okina'
Answer questions from other users
Full Discussion Forum



Encyclopedia


Okina letter forms
The Tongan
Tongan language

Tongan is an Austronesian languages language spoken in Tonga. It has around 100,000 speakers and is a national language of Tonga. It is a Verb Subject Object language....
 fakaua letter or Hawaiian
Hawaiian language

The Hawaiian language is an Austronesian languages that takes its name from Hawaii , the largest island in the tropical North Pacific archipelago where it developed....
 okina encoded as U+02BB (in Unicode
Unicode

Unicode is a computing industry standard allowing computers to consistently represent and manipulate Character expressed in most of the world's writing systems....
), derived from the Lucida Sans font.
The Tahitian
Tahitian language

Tahitian, a Tahitic languages, is one of the two official languages of French Polynesia . It is an Eastern Polynesian language closely related to Rarotongan language, Maori language, and Hawaiian language....
  eta letter (or Wallisian fakamoga), currently not encoded correctly, derived from the Lucida Sans font.
The okina, also called by several other names (see examples below), is a unicameral consonant
Consonant

In articulatory phonetics, a consonant is a speech sound that is articulated with complete or partial closure of the upper vocal tract, the upper vocal tract being defined as that part of the vocal tract that lies above the larynx....
 letter used within the Latin script to mark the phonetic glottal stop
Glottal stop

The glottal stop, or more fully, the voiceless glottal plosive, is a type of consonantal sound which is used in many Speech communication languages....
, as it is used in many Polynesian languages.

Area Vernacular name Literal meaning Notes
Hawaiian
Hawaiian language

The Hawaiian language is an Austronesian languages that takes its name from Hawaii , the largest island in the tropical North Pacific archipelago where it developed....
okina separator transitionally formalised
Tongan
Tongan language

Tongan is an Austronesian languages language spoken in Tonga. It has around 100,000 speakers and is a national language of Tonga. It is a Verb Subject Object language....
fakaua
(honorific for fakamonga)
throat maker officially formalised
Wallisian (in Uvea)
Fakauvea

Wallisian or Uvean is the Polynesian language spoken on Wallis Island . The language is also known as East Uvean to distinguish it from the related West Uvean spoken on the outlier island of Ouv?a ....
fakamoga throat maker no official or traditional status, may use ' or or
Tahitian
Tahitian language

Tahitian, a Tahitic languages, is one of the two official languages of French Polynesia . It is an Eastern Polynesian language closely related to Rarotongan language, Maori language, and Hawaiian language....
eta etaeta = to harden no official or traditional status, may use ' or or
Cook Islands Maori amata or akairo amata "Hamsah
Hamza

Hamza is a letter in the Arabic alphabet, representing the glottal stop . Hamza is not one of the 28 "full" letters, and owes its existence to historical orthographical inconsistencies in early Islamic times....
" or "Hamsah mark"
no official or traditional status, may use ' or or or nothing


Encoding and displaying the Polynesian glottal


Old conventions

In plain ASCII
ASCII

American Standard Code for Information Interchange , is a coding standard that can be used for interchanging information, if the information is expressed mainly by the written form of English words....
 the glottal is sometimes represented by the apostrophe character ('), ASCII value 39 in decimal
Decimal

The decimal numeral system has 10 as its Base . It is the most widely used numeral system....
 and 27 in hexadecimal
Hexadecimal

In mathematics and computer science, hexadecimal is a numeral system with a radix, or base, of 16. It uses sixteen distinct symbols, most often the symbols 09 to represent values zero to nine, and A, B, C, D, E, F to represent values ten to fifteen....
, which in most fonts currently used renders as a straight, data-processing, typewriter
Typewriter

A typewriter is a Machine or electromechanical device with a set of "keys" that, when pressed, cause Typeface to be printed on a medium, usually paper....
 apostrophe as is also specified in Unicode. But in some older fonts, especially those used on Unix-like
Unix-like

A Unix-like operating system is one that behaves in a manner similar to a Unix system, while not necessarily conforming to or being certified to any version of the Single UNIX Specification....
 platforms and related platforms and on an MS-DOS screen, it renders as a right single quotation mark (which is the wrong shape).

A "hypercorrect" (that is, incorrect) method for plain ASCII text is to use U+0060 grave accent
Grave accent

The grave accent is a diacritical mark used in written Catalan language, French language, Greek language until 1982 , Italian language, Norwegian language, Occitan language, Portuguese language, Scottish Gaelic language, Vietnamese language, Welsh language, Dutch language, and other languages....
 (`) (incorrectly termed "back-quote character"), which in some older fonts does display a glyph similar to a left single quotation mark. However, in most newer fonts, it has a pronounced lean to the left and can look inappropriate. A (partial) advantage is when a wordlist is alphabetically sorted, the "`" often comes after the "z", exactly where it should be in the Tongan language (admittedly not so in most other Polynesian languages, where it should be ignored). It is still useful as a fallback when words are to be entered into a database with limited character-set ability to have the character distinct from the apostrophe.

The new standard and transitional problems

According to Unicode, the codepoint for okina is Unicode character U+02BB MODIFIER LETTER TURNED COMMA (  ) which can be rendered in HTML
HTML

HTML, an Acronym and initialism of HyperText Markup Language, is the predominant markup language for Web pages. It provides a means to describe the structure of text-based information in a document?by denoting certain text as links, headings, paragraphs, lists, and so on?and to supplement that text with interactive forms, embedded '...
 by the entity ʻ (or in hexadecimal form ʻ).

But lack of support for this character in older fonts (and many newer fonts), along with the large amount of legacy data and expense in time and money to convert, has prevented easy and universal use of the new character. Apple Mac OS X
Mac OS X

Mac OS X is a line of computer operating systems developed, marketed, and sold by Apple Inc., and since 2002 has been included with all new Macintosh computer systems....
 based computers have no problem with the glyph, but Microsoft Windows
Microsoft Windows

Microsoft Windows is a series of software operating systems and graphical user interfaces produced by Microsoft. Microsoft first introduced an operating environment named Windows in November 1985 as an add-on to MS-DOS in response to the growing interest in graphical user interfaces ....
 especially still has, but is no longer a problem in Internet Explorer 7
Internet Explorer 7

Windows Internet Explorer 7 is a web browser released by Microsoft in October 2006. Internet Explorer 7 is part of a long line of versions of Internet Explorer and was the first major update to the browser in more than 5 years....
 as it was in previous versions. U+02BB should be the value used in encoding new data when the expected use of the data permits.

This character is also a proper one for a Latin-letter transliteration of the Hebrew
Hebrew language

Hebrew is a Semitic languages of the Afro-Asiatic languages. Modern Hebrew is spoken by more than seven million people in Israel and Classical Hebrew is used for prayer or study in Jews communities around the world....
 letter
Ayin

' or ' is the sixteenth letter in many Semitic abjads, including Phoenician alphabet, Aramaic language, Hebrew language and Arabic alphabet ....
 and the Arabic
Arabic language

Arabic is a Central Semitic language, thus related to and classified alongside other Semitic languages languages such as Hebrew language and Aramaic language....
 letter
Ayin

' or ' is the sixteenth letter in many Semitic abjads, including Phoenician alphabet, Aramaic language, Hebrew language and Arabic alphabet ....
. They are sometimes also rendered by a superscript half ring with the opening to the right (  ) or even, as a typographical fallback, a superscript cc ).

Unicode encodes a glottal stop at U+02C0 MODIFIER LETTER GLOTTAL STOP, but this looks like an undotted question mark, which is inappropriate for okina.

Its orientation and curve should not depend on the font style for apostrophes (so using a left apostrophe is wrong too, because it can be drawn either like a superscript non-curved mirrored comma, or a superscript 6-shaped apostrophe).

True Polynesian texts however draw the okina very differently, and this looks as none of the apostrophe, mirrored apostrophe, turned comma, or accent letter. The Polynesian okina letter is more like 9-shaped left apostrophe, turned about 60 to 90 degrees counter-clockwise.

Tentative approximations


A display work-around

Because this character is not found in many fonts, it may not appear properly on all computer systems and in all configurations. Accordingly, where U+02BB should properly be used, the Unicode punctuation character U+2018 LEFT SINGLE QUOTATION MARK, ‘, represented by the HTML entity ‘, is sometimes used instead. It is nearly identical in appearance to U+02BB, but is treated as a punctuation mark rather than a letter by applications.

In practical terms, this only matters with regard to page breaks, hyphenation, and capitalisation; these usually cause few problems. This symbol is also used instead of the recommended turned comma letter symbol in transliterations from Semitic languages to ensure proper display on the widest number of browsers.

The problem with this left single quotation mark character is that, depending on font style design, the single quotation mark may have two very different shapes, one of which is incompatible with the okina :
  • a superscript straight mirrored comma, drawn from bottom to top and normally thicker on the bottom right than on the top left. The thicker end on the bottom is incompatible.
  • the modifier letter turned comma, but it may still be wrong as it could be drawn in some font designs as an oblick strait line or a wedge without the needed curve, or the curve will be made so that its center will be on the left or top right, when the okina curve should be centered and opened on the bottom or bottom left.


A work-around problem
Nowadays many word-processors are equipped with 'smart quotes', which automatically change the straight apostrophe (') and the straight quotation mark (") into curly ones. If a quotation mark occurs after a space, it is assumed to be an open quote (the left quote), if elsewhere a close quote (the right quote). This policy also allows the apostrophe to be dealt with in the same way. Clearly this is not the behaviour one wants for the glottal. One would end up with text full with 'drunken' glottals, some pointing left, some pointing right. If a special Polynesian keyboard layout is not available, a workaround to the workaround is to insert a ‘dummy’ space before typing the quote (thus making it a left, open quote), then delete the space.

Also standard undo function of the word processors removes the bad autocorrections, for example using the undo icon on the toolbar or pressing CTRL-z in the most widely spread office suites, after the autocorrection happens.

Another problem
In some sans-serif
Sans-serif

In typography, a sans-serif or sans serif typeface is one that does not have the small features called "serifs" at the end of strokes. The term comes from the French word sans, meaning "without"....
 fonts non-bolded and at normal size, the left single quotation character does not appear distinctly different from the straight apostrophe or from the right single quotation character. In Hawaiian, where only one of these curly quotation forms is used as a letter, this matters little. It is more problematic in displaying transliterations from Semitic languages where both left-quotation and right-quotation characters are used with different meanings. Luckily, nearly all scholars of native Hawaiian agree that the apostrophe is an accurate proxy when available and the okina provides no additional value and thus is unnecessary. Now the standard symbol used to separate the final two i's in Hawaii is an apostrophe. This is also the officially approved symbol according to the AP Style Guide.

See also

  • Glottal stop (letter)
    Glottal stop (letter)

    The symbol is a letter of the Latin alphabet, used to represent a glottal stop in several phonetic transcription schemes, as well as in the alphabets of some languages....
  • Saltillo (linguistics)
    Saltillo (linguistics)

    In Languages of Mexico, saltillo refers to a glottal stop consonant . It was given that name by the early grammarians of Classical Nahuatl language....


External links

  • The correct Unicode values and HTML entities for Hawaiian in Unicode
  • Apple Compatibility with Hawaiian added in OS 10.2
    • .
  • (On slow progress in using proper Hawaiian spellings instead of makeshift English spelling.)
  • , a graphic example on the top of the page of the official website of the commune of Faa'a, capital of the French Polynesia
    French Polynesia

    French Polynesia is a France overseas collectivity in the southern Pacific Ocean. It is made up of several groups of Polynesian islands, the most famous island being Tahiti in the Society Islands group, which is also the most populous island and the seat of the capital of the territory ....
     (this explains why the INSEE
    INSEE

    INSEE is the France List of national and international statistical services for Statistics and Economic Studies. It collects and publishes information on the Economy of France and society, carrying out the periodic national census....
     still encodes it like the French apostrophe).