Information about Okina

The correct title of this article is . This article's title contains characters or symbols not found in Unicode.
ʻOkina letter forms
Enlarge picture
Hawaiian ʻokina
The Tongan fakauʻa letter or Hawaiian ʻokina encoded as U+02BB (in Unicode 5.0[1]), derived from the Lucida Sans font.
Enlarge picture
Tahitian ʻeta
The Tahitian ʻeta letter (or Wallisian fakamoga), currently not encoded correctly, derived from the Lucida Sans font.
The ʻokina, also called by several other names (see examples below), is a unicameral consonant letter used within the Latin script to mark the phonetic glottal stop, as it is used in many Polynesian languages.

Area Vernacular name Literal meaning Notes
Hawaiianʻokinaseparatortransitionally formalised
Tonganfakauʻa
(honorific for fakamonga)
throat makerofficially formalised
Wallisian (in ʻUvea)fakamogathroat makerno official or traditional status, may use ' or or
Tahitianʻetaʻetaʻeta = to hardenno official or traditional status, may use ' or or
Cook Islands Maoriʻamata or ʻakairo ʻamata"Hamsah" or "Hamsah mark"no official or traditional status, may use ' or or or nothing

Encoding and displaying the Polynesian glottal

Old conventions

In plain ASCII the glottal is sometimes represented by the apostrophe character ('), ASCII value 39 in decimal and 27 in hexadecimal, which in most fonts currently used renders as a straight, data-processing, typewriter apostrophe as is also specified in Unicode. But in some older fonts, especially those used on Unix-like platforms and related platforms and on an MS-DOS screen, it renders as a right single quotation mark (which is the wrong shape).

A hypercorrect (but actually incorrect) method for plain ASCII text is to use U+0060 grave accent (incorrectly termed "back-quote character" (`), which in some older fonts does display a glyph similar to a left single quotation mark. However, in most newer fonts, it has a pronounced lean to the left and can look inappropriate. A (partial) advantage is when a wordlist is alphabetically sorted, the "`" often comes after the "z", exactly where it should be in the Tongan language (admittedly not so in most other Polynesian languages, where it should be ignored). It is still useful as a fallback when words are to be entered into a database with limited character-set ability to have the character distinct from the apostrophe.

The new standard and transitional problems

Unicode 5.0[2] (issued July 14, 2006) says the codepoint for ʻokina is Unicode character U+02BB MODIFIER LETTER TURNED COMMAʻ ) which can be rendered in HTML by the entity ʻ (or in hexadecimal form ʻ).

But lack of support for this character in older fonts (and many newer fonts) along with the large amount of legacy data and expense in time and money to convert has prevented easy and universal use of the new character. As of 2006 Apple Mac OS X based computers have no problem with the glyph, but Microsoft Windows especially when using Internet Explorer still has. U+02BB should be the value used in encoding new data when the expected use of the data permits.

This character is also a proper one for a Latin-letter transliteration of the Hebrew letter ʻyin and the Arabic letter ʻayn. They are sometimes also rendered by a superscript half ring with the opening to the right ( ʿ ) or even, as a typographical fallback, a superscript cc ).

Unicode encodes a glottal stop at U+02C0 MODIFIER LETTER GLOTTAL STOP (ˀ), but this looks like an undotted question mark, which is inappropriate for ʻokina.

Its orientation and curve should not depend on the font style for apostrophes (so using a left apostrophe is wrong too, because it can be drawn either like a superscript non-curved mirrored comma, or a superscript 6-shaped apostrophe).

True Polynesian texts however draw the okina very differently, and this looks as none of the apostrophe, mirrored apostrophe, turned comma, or accent letter. The Polynesian ʻokina letter is more like 9-shaped left apostrophe, turned about 60 to 90 degrees counter-clockwise.

Tentative approximants

A display work-around

Because this character is not found in many fonts, it may not appear properly on all computer systems and in all configurations. Accordingly, where U+02BB should properly be used, the Unicode punctuation character U+2018 LEFT SINGLE QUOTATION MARK, ‘, represented by the HTML entity ‘, is sometimes used instead. It is nearly identical in appearance to U+02BB, but is treated as a punctuation mark rather than a letter by applications.

In practical terms, this only matters with regard to page breaks, hyphenation, and capitalisation; these usually cause few problems. This symbol is also used instead of the recommended turned comma letter symbol in transliterations from Semitic languages to assure proper display on the widest number of browsers.

The problem with this left single quotation mark character is that, depending on font style design, the single quotation mark may have two very different shapes, one of which is incompatible with the okina :
  • a superscript straight mirrored comma, drawn from bottom to top and normally thicker on the bottom right than on the top left. The thicker end on the bottom is incompatible.
  • the modifier letter turned comma, but it may still be wrong as it could be drawn in some font designs as an oblick strait line or a wedge without the needed curve, or the curve will be made so that its center will be on the left or top right, when the okina curve should be centered and opened on the bottom or bottom left.

A work-around problem

Nowadays many word-processors are equipped with 'smart quotes', which automatically change the straight apostrophe (') and the straight quotation mark (") into curly ones. If a quotation mark occurs after a space, it is assumed to be an open quote (the left quote), if elsewhere a close quote (the right quote). This policy also allows the apostrophe to be dealt with in the same way. Clearly this is not the behaviour one wants for the glottal. One would end up with text full with 'drunken' glottals, some pointing left, some pointing right. If a special Polynesian keyboard layout is not available, a workaround to the workaround is to insert a ‘dummy’ space before typing the quote (thus making it a left, open quote), then delete the space.

Another problem

In some sans-serif fonts non-bolded and at normal size, the left single quotation character does not appear distinctly different from the straight apostrophe or from the right single quotation character. In Hawaiian, where only one of these curly quotation forms is used as a letter, this matters little. It is more problematic in displaying transliterations from Semitic languages where both left-quotation and right-quotation characters are used with different meanings.

See also

External links

The ISO basic Latin alphabet
AaBbCcDdEeFfGgHhIiJjKkLlMmNnOoPpQqRrSsTtUuVvWwXxYyZz
Z?
Typography is the art and techniques of type design, modifying type glyphs, and arranging type. Type glyphs (characters) are created and modified using a variety of illustration techniques.
..... Click the link for more information.
Symbols are objects, characters, or other concrete representations of ideas, concepts, or other abstractions. For example, in the United States, Canada and Great Britain, a red octagon is a symbol for the traffic sign meaning "STOP".
..... Click the link for more information.
Unicode is an industry standard allowing computers to consistently represent and manipulate text expressed in any of the world's writing systems. Developed in tandem with the Universal Character Set standard and published in book form as The Unicode Standard
..... Click the link for more information.
Tongan}}} 
Official status
Official language of: Tonga
Regulated by: no official regulation
Language codes
ISO 639-1: to
ISO 639-2: ton
ISO 639-3: ton

Tongan (lea fakatonga
..... Click the link for more information.
Hawaiian}}} 
Writing system: Latin 
Official status
Official language of: Hawaiʻi (with English)
Regulated by: no official regulation
Language codes
ISO 639-1: none
..... Click the link for more information.
Lucida Sans <nowiki /> Category Various
<nowiki />
Designer(s) Charles Bigelow
Kris Holmes <nowiki /> <nowiki />
Foundry Bigelow & Holmes <nowiki /> <nowiki /> <nowiki /> <nowiki /> <nowiki /> <nowiki />
..... Click the link for more information.
Tahitian, a Tahitic language, is one of the two official languages of French Polynesia (along with French). It is an Eastern Polynesian language closely related to Rarotongan, New Zealand Māori, and Hawaiian.
..... Click the link for more information.
ʻUvean (Fakaʻuvea in the vernacular) is the Polynesian language spoken on ʻ
..... Click the link for more information.
Lucida Sans <nowiki /> Category Various
<nowiki />
Designer(s) Charles Bigelow
Kris Holmes <nowiki /> <nowiki />
Foundry Bigelow & Holmes <nowiki /> <nowiki /> <nowiki /> <nowiki /> <nowiki /> <nowiki />
..... Click the link for more information.
consonant is a sound in spoken language that is characterized by a closure or stricture of the vocal tract sufficient to cause audible turbulence. The word consonant
..... Click the link for more information.
Latin alphabet
Child systems Numerous: see Alphabets derived from the Latin
Sister systems Cyrillic
Coptic
Armenian
Runic/Futhark
Unicode range See Latin characters in Unicode
ISO 15924 Latn

Note
..... Click the link for more information.
glottal stop or voiceless glottal plosive is a type of consonantal sound, used in many spoken languages. The symbol in the International Phonetic Alphabet that represents this sound is ʔ.
..... Click the link for more information.
Polynesian languages are a language family spoken in the region known as Polynesia. They are classified as part of the Austronesian family, belonging to the Eastern Eastern Malayo-Polynesian branch of that family. They fall into two branches: Tongic and Nuclear Polynesian.
..... Click the link for more information.
Hawaiian}}} 
Writing system: Latin 
Official status
Official language of: Hawaiʻi (with English)
Regulated by: no official regulation
Language codes
ISO 639-1: none
..... Click the link for more information.
Tongan}}} 
Official status
Official language of: Tonga
Regulated by: no official regulation
Language codes
ISO 639-1: to
ISO 639-2: ton
ISO 639-3: ton

Tongan (lea fakatonga
..... Click the link for more information.
ʻUvean (Fakaʻuvea in the vernacular) is the Polynesian language spoken on ʻ
..... Click the link for more information.
Tahitian, a Tahitic language, is one of the two official languages of French Polynesia (along with French). It is an Eastern Polynesian language closely related to Rarotongan, New Zealand Māori, and Hawaiian.
..... Click the link for more information.
Cook Islands Maori language, also called Māori Kūki 'Āirani or Rarotongan, is the official language of the Cook Islands. Most Cook Islanders also call it Te reo Ipukarea, literally "the language of the Ancestral Homeland".
..... Click the link for more information.
American Standard Code for Information Interchange (ASCII), generally pronounced ask-ee IPA: /ˈæski/ ( [1] ), is a character encoding based on the English alphabet.
..... Click the link for more information.
apostrophe  or  ' ) is a punctuation mark, and sometimes a diacritic mark, in languages written in the Latin alphabet.
..... Click the link for more information.
decimal (base ten or occasionally denary) numeral system has ten as its base. It is the most widely used numeral system, perhaps because humans have four fingers and a thumb on each hand, giving a total of ten digits over both hands.
..... Click the link for more information.
hexadecimal, base-16, or simply hex, is a numeral system with a radix, or base, of 16, usually written using the symbols 0–9 and A–F, or a–f.
..... Click the link for more information.
typewriter is a mechanical, electromechanical, or electronic device with a set of "keys" that, when pressed, cause characters to be printed on a document, usually paper.
..... Click the link for more information.
Unix-like operating system is one that behaves in a manner similar to a Unix system, while not necessarily conforming to or being certified to any version of the Single UNIX Specification.
..... Click the link for more information.
Hypercorrection comprises four linguistic phenomena:
  1. an elaborate, prescriptively based correction of common usage, often introduced in an attempt to avoid vulgarity or informality, that results in wording commonly considered clumsier than the usual, colloquial usage.

..... Click the link for more information.

..... Click the link for more information.
Unicode is an industry standard allowing computers to consistently represent and manipulate text expressed in any of the world's writing systems. Developed in tandem with the Universal Character Set standard and published in book form as The Unicode Standard
..... Click the link for more information.
HTML (Hypertext Markup Language)

File extension: .html, .htm
MIME type: text/html
Type code: TEXT
..... Click the link for more information.
20th century - 21st century - 22nd century
1970s  1980s  1990s  - 2000s -  2010s  2020s  2030s
2003 2004 2005 - 2006 - 2007 2008 2009

2006 by topic:
News by month
Jan - Feb - Mar - Apr - May - Jun
..... Click the link for more information.
Mac OS X (IPA: /mæk.oʊ.ɛs.tɛn/) is a line of graphical operating systems developed, marketed, and sold by Apple Inc., the latest of which is pre-loaded on all currently shipping Macintosh computers.
..... Click the link for more information.


This article is copied from an article on Wikipedia.org - the free encyclopedia created and edited by online user community. The text was not checked or edited by anyone on our staff. Although the vast majority of the wikipedia encyclopedia articles provide accurate and timely information please do not assume the accuracy of any particular article. This article is distributed under the terms of GNU Free Documentation License.
Herod_Archelaus


page counter