All Topics  
Diacritic

 

   Email Print
   Bookmark   Link






 

Diacritic



 
 
A diacritic (also diacritical mark, diacritical point, diacritical sign) is a small sign added to a letter
Letter (alphabet)

A letter is an element in an alphabetic system of writing, such as the Greek alphabet and its descendants. Each letter in the written language is usually associated with one phoneme in the spoken form of the language....
 to alter pronunciation or to distinguish between similar words. The term derives from the Greek
Greek language

Greek is an Indo-European languages native to the southern Balkan peninsula, the language of the Greek people. It forms an independent branch within Indo-European....
 d?a???t???? (diakritikós, "distinguishing"). Diacritic is both an adjective
Adjective

In grammar, an adjective is a word whose main syntax role is to grammatical modifier a noun or pronoun, giving more information about the noun or pronoun's definition....
 and a noun
Noun

In linguistics, a noun is a member of a large, open class lexical category whose members can occur as the main word in the subject of a clause, the object of a verb, or the object of a preposition....
, whereas diacritical is only an adjective
Adjective

In grammar, an adjective is a word whose main syntax role is to grammatical modifier a noun or pronoun, giving more information about the noun or pronoun's definition....
. Some diacritical marks, such as the grave
Grave accent

The grave accent is a diacritical mark used in written Catalan language, French language, Greek language until 1982 , Italian language, Norwegian language, Occitan language, Portuguese language, Scottish Gaelic language, Vietnamese language, Welsh language, Dutch language, and other languages....
 and acute
Acute accent

The acute accent is a diacritic mark used in many modern written languages with alphabets based on the Latin alphabet, Cyrillic alphabet and Greek alphabet writing systems....
 accents, but not the cedilla
Cedilla

A cedilla or cedille is a hook added under certain consonant letters as a diacritic to modify their pronunciation....
, are often called accents.

A diacritical mark can appear above or below a letter or in some other position.






Discussion
Ask a question about 'Diacritic'
Start a new discussion about 'Diacritic'
Answer questions from other users
Full Discussion Forum



Encyclopedia


A diacritic (also diacritical mark, diacritical point, diacritical sign) is a small sign added to a letter
Letter (alphabet)

A letter is an element in an alphabetic system of writing, such as the Greek alphabet and its descendants. Each letter in the written language is usually associated with one phoneme in the spoken form of the language....
 to alter pronunciation or to distinguish between similar words. The term derives from the Greek
Greek language

Greek is an Indo-European languages native to the southern Balkan peninsula, the language of the Greek people. It forms an independent branch within Indo-European....
 d?a???t???? (diakritikós, "distinguishing"). Diacritic is both an adjective
Adjective

In grammar, an adjective is a word whose main syntax role is to grammatical modifier a noun or pronoun, giving more information about the noun or pronoun's definition....
 and a noun
Noun

In linguistics, a noun is a member of a large, open class lexical category whose members can occur as the main word in the subject of a clause, the object of a verb, or the object of a preposition....
, whereas diacritical is only an adjective
Adjective

In grammar, an adjective is a word whose main syntax role is to grammatical modifier a noun or pronoun, giving more information about the noun or pronoun's definition....
. Some diacritical marks, such as the grave
Grave accent

The grave accent is a diacritical mark used in written Catalan language, French language, Greek language until 1982 , Italian language, Norwegian language, Occitan language, Portuguese language, Scottish Gaelic language, Vietnamese language, Welsh language, Dutch language, and other languages....
 and acute
Acute accent

The acute accent is a diacritic mark used in many modern written languages with alphabets based on the Latin alphabet, Cyrillic alphabet and Greek alphabet writing systems....
 accents, but not the cedilla
Cedilla

A cedilla or cedille is a hook added under certain consonant letters as a diacritic to modify their pronunciation....
, are often called accents.

A diacritical mark can appear above or below a letter or in some other position. Its main usage is to change the phonetic value of the letter to which it is added; but it may also be used to modify the pronunciation of a whole word or syllable, as in the tone
Tone (linguistics)

Tone is the use of pitch in language to distinguish lexical or grammatical meaning?that is, to distinguish or inflection words. All languages use pitch to express emotional and other paralinguistic information, and to convey emphasis, contrast, and other such features in what is called intonation , but not all languages use tones to distingu...
 marks of tonal languages; to distinguish between homograph
Homonym

In linguistics, a homonym is one of a group of words that share the same spelling and the same pronunciation but have different meanings, usually as a result of the two words having different origins....
s; to make abbreviations, as the titlo
Titlo

Titlo is an extended diacritic symbol first used in old Cyrillic manuscripts, e.g., in Old Church Slavonic and Old East Slavic language languages....
 in old Slavic
Slavic languages

File:Slavic europe.svgThe Slavic languages , a group of closely related languages of the Slavic peoples and a subgroup of Indo-European languages, have speakers in most of Eastern Europe, in much of the Balkans, in parts of Central Europe, and in the northern part of Asia....
 texts does; and to change the meaning of a character, such as to denote numerals in numeral system
Numeral system

A numeral system is a writing system for expressing numerals , and a mathematical notation for representing numbers of a given set, using graphemes or symbols in a consistent manner....
s, such as early Greek numerals
Greek numerals

Greek numerals are a numeral system using letters of the Greek alphabet. They are also known by the names Milesian numerals, Alexandrian numerals, or alphabetic numerals....
.

In orthography
Orthography

The orthography of a language specifies the correct way of using a specific writing system to write the language. Orthography is derived from Greek language ????? orth?s and ???fe?? gr?phein ....
 and collation
Collation

Collation is the assembly of written information into a standard order. One common type of collation is called alphabetisation, though collation is not limited to ordering letters of the alphabet....
, a letter modified by a diacritic may be treated either as a new, distinct letter or as a letter–diacritic combination. This varies from language to language and, in some cases, from symbol to symbol within a single language.

Types

Among the types of diacritic are these:
  • accent marks (thus called because the acute, the grave and the circumflex accent were originally used to indicate different types of pitch accent
    Pitch accent

    Pitch accent is a linguistics term of convenience for a variety of restricted tone systems that use variations in Pitch to give prominence to a syllable or Mora_ within a word....
    s, in the polytonic orthography of Greek
    Greek language

    Greek is an Indo-European languages native to the southern Balkan peninsula, the language of the Greek people. It forms an independent branch within Indo-European....
    )
    • ( ´ ) acute accent
      Acute accent

      The acute accent is a diacritic mark used in many modern written languages with alphabets based on the Latin alphabet, Cyrillic alphabet and Greek alphabet writing systems....
       (Latin apex
      Apex (diacritic)

      The apex is a mark roughly with the shape of an acute accent which is placed over vowels to indicate that they are long vowel.Although hardly known by most contemporary Latinists, the use of the sign was actually quite widespread during classical and postclassical times....
      )
    • ( ` ) grave accent
      Grave accent

      The grave accent is a diacritical mark used in written Catalan language, French language, Greek language until 1982 , Italian language, Norwegian language, Occitan language, Portuguese language, Scottish Gaelic language, Vietnamese language, Welsh language, Dutch language, and other languages....
    • ( ) circumflex accent
    • ( ) caron
      Caron

      A caron or h?cek , also known as a wedge, inverted circumflex, inverted hat, is a diacritic placed over certain letters to indicate present or historical palatalization, iotation, or postalveolar consonant pronunciation in the orthography of some Baltic languages, Slavic languages, Finno-Lappic languages, and other la...
      , inverted circumflex, (Czech hácek, Slovak mäkcen, Slovene strešica)
    • ( ? ) double acute accent
      Double acute accent

      The double acute accent is a diacritic mark of the Latin script used primarily in Hungarian alphabet. Consequently, it is sometimes referred to as Hungarumlaut or Hungarian umlaut, although it is more similar to a trema or diaresis than an umlaut....
    • (  ? ) double grave accent
      Double grave accent

      The double grave accent is a diacritic used in scholarly discussions of the Serbo-Croatian language complex and sometimes Slovenian language. It is also used in the International Phonetic Alphabet....
  • dots
    • ( ), ( . ) dot
      Dot (diacritic)

      When used as a diacritic mark, the term dot is usually reserved for the Interpunct , or to the glyphs 'combining dot above' and 'combining dot below' which may be combined with some Letter s of the extended Latin alphabets in use in Central European languages and Vietnamese language....
       (Indic anusvara
      Anusvara

      Anusvara is the diacritic used to mark a type of nasalization used in a number of Indic languages. Depending on the location of the anusvara in the word, and on the language within which it is used, its exact pronunciation can vary greatly....
      )
    • ( · ) Interpunct
      Interpunct

      An interpunct is a small dot used for interword separation in ancient Latin alphabet, being perhaps the first consistent visual representation of word boundaries in written language....
    • tittle
      Tittle

      A tittle is a small distinguishing mark, such as a diacritic or the dot on a lowercase i or j. The tittle is an integral part of the glyph of i and j, but dot s can appear over other letters in various languages....
      , the dot used by default in the modern lowercase form of the Latin letters "i" and "j"
    • ( ¨ ) trema, diaeresis
      Diaeresis

      In linguistics, diaeresis, or dieresis, is the pronunciation of two adjacent vowels in two separate syllables rather than as a diphthong, and it is also the name of the diacritic mark used to prompt the reader to pronounce adjacent vowels in this manner....
      , or umlaut
      Umlaut (diacritic)

      The word umlaut is the name of a type of sound shift in spoken language and of the diacritic mark used to represent it Orthography. The diacritic mark comprises a pair of dots or lines placed over the letter that represents the affected Vowel....
       sign
    • ( ) colon
      Colon (punctuation)

      The colon is a punctuation mark, consisting of two equally sized dots centered on the same vertical line....
      , used in the International Phonetic Alphabet
      International Phonetic Alphabet

      The International Phonetic Alphabet "The acronym 'IPA' strictly refers [...] to the 'International Phonetic Association'. But it is now such a common practice to use the acronym also to refer to the alphabet itself that resistance seems pedantic....
       to mark long vowels
      Vowel length

      In linguistics, vowel length is the perceived length of a vowel sound. Often the chroneme, or the "longness", acts like a consonant, and may etymologically be one such as in Australian English....
      .
  • ( ° ) ring
    Ring (diacritic)

    A ring diacritic may appear above or below letters. It may be combined with some Letter of the extended Latin alphabets in various contexts....
     (Czech kroužek)
  • macron or line
    • ( ¯ ) macron
      Macron

      A macron, from Greek language meaning "long", is a diacritic ? placed over or under a vowel which was originally used to mark a Long syllable#Syllable weight in classical poetry in Meter #Greek and Latin, but has now been taken also to indicate that the vowel is long vowel....
       (Hawaiian kahako)
    • ( ) macron below
      Combining macron below

      "Combining macron below" is a Unicode combining diacritical mark used in various orthographies; see the precomposed charactersNot to be confused are "combining minus below" ? , "underline" and "low line" _ )....
  • overlays
    • ( | ) bar
      Bar (diacritic)

      A bar or stroke is a modification consisting of a line drawn through a grapheme. It may be used as a diacritic to derive new letters from old ones, or simply as an addition to make a grapheme more distinct from others....
       through the basic letter
    • ( / ) slash
      Bar (diacritic)

      A bar or stroke is a modification consisting of a line drawn through a grapheme. It may be used as a diacritic to derive new letters from old ones, or simply as an addition to make a grapheme more distinct from others....
       through the basic letter
    • () stroke
      Bar (diacritic)

      A bar or stroke is a modification consisting of a line drawn through a grapheme. It may be used as a diacritic to derive new letters from old ones, or simply as an addition to make a grapheme more distinct from others....
       through the basic letter
  • curves
    • ( ? ) breve
      Breve

      A breve is a diacritical mark ?, shaped like the bottom half of a circle. It looks similar to caron , but the caron has a sharp tip, whilst the breve is rounded....
    • sicilicus
      Sicilicus

      In Old Latin a sicilicus is a diacritical mark like a laterally inverted C placed above a letter and evidently deriving its name from its shape like a little sickle ....
      , a palaeographic diacritic similar to a caron or breve
    • ( ) tilde
      Tilde

      The tilde is a grapheme with several uses. The name of the character comes from Spanish language, from the Latin wikt:titulus meaning a title or superscription, though the term ?tilde? has evolved in that language and now has a different meaning in Linguistics....
    • ( ) titlo
      Titlo

      Titlo is an extended diacritic symbol first used in old Cyrillic manuscripts, e.g., in Old Church Slavonic and Old East Slavic language languages....
  • curls above
    • () apostrophe
    • ( ) hook
      Hook (diacritic)

      In typesetting, the hook is a diacritic mark placed on top of vowels in the Vietnamese alphabet. In shape it looks like a tiny question mark without the dot underneath....
       (Vietnamese d?u h?i)
    • ( ) horn
      Horn (diacritic)

      The horn is a diacritic mark attached to the top right corner of the letters o and u in the Vietnamese alphabet to give o and u, vowel roundedness variants of the vowel represented by the basic letter....
       (Vietnamese d?u móc)
  • curls below
    • ( , ) comma
      Comma (punctuation)

      The comma is a punctuation mark. It has the same shape as an apostrophe or single closing quotation mark in many typefaces, but it differs from them in being placed on the baseline of the text....
    • ( ¸ ) cedilla
      Cedilla

      A cedilla or cedille is a hook added under certain consonant letters as a diacritic to modify their pronunciation....
    • ( ? ) ogonek
      Ogonek

      The ogonek is a diacritic hook placed under the lower right corner of a vowel in the Latin alphabet used in several European and Native American languages....


Some of these marks are sometimes diacritics, but also have other uses: that applies to the tilde, dot, comma, titlo, apostrophe, bar, and colon.

The dot on the letter i of the Latin alphabet originated as a diacritic to clearly distinguish i from the vertical strokes of adjacent letters. It first appeared in the sequence ii (as in ingeníí) in Latin manuscripts of the 11th century, then spread to i adjacent to m, n, u before being used for all lower-case i. The j, which separated from the i later, inherited the "dot". The shape of the diacritic developed from initially resembling today's acute accent to a long flourish by the 15th century. With the advent of Roman type
Roman type

In Typography, "roman" type has two principal meanings, both stemming from the stylistic origin of text typefaces from Roman square capitals used in ancient Rome:...
 it was reduced to the round dot we have today.

Diacritics specific to non-Latin alphabets


Arabic

  • (? ? ? ? and stand alone ?) hamza
    Hamza

    Hamza is a letter in the Arabic alphabet, representing the glottal stop . Hamza is not one of the 28 "full" letters, and owes its existence to historical orthographical inconsistencies in early Islamic times....
    : indicates a glottal stop
    Glottal stop

    The glottal stop, or more fully, the voiceless glottal plosive, is a type of consonantal sound which is used in many Speech communication languages....
    .
  • (????????? ) tanwin symbols. Serve a grammatical role in Arabic
    Arabic language

    Arabic is a Central Semitic language, thus related to and classified alongside other Semitic languages languages such as Hebrew language and Aramaic language....
    . The sign ???? is most commonly written in combination with ?? (Alif), ?e.g. ??? ?.
  • shadda
    Shadda

    Shadda , is one of the diacritics used with the Arabic alphabet, marking a long consonant . It is functionally equivalent to writing a consonant twice in the orthographies of languages like Latin, Italian language, Swedish language, and Ancient Greek, and is thus rendered in Latin script in most schemes of Arabic transliteration, e.g....
    : Gemination (doubling) of consonants.
  • (? ) wa?la: Indicates where an alif is not pronounced.
  • (? ) madda. A replacement for two alifs in a row. Read as a glottal stop followed by a long /a?/.
  • superscript alif (also "short" or "dagger alif". A rare replacement for "long" Alif in some words.
  • ?arakat (In Arabic: ????? also called ????? tashkil).
    • (?? ) fat?a (a)
    • (?? ) kasra (i)
    • (?? ) ?amma (u)
    • (??? ) sukun (no vowel)
  • The ?arakat or vowel points serve two purposes:
    • They serve as a phonetical guide. They indicate the presence of short vowels (fat?a, kasra or ?amma) or their absence (s ukun).
    • At the last letter of a word, the vowel point reflects the inflection
      Inflection

      In grammar, inflection or inflexion is the way language handles grammatical relations and relational categories such as grammatical tense, grammatical mood, grammatical voice, grammatical aspect, grammatical person, grammatical number, grammatical gender, grammatical case....
       case or conjugation mood.
      • For nouns, The '?amma' is for the nominative, 'fat?a' for the accusative, and 'kasra' for the genitive.
      • For verbs, the '?amma' is for the imperfective, 'fat?a' for the perfective, and the 'sukun' is for verbs in the imperative or jussive moods.
  • Vowel points or tashkil should not be confused with consonant points or i?jam - one, two or three dots written above or below a consonant to distinguish between letters of the same or similar form
    Rasm

    Rasm is an Arabic term that signifies "sketch, pattern, mark, design, form". When speaking of the Qur'an, it stands for either the basic 18 letters used in early manuscripts, that is, without i'jam diacritics, or for one of the various ways of depicting the precise vocalization of the Qur'an....
    .


Greek


These diacritics are used in addition to the acute, grave, and circumflex accents and the diaeresis:
  • Iota subscript
    Iota subscript

    Iota subscript in Greek language polytonic orthography is a way of writing the letter iota as a small vertical stroke beneath a vowel. It was used in the so-called "long diphthongs" in Ancient Greek phonology, that is, diphthongs of which the first part is a long vowel: and ....
  • ( ) rough breathing, aspiration, spiritus asper
    Spiritus asper

    The spiritus asper , is a diacritic used in the polytonic orthography. In ancient Greek, it indicates initial aspiration , or the presence of the voiceless glottal fricative at the beginning of a word....
  • ( ) smooth (or soft) breathing, spiritus lenis
    Spiritus lenis

    The spiritus lenis is a diacritic used in the polytonic orthography. In ancient Greek, it indicates the lack of initial aspiration , or the absence of the voiceless glottal fricative from the beginning of a word....


Hebrew

  • Niqqud
    Niqqud

    In Hebrew language orthography, niqqud or nikkud is the system of diacritical signs used to represent vowels or distinguish between alternative pronunciations of consonants of the Hebrew alphabet....
  • ( ? ) Dagesh
    Dagesh

    The dagesh is a diacritic used in the Hebrew alphabet. It was added to the Hebrew language orthography at the same time as the Masoretic system of niqqud ....
  • ( ? ) Mappiq
    Mappiq

    The mappiq is a diacritic used in the Hebrew alphabet. It is part of the Masoretes' system of niqqud , and was added to Hebrew language orthography at the same time....
  • ( ? ) Rafe
    Rafe

    In Hebrew language orthography the rafe, also raphe, , is a diacritic ? : a short horizontal overbar placed above certain letters to indicate that they are to be pronunciation as fricatives....
  • ( ? ) Shin dot (at top right corner)
    Shin (letter)

    Shin is the twenty-first letter in many Semitic abjads, including Phoenician language, Aramaic language, Hebrew language , and Arabic alphabet ....
  • ( ? ) Sin dot (at top left corner)
    Shin (letter)

    Shin is the twenty-first letter in many Semitic abjads, including Phoenician language, Aramaic language, Hebrew language , and Arabic alphabet ....
  • ( ? ) Shva
  • ( ? ) Kubutz
  • ( ? ) Holam
  • ( ? ) Kamatz
  • ( ? ) Patakh
  • ( ? ) Segol
  • ( ? ) Tzeire
  • ( ? ) Hiriq


  • Other
  • ( ? ) Geresh
    Geresh

    Geresh is a sign in Hebrew writing. It has two meanings.1. An apostrophe-like sign placed after a letter. It is used:Usage...

Non-alphabetic scripts

Some non-alphabetic scripts also employ symbols that function essentially as diacritics.
  • Non-pure abjad
    Abjad

    An abjad is a type of writing system in which each symbol stands for a consonant; the reader must supply the appropriate vowel. It is a term suggested by Peter T....
    s (such as Hebrew
    Hebrew language

    Hebrew is a Semitic languages of the Afro-Asiatic languages. Modern Hebrew is spoken by more than seven million people in Israel and Classical Hebrew is used for prayer or study in Jews communities around the world....
     and Arabic
    Arabic language

    Arabic is a Central Semitic language, thus related to and classified alongside other Semitic languages languages such as Hebrew language and Aramaic language....
     script) and abugida
    Abugida

    An 'abugida' is a segment writing system which is based on consonants but in which vowel notation is obligatory. About half the writing systems in the world are abugidas, including the extensive Brahmic family of scripts used in South and Southeast Asia....
    s use diacritics for denoting vowel
    Vowel

    In phonetics, a vowel is a sound in spoken language, such as English ah! or oh! , pronounced with an open vocal tract so that there is no build-up of air pressure at any point above the glottis....
    s. Hebrew and Arabic also indicate consonant doubling and change with diacritics; Hebrew and Devanagari
    Devanagari

    , or 'Nagari', is an abugida alphabet of India and Nepal. It is written from left to right, lacks distinct letter cases, and is recognizable by a distinctive horizontal line running along the tops of the letters that links them together....
     use them for foreign sounds. Devanagari and related abugidas also use a diacritical mark called a virama
    Virama

    Virama, or virama, is a generic term for the diacritic in many Brahmic scripts that is used to suppress the inherent vowel that otherwise occurs with every consonant letter....
     to mark the absence of a vowel. In addition, Devanagari uses the moon-dot chandrabindu
    Chandrabindu

    Chandrabindu is a diacritic sign having the form of a dot inside the lower half of a circle. It is used in the Devanagari , Bengali script , Gujarati script , Oriya script and Telugu script scripts....
     ( ? ).
  • The Japanese hiragana
    Hiragana

    is a Japanese language syllabary, one component of the Japanese writing system, along with katakana, kanji, and the romanization of Japanese. Hiragana and katakana are both kana systems, in which each symbol represents one mora ....
     and katakana
    Katakana

    is a Japanese language syllabary, one component of the Japanese writing system along with hiragana, kanji, and in some cases the Latin alphabet. The word katakana means "fragmentary kana", as the katakana scripts are derived from components of more complex kanji....
     syllabaries
    Syllabary

    A syllabary is a set of written symbols that represent syllables, which make up words. A symbol in a syllabary typically represents an optional consonant sound followed by a vowel sound....
     use the dakuten and handakuten
    Dakuten

    , colloquially ten-ten , is a diacritic sign most often used in the Japanese language kana syllabaries to indicate that the consonant of a syllable should be pronounced voiced consonant....
     (in Japanese: ?? and ???) symbols, also known as nigori or ten-ten and maru, to indicate voiced consonants or other phonetical changes.
  • Emoticons are commonly created with diacritic symbols, especially Japan
    Japan

    Japan is an island country in East Asia. Located in the Pacific Ocean, it lies to the east of the Sea of Japan, People's Republic of China, North Korea, South Korea and Russia, stretching from the Sea of Okhotsk in the north to the East China Sea and Taiwan in the south....
    ese emoticons on popular boards such as 2chan and the many other imageboards suffixed -chan.


Alphabetization or collation

Different languages use different rules to put diacritic characters in alphabet
Alphabet

An alphabet is a standardized set of letter basic written symbols each of which roughly represents a phoneme, a spoken language, either as it exists now or as it was in the past....
ical order. French treats letters with diacritical marks the same as the underlying letter for purposes of ordering and dictionaries.

The Scandinavian languages, by contrast, treat the characters with diacritics ä, ö and å as new and separate letters of the alphabet, and sort them after z. Usually ä is sorted as equal to æ (ash) and ö is sorted as equal to ø (o-slash). Also, aa, when used as an alternative spelling to å, is sorted as such. Other letters modified by diacritics are treated as variants of the underlying letter, with the exception that ü is frequently sorted as y.

Languages that treat accented letters as variants of the underlying letter usually alphabetize words with such symbols immediately after similar unmarked words. For instance, in German where two words differ only by an umlaut, the word without it is sorted first in German dictionaries (e.g. schon and then schön, or fallen and then fällen). However, when names are concerned (e.g. in phone books or in author catalogues in libraries), umlauts are often treated as combinations of the vowel with a suffixed e; Austrian phone books now treat characters with umlauts as separate letters (immediately following the underlying vowel).

In Spanish, the grapheme ñ is considered a new letter different from n and collated between n and o, as it denotes a different sound from that of a plain n. But the accented vowels á, é, í, ó, ú are not separated from the unaccented vowels a, e, i, o, u as the acute accent in Spanish only modifies stress
Stress (linguistics)

In linguistics, stress is the relative emphasis that may be given to certain syllables in a word. The term is also used for similar patterns of phonetic prominence inside syllables....
 within the word, not the sound of a letter.

For a comprehensive list of the collating orders in various languages, see Collating sequence
Collating sequence

The term collating sequence refers to the order in which character strings should be placed when sorting them.A common example is the familiar "alphabetic order", in which "Alfred" occurs before "Zeus" because "A" occurs before "Z" in the English alphabet....
.

Generation with computers

Modern computer technology was developed mostly in the English-speaking countries, so data formats, keyboard layouts, etc., were developed with a bias favouring English, a language with a "simple" alphabet, one without diacritical marks. This has led to fears internationally that the marks and accents may be made obsolete to facilitate the worldwide exchange of data. Efforts have been made to create internationalized domain names that further extend the English alphabet (e.g., "pokémon.com").

Depending on the keyboard layout
Keyboard layout

A keyboard layout is any specific mechanical, visual, or functional arrangement of the keys, legends, or key?meaning associations of a Computer keyboard, typewriter, or other alphanumeric keyboard keyboard....
, which differs amongst countries, it is more or less easy to enter letters with diacritics on computers and typewriters. Some have their own keys; some are created by first pressing the key with the diacritic mark followed by the letter to place it on. Such a key is sometimes referred to as a dead key
Dead key

A dead key is a key on a typewriter or a computer keyboard that allows modification on the following letter. For example, Option key-`, e produces ? on the Macintosh U.S....
, as it produces no output of its own, but modifies the output of the key pressed after it.

In modern Microsoft Windows operating systems, the keyboard layout US International allows one to type Latin letters with the acute, grave, circumflex, diæresis, tilde, and cedilla found in Western European languages (specifically, those combinations found in the ISO Latin-1 character set) directly: "+e gives ë, ~+o gives õ, etc. On Apple Macintosh computers, there are keyboard shortcuts for the most common diacritics; Option-e followed by a vowel places an acute accent, Option-u followed by a vowel gives an umlaut, option-c gives a cedilla, etc. Diacritics can be compose
Compose key

On some computer systems, a compose key is a key which is designated to signal the software to interpret the next keystrokes as a combination in order to produce a character not found on the keyboard....
d in most X Window System
X Window System

The X Window System is a computing software system and network protocol that provides a graphical user interface for networked computers. It implements the X Window System protocols and architecture and provides windowing system on raster graphics Visual display units and manages Keyboard and pointing device control functions....
 keyboard layouts.

On computers, the availability of code page
Code page

Code page is the traditional International Business Machines term used to map a specific set of characters to numerical code point values . This is slightly different in meaning than the related terms character encoding and character set....
s determines whether one can use certain diacritics. Unicode
Unicode

Unicode is a computing industry standard allowing computers to consistently represent and manipulate Character expressed in most of the world's writing systems....
 solves this problem by assigning every known character its own code; if this code is known, most modern computer systems provide a method to input it
Unicode

Unicode is a computing industry standard allowing computers to consistently represent and manipulate Character expressed in most of the world's writing systems....
. With Unicode, it is also possible to combine diacritical marks with most characters.

Languages with letters containing diacritics

The following languages have letters which contain diacritics.

Germanic and Celtic
  • Danish and Norwegian
    Norwegian language

    Norwegian is a North Germanic languages language spoken primarily in Norway, where it is an official language. It is also spoken as a second language among Norwegian-Americans in the United States of America, especially in the central northern states....
     uses additional characters like the o-slash ø and the a-circle å. These letters are collated after z and æ, in the order ø, å.
  • Faroese
    Faroese alphabet

    The Faroese alphabet consists of 29 letters derived from the Latin alphabet:...
     uses acute accents and other special letters. All are considered separate letters and have their own place in the alphabet: á, í, ó, ú, ý, and ø.
  • Icelandic
    Icelandic alphabet

    The modern Icelandic language alphabet consists of the following 33 letters:It is based upon a Latin alphabet with diacritics, in addition it includes the character eth ?? and the Runes Thorn ?? ....
     uses acute accents and other special letters. All are considered separate letters, and have their own place in the alphabet: á, é, í, ó, ú, ý, and ö.
  • Among the Scandinavian languages, Danish and Norwegian have long used o-slash
    Ø

    ? , is a vowel and a Letter used in the Danish and Norwegian alphabet, Faroese language#Alphabet and Danish and Norwegian alphabet languages....
     (ø), but have more recently incorporated a-ring (å) after Swedish example. Historically the å has developed from a ligature by writing a small a on top of the letter a; if an å character is unavailable, some Scandinavian languages allow the substitution of a doubled a. The Scandinavian languages collate these letters after z, but have different collation
    Collation

    Collation is the assembly of written information into a standard order. One common type of collation is called alphabetisation, though collation is not limited to ordering letters of the alphabet....
     standards.
  • Swedish
    Swedish alphabet

    The Swedish language alphabet consists of the following 29 letters:Upper CaseLower CaseIn addition to the commonly occurring letters of the Latin alphabet, A-Z, the Swedish alphabet has the three letter , "?", "?" and "?"....
     uses characters identical to a-diaeresis (ä) and o-diaeresis (ö) in the place of ash and o-slash in addition to the a-circle (å). Historically the diaresis for the Swedish letters ä and ö, like the German umlaut, has developed from a small gothic e written on top of the letters. These letters are collated after z, in the order å, ä, ö.


Romance
  • Galician
    Galician language

    Galician is a language of the Iberian Romance languages branch, spoken in Galicia , an Autonomous communities of Spain located in northwestern Spain, as well as in small bordering zones in the neighbouring autonomous communities of Asturias and Castile and Le?n and in Northern Portugal....
    : as in Spanish, the character ñ
    N

    N is the fourteenth letter in the Latin alphabet. Its name in English language is spelled en ....
     is a letter and collated between n and o
  • Romanian
    Romanian alphabet

    The Romanian alphabet is a modification of the Latin alphabet and consists of 31 letters:The letters Q , W , and Y were officially introduced in the Romanian alphabet in 1982, although they had been used earlier....
     uses a breve on the letter a (a) to indicate the sound schwa
    Schwa

    In linguistics, specifically phonetics and phonology, schwa can mean the following*An stress and tone neutral vowel sound in any language, often but not necessarily a mid-central vowel....
     , as well as a circumflex over the letters a (â) and i (î) for the sound . Romanian also writes a comma
    Comma

    A comma is a type of punctuation mark .Comma may also refer to* Comma , a type of interval in music theory* Comma , a species of butterfly...
     below the letters s () and t () to represent the sounds and , respectively. These characters are collated after their non-diacritic equivalent.
  • Spanish: the character ñ
    N

    N is the fourteenth letter in the Latin alphabet. Its name in English language is spelled en ....
     is considered a letter, and collated between n and o.


Slavic
  • Bosnian and Croatian
    Croatian alphabet

    Gaj's Latin alphabet is a variant of the Croatian language Latin alphabet devised by Croat Ljudevit Gaj, in his 1830 book, Kratka osnova horvatsko-slavenskog pravopisanja ....
     have the symbols c
    C

    C or c is a consonant in Esperanto orthography, representing a voiceless postalveolar affricate , and is equivalent to the voiceless postalveolar affricate, , or the voiceless retroflex affricate, ...
    , c
    C

    C or c is a consonant in Esperanto orthography, representing a voiceless postalveolar affricate , and is equivalent to the voiceless postalveolar affricate, , or the voiceless retroflex affricate, ...
    , d
    D

    D is the fourth letter in the Latin alphabet. Its name in English language is spelled dee , plural dees....
    , š and ž, which are considered separate letters and are listed as such in dictionaries and other contexts in which words are listed according to alphabetical order. Bosnian and Croatian
    Croatian alphabet

    Gaj's Latin alphabet is a variant of the Croatian language Latin alphabet devised by Croat Ljudevit Gaj, in his 1830 book, Kratka osnova horvatsko-slavenskog pravopisanja ....
     also have one digraph
    Digraph

    Digraph may refer to* Digraph , a pair of characters used to write one phoneme or a sequence of phonemes that does not correspond to the normal values of the two characters combined...
     including a diacritic,

    D? is the seventh Letter of the Croatian alphabet and Bosnian alphabet alphabets, and the Latin forms of Serbian language#Alphabets and Macedonian alphabet, after D and before D with stroke....
     which is also alphabetised independently, and follows d
    D

    D is the fourth letter in the Latin alphabet. Its name in English language is spelled dee , plural dees....
     and precedes d
    D

    D is the fourth letter in the Latin alphabet. Its name in English language is spelled dee , plural dees....
     in the alphabetical order. The Serbian
    Serbian language

    name=Serbian|nativename=|pronunciation=['sr?pski?]|familycolor=Indo-European|map=|states=See below under "Official status", besides that in Croatia and as an immigrant's language spread over Central Europe and Western Europe, as well as Northern America...
     Latin alphabet contains the same letters, but the Serbian Cyrillic alphabet has no diacritics.
  • The Czech alphabet
    Czech alphabet

    The Czech alphabet is a version of the Latin alphabet, used when writing Czech language. Its basic principles are "one sound - one letter" and the addition of diacritical marks above letters to represent sounds alien to Latin....
     contains 27 graphemes (letters) when written without diacritics and 42 graphemes when written including them. Czech uses the acute (á é í ó ú ý), the hácek (c d e n r š t ž), and for one letter (u) the ring.
  • Polish
    Polish alphabet

    The Polish alphabet is the writing system of the Polish language. It is based on the Latin alphabet but uses diacritics such as the kreska, which is graphically similar to an acute accent , the dot , the ogonek , and the bar ....
     has the following letters: a c e l n ó s z z. These are considered to be separate letters, each of them is placed in alphabet right after its Latin counterpart (i.e. a between a and b), z and z are placed after z in this order.
  • The Slovak alphabet
    Slovak alphabet

    The Slovak alphabet uses a modification of the Latin alphabet. The modifications include the four diacriticals placed above certain letters....
     uses the acute (á é í ó ú ý l r), caron (c d l n š t ž), umlaut (ä) and circumflex accent (ô).
  • The Slovene alphabet has the symbols c
    C

    C or c is a consonant in Esperanto orthography, representing a voiceless postalveolar affricate , and is equivalent to the voiceless postalveolar affricate, , or the voiceless retroflex affricate, ...
    , š and ž, which are considered separate letters and are listed as such in dictionaries and other contexts in which words are listed according to alphabetical order.


Baltic
  • Latvian
    Latvian alphabet

    AlphabetThe Latvian language alphabet is based on the Latin alphabet and consists of 33 letters. 22 of them are from the Latin alphabet; the remaining 11 are obtained from Latin letters by using diacritic marks....
     has the following letters: a e i u r l k n g š ž c.
  • Lithuanian
    Lithuanian alphabet

    Lithuanian language employs a modified Roman alphabet. It is composed of 32 Letter . The collation order presents one surprise: "Y" is moved to occur between I Ogonek and J....
    . In general usage, where letters appear with the caron
    Caron

    A caron or h?cek , also known as a wedge, inverted circumflex, inverted hat, is a diacritic placed over certain letters to indicate present or historical palatalization, iotation, or postalveolar consonant pronunciation in the orthography of some Baltic languages, Slavic languages, Finno-Lappic languages, and other la...
     (c, š and ž) they are considered as separate letters from c, s or z and collated separately; letters with the ogonek
    Ogonek

    The ogonek is a diacritic hook placed under the lower right corner of a vowel in the Latin alphabet used in several European and Native American languages....
     (a, e, i and u), the macron
    Macron

    A macron, from Greek language meaning "long", is a diacritic ? placed over or under a vowel which was originally used to mark a Long syllable#Syllable weight in classical poetry in Meter #Greek and Latin, but has now been taken also to indicate that the vowel is long vowel....
     (u) and the superdot (e) are considered as separate letters as well, but not given a unique collation order.


Finno-Ugric
  • Estonian
    Estonian alphabet

    The Estonian alphabet is used for writing the Estonian language and is based on the Latin alphabet, with German alphabet influence. As such, the Estonian alphabet has the letters ? , ? and ? , which stand for the vowels , and , respectively....
     has a distinct letter õ
    O

    O is the fifteenth letter of the modern Latin alphabet. Its name in English language is spelled o , plural oes ....
     which contains a tilde. Estonian "dotted vowels" ä, ö, ü are similar to German, but these are also distinct letters, not like German umlauted letters. All four have their own place in the alphabet, between w and x. Carons in š or ž appear only in foreign proper names and loanword
    Loanword

    A loanword is a word directly taken into one language from another with little or no translation. By contrast, a calque or loan translation is a related concept whereby it is the Meaning or idiom that is borrowed rather than the lexical item itself....
    s. Also these are distinct letters, placed in the alphabet between s and t.
  • Finnish
    Finnish alphabet

    The Finnish language alphabet is based on the Latin alphabet, and especially :A, B, C, D, E, F, G, H, I, J, K, L, M, N, O, P, Q, R, S, T, U, V, X, Y, Z, ?, ?, ?...
     uses dotted vowels (ä and ö). As in Swedish and Estonian, these are regarded as individual letters, rather than vowel + umlaut combinations (as happens in German). It also uses the characters å, š and ž in foreign names and loanwords. In the Finnish alphabet, å, ä and ö collate as separate letters after z, the others as variants of their base letter.
  • Hungarian
    Hungarian alphabet

    The Hungarian alphabet is an extension of the Latin alphabet.One sometimes speaks of the smaller and greater Hungarian alphabet, depending on whether the letters Q, W, X, Y which can only be found in foreign words and traditional orthography of names are listed, or not....
     uses the umlaut, the acute and double acute accent (unique to Hungarian): ö ü, á é í ó ú and o u. The acute accent indicates the long form of a vowel (in case of i/í, o/ó, u/ú) while the double acute performs the same function for ö and ü. The acute accent can also indicate a different sound (more open, like in case of a/á, e/é). Both long and short forms of the vowels are listed separately in the Hungarian alphabet
    Hungarian alphabet

    The Hungarian alphabet is an extension of the Latin alphabet.One sometimes speaks of the smaller and greater Hungarian alphabet, depending on whether the letters Q, W, X, Y which can only be found in foreign words and traditional orthography of names are listed, or not....
     but members of the pairs a/á, e/é, i/í, o/ó, ö/o, u/ú and ü/u are collated in dictionaries as the same letter.
  • Livonian
    Livonian language

    Livonian belongs to the Baltic-Finnic languages branch of the Uralic languages. It is a moribund language now spoken by some 35 people, of whom only 10 are fluent....
     has the following letters: a, ä, , , e, i, l, n, o, , , õ, , r, š, , u, ž.


Turkic
  • Azerbaijani
    Azerbaijani alphabet

    In Republic of Azerbaijan, the Azerbaijani alphabet may refer to either of two alphabets used to write the Azerbaijani language: one Cyrillic-based alphabet and one Latin-based alphabet....
     includes the distinct Turkish alphabet
    Turkish alphabet

    The Turkish alphabet is a Latin-based alphabet used for writing the Turkish language, consisting of 29 letters, a certain number of which have been adapted or modified for the phonetic requirements of the language....
     letters Ç
    C

    C or c is a consonant in Esperanto orthography, representing a voiceless postalveolar affricate , and is equivalent to the voiceless postalveolar affricate, , or the voiceless retroflex affricate, ...
    , G
    G

    G is the seventh letter in the Latin alphabet. Its name in English language is spelled gee....
    , I, I
    Turkish dotted and dotless I

    The Turkish alphabet, which is a variant of the Latin alphabet, includes two distinct versions of I, one dotted and the other dotless....
    , Ö
    Ö

    "?", or "?", is a character used in several extended Latin alphabets, or the letter O with umlaut ....
    , S
    S

    S is the nineteenth letter in the modern Latin alphabet. Its name in English language is spelled ess or generally es- when part of a compound word, plural esses....
     and Ü
    Y

    The letter Y is the twenty-fifth letter in the modern Latin alphabet. Its name in English language is spelled wye or occasionally wy' , plural wyes....
    .
  • Crimean Tatar
    Crimean Tatar language

    The Crimean Tatar language , also known as Crimean and Crimean Turkish is the language of the Crimean Tatars. It is spoken in Crimea, Central Asia , and the Crimean Tatar diasporas in Turkey, Romania, Bulgaria....
     includes the distinct Turkish alphabet
    Turkish alphabet

    The Turkish alphabet is a Latin-based alphabet used for writing the Turkish language, consisting of 29 letters, a certain number of which have been adapted or modified for the phonetic requirements of the language....
     letters Ç
    C

    C or c is a consonant in Esperanto orthography, representing a voiceless postalveolar affricate , and is equivalent to the voiceless postalveolar affricate, , or the voiceless retroflex affricate, ...
    , G
    G

    G is the seventh letter in the Latin alphabet. Its name in English language is spelled gee....
    , I, I
    Turkish dotted and dotless I

    The Turkish alphabet, which is a variant of the Latin alphabet, includes two distinct versions of I, one dotted and the other dotless....
    , Ö
    Ö

    "?", or "?", is a character used in several extended Latin alphabets, or the letter O with umlaut ....
    , S
    S

    S is the nineteenth letter in the modern Latin alphabet. Its name in English language is spelled ess or generally es- when part of a compound word, plural esses....
     and Ü
    Y

    The letter Y
    is the twenty-fifth letter in the modern Latin alphabet. Its name in English language is spelled wye or occasionally wy' , plural wyes....
    . Unlike Standard Turkish (but like Cypriot Turkish
    Cypriot Turkish

    Cypriot Turkish is a dialect of Turkish language spoken by Turkish Cypriots....
    ), Crimean Tatar also has the letter Ñ
    N

    N is the fourteenth letter in the Latin alphabet. Its name in English language is spelled en ....
    .
  • Gagauz
    Gagauz alphabet

    The modern Gagauz alphabet, used for the Gagauz language, is a 32-letter Latin-based alphabet modelled on the Turkish alphabet.Previously, Gagauz used the Greek alphabet and until 1996 the Cyrillic alphabet....
     includes the distinct Turkish alphabet
    Turkish alphabet

    The Turkish alphabet is a Latin-based alphabet used for writing the Turkish language, consisting of 29 letters, a certain number of which have been adapted or modified for the phonetic requirements of the language....
     letters Ç
    C

    C or c is a consonant in Esperanto orthography, representing a voiceless postalveolar affricate , and is equivalent to the voiceless postalveolar affricate, , or the voiceless retroflex affricate, ...
    , G
    G

    G is the seventh letter in the Latin alphabet. Its name in English language is spelled gee....
    , I, I
    Turkish dotted and dotless I

    The Turkish alphabet, which is a variant of the Latin alphabet, includes two distinct versions of I, one dotted and the other dotless....
    , Ö
    Ö

    "?", or "?", is a character used in several extended Latin alphabets, or the letter O with umlaut ....
    , S
    S

    S is the nineteenth letter in the modern Latin alphabet. Its name in English language is spelled ess or generally es- when part of a compound word, plural esses....
     and Ü
    Y

    The letter Y
    is the twenty-fifth letter in the modern Latin alphabet. Its name in English language is spelled wye or occasionally wy' , plural wyes....
    . Unlike Turkish, Gagauz also has the letters Ä
    Æ

    ? is a grapheme formed from the letters a and e. Originally a ligature representing a Latin diphthong, it has been promoted to the full status of a letter in the alphabets of many languages....
    , Ê
    E

    E is the fifth letter in the Latin alphabet. Its name in English language is spelled e , plural ees . The letter E is the most commonly used letter in the Czech language, Danish language, Dutch language, English language, French language, German language, Hungarian language, Latin language, Norwegian language, Spanish language...
     and T
    T

    T is the twentieth letter in the modern Latin alphabet. Its name in English language is spelled tee . It is the most commonly used consonant and the second most common letter in the English language....
    . T
    T

    T is the twentieth letter in the modern Latin alphabet. Its name in English language is spelled tee . It is the most commonly used consonant and the second most common letter in the English language....
     is derived from the Romanian alphabet
    Romanian alphabet

    The Romanian alphabet is a modification of the Latin alphabet and consists of 31 letters:The letters Q , W , and Y were officially introduced in the Romanian alphabet in 1982, although they had been used earlier....
     for the same sound.
  • Turkish
    Turkish alphabet

    The Turkish alphabet is a Latin-based alphabet used for writing the Turkish language, consisting of 29 letters, a certain number of which have been adapted or modified for the phonetic requirements of the language....
     uses a G with a breve (G
    G

    G is the seventh letter in the Latin alphabet. Its name in English language is spelled gee....
    ), two letters with a diaeresis (Ö
    Ö

    "?", or "?", is a character used in several extended Latin alphabets, or the letter O with umlaut ....
     and Ü
    Y

    The letter Y
    is the twenty-fifth letter in the modern Latin alphabet. Its name in English language is spelled wye or occasionally wy' , plural wyes....
    , representing two rounded front vowels), two letters with a cedilla (Ç
    C

    C or c is a consonant in Esperanto orthography, representing a voiceless postalveolar affricate , and is equivalent to the voiceless postalveolar affricate, , or the voiceless retroflex affricate, ...
     and S, representing the affricate and the fricative ), and also possesses a dotted capital I (and a dotless lowercase i representing a high unrounded back vowel). In Turkish each of these are separate letters, rather than versions of other letters, where dotted
    Turkish dotted and dotless I

    The Turkish alphabet, which is a variant of the Latin alphabet, includes two distinct versions of I, one dotted and the other dotless....
     capital I and lower case i are the same letter, as are dotless capital I and lowercase i. Typographically
    Typeface

    In typography, a typeface is a set of one or more fonts, in one or more sizes, designed with stylistic unity, each comprising a coordinated set of glyphs....
    , Ç and S are often rendered with a subdot, as in ; when a hook is used, it tends to have more a comma shape than the usual cedilla. The new Azerbaijani
    Azerbaijani alphabet

    In Republic of Azerbaijan, the Azerbaijani alphabet may refer to either of two alphabets used to write the Azerbaijani language: one Cyrillic-based alphabet and one Latin-based alphabet....
    , Crimean Tatar
    Crimean Tatar language

    The Crimean Tatar language , also known as Crimean and Crimean Turkish is the language of the Crimean Tatars. It is spoken in Crimea, Central Asia , and the Crimean Tatar diasporas in Turkey, Romania, Bulgaria....
    , and Gagauz
    Gagauz alphabet

    The modern Gagauz alphabet, used for the Gagauz language, is a 32-letter Latin-based alphabet modelled on the Turkish alphabet.Previously, Gagauz used the Greek alphabet and until 1996 the Cyrillic alphabet....
     alphabets are based on the Turkish alphabet and its same diacriticized letters, with some additions.


Other
  • Albanian
    Albanian alphabet

    The modern Albanian language alphabet is based on the Latin alphabet, and consists of 36 letters:Note: The vowels are shown in bold....
     has two special letters Ç
    C

    C or c is a consonant in Esperanto orthography, representing a voiceless postalveolar affricate , and is equivalent to the voiceless postalveolar affricate, , or the voiceless retroflex affricate, ...
     and ?
    ?

    or is a letter derived from the Latin alphabet. Both glyphs of the majuscule and Lower case forms of this letter are based on the rotated form of a minuscule e; a similar letter with identical minuscule is used in the Pan-Nigerian Alphabet, but has the capital form majuscule , based on a horizontally flipped majuscule E....
     upper and lowercase. They are placed next to the most similar letters in the alphabet, c and e correspondingly.
  • Esperanto has the symbols u
    U

    U is the twenty-first letter in the modern Latin alphabet. Its name in English language is spelled u ....
    , c
    C

    C or c is a consonant in Esperanto orthography, representing a voiceless postalveolar affricate , and is equivalent to the voiceless postalveolar affricate, , or the voiceless retroflex affricate, ...
    , g
    G

    G is the seventh letter in the Latin alphabet. Its name in English language is spelled gee....
    , h
    H

    H is the eighth letter in the Latin alphabet. Its name in both British English and American English is aitch , though it is also pronounced haitch in some dialects ....
    , j
    J

    J or j is a consonant in Esperanto orthography, representing a voiced postalveolar fricative , and is equivalent to the voiced postalveolar fricative, , or the voiced retroflex fricative, ....
     and s
    S

    S is the nineteenth letter in the modern Latin alphabet. Its name in English language is spelled ess or generally es- when part of a compound word, plural esses....
    , which are included in the alphabet, and considered separate letters.
  • Hawaiian uses the kahakô (macron
    Macron

    A macron, from Greek language meaning "long", is a diacritic ? placed over or under a vowel which was originally used to mark a Long syllable#Syllable weight in classical poetry in Meter #Greek and Latin, but has now been taken also to indicate that the vowel is long vowel....
    ) over vowels, although there is some disagreement over considering them as individual letters. The kahakô over a vowel can completely change the meaning of a word that is spelled the same but without the kahakô.
  • Kurdish
    Kurdish alphabet

    The Kurdish alphabet is a writing system for the Kurdish language. Three systems currently exist. The form used in Turkey was derived from the Latin alphabet by Jaladat Ali Badirkhan in 1932, and thus is also called the Bedirxan script or more properly Hawar....
     uses the symbols Ç
    C

    C or c is a consonant in Esperanto orthography, representing a voiceless postalveolar affricate , and is equivalent to the voiceless postalveolar affricate, , or the voiceless retroflex affricate, ...
    , Ê
    E

    E is the fifth letter in the Latin alphabet. Its name in English language is spelled e , plural ees . The letter E is the most commonly used letter in the Czech language, Danish language, Dutch language, English language, French language, German language, Hungarian language, Latin language, Norwegian language, Spanish language...
    , Î
    I

    I is the ninth Letter of the Latin alphabet. Its English language name is i ....
    , S
    S

    S is the nineteenth letter in the modern Latin alphabet. Its name in English language is spelled ess or generally es- when part of a compound word, plural esses....
     and Û
    U

    U is the twenty-first letter in the modern Latin alphabet. Its name in English language is spelled u ....
     with other 26 standard Latin alphabet symbols.
  • Maltese
    Maltese alphabet

    The Maltese alphabet is based on the Latin alphabet with the addition of some letters with diacritic marks and digraphs. It is used to write the Maltese language....
     uses a C, G, and Z with a dot over them (C, G, Z), and also has an H with an extra horizontal bar. For uppercase H, the extra bar is written slightly above the usual bar. For lowercase H, the extra bar is written crossing the vertical, like a t, and not touching the lower part (H, h). The above characters are considered separate letters. The letter 'c' without a dot has fallen out of use due to redundancy. 'C' is pronounced like the English 'ch' and 'k' is used as a hard c as in 'cat'. The digraph 'gh' (called ghajn after the Arabic
    Arabic alphabet

    The Arabic alphabet is the writing system used for writing several languages of Asia and Africa, such as Arabic language, Persian language, and Urdu language....
     letter name ?ayn for ?) is considered separate, and sometimes ordered after 'g', whilst in other volumes it is placed between 'n' and 'o' (the Latin letter 'o' originally evolved from the shape of Phoenician
    Phoenician alphabet

    The Phoenician alphabet is a continuation of the Proto-Canaanite alphabet, by convention taken to originate around 1050 BC. It was used for the writing of Phoenician language, a Northern Semitic languages language, used by the civilization of Phoenicia....
     ?ayin which was traditionally collated after Phoenician nun).
  • Vietnamese
    Vietnamese alphabet

    The Vietnamese alphabet has the following 29 letters, in collation order:Vietnamese also uses the ten Digraph s and one Trigraph below.These groups were formerly considered single letters and are treated as such in older dictionaries....
     uses the horn diacritic
    Horn (diacritic)

    The horn is a diacritic mark attached to the top right corner of the letters o and u in the Vietnamese alphabet to give o and u, vowel roundedness variants of the vowel represented by the basic letter....
     for the letters o and u; the circumflex
    Circumflex

    The circumflex is a diacritic mark used in written Serbian language, Croatian language, Esperanto, French language, West Frisian language, Norwegian language, Romanian language, Slovak language, Vietnamese language, Romaji, Romanization of Persian, Welsh language, Portuguese language, Italian language, Afrikaans language, Turkish language...
     for the letters â, ê, and ô; the breve
    Breve

    A breve is a diacritical mark ?, shaped like the bottom half of a circle. It looks similar to caron , but the caron has a sharp tip, whilst the breve is rounded....
     for the letter a; and a bar through the letter d.


Cyrillic alphabet
Cyrillic alphabet

The Cyrillic alphabet is a family of alphabets, subsets of which are used by five Slavic languages national languages as well as non-Slavic . It is also used by many other languages of Eastern Europe, the Caucasus, Siberia and other languages in the past....
s
  • Belarusian
    Belarusian alphabet

    The Belarusian alphabet is based on the Cyrillic alphabet and is derived from the alphabet of the Old Church Slavonic language. The alphabet has existed in its modern form since 1918 and consists of thirty-two letters....
     has a letter ?.
  • Belarusian, Bulgarian
    Bulgarian language

    Bulgarian is an Indo-European languages, a member of the Slavic languages linguistic group.Bulgarian demonstrates several linguistic innovations that set it apart from all other Slavic languages except Macedonian language, such as the elimination of grammatical case, the development of a suffixed definite article , the lack of a verb infin...
    , Russian and Ukrainian have the letter ?
    Short I

    Short I is a letter in the Cyrillic alphabet. It is made of the Cyrillic I , with a breve.It is the eleventh letter in the Russian alphabet, and in Russian language is called ? ??????? ....
    .
  • Belarusian and Russian
    Russian alphabet

    The modern Russian alphabet is a variant of the Cyrillic alphabet. It was introduced into Kievan Rus' at the time of Vladimir I of Kiev's conversion to Christianity date....
     have the letter ?
    Yo (Cyrillic)

    Yo ,a letter of the Cyrillic alphabet. In Belarusian language it is the seventh letter of the alphabet and in Rusyn language the ninth. Its status in Russian language, the language in which it was first used, is ambiguous: although it indicates a distinct sound from ?, it is treated as the same letter for purposes of alphabetisation and sort...
    . In Russian, this letter is usually replaced in print by ?
    E (Cyrillic)

    E is a letter found amongst Slavonic languages only in Russian language and Belarusian language , representing the sounds [e] and . In other Slavonic languages using Cyrillic alphabet, these sounds are represented by ?, which in Russian and Belarusian represents [je] in initial and post-vocalic position or else [e] after a palatalized cons...
    , although it has a different pronunciation. The use of ? instead of ? in print does not affect the pronunciation. ? is still used in children's books and in handwriting, is always used in dictionaries. A minimal pair
    Minimal pair

    In phonology, minimal pairs are pairs of words or phrases in a particular language, which differ in only one phonological element, such as a Phone , phoneme, toneme or chroneme and have a distinct meaning....
     is ??? (vse, "all" pl.) and ??? (vsyo, "everything" n. sg.). In Belarusian the replacement by ? is a mistake, in Russian, it is possible to use either ? and ? in print in place of ? but the former is more common.
  • Ukrainian
    Ukrainian alphabet

    The Ukrainian alphabet is the alphabet used to write Ukrainian language, the official language of Ukraine. It is one of the national variations of the Cyrillic alphabet writing system....
     has the letter ï
    Yi (Cyrillic)

    Yi is a letter of the Cyrillic alphabet, used in the Ukrainian language and Rusyn language languages. It represents the iotated vowel sound . In Rusyn language, it can also represent a palatalized sound....
    .
  • Macedonian
    Macedonian language

    Macedonian is the official language of the Republic of Macedonia and is a part of the Eastern group of South Slavic languages. Macedonian is closely related to and shares a high degree of mutual intelligibility with the Bulgarian language, Serbian language, Bosnian language, and Croatian language languages....
     has the letters ?
    Kje

    Heading = Cyrillic letter Kje|Image=...
     and ?
    Gje

    eading=Cyrillic letter Gje|Image=...
    .
  • The acute accent " ´" above any vowel in Cyrillic alphabets is used in dictionaries, books for children and foreign learners to indicate the word stress, it also can be used for disambiguation of similarly spelled words with different lexical stresses.

Diacritics that do not produce new letters


English
English alphabet

The modern English alphabet is a Latin-based alphabet consisting of 26 letters, like in the Basic modern Latin alphabet:The exact shape of printed letters varies depending on the typeface....
 is one of the few European languages that does not have many words which contain diacritical marks (with the exception of the apostrophe in word contractions). Exceptions are unassimilated foreign loanwords, including borrowings from French
French language

French is a Romance language spoken around the world by around 80 million people as first language, by 190 million as second language, and by about another 200 million people as an acquired tongue, with significant speakers in 54 countries....
 and increasingly Spanish
Spanish language

Spanish or Castilian is a Romance languages that originated in northern Spain, and gradually spread in the Kingdom of Castile and evolved into the principal language of government and trade....
; however, the diacritic is also sometimes omitted from such words. Loanwords that frequently appear with the diacritic in English include café, résumé (a usage that helps distinguish it from the verb resume, though the former is often written resumé), and naïveté (see List of English words with diacritics
List of English words with diacritics

Some English language words have letters with diacritical marks. Most of the words are loanwords from French language, with others coming from Spanish language, German language, or other languages....
). In older practice (and even among some orthographically conservative modern writers) one may see examples such as élite and rôle.

English once used the diaeresis more often than not in words such as coöperation (from Fr. coopération) and zoölogy (from Lat. zoologia), but this practice has become far less common (The New Yorker
The New Yorker

The New Yorker is an United States magazine that publishes reportage, commentary, criticism, essays, fiction, satire, cartoons, and poetry. Starting as a weekly in the mid-1920s, the magazine is now published 47 times per year, with five of these issues covering two-week spans....
's house style is one of the few major publications to retain this feature, and various individual writers still use it). It is not competely extinct, but not common. English has also added a diacritic to a common spelling of maté
Mate

Mate and similar may refer to:* One of a pair of animals involved in mating* Mate , a colloquialism used to refer to a friend* Mahte, a goddess in Latvian mythology, also spelled Mate...
 (from Sp. and Port. mate).

A few English words can only be distinguished from others by a diacritic or modified letter, including animé
Animé (oleo-resin)

Anim? or Zanzibar copal is an Resin which is discharged from the Jatob? tree and other species of Hymenaea growing in tropical South America....
, exposé
Expose

Expose, expos?, or exposed may refer to:* Investigative journalism, a form of investigative journalism* Expos? , a window management tool for Mac OS X...
, lamé
LAME

LAME is a free software software application used to encoder audio into the MP3 file format. The name LAME is a recursive acronym for LAME Ain't an MP3 Encoder, reflecting LAME's early history when it was not actually an encoder, but merely a set of patches against the freely available ISO demonstration source code....
, maté
Mate

Mate and similar may refer to:* One of a pair of animals involved in mating* Mate , a colloquialism used to refer to a friend* Mahte, a goddess in Latvian mythology, also spelled Mate...
, öre
Öre

?re is the one-hundredth subdivision of the Swedish krona currency unit. The plural and singular are the same in the indefinite forms, whereas the singular definite form is ?ret and the plural form is ?ren....
, øre
Øre

?re is the one-hundredth subdivision of the Norwegian krone, Danish krone, Swedish krona and Icelandic kr?na currency units. ?re is the Norwegian language and Danish language spelling, where as in Swedish language it is spelt ?re, in the Faroese language oyra and in Icelandic language eyrir ....
, pâté
Pâté

P?t? is a mixture of minced meat and fat in the form of spreadable paste, generally made from a finely ground or chunky mixture of meats and liver, and often with additional fat, vegetables, herbs, spices or wine....
, piqué
Pique

Piqu?, or marcella, refers to a weaving style, normally used with cotton yarn, which is characterized by raised parallel cords or fine ribbing....
, rosé
Rose

A rose is a perennial plant flower shrub or vine of the genus Rosa, within the family Rosaceae, that contains over 100 species and comes in a variety of colors....
.
The same is true of résumé
Résumé

A r?sum? is a document that contains a summary or listing of relevant job experience and education. The r?sum? or CV is typically the first item that a potential employer encounters regarding the job seeker and is typically used to screen applicants, often followed by an interview, when seeking employment....
,
alternately resumé, but nevertheless it is sometimes spelled resume in the US, and saké
Sake

Sake is a Japanese alcoholic beverage made from rice.This beverage is called sake in English, but in Japanese language, sake or Honorific speech in Japanese refers to alcoholic drinks in general....
, which is more commonly spelled sake.

The acute and grave accents are occasionally used in poetry and lyrics: the acute to indicate stress overtly where it might be ambiguous (rébel vs. rebél) or nonstandard for metrical reasons (caléndar), the grave to indicate that an ordinarily silent or elided syllable is pronounced (warnèd, parlìament). In certain personal names such as Renée and Zoë, the diacritical marks are included more often than omitted.

Other languages

The following languages have letter-diacritic combinations that are not considered independent letters.
  • Afrikaans
    Afrikaans

    Afrikaans is an Indo-European language, derived from Dutch language and thus classified as Low Franconian languages West Germanic languages. It is mainly spoken in South Africa and Namibia, with smaller numbers of speakers living in Botswana, Angola, Swaziland, Zimbabwe, Lesotho, Zambia, Australia, New Zealand, United States of America, Taiwa...
     uses diaeresis to mark vowels that are pronounced separately and not as one would expect where they occur together, for example voel (to feel) as opposed to voël (bird). The circumflex is used in ê, î, ô and û generally to indicate long close-mid
    Close-mid vowel

    A close-mid vowel is a type of vowel sound used in some spoken languages. The defining characteristic of a close-mid vowel is that the tongue is positioned two-thirds of the way from a close vowel to a mid vowel....
    , as opposed to open-mid
    Open-mid vowel

    The open-mid vowels make a class of vowel sounds used in some spoken languages. The defining characteristic of an open-mid vowel is that the tongue is positioned two-thirds of the way from an open vowel to a mid vowel....
     vowels, for example in the words wêreld (world) and môre (morning, tomorrow). The acute accent is used to add emphasis in the same way as underlining or writing in bold or italics in English, for example Dit is jóú boek (It is your book). The grave accent is used to distinguish between words that are different only in placement of the stress, for example appel (apple) and appèl (appeal) and in a few cases where it makes no difference to the pronunciation but distinguishes between homophones. The two most usual cases of the latter are the in the sayings òf... òf (either... or) and nòg... nòg (neither... nor) to distinguish them from of (or) and nog (again, still).
  • Aymara
    Aymara language

    Aymara is an Aymaran languages language spoken by the Aymara ethnic group of the Andes. It is one of only a handful of Indigenous languages of the Americas with over a million speakers....
     uses a diacritical horn over p, q, t, k, ch.
  • Catalan has the following composite characters: à, ç, é, è, í, ï, ó, ò, ú, ü, l·l. The acute and the grave accent indicate stress
    Stress (linguistics)

    In linguistics, stress is the relative emphasis that may be given to certain syllables in a word. The term is also used for similar patterns of phonetic prominence inside syllables....
     and vowel height, the cedilla marks the result of a historical palatalization
    Palatalization

    Palatalization or palatalisation generally refers to two phenomena:*As a process or the result of a process, the effect that front vowels and the palatal approximant frequently have on consonants;...
    , the diaeresis mark indicates either a hiatus
    Hiatus (linguistics)

    Hiatus in linguistics is the separate pronunciation of two adjacent vowels, sometimes with an intervening glottal stop. In poetic metre , hiatus can also refer to the failure of two vowels straddling a word boundary to coalesce, for example by elision of the first vowel....
    , or that the letter u is pronounced when the graphemes gü, qü are followed by e or i, the interpunct
    Interpunct

    An interpunct is a small dot used for interword separation in ancient Latin alphabet, being perhaps the first consistent visual representation of word boundaries in written language....
     (·) distinguishes the different values of ll/l·l.
  • Czech
    Czech alphabet

    The Czech alphabet is a version of the Latin alphabet, used when writing Czech language. Its basic principles are "one sound - one letter" and the addition of diacritical marks above letters to represent sounds alien to Latin....
     has the following composite characters: á, d, é, e, í, n, ó, t, ú, u, ý.
  • Dutch
    Dutch alphabet

    The Dutch alphabet has 26 letters, five or six of which are vowels. The alphabet used for the Dutch language is the Alphabets derived from the Latin....
     uses the diaeresis. For example in ruïne it means that the u and the i are separately pronounced in their usual way, and not in the way that the combination ui is normally pronounced. Thus it works as a separation sign and not as an indication for an alternative version of the i. Diacritics can be used for emphasis (érg koud for very cold) or for disambiguation between a number of words that are spelled the same when context doesn't indicate the correct meaning (één appel = one apple, een appel = an apple; vóórkomen = to occur, voorkómen = to prevent). Grave and acute accents are used on a very small number of words, mostly loanwords. The ç also appears in some loanwords.
  • Faroese
    Faroese alphabet

    The Faroese alphabet consists of 29 letters derived from the Latin alphabet:...
    . Non-Faroese accented letters are not added to the Faroese alphabet. These include é, ö, ü, å and recently also letters like š, l, and c.
  • Finnish
    Finnish alphabet

    The Finnish language alphabet is based on the Latin alphabet, and especially :A, B, C, D, E, F, G, H, I, J, K, L, M, N, O, P, Q, R, S, T, U, V, X, Y, Z, ?, ?, ?...
    . Carons in š and ž appear only in foreign proper names and loanwords, but may be substituted with sh or zh if and only if it is technically impossible to produce accented letters in the medium. Contrary to Estonian, š and ž are not considered distinct letters in Finnish.
  • French
    French alphabet

    The French alphabet is based on the Latin alphabet. It uses the standard 26 letters. The words in the column "Letter name in French" are sometimes used when discussing the letters ....
     uses the grave accent (accent grave), the acute accent (accent aigu), the circumflex (accent circonflexe), the cedilla (cédille), and the diaeresis.
  • Galician
    Galician language

    Galician is a language of the Iberian Romance languages branch, spoken in Galicia , an Autonomous communities of Spain located in northwestern Spain, as well as in small bordering zones in the neighbouring autonomous communities of Asturias and Castile and Le?n and in Northern Portugal....
     vowels can bear a grave accent (á, é, í, ó, ú) to indicate stress or difference between two otherwise same written words (é, '(he/she) is' vs. e, 'and'), but trema
    Umlaut (diacritic)

    The word umlaut is the name of a type of sound shift in spoken language and of the diacritic mark used to represent it Orthography. The diacritic mark comprises a pair of dots or lines placed over the letter that represents the affected Vowel....
     is only used with ï and ü to show diaeresis
    Diaeresis

    In linguistics, diaeresis, or dieresis, is the pronunciation of two adjacent vowels in two separate syllables rather than as a diphthong, and it is also the name of the diacritic mark used to prompt the reader to pronounce adjacent vowels in this manner....
     in pronunciation. Only in foreign words Galician may use of another diacritics as ç (widely used in the Middle Age) ê or à.
  • German
    German alphabet

    The German language alphabet consists of the same 26 letters as the basic modern Latin alphabet:...
     uses the three so-called umlauts ä, ö and ü. These diacritics indicate vowel changes. For instance the word Ofen /'o:f?n/ (English: oven) has the plural Öfen /'ø:f?n/ (ovens). The sign originated in a superscript e; a handwritten Sütterlin
    Sütterlin

    S?tterlinschrift , or S?tterlin for short, is the last widely used form of the old Kurrent . In Germany, the old German cursive script developed in the 16th century replacing the Gothic handwriting at the same time that bookletters developed into the Fraktur ....
     e resembles two parallel vertical lines, like an umlaut.
  • Hebrew
    Hebrew alphabet

    The Hebrew alphabet consists of 22 letters used for writing the Hebrew language. Five of these letters have a different form when appearing as the last letter in a word....
     has many various diacritic marks known as niqqud
    Niqqud

    In Hebrew language orthography, niqqud or nikkud is the system of diacritical signs used to represent vowels or distinguish between alternative pronunciations of consonants of the Hebrew alphabet....
     that are used above and below script to represent vowels. These must be distinguished from cantillation
    Cantillation

    Cantillation is the ritual chanting of readings from the Bible in synagogue Jewish services.The chants are rendered in accordance with the special signs or marks printed in the Masoretic text of the Hebrew Bible to complement the letters and vowel points....
    , which are keys to pronunciation and syntax.
  • The International Phonetic Alphabet
    International Phonetic Alphabet

    The International Phonetic Alphabet "The acronym 'IPA' strictly refers [...] to the 'International Phonetic Association'. But it is now such a common practice to use the acronym also to refer to the alphabet itself that resistance seems pedantic....
     uses diacritic symbols and diacritic letters to indicate phonetic features or secondary articulations.
  • Irish uses acute accent to indicate that the vowel is long
    Vowel length

    In linguistics, vowel length is the perceived length of a vowel sound. Often the chroneme, or the "longness", acts like a consonant, and may etymologically be one such as in Australian English....
    . It is known as síneadh fada (long sign) or simply fada (long) in Irish.
  • Italian
    Italian alphabet

    The Italian alphabet is a variant of the Latin alphabet used by the Italian language. The standard contemporary Italian alphabet has 21 letters, shown in the table below....
     mainly has the acute
    Acute accent

    The acute accent is a diacritic mark used in many modern written languages with alphabets based on the Latin alphabet, Cyrillic alphabet and Greek alphabet writing systems....
     and the grave
    Grave accent

    The grave accent is a diacritical mark used in written Catalan language, French language, Greek language until 1982 , Italian language, Norwegian language, Occitan language, Portuguese language, Scottish Gaelic language, Vietnamese language, Welsh language, Dutch language, and other languages....
     (à, è/é, ì, ò/ó, ù), typically to indicate a stressed syllable which would not be stressed under the normal rules of pronunciation but sometimes also to distinguish between words which are otherwise spelled the same way (e.g. "e", and; "è", is).
  • Lithuanian
    Lithuanian alphabet

    Lithuanian language employs a modified Roman alphabet. It is composed of 32 Letter . The collation order presents one surprise: "Y" is moved to occur between I Ogonek and J....
     uses the acute
    Acute accent

    The acute accent is a diacritic mark used in many modern written languages with alphabets based on the Latin alphabet, Cyrillic alphabet and Greek alphabet writing systems....
    , grave
    Grave accent

    The grave accent is a diacritical mark used in written Catalan language, French language, Greek language until 1982 , Italian language, Norwegian language, Occitan language, Portuguese language, Scottish Gaelic language, Vietnamese language, Welsh language, Dutch language, and other languages....
     and tilde
    Tilde

    The tilde is a grapheme with several uses. The name of the character comes from Spanish language, from the Latin wikt:titulus meaning a title or superscription, though the term ?tilde? has evolved in that language and now has a different meaning in Linguistics....
     in dictionaries to indicate stress types in the language's pitch accent
    Pitch accent

    Pitch accent is a linguistics term of convenience for a variety of restricted tone systems that use variations in Pitch to give prominence to a syllable or Mora_ within a word....
     system.
  • Maltese
    Maltese alphabet

    The Maltese alphabet is based on the Latin alphabet with the addition of some letters with diacritic marks and digraphs. It is used to write the Maltese language....
     sometimes uses diacritics on some vowels to indicate stress or long vowels, but this is restricted to pronunciation assistance in dictionaries.
  • Occitan
    Occitan alphabet

    The Occitan language alphabet consists of the following 23 Latin alphabet:|- |bgcolor="#EFEFEF" align="center" colspan="26" | Capital letters ...
     has the following composite characters: á, à, ç, é, è, í, ï, ó, ò, ú, ü, n·h, s·h. The acute and the grave accent indicate stress
    Stress (linguistics)

    In linguistics, stress is the relative emphasis that may be given to certain syllables in a word. The term is also used for similar patterns of phonetic prominence inside syllables....
     and vowel height, the cedilla marks the result of a historical palatalization
    Palatalization

    Palatalization or palatalisation generally refers to two phenomena:*As a process or the result of a process, the effect that front vowels and the palatal approximant frequently have on consonants;...
    , the diaeresis mark indicates either a hiatus
    Hiatus (linguistics)

    Hiatus in linguistics is the separate pronunciation of two adjacent vowels, sometimes with an intervening glottal stop. In poetic metre , hiatus can also refer to the failure of two vowels straddling a word boundary to coalesce, for example by elision of the first vowel....
    , or that the letter u is pronounced when the graphemes gü, qü are followed by e or i, the interpunct
    Interpunct

    An interpunct is a small dot used for interword separation in ancient Latin alphabet, being perhaps the first consistent visual representation of word boundaries in written language....
     (·) distinguishes the different values of nh/n·h and sh/s·h.
  • Portuguese
    Portuguese alphabet

    The Portuguese alphabet, , consists of the following 23 or 26 Latin alphabet:|- |bgcolor="#EFEFEF" align="center" colspan="26" | Capital letters ...
     has the following composite characters: à, á, â, ã, ç, é, ê, í, ó, ô, õ, ú. The acute and the circumflex accent indicate stress and vowel height, the grave accent indicates crasis, the tilde represents nasalization, and the cedilla marks the result of a historical palatalization.
  • Acute accents are also used in Slavic language dictionaries and textbooks to indicate lexical stress, placed over the vowel of the stressed syllable. This can also serve to disambiguate meaning (e.g., in Russian ????´?? (pisát) means "to write", but ??´???? (písat) means "to piss"), or "??´????? ?????" (the biggest part) vs "??????´? ?????" (the big part).
  • Slovak
    Slovak alphabet

    The Slovak alphabet uses a modification of the Latin alphabet. The modifications include the four diacriticals placed above certain letters....
     has the acute (á, é, í, l, ó, r, ú, ý), the caron (c, d, dž, l, n, š, t, ž), the circumflex (only above o - ô) and the diaeresis (only above a - ä).
  • Spanish uses the acute accent and the diaeresis. The acute is used on a vowel in a stressed syllable in words with irregular stress patterns. It can also be used to "break up" a diphthong
    Diphthong

    In phonetics, a diphthong, or , is a contour vowel?that is, a unitary vowel that changes vowel quality during its pronunciation, or "glides", with a glissando of the tongue from one articulation to another, as in the English words eye, boy, and cow. This contrasts with "pure" vowels, or monophthongs, where the tongue is held s...
     as in tío (pronounced , rather than as it would be without the accent). Moreover, the acute can be used to distinguish words that otherwise are spelt alike, such as si ("if") and ("yes"), and also to distinguish interrogative and exclamative pronouns from homophones with a different grammatical function, such as donde/¿dónde? ("where"/"where?") or como/¿cómo? ("as"/"how?") The diaeresis is used only over u (ü) so that it be pronounced in the combinations gue and gui (where u is normally silent), for example ambigüedad. In poetry, the diaeresis may be used on i and u as a way to force a hiatus. Very rarely, the "dotted l" may also be found, especially in loanwords, in order to indicate that the letters "ll" should not be pronounced as the Castilian letter "elle," but rather, as the letter "ele," in much the way that a diaeresis might be used to "break" a diphthong in other languages. This is accent mark, however, derives from other Iberian languages.
  • Swedish
    Swedish alphabet

    The Swedish language alphabet consists of the following 29 letters:Upper CaseLower CaseIn addition to the commonly occurring letters of the Latin alphabet, A-Z, the Swedish alphabet has the three letter , "?", "?" and "?"....
     uses the acute accent
    Acute accent

    The acute accent is a diacritic mark used in many modern written languages with alphabets based on the Latin alphabet, Cyrillic alphabet and Greek alphabet writing systems....
     to show non-standard stress, for example in (café) and (résumé). This occasionally helps resolve ambiguities, such as ide (hibernation) versus idé (idea). In these words, the acute accent is not optional. Some proper names use non-standard diacritics, such as Carolina Klüft
    Carolina Klüft

    Carolina Evelyn Kl?ft is a Swedish Track and field athletics competing in triple jump, long jump and formerly in heptathlon and women's pentathlon....
     and Staël von Holstein
    Staël von Holstein

    According to tradition the family came to Sweden from Livonia during the 1600s via the Polish Major Hildebrand Sta?l. The family was naturalized as Swedish Nobility on the 14th of October 1652, and the family was introduced at the Swedish House of Knights on the 20th of September 1675....
    . For foreign loanwords the original accents are strongly recommended, unless the word has been infused into the language, in which case they are optional. Hence crème fraîche but ampere.
  • Tamil does not have any diacritics in itself, but uses the Hindu numerals
    Arabic numerals

    The 'arabic numerals', or 'Hindu numerals' are the ten digits , which?along with Decimal Number System by which a sequence was read as a number?were originally defined by Indian mathematics, later modified and transferred to North African Islamic mathematics and transmitted to Europe in the Middle Ages, whence they spread around the wo...
     2, 3 and 4 as diacritics to represent aspirated, voiced, and voiced-aspirated consonants when the Tamil script
    Tamil script

    The Tamil script is a Vatteluttu that is used to write the Tamil language. With the use of special diacritics to represent aspiration and voice consonants not represented in the basic script, it is also used to write Saurashtra language and, by Tamil people, to write Sanskrit....
     is used to write long passages in Sanskrit.
  • Thai
    Thai alphabet

    The Thai alphabet is used to write the Thai language and other :Category:Languages of Thailands in Thailand. It has forty-four consonants , fifteen vowel symbols that combine into at least twenty-eight vowel forms, and four tone marks ....
     has its own system of diacritics
    Thai alphabet

    The Thai alphabet is used to write the Thai language and other :Category:Languages of Thailands in Thailand. It has forty-four consonants , fifteen vowel symbols that combine into at least twenty-eight vowel forms, and four tone marks ....
     derived from Indic
    Indic

    Indic can refer to:* Indo-Aryan languages* Indic scripts* Related to South Asia* of or related to India ; see Indica...
     numerals, which denote different tones
    Tone (linguistics)

    Tone is the use of pitch in language to distinguish lexical or grammatical meaning?that is, to distinguish or inflection words. All languages use pitch to express emotional and other paralinguistic information, and to convey emphasis, contrast, and other such features in what is called intonation , but not all languages use tones to distingu...
    .
  • Vietnamese
    Vietnamese alphabet

    The Vietnamese alphabet has the following 29 letters, in collation order:Vietnamese also uses the ten Digraph s and one Trigraph below.These groups were formerly considered single letters and are treated as such in older dictionaries....
     uses the acute accent (d?u s?c), the grave accent (d?u huy?n), the tilde (d?u ngã), the dot below (d?u n?ng) and the hook (d?u h?i) on vowels as tone
    Tone (linguistics)

    Tone is the use of pitch in language to distinguish lexical or grammatical meaning?that is, to distinguish or inflection words. All languages use pitch to express emotional and other paralinguistic information, and to convey emphasis, contrast, and other such features in what is called intonation , but not all languages use tones to distingu...
     indicators.
  • Welsh
    Welsh alphabet

    The modern Welsh language alphabet contains 28 letters, of which eight are digraph s:The acute accent, the grave accent, the circumflex and the Umlaut are also used on vowels, but accented letters are not regarded as part of the alphabet....
     uses the circumflex, diaeresis, acute and grave accents on its seven vowels a, e, i, o, u, w, y. The most common is the circumflex (which it calls to bach, meaning "little roof", or acen grom "crooked accent", or hirnod "long sign") to denote a long vowel, usually to disambiguate it from a similar word with a short vowel. The rarer grave accent has the opposite effect, shortening vowel sounds which would usually be pronounced long. The acute accent and diaeresis are also occasionally used, to denote stress and vowel separation respectively. The w-circumflex and the y-circumflex are among the most commonly accented characters in Welsh, but unusual in languages generally, and were until recently very hard to obtain in word-processed and HTML documents.


Transliteration

Several languages which are not written with the Roman alphabet are transliterated
Transliteration

Transliteration is the practice of transcribing a word or text written in one writing system into another writing system or system of rules for such practice....
, or romanized, using diacritics. Examples:
  • Chinese
    Chinese language

    Chinese or the Sinitic language is a language family consisting of language mutually unintelligible to varying degrees. Originally the indigenous languages spoken by the Han Chinese in China, it forms one of the two branches of Sino-Tibetan languages of languages....
     has several romanization
    Romanization of Chinese

    The romanization of Chinese is the use of the Latin alphabet to write Chinese. Chinese has been written in Chinese characters since about 1500 B.C....
    s that use the umlaut, but only on u (ü). In Hanyu Pinyin, the four tones
    Tone (linguistics)

    Tone is the use of pitch in language to distinguish lexical or grammatical meaning?that is, to distinguish or inflection words. All languages use pitch to express emotional and other paralinguistic information, and to convey emphasis, contrast, and other such features in what is called intonation , but not all languages use tones to distingu...
     of Mandarin Chinese are denoted by the macron (first tone), acute (second tone), caron (third tone) and grave (fourth tone) diacritics. Example: a, á, a, à.
  • Romanized Japanese
    Japanese language

    IPA: [n?iho?go] is a language spoken by over 130 million people in Japan and in Japanese emigrant communities. It is related to the Ryukyuan languages....
     (Romaji) uses diacritics to mark long vowels. The Hepburn romanization
    Hepburn romanization

    The is named after James Curtis Hepburn, who used it to transcribe the sounds of the Japanese language into the Latin alphabet in the third edition of his Japanese?English dictionary, published in 1887....
     system uses a macron
    Macron

    A macron, from Greek language meaning "long", is a diacritic ? placed over or under a vowel which was originally used to mark a Long syllable#Syllable weight in classical poetry in Meter #Greek and Latin, but has now been taken also to indicate that the vowel is long vowel....
     to mark long vowels, and the Kunrei-shiki
    Kunrei-shiki

    is a romanization system, i.e. a system for transcribing the Japanese language into the Latin alphabet. It is abbreviated as Kunrei-shiki. Its name is rendered Kunreisiki using Kunrei-shiki itself....
     and Nihon-shiki
    Nihon-shiki

    Nihon-shiki or Nippon-shiki Romaji is a romanization system for transcribing the Japanese language into the Latin alphabet. In discussion about Romaji, it is abbreviated as Nihon-shiki or Nippon-shiki....
     systems use a circumflex
    Circumflex

    The circumflex is a diacritic mark used in written Serbian language, Croatian language, Esperanto, French language, West Frisian language, Norwegian language, Romanian language, Slovak language, Vietnamese language, Romaji, Romanization of Persian, Welsh language, Portuguese language, Italian language, Afrikaans language, Turkish language...
    .
  • Sanskrit
    Sanskrit

    Sanskrit is a historical Indo-Aryan language, one of the liturgical languages of Hinduism and Buddhism, and one of the 22 official languages of India....
    , as well as many of its descendants, like Hindi
    Hindi

    Standard Hindi, also known as High Hindi, Nagari Hindi or Literary Hindi is a Standard language register of Hindi. It is one of the 22 official languages of India, and is used, along with English language, for administration of the central government....
     and Bengali
    Bengali language

    Bengali or Bangla is an Indo-European languages language of the eastern Indian subcontinent, evolved from the Magadhi Prakrit and Sanskrit languages....
    , uses a lossless transliteration
    Transliteration

    Transliteration is the practice of transcribing a word or text written in one writing system into another writing system or system of rules for such practice....
     system for representing words in the Roman alphabet. This includes several letters with diacritical markings, such as horizontal lines above vowels (a, i, u), dots above and below consonants (?, ?, ?, ?, ?, ?, ?) as well as a few others (s, ñ).
  • Arabic
    Arabic language

    Arabic is a Central Semitic language, thus related to and classified alongside other Semitic languages languages such as Hebrew language and Aramaic language....
     has several romanisations, depending on the type of the application, region, intended audience, country, etc. many of them extensively use diacritics, e.g. some methods use a dot for rendering emphatic consonant
    Emphatic consonant

    Emphatic consonant is a term widely used in Semitic languages linguistics to describe one of a series of obstruent consonants which originally contrasted with series of both voiced and voiceless obstruents....
    s (?, ?, ?, ?, ?). Macron is often used to render long vowels. š is often used for /?/. g for /?/.


See also

  • Alphabets derived from the Latin
    Alphabets derived from the Latin

    A Latin-derived alphabet is an alphabetical writing system that uses letters of the original Roman Latin alphabet and extensions. Extending can be done by adding diacritics to existing letters, joining multiple letters together to make ligature , creating completely new forms, or assigning a special function to pairs or triplets of letters....
Category:Latin-derived alphabets
Category:Specific letter-diacritic combinations
  • Collating sequence
    Collating sequence

    The term collating sequence refers to the order in which character strings should be placed when sorting them.A common example is the familiar "alphabetic order", in which "Alfred" occurs before "Zeus" because "A" occurs before "Z" in the English alphabet....
  • Combining character
    Combining character

    In digital typography, combining characters are Character that are intended to modify other characters. The most common combining characters in the Latin script are the combining diacritical marks ....
  • Heavy metal umlaut
    Heavy metal umlaut

    A metal umlaut is an umlaut that is sometimes used gratuitously or decoratively over letters in the names of heavy metal music bands, such as in M?tley Cr?e or Mot?rhead....
  • Latin alphabet
    Latin alphabet

    The Latin alphabet, also called the Roman alphabet, is the most widely used alphabetic writing system in the world today. It evolved from the western variety of the Greek alphabet called the Cumae alphabet, and was initially developed by the Ancient Romes to write the Latin....
  • List of English words with diacritics
    List of English words with diacritics

    Some English language words have letters with diacritical marks. Most of the words are loanwords from French language, with others coming from Spanish language, German language, or other languages....
  • List of Latin letters
  • List of U.S. cities with diacritics
    List of U.S. cities with diacritics

    This is a list of U.S. cities whose official names have diacritics.* Ca?on City, Colorado, Fremont County, Colorado* Do?a Ana, New Mexico, Do?a Ana County, New Mexico...
  • Romanization
    Romanization

    In linguistics, romanization is the representation of a written word or spoken speech with the Latin alphabet, or a system for doing so, where the original word or language uses a different writing system ....
  • Typing accents

External links

  • PDF at Adobe.com
  • — Learn how to create world language accent marks and other diacritics on a computer
  • — A text editor which makes it easier to type diacritics.