Indian Standard Code for Information Interchange (
ISCII) is a coding scheme for representing various writing systems of
IndiaIndia , officially the Republic of India , is a country in South Asia. It is the seventh-largest country by geographical area, the second-most populous country with over 1.2 billion people, and the most populous democracy in the world...
. It encodes the main Indic scripts and a Roman transliteration. The supported scripts are:
AssameseThe Assamese script is a variant of the Eastern Nagari script also used for Bengali and Bishnupriya Manipuri. The Eastern Nagari script belongs to the Brahmic family of scripts and has a continuous history of development from Nagari script, a precursor of Devanagari...
,
Bengali (Bengla)The Bengali alphabet is the writing system for the Bengali language. The script with variations is used for Assamese and is basis for Meitei, Bishnupriya Manipuri, Kokborok, Garo and Mundari alphabets. All these languages are spoken in the eastern region of South Asia. Historically, the script has...
,
DevanagariDevanagari |deva]]" and "nāgarī" ), also called Nagari , is an abugida alphabet of India and Nepal...
,
GujaratiThe Gujarati script , which like all Nāgarī writing systems is strictly speaking an abugida rather than an alphabet, is used to write the Gujarati and Kutchi languages...
, Gurmukhi,
KannadaThe Kannada script is an alphasyllabary of the Brahmic family, used primarily to write the Kannada language, one of the Dravidian languages of southern India and also Sanskrit in the past. The Telugu script is derived from Old Kannada, and resembles Kannada script...
,
MalayalamThe Malayalam script is a Brahmic script used commonly to write the Malayalam language—which is the principal language of the Indian state of Kerala, spoken by 36 million people in the world. Like many other Indic scripts, it is an abugida, or a writing system that is partially “alphabetic” and...
,
Oriya (Odia)The Oriya script or Utkala Lipi or Utkalakshara is used to write the Oriya language, and can be used for several other Indian languages, for example, Sanskrit.- History :...
,
TamilThe Tamil script is a script that is used to write the Tamil language as well as other minority languages such as Badaga, Irulas, and Paniya...
, and
TeluguTelugu script, an abugida from the Brahmic family of scripts, is used to write the Telugu language, a language found in the South-Central Indian state of Andhra Pradesh as well as several other neighboring states. The Telugu script is derived from the Bhattiprolu script...
. ISCII does not encode the writing systems of India based on Arabic, but its writing system switching codes nonetheless provide for
KashmiriKashmiri is a language from the Dardic sub-group and it is spoken primarily in the Kashmir Valley, in Jammu and Kashmir. There are approximately 5,554,496 speakers in Jammu and Kashmir, according to the Census of 2001. Most of the 105,000 speakers or so in Pakistan are émigrés from the Kashmir...
,
SindhiSindhi is the language of the Sindh region of Pakistan that is spoken by the Sindhi people. In India, it is among 22 constitutionally recognized languages, where Sindhis are a sizeable minority. It is spoken by 53,410,910 people in Pakistan, according to the national government's Statistics Division...
,
UrduUrdu is a register of the Hindustani language that is identified with Muslims in South Asia. It belongs to the Indo-European family. Urdu is the national language and lingua franca of Pakistan. It is also widely spoken in some regions of India, where it is one of the 22 scheduled languages and an...
,
PersianPersian is an Iranian language within the Indo-Iranian branch of the Indo-European languages. It is primarily spoken in Iran, Afghanistan, Tajikistan and countries which historically came under Persian influence...
, Pashto and Arabic. The Arabic-based writing systems were subsequently encoded in the
PASCIIPerso-Arabic Script Code for Information Interchange is one of the Indian government standards for encoding languages using writing systems based on that of Arabic, in particular Kashmiri, Persian, Sindhi, and Urdu...
encoding.
The Brahmi-derived writing systems are mostly rather similar in structure, but have different letter shapes. So ISCII encodes letters with the same phonetic value at the same codepoint, overlaying the various scripts. For example, the ISCII codes 0xB3 0xDB represent [ki]. This will be rendered as कि in Devanagari, as ਕਿ in Gurmukhi, and as கி in Tamil. The writing system can be selected in rich text by markup or in plain text by means of the ATR code described below.
One motivation for the use of a single encoding is the idea that it will allow easy
transliterationTransliteration is a subset of the science of hermeneutics. It is a form of translation, and is the practice of converting a text from one script into another...
from one writing system to another. However, there are enough incompatibilities that this is not really a practical idea. See
About ISCII.
ISCII is a stateful 8-bit encoding. The lower 128 codepoints are plain ASCII, the upper 128 codepoints are ISCII-specific. In addition to the codepoints representing characters, ISCII makes use of a codepoint with mnemonic ATR that indicates that the following byte contains one of two kinds of information. One set of values changes the writing system until the next writing system indicator or end-of-line. Another set of values select display modes, such as bold and italic. ISCII does not provide a means of indicating the default writing system.
ISCII has not been widely used outside of certain government institutions and has now been rendered largely obsolete by
UnicodeUnicode is a computing industry standard for the consistent encoding, representation and handling of text expressed in most of the world's writing systems...
. Unicode uses a separate block for each Indic writing system, and largely preserves the ISCII layout within each block.
Codepage layout
The following table shows the character set for
DevanagariDevanagari |deva]]" and "nāgarī" ), also called Nagari , is an abugida alphabet of India and Nepal...
. The code sets for Assamese, Bengali, Gujarati, Gurmukhi, Kannada, Malayalam, Oriya, Tamil, and Telugu are similar, with each Devanagari form replaced by the
equivalent form in each writing systemThe Brahmic or Indic scripts are a family of abugida writing systems. They are used throughout South Asia , Southeast Asia, and parts of Central and East Asia, and are descended from the Brāhmī script of the ancient Indian subcontinent...
. Each character is shown with its decimal code and its
UnicodeUnicode is a computing industry standard for the consistent encoding, representation and handling of text expressed in most of the world's writing systems...
equivalent.
| Windows-1251 |
|
—0 |
—1 |
—2 |
—3 |
—4 |
—5 |
—6 |
—7 |
—8 |
—9 |
—A |
—B |
—C |
—D |
—E |
—F |
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
| | |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Special code points
INV character—code point D9 (217): The INV character is used as a pseudo-consonant to display combining elements in isolation. For example, क (ka) + ् (halant) + INV = क् (half ka). The Unicode equivalent is no break space 00A0 or dotted circle ◌ 25CC.
ATR character—code point EF (239): The ATR character followed by a byte code is used to switch to a different font attribute (such as bold) or language (such as Bengali), up to the next ATR sequence or the end of the line. This has no direct Unicode equivalent, as font attributes are not part of Unicode, and each script has a distinct set of code points.
EXT character—code point F0 (240): The EXT character followed by a byte code indicates a Vedic accent. This has no direct Unicode equivalent, as Vedic accents are assigned to distinct code points.
Halant character ़—code point E8 (232): The halant character removes the implicit vowel from a consonant and is used between consonants to represent conjunct consonants. For example, क (ka) + ् (halant) + त (ta) = क्त (kta). The sequence ् (halant) + ् (halant) displays a conjunct with an explicit halant, for example क (ka) + ् (halant) + ् (halant) + त (ta) = क्त. The sequence ् (halant) + ़ (nukta) displays a conjunct with half consonants, if available, for example क (ka) + ् (halant) + ़ (nukta) + त (ta) = क्त.
| ISCII | | Unicode |
| single halant |
E8 |
halant |
094D |
| halant + halant |
E8 E8 |
halant + ZWNJ The zero-width non-joiner is a non-printing character used in the computerization of writing systems that make use of ligatures. When placed between two characters that would otherwise be connected into a ligature, a ZWNJ causes them to be printed in their final and initial forms, respectively... |
094D 200C |
| halant + nukta |
E8 E9 |
halant + ZWJ The zero-width joiner is a non-printing character used in the computerized typesetting of some complex scripts, such as the Arabic script or any of the Indic scripts. When placed between two characters that would otherwise not be connected, a ZWJ causes them to be printed in their connected... |
094D 200D |
Nukta character ़—code point E9 (233): The
nuktaNukta is a generic term for the diacritic mark in several Brahmic scripts, like Devanagari that is used to represent sounds from other languages by being applied to an existing character. The word nukta, originates from the Arabic word nuqta ....
character after another ISCII character is used for a number of rarer characters which don't exist in the main ISCII set. For example क (ka) + ़ (nukta) = क़ (qa). These characters have precomposed forms in Unicode, as shown in the following table.
ISCII code point | Original character | Character with nukta | Unicode code point |
| A1 (161) |
ँ |
ॐ |
0950 |
| A6 (166) |
इ |
ऌ |
090C |
| A7 (167) |
ई |
ॡ |
0961 |
| AA (176) |
ऋ |
ॠ |
0960 |
| B3 (179) |
क |
क़ |
0958 |
| B4 (180) |
ख |
ख़ |
0959 |
| B5 (181) |
ग |
ग़ |
095A |
| BA (186) |
ज |
ज़ |
095B |
| BF (191) |
ड |
ड़ |
095C |
| C0 (192) |
ढ |
ढ़ |
095D |
| C9 (201) |
फ |
फ़ |
095E |
| DB (219) |
ि |
ॢ |
0962 |
| DC (220) |
ी |
ॣ |
0963 |
| DF (223) |
ृ |
ॄ |
0944 |
| EA (234) |
। |
ऽ |
0964 |
Code points for all languages
Each alphabet is listed in the order of its ISCII code point. Code points with asterisks (*) indicate the code point followed by nukta, e.g. क (ka) + ़ = क़ (qa); इ (i) + ़ = ऌ (ḷ). Each character is listed along with its Unicode code point.
| Code-set for all alphabets using ISCII |
EWLINE
| Hex | Official Listing | ISO 15919 ISO 15919 Transliteration of Devanagari and related Indic scripts into Latin characters is an international standard for the transliteration of Indic scripts to the Latin alphabet formed in 2001...
| DevanagariDevanagari |deva]]" and "nāgarī" ), also called Nagari , is an abugida alphabet of India and Nepal...
| BengaliThe Bengali alphabet is the writing system for the Bengali language. The script with variations is used for Assamese and is basis for Meitei, Bishnupriya Manipuri, Kokborok, Garo and Mundari alphabets. All these languages are spoken in the eastern region of South Asia. Historically, the script has...
| Gurmukhi Gurmukhi is the most common script used for writing the Punjabi language. An abugida derived from the Laṇḍā script and ultimately descended from Brahmi, Gurmukhi was standardized by the second Sikh guru, Guru Angad Dev Ji, in the 16th century. The whole of the Sri Guru Granth Sahib Ji's 1430...
| GujaratiThe Gujarati script , which like all Nāgarī writing systems is strictly speaking an abugida rather than an alphabet, is used to write the Gujarati and Kutchi languages...
| OriyaThe Oriya script or Utkala Lipi or Utkalakshara is used to write the Oriya language, and can be used for several other Indian languages, for example, Sanskrit.- History :...
| Tamil The Tamil script is a script that is used to write the Tamil language as well as other minority languages such as Badaga, Irulas, and Paniya...
| Telugu Telugu script, an abugida from the Brahmic family of scripts, is used to write the Telugu language, a language found in the South-Central Indian state of Andhra Pradesh as well as several other neighboring states. The Telugu script is derived from the Bhattiprolu script...
| KannadaThe Kannada script is an alphasyllabary of the Brahmic family, used primarily to write the Kannada language, one of the Dravidian languages of southern India and also Sanskrit in the past. The Telugu script is derived from Old Kannada, and resembles Kannada script...
| MalayalamThe Malayalam script is a Brahmic script used commonly to write the Malayalam language—which is the principal language of the Indian state of Kerala, spoken by 36 million people in the world. Like many other Indic scripts, it is an abugida, or a writing system that is partially “alphabetic” and...
|
| A0 |
Sign OMOm or Aum Om or Aum Om or Aum (also , written in Devanāgari as and as , in Sanskrit known as (lit. "to sound out loudly"), ', or ' (also as ') (lit. "Auṃ form/syllable"), is a sacred/mystical syllable in the Dharmic or Indian religions, i.e... |
|
ॐ |
0950 |
|
|
ૐ |
0AD0 |
|
|
|
|
|
| A1 |
Vowel-modifier CHANDRABINDU Chandrabindu is a diacritic sign having the form of a dot inside the lower half of a circle. It is used in the Devanagari , Bengali , Gujarati , Oriya and Telugu scripts.It usually means that the previous vowel is nasalized... |
|
ँ |
0901 |
ঁ |
0981 |
ਁ |
0A01 |
ઁ |
0A81 |
ଁ |
0B01 |
|
ఁ |
0C01 |
|
|
| A2 |
Vowel-modifier ANUSWAR Anusvara is the diacritic used to mark a type of nasalization used in a number of Indic languages. Depending on the location of the anusvara in the word and the language within which it is used, its exact pronunciation can vary greatly.... |
ṁ |
ं |
0902 |
ং |
0982 |
ਂ |
0A02 |
ં |
0A82 |
ଂ |
0B02 |
ஂ |
0B82 |
ం |
0C02 |
ಂ |
0C82 |
ം |
0D02 |
| A3 |
Vowel-modifier VISARG Visarga is a Sanskrit word meaning "sending forth, discharge". In Sanskrit phonology , is the name of a phone, , written as IAST , Harvard-Kyoto , Devanagari . Visarga is an allophone of and in pausa... |
ḥ |
ः |
0903 |
ঃ |
0983 |
ਃ |
0A03 |
ઃ |
0A83 |
ଃ |
0B03 |
ஃ |
0B83 |
ః |
0C03 |
ಃ |
0C83 |
ഃ |
0D03 |
| A4 |
Vowel A |
a |
अ |
0905 |
অ |
0985 |
ਅ |
0A05 |
અ |
0A85 |
ଅ |
0B05 |
அ |
0B85 |
అ |
0C05 |
ಅ |
0C85 |
അ |
0D05 |
| A5 |
Vowel AA |
ā |
आ |
0906 |
আ |
0986 |
ਆ |
0A06 |
આ |
0A86 |
ଆ |
0B06 |
ஆ |
0B86 |
ఆ |
0C06 |
ಆ |
0C86 |
ആ |
0D06 |
| A6 |
Vowel I |
i |
इ |
0907 |
ই |
0987 |
ਇ |
0A07 |
ઇ |
0A87 |
ଇ |
0B07 |
இ |
0B87 |
ఇ |
0C07 |
ಇ |
0C87 |
ഇ |
0D07 |
| A6* |
Vowel LI (Sanskrit) |
ḷ |
ऌ |
090C |
ঌ |
098C |
|
ઌ |
0A8C |
ଌ |
0B0C |
|
ఌ |
0C0C |
ಌ |
0C8C |
ഌ |
0D0C |
| A7 |
Vowel II |
ī |
ई |
0908 |
ঈ |
0988 |
ਈ |
0A08 |
ઈ |
0A88 |
ଈ |
0B08 |
ஈ |
0B88 |
ఈ |
0C08 |
ಈ |
0C88 |
ഈ |
0D08 |
| A7* |
Vowel LII (Sanskrit) |
ḹ |
ॡ |
0961 |
ৡ |
09E1 |
|
ૡ |
0AE1 |
ୡ |
0B61 |
|
ౡ |
0C61 |
ೡ |
0CE1 |
ൡ |
0D61 |
| A8 |
Vowel U |
u |
उ |
0909 |
উ |
0989 |
ਉ |
0A09 |
ઉ |
0A89 |
ଉ |
0B09 |
உ |
0B89 |
ఉ |
0C09 |
ಉ |
0C89 |
ഉ |
0D09 |
| A9 |
Vowel UU |
ū |
ऊ |
090A |
ঊ |
098A |
ਊ |
0A0A |
ઊ |
0A8A |
ଊ |
0B0A |
ஊ |
0B8A |
ఊ |
0C0A |
ಊ |
0C8A |
ഊ |
0D0A |
| AA |
Vowel RI |
r̥ |
ऋ |
090B |
ঋ |
098B |
|
ઋ |
0A8B |
ଋ |
0B0B |
|
ఋ |
0C0B |
ಋ |
0C8B |
ഋ |
0D0B |
| AA* |
Vowel RII (Sanskrit) |
ṝ |
ॠ |
0960 |
ৠ |
09E0 |
|
ૠ |
0AE0 |
ୠ |
0B60 |
|
ౠ |
0C60 |
ೠ |
0CE0 |
ൠ |
0D60 |
| AB |
Vowel E (Southern Scripts) |
e |
ऎ |
090E |
|
|
|
|
எ |
0B8E |
ఎ |
0C0E |
ಎ |
0C8E |
എ |
0D0E |
| AC |
Vowel EY |
ē |
ए |
090F |
এ |
098F |
ਏ |
0A0F |
એ |
0A8F |
ଏ |
0B0F |
ஏ |
0B8F |
ఏ |
0C0F |
ಏ |
0C8F |
ഏ |
0D0F |
| AD |
Vowel AI |
ai |
ऐ |
0910 |
ঐ |
0990 |
ਐ |
0A10 |
ઐ |
0A90 |
ଐ |
0B10 |
ஐ |
0B90 |
ఐ |
0C10 |
ಐ |
0C90 |
ഐ |
0D10 |
| AE |
Vowel AYE (Devanagari Script) |
ê |
ऍ |
090D |
|
|
ઍ |
0A8D |
|
|
|
|
|
| AF |
Vowel O (Southern Scripts) |
o |
ऒ |
0912 |
|
|
|
|
ஒ |
0B92 |
ఒ |
0C12 |
ಒ |
0C92 |
ഒ |
0D12 |
| B0 |
Vowel OW |
ō |
ओ |
0913 |
ও |
0993 |
ਓ |
0A13 |
ઓ |
0A93 |
ଓ |
0B13 |
ஓ |
0B93 |
ఓ |
0C13 |
ಓ |
0C93 |
ഓ |
0D13 |
| B1 |
Vowel AU |
au |
औ |
0914 |
ঔ |
0994 |
ਔ |
0A14 |
ઔ |
0A94 |
ଔ |
0B14 |
ஔ |
0B94 |
ఔ |
0C14 |
ಔ |
0C94 |
ഔ |
0D14 |
| B2 |
Vowel AWE (Devanagari Script) |
ô |
ऑ |
0911 |
|
|
ઑ |
0A91 |
|
|
|
|
|
| B3 |
Consonant KA |
k |
क |
0915 |
ক |
0995 |
ਕ |
0A15 |
ક |
0A95 |
କ |
0B15 |
க |
0B95 |
క |
0C15 |
ಕ |
0C95 |
ക |
0D15 |
| B3* |
Consonant QA (Urdu) |
q |
क़ |
0958 |
|
|
|
|
|
|
|
|
| B4 |
Consonant KHA |
kh |
ख |
0916 |
খ |
0996 |
ਖ |
0A16 |
ખ |
0A96 |
ଖ |
0B16 |
|
ఖ |
0C16 |
ಖ |
0C96 |
ഖ |
0D16 |
| B4* |
Consonant KHHA (Urdu) |
kh |
ख़ |
0959 |
|
ਖ਼ |
0A59 |
|
|
|
|
|
|
| B5 |
Consonant GA |
g |
ग |
0917 |
গ |
0997 |
ਗ |
0A17 |
ગ |
0A97 |
ଗ |
0B17 |
|
గ |
0C17 |
ಗ |
0C97 |
ഗ |
0D17 |
| B5* |
Consonant GHHA (Urdu) |
ġ |
ग़ |
095A |
|
ਗ਼ |
0A5A |
|
|
|
|
|
|
| B6 |
Consonant GHA |
gh |
घ |
0918 |
ঘ |
0998 |
ਘ |
0A18 |
ઘ |
0A98 |
ଘ |
0B18 |
|
ఘ |
0C18 |
ಘ |
0C98 |
ഘ |
0D18 |
| B7 |
Consonant NGA |
ṅ |
ङ |
0919 |
ঙ |
0999 |
ਙ |
0A19 |
ઙ |
0A99 |
ଙ |
0B19 |
ங |
0B99 |
ఙ |
0C19 |
ಙ |
0C99 |
ങ |
0D19 |
| B8 |
Consonant CHA |
c |
च |
091A |
চ |
099A |
ਚ |
0A1A |
ચ |
0A9A |
ଚ |
0B1A |
ச |
0B9A |
చ |
0C1A |
ಚ |
0C9A |
ച |
0D1A |
| B9 |
Consonant CHHA |
ch |
छ |
091B |
ছ |
099B |
ਛ |
0A1B |
છ |
0A9B |
ଛ |
0B1B |
|
ఛ |
0C1B |
ಛ |
0C9B |
ഛ |
0D1B |
| BA |
Consonant JA |
j |
ज |
091C |
জ |
099C |
ਜ |
0A1C |
જ |
0A9C |
ଜ |
0B1C |
ஜ |
0B9C |
జ |
0C1C |
ಜ |
0C9C |
ജ |
0D1C |
| BA* |
Consonant ZA (Urdu) |
z |
ज़ |
095B |
|
ਜ਼ |
0A5B |
|
|
|
|
|
|
| BB |
Consonant JHA |
jh |
झ |
091D |
ঝ |
099D |
ਝ |
0A1D |
ઝ |
0A9D |
ଝ |
0B1D |
|
ఝ |
0C1D |
ಝ |
0C9D |
ഝ |
0D1D |
| BC |
Consonant JNA |
ñ |
ञ |
091E |
ঞ |
099E |
ਞ |
0A1E |
ઞ |
0A9E |
ଞ |
0B1E |
ஞ |
0B9E |
ఞ |
0C1E |
ಞ |
0C9E |
ഞ |
0D1E |
| BD |
Consonant Hard TA |
ṭ |
ट |
091F |
ট |
099F |
ਟ |
0A1F |
ટ |
0A9F |
ଟ |
0B1F |
ட |
0B9F |
ట |
0C1F |
ಟ |
0C9F |
ട |
0D1F |
| BE |
Consonant Hard THA |
ṭh |
ठ |
0920 |
ঠ |
09A0 |
ਠ |
0A20 |
ઠ |
0AA0 |
ଠ |
0B20 |
|
ఠ |
0C20 |
ಠ |
0CA0 |
ഠ |
0D20 |
| BF |
Consonant Hard DA |
ḍ |
ड |
0921 |
ড |
09A1 |
ਡ |
0A21 |
ડ |
0AA1 |
ଡ |
0B21 |
|
డ |
0C21 |
ಡ |
0CA1 |
ഡ |
0D21 |
| BF* |
Consonant Flapped DA |
ṛ |
ड़ |
095C |
ড় |
09DC |
ੜ |
0A5C |
|
ଡ଼ |
0B5C |
|
|
|
|
| C0 |
Consonant Hard DHA |
ḍh |
ढ |
0922 |
ঢ |
09A2 |
ਢ |
0A22 |
ઢ |
0AA2 |
ଢ |
0B22 |
|
ఢ |
0C22 |
ಢ |
0CA2 |
ഢ |
0D22 |
| C0* |
Consonant Flapped DHA |
ṛh |
ढ़ |
095D |
ঢ় |
09DD |
|
|
ଢ଼ |
0B5D |
|
|
|
|
| C1 |
Consonant Hard NA |
ṇ |
ण |
0923 |
ণ |
09A3 |
ਣ |
0A23 |
ણ |
0AA3 |
ଣ |
0B23 |
ண |
0BA3 |
ణ |
0C23 |
ಣ |
0CA3 |
ണ |
0D23 |
| C2 |
Consonant Soft TA |
t |
त |
0924 |
ত |
09A4 |
ਤ |
0A24 |
ત |
0AA4 |
ତ |
0B24 |
த |
0BA4 |
త |
0C24 |
ತ |
0CA4 |
ത |
0D24 |
| C3 |
Consonant Soft THA |
th |
थ |
0925 |
থ |
09A5 |
ਥ |
0A25 |
થ |
0AA5 |
ଥ |
0B25 |
|
థ |
0C25 |
ಥ |
0CA5 |
ഥ |
0D25 |
| C4 |
Consonant Soft DA |
d |
द |
0926 |
দ |
09A6 |
ਦ |
0A26 |
દ |
0AA6 |
ଦ |
0B26 |
|
ద |
0C26 |
ದ |
0CA6 |
ദ |
0D26 |
| C5 |
Consonant Soft DHA |
dh |
ध |
0927 |
ধ |
09A7 |
ਧ |
0A27 |
ધ |
0AA7 |
ଧ |
0B27 |
|
ధ |
0C27 |
ಧ |
0CA7 |
ധ |
0D27 |
| C6 |
Consonant Soft NA |
n |
न |
0928 |
ন |
09A8 |
ਨ |
0A28 |
ન |
0AA8 |
ନ |
0B28 |
ந |
0BA8 |
న |
0C28 |
ನ |
0CA8 |
ന |
0D28 |
| C7 |
Consonant NA (Tamil) |
ṉ |
ऩ |
0929 |
|
|
|
|
ன |
0BA9 |
|
|
|
| C8 |
Consonant PA |
p |
प |
092A |
প |
09AA |
ਪ |
0A2A |
પ |
0AAA |
ପ |
0B2A |
ப |
0BAA |
ప |
0C2A |
ಪ |
0CAA |
പ |
0D2A |
| C9 |
Consonant PHA |
ph |
फ |
092B |
ফ |
09AB |
ਫ |
0A2B |
ફ |
0AAB |
ଫ |
0B2B |
|
ఫ |
0C2B |
ಫ |
0CAB |
ഫ |
0D2B |
| C9* |
Consonant FA (Urdu) |
f |
फ़ |
095E |
|
ਫ਼ |
0A5E |
|
|
|
|
ೞ |
0CDE |
|
| CA |
Consonant BA |
b |
ब |
092C |
ব |
09AC |
ਬ |
0A2C |
બ |
0AAC |
ବ |
0B2C |
|
బ |
0C2C |
ಬ |
0CAC |
ബ |
0D2C |
| CB |
Consonant BHA |
bh |
भ |
092D |
ভ |
09AD |
ਭ |
0A2D |
ભ |
0AAD |
ଭ |
0B2D |
|
భ |
0C2D |
ಭ |
0CAD |
ഭ |
0D2D |
| CC |
Consonant MA |
m |
म |
092E |
ম |
09AE |
ਮ |
0A2E |
મ |
0AAE |
ମ |
0B2E |
ம |
0BAE |
మ |
0C2E |
ಮ |
0CAE |
മ |
0D2E |
| CD |
Consonant YA |
y |
य |
0930 |
র |
09B0 |
ਰ |
0A30 |
ર |
0AB0 |
ର |
0B30 |
ர |
0BB0 |
ర |
0C30 |
ರ |
0CB0 |
ര |
0D30 |
| CE |
Consonant JYA (Bengali, Assamese & Oriya) |
ẏ |
य़ |
095F |
য় |
09DF |
|
|
ୟ |
0B5F |
|
|
|
|
| CF |
Consonant RA |
r̥ |
र |
092F |
য |
09AF |
ਯ |
0A2F |
ય |
0AAF |
ଯ |
0B2F |
ய |
0BAF |
య |
0C2F |
ಯ |
0CAF |
യ |
0D2F |
| D0 |
Consonant Hard RA (Southern Scripts) |
ṟ |
ऱ |
0931 |
|
|
|
|
ற |
0BB1 |
ఱ |
0C31 |
ಱ |
0CB1 |
റ |
0D31 |
| D1 |
Consonant LA |
l |
ल |
0932 |
ল |
09B2 |
ਲ |
0A32 |
લ |
0AB2 |
ଲ |
0B32 |
ல |
0BB2 |
ల |
0C32 |
ಲ |
0CB2 |
ല |
0D32 |
| D2 |
Consonant Hard LA |
ḷ |
ळ |
0933 |
|
ਲ਼ |
0A33 |
ળ |
0AB3 |
ଳ |
0B33 |
ள |
0BB3 |
ళ |
0C33 |
ಳ |
0CB3 |
ള |
0D33 |
| D3 |
Consonant ZHA (Tamil & Malayalam) |
ḻ |
ऴ |
0934 |
|
|
|
|
ழ |
0BB4 |
|
|
ഴ |
0D34 |
| D4 |
Consonant VA |
v |
व |
0935 |
|
ਵ |
0A35 |
વ |
0AB5 |
ଵ |
0B35 |
வ |
0BB5 |
వ |
0C35 |
ವ |
0CB5 |
വ |
0D35 |
| D5 |
Consonant SHA |
ś |
श |
0936 |
শ |
09B6 |
ਸ਼ |
0A36 |
શ |
0AB6 |
ଶ |
0B36 |
ஶ |
0BB6 |
శ |
0C36 |
ಶ |
0CB6 |
ശ |
0D36 |
| D6 |
Consonant Hard SHA |
ṣ |
ष |
0937 |
ষ |
09B7 |
|
ષ |
0AB7 |
ଷ |
0B37 |
ஷ |
0BB7 |
ష |
0C37 |
ಷ |
0CB7 |
ഷ |
0D37 |
| D7 |
Consonant SA |
s |
स |
0938 |
স |
09B8 |
ਸ |
0A38 |
સ |
0AB8 |
ସ |
0B38 |
ஸ |
0BB8 |
స |
0C38 |
ಸ |
0CB8 |
സ |
0D38 |
| D8 |
Consonant HA |
h |
ह |
0939 |
হ |
09B9 |
ਹ |
0A39 |
હ |
0AB9 |
ହ |
0B39 |
ஹ |
0BB9 |
హ |
0C39 |
ಹ |
0CB9 |
ഹ |
0D39 |
| D9 |
Consonant INVISIBLE |
|
|
|
|
|
|
|
|
|
|
| DA |
Vowel Sign AA |
ā |
ा |
093E |
া |
09BE |
ਾ |
0A3E |
ા |
0ABE |
ା |
0B3E |
ா |
0BBE |
ా |
0C3E |
ಾ |
0CBE |
ാ |
0D3E |
| DB |
Vowel Sign I |
i |
ि |
093F |
ি |
09BF |
ਿ |
0A3F |
િ |
0ABF |
ି |
0B3F |
ி |
0BBF |
ి |
0C3F |
ಿ |
0CBF |
ി |
0D3F |
| DB* |
Vowel Sign LI (Sanskrit) |
ḷ |
ॢ |
0962 |
ৢ |
09E2 |
|
ૢ |
0AE2 |
ୢ |
0B62 |
|
ౢ |
0C62 |
ೢ |
0CE2 |
ൢ |
0D62 |
| DC |
Vowel Sign II |
ī |
ी |
0940 |
ী |
09C0 |
ੀ |
0A40 |
ી |
0AC0 |
ୀ |
0B40 |
ீ |
0BC0 |
ీ |
0C40 |
ೀ |
0CC0 |
ീ |
0D40 |
| DC* |
Vowel Sign LII (Sanskrit) |
ḹ |
ॣ |
0963 |
ৣ |
09E3 |
|
ૣ |
0AE3 |
ୣ |
0B63 |
|
ౣ |
0C63 |
ೣ |
0CE3 |
ൣ |
0D63 |
| DD |
Vowel Sign U |
u |
ु |
0941 |
ু |
09C1 |
ੁ |
0A41 |
ુ |
0AC1 |
ୁ |
0B41 |
ு |
0BC1 |
ు |
0C41 |
ು |
0CC1 |
ു |
0D41 |
| DE |
Vowel Sign UU |
ū |
ू |
0942 |
ূ |
09C2 |
ੂ |
0A42 |
ૂ |
0AC2 |
ୂ |
0B42 |
ூ |
0BC2 |
ూ |
0C42 |
ೂ |
0CC2 |
ൂ |
0D42 |
| DF |
Vowel Sign RI |
r̥ |
ृ |
0943 |
ৃ |
09C3 |
|
ૃ |
0AC3 |
ୃ |
0B43 |
|
ృ |
0C43 |
ೃ |
0CC3 |
ൃ |
0D43 |
| DF* |
Vowel Sign RII (Sanskrit) |
ṝ |
ॄ |
0944 |
ৄ |
09C4 |
|
ૄ |
0AC4 |
ୄ |
0B44 |
|
ౄ |
0C44 |
ೄ |
0CC4 |
ൄ |
0D44 |
| E0 |
Vowel Sign E (Southern Scripts) |
e |
ॆ |
0946 |
|
|
|
|
ெ |
0BC6 |
ె |
0C46 |
ೆ |
0CC6 |
െ |
0D46 |
| E1 |
Vowel Sign EY |
ē |
े |
0947 |
ে |
09C7 |
ੇ |
0A47 |
ે |
0AC7 |
େ |
0B47 |
ே |
0BC7 |
ే |
0C47 |
ೇ |
0CC7 |
േ |
0D47 |
| E2 |
Vowel Sign AI |
ai |
ै |
0948 |
ৈ |
09C8 |
ੈ |
0A48 |
ૈ |
0AC8 |
ୈ |
0B48 |
ை |
0BC8 |
ై |
0C48 |
ೈ |
0CC8 |
ൈ |
0D48 |
| E3 |
Vowel Sign AYE (Devanagari Script) |
ê |
ॅ |
0945 |
|
|
ૅ |
0AC5 |
|
|
|
|
|
| E4 |
Vowel Sign O (Southern Scripts) |
o |
ॊ |
094A |
|
|
|
|
ொ |
0BCA |
ొ |
0C4A |
ೊ |
0CCA |
ൊ |
0D4A |
| E5 |
Vowel Sign OW |
ō |
ो |
094B |
ো |
09CB |
ੋ |
0A4B |
ો |
0ACB |
ୋ |
0B4B |
ோ |
0BCB |
ో |
0C4B |
ೋ |
0CCB |
ോ |
0D4B |
| E6 |
Vowel Sign AU |
au |
ौ |
094C |
ৌ |
09CC |
ੌ |
0A4C |
ૌ |
0ACC |
ୌ |
0B4C |
ௌ |
0BCC |
ౌ |
0C4C |
ೌ |
0CCC |
ൌ |
0D4C |
| E7 |
Vowel Sign AWE (Devanagari Script) |
ô |
ॉ |
0949 |
|
|
ૉ |
0AC9 |
|
|
|
|
|
| E8 |
Vowel Omission Sign (Halant) |
|
् |
094D |
্ |
09CD |
੍ |
0A4D |
્ |
0ACD |
୍ |
0B4D |
் |
0BCD |
్ |
0C4D |
್ |
0CCD |
് |
0D4D |
| E9 |
Diacritic Sign (Nukta) |
|
़ |
093C |
় |
09BC |
਼ |
0A3C |
઼ |
0ABC |
଼ |
0B3C |
|
|
಼ |
0CBC |
|
| EA |
Full Stop (Viram, Northern Scripts) |
|
। |
0964 |
|
|
|
|
|
|
|
|
| EA* |
Vowel Stress Sign AVAGRAH Avagraha is a Devanāgarī symbol used to indicate prodelision of an . It is usually transliterated with an apostrophe, as in the Sanskrit philosophical expression ‘I am Shiva’. The avagraha is also used for prolonging vowel sounds in modern languages, for example Hindi for ‘Mãããã!’ when... |
|
ऽ |
093D |
ঽ |
09BD |
|
ઽ |
0ABD |
ଽ |
0B3D |
|
ఽ |
0C3D |
ಽ |
0CBD |
ഽ |
0D3D |
| EB |
Unused |
| EC |
Unused |
| ED |
Unused |
| EE |
Unused |
| EF |
Attribute Code |
|
|
|
|
|
|
|
|
|
|
| F0 |
Extension Code |
|
|
|
|
|
|
|
|
|
|
| F1 |
Digit 0 |
|
० |
0966 |
০ |
09E6 |
੦ |
0A66 |
૦ |
0AE6 |
୦ |
0B66 |
௦ |
0BE6 |
౦ |
0C66 |
೦ |
0CE6 |
൦ |
0D66 |
| F2 |
Digit 1 |
|
१ |
0967 |
১ |
09E7 |
੧ |
0A67 |
૧ |
0AE7 |
୧ |
0B67 |
௧ |
0BE7 |
౧ |
0C67 |
೧ |
0CE7 |
൧ |
0D67 |
| F3 |
Digit 2 |
|
२ |
0968 |
২ |
09E8 |
੨ |
0A68 |
૨ |
0AE8 |
୨ |
0B68 |
௨ |
0BE8 |
౨ |
0C68 |
೨ |
0CE8 |
൨ |
0D68 |
| F4 |
Digit 3 |
|
३ |
0969 |
৩ |
09E9 |
੩ |
0A69 |
૩ |
0AE9 |
୩ |
0B69 |
௩ |
0BE9 |
౩ |
0C69 |
೩ |
0CE9 |
൩ |
0D69 |
| F5 |
Digit 4 |
|
४ |
096A |
৪ |
09EA |
੪ |
0A6A |
૪ |
0AEA |
୪ |
0B6A |
௪ |
0BEA |
౪ |
0C6A |
೪ |
0CEA |
൪ |
0D6A |
| F6 |
Digit 5 |
|
५ |
096B |
৫ |
09EB |
੫ |
0A6B |
૫ |
0AEB |
୫ |
0B6B |
௫ |
0BEB |
౫ |
0C6B |
೫ |
0CEB |
൫ |
0D6B |
| F7 |
Digit 6 |
|
६ |
096C |
৬ |
09EC |
੬ |
0A6C |
૬ |
0AEC |
୬ |
0B6C |
௬ |
0BEC |
౬ |
0C6C |
೬ |
0CEC |
൬ |
0D6C |
| F8 |
Digit 7 |
|
७ |
096D |
৭ |
09ED |
੭ |
0A6D |
૭ |
0AED |
୭ |
0B6D |
௭ |
0BED |
౭ |
0C6D |
೭ |
0CED |
൭ |
0D6D |
| F9 |
Digit 8 |
|
८ |
096E |
৮ |
09EE |
੮ |
0A6E |
૮ |
0AEE |
୮ |
0B6E |
௮ |
0BEE |
౮ |
0C6E |
೮ |
0CEE |
൮ |
0D6E |
| FA |
Digit 9 |
|
९ |
096F |
৯ |
09EF |
੯ |
0A6F |
૯ |
0AEF |
୯ |
0B6F |
௯ |
0BEF |
౯ |
0C6F |
೯ |
0CEF |
൯ |
0D6F |
| FB |
Unused |
| FC |
Unused |
| FD |
Unused |
| FE |
Unused |
| FF |
Unused |
|
External links