Devanagari (/ˌdeɪvəˈnɑːɡəri/ DAY-və-NAH-gə-ree;[6] देवनागरी, IAST: Devanāgarī, Sanskrit pronunciation: [deːʋɐˈnaːɡɐriː]) is an Indic script used in northern India and Nepal. Also simply called Nāgari (Sanskrit: नागरि, Nāgari),[7] it is a left-to-right abugida (a type of segmental writing system),[8] based on the ancient Brāhmi script.[9] It is one of the official scripts of the Republic of India and Nepal. It was developed and in regular use by the 8th century CE[7] and achieved its modern form by 1200 CE.[10] The Devanāgari script, composed of 48 primary characters, including 14 vowels and 34 consonants,[11] is the fourth most widely adopted writing system in the world,[12][13] being used for over 120 languages.[14]
Devanāgari देवनागरी | |
---|---|
Script type | |
Time period | 12th century to present |
Direction | Left-to-right |
Official script | |
Languages | Apabhramsha, Angika, Awadhi, Bajjika, Bhili, Bhojpuri, Boro, Braj, Chhattisgarhi, Dogri, Garhwali, Haryanvi, Hindi, Khandeshi, Konkani, Kumaoni, Magahi, Maithili, Marathi, Marwari, Mundari, Nagpuri, Newari, Nepali, Pāli, Pahari, Prakrit, Rajasthani, Sanskrit, Santali, Sherpa, Surjapuri, and many more. |
Related scripts | |
Parent systems | |
Sister systems | Nandināgarī Kaithi Gujarātī Moḍī |
ISO 15924 | |
ISO 15924 | Deva (315), Devanagari (Nagari) |
Unicode | |
Unicode alias | Devanagari |
U+0900–U+097F Devanagari, U+A8E0–U+A8FF Devanagari Extended, U+11B00–11B5F Devanagari Extended-A, U+1CD0–U+1CFF Vedic Extensions | |
Part of a series on | |
---|---|
| |
Writing systems used in India | |
Brahmic scripts | |
Arabic derived scripts | |
Alphabetical scripts | |
Related | |
The orthography of this script reflects the pronunciation of the language.[14] Unlike the Latin alphabet, the script has no concept of letter case.[15] It is written from left to right, has a strong preference for symmetrical rounded shapes within squared outlines, and is recognisable by a horizontal line, known as a शिरोरेखा śirorekhā, that runs along the top of full letters.[8] In a cursory look, the Devanāgarī script appears different from other Indic scripts, such as Bengali-Assamese or Gurmukhi, but a closer examination reveals they are very similar except for angles and structural emphasis.[8]
Among the languages using it as a primary or secondary script are Marathi, Pāḷi, Sanskrit,[16] Hindi,[17] Boro, Nepali, Sherpa, Prakrit, Apabhramsha, Awadhi, Bhojpuri, Braj Bhasha,[18] Chhattisgarhi, Haryanvi, Magahi, Nagpuri, Rajasthani, Khandeshi, Bhili, Dogri, Maithili, Konkani, Nepal Bhasa, Mundari, Angika, Bajjika and Santali.[14] Kashmiri can also be written in Devanāgarī, but is predominantly written in the Perso-Arabic script, both in Pakistan administered Kashmir, and often by Kashmiri muslims in Indian administed Kashmir. Similarly, while Sindhi language is most commonly written in the perso-arabic based Sindhi script in Sindh, Pakistan, the migrant Sindhi community in India writes Sindhi in Devanagri script. The Devanāgarī script is closely related to the Nandināgarī script commonly found in numerous ancient manuscripts of South India,[19][20] and it is distantly related to a number of southeast Asian scripts.[14]
Etymology
Devanāgarī is formed by the addition of the word deva (देव) to the word nāgarī (नागरी). Nāgarī is an adjective derived from nagara (नगर), a Sanskrit word meaning "town" or "city," and literally means "urban" or "urbane".[21] The word Nāgarī (implicitly modifying lipi, "script") was used on its own to refer to a North Indian script, or perhaps a number of such scripts, as Al-Biruni attests in the 11th century; the form Devanāgarī is attested later, at least by the 18th century.[22] The name of the Nandināgarī script is also formed by adding a prefix to the generic script name nāgarī. The precise origin and significance of the prefix deva remains unclear.
History
Devanāgarī is part of the Brahmic family of scripts of India, Nepal, Tibet, and Southeast Asia.[23][24] It is a descendant of the 3rd century BCE Brāhmī script, which evolved into the Nagari script which in turn gave birth to Devanāgarī and Nandināgarī. Devanāgarī has been widely adopted across India and Nepal to write Sanskrit, Marathi, Hindi, Central Indo-Aryan languages, Konkani, Boro, and various Nepalese languages.
Some of the earliest epigraphic evidence attesting to the developing Sanskrit Nāgarī script in ancient India is from the 1st to 4th century CE inscriptions discovered in Gujarat.[9] Variants of script called nāgarī, recognisably close to Devanāgarī, are first attested from the 1st century CE Rudradaman inscriptions in Sanskrit, while the modern standardised form of Devanāgarī was in use by about 1000 CE.[10][25] Medieval inscriptions suggest widespread diffusion of Nāgarī-related scripts, with biscripts presenting local script along with the adoption of Nāgarī scripts. For example, the mid 8th-century Pattadakal pillar in Karnataka has text in both Siddha Matrika script, and an early Telugu-Kannada script; while, the Kangra Jawalamukhi inscription in Himachal Pradesh is written in both Sharada and Devanāgarī scripts.[26]
The Nāgarī script was in regular use by the 7th century CE, and it was fully developed by about the end of first millennium.[7][10] The use of Sanskrit in Nāgarī script in medieval India is attested by numerous pillar and cave-temple inscriptions, including the 11th-century Udayagiri inscriptions in Madhya Pradesh,[27] and an inscribed brick found in Uttar Pradesh, dated to be from 1217 CE, which is now held at the British Museum.[28] The script's prototypes and related versions have been discovered with ancient relics outside India, in places such as Sri Lanka, Myanmar and Indonesia. In East Asia, the Siddhaṃ matrika script (considered as the closest precursor to Nāgarī) was in use by Buddhists.[16][29] Nāgarī has been the primus inter pares of the Indic scripts.[16] It has long been used traditionally by religiously educated people in South Asia to record and transmit information, existing throughout the land in parallel with a wide variety of local scripts (such as Moḍī, Kaithi, and Mahajani) used for administration, commerce, and other daily uses.
Sharada remained in parallel use in Kashmir. An early version of Devanāgarī is visible in the Kutila inscription of Bareilly dated to VS 1049 (992 CE), which demonstrates the emergence of the horizontal bar to group letters belonging to a word.[30] One of the oldest surviving Sanskrit texts from the early post-Maurya period consists of 1,413 Nāgarī pages of a commentary by Patanjali, with a composition date of about 150 BCE, the surviving copy transcribed about 14th century CE.[31]
k- | kh- | g- | gh- | ṅ- | c- | ch- | j- | jh- | ñ- | ṭ- | ṭh- | ḍ- | ḍh- | ṇ- | t- | th- | d- | dh- | n- | p- | ph- | b- | bh- | m- | y- | r- | l- | v- | ś- | ṣ- | s- | h- | |
Brahmi | 𑀓 | 𑀔 | 𑀕 | 𑀖 | 𑀗 | 𑀘 | 𑀙 | 𑀚 | 𑀛 | 𑀜 | 𑀝 | 𑀞 | 𑀟 | 𑀠 | 𑀡 | 𑀢 | 𑀣 | 𑀤 | 𑀥 | 𑀦 | 𑀧 | 𑀨 | 𑀩 | 𑀪 | 𑀫 | 𑀬 | 𑀭 | 𑀮 | 𑀯 | 𑀰 | 𑀱 | 𑀲 | 𑀳 |
Gupta | |||||||||||||||||||||||||||||||||
Devanagari | क | ख | ग | घ | ङ | च | छ | ज | झ | ञ | ट | ठ | ड | ढ | ण | त | थ | द | ध | न | प | फ | ब | भ | म | य | र | ल | व | श | ष | स | ह |
East Asia
In the 7th century, under the rule of Songtsen Gampo of the Tibetan Empire, Thonmi Sambhota was sent to Nepal to open marriage negotiations with a Nepali princess and to find a writing system suitable for the Tibetan language. He then invented the Tibetan script based on the Nāgarī used in Kashmir. He added 6 new characters for sounds that did not exist in Sanskrit.[33]
Other scripts closely related to Nāgarī (such as Siddhaṃ) were introduced throughout East and Southeast Asia from the 7th to the 10th centuries CE: notably in Indonesia, Vietnam, and Japan.[34][35]
Most of the Southeast Asian scripts have roots in Dravidian scripts, but a few found in south-central regions of Java and isolated parts of southeast Asia resemble Devanāgarī or its prototypes. The Kawi script in particular is similar to the Devanāgarī in many respects, though the morphology of the script has local changes. The earliest inscriptions in the Devanāgarī-like scripts are from around the 10th century CE, with many more between the 11th and 14th centuries.[36][37]
Some of the old-Devanāgarī inscriptions are found in Hindu temples of Java, such as the Prambanan temple.[38] The Ligor and the Kalasan inscriptions of central Java, dated to the 8th century, are also in the Nāgarī script of north India. According to the epigraphist and Asian Studies scholar Lawrence Briggs, these may be related to the 9th century copper plate inscription of Devapaladeva (Bengal) which is also in early Devanāgarī script.[39] The term kawi in Kawi script is a loan word from kāvya (poetry). According to anthropologists and Asian studies scholars John Norman Miksic and Goh Geok Yian, the 8th century version of early Nāgarī or Devanāgarī script was adopted in Java, Bali, and Khmer around the 8th–9th centuries, as evidenced by the many contemporaneous inscriptions of this period.[40]
- Uṣṇīṣa Vijaya Dhāraṇī Sūtra in Siddhaṃ on palm leaf in 609 CE found in Hōryū-ji, Japan. The last line is a complete Sanskrit syllabary in Siddhaṃ script.
Letters
The letter order of Devanāgarī, like nearly all Brāhmic scripts, is based on phonetic principles that consider both the manner and place of articulation of the consonants and vowels they represent. This arrangement is usually referred to as the varṇamālā ("garland of letters").[41] The format of Devanāgarī for Sanskrit serves as the prototype for its application, with minor variations or additions, to other languages.[42]
Vowels
The vowels and their arrangement are:[43]
Independent form | IAST | ISO | IPA | As diacritic with प (Barakhadi) | Independent form | IAST | ISO | IPA | As diacritic with प (Barakhadi) | |||
---|---|---|---|---|---|---|---|---|---|---|---|---|
kaṇṭhya (Guttural) |
अ | a | [ɐ] | प | आ | ā | [aː] | पा | ||||
tālavya (Palatal) |
इ | i | [i] | पि | ई | ī | [iː] | पी | ||||
oṣṭhya (Labial) |
उ | u | [u] | पु 6 | ऊ | ū | [uː] | पू 6 | ||||
mūrdhanya (Retroflex) |
ऋ | ṛ | r̥ | [r̩] | पृ | ॠ 4 | ṝ | r̥̄ | [r̩ː] | पॄ | ||
dantya (Dental) |
ऌ 4 | ḷ | l̥ | [l̩] | पॢ | ॡ 4, 5 | ḹ | l̥̄ | [l̩ː] | पॣ | ||
kaṇṭhatālavya (Palatoguttural) |
ए | e | ē | [eː] | पे | ऐ | ai | [ɑj] | पै | |||
kaṇṭhoṣṭhya (Labioguttural) |
ओ | o | ō | [oː] | पो | औ | au | [ɑw] | पौ | |||
अं / ं 1,2 | ṃ | ṁ | [◌̃] | पं | अः / ः 1 | ḥ | [h] | पः | ||||
ॲ / ऍ 7 | ê | [æ] | पॅ | ऑ 7 | ô | [ɒ] | पॉ |
- Arranged with the vowels are two consonantal diacritics, the final nasal anusvāra ं ṃ and the final fricative visarga ः ḥ (called अं aṃ and अः aḥ). Masica (1991:146) notes of the anusvāra in Sanskrit that "there is some controversy as to whether it represents a homorganic nasal stop ..., a nasalised vowel, a nasalised semivowel, or all these according to context". The visarga represents post-vocalic voiceless glottal fricative [h], in Sanskrit an allophone of s, or less commonly r, usually in word-final position. Some traditions of recitation append an echo of the vowel after the breath:[44] इः [ihi]. Masica (1991:146) considers the visarga along with letters ङ ṅa and ञ ña for the "largely predictable" velar and palatal nasals to be examples of "phonetic overkill in the system".
- Another diacritic is the candrabindu/anunāsika ँ अँ. Salomon (2003:76–77) describes it as a "more emphatic form" of the anusvāra, "sometimes ... used to mark a true [vowel] nasalization". In a new Indo-Aryan language such as Hindi the distinction is formal: the candrabindu indicates vowel nasalisation[45] while the anusvār indicates a homorganic nasal preceding another consonant:[46] e.g., हँसी [ɦə̃si] "laughter", गंगा [ɡəŋɡɑ] "the Ganges". When an akṣara has a vowel diacritic above the top line, that leaves no room for the candra ("moon") stroke candrabindu, which is dispensed with in favour of the lone dot:[47] हूँ [ɦũ] "am", but हैं [ɦɛ̃] "are". Some writers and typesetters dispense with the "moon" stroke altogether, using only the dot in all situations.[48]
- The avagraha (ऽ अऽ) (usually transliterated with an apostrophe) is a Sanskrit punctuation mark for the elision of a vowel in sandhi: एकोऽयम् eko'yam ( ← एकस् ekas + अयम् ayam) ("this one"). An original long vowel lost to coalescence is sometimes marked with a double avagraha: सदाऽऽत्मा sadā'tmā ( ← सदा sadā + आत्मा ātmā) "always, the self".[49] In Hindi, Snell (2000:77) states that its "main function is to show that a vowel is sustained in a cry or a shout": आईऽऽऽ! āīīī!. In Madhyadeshi languages like Bhojpuri, Awadhi, Maithili, etc. which have "quite a number of verbal forms that end in that inherent vowel",[50] the avagraha is used to mark the non-elision of word-final inherent a, which otherwise is a modern orthographic convention: बइठऽ baiṭha "sit" versus बइठ baiṭh
- The syllabic consonants ॠ ṝ, ऌ ḷ, and ॡ ḹ are specific to Sanskrit and not included in the varṇamālā of other languages. The sound represented by ṛ has also been largely lost in the modern languages, and its pronunciation now ranges from [ɾɪ] (Hindi) to [ɾu] (Marathi).
- ḹ is not an actual phoneme of Sanskrit, but rather a graphic convention included among the vowels in order to maintain the symmetry of short–long pairs of letters.[42]
- There are non-regular formations of रु ru, रू rū, and हृ hṛ.
- There are two more vowels in Marathi, ॲ and ऑ, that respectively represent [æ], similar to the RP English pronunciation of ⟨a⟩ in act, and [ɒ], similar to the RP pronunciation of ⟨o⟩ in cot. These vowels are sometimes used in Hindi too, as in डॉलर dôlar ("dollar").[51] IAST transliteration is not defined. In ISO 15919, the transliteration is ê and ô, respectively.
- Kashmiri Devanagari uses letters like ॳ, ॴ, ॶ, ॷ, ऎ, ऒ, औ, ॵ to represent its vowels (see Kashmiri language#Devanagari).
Consonants
The table below shows the consonant letters (in combination with inherent vowel a) and their arrangement. To the right of the Devanāgarī letter it shows the Latin script transliteration using International Alphabet of Sanskrit Transliteration,[52] and the phonetic value (IPA) in Hindi.[53][54]
Phonetics → | sparśa (Occlusive) |
anunāsika (Nasal) |
antastha (Approximant) |
ūṣman/saṃgharṣī (Fricative) | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Voicing → | aghoṣa | saghoṣa | aghoṣa | saghoṣa | ||||||||||||
Aspiration → | alpaprāṇa | mahāprāṇa | alpaprāṇa | mahāprāṇa | alpaprāṇa | mahāprāṇa | ||||||||||
kaṇṭhya (Velar) |
क | ka [k] |
ख | kha [kʰ] |
ग | ga [ɡ] |
घ | gha [ɡʱ] |
ङ | ṅa [ŋ] |
ह | ha [ɦ] | ||||
tālavya (Palatal) |
च | ca [tʃ] |
छ | cha [tʃʰ] |
ज | ja [dʒ] |
झ | jha [dʒʱ] |
ञ | ña [ɲ] |
य | ya [j] |
श | śa [ʃ] |
||
mūrdhanya (Retroflex) |
ट | ṭa [ʈ] |
ठ | ṭha [ʈʰ] |
ड | ḍa [ɖ] |
ढ | ḍha [ɖʱ] |
ण | ṇa [ɳ] |
र | ra [r] |
ष | ṣa [ʂ] | ||
dantya (Dental) |
त | ta [t̪] |
थ | tha [t̪ʰ] |
द | da [d̪] |
ध | dha [d̪ʱ] |
न | na [n] |
ल | la [l] |
स | sa [s] | ||
oṣṭhya (Labial) |
प | pa [p] |
फ | pha [pʰ] |
ब | ba [b] |
भ | bha [bʱ] |
म | ma [m] |
व | va [ʋ] |
- Additionally, there is ळ ḷa (IPA: [ɭ] or [ɭ̆]), the intervocalic lateral flap allophone of the voiced retroflex stop in Vedic Sanskrit, which is a phoneme in languages such as Marathi, Konkani, Garhwali, and Rajasthani.[55]
- Beyond the Sanskritic set, new shapes have rarely been formulated. Masica (1991:146) offers the following, "In any case, according to some, all possible sounds had already been described and provided for in this system, as Sanskrit was the original and perfect language. Hence it was difficult to provide for or even to conceive other sounds, unknown to the phoneticians of Sanskrit". Where foreign borrowings and internal developments did inevitably accrue and arise in New Indo-Aryan languages, they have been ignored in writing, or dealt through means such as diacritics and ligatures (ignored in recitation).
- The most prolific diacritic has been the subscript dot (nuqtā) ़. Hindi uses it for the Persian, Arabic and English sounds क़ qa /q/, ख़ xa /x/, ग़ ġa /ɣ/, ज़ za /z/, झ़ zha /ʒ/, and फ़ fa /f/, and for the allophonic developments ड़ ṛa /ɽ/ and ढ़ ṛha /ɽʱ/.[56] (Although ऴ ḻa /ɻ/ could also exist, it is not used in Hindi.)
- Devanagari used to write Mahl dialect of Dhivehi uses nukta on च़, त़, द़, ल़, श़, स़, ह़ to represent other Perso-Arabic phonemes (see Maldivian writing systems#Devanagari script for Mahl).
- Sindhi's and Saraiki's implosives are accommodated with a line attached below: ॻ [ɠə], ॼ [ʄə], ॾ [ɗə], ॿ [ɓə].
- Aspirated sonorants may be represented as conjuncts/ligatures with ह ha: म्ह mha, न्ह nha, ण्ह ṇha, व्ह vha, ल्ह lha, ळ्ह ḷha, र्ह rha.
- Masica (1991:147) notes Marwari as using ॸ for ḍa [ɗə] (while ड represents [ɽə]).
- When used to write Avestan, Devanagari uses letters like ॹ /ʒ/ to represent its sounds.
Vowel diacritics
Table: Consonants with vowel diacritics. Vowels in their independent form on the top and in their corresponding dependent form (vowel sign) combined with the consonant 'k' on the bottom. 'ka' is without any added vowel sign, where the vowel 'a' is inherent.
a | ā | i | ī | u | ū | e | ê | ē | ai | o | ô | ō | au | r̥ | r̥̄ | l̥ | l̥̄ | ṁ | ḥ | m̐ | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
अ | आ | ॲ | ऑ | इ | ई | उ | ऊ | ऎ | ए | ऐ | ऒ | ओ | औ | ऋ | ॠ | ऌ | ॡ | अं | अः | अँ | |
ा | ि | ी | ु | ू | ॆ | ॅ | े | ै | ॊ | ॉ | ो | ौ | ृ | ॄ | ॢ | ॣ | ं | ः | ् | ँ | |
ka | kā | ki | kī | ku | kū | ke | kê | kē | kai | ko | kô | kō | kau | kr̥ | kr̥̄ | kl̥ | kl̥̄ | kaṁ | kaḥ | k | kam̐ |
क | का | कॅ | कॉ | कि | की | कु | कू | कॆ | के | कै | कॊ | को | कौ | कृ | कॄ | कॢ | कॣ | कं | कः | क् | कँ |
A vowel combines with a consonant in their diacritic form. For example, the vowel आ (ā) combines with the consonant क् (k) to form the syllabic letter का (kā), with halant (cancel sign) removed and added vowel sign which is indicated by diacritics. The vowel अ (a) combines with the consonant क् (k) to form क (ka) with halant removed. But the diacritic series of क, ख, ग, घ (ka, kha, ga, gha, respectively) is without any added vowel sign, as the vowel अ (a) is inherent.
The combinations of all consonants and vowels, each in alphabetical order, are laid out in the bārākhaḍī (बाराखडी) or bārahkhaṛī (बारहखड़ी) table. In the following barakhadi table, the transliteration of each combination will appear on mouseover:
a | ā | i | ī | u | ū | e | ai | o | au | aṁ | aḥ | |
अ | आ | इ | ई | उ | ऊ | ए | ऐ | ओ | औ | अं | अः | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
k- | क | का | कि | की | कु | कू | के | कै | को | कौ | कं | कः |
kh- | ख | खा | खि | खी | खु | खू | खे | खै | खो | खौ | खं | खः |
g- | ग | गा | गि | गी | गु | गू | गे | गै | गो | गौ | गं | गः |
gh- | घ | घा | घि | घी | घु | घू | घे | घै | घो | घौ | घं | घः |
ṅ- | ङ | ङा | ङि | ङी | ङु | ङू | ङे | ङै | ङो | ङौ | ङं | ङः |
c- | च | चा | चि | ची | चु | चू | चे | चै | चो | चौ | चं | चः |
ch- | छ | छा | छि | छी | छु | छू | छे | छै | छो | छौ | छं | छः |
j- | ज | जा | जि | जी | जु | जू | जे | जै | जो | जौ | जं | जः |
jh- | झ | झा | झि | झी | झु | झू | झे | झै | झो | झौ | झं | झः |
ñ- | ञ | ञा | ञि | ञी | ञु | ञू | ञे | ञै | ञो | ञौ | ञं | ञः |
ṭ- | ट | टा | टि | टी | टु | टू | टे | टै | टो | टौ | टं | टः |
ṭh- | ठ | ठा | ठि | ठी | ठु | ठू | ठे | ठै | ठो | ठौ | ठं | ठः |
ḍ- | ड | डा | डि | डी | डु | डू | डे | डै | डो | डौ | डं | डः |
ḍh- | ढ | ढा | ढि | ढी | ढु | ढू | ढे | ढै | ढो | ढौ | ढं | ढः |
ṇ- | ण | णा | णि | णी | णु | णू | णे | णै | णो | णौ | णं | णः |
t- | त | ता | ति | ती | तु | तू | ते | तै | तो | तौ | तं | तः |
th- | थ | था | थि | थी | थु | थू | थे | थै | थो | थौ | थं | थः |
d- | द | दा | दि | दी | दु | दू | दे | दै | दो | दौ | दं | दः |
dh- | ध | धा | धि | धी | धु | धू | धे | धै | धो | धौ | धं | धः |
n- | न | ना | नि | नी | नु | नू | ने | नै | नो | नौ | नं | नः |
p- | प | पा | पि | पी | पु | पू | पे | पै | पो | पौ | पं | पः |
ph- | फ | फा | फि | फी | फु | फू | फे | फै | फो | फौ | फं | फः |
b- | ब | बा | बि | बी | बु | बू | बे | बै | बो | बौ | बं | बः |
bh- | भ | भा | भि | भी | भु | भू | भे | भै | भो | भौ | भं | भः |
m- | म | मा | मि | मी | मु | मू | मे | मै | मो | मौ | मं | मः |
y- | य | या | यि | यी | यु | यू | ये | यै | यो | यौ | यं | यः |
r- | र | रा | रि | री | रु | रू | रे | रै | रो | रौ | रं | रः |
l- | ल | ला | लि | ली | लु | लू | ले | लै | लो | लौ | लं | लः |
v- | व | वा | वि | वी | वु | वू | वे | वै | वो | वौ | वं | वः |
ś- | श | शा | शि | शी | शु | शू | शे | शै | शो | शौ | शं | शः |
ṣ- | ष | षा | षि | षी | षु | षू | षे | षै | षो | षौ | षं | षः |
s- | स | सा | सि | सी | सु | सू | से | सै | सो | सौ | सं | सः |
h- | ह | हा | हि | ही | हु | हू | हे | है | हो | हौ | हं | हः |
Old forms
The following letter variants are also in use, particularly in older texts and in specific regions:[58]
Conjunct consonants
As mentioned, successive consonants lacking a vowel in between them may physically join as a conjunct consonant or ligature. When Devanāgarī is used for writing languages other than Sanskrit, conjuncts are used mostly with Sanskrit words and loan words. Native words typically use the basic consonant and native speakers know to suppress the vowel when it is conventional to do so. For example, the native Hindi word karnā is written करना (ka-ra-nā).[59] The government of these clusters ranges from widely to narrowly applicable rules, with special exceptions within. While standardised for the most part, there are certain variations in clustering, of which the Unicode used on this page is just one scheme. The following are a number of rules:
- 24 out of the 36 consonants contain a vertical right stroke (य ya, न na, ग ga etc.). As first or middle fragments/members of a cluster (when letters are to be written as half pronounced), they lose that stroke. e.g. त् + व = त्व tva, ण् + ढ = ण्ढ ṇḍha, स् + थ = स्थ stha. In Unicode, as in Hindi, these consonants without their vertical stems are called "half forms".[60] श śa appears as a different, simple ribbon-shaped fragment preceding व va, न na, च ca, ल la, and र ra, causing these second members to be shifted down and reduced in size. Thus श्व śva, श्न śna, श्च śca, श्ल śla, श्र śra, and शृ śṛi.
- र ra as a first member takes the form of a curved upward dash above the final character or its ā- diacritic. e.g. र्व rva, र्वा rvā, र्स्प rspa, र्स्पा rspā. In Marathi and Nepali, र ra as a first member of a conjunct also takes on an eyelash form when in front of glides and semivowels. e.g. र्य rya, र्व rva. As a final member with ट ṭa, ठ ṭha, ड ḍa, ढ ḍha, ड़ ṛa, छ cha, it is two lines together below the character pointed downwards. Thus ट्र ṭra, ठ्र ṭhra, ड्र ḍra, ढ्र ḍhra, ड़्र ṛra, छ्र chra. Elsewhere as a final member it is a diagonal stroke extending leftwards and down. e.g. क्र ग्र भ्र ब्र. त ta is shifted up to make the conjunct त्र tra.
- As first members, remaining characters lacking vertical strokes such as द da and ह ha may have their second member, reduced in size and lacking its horizontal stroke, placed underneath. क ka, छ cha, and फ pha shorten their right hooks and join them directly to the following member.
- The conjuncts for kṣa and jña are not clearly derived from the letters making up their components. The conjunct for kṣa is क्ष (क् + ष) and for jña it is ज्ञ (ज् + ञ).
Accent marks
The pitch accent of Vedic Sanskrit is written with various symbols depending on shakha. In the Rigveda, anudātta is written with a bar below the line (◌॒), svarita with a stroke above the line (◌॑) while udātta is unmarked.
Punctuation
The end of a sentence or half-verse may be marked with the "।" symbol (called a daṇḍa, meaning "bar", or called a pūrṇa virām, meaning "full stop/pause"). The end of a full verse may be marked with a double-daṇḍa, a "॥" symbol. A comma (called an alpa virām, meaning "short stop/pause") is used to denote a natural pause in speech.[61][62] Punctuation marks of Western origin, such as the colon, semicolon, exclamation mark, dash, and question mark have been in use in Devanāgarī script since at least the 1900s,[citation needed] matching their use in European languages.[63]
Fonts
A variety of Unicode fonts are in use for Devanāgarī. These include Akshar,[64] Annapurna,[65] Arial,[66] CDAC-Gist Surekh,[67] CDAC-Gist Yogesh,[68] Chandas,[69] Gargi,[70] Gurumaa,[71] Jaipur,[72] Jana,[73] Kalimati,[74] Kanjirowa,[75] Lohit Devanagari, Mangal,[76] Kokila,[77] ,Preeti,[78] Raghu,[79] Sanskrit2003,[80] Santipur OT,[81] Siddhanta, and Thyaka.[82]
The form of Devanāgarī fonts vary with function. According to Harvard College for Sanskrit studies:[81]
Uttara [companion to Chandas] is the best in terms of ligatures but, because it is designed for Vedic as well, requires so much vertical space that it is not well suited for the "user interface font" (though an excellent choice for the "original field" font). Santipur OT is a beautiful font reflecting a very early [medieval era] typesetting style for Devanagari. Sanskrit 2003[83] is a good all-around font and has more ligatures than most fonts, though students will probably find the spacing of the CDAC-Gist Surekh[67] font makes for quicker comprehension and reading.
The Google Fonts project has a number of Unicode fonts for Devanāgarī in a variety of typefaces in serif, sans-serif, display and handwriting categories.
Numerals
० | १ | २ | ३ | ४ | ५ | ६ | ७ | ८ | ९ |
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
Transliteration
There are several methods of Romanisation or transliteration from Devanāgarī to the Roman script.[84]
Hunterian system
The Hunterian system is the national system of romanisation in India, officially adopted by the Government of India.[85][86][87]
ISO 15919
A standard transliteration convention was codified in the ISO 15919 standard of 2001. It uses diacritics to map the much larger set of Brāhmic graphemes to the Latin script. The Devanāgarī-specific portion is nearly identical to the academic standard for Sanskrit, IAST.[88]
IAST
The International Alphabet of Sanskrit Transliteration (IAST) is the academic standard for the romanisation of Sanskrit. IAST is the de facto standard used in printed publications, like books, magazines, and electronic texts with Unicode fonts. It is based on a standard established by the Congress of Orientalists at Athens in 1912. The ISO 15919 standard of 2001 codified the transliteration convention to include an expanded standard for sister scripts of Devanāgarī.[88]
The National Library at Kolkata romanisation, intended for the romanisation of all Indic scripts, is an extension of IAST.
Harvard-Kyoto
Compared to IAST, Harvard-Kyoto looks much simpler. It does not contain all the diacritic marks that IAST contains. It was designed to simplify the task of putting large amount of Sanskrit textual material into machine readable form, and the inventors stated that it reduces the effort needed in transliteration of Sanskrit texts on the keyboard.[89] This makes typing in Harvard-Kyoto much easier than IAST. Harvard-Kyoto uses capital letters that can be difficult to read in the middle of words.
ITRANS
ITRANS is a lossless transliteration scheme of Devanāgarī into ASCII that is widely used on Usenet. It is an extension of the Harvard-Kyoto scheme. In ITRANS, the word devanāgarī is written "devanaagarii" or "devanAgarI". ITRANS is associated with an application of the same name that enables typesetting in Indic scripts. The user inputs in Roman letters and the ITRANS pre-processor translates the Roman letters into Devanāgarī (or other Indic languages). The latest version of ITRANS is version 5.30 released in July 2001. It is similar to Velthuis system and was created by Avinash Chopde to help print various Indic scripts with personal computers.[89]
Velthuis
The disadvantage of the above ASCII schemes is case-sensitivity, implying that transliterated names may not be capitalised. This difficulty is avoided with the system developed in 1996 by Frans Velthuis for TeX, loosely based on IAST, in which case is irrelevant.
ALA-LC Romanisation
ALA-LC[90] romanisation is a transliteration scheme approved by the Library of Congress and the American Library Association, and widely used in North American libraries. Transliteration tables are based on languages, so there is a table for Hindi,[91] one for Sanskrit and Prakrit,[92] etc.
WX
WX is a Roman transliteration scheme for Indian languages, widely used among the natural language processing community in India. It originated at IIT Kanpur for computational processing of Indian languages. The salient features of this transliteration scheme are as follows.
- Every consonant and every vowel has a single mapping into Roman. Hence it is a prefix code, advantageous from computation point of view.
- Lower-case letters are used for unaspirated consonants and short vowels, while capital letters are used for aspirated consonants and long vowels. While the retroflex stops are mapped to 't, T, d, D, N', the dentals are mapped to 'w, W, x, X, n'. Hence the name 'WX', a reminder of this idiosyncratic mapping.
Encodings
ISCII
ISCII is an 8-bit encoding. The lower 128 codepoints are plain ASCII, the upper 128 codepoints are ISCII-specific.
It has been designed for representing not only Devanāgarī but also various other Indic scripts as well as a Latin-based script with diacritic marks used for transliteration of the Indic scripts.
ISCII has largely been superseded by Unicode, which has, however, attempted to preserve the ISCII layout for its Indic language blocks.
Unicode
The Unicode Standard defines four blocks for Devanāgarī: Devanagari (U+0900–U+097F), Devanagari Extended (U+A8E0–U+A8FF), Devanagari Extended-A (U+11B00–11B5F), and Vedic Extensions (U+1CD0–U+1CFF).
Devanagari[1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+090x | ऀ | ँ | ं | ः | ऄ | अ | आ | इ | ई | उ | ऊ | ऋ | ऌ | ऍ | ऎ | ए |
U+091x | ऐ | ऑ | ऒ | ओ | औ | क | ख | ग | घ | ङ | च | छ | ज | झ | ञ | ट |
U+092x | ठ | ड | ढ | ण | त | थ | द | ध | न | ऩ | प | फ | ब | भ | म | य |
U+093x | र | ऱ | ल | ळ | ऴ | व | श | ष | स | ह | ऺ | ऻ | ़ | ऽ | ा | ि |
U+094x | ी | ु | ू | ृ | ॄ | ॅ | ॆ | े | ै | ॉ | ॊ | ो | ौ | ् | ॎ | ॏ |
U+095x | ॐ | ॑ | ॒ | ॓ | ॔ | ॕ | ॖ | ॗ | क़ | ख़ | ग़ | ज़ | ड़ | ढ़ | फ़ | य़ |
U+096x | ॠ | ॡ | ॢ | ॣ | । | ॥ | ० | १ | २ | ३ | ४ | ५ | ६ | ७ | ८ | ९ |
U+097x | ॰ | ॱ | ॲ | ॳ | ॴ | ॵ | ॶ | ॷ | ॸ | ॹ | ॺ | ॻ | ॼ | ॽ | ॾ | ॿ |
Notes
|
Devanagari Extended[1] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+A8Ex | ꣠ | ꣡ | ꣢ | ꣣ | ꣤ | ꣥ | ꣦ | ꣧ | ꣨ | ꣩ | ꣪ | ꣫ | ꣬ | ꣭ | ꣮ | ꣯ |
U+A8Fx | ꣰ | ꣱ | ꣲ | ꣳ | ꣴ | ꣵ | ꣶ | ꣷ | ꣸ | ꣹ | ꣺ | ꣻ | ꣼ | ꣽ | ꣾ | ꣿ |
Notes
|
Devanagari Extended-A[1][2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+11B0x | 𑬀 | 𑬁 | 𑬂 | 𑬃 | 𑬄 | 𑬅 | 𑬆 | 𑬇 | 𑬈 | 𑬉 | ||||||
U+11B1x | ||||||||||||||||
U+11B2x | ||||||||||||||||
U+11B3x | ||||||||||||||||
U+11B4x | ||||||||||||||||
U+11B5x | ||||||||||||||||
Notes |
Vedic Extensions[1][2] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+1CDx | ᳐ | ᳑ | ᳒ | ᳓ | ᳔ | ᳕ | ᳖ | ᳗ | ᳘ | ᳙ | ᳚ | ᳛ | ᳜ | ᳝ | ᳞ | ᳟ |
U+1CEx | ᳠ | ᳡ | ᳢ | ᳣ | ᳤ | ᳥ | ᳦ | ᳧ | ᳨ | ᳩ | ᳪ | ᳫ | ᳬ | ᳭ | ᳮ | ᳯ |
U+1CFx | ᳰ | ᳱ | ᳲ | ᳳ | ᳴ | ᳵ | ᳶ | ᳷ | ᳸ | ᳹ | ᳺ | |||||
Notes |
Devanāgari keyboard layouts
InScript layout
InScript is the standard keyboard layout for Devanāgarī as standardized by the Government of India. It is inbuilt in all modern major operating systems. Microsoft Windows supports the InScript layout, which can be used to input unicode Devanāgarī characters. InScript is also available in some touchscreen mobile phones.
Typewriter
This layout was used on manual typewriters when computers were not available or were uncommon. For backward compatibility some typing tools like Indic IME still provide this layout.
Phonetic
Such tools work on phonetic transliteration. The user writes in the Latin alphabet and the IME automatically converts it into Devanāgarī. Some popular phonetic typing tools are Akruti, Baraha IME and Google IME.
The Mac OS X operating system includes two different keyboard layouts for Devanāgarī: one resembles the INSCRIPT/KDE Linux, while the other is a phonetic layout called "Devanāgarī QWERTY".
Any one of the Unicode fonts input systems is fine for the Indic language Wikipedia and other wikiprojects, including Hindi, Bhojpuri, Marathi, and Nepali Wikipedia. While some people use InScript, the majority uses either Google phonetic transliteration or the input facility Universal Language Selector provided on Wikipedia. On Indic language wikiprojects, the phonetic facility provided initially was java-based, and was later supported by Narayam extension for phonetic input facility. Currently Indic language Wiki projects are supported by Universal Language Selector (ULS), that offers both phonetic keyboard (Aksharantaran, Marathi: अक्षरांतरण, Hindi: लिप्यंतरण, बोलनागरी) and InScript keyboard (Marathi: मराठी लिपी).
The Ubuntu Linux operating system supports several keyboard layouts for Devanāgarī, including Harvard-Kyoto, WX notation, Bolanagari and phonetic. The 'remington' typing method in Ubuntu IBUS is similar to the Krutidev typing method, popular in Rajasthan. The 'itrans' method is useful for those who know English (and the English keyboard) well but are not familiar with typing in Devanāgarī.
See also
- Languages of India
- Clip font
- Devanāgarī transliteration
- Devanāgarī Braille
- ISCII
- Nagari Pracharini Sabha
- Nepali
- Schwa deletion in Indo-Aryan languages
- Shiksha – the Vedic study of sound, focusing on the letters of the Sanskrit alphabet
References
External links
Wikiwand in your browser!
Seamless Wikipedia browsing. On steroids.
Every time you click a link to Wikipedia, Wiktionary or Wikiquote in your browser's search results, it will show the modern Wikiwand interface.
Wikiwand extension is a five stars, simple, with minimum permission required to keep your browsing private, safe and transparent.