Loading AI tools
Form of the Latin script used to write Czech language From Wikipedia, the free encyclopedia
Czech orthography is a system of rules for proper formal writing (orthography) in Czech. The earliest form of separate Latin script specifically designed to suit Czech was devised by Czech theologian and church reformist Jan Hus, the namesake of the Hussite movement, in one of his seminal works, De orthographia bohemica (On Bohemian orthography).
This article needs additional citations for verification. (September 2022) |
Czech alphabet Česká abeceda | |
---|---|
Script type | |
Time period | Since Jan Hus' Orthographia bohemica (early 15th century – present) |
Languages | Czech |
Related scripts | |
Parent systems | |
Child systems | Slovak alphabet Gaj's Latin alphabet Latvian alphabet Lithuanian alphabet |
Unicode | |
Subset of Latin | |
The modern Czech orthographic system is diacritic, having evolved from an earlier system which used many digraphs (although one digraph has been kept - ch). The caron is added to standard Latin letters to express sounds which are foreign to Latin. The acute accent is used for long vowels.
The Czech orthography is considered the model for many other Balto-Slavic languages using the Latin alphabet; Slovak orthography being its direct revised descendant, while the Serbo-Croatian Gaj's Latin alphabet and its Slovene descendant system are largely based on it. The Baltic languages, such as Latvian and Lithuanian, are also largely based on it. All of them make use of similar diacritics and also have a similar, usually interchangeable, relationship between the letters and the sounds they are meant to represent.[1]
The Czech alphabet consists of 42 letters.
Majuscule forms (uppercase/capital letters) | ||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
A | Á | B | C | Č | D | Ď | E | É | Ě | F | G | H | Ch | I | Í | J | K | L | M | N |
Ň | O | Ó | P | Q | R | Ř | S | Š | T | Ť | U | Ú | Ů | V | W | X | Y | Ý | Z | Ž |
Minuscule forms (lowercase/small letters) | ||||||||||||||||||||
a | á | b | c | č | d | ď | e | é | ě | f | g | h | ch | i | í | j | k | l | m | n |
ň | o | ó | p | q | r | ř | s | š | t | ť | u | ú | ů | v | w | x | y | ý | z | ž |
Letter | Name | Letter | Name | ||
---|---|---|---|---|---|
Uppercase | Lowercase | Uppercase | Lowercase | ||
A | a | á | Ň | ň | eň |
Á | á | dlouhé á; á s čárkou | O | o | ó |
B | b | bé | Ó | ó[lower-alpha 1] | dlouhé ó; ó s čárkou |
C | c | cé | P | p | pé |
Č | č | čé | Q | q | kvé |
D | d | dé | R | r | er |
Ď | ď | ďé | Ř | ř | eř |
E | e | é | S | s | es |
É | é | dlouhé é; é s čárkou | Š | š | eš |
Ě[lower-alpha 2] | ě | ije; é s háčkem | T | t | té |
F | f[lower-alpha 1] | ef | Ť | ť | ťé |
G | g[lower-alpha 1] | gé | U | u | ú |
H | h | há | Ú | ú | dlouhé ú; ú s čárkou |
Ch | ch | chá | Ů[lower-alpha 2] | ů | ů s kroužkem |
I | i | í; měkké i | V | v | vé |
Í | í | dlouhé í; dlouhé měkké í; í s čárkou; měkké í s čárkou | W | w | dvojité vé |
J | j | jé | X | x | iks |
K | k | ká | Y | y | ypsilon; krátké tvrdé ý |
L | l | el | Ý[lower-alpha 2] | ý | dlouhé ypsilon; dlouhé tvrdé ý; ypsilon s čárkou; tvrdé ý s čárkou |
M | m | em | Z | z | zet |
N | n | en | Ž | ž | žet |
The letters Q, W, and X are used exclusively in foreign words, and the former two are respectively replaced with KV and V once the word becomes "naturalized" (assimilated into Czech); the digraphs dz and dž are also used mostly for foreign words and are not considered to be distinct letters in the Czech alphabet.
Czech orthography is primarily phonemic (rather than phonetic) because an individual grapheme usually corresponds to an individual phoneme (rather than a sound). However, some graphemes and letter groups are remnants of historical phonemes which were used in the past but have since merged with other phonemes. Some changes in the phonology have not been reflected in the orthography.
Grapheme | IPA value | Notes |
---|---|---|
a | /a/ | |
á | /aː/ | |
e | /ɛ/ | |
é | /ɛː/ | |
ě | /ɛ/, /ʲɛ/ | Marks palatalization of preceding consonant; see usage rules below |
i | /ɪ/ | Palatalizes preceding ⟨d⟩, ⟨t⟩, or ⟨n⟩; see usage rules below |
í | /iː/ | Palatalizes preceding ⟨d⟩, ⟨t⟩, or ⟨n⟩; see usage rules below |
o | /o/ | |
ó | /oː/ | Occurs mostly in words of foreign origin. |
u | /u/ | |
ú | /uː/ | See usage rules below |
ů | /uː/ | See usage rules below |
y | /ɪ/ | See usage rules below |
ý | /iː/ | See usage rules below |
Grapheme | IPA value | Notes |
---|---|---|
b | /b/ | |
c | /t͡s/ [n 1] | |
č | /t͡ʃ/ [n 1] | |
d | /d/ | Represents /ɟ/ before ⟨i í ě⟩; see below |
ď | /ɟ/ | |
f | /f/ | Occurs mostly in words of foreign origin. |
g | /ɡ/ | Occurs mostly in words of foreign origin. [citation needed] |
h | /ɦ/ | |
ch | /x/ | |
j | /j/ | |
k | /k/ | |
l | /l/ | |
m | /m/ | |
n | /n/ | Represents /ɲ/ before ⟨i í ě⟩; see below |
ň | /ɲ/ | |
p | /p/ | |
r | /r/ | |
ř | /r̝/ [n 2] | |
s | /s/ | |
š | /ʃ/ | |
t | /t/ | Represents /c/ before ⟨i í ě⟩; see below |
ť | /c/ | |
v | /v/ | |
x | /ks/, /ɡz/ | Occurs only in words of foreign origin; pronounced /ɡz/ in words with the prefix 'ex-' before vowels or voiced consonants. |
z | /z/ | |
ž | /ʒ/ |
All the obstruent consonants are subject to voicing (before voiced obstruents except ⟨v⟩) or devoicing (before voiceless consonants and at the end of words); spelling in these cases is morphophonemic (i.e. the morpheme has the same spelling as before a vowel). An exception is the cluster ⟨sh⟩, in which the /s/ is voiced to /z/ only in Moravian dialects, while in Bohemia the /ɦ/ is devoiced to /x/ instead (e.g. shodit /sxoɟɪt/, in Moravia /zɦoɟɪt/). Devoicing /ɦ/ changes its articulation place: it becomes [x]. After unvoiced consonants ⟨ř⟩ is devoiced: for instance, in tři 'three', which is pronounced . Written voiced or voiceless counterparts are kept according to the etymology of the word, e.g. odpadnout [ˈotpadnoʊ̯t] (to fall away) - od- is a prefix; written /d/ is devoiced here because of the following voiceless /p/.
For historical reasons, the consonant [ɡ] is written k in Czech words like kde ('where', < Proto-Slavic *kъdě) or kdo ('who', < Proto-Slavic *kъto). This is because the letter g was historically used for the consonant [j]. The original Slavic phoneme /ɡ/ changed into /h/ in the Old-Czech period. Thus, /ɡ/ is not a separate phoneme (with a corresponding grapheme) in words of domestic origin; it occurs only in foreign words (e.g. graf, gram, etc.).
Unlike in English but like German and Russian, voiced consonants are pronounced voicelessly in the final position in words. In declension, they are voiced in cases where the words take on endings.
Compare:
The letters ⟨i⟩ and ⟨y⟩ are both pronounced [ɪ], while ⟨í⟩ and ⟨ý⟩ are both pronounced [iː]. ⟨y⟩ was originally pronounced [ɨ] as in contemporary Polish. However, in the 14th century, this difference in standard pronunciation disappeared, though it has been preserved in some Moravian dialects.[2] In words of native origin "soft" ⟨i⟩ and ⟨í⟩ cannot follow "hard" consonants, while "hard" ⟨y⟩ and ⟨ý⟩ cannot follow "soft" consonants; "neutral" consonants can be followed by either vowel:
Soft | ž, š, č, ř, c, j, ď, ť, ň |
---|---|
Neutral | b, f, l, m, p, s, v, z |
Hard | h, ch, k, r, d, t, n, g |
When ⟨i⟩ or ⟨í⟩ is written after ⟨d, t, n⟩ in native words, these consonants are soft, as if they were written ⟨ď, ť, ň⟩. That is, the sounds [ɟɪ, ɟiː, cɪ, ciː, ɲɪ, ɲiː] are written ⟨di, dí, ti, tí, ni, ní⟩ instead of ⟨ďi, ďí, ťi, ťí, ňi, ňí⟩, e.g. in čeština [ˈt͡ʃɛʃcɪna]. The sounds [dɪ, diː, tɪ, tiː, nɪ, niː] are denoted, respectively, by ⟨dy, dý, ty, tý, ny, ný⟩. In words of foreign origin, ⟨di, ti, ni⟩ are pronounced [dɪ, tɪ, nɪ]; that is, as if they were written ⟨dy, ty, ny⟩, e.g. in diktát, dictation.
Historically the letter ⟨c⟩ was hard, but this changed in the 19th century. However, in some words it is still followed by the letter ⟨y⟩: tác (plate) – tácy (plates).
Because neutral consonants can be followed by either ⟨i⟩ or ⟨y⟩, in some cases they distinguish homophones, e.g. být (to be) vs. bít (to beat), mýt (to wash) vs. mít (to have). At school pupils must memorize word roots and prefixes where ⟨y⟩ is written; ⟨i⟩ is written in other cases. Writing ⟨i⟩ or ⟨y⟩ in endings is dependent on the declension patterns.
The letter ⟨ě⟩ is a vestige of Old Czech palatalization. The originally palatalizing phoneme /ě/ [ʲɛ] became extinct, changing to [ɛ] or [jɛ], but it is preserved as a grapheme which can never appear in the initial position.
There are two ways in Czech to write long [uː]: ⟨ú⟩ and ⟨ů⟩. ⟨ů⟩ cannot occur in an initial position, while ⟨ú⟩ occurs almost exclusively in the initial position or at the beginning of a word root in a compound.
Historically, long ⟨ú⟩ changed into the diphthong ⟨ou⟩ [ou̯] (as also happened in the English Great Vowel Shift with words such as "house"), though not in word-initial position in the prestige form. In 1848 ⟨ou⟩ at the beginning of word-roots was changed into ⟨ú⟩ in words like ouřad to reflect this. Thus, the letter ⟨ú⟩ is written at the beginning of word-roots only: úhel (angle), trojúhelník (triangle), except in loanwords: skútr (scooter).
Meanwhile, historical long ⟨ó⟩ [oː] changed into the diphthong ⟨uo⟩ [ʊo]. As was common with scribal abbreviations, the letter ⟨o⟩ in the diphthong was sometimes written as a ring above the letter ⟨u⟩, producing ⟨ů⟩, e.g. kóň > kuoň > kůň (horse), like the origin of the German umlaut. Later, the pronunciation changed into [uː], but the grapheme ⟨ů⟩ has remained. It never occurs at the beginning of words: dům (house), domů (home, homeward).
The letter ⟨ů⟩ now has the same pronunciation as the letter ⟨ú⟩ (long [uː]), but alternates with a short ⟨o⟩ when a word is inflected (e.g. nom. kůň → gen. koně, nom. dům → gen. domu), thus showing the historical evolution of the language.
The predicate must be always in accordance with the subject in the sentence - in number and person (personal pronouns), and with past and passive participles also in gender. This grammatical principle affects the orthography (see also "Soft" I and "Hard" Y) – it is especially important for the correct choice and writing of plural endings of the participles.
Examples:
Gender | Sg. | Pl. | English |
---|---|---|---|
masculine animate | pes byl koupen | psi byli koupeni | a dog was bought/dogs were bought |
masculine inanimate | hrad byl koupen | hrady byly koupeny | a castle was bought/castles were bought |
feminine | kočka byla koupena | kočky byly koupeny | a cat was bought/cats were bought |
neuter | město bylo koupeno | města byla koupena | a town was bought/towns were bought |
The mentioned example shows both past (byl, byla ...) and passive (koupen, koupena ...) participles. The accordance in gender takes effect in the past tense and the passive voice, not in the present and future tenses in active voice.
If the complex subject is a combination of nouns of different genders, masculine animate gender is prior to others and the masculine inanimate and feminine genders are prior to the neuter gender.
Examples:
Priority of genders:
The use of the full stop (.), the colon (:), the semicolon (;), the question mark (?) and the exclamation mark (!) is similar to their use in other European languages. The full stop is placed after a number if it stands for ordinal numerals (as in German), e.g. 1. den (= první den) – the 1st day.
The comma is used to separate individual parts in complex-compound sentences, lists, isolated parts of sentences, etc. Its use in Czech is different from English. Subordinate (dependent) clauses must be always separated from their principal (independent) clauses, for instance. A comma is not placed before a (and), i (as well as), ani (nor) and nebo (or) when they connect parts of sentences or clauses in copulative conjunctions (on a same level). It must be placed in non-copulative conjunctions (consequence, emphasis, exclusion, etc.). A comma can, however, occur in front of the word a (and) if the former is part of comma-delimited parenthesis: Jakub, můj mladší bratr, a jeho učitel Filip byli příliš zabráni do rozhovoru. Probírali látku, která bude u zkoušky, a též, kdo na ní bude. A comma also separates subordinate conjunctions introduced by composite conjunctions a proto (and therefore) and a tak (and so).
Examples:
Quotation marks. The first one preceding the quoted text is placed to the bottom line:
Other types of quotation marks: ‚‘ »«
Apostrophes are used rarely in Czech. They can denote a missing sound in non-standard speech, but it is optional, e.g. řek' or řek (= řekl, he said).
The first word of every sentence and all proper names are capitalized. Special cases are:
In the 9th century, the Glagolitic script was used, during the 11th century it was replaced by Latin script. There are five periods in the development of the Czech Latin-based orthographic system:
In computing, several different coding standards have existed for this alphabet, among them:
Seamless Wikipedia browsing. On steroids.
Every time you click a link to Wikipedia, Wiktionary or Wikiquote in your browser's search results, it will show the modern Wikiwand interface.
Wikiwand extension is a five stars, simple, with minimum permission required to keep your browsing private, safe and transparent.