Loading AI tools
Glyph combining two or more letterforms From Wikipedia, the free encyclopedia
In writing and typography, a ligature occurs where two or more graphemes or letters are joined to form a single glyph. Examples are the characters ⟨æ⟩ and ⟨œ⟩ used in English and French, in which the letters ⟨a⟩ and ⟨e⟩ are joined for the first ligature and the letters ⟨o⟩ and ⟨e⟩ are joined for the second ligature. For stylistic and legibility reasons, ⟨f⟩ and ⟨i⟩ are often merged to create ⟨fi⟩ (where the tittle on the ⟨i⟩ merges with the hood of the ⟨f⟩); the same is true of ⟨s⟩ and ⟨t⟩ to create ⟨st⟩. The common ampersand, ⟨&⟩, developed from a ligature in which the handwritten Latin letters ⟨e⟩ and ⟨t⟩ (spelling et, Latin for 'and') were combined.[1]
The earliest known script Sumerian cuneiform and Egyptian hieratic both include many cases of character combinations that gradually evolve from ligatures into separately recognizable characters. Other notable ligatures, such as the Brahmic abugidas and the Germanic bind rune, figure prominently throughout ancient manuscripts. These new glyphs emerge alongside the proliferation of writing with a stylus, whether on paper or clay, and often for a practical reason: faster handwriting. Merchants especially needed a way to speed up the process of written communication and found that conjoining letters and abbreviating words for lay use was more convenient for record keeping and transaction than the bulky long forms.[citation needed]
Around the 9th and 10th centuries, monasteries became a fountainhead for these type of script modifications. Medieval scribes who wrote in Latin increased their writing speed by combining characters and by introducing notational abbreviations. Others conjoined letters for aesthetic purposes. For example, in blackletter, letters with right-facing bowls (⟨b⟩, ⟨o⟩, and ⟨p⟩) and those with left-facing bowls (⟨c⟩, ⟨e⟩, ⟨o⟩, ⟨d⟩, ⟨g⟩ and ⟨q⟩) were written with the facing edges of the bowls superimposed. In many script forms, characters such as ⟨h⟩, ⟨m⟩, and ⟨n⟩ had their vertical strokes superimposed.[citation needed] Scribes also used notational abbreviations to avoid having to write a whole character in one stroke. Manuscripts in the fourteenth century employed hundreds of such abbreviations.[citation needed]
In handwriting, a ligature is made by joining two or more characters in an atypical fashion by merging their parts, or by writing one above or inside the other. In printing, a ligature is a group of characters that is typeset as a unit, so the characters do not have to be joined. For example, in some cases the ⟨fi⟩ ligature prints the letters ⟨f⟩ and ⟨i⟩ with a greater separation than when they are typeset as separate letters. When printing with movable type was invented around 1450,[4] typefaces included many ligatures and additional letters, as they were based on handwriting. Ligatures made printing with movable type easier because one sort would replace frequent combinations of letters and also allowed more complex and interesting character designs which would otherwise collide with one another.[citation needed]
Because of their complexity, ligatures began to fall out of use in the 20th century. Sans serif typefaces, increasingly used for body text, generally avoid ligatures, though notable exceptions include Gill Sans and Futura. Inexpensive phototypesetting machines in the 1970s (which did not require journeyman knowledge or training to operate) also generally avoid them. A few, however, became characters in their own right, see below the sections about German ß, various Latin accented letters, & et al.
The trend against digraph use was further strengthened by the desktop publishing revolution. Early computer software in particular had no way to allow for ligature substitution (the automatic use of ligatures where appropriate), while most new digital typefaces did not include ligatures. As most of the early PC development was designed for the English language (which already treated ligatures as optional at best) dependence on ligatures did not carry over to digital. Ligature use fell as the number of traditional hand compositors and hot metal typesetting machine operators dropped because of the mass production of the IBM Selectric brand of electric typewriter in 1961. A designer active in the period commented: "some of the world's greatest typefaces were quickly becoming some of the world's worst fonts."[5]
Ligatures have grown in popularity in the 21st century because of an increasing interest in creating typesetting systems that evoke arcane designs and classical scripts. One of the first computer typesetting programs to take advantage of computer-driven typesetting (and later laser printers) was Donald Knuth's TeX program. Now the standard method of mathematical typesetting, its default fonts are explicitly based on nineteenth-century styles. Many new fonts feature extensive ligature sets; these include FF Scala, Seria and others by Martin Majoor and Hoefler Text by Jonathan Hoefler. Mrs Eaves by Zuzana Licko contains a particularly large set to allow designers to create dramatic display text with a feel of antiquity. A parallel use of ligatures is seen in the creation of script fonts that join letterforms to simulate handwriting effectively. This trend is caused in part by the increased support for other languages and alphabets in modern computing, many of which use ligatures somewhat extensively. This has caused the development of new digital typesetting techniques such as OpenType, and the incorporation of ligature support into the text display systems of macOS, Windows, and applications like Microsoft Office. An increasing modern trend is to use a "Th" ligature which reduces spacing between these letters to make it easier to read, a trait infrequent in metal type.[6][7][8]
Today, modern font programming divides ligatures into three groups, which can be activated separately: standard, contextual and historical. Standard ligatures are needed to allow the font to display without errors such as character collision. Designers sometimes find contextual and historic ligatures desirable for creating effects or to evoke an old-fashioned print look.[citation needed]
Many ligatures combine ⟨f⟩ with the following letter. A particularly prominent example is ⟨fi⟩ (or ⟨fi⟩, rendered with two normal letters). The tittle of the ⟨i⟩ in many typefaces collides with the hood of the ⟨f⟩ when placed beside each other in a word, and are combined into a single glyph with the tittle absorbed into the ⟨f⟩. Other ligatures with the letter f include ⟨fj⟩,[lower-alpha 1] ⟨fl⟩ (fl), ⟨ff⟩ (ff), ⟨ffi⟩ (ffi), and ⟨ffl⟩ (ffl). Ligatures for ⟨fa⟩, ⟨fe⟩, ⟨fo⟩, ⟨fr⟩, ⟨fs⟩, ⟨ft⟩, ⟨fb⟩, ⟨fh⟩, ⟨fu⟩, ⟨fy⟩, and for ⟨f⟩ followed by a full stop, comma, or hyphen are also used, as well as the equivalent set for the doubled ⟨ff⟩.[citation needed]
These arose because with the usual type sort for lowercase ⟨f⟩, the end of its hood is on a kern, which would be damaged by collision with raised parts of the next letter.[citation needed]
Ligatures crossing the morpheme boundary of a composite word are sometimes considered incorrect, especially in official German orthography as outlined in the Duden. An English example of this would be ⟨ff⟩ in shelfful; a German example would be Schifffahrt ("boat trip").[lower-alpha 2] Some computer programs (such as TeX) provide a setting to disable ligatures for German, while some users have also written macros to identify which ligatures to disable.[9][10]
Turkish distinguishes dotted and dotless "I". In a ligature with f (in words such as fırın and fikir), this contrast would be obscured. The ⟨fi⟩ ligature is therefore not used in Turkish typography, and neither are other ligatures like that for ⟨fl⟩, which would be rare anyway because of Turkish phonotactics.[citation needed]
Remnants of the ligatures ⟨ſʒ⟩ /⟨ſz⟩ ("sharp s", eszett) and ⟨tʒ⟩/⟨tz⟩ ("sharp t", tezett) from Fraktur, a family of German blackletter typefaces, originally mandatory in Fraktur but now employed only stylistically, can be seen to this day on street signs for city squares whose name contains Platz or ends in -platz. Instead, the "sz" ligature has merged into a single character, the German ß – see below.[citation needed]
Sometimes, ligatures for ⟨st⟩ (st), ⟨ſt⟩ (ſt), ⟨ch⟩, ⟨ck⟩, ⟨ct⟩, ⟨Qu⟩ and ⟨Th⟩ are used (e.g. in the typeface Linux Libertine).[citation needed]
Besides conventional ligatures, in the metal type era some newspapers commissioned custom condensed single sorts for the names of common long names that might appear in news headings, such as "Eisenhower", "Chamberlain", and others. In these cases the characters did not appear combined, just more tightly spaced than if printed conventionally.[11]
The German letter ⟨ß⟩ (Eszett, also called the scharfes S, meaning sharp s) is an official letter of the alphabet in Germany and Austria. There is no general consensus about its history. Its name Es-zett (meaning S-Z) suggests a connection of "long s and z" (ſʒ) but the Latin script also knows a ligature of "long s over round s" (ſs). The latter is used as the design principle for the character in most of today's typefaces. Since German was mostly set in blackletter typefaces until the 1940s, and those typefaces were rarely set in uppercase, a capital version of the Eszett never came into common use, even though its creation has been discussed since the end of the 19th century. Therefore, the common replacement in uppercase typesetting was originally SZ (Maße "measure" → MASZE, different from Masse "mass" → MASSE) and later SS (Maße → MASSE). Until 2017, the SS replacement was the only valid spelling according to the official orthography in Germany and Austria. In Switzerland, the ß is omitted altogether in favour of ss. The capital version (ẞ) of the Eszett character was occasionally used since 1905/06, has been part of Unicode since 2008, and has appeared in more and more typefaces. Since the end of 2010, the Ständiger Ausschuss für geographische Namen (StAGN) has suggested the new upper case character for "ß" rather than replacing it with "SS" or "SZ" for geographical names.[12] A new standardized German keyboard layout (DIN 2137-T2) has included the capital ß since 2012. The new character entered the official orthographic rules in June 2017.[citation needed]
A prominent feature of the colonial orthography created by John Eliot (later used in the first Bible printed in the Americas, the Massachusett-language Mamusse Wunneetupanatamwe Up-Biblum God, published in 1663) was the use of the double-o ligature ⟨ꝏ⟩ to represent the /u/ of food as opposed to the /ʊ/ of hook (although Eliot himself used ⟨oo⟩ and ⟨ꝏ⟩ interchangeably).[clarification needed] In the orthography in use since 2000 in the Wampanoag communities participating in the Wôpanâak Language Reclamation Project (WLRP), the ligature was replaced with the numeral ⟨8⟩, partly because of its ease in typesetting and display as well as its similarity to the o-u ligature ⟨Ȣ⟩ used in Abenaki. For example, compare the colonial-era spelling seepꝏash[13] with the modern WLRP spelling seep8ash.[14]
As the letter ⟨W⟩ is an addition to the Latin alphabet that originated in the seventh century, the phoneme it represents was formerly written in various ways. In Old English, the runic letter wynn ⟨Ƿ⟩) was used, but Norman influence forced wynn out of use. By the 14th century, the "new" letter ⟨W⟩, originated as two ⟨V⟩ glyphs or ⟨U⟩ glyphs joined, developed into a legitimate letter with its own position in the alphabet. Because of its relative youth compared to other letters of the alphabet, only a few European languages (English, Dutch, German, Polish, Welsh, Maltese, and Walloon) use the letter in native words.[citation needed]
The character ⟨Æ⟩ (lower case ⟨æ⟩; in ancient times named æsc) when used in Danish, Norwegian, Icelandic, or Old English is not a typographic ligature. It is a distinct letter — a vowel — and when collated, may be given a different place in the alphabetical order than Ae.[citation needed]
In modern English orthography, ⟨Æ⟩ is not considered an independent letter but a spelling variant, for example: "encyclopædia" versus "encyclopaedia" or "encyclopedia". In this use, ⟨Æ⟩ comes from Medieval Latin, where it was an optional ligature in some specific words that had been transliterated and borrowed from Ancient Greek, for example, "Æneas". It is still found as a variant in English and French words descended or borrowed from Medieval Latin, but the trend has recently been towards printing the ⟨A⟩ and ⟨E⟩ separately.[15]
Similarly, ⟨Œ⟩ and ⟨œ⟩, while normally printed as ligatures in French, are replaced by component letters if technical restrictions require it.[citation needed]
In German orthography, the umlauted vowels ⟨ä⟩, ⟨ö⟩, and ⟨ü⟩ historically arose from ⟨ae⟩, ⟨oe⟩, ⟨ue⟩ ligatures (strictly, from these vowels with a small letter ⟨e⟩ written as a diacritic, for example ⟨aͤ⟩, ⟨oͤ⟩, ⟨uͤ⟩). It is common practice to replace them with ⟨ae⟩, ⟨oe⟩, ⟨ue⟩ digraphs when the diacritics are unavailable, for example in electronic conversation. Phone books treat umlauted vowels as equivalent to the relevant digraph (so that a name Müller will appear at the same place as if it were spelled Mueller; German surnames have a strongly fixed orthography, either a name is spelled with ⟨ü⟩ or with ⟨ue⟩); however, the alphabetic order used in other books treats them as equivalent to the simple letters ⟨a⟩, ⟨o⟩ and ⟨u⟩. The convention in Scandinavian languages and Finnish is different: there the umlaut vowels are treated as independent letters with positions at the end of the alphabet.[citation needed]
In Middle English, the word the (written þe) was frequently abbreviated as a ⟨þ⟩ (thorn) with a small ⟨e⟩ written as a diacritic. Similarly, the word that was abbreviated to ⟨þ⟩ with a small ⟨t⟩ written as a diacritic. During the latter Middle English and Early Modern English periods, the thorn in its common script, or cursive, form came to resemble a ⟨y⟩ shape. With the arrival of movable type printing, the substitution of ⟨y⟩ for ⟨Þ⟩ became ubiquitous, leading to the common "ye", as in 'Ye Olde Curiositie Shoppe'. One major reason for this was that ⟨y⟩ existed in the printer's types that William Caxton and his contemporaries imported from Belgium and the Netherlands, while ⟨Þ⟩ did not.[16]
The ring diacritic used in vowels such as ⟨å⟩ likewise originated as an ⟨o⟩ -ligature.[17] Before the replacement of the older "aa" with "å" became a de facto practice, an "a" with another "a" on top (aͣ) could sometimes be used, for example in Johannes Bureus's, Runa: ABC-Boken (1611).[18] The ⟨uo⟩ ligature ů in particular saw use in Early Modern High German, but it merged in later Germanic languages with ⟨u⟩ (e.g. MHG fuosz, ENHG fuͦß, Modern German Fuß "foot"). It survives in Czech, where it is called kroužek.
The tilde diacritic, used in Spanish as part of the letter ⟨ñ⟩, representing the palatal nasal consonant, and in Portuguese for nasalization of a vowel, originated in ligatures where ⟨n⟩ followed the base letter: Espanna → España.[19] Similarly, the circumflex in French spelling stems from the ligature of a silent ⟨s⟩.[20]
The letter hwair (ƕ), used only in transliteration of the Gothic language, resembles a ⟨hw⟩ ligature. It was introduced by philologists around 1900 to replace the digraph ⟨hv⟩ formerly used to express the phoneme in question, e.g. by Migne in the 1860s (Patrologia Latina vol. 18).
The Byzantines had a unique o-u ligature ⟨Ȣ⟩ that, while originally based on the Greek alphabet's ο-υ, carried over into Latin alphabets as well. This ligature is still seen today on icon artwork in Greek Orthodox churches, and sometimes in graffiti or other forms of informal or decorative writing.[citation needed]
Gha ⟨ƣ⟩, a rarely used letter based on Q and G, was misconstrued by the ISO to be an OI ligature because of its appearance, and is thus known (to the ISO and, in turn, Unicode) as "Oi". Historically, it was used in many Latin-based orthographies of Turkic (e.g., Azerbaijani) and other central Asian languages.[citation needed]
The International Phonetic Alphabet formerly used ligatures to represent affricate consonants, of which six are encoded in Unicode: ʣ, ʤ, ʥ, ʦ, ʧ and ʨ. One fricative consonant is still represented with a ligature: ɮ, and the extensions to the IPA contain three more: ʩ, ʪ and ʫ.[citation needed]
The Initial Teaching Alphabet, a short-lived alphabet intended for young children, used a number of ligatures to represent long vowels: ⟨ꜷ⟩, ⟨æ⟩, ⟨œ⟩, ⟨ᵫ⟩, ⟨ꭡ⟩, and ligatures for ⟨ee⟩, ⟨ou⟩ and ⟨oi⟩ that are not encoded in Unicode. Ligatures for consonants also existed, including ligatures of ⟨ʃh⟩, ⟨ʈh⟩, ⟨wh⟩, ⟨ʗh⟩, ⟨ng⟩ and a reversed ⟨t⟩ with ⟨h⟩ (neither the reversed t nor any of the consonant ligatures are in Unicode).[citation needed]
Rarer ligatures also exist, including ⟨ꜳ⟩; ⟨ꜵ⟩; ⟨ꜷ⟩; ⟨ꜹ⟩; ⟨ꜻ⟩ (barred ⟨av⟩); ⟨ꜽ⟩; ⟨ꝏ⟩, which is used in medieval Nordic languages for /oː/ (a long close-mid back rounded vowel),[21] as well as in some orthographies of the Massachusett language to represent uː (a long close back rounded vowel); ᵺ; ỻ, which was used in Medieval Welsh to represent ɬ (the voiceless lateral fricative);[21] ꜩ; ᴂ; ᴔ; and ꭣ have Unicode codepoints (in code block Latin Extended-E for characters used in German dialectology (Teuthonista),[22] the Anthropos alphabet, Sakha and Americanist usage).[citation needed]
The most common ligature in modern usage is the ampersand ⟨&⟩ . This was originally a ligature of ⟨E⟩ and ⟨t⟩ , forming the Latin word "et", meaning "and". It has exactly the same use in French and in English. The ampersand comes in many different forms. Because of its ubiquity, it is generally no longer considered a ligature, but a logogram. Like many other ligatures, it has at times been considered a letter (e.g., in early Modern English); in English it is pronounced "and", not "et", except in the case of &c, pronounced "et cetera". In most typefaces, it does not immediately resemble the two letters used to form it, although certain typefaces use designs in the form of a ligature (examples include the original versions of Futura and Univers, Trebuchet MS, and Civilité, known in modern times as the italic of Garamond).[citation needed]
Similarly, the number sign ⟨#⟩ originated as a stylized abbreviation of the Roman term libra pondo, written as ℔.[23] Over time, the number sign was simplified to how it is seen today, with two horizontal strokes across two slash-like strokes.[24] Now a logogram, the symbol is used mainly to denote (in the US) numbers, and weight in pounds.[25] It has also been used popularly on push-button telephones and as the hashtag indicator.[26]
The at sign ⟨@⟩ is potentially a ligature, but there are many different theories about the origin. One theory says that the French word à (meaning at), was simplified by scribes who, instead of lifting the pen to write the grave accent, drew an arc around the "a". Another states that it is short for the Latin word for "toward", "ad", with the ⟨d⟩ being represented by the arc. Another says it is short for an abbreviation of the term each at, with the ⟨e⟩ encasing the ⟨a⟩.[27] Around the 18th century, it started being used in commerce to indicate price per unit, as "15 units @ $1".[28] After the popularization of Email, this fairly unpopular character became widely known, used to tag specific users.[29] Lately, it has been used to de-gender nouns in Spanish with no agreed pronunciation.
The dollar sign ⟨$⟩ possibly originated as a ligature (for "pesos", although there are other theories as well) but is now a logogram.[30] At least once, the United States dollar used a symbol resembling an overlapping U-S ligature, with the right vertical bar of the U intersecting through the middle of the S ( US ) to resemble the modern dollar sign.[31]
The Spanish peseta was sometimes symbolized by a ligature ⟨₧⟩ (from Pts), and the French franc was often symbolized by the ligature ⟨₣⟩ (from Fr).[citation needed]
In astronomy, the planetary symbol for Mercury (☿) may be a ligature of Mercury's caduceus and a cross (which was added in the 16th century to Christianize the pagan symbol),[32] though other sources disagree;[33] the symbol for Venus ♀ may be a ligature of the Greek letters ⟨ϕ⟩ (phi) and ⟨κ⟩ (kappa).[33] The symbol for Jupiter (♃) descends from a Greek zeta with a horizontal stroke, ⟨Ƶ⟩, as an abbreviation for Zeus.[32] [34] Saturn's astronomical symbol (♄) has been traced back to the Greek Oxyrhynchus Papyri, where it can be seen to be a Greek kappa-rho with a horizontal stroke, as an abbreviation for Κρονος (Cronus), the Greek name for the planet.[32] It later came to look like a lower-case Greek eta, with the cross added at the top in the 16th century to Christianize it. The dwarf planet Pluto is symbolized by a PL ligature, ♇.
A different PL ligature, ⅊, represents the property line in surveying. [citation needed]
In engineering diagrams, a CL ligature, ℄, represents the center line of an object.[citation needed]
The interrobang ⟨‽⟩ is an unconventional punctuation meant to combine the interrogation point (or the question mark) and the bang (printer's slang for exclamation mark) into one symbol, used to denote a sentence which is both a question and is exclaimed. For example, the sentence "Are you really coming over to my house on Friday‽" shows that the speaker is surprised while asking their question.[35]
Alchemy used a set of mostly standardized symbols, many of which were ligatures: 🜇 (AR, for aqua regia); 🜈 (S inside a V, for aqua vitae); 🝫 (MB, for balneum Mariae [Mary's bath], a double boiler); 🝬 (VB, for balneum vaporis, a steam bath); and 🝛 (aaa with overline, for amalgam).[citation needed]
Digraphs, such as ⟨ll⟩ in Spanish or Welsh, are not ligatures in the general case as the two letters are displayed as separate glyphs: although written together, when they are joined in handwriting or italic fonts the base form of the letters is not changed and the individual glyphs remain separate. Like some ligatures discussed above, these digraphs may or may not be considered individual letters in their respective languages. Until the 1994 spelling reform, the digraphs ⟨ch⟩ and ⟨ll⟩ were considered separate letters in Spanish for collation purposes. Catalan makes a difference between "Spanish ll" or palatalized l, written ll as in llei (law), and "French ll" or geminated l, written l·l as in col·lega (colleague).[citation needed]
The difference can be illustrated with the French digraph œu, which is composed of the ligature œ and the simplex letter u.[citation needed]
In Dutch, ⟨ij⟩ can be considered a digraph, a ligature, or a letter in itself, depending on the standard used. Its uppercase and lowercase forms are often available as a single glyph with a distinctive ligature in several professional typefaces (e.g. Zapfino). Sans serif uppercase ⟨IJ⟩ glyphs, popular in the Netherlands, typically use a ligature resembling a ⟨U⟩ with a broken left-hand stroke. Adding to the confusion, Dutch handwriting can render ⟨y⟩ (which is not found in native Dutch words, but occurs in words borrowed from other languages) as a ⟨ij⟩-glyph without the dots in its lowercase form and the ⟨IJ⟩ in its uppercase form looking virtually identical (only slightly bigger). When written as two separate letters, both should be capitalized – or both not – to form a correctly spelled word, like IJs or ijs (ice).[citation needed]
Ligatures are not limited to Latin script:
Written Chinese has a long history of creating new characters by merging parts or wholes of other Chinese characters. However, a few of these combinations do not represent morphemes but retain the original multi-character (multiple morpheme) reading and are therefore not considered true characters themselves. In Chinese, these ligatures are called héwén (合文) or héshū (合書); see polysyllabic Chinese characters for more.
One popular ligature used on chūntiē decorations used for Chinese Lunar New Year is a combination of the four characters for zhāocái jìnbǎo (招財進寶), meaning "ushering in wealth and fortune" and used as a popular New Year's greeting.
In 1924, Du Dingyou (杜定友; 1898–1967) created the ligature 圕 from two of the three characters 圖書館 (túshūguǎn), meaning "library".[37] Although it does have an assigned pronunciation of tuān and appears in many dictionaries, it is not a morpheme and cannot be used as such in Chinese. Instead, it is usually considered a graphic representation of túshūguǎn.
In recent years, a Chinese internet meme, the Grass Mud Horse, has had such a ligature associated with it combining the three relevant Chinese characters 草, 泥, and 马 (Cǎonímǎ).
Similar to the ligatures were several "two-syllable Chinese characters" (雙音節漢字) created in the 19th century as Chinese characters for SI units. In Chinese these units are disyllabic and standardly written with two characters, as 厘米 límǐ "centimeter" (厘 centi-, 米 meter) or 千瓦 qiānwǎ "kilowatt". However, in the 19th century these were often written via compound characters, pronounced disyllabically, such as 瓩 for 千瓦 or 糎 for 厘米 – some of these characters were also used in Japan, where they were pronounced with borrowed European readings instead. These have now fallen out of general use, but are occasionally seen.[38]
The CJK Compatibility Unicode block features characters that have been combined into one square character in legacy character set so that it matches Japanese text. For example, the Japanese equivalent of "stock company", 株式会社 (kabushiki gaisha) can be represented in 1 Unicode character ⟨㍿⟩. Its romanized abbreviation K.K. can also be 1 character ⟨㏍⟩. There are other Latin abbreviations such as kg for "kilogram" that can be ligated into 1 square character ⟨㎏⟩.
The OpenType font format includes features for associating multiple glyphs to a single character, used for ligature substitution. Typesetting software may or may not implement this feature, even if it is explicitly present in the font's metadata. XeTeX is a TeX typesetting engine designed to make the most of such advanced features. This type of substitution used to be needed mainly for typesetting Arabic texts, but ligature lookups and substitutions are being put into all kinds of Western Latin OpenType fonts. In OpenType, there are standard liga
, historical hlig
, contextual clig
, discretionary dlig
and required rlig
ligatures.
Opinion is divided over whether it is the job of writers or typesetters to decide where to use ligatures. TeX is an example of a computer typesetting system that makes use of ligatures automatically. The Computer Modern Roman typeface provided with TeX includes the five common ligatures ⟨ff⟩ , ⟨fi⟩ , ⟨fl⟩ , ⟨ffi⟩ , and ⟨ffl⟩ . When TeX finds these combinations in a text, it substitutes the appropriate ligature, unless overridden by the typesetter.
CSS3 provides control over these properties using font-feature-settings
,[39] though the CSS Fonts Module Level 4 draft standard indicates that authors should prefer several other properties.[40] Those include font-variant-ligatures
, common-ligatures
, discretionary-ligatures
, historical-ligatures
, and contextual
.[41]
This table below shows discrete letter pairs on the left, the corresponding Unicode ligature in the middle column, and the Unicode code point on the right. Provided you are using an operating system and browser that can handle Unicode, and have the correct Unicode fonts installed, some or all of these will display correctly. See also the provided graphic.
Unicode maintains that ligaturing is a presentation issue rather than a character definition issue, and that, for example, "if a modern font is asked to display 'h' followed by 'r', and the font has an 'hr' ligature in it, it can display the ligature." Accordingly, the use of the special Unicode ligature characters is "discouraged", and "no more will be encoded in any circumstances".[42] (Unicode has continued to add ligatures, but only in such cases that the ligatures were used as distinct letters in a language or could be interpreted as standalone symbols. For example, ligatures such as æ and œ are not used to replace arbitrary "ae" or "oe" sequences; it is generally considered incorrect to write "does" as "dœs".)
Microsoft Word disables ligature substitution by default, largely for backward compatibility when editing documents created in earlier versions of Word. Users can enable automatic ligature substitution on the Advanced tab of the Font dialog box.
LibreOffice Writer enables standard ligature substitution by default for OpenType fonts, user can enable or disable any ligature substitution on the Features dialog box, which is accessible via the Features button of the Character dialog box, or alternatively, input a syntax with font name and feature into the Font Name input box, for example: Noto Sans:liga=0.
Non-ligature | Ligature[42] | Unicode | HTML |
---|---|---|---|
AA, aa | Ꜳ, ꜳ[21] | U+A732, U+A733 | Ꜳ ꜳ |
AE, ae | Æ, æ | U+00C6, U+00E6 | Æ æ |
AO, ao | Ꜵ, ꜵ[21] | U+A734, U+A735 | Ꜵ ꜵ |
AU, au | Ꜷ, ꜷ[21] | U+A736, U+A737 | Ꜷ ꜷ |
AV, av | Ꜹ, ꜹ[21] | U+A738, U+A739 | Ꜹ ꜹ |
AV, av (with bar) | Ꜻ, ꜻ[21] | U+A73A, U+A73B | Ꜻ ꜻ |
AY, ay | Ꜽ, ꜽ[21] | U+A73C, U+A73D | Ꜽ ꜽ |
et | 🙰 | U+1F670 | 🙰 |
ff | ff | U+FB00 | ff |
ffi | ffi | U+FB03 | ffi |
ffl | ffl | U+FB04 | ffl |
fi | fi | U+FB01 | fi |
fl | fl | U+FB02 | fl |
Hv, hv | Ƕ, ƕ | U+01F6, U+0195 | Ƕ ƕ |
lb | ℔ | U+2114 | ℔ ℔ |
lL, ll | Ỻ, ỻ | U+1EFA, U+1EFB | Ỻ ỻ |
OE, oe | Œ, œ | U+0152, U+0153 | Œ œ |
OO, oo | Ꝏ, ꝏ[21] | U+A74E, U+A74F | Ꝏ ꝏ |
ɔe | ꭢ | U+AB62 | ꭢ |
ſs, ſz | ẞ, ß | U+1E9E, U+00DF | ß |
st | st | U+FB06 | st |
ſt | ſt | U+FB05 | ſt |
TZ, tz | Ꜩ, ꜩ | U+A728, U+A729 | Ꜩ ꜩ |
ue | ᵫ | U+1D6B | ᵫ |
uo | ꭣ[43] | U+AB63 | ꭣ |
VV, vv | W, w | U+0057, U+0077 | W w |
VY, vy | Ꝡ, ꝡ[21] | U+A760, U+A761 | Ꝡ ꝡ |
ſs | Ꟗ ꟗ | U+A7D6, U+A7D7 | ꟗ ꟗ |
ƿƿ | ꟕ | U+A7D5 | ꟕ ꟕ |
þþ | ꟓ | U+A7D3 | ꟓ ꟓ |
There are separate code points for the digraph DZ, the Dutch digraph IJ, and for the Serbo-Croatian digraphs DŽ, LJ, and NJ. Although similar, these are digraphs, not ligatures. See Digraphs in Unicode.
Ligature[42] | Unicode | HTML | |
---|---|---|---|
superscript small capital AA | 𐞀[44][45] | U+10780 | 𐞀 |
superscript ae | 𐞃[46] | U+10783 | 𐞃 |
aə | ꬱ[43] | U+AB31 | ꬱ |
əø | ꭁ | U+AB41 | ꭁ |
db[lower-alpha 3] | ȸ | U+0238 | ȸ |
dz | ʣ | U+02A3 | ʣ |
dʐ | ꭦ[47] | U+AB66 | ꭦ |
dʑ (or dz curl) | ʥ | U+02A5 | ʥ |
dʒ (or dezh) | ʤ | U+02A4 | ʤ |
dʒ with palatal hook | 𝼒[48][45] | U+1DF12 | 𝼒 |
dʒ with retroflex hook | 𝼙[49] | U+1DF19 | 𝼙 |
fŋ (or feng) | ʩ | U+02A9 | ʩ |
Superscript fŋ | 𐞐[44][45] | U+10790 | 𐞐 |
fŋ with trill | 𝼀[44][45] | U+1DF00 | 𝼀 |
ls (or less) | ʪ | U+02AA | ʪ |
superscript ls | 𐞙[44][45] | U+10799 | 𐞙 |
lz | ʫ | U+02AB | ʫ |
superscript lz | 𐞚[44][45] | U+1079A | 𐞚 |
lʒ (or lezh) | ɮ | U+026E | ɮ |
superscript lʒ | 𐞞[44] | U+1079E | 𐞞 |
lʒ with retroflex hook | 𝼅[44][45] | U+1DF05 | 𝼅 |
superscript lʒ with retroflex hook | 𐞟[44] | U+1079F | 𐞟 |
oə | ꭀ | U+AB40 | ꭀ |
qp[lower-alpha 3] | ȹ | U+0239 | ȹ |
tɕ (or tc curl) | ʨ | U+02A8 | ʨ |
superscript tɕ | 𐞫[46] | U+107AB | 𐞫 |
ts (or tess) | ʦ | U+02A6 | ʦ |
superscript ts | 𐞬[46] | U+107AC | 𐞬 |
ts with retroflex hook | ꭧ | U+AB67 | ꭧ |
superscript ts with retroflex hook | 𐞭[46] | U+107AD | 𐞭 |
tʂ | ꭧ[47] | U+AB67 | ꭧ |
tʃ (or tesh) | ʧ | U+02A7 | ʧ |
superscript tʃ | 𐞮[46] | U+107AE | 𐞮 |
tʃ with retroflex hook | 𝼜[49] | U+1DF1C | 𝼜 |
tʃ with palatal hook | 𝼗[48][45] | U+1DF17 | 𝼗 |
ui | ꭐ[51] | U+AB50 | ꭐ |
turned ui | ꭑ[51] | U+AB51 | ꭑ |
uu | ɯ | U+026F | ɯ |
Four "ligature ornaments" are included from U+1F670 to U+1F673 in the Ornamental Dingbats block: regular and bold variants of ℯT (script e and T) and of ɛT (open E and T).
Typographic ligatures are used in a form of contemporary art,[52] as can be illustrated by Chinese artist Xu Bing's work in which he combines Latin letters to form characters that resemble Chinese.[53] Croatian designer Maja Škripelj also created a ligature that combined Glagolitic letters ⰘⰓ for euro coins.[54]
Seamless Wikipedia browsing. On steroids.
Every time you click a link to Wikipedia, Wiktionary or Wikiquote in your browser's search results, it will show the modern Wikiwand interface.
Wikiwand extension is a five stars, simple, with minimum permission required to keep your browsing private, safe and transparent.