In writing, a space ( ) is a blank area that separates words, sentences, syllables (in syllabification) and other written or printed glyphs (characters). Conventions for spacing vary among languages, and in some languages the spacing rules are complex.[citation needed] Inter-word spaces ease the reader's task of identifying words, and avoid outright ambiguities such as "now here" vs. "nowhere". They also provide convenient guides for where a human or program may start new lines.
Space | |
---|---|
U+0020 SPACE (Note: Representations here of a regular space are replaced with a no-break space) | |
See also | |
U+00A0 NO-BREAK SPACE ( ,  ) Other types of spaces |
Typesetting can use spaces of varying widths, just as it can use graphic characters of varying widths. Unlike graphic characters, typeset spaces are commonly stretched in order to align text. The typewriter, on the other hand, typically has only one width for all characters, including spaces. Following widespread acceptance of the typewriter, some typewriter conventions influenced typography and the design of printed works.[citation needed]
Computer representation of text facilitates getting around mechanical and physical limitations such as character widths in at least two ways:
- Character encodings such as Unicode provide spaces of several widths, which are encoded using distinct numeric code points. For example, Unicode U+0020 is the "normal" space character, but U+00A0 adds the meaning that a new line should not be started there, while U+2003 represents a space with a fixed width of one em. Collectively, such characters are called Whitespace characters.
- Formatting and drawing languages and software commonly provide much more flexibility in spacing. For example, SVG, PostScript, and countless other languages enable drawing characters at specific (x,y) coordinates on a screen or page. By drawing each word at a specific starting coordinate, such programs need not "draw" spaces at all (this can lead to difficulties in extracting the correct text back out). Similarly, word processors can "fully justify" text, stretching inter-word spaces to make all lines the same length (as can mechanical Linotype machines). Precision is limited by physical capabilities of output devices.
Use in natural languages
Between words
This article needs attention from an expert in Linguistics. The specific problem is: At some point, this subsection contained a lot of incorrect claims about Semitic languages, and weasel-wording. It was clearly written by someone who doesn't know any of the relevant languages. Some claims remain largely unsupported and Euro-centric. This needs attention from a historical linguist. If such an expert can't be found, it is better to rewrite this with a focus on developments within individual writing systems - which is not how it's structured at the moment.. (August 2023) |
Modern English uses a space to separate words, but not all languages follow this practice. Spaces were not used to separate words in Latin until roughly 600–800 AD. Ancient Hebrew and Arabic did use spaces partly to compensate in clarity for the lack of vowels.[1] The earliest Greek script also used interpuncts to divide words rather than spacing, although this practice was soon displaced by the scriptura continua.
Word spacing was later used by Irish and Anglo-Saxon scribes, beginning after the creation of the Carolingian minuscule by Alcuin of York and the scribes' adoption of it. Spacing would become standard in Renaissance Italy and France, and then Byzantium by the end of the 16th century; then entering into the Slavic languages in Cyrillic in the 17th century, and only in modern times entering modern Sanskrit.[2][dubious – discuss]
CJK languages do not use spaces when dealing with text containing mostly Chinese characters and kana. In Japanese, spaces may occasionally be used to separate people's family names from given names, to denote omitted particles (especially the topic particle wa), and for certain literary or artistic effects. Modern Korean, however, has spaces as an essential part of its writing system (because of Western influence), given the phonetic nature of the hangul script that requires word dividers to avoid ambiguity, as opposed to Chinese characters which are mostly very distinguishable from each other. In Korean, spaces are used to separate chunks of nouns, nouns and particles, adjectives, and verbs; for certain compounds or phrases, spaces may be used or not, for example the phrase for "Republic of Korea" is usually spelled without spaces as 대한민국 rather than with a space as 대한 민국.
Runic texts use either an interpunct-like or a colon-like punctuation mark to separate words. There are two Unicode characters dedicated for this: U+16EB ᛫ RUNIC SINGLE PUNCTUATION and U+16EC ᛬ RUNIC MULTIPLE PUNCTUATION.
Between sentences
Languages with a Latin-derived alphabet have used various methods of sentence spacing since the advent of movable type in the 15th century.
- One space (some times called French spacing, q.v.). This is a common convention in most countries that use the ISO basic Latin alphabet for published and final written work, as well as digital (World Wide Web) media.[3] Web browsers usually do not differentiate between single and multiple spaces in source code when displaying text, unless the text is given a "white-space" CSS attribute. Without this being set, collapsing strings of spaces to a single space allow HTML source code to be spaced in a more machine-readable way, at the expense of control over the spacing of the rendered page.[4]
- Double space (English spacing). It is sometimes claimed that this convention stems from the use of the monospaced font on typewriters.[5] However, instructions to use more spacing between sentences than words date back centuries, and two spaces on a typewriter was the closest approximation to typesetters' previous rules aimed at improving readability.[6] Wider spacing continued to be used by both typesetters and typists until the Second World War, after which typesetters gradually transitioned to word spacing between sentences in published print, while typists continued the practice of using two spaces.[7][8][9][10][11][12][13][14][15][16][17]
- One widened space, typically one-and-a-third to slightly less than twice as wide as a word space. This spacing was sometimes used in typesetting before the 19th century. It has also been used in other non-typewriter typesetting systems such as the Linotype machine[18] and the TeX system.[19] Modern computer-based digital fonts can adjust the spacing after terminal punctuation as well, creating a space slightly wider than a standard word space.[20]
There has been some controversy regarding the proper amount of sentence spacing in typeset material. The Elements of Typographic Style states that only a single word space is required for sentence spacing.[21] Psychological studies suggest "readers benefit from having two spaces after periods."[22]
Unit symbols and numbers
The International System of Units (SI) prescribes inserting a space between a number and a unit of measurement (the space being regarded as an implied multiplication sign) but never between a prefix and a base unit; a space (or a multiplication dot) should also be used between units in compound units.[23]
- 5.0 cm, not 5.0cm or 5.0 c m or 5.0 cms
- 45 kg, not 45kg or 45 k g or 45 kgs
- 32 °C, not 32°C or 32° C
- 20 kN m or 20 kN⋅m, not 20 kNm or 20 k Nm
- π/2 rad, not π/2rad or π / 2 rad
- 50 %, not 50% or 50 percent (Note: % is not an SI unit, and many style guides do not follow this recommendation; 50% is used as adjective, e.g. to express concentration as in 50% acetic acid.)
The only exception to this rule is the traditional symbolic notation of angles: degree (e.g., 30°), minute of arc (e.g., 22′), and second of arc (e.g., 8″).
The SI also prescribes the use of a space[24] (often typographically a thin space) as a thousands separator where required. Both the point and the comma are reserved as decimal markers.
- 1 000 000 000 000 (thin space) or 1000000 not 1,000,000 or 1.000.000
- 1 000 000 000 000 (regular space which is significantly wider)
Sometimes a narrow non-breaking space or non-breaking space, respectively, is recommended (as in, for example, IEEE Standards[25] and IEC standards[26]) to avoid the separation of units and values or parts of compounds units, due to automatic line wrap and word wrap.
Encoding
Unicode defines many variants of a single whitespace character, with various properties; the more commonly encountered variations include:
- U+0020 SPACE
- U+00A0 NO-BREAK SPACE
- U+2002 EN SPACE
- U+2003 EM SPACE
In URLs, spaces are percent encoded with its ASCII/UTF-8 representation %20
.
Types of spaces
See also
References
Further reading
Wikiwand - on
Seamless Wikipedia browsing. On steroids.