Home  

Random  

Nearby  



Log in  



Settings  



Donate  



About Wikipedia  

Disclaimers  



Wikipedia





Zero-width space





Article  

Talk  



Language  

Watch  

Edit  





The zero-width space (ZWSP) is a non-printing character used in computerized typesetting to indicate where the word boundaries are, without actually displaying a visible space in the rendered text. This enables text-processing systems for scripts that do not use explicit spacing to recognize where word boundaries are for the purpose of handling line breaks appropriately. Zero-width space is unicode character U+200B, and is located in the unicode General Punctuation block, and can be represented by HTML entities ​or​.

Purpose

edit

The zero-width space marks a potential line break without hyphenation. Its semantics and HTML implementation are similar to the soft hyphen, but soft hyphens display a hyphen character at the point where the line is broken.

The zero-width space can be used to mark word breaks in languages without visible space between words, such as Thai, Myanmar, Khmer, and Japanese.[1][2]

Unlike fixed-width spaces, in justified text that increases spacing between letters, characters adjacent to the zero-width space are spaced as if it was not present.[2]

Example

edit

To show the effect of the zero-width space in text, the following words have been separated with zero-width spaces:

Lorem​Ipsum​Dolor​Sit​Amet​Consectetur​Adipiscing​Elit​Sed​Do​Eiusmod​Tempor​Incididunt​Ut​Labore​Et​Dolore​Magna​Aliqua​Ut​Enim​Ad​Minim​Veniam​Quis​Nostrud​Exercitation​Ullamco​Laboris​Nisi​Ut​Aliquip​Ex​Ea​Commodo​Consequat​Duis​Aute​Irure​Dolor​In​Reprehenderit​In​Voluptate​Velit​Esse​Cillum​Dolore​Eu​Fugiat​Nulla​Pariatur​Excepteur​Sint​Occaecat​Cupidatat​Non​Proident​Sunt​In​Culpa​Qui​Officia​Deserunt​Mollit​Anim​Id​Est​Laborum

And the following words have not been separated with these spaces:

LoremIpsumDolorSitAmetConsecteturAdipiscingElitSedDoEiusmodTemporIncididuntUtLaboreEtDoloreMagnaAliquaUtEnimAdMinimVeniamQuisNostrudExercitationUllamcoLaborisNisiUtAliquipExEaCommodoConsequatDuisAuteIrureDolorInReprehenderitInVoluptateVelitEsseCillumDoloreEuFugiatNullaPariaturExcepteurSintOccaecatCupidatatNonProidentSuntInCulpaQuiOfficiaDeseruntMollitAnimIdEstLaborum

The first text only breaks at word boundaries, while the second text will not be broken at all. Resizing the browser window will re-break the text accordingly.

Usage

edit

HTML

edit

InHTML pages, the HTML element <wbr> functions as a zero-width space. In Internet Explorer 6, the zero-width space was not supported in some fonts.[3]

Prohibition in domain names

edit

ICANN rules prohibit domain names from containing non-displayed characters, including the zero-width space, and most browsers prohibit their use within domain names because they can be used to create a homograph attack, where a malicious URL is visually indistinguishable from a legitimate one.[4][5]

Encoding

edit

The zero-width space character is encoded in UnicodeasU+200B ZERO WIDTH SPACE,[6] and input in HTML as &ZeroWidthSpace;, &#8203;or&#x200B;. Contrary to what their names suggest, the character entities &NegativeThickSpace;, &NegativeMediumSpace;, &NegativeThinSpace;, and &NegativeVeryThinSpace; also refer to the zero-width space.[7]

The TeX representation is \hskip0pt; the LaTeX representation is \hspace{0pt};[8] and the groff representation is \:.[9]

See also

edit

References

edit

Citations

edit
  1. ^ "Zones spéciales et caractères de formatage" [Special areas and formatting characters] (PDF). Hapax Quebec (in French). p. 3. Archived from the original (PDF) on 27 December 2005. Retrieved 31 July 2019. Les espaces sans chasse sont conçues pour les langues qui ne séparent pas les mots à l'aide d'espaces visibles, comme le thaï ou le japonais.
  • ^ a b The Unicode® Standard Version 15.0 – Core Specification (PDF). The Unicode Consortium. September 2022. p. 918. ISBN 978-1-936213-32-0.
  • ^ Dunae, Alex. "Better Web Typography with Spaces and Hyphens". dunae.ca. Archived from the original on December 14, 2010. Retrieved December 3, 2009.
  • ^ "Network.IDN.blacklist_chars". mozillaZine. Retrieved 2018-02-07.
  • ^ "Unicode Character 'Zero Width Space'". FileFormat.Info. Retrieved 2018-02-07.
  • ^ "General Punctuation – Unicode" (PDF). Retrieved 2013-07-20.
  • ^ Entities/ZeroWidthSpace in MathML Version 2.0
  • ^ "The LaTeX Companion. Chapter 3: Basic Formatting Tools" (PDF). Retrieved 2019-07-16.
  • ^ "groff(7) – Linux manual page". Retrieved 2014-02-08.
  • Sources

    edit

    Retrieved from "https://en.wikipedia.org/w/index.php?title=Zero-width_space&oldid=1235734030"
     



    Last edited on 20 July 2024, at 23:15  





    Languages

     



    Deutsch
    Español
    Esperanto
    Français

    Italiano
    Nederlands

    Português
    Русский


     

    Wikipedia


    This page was last edited on 20 July 2024, at 23:15 (UTC).

    Content is available under CC BY-SA 4.0 unless otherwise noted.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Terms of Use

    Desktop