Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 Names  





2 Origin  





3 Usage  



3.1  Phonetics  





3.2  Writing and printing carons  





3.3  In Balto-Slavic languages  





3.4  In Uralic languages  



3.4.1  Finno-Ugric transcription  







3.5  In other languages  





3.6  Other transcription and transliteration systems  







4 Letters with caron  





5 Software  



5.1  Unicode  







6 See also  





7 Notes  





8 References  





9 External links  














Caron






Asturianu
Беларуская
Brezhoneg
Català
Čeština
Deutsch
Español
Esperanto
فارسی
Français
Gàidhlig
Galego

Italiano
Lietuvių
Magyar
Bahasa Melayu
Nederlands

Norsk bokmål
Polski
Português
Română
Русский
Simple English
Slovenčina
Slovenščina
Sranantongo
Suomi
Svenska
Türkçe
Українська
Vepsän kel

 

Edit links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 




In other projects  



Wikimedia Commons
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 

(Redirected from W̌)

◌̌

Caron

U+030C ◌̌ COMBINING CARON

Acaron[a] is a diacritic mark (◌̌) commonly placed over certain letters in the orthography of some languages to indicate a change of the related letter's pronunciation.

The symbol is common in the Baltic, Slavic, Finnic, Samic and Berber languages.

The use of the caron differs according to the orthographic rules of a language. In most Slavic and other European languages it indicates present or historical palatalization (eě; [e] → [ʲe]), iotation, or postalveolar articulation (cč; [ts][tʃ]). In Salishan languages, it often represents a uvular consonant (x → ; [x] → [χ]). When placed over vowel symbols, the caron can indicate a contour tone, for instance the falling and then rising tone in the Pinyin romanizationofMandarin Chinese. It is also used to decorate symbols in mathematics, where it is often pronounced ˈɛk ("check").

The caron is shaped approximately like a small letter "v". For serif typefaces, the caron generally has one of two forms: either symmetrical, essentially identical to a rotated circumflex; or with the left stroke thicker than the right, like the usual serif form of the letter "v" (but without serifs). The latter form is often preferred by Czech designers for use in Czech, while for other uses the symmetrical form tends to predominate,[2] as it does also among sans-serif fonts.

Caron vs. breve
Caron Breve
ǎ ă

The caron is not to be confused with the breve (◌̆), which has a curved bottom, while the caron is pointed (see illustration).

Names[edit]

Different disciplines generally refer to this diacritic mark by different names. Typography tends to use the term caron. Linguistics more often uses haček (with no long mark[citation needed]), largely due to the influence of the Prague School (particularly on Structuralist linguists who subsequently developed alphabets for previously unwritten languages of the Americas). Pullum's and Ladusaw's Phonetic Symbol Guide (Chicago, 1996) uses the term wedge.

The term caron is used in the official names of Unicode characters (e.g., "LATIN CAPITAL LETTER C WITH CARON"). The Unicode Consortium explicitly states[3] that the reason for this is unknown, but its earliest known use was in the United States Government Printing Office Style Manual of 1967, and it was later used in character sets such as DIN 31624 (1979), ISO 5426 (1980), ISO/IEC 6937 (1983) and ISO/IEC 8859-2 (1985).[4] Its actual origin remains obscure, but some have suggested that it may derive from a fusion of caret and macron.[5] Though this may be folk etymology, it is plausible, particularly in the absence of other suggestions. A Unicode technical note states that the name "hacek" should have been used instead.[6]

The Oxford English Dictionary gives 1953 as the earliest appearance in English for háček. In Czech, háček ([ˈɦaːtʃɛk]) means 'small hook', the diminutive form of hák ([ˈɦaːk], 'hook')". The name appears in most English dictionaries, but they treat the long mark (acute accent) differently. British dictionaries, such as the OED, ODE, CED, write háček (with the mark) in the headwords, while American ones, such as the Merriam-Webster, NOAD, AHD, omit the acute and write haček, however, the NOAD gives háček as an alternative spelling.

InSlovak it is called mäkčeň ([ˈmɛɐktʂeɲ], i.e., 'softener' or 'palatalization mark'), in Serbo-Croatian kvakaorkvačica ('angled hook' or 'small angled hook'), in Slovenian strešica ('little roof') or kljukica ('little hook'), in Lithuanian paukščiukas ('little bird') or varnelė ('little jackdaw'), in Estonian katus ('roof'), in Finnish hattu ('hat'), and in Lakota ičášleče ('wedge').[citation needed]

Origin[edit]

The caron evolved from the dot above diacritic, which Jan Hus introduced into Czech orthography (along with the acute accent) in his De Orthographia Bohemica (1412). The original form still exists in Polish ż. However, Hus's work was hardly known at that time, and háček became widespread only in the 16th century with the introduction of printing.[7]

Usage[edit]

For the fricatives š [ʃ], ž [ʒ], and the affricate č [tʃ] only, the caron is used in most northwestern Uralic languages that use the Latin alphabet, such as Karelian, Veps, Northern Sami and Inari Sami (though not in Southern Sami). Estonian and Finnish use š and ž (but not č), but only for transcribing foreign names and loanwords (albeit common loanwords such as šekkiortšekk 'check'); the sounds (and letters) are native and common in Karelian, Veps and Sami.

InItalian, š, ž, and č are routinely used as in Slovenian to transcribe Slavic names in the Cyrillic script since in native Italian words, the sounds represented by these letters must be followed by a vowel, and Italian uses ch for /k/, not /tʃ/. Other Romance languages, by contrast, tend to use their own orthographies, or in a few cases such as Spanish, borrow English shorzh.

The caron is also used in the Romany alphabet. The Faggin-Nazzi writing system for Friulian makes use of the caron over the letters c, g, and s.[8]

The caron is also often used as a diacritical mark on consonants for romanization of text from non-Latin writing systems, particularly in the scientific transliteration of Slavic languages. Philologists and the standard Finnish orthography often prefer using it to express sounds for which English require a digraph (sh, ch, and zh) because most Slavic languages use only one character to spell the sounds (the key exceptions are Polish sz and cz). Its use for that purpose can even be found in the United States because certain atlases use it in romanization of foreign place names. On the typographical side, Š/š and Ž/ž are likely the easiest among non-Western European diacritic characters to adopt for Westerners because the two are part of the Windows-1252 character encoding.

Esperanto uses the circumflex over c, g, j and s in similar ways; the circumflex was chosen because there was no caron on most Western European typewriters, but the circumflex existed on French ones.

It is also used as an accent mark on vowels to indicate the tone of a syllable. The main example is in Pinyin for Chinese in which it represents a falling-rising tone. It is used in transliterations of Thai to indicate a rising tone.

Phonetics[edit]

The caron ⟨ǎ⟩ represents a rising tone in the International Phonetic Alphabet. It is used in the Uralic Phonetic Alphabet for indicating postalveolar consonants and in Americanist phonetic notation to indicate various types of pronunciation.

The caron below ⟨⟩ represents voicing.

Writing and printing carons[edit]

In printed Czech and Slovak text, the caron combined with certain letters (lower-case ť, ď, ľ, and upper-case Ľ) is reduced to a small stroke. That is optional in handwritten text. Latin fonts are typically set to display this way by default. Some fonts have an option to display a normal caron over these letters, but for those that don't, an option is to combine the letter and caron with the combining grapheme joiner, U+034F, resulting in t͏̌, d͏̌, l͏̌. However, using CGJ in this way can result in the caron mark being misaligned with respect to its letter, as is true for the font Gentium Plus, for instance.

InLazuri orthography, the lower-case k with caron sometimes has its caron reduced to a stroke while the lower-case t with caron preserves its caron shape.[9]

Although the stroke looks similar to an apostrophe, there is a significant difference in kerning. Using an apostrophe in place of a caron can be perceived as very unprofessional, though nevertheless it is still often found on imported goods meant for sale in the Czech Republic and Slovakia (compare t’ to ť, L’ahko to Ľahko). (Apostrophes appearing as palatalization marks in some Finnic languages, such as Võro and Karelian, are not forms of caron either.) Foreigners also sometimes mistake the caron for the acute accent (compare Ĺ to Ľ, ĺ to ľ).

In Balto-Slavic languages[edit]

The following are the Czech and Slovak letters and digraphs with the caron (Czech: háček, Slovak: mäkčeň):

InLower Sorbian and Upper Sorbian, the following letters and digraphs have the caron:

Balto-Slavic Serbo-Croatian, Slovenian, Latvian and Lithuanian use č, š and ž. The digraph dž is also used in these languages but is considered a separate letter only in Serbo-Croatian. The Belarusian Lacinka alphabet also contains the digraph dž (as a separate letter), and Latin transcriptions of Bulgarian and Macedonian may use them at times, for transcription of the letter-combination ДЖ (Bulgarian) and the letter Џ (Macedonian).

In Uralic languages[edit]

In the Finnic languages, Estonian (and transcriptions to Finnish) uses Š/š and Ž/ž, and Karelian uses Č/č, Š/š and Ž/ž. Dž is not a separate letter. Č is present because it may be phonemically geminate: in Karelian, the phoneme 'čč' is found, and is distinct from 'č', which is not the case in Finnish or Estonian, for which only one length is recognized for 'tš'. (Incidentally, in transcriptions, Finnish orthography has to employ complicated notations like mettšä or even the mettshä to express Karelian meččä.) On some Finnish keyboards, it is possible to write those letters by typing sorz while holding right Alt keyorAltGr key, though that is not supported by the Microsoft Windows keyboard device driver KBDFI.DLL for the Finnish language. The Finnish multilingual keyboard layout allows typing the letters Š/š and Ž/ž by pressing AltGr+'+S for š and AltGr+'+Z for ž.

In Estonian, Finnish and Karelian these are not palatalized but postalveolar consonants. For example, Estonian Nissi (palatalized) is distinct from nišši (postalveolar). Palatalization is typically ignored in spelling, but some Karelian and Võro orthographies use an apostrophe (') or an acute accent (´). In Finnish and Estonian, š and ž (and in Estonian, very rarely č) appear in loanwords and foreign proper names only and when not available, they can be substituted with 'h': 'sh' for 'š', in print.

In the orthographies of the Sami languages, the letters Č/č, Š/š and Ž/ž appear in Northern Sami, Inari Sami and Skolt Sami. Skolt Sami also uses three other consonants with the caron: Ǯ/ǯ (ezh-caron) to mark the voiced postalveolar affricate [dʒ] (plain Ʒ/ʒ marks the alveolar affricate [dz]), Ǧ/ǧ to mark the voiced palatal affricate [ɟʝ] and Ǩ/ǩ the corresponding voiceless palatal affricate [cç]. More often than not, they are geminated: vuäǯǯad "to get". The orthographies of the more southern Sami languages of Sweden and Norway such as Lule Sami do not use caron, and prefer instead the digraphs tj and sj.

Finno-Ugric transcription[edit]

Most other Uralic languages (including Kildin Sami) are normally written with Cyrillic instead of the Latin script. In their scientific transcription, the Finno-Ugric Transcription / Uralic Phonetic Alphabet however employs the letters š, ž and occasionally č, ǯ (alternately , ) for the postalveolar consonants. These serve as basic letters, and with further diacritics are used to transcribe also other fricative and affricate sounds. Retroflex consonants are marked by a caron and an underdot (ṣ̌, ẓ̌ = IPA [ʂ], [ʐ]), alveolo-palatal (palatalized postalveolar) consonants by a caron and an acute (š́, ž́ = IPA [ɕ], [ʑ]). Thus, for example, the postalveolar consonants of the Udmurt language, normally written as Ж/ж, Ӝ/ӝ, Ӵ/ӵ, Ш/ш are in Uralic studies normally transcribed as ž, ǯ, č, š respectively, and the alveolo-palatal consonants normally written as Зь/зь, Ӟ/ӟ, Сь/сь, Ч/ч are normally transcribed as ž́, ǯ́, š́, č́ respectively.[10]

In other languages[edit]

In the Berber Latin alphabet of the Berber language (North Africa) the following letters and digraphs are used with the caron:

Finnish Kalo uses Ȟ/ȟ.

Lakota uses Č/č, Š/š, Ž/ž, Ǧ/ǧ (voiced post-velar fricative) and Ȟ/ȟ (plain post-velar fricative).

Indonesian uses ě (e with caron) informally to mark the schwa (Indonesian: pepet).

Many alphabets of African languages use the caron to mark the rising tone, as in the African reference alphabet.

Outside of the Latin alphabet, the caron is also used for Cypriot Greek letters that have a different sound from Standard Modern Greek: σ̌ κ̌ π̌ τ̌ ζ̌ in words like τζ̌αι ('and'), κάτ̌τ̌ος ('cat').

Other transcription and transliteration systems[edit]

The DIN 31635 standard for transliteration of Arabic uses Ǧ/ǧ to represent the letter ج. ǧīm, on account of the inconsistent pronunciation of J in European languages, the variable pronunciation of the letter in educated Arabic [d͡ʒ~ʒ~ɟ~ɡ], and the desire of the DIN committee to have a one-to-one correspondence of Arabic to Latin letters in its system.

Romanization of Pashto uses Č/č, Š/š, Ž/ž, X̌/x̌, to represent the letters ‎چ‎, ‎ش‎, ‎ژ‎, ‎ښ‎, respectively. Additionally, Ṣ̌/ṣ̌ and Ẓ̌/ẓ̌ are used by the southern Pashto dialect only (replaced by X̌/x̌ and Ǵ/ǵ in the north). [citation needed]

The latter Š/š is also used to transcribe the /ʃ/ phoneme in Sumerian and Akkadian cuneiform, and the /ʃ/ phoneme in Semitic languages represented by the letter shin (Phoenician and its descendants).

The caron is also used in Mandarin Chinese pinyin romanization and orthographies of several other tonal languages to indicate the "falling-rising" tone (similar to the pitch made when asking "Huh?"). The caron can be placed over the vowels: ǎ, ě, ǐ, ǒ, ǔ, ǚ. The alternative to a caron is a number 3 after the syllable: hǎo = hao3, as the "falling-rising" tone is the third tone in Mandarin.

The caron is used in the New Transliteration System[broken anchor]ofD'ni in the symbol š to represent the sound [ʃ] (English "sh").

A-caron (ǎ) is also used to transliterate the Cyrillic letter Ъ (er golyam) in Bulgarian—it represents the mid back unrounded vowel [ɤ̞].

Caron marks a falling and rising tone (bǔ, bǐ) in Fon languages.

Letters with caron[edit]

Unicode encodes a number of cases of "letter with caron" as precomposed characters and these are displayed below. In addition, many more symbols may be composed using the combining character facility (U+030C ◌̌ COMBINING CARON and U+032C ◌̬ COMBINING CARON BELOW) that may be used with any letter or other diacritic to create a customised symbol but this does not mean that the result has any real-world application and are not shown in the table.

  •  Caron ◌̌   Ǎ ǎ
  • Č č
  • Ď ď
  • Ě ě
  • Ǧ ǧ
  • Ȟ ȟ
  • Ǐ ǐ
  • ǰ
  • Ǩ ǩ
  • Ľ ľ
  • Ň ň
  • Ǒ ǒ
  • Ř ř
  • Š š
  • Ṧ ṧ
  • Ť ť
  • Ǔ ǔ
  • Ǚ ǚ
  • Ž ž
  • Ǯ ǯ
  • There are a number of Cyrillic letters with caron but they do not have precomposed characters and thus must be generated using the combining character method. These are: В̌ в̌; Ǯ ǯ; Г̌ г̌; Ғ̌ ғ̌; Д̌ д̌; З̌ з̌; Р̌ р̌; Т̌ т̌; Х̌ х̌

    Software[edit]

    Unicode[edit]

    For legacy reasons, most letters that carry carons are precomposed charactersinUnicode, but a caron can also be added to any letter by using the combining character U+030C ◌̌ COMBINING CARON, for example: b̌ q̌ J̌.

    The characters Č, č, Ě, ě, Š, š, Ž, ž are a part of the Unicode Latin Extended-A set because they occur in Czech and other official languages in Europe, while the rest are in Latin Extended-B, which often causes an inconsistent appearance.

    Unicode also encodes U+032C ◌̬ COMBINING CARON BELOW, for example: p̬.

    A combining double caron was proposed for inclusion in April, 2024.[11]

    See also[edit]

    Notes[edit]

    1. ^ ˈkærən KARR-ən.[1] Also known as a háčekorhaček (ˈhɑːɛk, ˈhæɛk, ˈhɛk HAH-chek, HATCH-ek, HAY-chek, plural háčeksorháčky), hachek, wedge, check, kvačica, strešica, mäkčeň, varnelė, paukščiukas, inverted circumflex, inverted hat, flying bird, or inverted chevron

    References[edit]

    1. ^ Wells, John C. (1990). "caron". Longman pronunciation dictionary. Harlow, England: Longman. p. 121. ISBN 0582053838.
  • ^ Gaultney, Victor. "Problems of diacritic design for Latin text faces. Archived 2020-10-03 at the Wayback Machine" Dissertation submitted in partial fulfilment of the requirements for the Master of Arts in Typeface Design, University of Reading, 2002, pp. 16–18.
  • ^ "FAQ - Character Properties, Case Mappings and Names".
  • ^ Andrew West, Antedating the Caron
  • ^ Unicode.org
  • ^ Freytag, Asmus; McGowan, Rick; Whistler, Ken. "UTN #27: Known anomalies in Unicode Character Names". www.unicode.org. Retrieved 8 February 2023.
  • ^ Baddeley, Susan; Voeste, Anja (2012). Orthographies in Early Modern Europe. Berlin: De Gruyter Mouton. pp. 258–261. ISBN 9783110288179.
  • ^ "Norme ortografiche della Grafia Faggin-Nazzi" (in Italian). Friul.net. Retrieved 2013-10-06.
  • ^ Lazuri Font / Lazca Font, Lazca yazı karakterleri, Lazuri.com
  • ^ Rédei, Karoly (1973). "A votják nyelvjárások fonematikus átírása". In Posti, Lauri (ed.). FU-transcription yksinkertaistaminen. Helsinki. pp. 88–91.{{cite book}}: CS1 maint: location missing publisher (link)
  • ^ "Unicode request for double caron" (PDF).
  • External links[edit]


    Retrieved from "https://en.wikipedia.org/w/index.php?title=Caron&oldid=1231666727"

    Categories: 
    Letters with caron
    Greek-script diacritics
    Latin-script diacritics
    Hidden categories: 
    Pages using the Phonos extension
    Webarchive template wayback links
    CS1 Italian-language sources (it)
    CS1 maint: location missing publisher
    Articles with short description
    Short description matches Wikidata
    Articles needing additional references from October 2010
    All articles needing additional references
    Pages with plain IPA
    All articles with unsourced statements
    Articles with unsourced statements from January 2017
    Articles containing Czech-language text
    Pages with Czech IPA
    Articles containing Slovak-language text
    Pages with Slovak IPA
    Articles containing Serbo-Croatian-language text
    Articles containing Swedish-language text
    Articles containing Lithuanian-language text
    Articles containing Estonian-language text
    Articles containing Finnish-language text
    Articles containing Lakota-language text
    Articles with unsourced statements from March 2022
    Articles containing Karelian-language text
    Articles containing Indonesian-language text
    Articles containing Greek-language text
    Articles containing Arabic-language text
    Articles with unsourced statements from June 2015
    Articles containing Chinese-language text
    Pages with broken anchors
    Articles containing Bulgarian-language text
    Articles containing Bengali-language text
    Articles containing Odia-language text
    Articles containing Malayalam-language text
    Articles containing Telugu-language text
    Articles containing Tigrinya-language text
    Articles containing Kannada-language text
    Articles containing Sinhala-language text
    Articles containing Burmese-language text
    Articles containing Tamil-language text
    Articles containing Japanese-language text
     



    This page was last edited on 29 June 2024, at 15:27 (UTC).

    Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki