Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 Character repertoire  





2 Encoding  





3 Legal status  





4 See also  





5 References  



5.1  Works cited  







6 Sources  





7 External links  














CJK characters






العربية
Bahasa Indonesia

Português

 

Edit links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 

(Redirected from CJKV)

Translation of "That old man is 72 years old" in Vietnamese, Cantonese, Mandarin (insimplified and traditional characters), Japanese, and Korean.

Ininternationalization, CJK characters is a collective term for graphemes used in the Chinese, Japanese, and Korean writing systems, which each include Chinese characters. The term CJKV also includes Chữ Nôm, the Chinese-origin logographic script formerly used for the Vietnamese language.

Character repertoire

[edit]

Standard Mandarin Chinese and Standard Cantonese are written almost exclusively in Chinese characters. Over 3,000 characters are required for general literacy, with up to 40,000 characters for reasonably complete coverage. Japanese uses fewer characters—general literacy in Japanese can be expected with 2,136 characters. The use of Chinese characters in Korea is increasingly rare, although idiosyncratic use of Chinese characters in proper names requires knowledge (and therefore availability) of many more characters. Even today, however, South Korean students are taught 1,800 characters.

Other scripts used for these languages, such as bopomofo and the Latin-based pinyin for Chinese, hiragana and katakana for Japanese, and hangul for Korean, are not strictly "CJK characters", although CJK character sets almost invariably include them as necessary for full coverage of the target languages.

The sinologist Carl Leban (1971) produced an early survey of CJK encoding systems.

Until the early 20th century, Classical Chinese was the written language of government and scholarship in Vietnam. Popular literature in Vietnamese was written in the chữ Nôm script, consisting of Chinese characters with many characters created locally. Since the 1920s, the script since then used for recording literature has been the Latin-based Vietnamese alphabet.[1][2]

Encoding

[edit]

The number of characters required for complete coverage of all these languages' needs cannot fit in the 256-character code space of 8-bit character encodings, requiring at least a 16-bit fixed width encoding or multi-byte variable-length encodings. The 16-bit fixed width encodings, such as those from Unicode up to and including version 2.0, are now deprecated due to the requirement to encode more characters than a 16-bit encoding can accommodate—Unicode 5.0 has some 70,000 Han characters—and the requirement by the Chinese government that software in China support the GB 18030 character set.

Although CJK encodings have common character sets, the encodings often used to represent them have been developed separately by different East Asian governments and software companies, and are mutually incompatible. Unicode has attempted, with some controversy, to unify the character sets in a process known as Han unification.

CJK character encodings should consist minimally of Han characters plus language-specific phonetic scripts such as pinyin, bopomofo, hiragana, katakana and hangul.

CJK character encodings include:

  • Big5 (the most prevalent encoding before Unicode was implemented)
  • CCCII
  • CNS 11643 (official standard of Republic of China)
  • EUC-JP
  • EUC-KR
  • GB 2312 (subset and predecessor of GB 18030)
  • GB 18030 (mandated standard in the People's Republic of China)
  • Giga Character Set (GCS)
  • ISO 2022-JP
  • KS C 5861
  • Shift-JIS
  • TRON
  • Unicode
  • The CJK character sets take up the bulk of the assigned Unicode code space. There is much controversy among Japanese experts of Chinese characters about the desirability and technical merit of the Han unification process used to map multiple Chinese and Japanese character sets into a single set of unified characters.[citation needed]

    All three languages can be written both left-to-right and top-to-bottom (right-to-left and top-to-bottom in ancient documents), but are usually considered left-to-right scripts when discussing encoding issues.

    [edit]

    Libraries cooperated on encoding standards for JACKPHY characters in the early 1980s. According to Ken Lunde, the abbreviation "CJK" was a registered trademarkofResearch Libraries Group[3] (which merged with OCLC in 2006). The trademark owned by OCLC between 1987 and 2009 has now expired.[4]

    See also

    [edit]

    References

    [edit]
    1. ^ Coulmas (1991), pp. 113–115.
  • ^ DeFrancis (1977).
  • ^ Ken Lunde, 1996
  • ^ Justia listing
  • Works cited

    [edit]

    Sources

    [edit]
  • Hannas, William C. Asia's Orthographic Dilemma. Honolulu: University of Hawaii Press, 1997. ISBN 0-8248-1892-X (paperback); ISBN 0-8248-1842-3 (hardcover).
  • Lemberg, Werner: The CJK package for LATEX2ε—Multilingual support beyond babel. TUGboat, Volume 18 (1997), No. 3—Proceedings of the 1997 Annual Meeting.
  • Leban, Carl. Automated Orthographic Systems for East Asian Languages (Chinese, Japanese, Korean), State-of-the-art Report, Prepared for the Board of Directors, Association for Asian Studies. 1971.
  • Lunde, Ken. CJKV Information Processing. Sebastopol, Calif.: O'Reilly & Associates, 1998. ISBN 1-56592-224-7.
  • [edit]
    Retrieved from "https://en.wikipedia.org/w/index.php?title=CJK_characters&oldid=1234001764"

    Categories: 
    Encodings of Asian languages
    Languages of East Asia
    Natural language and computing
    Chinese-language computing
    Japanese-language computing
    Korean-language computing
    Writing systems using Chinese characters
    Hidden categories: 
    Articles with short description
    Short description is different from Wikidata
    Articles containing Vietnamese-language text
    All articles with unsourced statements
    Articles with unsourced statements from March 2011
     



    This page was last edited on 12 July 2024, at 02:04 (UTC).

    Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki