Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 Character set for OCR-A  





2 Character set for OCR-B  





3 Character set for JIS X 9008 (JIS C 6257)  





4 Character set for E-13B  





5 References  





6 External links  














ISO 2033







Add links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 


The ISO 2033:1983 standard ("Coding of machine readable characters (MICR and OCR)")[1] defines character sets for use with Optical Character RecognitionorMagnetic Ink Character Recognition systems. The Japanese standard JIS X 9010:1984 ("Coding of machine readable characters (OCR and MICR)", originally designated JIS C 6229-1984) is closely related.[2]

Character set for OCR-A[edit]

The version of the encoding for the OCR-A font registered with the ISO-IR registry as ISO-IR-91 is the Japanese (JIS X 9010 / JIS C 6229) version, which differs from the encoding defined by ISO 2033 only in the addition of a Yen sign at 5C.[2]

ISO 2033 and JIS C 6229 OCR-A set
0 1 2 3 4 5 6 7 8 9 A B C D E F
0x NUL SOH STX ETX EOT ENQ ACK BEL BS HT LF VT FF CR SO SI
1x DLE DC1 DC2 DC3 DC4 NAK SYN ETB CAN EM SUB ESC FS GS RS US
2x  SP  " £
00A3
$ % & ' {
007B
}
007D
* + , - . /
3x 0 1 2 3 4 5 6 7 8 9 : ;
2440
=
2441
?
4x A B C D E F G H I J K L M N O
5x P Q R S T U V W X Y Z ¥
00A5

2442
6x
7x | DEL
  Redefined compared to JIS-Roman

Character set for OCR-B[edit]

The version of the G0 set for the OCR-B font registered with the ISO-IR registry as ISO-IR-92 is the Japanese (JIS X 9010 / JIS C 6229) version, which differs from the encoding defined by ISO 2033 only in being based on JIS-Roman (with a dollar sign at 0x24 and a Yen sign at 0x5C) rather than on the ISO 646 IRV (with a backslash at 0x5C and, at the time, a universal currency sign (¤) at 0x24).[3] Besides those code points, it differs from ASCII only in omitting the backtick (`) and tilde (~).[3] An additional supplementary set registered as ISO-IR-93 assigns the pound sign (£), universal currency sign (¤) and section sign (§) to their ISO-8859-1 codepoints, and the backslash to the ISO-8859-1 codepoint for the Yen sign.[4]

Character set for JIS X 9008 (JIS C 6257)[edit]

JIS X 9010 (JIS C 6229) also defines character sets for the JIS X 9008:1981 (formerly JIS C 6257-1981) "hand-printed" OCR font.[5]: fn1  These include subsets of the JIS X 0201 Roman set (registered as ISO-IR-94 and omitting the backtick (`), lowercase letters, curly braces ({, }) and overline (‾)),[5] and kana set (registered as ISO-IR-96 and omitting the East Asian style comma (、) and full stop (。), the interpunct (・) and the small kana),[6] in addition to a set (registered as ISO-IR-95) containing only the backslash, which is assigned to the same code point as in ISO-IR-93.[7]

The JIS C 6527 font stylises the slash[5] and backslash[7] characters with a doubled appearance. The character names given are "Solidus"[5] and "Reverse Solidus",[7] matching the Unicode character names for the ASCII slash and backslash.[8] However, the Unicode Optical Character Recognition block includes an additional code point for an "OCR Double Backslash" (⑊), although not for a double (forward) slash,[9] although a double slash is available elsewhere, as U+2AFD DOUBLE SOLIDUS OPERATOR.

Character set for E-13B[edit]

The MICR E-13B font, showing the ISO-IR-98 character repertoire.

The ISO-IR-98 encoding defined by ISO 2033 encodes the character repertoire of the E13B font, as used with magnetic ink character recognition.[10] Although ISO 2033 also specifies other encodings, the encoding for E-13B is the encoding referred to as ISO_2033_1983byPerl libintl,[11] and as ISO_2033-1983orcsISO2033 by the IANA.[12] Other registered labels include iso-ir-98, its ISO-IR registration number, and simply e13b.[12]

The digits are preserved in their ASCII locations. Letters and symbols unavailable in the E13B font are omitted, while specialised punctuation for bank cheques included in the E13B font is added. The same symbols are available in Unicode in the Optical Character Recognition block.

ISO 2033:1983 E-13B set[11]
0 1 2 3 4 5 6 7 8 9 A B C D E F
0x NUL SOH STX ETX EOT ENQ ACK BEL BS HT LF VT FF CR SO SI
1x DLE DC1 DC2 DC3 DC4 NAK SYN ETB CAN EM SUB ESC FS GS RS US
2x  SP 
3x 0 1 2 3 4 5 6 7 8 9
2446

2447

2448

2449
4x
5x
6x
7x DEL
  Redefined compared to ASCII

References[edit]

  1. ^ ISO/IEC JTC 1/SC 2 (1983). Information processing — Coding of machine readable characters (MICR and OCR). ISO. ISO 2033:1983.{{citation}}: CS1 maint: numeric names: authors list (link)
  • ^ a b ISO/TC97/SC2 (1985-08-01). ISO-IR-91: Japanese OCR-A Graphic Character Set (PDF). ITSCJ/IPSJ.{{citation}}: CS1 maint: numeric names: authors list (link)
  • ^ a b ISO/TC97/SC2 (1985-08-01). ISO-IR-92: Japanese OCR-B Basic Graphic Character Set (PDF). ITSCJ/IPSJ.{{citation}}: CS1 maint: numeric names: authors list (link)
  • ^ ISO/TC97/SC2 (1985-08-01). ISO-IR-93: Japanese OCR-B - Additional Graphic Character Set (PDF). ITSCJ/IPSJ.{{citation}}: CS1 maint: numeric names: authors list (link)
  • ^ a b c d ISO/TC97/SC2 (1985-08-01). ISO-IR-94: Japanese Basic Hand-printed Graphic Character Set for OCR (PDF). ITSCJ/IPSJ.{{citation}}: CS1 maint: numeric names: authors list (link)
  • ^ ISO/TC97/SC2 (1985-08-01). ISO-IR-96: Katakana Hand-printed Graphic Character Set for OCR (PDF). ITSCJ/IPSJ.{{citation}}: CS1 maint: numeric names: authors list (link)
  • ^ a b c ISO/TC97/SC2 (1985-08-01). ISO-IR-95: Japanese Additional Hand-printed Graphic Character Set for OCR (PDF). ITSCJ/IPSJ.{{citation}}: CS1 maint: numeric names: authors list (link)
  • ^ Unicode Consortium. "C0 Controls and Basic Latin" (PDF). The Unicode Standard.
  • ^ Unicode Consortium. "Optical Character Recognition" (PDF). The Unicode Standard.
  • ^ ISO/TC97/SC2 (1985-08-01). ISO-IR-98: A set of 14 graphic characters of the E13B font (PDF). ITSCJ/IPSJ.{{citation}}: CS1 maint: numeric names: authors list (link)
  • ^ a b Flohr, Guido. "Conversion routines for ISO_2033_1983". libintl. Locale::RecodeData::ISO_2033_1983.
  • ^ a b "Character Sets". IANA.
  • External links[edit]


    Retrieved from "https://en.wikipedia.org/w/index.php?title=ISO_2033&oldid=1226590645"

    Categories: 
    Character sets
    ISO standards
    Optical character recognition
    OCR typefaces
    Hidden categories: 
    CS1 maint: numeric names: authors list
    Articles with short description
    Short description is different from Wikidata
     



    This page was last edited on 31 May 2024, at 16:35 (UTC).

    Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki