All Topics  
EBCDIC

 

   Email Print
   Bookmark   Link






 

EBCDIC



 
 
Extended Binary Coded Decimal Interchange Code (EBCDIC) is an 8-bit
Bit

A bit is a binary numeral system numerical digit, taking a value of either 0 or 1. Binary digits are a basic unit of information Computer data storage and transmission in digital computing and digital information theory....
 character encoding
Character encoding

A character encoding system consists of a code that pairs a sequence of character from a given character set with something else, such as a sequence of natural numbers, octet or electrical pulses, in order to facilitate the transmission of data through telecommunication networks and/or Computer data storage of Character in compute...
 (code page
Code page

Code page is the traditional International Business Machines term used to map a specific set of characters to numerical code point values . This is slightly different in meaning than the related terms character encoding and character set....
) used on IBM mainframe
IBM mainframe

IBM mainframes, though perceived as synonymous with mainframe computers in general due to their marketshare, are now technically and specifically IBM's line of business computers that can all trace their design evolution to the IBM System/360....
 operating systems such as z/OS
Z/OS

z/OS is a 64-bit operating system for mainframe computers, created by IBM. It is the successor to OS/390, which in turn followed MVS and combined a number of formerly separate, related products....
, OS/390
OS/390

OS/390 is an International Business Machines operating system for the System/390 IBM mainframes.OS/390 was introduced in late 1995 in an effort, led by the late Randy Stelman, to simplify the packaging and ordering for the key, entitled elements needed to complete a fully functional MVS operating system package....
, VM
VM (operating system)

VM refers to a family of IBM virtual machine operating systems used on IBM System/370, System/390, zSeries, and System z IBM mainframes and compatible systems, including the Hercules emulator for personal computers....
 and VSE, as well as IBM
IBM

International Business Machines Corporation, abbreviated IBM and nicknamed "Big Blue" , is a multinational corporation computer technology and consulting corporation headquartered in Armonk, New York, New York, United States....
 midrange computer
Midrange computer

Midrange computers, or midrange systems, are a class of computer systems which fall in between mainframe computers and microcomputers. The range emerged in the 1960s and were more generally known at the time as minicomputers....
 operating systems such as OS/400
OS/400

IBM i is an operating system used on IBM Power Systems, a unified server platform from the former IBM System i and IBM System p servers. IBM i was formerly known as i5/OS or OS/400....
 and i5/OS (see also Binary Coded Decimal). It is also employed on various non-IBM platforms such as Fujitsu
Fujitsu

is a Japanese company specializing in semiconductors, air conditioners, computers , telecommunications, and Service , and is headquartered in Minato, Tokyo, Tokyo....
-Siemens
Siemens AG

Siemens Aktiengesellschaft is Europe's largest engineering Conglomerate . Siemens' international headquarters are located in Berlin and Munich, Germany....
' BS2000/OSD
BS2000

BS2000 is the mainframe operating system platform of Fujitsu Siemens Computers.Mainframe computer are optimized to enable many programs to be installed in Parallelism and run Concurrency on a computer....
, HP
Hewlett-Packard

The Hewlett-Packard Company , commonly referred to as HP, is a technology corporation headquartered in Palo Alto, California, United States....
 MPE/iX, and Unisys
Unisys

Unisys Corporation , based in Blue Bell, Pennsylvania, Pennsylvania, United States, and incorporated in Delaware, is a global provider of information technology services and programs....
 MCP
MCP (Burroughs Large Systems)

The MCP is the Proprietary software operating system of the Burroughs large systems including the Unisys Clearpath/MCP systems. Originally written in 1961 in ESPOL , which itself was an extension of Burroughs Corporation Extended ALGOL, in the 1970s it was converted to NEWP, a better structured, more robust, and more secure form of ESP...
. It descended from punched cards and the corresponding six bit binary-coded decimal
Binary-coded decimal

In computing and electronics systems, binary-coded decimal is an encoding for decimal numbers in which each digit is represented by its own binary sequence....
 code that most of IBM's computer peripherals of the late 1950s and early 1960s used.

IC was devised in 1963 and 1964 by IBM
IBM

International Business Machines Corporation, abbreviated IBM and nicknamed "Big Blue" , is a multinational corporation computer technology and consulting corporation headquartered in Armonk, New York, New York, United States....
 and was announced with the release of the IBM System/360
System/360

The IBM System/360 is a mainframe computer system family announced by IBM on April 7, 1964. It was the first family of computers making a clear distinction between computer architecture and implementation, allowing IBM to release a suite of compatible designs at different price points....
 line of mainframe computer
Computer

A computer is a machine that manipulates Data according to a list of Code .The first devices that resemble modern computers date to the mid-20th century , although the computer concept and various machines similar to computers existed earlier....
s.






Discussion
Ask a question about 'EBCDIC'
Start a new discussion about 'EBCDIC'
Answer questions from other users
Full Discussion Forum



Encyclopedia


Extended Binary Coded Decimal Interchange Code (EBCDIC) is an 8-bit
Bit

A bit is a binary numeral system numerical digit, taking a value of either 0 or 1. Binary digits are a basic unit of information Computer data storage and transmission in digital computing and digital information theory....
 character encoding
Character encoding

A character encoding system consists of a code that pairs a sequence of character from a given character set with something else, such as a sequence of natural numbers, octet or electrical pulses, in order to facilitate the transmission of data through telecommunication networks and/or Computer data storage of Character in compute...
 (code page
Code page

Code page is the traditional International Business Machines term used to map a specific set of characters to numerical code point values . This is slightly different in meaning than the related terms character encoding and character set....
) used on IBM mainframe
IBM mainframe

IBM mainframes, though perceived as synonymous with mainframe computers in general due to their marketshare, are now technically and specifically IBM's line of business computers that can all trace their design evolution to the IBM System/360....
 operating systems such as z/OS
Z/OS

z/OS is a 64-bit operating system for mainframe computers, created by IBM. It is the successor to OS/390, which in turn followed MVS and combined a number of formerly separate, related products....
, OS/390
OS/390

OS/390 is an International Business Machines operating system for the System/390 IBM mainframes.OS/390 was introduced in late 1995 in an effort, led by the late Randy Stelman, to simplify the packaging and ordering for the key, entitled elements needed to complete a fully functional MVS operating system package....
, VM
VM (operating system)

VM refers to a family of IBM virtual machine operating systems used on IBM System/370, System/390, zSeries, and System z IBM mainframes and compatible systems, including the Hercules emulator for personal computers....
 and VSE, as well as IBM
IBM

International Business Machines Corporation, abbreviated IBM and nicknamed "Big Blue" , is a multinational corporation computer technology and consulting corporation headquartered in Armonk, New York, New York, United States....
 midrange computer
Midrange computer

Midrange computers, or midrange systems, are a class of computer systems which fall in between mainframe computers and microcomputers. The range emerged in the 1960s and were more generally known at the time as minicomputers....
 operating systems such as OS/400
OS/400

IBM i is an operating system used on IBM Power Systems, a unified server platform from the former IBM System i and IBM System p servers. IBM i was formerly known as i5/OS or OS/400....
 and i5/OS (see also Binary Coded Decimal). It is also employed on various non-IBM platforms such as Fujitsu
Fujitsu

is a Japanese company specializing in semiconductors, air conditioners, computers , telecommunications, and Service , and is headquartered in Minato, Tokyo, Tokyo....
-Siemens
Siemens AG

Siemens Aktiengesellschaft is Europe's largest engineering Conglomerate . Siemens' international headquarters are located in Berlin and Munich, Germany....
' BS2000/OSD
BS2000

BS2000 is the mainframe operating system platform of Fujitsu Siemens Computers.Mainframe computer are optimized to enable many programs to be installed in Parallelism and run Concurrency on a computer....
, HP
Hewlett-Packard

The Hewlett-Packard Company , commonly referred to as HP, is a technology corporation headquartered in Palo Alto, California, United States....
 MPE/iX, and Unisys
Unisys

Unisys Corporation , based in Blue Bell, Pennsylvania, Pennsylvania, United States, and incorporated in Delaware, is a global provider of information technology services and programs....
 MCP
MCP (Burroughs Large Systems)

The MCP is the Proprietary software operating system of the Burroughs large systems including the Unisys Clearpath/MCP systems. Originally written in 1961 in ESPOL , which itself was an extension of Burroughs Corporation Extended ALGOL, in the 1970s it was converted to NEWP, a better structured, more robust, and more secure form of ESP...
. It descended from punched cards and the corresponding six bit binary-coded decimal
Binary-coded decimal

In computing and electronics systems, binary-coded decimal is an encoding for decimal numbers in which each digit is represented by its own binary sequence....
 code that most of IBM's computer peripherals of the late 1950s and early 1960s used.

History

EBCDIC was devised in 1963 and 1964 by IBM
IBM

International Business Machines Corporation, abbreviated IBM and nicknamed "Big Blue" , is a multinational corporation computer technology and consulting corporation headquartered in Armonk, New York, New York, United States....
 and was announced with the release of the IBM System/360
System/360

The IBM System/360 is a mainframe computer system family announced by IBM on April 7, 1964. It was the first family of computers making a clear distinction between computer architecture and implementation, allowing IBM to release a suite of compatible designs at different price points....
 line of mainframe computer
Computer

A computer is a machine that manipulates Data according to a list of Code .The first devices that resemble modern computers date to the mid-20th century , although the computer concept and various machines similar to computers existed earlier....
s. It was created to extend the Binary-Coded Decimal
Binary-coded decimal

In computing and electronics systems, binary-coded decimal is an encoding for decimal numbers in which each digit is represented by its own binary sequence....
 encoding that existed at the time. It is an 8-bit character encoding, in contrast to, and developed separately from, the 7-bit ASCII encoding scheme.

It is worth noting that IBM was a chief proponent of the ASCII standardization committee. However, IBM did not have time to prepare ASCII peripherals (such as card punch machines) to ship with its System/360 computers, so the company settled on EBCDIC at the time. The System/360 became wildly successful, and thus so did EBCDIC.

All IBM mainframe peripheral
Peripheral

A peripheral is a device attached to a host computer behind the chipset whose primary functionality is dependent upon the host, and can therefore be considered as expanding the hosts capabilities, while not forming part of the system's core computer architecture....
s and operating system
Operating system

An operating system is an interface between hardware and applications; it is responsible for the management and coordination of activities and the sharing of the limited resources of the computer....
s (except Linux on zSeries
Linux on zSeries

Linux on System z is the collective term for the Linux operating system compiled to run on IBM mainframes, especially System z machines. Other terms with the same meaning include Linux on System z10, Linux on System z9, Linux on zSeries, Linux/390, zLinux, z/Linux, etc....
 or iSeries) use EBCDIC as their inherent encoding. but software can translate to and from other encodings. Many hardware peripherals provide translation as well and modern mainframes (such as IBM zSeries
ZSeries

IBM System z, or earlier IBM eServer zSeries, is a brand name designated by IBM to all its mainframe computers.In 2000, IBM rebranded the existing System/390 to IBM eServer zSeries with the e depicted in IBM's red trademarked symbol....
) include processor instructions, at the hardware level, to accelerate translation between character sets.

At the time it was devised, EBCDIC made it relatively easy to enter data into a computer with punch cards. Since punch cards are no longer used on mainframes, EBCDIC is used in modern mainframes solely for backwards compatibility. It has no real technical advantage over ASCII-based code pages such as the ISO-8859 series or Unicode
Unicode

Unicode is a computing industry standard allowing computers to consistently represent and manipulate Character expressed in most of the world's writing systems....
. There are some technical niceties in each, e.g., ASCII and EBCDIC both have one bit which indicates upper or lower case. But there are some aspects of EBCDIC which make it much less pleasant to work with than ASCII (such as a non-contiguous alphabet). As with single-byte extended ASCII
Extended ASCII

The term extended ASCII describes eight-bit or larger character encodings that include the standard seven-bit ASCII characters as well as others....
 codepages, most EBCDIC codepages only allow up to 2 languages (English and one other language) to be used in a database
Database

A database is a structured collection of records or data that is stored in a computer system. The structure is achieved by organizing the data according to a database model....
 or text file.

Where true support for multilingual text is desired, a system supporting far more characters is needed. Generally this is done with some form of Unicode support. There is an EBCDIC Unicode Transformation Format
Unicode Transformation Format

Unicode Transformation Format may refer to one of several forms:* UTF-1* UTF-7* UTF-8* UTF-16/UCS-2* UTF-32/UCS-4...
 called UTF-EBCDIC
UTF-EBCDIC

UTF-EBCDIC is a character encoding used to represent Unicode characters. It is meant to be EBCDIC-friendly, so that legacy EBCDIC applications on Mainframe computer may process the characters without much difficulty....
 proposed by the Unicode consortium, but it is not intended to be used in open interchange environments and, even on EBCDIC-based systems, it is almost never used. IBM mainframes support UTF-16, but they do not support UTF-EBCDIC natively.

Arabic EBCDIC versions are typically in presentation order, in left to right order as displayed by an older mainframe or line printer, rather than in the right to left logical order used by modern encodings such as Unicode.

Codepage layout

The table below is derived from CCSID 500
EBCDIC 500

Code page 500 is an EBCDIC code page with full ISO 8859-1....
, one of the code page variants of EBCDIC, showing only the basic (English) EBCDIC characters. Characters 00–3F and FF are control
Control character

In computing and telecommunication, a control Grapheme or non-printing character is a code point in a character encoding, that does not in itself represent a written symbol....
s, 40 is space, 41 is no-break space (RSP: "Required Space"), E1 is numeric space (NSP: "Numeric Space"), and CA is soft hyphen. Characters are shown with their equivalent Unicode
Unicode

Unicode is a computing industry standard allowing computers to consistently represent and manipulate Character expressed in most of the world's writing systems....
 codes. Invariant alphanumeric, punctuation, and control characters common to all EBCDIC code pages are shown in color. Unassigned codes are typically filled with international or region-specific characters in the various EBCDIC code page
Code page

Code page is the traditional International Business Machines term used to map a specific set of characters to numerical code point values . This is slightly different in meaning than the related terms character encoding and character set....
 variants.

EBCDIC
  —0 —1 —2 —3 —4 —5 —6 —7 —8 —9 —A —B —C —D —E —F
 —0—1—2—3—4—5—6—7—8—9—A—B—C—D—E—F


Criticism and humor

Open-source-software advocate and hacker Eric S. Raymond
Eric S. Raymond

Eric Steven Raymond , often referred to as ESR, is a computer programmer, author and open source software advocate. His name became known within the hacker culture when he became the maintainer of the "Jargon File"....
 writes in his Jargon File
Jargon File

The Jargon File is a glossary of hacker slang. The original Jargon File was a collection of hacker slang from technical cultures such as the MIT Artificial Intelligence Laboratory, the Stanford AI Lab , and others of the old ARPANET Artificial Intelligence/Lisp programming language/PDP-10 communities, including Bolt, Beranek and Newman, Carn...
 that EBCDIC was almost universally loathed by early hackers and programmers because of its multitude of different versions, none of which resembled the other versions, and that IBM produced it in direct competition with the already-established ASCII
ASCII

American Standard Code for Information Interchange , is a coding standard that can be used for interchanging information, if the information is expressed mainly by the written form of English words....
.

The Jargon file 4.4.7 gives the following definition:

Another popular complaint is that the EBCDIC alphabetic characters follow an archaic punch card encoding rather than a linear ordering like ASCII. The upshot of this is that incrementing the character code for "I" does not produce the code for "J", and likewise there is a gap between the codes for "R" and "S". Thus programming a simple control loop to cycle through only the alphabetic characters is problematic.

These incompatibilities were also the source of many jokes. A popular one went:
Professor: "So the American government went to IBM to come up with an encryption standard, and they came up with—"
Student: "EBCDIC!"


A reference to the EBCDIC character set is made in the classic Infocom adventure game Zork 2. In the "Machine Room", there is a collection of ancient computers and other machines of uncertain purpose. The following is the description of the room, with EBCDIC used to imply an incomprehensible language:

See also

  • EBCDIC-codepages with Latin-1-charset
    EBCDIC 8859

    List of EBCDIC code pages with Latin-1-charset 1st number : EBCDIC code page CCSID number with full ISO 8859-12nd number : the same code page, but Currency sign replaced by euro ?...
  • codepage 037 (English, Portuguese)
    EBCDIC 037

    Code page 037 is an EBCDIC code page with full ISO 8859-1.It is used in some English and Portuguese speaking countries, including Australia, Brazil, Canada, New Zealand, Portugal, South Africa, and the United States....
  • codepage 285 (Ireland, United Kingdom)
    EBCDIC 285

    Code page 285 is an EBCDIC code page with full ISO 8859-1. It is used in Ireland and the United Kingdom....
  • UTF-EBCDIC
    UTF-EBCDIC

    UTF-EBCDIC is a character encoding used to represent Unicode characters. It is meant to be EBCDIC-friendly, so that legacy EBCDIC applications on Mainframe computer may process the characters without much difficulty....


Sources


External links

  • Contains IBM's official information on codepages and charsets.
    • [ftp://ftp.software.ibm.com/software/globalization/gcoc/attachments/CP00037.pdf Codepage 37]
    • [ftp://ftp.software.ibm.com/software/globalization/gcoc/attachments/CP01047.pdf Codepage 1047]
  • from
  • Contains more information about EBCDIC derived from IBM's CDRA, including DBCS EBCDIC (Double Byte Character Set EBCDIC)
  • Contains computer readable Unicode mapping tables for EBCDIC and many other character sets
  • character list, including decimal and hex values, symbolic name, and character/function
  • Online tool to convert from ASCII to/from EBCDIC