All Topics  
Byte

 

   Email Print
   Bookmark   Link






 

Byte



 
 
A byte (pronounced ) is a basic unit of measurement of information storage
Computer storage

Computer data storage, often called storage or memory, refers to computer components, devices, and recording medium that retain digital data used for computing for some interval of time....
 in computer science
Computer science

Computer science is the study of the theoretical foundations of information and computation, and of practical techniques for their implementation and application in computer systems....
. In many computer architecture
Computer architecture

Computer architecture in computer engineering is the conceptual design and fundamental operational structure of a computer system. It is a blueprint and functional description of requirements and design implementations for the various parts of a computer, focusing largely on the way by which the central processing unit performs internally an...
s it is a unit of memory addressing
Address space

In computing, an address space defines a range of discrete addresses, each of which may correspond to a physical or virtual memory register, a Node , peripheral device, disk sector or other logical or physical entity....
. There is no standard but a byte most often consists of eight bit
Bit

A bit is a binary numeral system numerical digit, taking a value of either 0 or 1. Binary digits are a basic unit of information Computer data storage and transmission in digital computing and digital information theory....
s.

A byte is an ordered collection of bits, with each bit denoting a single binary
Binary

Binary means composed of two parts or two pieces. It contrasts with Unary, Ternary, Quaternary , and so on.Binary may also refer to:* Binary option, also known as digital option OR all-or-nothing option...
 value of 1 or 0. The byte most often consists of 8 bits in modern systems; however, the size of a byte can vary and is generally determined by the underlying computer operating system
Operating system

An operating system is an interface between hardware and applications; it is responsible for the management and coordination of activities and the sharing of the limited resources of the computer....
 or hardware.






Discussion
Ask a question about 'Byte'
Start a new discussion about 'Byte'
Answer questions from other users
Full Discussion Forum



Encyclopedia


A byte (pronounced ) is a basic unit of measurement of information storage
Computer storage

Computer data storage, often called storage or memory, refers to computer components, devices, and recording medium that retain digital data used for computing for some interval of time....
 in computer science
Computer science

Computer science is the study of the theoretical foundations of information and computation, and of practical techniques for their implementation and application in computer systems....
. In many computer architecture
Computer architecture

Computer architecture in computer engineering is the conceptual design and fundamental operational structure of a computer system. It is a blueprint and functional description of requirements and design implementations for the various parts of a computer, focusing largely on the way by which the central processing unit performs internally an...
s it is a unit of memory addressing
Address space

In computing, an address space defines a range of discrete addresses, each of which may correspond to a physical or virtual memory register, a Node , peripheral device, disk sector or other logical or physical entity....
. There is no standard but a byte most often consists of eight bit
Bit

A bit is a binary numeral system numerical digit, taking a value of either 0 or 1. Binary digits are a basic unit of information Computer data storage and transmission in digital computing and digital information theory....
s.

A byte is an ordered collection of bits, with each bit denoting a single binary
Binary

Binary means composed of two parts or two pieces. It contrasts with Unary, Ternary, Quaternary , and so on.Binary may also refer to:* Binary option, also known as digital option OR all-or-nothing option...
 value of 1 or 0. The byte most often consists of 8 bits in modern systems; however, the size of a byte can vary and is generally determined by the underlying computer operating system
Operating system

An operating system is an interface between hardware and applications; it is responsible for the management and coordination of activities and the sharing of the limited resources of the computer....
 or hardware. Historically, byte size was determined by the number of bits required to represent a single character from a Western character set. Its size was generally determined by the number of possible characters in the supported character set and was chosen to be a divisor of the computer's word size. Historically bytes have ranged from five to twelve bits.

The popularity of IBM's System/360
System/360

The IBM System/360 is a mainframe computer system family announced by IBM on April 7, 1964. It was the first family of computers making a clear distinction between computer architecture and implementation, allowing IBM to release a suite of compatible designs at different price points....
 architecture
Computer architecture

Computer architecture in computer engineering is the conceptual design and fundamental operational structure of a computer system. It is a blueprint and functional description of requirements and design implementations for the various parts of a computer, focusing largely on the way by which the central processing unit performs internally an...
 starting in the 1960s and the explosion of microcomputer
Microcomputer

A microcomputer is a computer with a microprocessor as its central processing unit. Another general characteristic of these computers is that they occupy physically small amounts of space when compared to mainframe computer and minicomputers....
s based on 8-bit microprocessor
Microprocessor

A microprocessor incorporates most or all of the functions of a central processing unit on a single integrated circuit . The first microprocessors emerged in the early 1970s and were used for electronic calculators, using Binary-coded decimal arithmetic on 4-bit Word ....
s in the 1980s has made eight bits by far the most common size for a byte. The term octet
Octet (computing)

In computing, an octet is a grouping of eight bits.Octet, with the only exception noted below, always refers to an entity having exactly eight bits....
 is widely used as a more precise synonym where ambiguity is undesirable (for example, in protocol
Protocol (computing)

In computer science, a protocol is a convention or standard that controls or enables the connection, communication, and data transfer between computing endpoints....
 definitions).

There has been considerable confusion about the meanings of SI (or metric) prefixes
SI prefix

An SI prefix is a name or associated symbol that precedes a basic unit of measure to form a decimal multiple . The abbreviation SI is from the French language name Syst?me International d?Unit?s ....
 used with the word "byte", especially concerning prefixes such as kilo- (k or K) and mega- (M) as shown in the chart Prefixes for bit and byte. Since computer memory is designed with dual logic, multiples are expressed in power of two
Power of two

In mathematics, a power of two is any of the integer exponentiation of the number 2 ; in other words, two multiplication by itself a certain number of times....
, rather than 10, the software and computer industries often use binary estimates of the SI-prefixed quantities, while producers of computer storage devices prefer the SI values. This is the reason for specifying computer hard drive capacities of, say, "100 GB" when it contains 93 GiB (or 93 GB in traditional units) of addressable storage. Because of the confusion, a contract specifying a quantity of bytes must define what the prefixes mean in terms of the contract (i.e., the alternative binary equivalents or the actual decimal values, or a binary estimate based on the actual values).

To make the meaning of the table absolutely clear: A kibibyte
Kibibyte

A kibibyte is a unit of information or computer storage, established by the International Electrotechnical Commission in 2000. Its symbol is KiB....
 (KiB) is 1,024 bytes. A mebibyte
Mebibyte

The Mebibyte is a standards-based binary prefix of the byte, a unit of Computer data storage. Mebibyte is abbreviated MiB.The unit prefix mebi was defined by the International Electrotechnical Commission in December 1998....
 (MiB) is 1,024 × 1,024 or 1,048,576 bytes. The figures in the column using 1,024 raised to powers of 1, 2, 3, 4 and so on are in units of bytes.

Meanings

The word "byte" has two closely related meanings:
  1. A contiguous sequence of a fixed number of bit
    Bit

    A bit is a binary numeral system numerical digit, taking a value of either 0 or 1. Binary digits are a basic unit of information Computer data storage and transmission in digital computing and digital information theory....
    s (binary digits). The use of a byte to mean 8 bits has become nearly ubiquitous.
  2. A contiguous sequence of bits within a binary computer that comprises the smallest addressable sub-field of the computer's natural word-size. That is, the smallest unit of binary data on which meaningful computation, or natural data boundaries, could be applied. For example, the CDC 6000 series
    CDC 6000 series

    The CDC 6000 series was a family of mainframe computers manufactured by Control Data Corporation in the 1960s. It consisted of CDC 6400, CDC 6500, CDC 6600 and CDC 6700 computers, which all were extremely rapid and efficient for their time....
     scientific mainframes divided their 60-bit floating-point words into 10 six-bit bytes. These bytes conveniently held Hollerith data from punched cards, typically the upper-case alphabet and decimal digits. CDC also often referred to 12-bit quantities as bytes, each holding two 6-bit display code
    Display code

    Display code is the 6-bit character set used by many computer systems manufactured by Control Data Corporation, notably the CDC 6600 in 1964. The CDC 6000 series, and their followons, had 60 bit words....
     characters, due to the 12-bit I/O architecture of the machine. The PDP-10
    PDP-10

    The PDP-10 was a mainframe computer manufactured by Digital Equipment Corporation from the late 1960s on; the name stands for "Programmed Data Processor model 10"....
     used assembly instructions LDB and DPB to extract bytes — these operations survive today in Common Lisp
    Common Lisp

    Common Lisp, commonly abbreviated CL, is a dialect of the Lisp programming language, published in American National Standards Institute standard document Information Technology - Programming Language - Common Lisp, formerly X3.226-1994 ....
    . Bytes of six, seven, or nine bits were used on some computers, for example within the 36-bit word of the PDP-10
    PDP-10

    The PDP-10 was a mainframe computer manufactured by Digital Equipment Corporation from the late 1960s on; the name stands for "Programmed Data Processor model 10"....
    . The UNIVAC 1100/2200 series
    UNIVAC 1100/2200 series

    The UNIVAC 1100/2200 series is a series of compatible 36-bit computer systems, beginning with the UNIVAC 1107 in 1962, initially made by UNIVAC....
     computers (now Unisys
    Unisys

    Unisys Corporation , based in Blue Bell, Pennsylvania, Pennsylvania, United States, and incorporated in Delaware, is a global provider of information technology services and programs....
    ) addressed in both 6-bit (Fieldata
    Fieldata

    Fieldata was a pioneering computer project run by the United States Army Signal Corps in the late 1950s that intended to create a single standard for collecting and distributing battlefield information....
    ) and 9-bit (ASCII
    ASCII

    American Standard Code for Information Interchange , is a coding standard that can be used for interchanging information, if the information is expressed mainly by the written form of English words....
    ) modes within its 36-bit word.


History

The term byte was coined by Dr. Werner Buchholz in July 1956, during the early design phase for the IBM Stretch
IBM 7030

The IBM 7030, also known as Stretch, was IBM's first transistorized supercomputer. The first one was delivered to Los Alamos National Laboratory in 1961....
 computer. Originally it was defined in instructions by a 4-bit byte-size field, allowing from one to sixteen bits (the production design reduced this to a 3-bit byte-size field, allowing from one to eight bits to be represented by a byte); typical I/O equipment of the period used six-bit bytes. A fixed eight-bit byte size was later adopted and promulgated as a standard by the System/360
System/360

The IBM System/360 is a mainframe computer system family announced by IBM on April 7, 1964. It was the first family of computers making a clear distinction between computer architecture and implementation, allowing IBM to release a suite of compatible designs at different price points....
. The term "byte" comes from "bite," as in the smallest amount of data a computer could "bite" at once. The spelling change not only reduced the chance of a "bite" being mistaken for a "bit," but also was consistent with the penchant of early computer scientists to make up words and change spellings. A byte was also often referred to as "an 8-bit byte", reinforcing the notion that it was a tuple of n bits, and that other sizes were possible.

  1. A contiguous sequence of binary bits in a serial data stream, such as in modem or satellite communications, or from a disk-drive head, which is the smallest meaningful unit of data. These bytes might include start bits, stop bits, or parity bits, and thus could vary from 7 to 12 bits to contain a single 7-bit ASCII code.
  2. A datatype or synonym for a datatype in certain programming language
    Programming language

    A programming language is a machine-readable artificial language designed to express computations that can be performed by a machine, particularly a computer....
    s. C
    C (programming language)

    C is a general-purpose computer programming language originally developed in 1972 by Dennis Ritchie at the Bell Telephone Laboratories to implement the Unix operating system....
     and C++, for example, defines byte as "addressable unit of data large enough to hold any member of the basic character set of the execution environment" (clause 3.6 of the C standard). Since the C char integral data type must contain at least 8 bits (clause 5.2.4.2.1), a byte in C is at least capable of holding 256 different values (signed or unsigned char does not matter). Various implementations of C and C++ define a "byte" as 8, 9, 16, 32, or 36 bits. The actual number of bits in a particular implementation is documented as CHAR_BIT as implemented in the limits.h
    Limits.h

    limits.h is the header of the general purpose standard library of the C programming language which includes definitions of the characteristics of common variable types....
    file. Java's
    Java (programming language)

    Java is a programming language originally developed by James Gosling at Sun Microsystems and released in 1995 as a core component of Sun Microsystems' Java ....
     primitive byte data type is always defined as consisting of 8 bits and being a signed data type, holding values from -128 to 127.


Early microprocessors, such as Intel 8008
Intel 8008

The Intel 8008 was an early byte-oriented microprocessor designed and manufactured by Intel and introduced in April 1972. Originally known as the 1201, the chip was commissioned by Computer Terminal Corporation to implement an instruction set designed for their Datapoint 2200 programmable terminal....
 (the direct predecessor of the 8080, and then 8086
Intel 8086

The 8086 is a 16-bit microprocessor chip designed by Intel and introduced on the market in 1978, which gave rise to the x86 architecture. Intel 8088, released in 1979, was essentially the same chip, but with an external 8-bit bus , and is notable as the processor used in the original IBM PC....
) could perform a small number of operations on four bits, such as the DAA (decimal adjust) instruction, and the "half carry" flag, that were used to implement decimal arithmetic routines. These four-bit quantities were called "nybbles," in homage to the then-common 8-bit "bytes."

Alternative words

Following "bit," "byte," and "nybble," there have been some analogical attempts to construct unambiguous terms for bit blocks of other sizes. All of these are strictly jargon
Jargon

Jargon is terminology which has been especially defined in relationship to a specific activity, profession, or group. In other words, the term covers the language used by people who work in a particular area or who have a common interest....
, and not very common.

  • 1 bit: sniff
  • 2 bits: lick, crumb, quad, quarter, tayste, tydbit
  • 4 bits: nibble
    Nibble

    A nibble is the computing term for a four-bit aggregation, or half an octet . As a nibble contains 4 bits, there are sixteen possible values, so a nibble corresponds to a single hexadecimal digit ....
    , nybble
  • 5 bits: nickel, nyckle
  • 10 bits: deckle
    Deckle

    In papermaking, a deckle is a belt used along with a Molding to gather up woodpulp from a vat for pressing and drying into sheets. It helps to control the size of the paper produced....
    , dyme bag
  • 16 bits: plate, playte, chomp
    Chomp (disambiguation)

    Chomp is a colloquial term for the act of biting.It may also refer to:* Chomp, a two-player game played on a rectangular chocolate bar made up of smaller square blocks...
    , chawmp (on a 32-bit machine)
  • 18 bits: chomp, chawmp (on a 36-bit machine)
  • 32 bits: dinner, dynner, gawble (on a 286 ryan m-bit machine)
  • 48 bits: gobble
    Gobble

    Gobble may refer to:* Jimmy Gobble , Major League Baseball pitcher* Gobble Goblin, a fictional character in Piers Anthony's The Goblin family of Xanth...
    , gawble (under circumstances that remain obscure)


Abbreviation/Symbol

IEEE 1541
IEEE 1541

IEEE 1541-2002 is a standard issued by the Institute of Electrical and Electronics Engineers concerning the use of prefixes for binary multiples of units of measurement related to digital electronics and computing....
 and specify "B" as the symbol for byte (e.g. MB means megabyte), while IEC 60027
IEC 60027

IEC 60027 is the International Electrotechnical Commission's standard on Letter symbols to be used in electrical technology. It consists of several parts:...
 seems silent on the subject. Furthermore, B means bel (see decibel
Decibel

The decibel is a logarithmic units of measurement that expresses the magnitude of a physical quantity relative to a specified or implied reference level....
), another (logarithmic) unit used in the same field. The use of B to stand for bel is consistent with the metric system convention that capitalized symbols are for units named after a person (in this case Alexander Graham Bell
Alexander Graham Bell

Alexander Graham Bell was an eminent scientist, Innovation and innovator who is credited with inventing the first practical telephone.Bell's father, grandfather, and brother had all been associated with work on elocution and speech, and both his mother and wife were deaf, profoundly influencing Bell's life's work....
); usage of a capital B to stand for byte is not consistent with this convention. There is little danger of confusing a byte with a bel because the bel's sub-multiple the decibel
Decibel

The decibel is a logarithmic units of measurement that expresses the magnitude of a physical quantity relative to a specified or implied reference level....
 (dB) is usually preferred, while use of the decibyte (dB) is extremely rare.

The unit symbol "KB" is a commonly used abbreviation for "kilobyte
Kilobyte

Kilobyte is a unit of Computer data storage equal to either 1,024 bytes or 1,000 bytes , depending on context.It is abbreviated in a number of ways: KB, kB, K and Kbyte....
" but is often confused with the use of "kb" to mean "kilobit
Kilobit

A kilobit is an expression of grouped bits meaning 1,000 bits. Use of the term to denote a kibibit is deprecated and contrary to international standard....
". IEEE 1541 specifies "b" as the symbol for bit
Bit

A bit is a binary numeral system numerical digit, taking a value of either 0 or 1. Binary digits are a basic unit of information Computer data storage and transmission in digital computing and digital information theory....
; however the IEC
International Electrotechnical Commission

The International Electrotechnical Commission is a Non-profit organization, non-governmental international standards organization that prepares and publishes International Standards for all electrical, electronic and related technologies ? collectively known as "electrotechnology"....
 60027 and Metric-Interchange-Format specify "bit" (e.g. Mbit for megabit) for the symbol, achieving maximum disambiguation from byte.

Lowercase "o" for "octet
Octet (computing)

In computing, an octet is a grouping of eight bits.Octet, with the only exception noted below, always refers to an entity having exactly eight bits....
" is a commonly used symbol in several non-English-speaking countries, and is also used with metric prefixes (for example, "ko" and "Mo").

Today the harmonized ISO/IEC
IEC

IEC may refer to:In education:* International Education Centre* Information, Education, and Communication* International Electrotechnical Commission - An international standards organization dealing with electrical, electronic and related technologies....
 IEC 80000-13:2008 - Quantities and units -- Part 13: Information science and technology
ISO/IEC 80000

International standard ISO 80000 or IEC 80000 , successor of ISO 31 and partially of IEC 60027, is the most widely respected style guide for the use of physical quantities and units of measurement, and formulas involving them, in scientific and educational documents worldwide....
 standard cancels and replaces subclauses 3.8 and 3.9 of IEC 60027-2:2005 (those related to Information theory and Prefixes for binary multiples). See Units of information#Byte
Units of information

Units of information are a way of measuring a quantity of information or an entropy....
 for detailed discussion on names for derived units.

See also

  • Bit
    Bit

    A bit is a binary numeral system numerical digit, taking a value of either 0 or 1. Binary digits are a basic unit of information Computer data storage and transmission in digital computing and digital information theory....
  • Octet (computing)
    Octet (computing)

    In computing, an octet is a grouping of eight bits.Octet, with the only exception noted below, always refers to an entity having exactly eight bits....
  • Word (computing)