DOC (computing)
Encyclopedia
In computing
Computing
Computing is usually defined as the activity of using and improving computer hardware and software. It is the computer-specific part of information technology...

, DOC or doc (an abbreviation of 'document
Document
The term document has multiple meanings in ordinary language and in scholarship. WordNet 3.1. lists four meanings :* document, written document, papers...

') is a filename extension
Filename extension
A filename extension is a suffix to the name of a computer file applied to indicate the encoding of its contents or usage....

 for word processing
Word processing
Word processing is the creation of documents using a word processor. It can also refer to advanced shorthand techniques, sometimes used in specialized contexts with a specially modified typewriter.-External links:...

 documents; most commonly for Microsoft Word
Microsoft Word
Microsoft Word is a word processor designed by Microsoft. It was first released in 1983 under the name Multi-Tool Word for Xenix systems. Subsequent versions were later written for several other platforms including IBM PCs running DOS , the Apple Macintosh , the AT&T Unix PC , Atari ST , SCO UNIX,...

. Historically, the extension was used for documentation in plain-text
Plain text
In computing, plain text is the contents of an ordinary sequential file readable as textual material without much processing, usually opposed to formatted text....

 format
File format
A file format is a particular way that information is encoded for storage in a computer file.Since a disk drive, or indeed any computer storage, can store only bits, the computer must have some way of converting information to 0s and 1s and vice-versa. There are different kinds of formats for...

, particularly of programs or computer hardware, on a wide range of operating system
Operating system
An operating system is a set of programs that manage computer hardware resources and provide common services for application software. The operating system is the most important type of system software in a computer system...

s. During the 1980s, WordPerfect
WordPerfect
WordPerfect is a word processing application, now owned by Corel.Bruce Bastian, a Brigham Young University graduate student, and BYU computer science professor Dr. Alan Ashton joined forces to design a word processing system for the city of Orem's Data General Corp. minicomputer system in 1979...

 used DOC as the extension of their proprietary format. Later, in the 1990s, Microsoft
Microsoft
Microsoft Corporation is an American public multinational corporation headquartered in Redmond, Washington, USA that develops, manufactures, licenses, and supports a wide range of products and services predominantly related to computing through its various product divisions...

 chose to use the DOC extension for their proprietary Microsoft Word
Microsoft Word
Microsoft Word is a word processor designed by Microsoft. It was first released in 1983 under the name Multi-Tool Word for Xenix systems. Subsequent versions were later written for several other platforms including IBM PCs running DOS , the Apple Macintosh , the AT&T Unix PC , Atari ST , SCO UNIX,...

 word processing formats. The original uses for the extension have largely disappeared from the PC
Personal computer
A personal computer is any general-purpose computer whose size, capabilities, and original sales price make it useful for individuals, and which is intended to be operated directly by an end-user with no intervening computer operator...

 world.

Microsoft's DOC binary file format

Binary DOC file
Computer file
A computer file is a block of arbitrary information, or resource for storing information, which is available to a computer program and is usually based on some kind of durable storage. A file is durable in the sense that it remains available for programs to use after the current program has finished...

s often contain more text formatting information (as well as scripts and undo information) than some other document file format
Document file format
A document file format is a text or binary file format for storing documents on a storage media, especially for use by computers.There currently exist a multitude of incompatible document file formats....

s like Rich Text Format
Rich Text Format
The Rich Text Format is a proprietary document file format with published specification developed by Microsoft Corporation since 1987 for Microsoft products and for cross-platform document interchange....

 and HyperText Markup Language, but are usually less widely compatible
Computer compatibility
A family of computer models is said to be compatible if certain software that runs on one of the models can also be run on all other models of the family. The computer models may differ in performance, reliability or some other characteristic...

.

The DOC format varies among Microsoft Office Word Formats. Word versions up to 97 used a different format from Microsoft Word version between 97 and 2003.

In Microsoft Office Word 2007 the binary file format was replaced as the default format by the new XML
XML
Extensible Markup Language is a set of rules for encoding documents in machine-readable form. It is defined in the XML 1.0 Specification produced by the W3C, and several other related specifications, all gratis open standards....

 based Office Open XML format. The filename extensions of this format are .docx or .docm. Microsoft Office Word 2010 uses a newer version of Office Open XML format, and is supported by Office 2007.

Application support

The DOC format is native to Microsoft Office Word, but other word processor
Word processor
A word processor is a computer application used for the production of any sort of printable material....

s, such as OpenOffice.org Writer
OpenOffice.org Writer
OpenOffice.org Writer is the word processor component of the OpenOffice.org software package. Writer is a word processor similar to Microsoft Word and Corel's WordPerfect, with some of their features....

, IBM Lotus Symphony
IBM Lotus Symphony
IBM Lotus Symphony is a suite of applications for creating, editing, and sharing text, spreadsheet, presentations and other documents, and is currently distributed as freeware. First released in 2007, the suite has a name similar to the 1980s DOS suite Lotus Symphony, but the two programs are...

, Google Docs, Apple Pages and AbiWord
AbiWord
AbiWord is a free and open source software word processor. It was originally started by SourceGear Corporation as the first part of a proposed AbiSuite. Development stopped when SourceGear changed their focus to Internet appliances. AbiWord was adopted by some open source developers and AbiWord...

, can create and read .doc files, although with some limitations. Command line programs for Unix-like
Unix-like
A Unix-like operating system is one that behaves in a manner similar to a Unix system, while not necessarily conforming to or being certified to any version of the Single UNIX Specification....

 operating systems which can convert files from the DOC format to plain text or other standard formats include the wv
Wv (software)
The software library wv, also known as wvware or by its previous name mswordview, is a set of free software programs licensed under the GNU General Public License which can be used for viewing and/or converting files in the Microsoft .doc format to plain text, LaTeX, html or other formats.The wv...

 library, which itself is used directly by AbiWord and as a source of ideas and some coding by KWord.

Specification

Because the .doc file format was a closed specification for many years, inconsistent handling of the format persists and may cause some loss of formatting information when handling the same file with multiple word processing programs. Some specifications for MS Office 97 binary file formats were published in 1997 under a restrictive license, but these specifications were removed from online download in 1999. Specifications of later versions of MS Office binary file formats were not publicly available. The DOC format specification was available from Microsoft on request since 2006 under restrictive RAND-Z terms until February 2008. Following reverse engineering
Reverse engineering
Reverse engineering is the process of discovering the technological principles of a device, object, or system through analysis of its structure, function, and operation...

 the documentation which was done by Sun
Sun Microsystems
Sun Microsystems, Inc. was a company that sold :computers, computer components, :computer software, and :information technology services. Sun was founded on February 24, 1982...

 and OpenOffice.org
OpenOffice.org
OpenOffice.org, commonly known as OOo or OpenOffice, is an open-source application suite whose main components are for word processing, spreadsheets, presentations, graphics, and databases. OpenOffice is available for a number of different computer operating systems, is distributed as free software...

, Microsoft released a .DOC format specification under the Microsoft Open Specification Promise
Microsoft Open Specification Promise
The Microsoft Open Specification Promise , is a promise by Microsoft, published in September 2006, to not assert legal rights over certain Microsoft patents on implementations of an included list of technologies....

. However, this specification does not describe all of the features used by DOC format and reverse engineering remains necessary.

Other file formats

Some historical documentations may use the .doc filename extension for plain-text file format. The .doc filename extension was also used in historical versions of WordPerfect for its proprietary format.

Some software applications use the name "DOC" in combination with other words (such as the name of software manufacturer) for different file formats. As an example, on the Palm OS
Palm OS
Palm OS is a mobile operating system initially developed by Palm, Inc., for personal digital assistants in 1996. Palm OS is designed for ease of use with a touchscreen-based graphical user interface. It is provided with a suite of basic applications for personal information management...

, DOC is shorthand for PalmDoc, a completely unrelated format (commonly using filename extension ".pdb") used to encode text files such as ebooks.

External links

The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK