In computing, it may have the following attributes: possibly containing errors, not validated; in sfferent (colloquial) formats; uncoded
A code is a rule for converting a piece of information into another form or representation , not necessarily of the same type....
Format may refer to:* File format, layout for electronic files* Text formatting, typesetting of text elements* Format , a command-line utility in many computer operating systems* Format , a computer command to prepare hard disks...
; and suspect
In the parlance of criminal justice, a suspect is a known person suspected of committing a crime.Police and reporters often incorrectly use the word suspect when referring to the...
, requiring confirmation or citation
Broadly, a citation is a reference to a published or unpublished source . More precisely, a citation is an abbreviated alphanumeric expression Broadly, a citation is a reference to a published or unpublished source (not always the original source). More precisely, a citation is an abbreviated...
. For example, a data input sheet might contain dates as raw data in many forms: "31st January 1999", "31/01/1999", "31/1/99", "31 Jan", or "today". Once captured, this raw data may be processed
Computer data processing is any process that a computer program does to enter data and summarise, analyse or otherwise convert data into usable information. The process may be automated and run on a computer. It involves recording, analysing, sorting, summarising, calculating, disseminating and...
stored as a single format, perhaps a Julian date, so as to be easier for computers and humans to interpret during later processing.
Raw data (sometimes called "sourcey" data or "eggy" data) is the data input to processing. A distinction is sometimes made between data
to the effect that information is the end
product of data
processing. Raw data
that has undergone processing is sometimes referred to as "cooked" data.
Although raw data has the potential to become "information," it requires selective extraction, organization, and sometimes analysis and formatting for presentation.
For example, a point-of-sale terminal (POS terminal) in a busy supermarket collects huge volumes of raw data each day, but that data doesn't yield much information until it is processed. Once processed, the data may indicate the particular items that each customer buys, when they buy them, and at what price. Such information
could then become data
for processing predictive marketing campaigns. As a result of processing, raw data sometimes ends up in a database, which enables the raw data to become accessible for further processing and analysis in any number of different ways.
Sir Timothy John "Tim" Berners-Lee, , also known as "TimBL", is a British computer scientist, MIT professor and the inventor of the World Wide Web...
(inventor of the World Wide Web) proposes that sharing raw data is important. His call to action is "Raw Data Now"
, meaning that everyone should demand that governments and businesses share their information as raw data. He points out that "data drives a huge amount of what happens in our lives… because somebody takes the data and does something with it." To Berners-Lee, it is essentially from this sharing of raw data, that advances in science will emerge.