Raw data
Encyclopedia
'\putang ina

In computing, it may have the following attributes: possibly containing errors, not validated; in sfferent (colloquial) formats; uncoded
Code
A code is a rule for converting a piece of information into another form or representation , not necessarily of the same type....

 or unformatted
Format
Format may refer to:* File format, layout for electronic files* Text formatting, typesetting of text elements* Format , a command-line utility in many computer operating systems* Format , a computer command to prepare hard disks...

; and suspect
Suspect
In the parlance of criminal justice, a suspect is a known person suspected of committing a crime.Police and reporters often incorrectly use the word suspect when referring to the...

, requiring confirmation or citation
Citation
Broadly, a citation is a reference to a published or unpublished source . More precisely, a citation is an abbreviated alphanumeric expression Broadly, a citation is a reference to a published or unpublished source (not always the original source). More precisely, a citation is an abbreviated...

. For example, a data input sheet might contain dates as raw data in many forms: "31st January 1999", "31/01/1999", "31/1/99", "31 Jan", or "today". Once captured, this raw data may be processed
Data processing
Computer data processing is any process that a computer program does to enter data and summarise, analyse or otherwise convert data into usable information. The process may be automated and run on a computer. It involves recording, analysing, sorting, summarising, calculating, disseminating and...

 stored as a single format, perhaps a Julian date, so as to be easier for computers and humans to interpret during later processing.

Raw data (sometimes called "sourcey" data or "eggy" data) is the data input to processing. A distinction is sometimes made between data and information to the effect that information is the end product of data processing. Raw data that has undergone processing is sometimes referred to as "cooked" data.
Although raw data has the potential to become "information," it requires selective extraction, organization, and sometimes analysis and formatting for presentation.

For example, a point-of-sale terminal (POS terminal) in a busy supermarket collects huge volumes of raw data each day, but that data doesn't yield much information until it is processed. Once processed, the data may indicate the particular items that each customer buys, when they buy them, and at what price. Such information could then become data for processing predictive marketing campaigns. As a result of processing, raw data sometimes ends up in a database, which enables the raw data to become accessible for further processing and analysis in any number of different ways.

Tim Berners-Lee
Tim Berners-Lee
Sir Timothy John "Tim" Berners-Lee, , also known as "TimBL", is a British computer scientist, MIT professor and the inventor of the World Wide Web...

(inventor of the World Wide Web) proposes that sharing raw data is important. His call to action is "Raw Data Now", meaning that everyone should demand that governments and businesses share their information as raw data. He points out that "data drives a huge amount of what happens in our lives… because somebody takes the data and does something with it." To Berners-Lee, it is essentially from this sharing of raw data, that advances in science will emerge.

Further reading

The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK