All Topics  
Spreadsheet

 

   Email Print
   Bookmark   Link






 

Spreadsheet



 
 
A spreadsheet is a computer application that simulates a paper worksheet
Worksheet

A worksheet is a piece of paper, often preprinted in a way designed to help organize material for learning or clear understanding. Students in a school may have "fill-in-the-blank" sheets of questions, diagrams or maps to help them with their exercises....
. It displays multiple cells that together make up a grid consisting of rows and columns, each cell containing either alphanumeric
Alphanumeric

Alphanumeric is a portmanteau of alphabetic and numeric and is used to describe the collection of Latin alphabet and Arabic numerals used by much of western society....
 text or numeric values. A spreadsheet cell may alternatively contain a formula
Formula

In mathematics and in the sciences, a formula is a concise way of expressing information symbolically , or a general relationship between quantities....
 that defines how the contents of that cell is to be calculated from the contents of any other cell (or combination of cells) each time any cell is updated.






Discussion
Ask a question about 'Spreadsheet'
Start a new discussion about 'Spreadsheet'
Answer questions from other users
Full Discussion Forum



Recent Posts









Encyclopedia


A spreadsheet is a computer application that simulates a paper worksheet
Worksheet

A worksheet is a piece of paper, often preprinted in a way designed to help organize material for learning or clear understanding. Students in a school may have "fill-in-the-blank" sheets of questions, diagrams or maps to help them with their exercises....
. It displays multiple cells that together make up a grid consisting of rows and columns, each cell containing either alphanumeric
Alphanumeric

Alphanumeric is a portmanteau of alphabetic and numeric and is used to describe the collection of Latin alphabet and Arabic numerals used by much of western society....
 text or numeric values. A spreadsheet cell may alternatively contain a formula
Formula

In mathematics and in the sciences, a formula is a concise way of expressing information symbolically , or a general relationship between quantities....
 that defines how the contents of that cell is to be calculated from the contents of any other cell (or combination of cells) each time any cell is updated. Spreadsheets are frequently used for financial
FINANCIAL

FINANCIAL is the weekly English language-language newspaper with offices in Tbilisi, Georgia and Kiev, Ukraine. Published by Intelligence Group LLC, FINANCIAL is focused on opinion leaders and top business decision-makers; It's about world?s largest companies, investing, careers, and small business....
 information because of their ability to re-calculate the entire sheet automatically after a change to a single cell is made.

The related term spreadmart
Spreadmart

A spreadmart is a concept describing the tendency that spreadsheets go havoc in organizations. Typically a spreadmart is created by individuals at different times using different data sources and rules for defining metrics in an organization, creating a fractured view of the enterprise....
 describes the situation that occurs when one or more business analysts develop a system of linked spreadsheets to perform a business analysis, then grow it to a size and degree of complexity that makes it nearly impossible to maintain. Re-implementing the analysis using a datamart architecture often improves both the reliability of the analysis and its maintainability.

Visicalc
VisiCalc

VisiCalc was the first spreadsheet program available for personal computers. It is often considered the application that turned the microcomputer from a hobby for computer hobby into a serious business tool....
 is usually considered the first electronic spreadsheet (although this has been challenged), and it helped turn the Apple II computer into a success and greatly assisted in their widespread application. Lotus 1-2-3
Lotus 1-2-3

Lotus 1-2-3 is a spreadsheet program from Lotus Software . It was the IBM PC's first "killer application"; its huge popularity in the mid-1980s contributed significantly to the success of the IBM PC in the corporate environment....
 was the leading spreadsheet when DOS
DOS

DOS, short for "Disk Operating System", is a shorthand term for several closely related operating systems that dominated the IBM PC compatible market between 1981 and 1995, or until about 2000 if one includes the partially DOS-based Microsoft Windows versions Windows 95, Windows 98, and Windows Me....
 was the dominant operating system. Excel
Microsoft Excel

Microsoft Excel is a spreadsheet-application written and distributed by Microsoft for Microsoft Windows and Mac OS X. It features calculation, graphing tools, pivot tables and a macro programming language called VBA ....
 is now considered to have the largest market share on the Windows and Macintosh platforms.

History


Paper spreadsheets


The word "spreadsheet" came from "spread" in its sense of a newspaper or magazine item (text and/or graphics) that covers two facing pages, extending across the center fold and treating the two pages as one large one. The compound word "spread-sheet" came to mean the format used to present book-keeping ledgers
General ledger

The general ledger, sometimes known as the nominal ledger, is the main accounting record of a business which uses double-entry bookkeeping....
—with columns for categories of expenditures across the top, invoices listed down the left margin, and the amount of each payment in the cell where its row and column intersect—which were, traditionally, a "spread" across facing pages of a bound ledger (book for keeping accounting records) or on oversized sheets of paper ruled into rows and columns in that format and approximately twice as wide as ordinary paper.

Early implementations


Batch spreadsheet report generators

The concept of an electronic spreadsheet was outlined in the 1961 paper "Budgeting Models and System Simulation" by Richard Mattessich. The subsequent work by Mattessich (1964a, Chpt. 9, Accounting and Analytical Methods) and its companion volume, Mattessich (1964b, Simulation of the Firm through a Budget Computer Program) applied computerized spreadsheets to accounting and budgeting systems (on main-frame computers in FORTRAN IV). Batch Spreadsheets dealt primarily with the addition or subtraction of entire columns or rows - rather than individual cells.

LANPAR spreadsheet compiler
Key invention in the development of electronic spreadsheets was made by Rene K. Pardo and Remy Landau, who filed in 1971 on spreadsheet automatic natural order recalculation algorithm
Algorithm

In mathematics, computing, linguistics and related subjects, an algorithm is a sequence of finite instructions, often used for calculation and data processing....
 in 1970. While the patent was initially rejected by the patent office as being a purely mathematical invention, following 12 years of Appeals, Pardo and Landau won a landmark court case at the CCPA (Predecessor Court of the Federal Circuit) overturning the Patent Office in 1983 - establishing that "something does not cease to become patentable merely because the point of novelty is in an algorithm."

The actual software was called LANPAR - LANguage for Programming Arrays at Random. This was conceived and entirely developed in the summer of 1969 following Pardo and Landau's recent graduation from Harvard University. Co-inventor Rene Pardo recalls that he felt that one manager at Bell Canada should not have to depend on programmers to program and modify budgeting forms, and he thought of letting users type out forms in any order and having computer calculating results in the right order. The software was developed in 1969.

LANPAR was used by Bell Canada, AT&T and the 18 operating telcos nationwide for their local and national budgeting operations. LANPAR was also used by General Motors. Its uniqueness was the incorporation of natural order recalculation, as opposed to left-to-right, top to bottom sequence for calculating the results in each cell that was used by Visicalc, Supercalc and the first version of Multiplan. Without natural order recalculation the users had to manually recalculate the spreadsheet as many times as necessary until the values in all the cells had stopped changing.

The LANPAR system was implemented on GE400 and Honeywell 6000 online timesharing systems enabling users to program remotely via computer terminals and modems. Data could be entered dynamically either by paper tape, specific file access, on line, or even external data bases. Sophisticated mathematical expressions including logical comparisons and "if/then" statements could be used in any cell, and cells could be presented in any order.

Autoplan/Autotab spreadsheet programming language
In 1968, three former employees from the General Electric
General Electric

The General Electric Company, or GE is a multinational corporation United States technology and Service s conglomerate incorporated in the State of New York....
 computer company headquartered in Phoenix, Arizona
Phoenix, Arizona

Phoenix is the capital and largest city in the U.S. state of Arizona, as well as the fifth most populous city in the United States. Phoenix is home to 1,552,259 residents, and is the anchor of the Phoenix Metropolitan Area with 4,179,427 residents....
 set out to start their own software development house. A. Leroy Ellison, Harry N. Cantrell, and Russell E. Edwards found themselves doing a large number of calculations when making tables for the business plans that they were presenting to venture capitalists. They decided to save themselves a lot of effort and wrote a computer program that produced their tables for them. This program, originally conceived as a simple utility for their personal use, would turn out to be the first software product offered by the company that would become known as Capex Corporation
Capex Corporation

Capex Corporation was a software house based in Phoenix, Arizona founded by three former employees of General Electric.It was subsequently acquired by Computer Associates....
. "AutoPlan" ran on GE’s Time-sharing
Time-sharing

Time-sharing refers to sharing a computing resource among many users by Computer multitasking. Its introduction in the 1960s, and emergence as the prominent model of computing in the 1970s, represents a major historical shift in the history of computing....
 service; afterward, a version that ran on IBM mainframe
IBM mainframe

IBM mainframes, though perceived as synonymous with mainframe computers in general due to their marketshare, are now technically and specifically IBM's line of business computers that can all trace their design evolution to the IBM System/360....
s was introduced under the name "AutoTab". (National CSS
National CSS

National CSS, Inc. was a time-sharing firm in the 1960-80s, until its acquisition by Dun & Bradstreet in 1979. NCSS was originally headquartered in Norwalk, Connecticut, but relocated to Wilton, Connecticut in 1978....
 offered a similar product, CSSTAB, which had a moderate timesharing user base by the early 70s. A major application was opinion research tabulation.) AutoPlan/AutoTab was not a WYSIWYG
WYSIWYG

WYSIWYG , is an acronym for What You See Is What You Get, used in computing to describe a system in which content displayed during editing appears very similar to the final output, which might be a printed document, web page, slide presentation or even the lighting for a theatrical event....
 interactive spreadsheet program, it was a simple scripting language for spreadsheets. The user defined the names and labels for the rows and columns, then the formulas that defined each row or column.

Interactive spreadsheets
Interactive spreadsheets became possible when computer display
Computer display

A visual display unit, often called simply a monitor or display, is a piece of electrical equipment which displays images generated from the video output of devices such as computers, without producing a permanent record....
s became available. Except for LANPAR- which allowed many users in real time and online to timesharing systems to simultaneously program or run their spreadsheets from their computer terminals across the country - earlier implementations were mainly designed around batch programs and certainly none of these batch programs allowed for forward referencing of cells or natural order recalculation. In the early 1970s text based computer displays began to be used as input/output devices for interactive transaction processes. This was sufficient for early interactive spreadsheets to be implemented. The lack of on-line historical material relating to such systems, and their limited coverage in academic and commercial publications, makes it hard to assess their level of innovation and ultimate impact.

APLDOT modeling language
An example of an early "industrial weight" spreadsheet was APLDOT, developed in 1976 at the United States Railway Association
United States Railway Association

The United States Railway Association was the Corporation created by federal law that oversaw Conrail.President Nixon signed the Regional Rail Reorganization Act of 1973 into law....
 on an IBM 360/91, running at The Johns Hopkins University Applied Physics Laboratory in Laurel, MD. The application was used successfully for many years in developing such applications as financial and costing models for the US Congress and for Conrail. APLDOT was dubbed a "spreadsheet" because financial analysts and strategic planners used it to solve the same problems they addressed with paper spreadsheet pads.

The spreadsheet concept became widely known in the late 1970s and early 1980s because of Dan Bricklin
Dan Bricklin

Daniel S. Bricklin is the co-creator, with Bob Frankston, of the VisiCalc spreadsheet program. He also founded Software Garden, Inc., of which he is currently president, and Trellix Corporation, which is currently owned by Web.com....
's implementation of VisiCalc
VisiCalc

VisiCalc was the first spreadsheet program available for personal computers. It is often considered the application that turned the microcomputer from a hobby for computer hobby into a serious business tool....
. VisiCalc was the first spreadsheet that combined all essential features of modern spreadsheet applications, such as WYSIWYG
WYSIWYG

WYSIWYG , is an acronym for What You See Is What You Get, used in computing to describe a system in which content displayed during editing appears very similar to the final output, which might be a printed document, web page, slide presentation or even the lighting for a theatrical event....
 interactive user interface, automatic recalculation, status and formula lines, range copying with relative and absolute references, formula building by selecting referenced cells. PC World magazine
PC World (magazine)

PC World is a global computer magazine published monthly by IDG. It offers advice on various aspects of PCs and related items, the Internet, and other personal-technology products and services....
 has called VisiCalc the first electronic spreadsheet.

Bricklin has spoken of watching his university professor create a table of calculation results on a blackboard. When the professor found an error, he had to tediously erase and rewrite a number of sequential entries in the table, triggering Bricklin to think that he could replicate the process on a computer, using the blackboard as the model to view results of underlying formulas. His idea became VisiCalc
VisiCalc

VisiCalc was the first spreadsheet program available for personal computers. It is often considered the application that turned the microcomputer from a hobby for computer hobby into a serious business tool....
, the first application
Application software

Application software is any tool that functions and is operated by means of a computer, with the purpose of supporting or improving the software user 's work....
 that turned the personal computer
Personal computer

A personal computer is any general-purpose computer whose original sales price, size, and capabilities make it useful for individuals, and which is intended to be operated directly by an end user, with no intervening computer operator....
 from a hobby for computer enthusiasts into a business tool.

Visicalc (ibm Pc's Killer Application)
VisiCalc went on to become the first "killer app", an application that was so compelling, people would buy a particular computer just to own it. In this case the computer was the Apple II, and VisiCalc was no small part in that machine's success. The program was later ported to a number of other early computers, notably CP/M
CP/M

CP/M is an operating system originally created for Intel 8080/Intel 8085 based microcomputers by Gary Kildall of Digital Research. Initially confined to single tasking on 8-bit processors and no more than 64 kilobytes of memory, later versions of CP/M added multi-user variations, and were migrated to 16-bit processors....
 machines, the Atari 8-bit family
Atari 8-bit family

The Atari 8-bit family is a series of 8-bit home computers manufactured from 1979 to 1992. All are based on the MOS Technology MOS Technology 6502 central processing unit and were the first home computers designed with custom coprocessor chips, giving them the most powerful graphic, sound and I/O subsystems of any 8 bit machine of their time...
 and various Commodore
Commodore International

Commodore, the commonly used name for Commodore International, was a United States electronics company based in West Chester, Pennsylvania which was a vital player in the home computer/personal computer field in the 1980s....
 platforms. Nevertheless, VisiCalc remains best known as "an Apple II program".

Multi-dimensional Spreadsheets
In the late 1980s and early 1990s, first Javelin Software
Javelin Software

Javelin Software Corporation was a company in Cambridge, Massachusetts, Massachusetts which developed an innovative business model and data analysis product, also called Javelin, and later Javelin Plus....
 and later Lotus Improv appeared and unlike models in a conventional spreadsheet
Spreadsheet

A spreadsheet is a computer application that simulates a paper worksheet. It displays multiple cells that together make up a grid consisting of rows and columns, each cell containing either alphanumeric text or numeric values....
, they utilized models built on objects called variables, not on data in cells of a report. These multi-dimensional spreadsheets enabled viewing data and algorithm
Algorithm

In mathematics, computing, linguistics and related subjects, an algorithm is a sequence of finite instructions, often used for calculation and data processing....
s in various self-documenting ways, including simultaneous multiple synchronized views. For example, users of Javelin could move through the connections between variables on a diagram while seeing the logical roots and branches of each variable. This is an example of what is perhaps its primary contribution of the earlier Javelin--the concept of traceability of a user's logic or model structure through its twelve views. A complex model can be dissected and understood by others who had no role in its creation, and this remains unique even today. Javelin was used primarily for financial modeling, but was also used to build instructional models in college chemistry courses, to model the world's economies, and by the military in the early Star Wars project. It is still in use by institutions for which model integrity is mission critical.

In these programs, a time series
Time series

In statistics, signal processing, and many other fields, a time series is a sequence of data points, measured typically at successive times, spaced at time intervals....
, or any variable, was an object in itself, not a collection of cells which happen to appear in a row or column. Variables could have many attributes, including complete awareness of their connections to all other variables, data references, and text and image notes. Calculations were performed on these objects, as opposed to a range of cells, so adding two time series automatically aligns them in calendar time, or in a user-defined time frame. Data were independent of worksheets—variables, and therefore data, could not be destroyed by deleting a row, column or entire worksheet. For instance, January's costs are subtracted from January's revenues, regardless of where or whether either appears in a worksheet. This permits actions later used in pivot table
Pivot table

A pivot table is a data summarization tool found in data visualization programs such as spreadsheets . Among other functions, they can automatically sort, count, and total the data stored in one table or spreadsheet and create a second table displaying the summarized data....
s, except that flexible manipulation of report tables was but one of many capabilities supported by variables. Moreover, if costs were entered by week and revenues by month, Javelin's program could allocate or interpolate as appropriate. This object design enabled variables and whole models to reference each other with user-defined variable names, and to perform multidimensional analysis and massive, but easily editable consolidations.

Lotus 1-2-3 and other MS-DOS spreadsheets

The acceptance of the IBM PC
IBM PC

The IBM Personal Computer, commonly known as the IBM PC, is the original version and progenitor of the IBM PC compatible hardware platform ....
 following its introduction in August, 1981, began slowly, because most of the programs available for it were ports from other 8-bit platforms. Things changed dramatically with the introduction of Lotus 1-2-3
Lotus 1-2-3

Lotus 1-2-3 is a spreadsheet program from Lotus Software . It was the IBM PC's first "killer application"; its huge popularity in the mid-1980s contributed significantly to the success of the IBM PC in the corporate environment....
 in November, 1982, and release for sale in January, 1983. It became that platform's killer app, and drove sales of the PC due to the improvements in speed and graphics compared to VisiCalc.

Lotus 1-2-3, along with its competitor Borland
Borland

Borland Software Corporation is a Computer software company headquartered in Austin, Texas. It was founded in 1983 by Niels Jensen, Ole Henriksen, Mogens Glad and Philippe Kahn....
 Quattro
Quattro Pro

Quattro Pro is a spreadsheet program developed by Borland and now sold by Corel, most often as part of Corel's WordPerfect Office.Historically, Quattro Pro used keyboard command similar to Lotus 1-2-3....
, soon displaced VisiCalc. Lotus 1-2-3 was released on January 26, 1983, started outselling then-most-popular VisiCalc
VisiCalc

VisiCalc was the first spreadsheet program available for personal computers. It is often considered the application that turned the microcomputer from a hobby for computer hobby into a serious business tool....
 the very same year, and for a number of years was the leading spreadsheet for DOS
DOS

DOS, short for "Disk Operating System", is a shorthand term for several closely related operating systems that dominated the IBM PC compatible market between 1981 and 1995, or until about 2000 if one includes the partially DOS-based Microsoft Windows versions Windows 95, Windows 98, and Windows Me....
.

Microsoft Excel

Microsoft
Microsoft

Microsoft Corporation is a multinational corporation computer technology corporation that develops, manufactures, licenses, and supports a wide range of computer software products for computing devices....
 had been developing Excel
Microsoft Excel

Microsoft Excel is a spreadsheet-application written and distributed by Microsoft for Microsoft Windows and Mac OS X. It features calculation, graphing tools, pivot tables and a macro programming language called VBA ....
 on the Macintosh platform for several years at this point, where it had developed into a fairly powerful system. A port of Excel to Windows 2.0 resulted in a fully functional Windows spreadsheet. The more robust Windows 3.x platforms of the early 1990s made it possible for Excel to take market share from Lotus. By the time Lotus responded with usable Windows products, Microsoft had started compiling their Office
Microsoft Office

Microsoft Office is a popular set of interrelated desktop applications, servers and services. Microsoft Office is collectively referred to as an office suite, for the Microsoft Windows and Mac OS X operating systems....
 suite. Starting in the mid 1990s continuing through the present, Microsoft Excel has dominated the commercial electronic spreadsheet market.

Apple Numbers

Numbers
Numbers (software)

Numbers is a spreadsheet application developed by Apple Inc. as part of the iWork productivity suite alongside Keynote and Pages. Numbers 1.0 was announced on August 7 2007 and runs on Mac OS X v10.4 and Mac OS X v10.5 only....
 is Apple Inc.'s spreadsheet software, part of iWork
IWork

iWork is a office suite of applications created by Apple Inc., which contains Pages, a word processing and desktop publishing application; Keynote , a presentation package; and Numbers , a spreadsheet application....
. It focuses on usability and the elegance of chart presentation. Numbers completed Apple's productivity suite, making it a viable competitor to Microsoft Office
Microsoft Office

Microsoft Office is a popular set of interrelated desktop applications, servers and services. Microsoft Office is collectively referred to as an office suite, for the Microsoft Windows and Mac OS X operating systems....
. It lacks features such as pivot table
Pivot table

A pivot table is a data summarization tool found in data visualization programs such as spreadsheets . Among other functions, they can automatically sort, count, and total the data stored in one table or spreadsheet and create a second table displaying the summarized data....
s.

OpenOffice.org Calc

OpenOffice.org Calc
OpenOffice.org Calc

OpenOffice.org Calc is the spreadsheet component of the OpenOffice.org software package.Calc is similar to Microsoft Excel, with a roughly equivalent range of features....
 is a freely available, open-source program modelled after Microsoft Excel
Microsoft Excel

Microsoft Excel is a spreadsheet-application written and distributed by Microsoft for Microsoft Windows and Mac OS X. It features calculation, graphing tools, pivot tables and a macro programming language called VBA ....
. Calc can both open and save in the Excel (XLS) file format. Calc can be acquired as both an installation file and a portable program, capable of being run from a device such as a USB memory drive. It can be downloaded from the OpenOffice.org website.

Gnumeric

Gnumeric
Gnumeric

Gnumeric is a free software spreadsheet program that is part of the GNOME desktop and has Windows installers available. It is intended to be a free replacement for proprietary software spreadsheet programs such as Microsoft Excel, which it broadly and openly emulates....
 is a free
Free software

Free Software or software libre is software that can be used, studied, and modified without restriction, and which can be copied and redistributed in modified or unmodified form either without restriction, or with minimal restrictions only to ensure that further recipients can also do these things and to prevent consumer-facing hardware...
 spreadsheet
Spreadsheet

A spreadsheet is a computer application that simulates a paper worksheet. It displays multiple cells that together make up a grid consisting of rows and columns, each cell containing either alphanumeric text or numeric values....
 program that is part of the GNOME
Gnome

A gnome is a mythical creature characterized by its extremely small size and wiktionary:subterranean lifestyle. The word gnome is derived from the New Latin gnomus....
 desktop and has Windows installers available. It is intended to be a free replacement for proprietary
Proprietary software

Proprietary software is a term coined by advocates of the free software movement to describe computer software which is the legal property of one party....
 spreadsheet programs such as Microsoft Excel
Microsoft Excel

Microsoft Excel is a spreadsheet-application written and distributed by Microsoft for Microsoft Windows and Mac OS X. It features calculation, graphing tools, pivot tables and a macro programming language called VBA ....
, which it broadly and openly emulates. Gnumeric was created and developed by Miguel de Icaza
Miguel de Icaza

Miguel de Icaza is a Mexico free software programmer, best known for starting the GNOME and Mono projects....
, and the current maintainer is Jody Goldberg.

Gnumeric has the ability to import and export data in several file formats, including CSV
Comma-separated values

A Comma separated values file is a computer data file used for implementing the tried and true organizational tool, the Comma Separated List....
, Microsoft Excel
Microsoft Excel

Microsoft Excel is a spreadsheet-application written and distributed by Microsoft for Microsoft Windows and Mac OS X. It features calculation, graphing tools, pivot tables and a macro programming language called VBA ....
, HTML
HTML

HTML, an Acronym and initialism of HyperText Markup Language, is the predominant markup language for Web pages. It provides a means to describe the structure of text-based information in a document?by denoting certain text as links, headings, paragraphs, lists, and so on?and to supplement that text with interactive forms, embedded '...
, LaTeX
LaTeX

LaTeX is a document markup language and Word processor for the TeX typesetting program. Within the typesetting system, its name is styled as ....
, Lotus 1-2-3
Lotus 1-2-3

Lotus 1-2-3 is a spreadsheet program from Lotus Software . It was the IBM PC's first "killer application"; its huge popularity in the mid-1980s contributed significantly to the success of the IBM PC in the corporate environment....
, OpenDocument
OpenDocument

The OpenDocument format is a file format for electronic office documents such as spreadsheets, charts, presentation programs and word processor documents....
 and Quattro Pro
Quattro Pro

Quattro Pro is a spreadsheet program developed by Borland and now sold by Corel, most often as part of Corel's WordPerfect Office.Historically, Quattro Pro used keyboard command similar to Lotus 1-2-3....
; its native format is the Gnumeric file format (.gnm or .gnumeric), an XML file compressed with gzip
Gzip

gzip is a software application used for file compression. gzip is short for GNU zip; the program is a free software replacement for the compress program used in early Unix systems, intended for use by the GNU Project....
. It includes all of the spreadsheet function
Subroutine

In computer science, a subroutine or subprogram is a portion of computer code within a larger computer program, which performs a specific task and is relatively independent of the remaining code....
s of the North America
North America

North America is the northern continent of the Americas, situated in the Earth's northern hemisphere and almost totally in the western hemisphere....
n edition of Microsoft Excel
Microsoft Excel

Microsoft Excel is a spreadsheet-application written and distributed by Microsoft for Microsoft Windows and Mac OS X. It features calculation, graphing tools, pivot tables and a macro programming language called VBA ....
 and many functions unique to Gnumeric. Pivot table
Pivot table

A pivot table is a data summarization tool found in data visualization programs such as spreadsheets . Among other functions, they can automatically sort, count, and total the data stored in one table or spreadsheet and create a second table displaying the summarized data....
s and conditional formatting are not yet supported but are planned for future versions. Gnumeric's accuracy has helped it to establish a niche among people using it for statistical analysis and other scientific tasks. For improving the accuracy of Gnumeric, the developers are cooperating with the R Project.

Web based spreadsheets

With the advent of advanced web
World Wide Web

The World Wide Web is a very large set of interlinked hypertext documents accessed via the Internet. With a Web browser, one can view Web pages that may contain writing, s, videos, and other multimedia and navigate between them using hyperlinks....
 technologies such as Ajax
Ajax (programming)

Ajax, or AJAX , is a group of interrelated web development techniques used to create interactive web applications or rich Internet applications....
 circa 2005, a new generation of online spreadsheets
List of online spreadsheets

This is a list of online spreadsheets. Online spreadsheets are online applications that allow users to create a spreadsheet, have multiple persons edit it and share it with the world....
 has emerged. Equipped with a rich Internet application
Rich Internet application

Rich Internet applications are web applications that have some of the characteristics of Application software, typically delivered by way of an Ajax framework, Proprietary software web browser Plug-in s, advanced javascript compiler technology, or independently via Sandbox es or virtual machines....
 user experience, the best web based online spreadsheets have many of the features seen in desktop spreadsheet applications. Some of them have strong multi-user collaboration features. Some of them offer real time
Real Time

Real Time is a webcast based on the long-running United Kingdom science fiction television series Doctor Who which was then subsequently released on CD....
 updates from remote sources such as stock prices and currency exchange rate
Exchange rate

In finance, the exchange rates between two currency specifies how much one currency is worth in terms of the other. It is the value of a foreign nation?s currency in terms of the home nation?s currency....
s.

Other spreadsheets

  • A list of current spreadsheet software
    • Lotus Symphony
      Lotus Symphony

      IBM Lotus Symphony is a software suite for creating, editing, and sharing text, spreadsheet, presentations and other documents. Originally released in July 1984 as an integrated software application for DOS, IBM revived the Symphony name for a new office suite that was first released free of charge in 2007....
       (2007)
    • KSpread
      KSpread

      KSpread is a free software spreadsheet application that is part of KOffice, an integrated office suite for the KDE Desktop Environment.Among KSpread features are multiple sheets per document, assorted formatting possibilities, support for more than 100 built-in functions, templates, chart, spell-check, hyperlinks, data sorting and scripting...
    • ZCubes
      ZCubes

      ZCubes is free web-based software that is intended for the creation of documents including web pages, web sites, spreadsheets, presentations, drawings, albums, and Web portal....
      -Calci
    • Resolver One
      Resolver One

      Resolver One is a Spreadsheet program created by Resolver Systems with the IronPython programming language. It is a desktop application, with a web server version, and intended as a platform for the rapid development of business applications....
  • Discontinued spreadsheet software
    • Advantage
      Advantage

      Advantage may refer to:*A beneficial factor or combination of factors.*In tennis, advantage is when one player wins a point from a deuce and needs one more point to win the game....
    • Lotus Improv
      Lotus Improv

      Lotus Improv was a spreadsheet program from Lotus Development that attempted to re-define the way a spreadsheet should work....
    • Javelin Software
      Javelin Software

      Javelin Software Corporation was a company in Cambridge, Massachusetts, Massachusetts which developed an innovative business model and data analysis product, also called Javelin, and later Javelin Plus....
    • Lotus Jazz
      Lotus Jazz

      Lotus Jazz was an Office suite for the Apple, Inc. Macintosh, released in 1985 for $595, after the substantial success of Lotus 1-2-3 for the IBM PC compatible....
       for Macintosh
    • MultiPlan
      MultiPlan

      Multiplan was an early spreadsheet program developed by Microsoft. Known initially by the List of computer technology code names "EP" , it was introduced in 1982 as a competitor for VisiCalc....
    • Borland's Quattro Pro
      Quattro Pro

      Quattro Pro is a spreadsheet program developed by Borland and now sold by Corel, most often as part of Corel's WordPerfect Office.Historically, Quattro Pro used keyboard command similar to Lotus 1-2-3....
    • Silk
      Silk

      Silk is a natural protein fiber, some forms of which can be weaving into textiles. The best-known type of silk is obtained from Pupa#Cocoons made by the larvae of the mulberry silkworm Bombyx mori reared in captivity ....
    • SuperCalc
      SuperCalc

      SuperCalc was a spreadsheet application published by Sorcim in 1980, and originally bundled as part of the CP/M software package included with the Osborne 1 portable computer....
    • Lotus Symphony
      Lotus Symphony

      IBM Lotus Symphony is a software suite for creating, editing, and sharing text, spreadsheet, presentations and other documents. Originally released in July 1984 as an integrated software application for DOS, IBM revived the Symphony name for a new office suite that was first released free of charge in 2007....
       (1984)
    • Wingz
      Informix Wingz

      Wingz was a spreadsheet program sold by Informix in the late 1980s and early 1990s. Originally developed for the Apple Macintosh, it was later distributed on Microsoft Windows, OS/2, NextStep and several other commercial flavors of Unix....
       for Macintosh
    • Target Planner Calc for CP/M and TRS-DOS


Other products

A number of companies have attempted to break into the spreadsheet market with programs based on very different paradigms. Lotus introduced what is likely the most successful example, Lotus Improv
Lotus Improv

Lotus Improv was a spreadsheet program from Lotus Development that attempted to re-define the way a spreadsheet should work....
, which saw some commercial success, notably in the financial world where its powerful data mining
Data mining

Data mining is the process of extracting hidden patterns from data. As more data is gathered, with the amount of data doubling every three years, data mining is becoming an increasingly important tool to transform this data into information....
 capabilities remain well respected to this day. Spreadsheet 2000
Spreadsheet 2000

Spreadsheet 2000 was a spreadsheet program for Apple Macintosh computers, published by Casady & Greene, a distributor of many "smaller" Mac releases....
 attempted to dramatically simplify formula construction, but was generally not successful.

Concepts


Cells

A "cell" can be thought of as a box for holding a datum
Datum

A geodetic datum is a reference from which measurements are made. In surveying and geodesy,a datum is a set of reference points on the earth's surface against which position measurements are made, and an associated model of the shape of the earth to define a geographic coordinate system....
. A single cell is usually referenced by its column and row (A2 would represent the cell below containing the value 10). Its physical size can usually be tailored for its content by dragging its height or width at box intersections (or for entire columns or rows by dragging the column or rows headers).

An array
Array

In computer science, an array is a data structure consisting of a group of element s that are accessed by index . In most programming languages each element has the same data type and the array occupies a contiguous area of computer memory....
 of cells is called a "sheet" or "worksheet". It is analogous to an array of variable
Variable

A variable is a symbol that stands for a value that may vary; the term usually occurs in opposition to constant, which is a symbol for a non-varying value, i.e....
s in a conventional computer program
Computer program

Computer programs are Instruction for a computer. A computer requires programs to function. Moreover, a computer program does not run unless its instructions are executed by a Central processing unit; however, a program may communicate an Algorithm#Formalization of algorithms to people without running....
 (although certain unchanging values, once entered, could be considered, by the same analogy, constants). In most implementations, many worksheets may be located within a single spreadsheet. A worksheet is simply a subset of the spreadsheet divided for the sake of clarity. Functionally, the spreadsheet operates as a whole and all cells operate as global variable
Global variable

In computer programming, a global variable is a variable that is accessible in every scope . Interaction mechanisms with global variables are called global environment mechanisms....
s within the spreadsheet ('read' access only except its own containing cell).

A cell may contain a value
Value (computer science)

In computer science, a value is a sequence of bits that is interpreted according to some data type. It is possible for the same sequence of bits to have different values, depending on the type used to interpret its meaning....
 or a formula
Formula

In mathematics and in the sciences, a formula is a concise way of expressing information symbolically , or a general relationship between quantities....
, or it may simply be left empty. By convention, formulas usually begin with = sign.

Values
A value can be entered from the computer keyboard by directly typing into the cell itself. Alternatively, a value can be based on a formula (see below), which might perform a calculation, display the current date or time, or retrieve external data such as a stock quote or a database value.

The Spreadsheet Value Rule

Computer scientist Alan Kay
Alan Kay

Alan Curtis Kay is an United States computer scientist, known for his early pioneering work on object-oriented programming and Window graphical user interface design....
 used the term
value rule to summarize a spreadsheet's operation: a cell's value relies solely on the formula the user has typed into the cell. The formula may rely on the value of other cells, but those cells are likewise restricted to user-entered data or formulas. There are no 'side effects' to calculating a formula: the only output is to display the calculated result inside its occupying cell. There is no natural mechanism for permanently modifying the contents of a cell unless the user manually modifies the cell's contents. In the context of programming languages, this yields a limited form of first-order functional programming
Functional programming

In computer science, functional programming is a programming paradigm that treats computation as the evaluation of function s and avoids program state and immutable object data....
.


Automatic recalculation
A standard of spreadsheets since the mid 80s , this optional feature eliminates the need to manually request the spreadsheet program to recalculate values (nowadays typically the default option unless specifically 'switched off' for large spreadsheets, usually to improve performance). Some earlier spreadsheets required a manual request to recalculate, since recalculation of large or complex spreadsheets often reduced data entry speed. Many modern spreadsheets still retain this option.

Real time update
This feature refers to updating a cell's contents periodically when its value is derived from an external source - such as a cell in another "remote" spreadsheet. For shared, web-based spreadsheets, it applies to "immediately" updating cells that have been altered by another user. All dependent cells have to be updated also.

Formula

A formula identifies the calculation
Calculation

A calculation is a deliberate process for transforming one or more inputs into one or more results, with variable change.The term is used in a variety of senses, from the very definite arithmetical calculation using an algorithm to the vague heuristics of calculating a strategy in a competition or calculating the chance of a successful rela...
 needed to place the result in the cell it is contained within. A cell containing a formula therefore has two display components; the formula itself and the resulting value. The formula is normally only shown when the cell is selected by "clicking" the mouse over a particular cell; otherwise it contains the result of the calculation.

A formula assigns values to a cell or range of cells, and typically has the format:
=expression
where the expression
Expression (programming)

An expression in a programming language is a combination of value s, variables, operator s, and function s that are interpreted according to the particular Order of operations and of association for a particular programming language, which computes and then produces another value....
 consists of:
  • values
    Value (computer science)

    In computer science, a value is a sequence of bits that is interpreted according to some data type. It is possible for the same sequence of bits to have different values, depending on the type used to interpret its meaning....
    , such as 2, 9.14 or 6.67E-11;
  • references
    Reference (computer science)

    In computer science, a reference is an object containing information about how to locate and access the particular data item, as opposed to containing the data itself....
     to other cells, such as, e.g., A1 for a single cell or B1:B3 for a range;
  • arithmetic operators
    Operator (programming)

    Programming languages generally support a set of operators that are similar to operator. A language may contain a fixed number of built-in operators or it may allow the creation of programmer-defined operators ....
    , such as +, -, *, /, and others;
  • relational operator
    Relational operator

    In computer science a relational operator is a programming language construct or Operator that tests some kind of relation between Binary function....
    s, such as >=, <, and others; and,
  • functions, such as SUM, TAN, and many others.


When a cell contains a formula, it often contains references to other cells. Such a cell reference is a type of variable. Its value is the value of the referenced cell or some derivation of it. If that cell in turn references other cells, the value depends on the values of those. References can be relative (e.g., A1, or B1:B3), absolute (e.g., $A$1, or $B$1:$B$3) or mixed row-wise or column-wise absolute/relative (e.g., $A1 is column-wise absolute and A$1 is row-wise absolute).

The available options for valid formulas depends on the particular spreadsheet implementation but, in general, most arithmetic operations and quite complex nested conditional operations can be performed by most of today's commercial spreadsheets. Modern implementations also offer functions to access custom-build functions, remote data, and applications.

A formula may contain a condition (or nested conditions) - with or without an actual calculation - and is sometimes used purely to identify and
highlight errors. In the example below, it is assumed the sum of a column of percentages (A1 through A6) is tested for validity and an explicit message put into the adjacent right-hand cell.

=IF(SUM(A1:A6) > 100, "More than 100%", SUM(A1:A6))

A spreadsheet does not, in fact, have to contain any formulas at all, in which case it could be considered merely a collection of data arranged in rows and columns (a database
Database

A database is a structured collection of records or data that is stored in a computer system. The structure is achieved by organizing the data according to a database model....
) like a calendar, timetable or simple list. Because of its ease of use, formatting and hyperlinking capabilities, many spreadsheets are used solely for this purpose.

Locked cell
Once entered, selected cells (or the entire spreadsheet) can optionally be "locked" to prevent accidental overwriting. Typically this would apply to cells containing formulas but might be applicable to cells containing "constants" such as a kilogram/pounds conversion factor (2.20462262 to eight decimal places). Even though individual cells are marked as locked, the spreadsheet data is not protected until the feature is activated in the file preferences.

Data format
A cell or range can optionally be defined to specify how the value is displayed. The default display format is usually set by its initial content if not specifically previously set, so that for example "31/12/2007" or "31 Jan 2007" would default to the cell format of "date". Similarly adding a % sign after a numeric value would tag the cell as a percentage
Percentage

In mathematics, a percentage is a way of expressing a number as a fraction of 100 . It is often denoted using the percent sign, "%". For example, 45% is equal to 45 / 100, or 0.45....
 cell format. The cell contents are not changed by this format, only the displayed value.

Some cell formats such as "numeric" or "currency
Currency

A currency is a Medium of exchange, facilitating the trade of goods and/or Service s. It is coins and paper bills used as money. It is one form of money, where money is anything that serves as a medium of exchange, a store of value, and a standard of value....
" can also specify the number of decimal places.

This can allow invalid operations (such as doing multiplication on a cell containing a date), resulting in illogical results without an appropriate warning.

Text format
Each cell (like its counterpart the "word" in a word processor
Word processor

A word processor is a computer Application software used for the production of any sort of printable material.Word processor may also refer to an obsolete type of stand-alone office machine, popular in the 1970s and 80s, combining the keyboard text-entry and printing functions of an electric typewriter with a dedicated computer for th...
) can be separately defined in terms of its displayed format. Any cell or range of cells can be highlighted in several different ways such as use of
bold text, colour, font, text size and so on.

These attribute
Attribute

The word "attribute" can refer to:* In philosophy, property , an abstraction of a characteristic of an entity or substance* In art, an object that identifies a figure, most commonly referring to objects held by saints - see emblem...
s typically do not alter the data content in any way and some formatting may be lost or altered when copying spreadsheet data between different implementations or software versions. In some implementations, the format may be conditional upon the data within the cell - for example, a value may be displayed red if it is negative.

Named cells
In most implementations, a cell can be "named" so that even if the cell is "cut and pasted" to a new location within the spreadsheet, its reference always remains intact. Names must be unique within the spreadsheet and, once defined, can then be used instead of a "normal" cell reference. When using multiple sheets in a spreadsheet file, an identically named cell range on each sheet can be used if it is distinguised by adding the sheet name. A primary reason for this usage is for creating or running macros that repeat a command across many sheets.

Cell reference
A cell reference is the name of some cell in some spreadsheet. Most cell references indicate another cell in the same spreadsheet, but a cell reference can also refer to a cell in a different sheet within the same spreadsheet, or (depending on the implementation) to a cell in another spreadsheet entirely, or to a value from a remote application.

A typical
cell reference in "A1" style consists of one or two case-insensitive letters to identify the column (if there are up to 256 columns: A-Z and AA-IV) followed by a row number (e.g. in the range 1-65536). Either part can be relative (it changes when the formula it is in is moved or copied), or absolute (indicated with $ in front of the part concerned of the cell reference). The older "R1C1" reference style consists of the letter R, the row number, the letter C, and the column number; relative row or column numbers are indicated by enclosing the number in square brackets. Most current spreadsheets use the A1 style, some providing the R1C1 style as a compatibility option.

When the computer calculates a formula in one cell to update the displayed value of that cell, cell reference(s) in that cell, naming some other cell(s), cause the computer to fetch the value of the named cell(s).

A cell on the same "sheet" is usually addressed as:-

=A1

A cell on a different sheet of the same spreadsheet is usually addressed as:-

=SHEET2!A1 (that is; the first cell in sheet 2 of same spreadsheet).

Some spreadsheet implementations allow a cell references to another spreadsheet (not the current open and active file) on the same computer or a local network. It may also refer to a cell in another open and active spreadsheet on the same computer or network that is defined as shareable. These references contain the complete filename, such as:-

='C:\Documents and Settings\Username\My spreadsheets\[main sheet]Sheet1!A1

In a spreadsheet, references to cells are automatically updated when new rows or columns are inserted or deleted. Care must be taken however when adding a row immediately before a set of column totals to ensure that the totals reflect the additional rows values - which often they do not!

A circular reference
Circular reference

A circular reference, sometimes referred to as a run-around, is a series of references where the last object references the first, thus causing the whole series of references to be unusable....
 occurs when the formula in one cell has a reference that directly -- or indirectly, through a chain of references, each one pointing to another cell that has another reference to the next cell on the chain -- points to the one cell. Many common kinds of errors cause such circular references. However, there are some valid techniques that use such circular references. Such techniques, after many recalculations of the spreadsheet, (usually) converge on the correct values for those cells.

Cell Ranges
A reference to a range of cells is typically of the form (A1:A6) which specifies all the cells in the range A1 through to A6. A formula such as "=SUM(A1:A6)" would add all the cells specified and put the result in the cell containing the formula itself.

Sheets

In the earliest spreadsheets, cells were a simple two-dimensional grid. Over time, the model has been expanded to include a third dimension, and in some cases a series of named grids, called
sheets. The most advanced examples allow inversion and rotation operations which can slice and project the data set in various ways.

Remote Spreadsheet

Whenever a reference is made to a cell or group of cells that are not located within the current physical spreadsheet file, it is considered as accessing a "remote" spreadsheet. The contents of the referenced cell may be accessed either on first reference with a manual update or more recently in the case of web based spreadsheets, as a near real time value with a specified automatic refresh interval.

Charts

Many spreadsheet applications permit chart
Chart

and A chart is a visual representation of data, in which the data are represented by symbols such as bars in a bar chart or lines in a line chart....
s, graphs
Graph (data structure)

In computer science, a graph is a kind of data structure, specifically an abstract data type , that consists of a Set of nodes and a set of edges that establish relationships between the nodes....
 or histogram
Histogram

In statistics, a histogram is a graphical display of tabulated frequency , shown as bars. It shows what proportion of cases fall into each of several Categorization....
s to be generated from specified groups of cells which are dynamically re-built as cell contents change. The generated graphic component can either be embedded within the current sheet or added as a separate object.

Programming issues

Just as the early programming languages were designed to generate spreadsheet printouts, programming techniques themselves have evolved to process tables (also known as spreadsheets or matrices
Matrix (mathematics)

In mathematics, a matrix is a rectangular array of numbers, as shown at the right. In addition to a number of elementary, entrywise operations such as matrix addition a key notion is matrix multiplication....
) of data more efficiently in the computer itself.

Spreadsheets have evolved to use powerful programming languages like VBA
Visual Basic for Applications

Visual Basic for Applications is an implementation of Microsoft Event-driven programming programming language Visual Basic, and associated integrated development environment , which is built into most Microsoft Office applications....
; specifically, they are functional
Functional programming

In computer science, functional programming is a programming paradigm that treats computation as the evaluation of function s and avoids program state and immutable object data....
, visual
Visual programming language

A visual programming language is any programming language that lets users create computer program by manipulating program elements graphically rather than by specifying them textually....
, and multiparadigm
Multi-paradigm programming language

A multi-paradigm programming language is a programming language that supports more than one programming paradigm. As Lead designer Tim Budd holds it: The idea of a multiparadigm language is to provide a framework in which programmers can work in a variety of styles, freely intermixing constructs from different paradigms. The design goal...
 languages.

Many people find it easier to perform calculations in spreadsheets than by writing the equivalent sequential program. This is due to two traits of spreadsheets.
  • They use spatial
    Space

    Space is the boundless, three-dimensional extent in which Physical body and events occur and have relative position and direction. Physical space is often conceived in three linear dimensions, although modern physics usually consider it, with time, to be part of the boundless four-dimensional continuum known as spacetime....
     relationships to define program relationships. Like all animals, humans have highly developed intuition
    Intuition (knowledge)

    Intuition is the apparent ability to acquire knowledge without inference or the use of reason.?The word ?intuition? comes from the Latin word 'intueri', which is often roughly translated as meaning ?to look inside? or ?to contemplate?."...
    s about spaces, and of dependencies between items. Sequential programming usually requires typing line after line of text, which must be read slowly and carefully to be understood and changed.
  • They are forgiving, allowing partial results and functions to work. One or more parts of a program can work correctly, even if other parts are unfinished or broken. This makes writing and debugging programs much easier, and faster . Sequential programming usually needs every program line and character to be correct for a program to run. One error usually stops the whole program and prevents any result.


A
'spreadsheet program' is designed to perform general computation tasks using spatial relationships rather than time as the primary organizing principle..

It is often convenient to think of a spreadsheet as a mathematical graph
Graph theory

In mathematics and computer science, graph theory is the study of graph : mathematical structures used to model pairwise relations between objects from a certain collection....
, where the nodes are spreadsheet cells, and the edges are references to other cells specified in formulas. This is often called the dependency graph of the spreadsheet. References between cells can take advantage of spatial concepts such as relative position and absolute position, as well as named locations, to make the spreadsheet formulas easier to understand and manage.

Spreadsheets usually attempt to automatically update cells when the cells on which they depend have been changed. The earliest spreadsheets used simple tactics like evaluating cells in a particular order, but modern spreadsheets compute a minimal recomputation order from the dependency graph. Later spreadsheets also include a limited ability to propagate values in reverse, altering source values so that a particular answer is reached in a certain cell. Since spreadsheet cells formulas are not generally invertible, though, this technique is of somewhat limited value.

Many of the concepts common to sequential programming models have analogues in the spreadsheet world. For example, the sequential model of the indexed loop
For loop

In computer science a for loop is a programming language statement which allows code to be repeatedly execution . A for loop is classified as an iteration statement....
 is usually represented as a table of cells, with similar formulas (normally differing only in which cells they reference).

Shortcomings


While spreadsheets are a great step forward in quantitative modeling, they have deficiencies. At the level of overall user benefits, spreadsheets have four main shortcomings.
  • Spreadsheets have significant reliability problems. Research studies estimate that roughly 94% of spreadsheets deployed in the field contain errors, and 5.2% of cells in unaudited spreadsheets contain errors.


  • The practical expressiveness of spreadsheets is limited. Several factors contribute to this limitation. Implementing a complex model requires implementing detailed layouts, cell-at-a-time. Authors have difficulty remembering the meanings of hundreds or thousands of cell addresses that appear in formulas.


  • Collaboration in authoring spreadsheet formulas is difficult because such collaboration must occur at the level of cells and cell addresses. By comparison, programming languages aggregate cells with similar meaning into indexed variables with names that indicate meaning. Although some spreadsheets have good collaboration features, authoring at the level of cells and cell formulas remains a significant obstacle to collaboration in authoring spreadsheet models. On the other hand, many people collaborate on entering numerical data and many people can use the same spreadsheet.


  • Productivity of spreadsheet modelers is reduced by the cell-level focus of spreadsheets. Even conceptually simple changes in spreadsheets (such as changing starting or ending time or time grain, adding new members or a level of hierarchy to a dimension, or changing one conceptual formula that is represented as hundreds of cell formulas) often require large numbers of manual cell-level operations (such as inserting or deleting cells/rows/columns, editing and copying formulas, re-laying out worksheets). Each of these manual corrections increases the risk of introducing further mistakes.


These four deficiencies in high-level benefits have deeper causes that, ironically, flow directly from the signature strength of spreadsheets (that they capture the structure of models in terms of WYSIWYG sheet layout for authors and report users).

  • Spreadsheets capture model logic in terms of sheet layout, especially contiguous layout of cells in a table. Spreadsheets have weak or nonexistent methods to capture higher level structures such as named variables, segmentation dimensions, and time series.


  • Formulas are subordinated to the cell layout. This forces the sheet layout to carry the structure of the model, not variables and formulas that relate variables. This also causes a large proliferation of cells, formulas and cell-level tasks even when only a few basic concepts are involved in a model. This forces authors to think and work at the level of cells instead of at the level of the natural concepts and structures of the model.


  • Formulas expressed in terms of cell addresses are hard to keep straight and hard to audit. Research shows that spreadsheet auditors who check numerical results and cell formulas find no more errors than auditors who only check numerical results .


  • Proliferation of error-prone manual cell-level operations contributes to all four of the high-level problems listed above.


Other problems associated with spreadsheets include:

  • Some sources advocate the use of specialized software instead of spreadsheets for some applications (budgeting, statistics)Excel's critics say that Excel is fundamentally unsuited for budgeting, forecasting, and other activities that involve collaboration or consolidation. Are they correct?
  • Many spreadsheet software products, such as Excel (versions prior to 2007) and OpenOffice.org Calc, have a capacity limit of about 65,000 rows. This can present a problem for people using very large datasets, and may result in lost data.
  • Lack of auditing and revision control
    Revision control

    Revision control is the management of multiple revisions of the same unit of information. It is most commonly used in engineering and software development to manage ongoing development of digital documents like application source code, art resources such as blueprints or electronic models, and other projects that may be worked on by a team o...
    . This makes it difficult to determine who changed what and when. This can cause problems with regulatory compliance. Lack of revision control greatly increases the risk of errors due the inability to track, isolate and test changes made to a document.
  • Lack of security
    Computer security

    Computer security is a branch of technology known as information security as applied to computers. The objective of computer security can include protection of information from theft or corruption, or the preservation of availability, as defined in the security policy....
    . Generally, if one has permission to open a spreadsheet, one has permission to modify any part of it. This, combined with the lack of auditing above, can make it easy for someone to commit fraud
    Fraud

    In the broadest sense, a fraud is a deception made for personal gain or to damage another individual. The specific legal definition varies by legal jurisdiction....
    .
  • Because they are loosely structured, it is easy for someone to introduce an error
    Error

    The word error has different meanings and usages relative to how it is conceptually applied. The concrete meaning of the Latin word error means "wandering" or "straying"....
    , either accidentally or intentionally, by entering information in the wrong place or expressing dependencies among cells (such as in a formula) incorrectly.
  • The results of a formula (example "=A1*B1") applies only to a single cell (that is, the cell the formula is actually located in - in this case perhaps C1), even though it can "extract" data from many other cells, and even real time
    Real Time

    Real Time is a webcast based on the long-running United Kingdom science fiction television series Doctor Who which was then subsequently released on CD....
     dates and actual times. This means that to cause a similar calculation on an array of cells, an almost identical formula (but residing in its own "output" cell) must be repeated for each row of the "input" array. This differs from a "formula" in a conventional computer program which would typically have one calculation which would then apply to all of the input in turn. With current spreadsheets, this forced repetition of near identical formulas can have detrimental consequences from a quality assurance
    Quality Assurance

    Quality assurance, or QA for short, refers to planned and systematic production processes that provide confidence in a product's suitability for its intended purpose....
     standpoint and is often the cause of many spreadsheet errors. Some spreadsheets have array formulas to address this issue.
  • Trying to manage the sheer volume of spreadsheets which sometimes exists within an organization without proper security, audit trails, the unintentional introduction of errors and other items listed above can become overwhelming.


While there are built-in and third-party tools for desktop spreadsheet applications that address some of these shortcomings, awareness and use of these is generally low.

See also

  • List of spreadsheets
    List of spreadsheets

    The following is a list of spreadsheets....
  • List of online spreadsheets
    List of online spreadsheets

    This is a list of online spreadsheets. Online spreadsheets are online applications that allow users to create a spreadsheet, have multiple persons edit it and share it with the world....
  • Comparison of spreadsheet software
    Comparison of spreadsheet software

    Online versus DesktopWeb-based spreadsheets do not in general depend on a particular operating system or similar constraints and are therefore generally more portable by their nature than desktop applications....
  • Summation in spreadsheets
    Summation

    Summation is the addition of a set of numbers; the result is their sum or total. An interim or present total of a summation process is termed the running total....
  • Moving and copying in spreadsheets
  • Attribute-value system
    Attribute-value system

    An attribute-value system is a basic knowledge representation framework comprising a table with columns designating "attributes" and rows designating "objects" ....
  • Model Audit
    Model Audit

    A Model Audit is the colloquial term for the tasks performed when conducting due diligence on a financial model, in order to eliminate spreadsheet error....


External links


History of spreadsheets

  • by D.J. Power
  • by Ken Dakin
  • The History of Mathematical Tables: From Sumer to Spreadsheets by Martin Campbell-Kelly, Mary Croarken, Raymond Flood, Eleanor Robson (Editors). ()


General information

  • by Russell Schulz
  • by Jocelyn Paine
  • by Christopher Browne; much general information on spreadsheets, and some on related Linux
    Linux

    Linux is a generic term referring to Unix-like computer operating systems based on the Linux kernel. Their development is one of the most prominent examples of free and open source software collaboration; typically all the underlying source code can be used, freely modified, and redistributed by anyone under the terms of the GNU GPL license...
     issues
  • on the Open Directory Project
    Open Directory Project

    The Open Directory Project , also known as Dmoz , is a multilingual open content Web directory of World Wide Web links owned by Netscape that is constructed and maintained by a virtual community of volunteer editors....
  • by Richard Mattessich
  • by Bob Yelavich
  • by Creative Karma