All Topics  
Optical character recognition

 
Optical Character Recognition

   Email Print
   Bookmark   Link






 

Optical character recognition



 
 
Optical character recognition, usually abbreviated to OCR, is the mechanical
Mechanical

* Mechanical engineering, a branch of engineering concerned with the application of physical mechanics* HVAC , the mechanical systems of a building* Mechanical , one of several characters in Shakespeare's A Midsummer Night's Dream...
 or electronic
Electronics

Electronics refers to the flow of charge through nonmetal electrical conductor , whereas electrical refers to the flow of charge through metal electrical conductor....
 translation of image
Image

An image is an artifact, usually two-dimensional , that has a similar appearance to some subject —usually a physical object or a person....
s of handwritten, typewritten or printed text (usually captured by a scanner
Image scanner

In computing, a scanner is a device that optically scans images, printed text, handwriting, or an object, and converts it to a digital image. Common examples found in offices are variations of the desktop scanner where the document is placed on a glass window for scanning....
) into machine-editable text.

OCR is a field of research in pattern recognition
Pattern recognition

Pattern recognition is a sub-topic of machine learning. It is "the act of taking in raw data and taking an action based on the Category of the data"....
, artificial intelligence
Artificial intelligence

Artificial intelligence is the intelligence of machines and the branch of computer science which aims to create it. Major AI textbooks define the field as "the study and design of intelligent agents,"...
 and machine vision
Machine vision

Machine vision is the application of computer vision to industry and manufacturing. Whereas computer vision is mainly focused on machine-based image processing, machine vision most often requires also digital input/output devices and computer networks to control other manufacturing equipment such as robotic arms....
. Though academic research in the field continues, the focus on OCR has shifted to implementation of proven techniques. Optical character recognition (using optical techniques such as mirrors and lenses) and digital character recognition (using scanners and computer algorithms) were originally considered separate fields.






Discussion
Ask a question about 'Optical character recognition'
Start a new discussion about 'Optical character recognition'
Answer questions from other users
Full Discussion Forum



Encyclopedia


Optical character recognition, usually abbreviated to OCR, is the mechanical
Mechanical

* Mechanical engineering, a branch of engineering concerned with the application of physical mechanics* HVAC , the mechanical systems of a building* Mechanical , one of several characters in Shakespeare's A Midsummer Night's Dream...
 or electronic
Electronics

Electronics refers to the flow of charge through nonmetal electrical conductor , whereas electrical refers to the flow of charge through metal electrical conductor....
 translation of image
Image

An image is an artifact, usually two-dimensional , that has a similar appearance to some subject —usually a physical object or a person....
s of handwritten, typewritten or printed text (usually captured by a scanner
Image scanner

In computing, a scanner is a device that optically scans images, printed text, handwriting, or an object, and converts it to a digital image. Common examples found in offices are variations of the desktop scanner where the document is placed on a glass window for scanning....
) into machine-editable text.

OCR is a field of research in pattern recognition
Pattern recognition

Pattern recognition is a sub-topic of machine learning. It is "the act of taking in raw data and taking an action based on the Category of the data"....
, artificial intelligence
Artificial intelligence

Artificial intelligence is the intelligence of machines and the branch of computer science which aims to create it. Major AI textbooks define the field as "the study and design of intelligent agents,"...
 and machine vision
Machine vision

Machine vision is the application of computer vision to industry and manufacturing. Whereas computer vision is mainly focused on machine-based image processing, machine vision most often requires also digital input/output devices and computer networks to control other manufacturing equipment such as robotic arms....
. Though academic research in the field continues, the focus on OCR has shifted to implementation of proven techniques. Optical character recognition (using optical techniques such as mirrors and lenses) and digital character recognition (using scanners and computer algorithms) were originally considered separate fields. Because very few applications survive that use true optical techniques, the OCR term has now been broadened to include digital image processing
Digital image processing

Digital image processing is the use of computer algorithms to perform on digital images. As a subfield of digital signal processing, digital image processing has many advantages over analog image processing; it allows a much wider range of algorithms to be applied to the input data, and can avoid problems such as the build-up of noise and si...
 as well.

Early systems required training (the provision of known samples of each character) to read a specific font
Typeface

In typography, a typeface is a set of one or more fonts, in one or more sizes, designed with stylistic unity, each comprising a coordinated set of glyphs....
. "Intelligent" systems with a high degree of recognition accuracy for most fonts are now common. Some systems are even capable of reproducing formatted output that closely approximates the original scanned page including images, columns and other non-textual components.

History


In 1929, Gustav Tauschek obtained a patent on OCR in Germany, followed by Handel who obtained a US patent on OCR in USA in 1933 (U.S. Patent 1,915,993). In 1935 Tauschek was also granted a US patent on his method (U.S. Patent 2,026,329).

Tauschek's machine was a mechanical device that used templates. A photodetector
Photodetector

Photosensors or photodetectors are sensors of light or other electromagnetic energy. There are several varieties:*optics detectors, which are mostly quantum devices in which an individual photon produces a discrete effect....
 was placed so that when the template and the character to be recognised were lined up for an exact match and a light was directed towards them, no light would reach the photodetector.

In 1950, David H. Shepard
David H. Shepard

David Hammond Shepard was a prolific United States inventor, who invented amongst other things, the first optical character recognition device....
, a cryptanalyst at the Armed Forces Security Agency in the United States
United States

The United States of America is a Federal government constitutional republic comprising U.S. state and a federal district. The country is situated mostly in central North America, where its Contiguous United States and Washington, D.C., the Capital districts and territories, lie between the Pacific Ocean and Atlantic Oceans, Borders of the U...
, was asked by Frank Rowlett
Frank Rowlett

Frank Byron Rowlett was an American cryptologist.Rowlett was born in Rose Hill, Virginia and attended Emory & Henry College in Emory, Virginia, where he was a member of the Beta Lambda Zeta fraternity....
, who had broken the Japanese PURPLE diplomatic code, to work with Dr. Louis Tordella to recommend data automation procedures for the Agency. This included the problem of converting printed messages into machine language for computer processing. Shepard decided it must be possible to build a machine to do this, and, with the help of Harvey Cook, a friend, built "Gismo" in his attic during evenings and weekends. This was reported in the Washington Daily News on 27 April 1951 and in the New York Times on 26 December 1953 after his U.S. Patent Number 2,663,758 was issued. Shepard then founded Intelligent Machines Research Corporation
Intelligent Machines Research Corporation

Intelligent Machines Research Corporation was founded by David H. Shepard and William Lawless, Jr. in 1952 for the purpose of commercializing the work Shepard had done with the help of Harvey Cook in building "Gismo", a machine later called the "Analyzing Reader"....
 (IMR), which went on to deliver the world's first several OCR systems used in commercial operation. While both Gismo and the later IMR systems used image analysis, as opposed to character matching, and could accept some font variation, Gismo was limited to reasonably close vertical registration, whereas the following commercial IMR scanners analyzed characters anywhere in the scanned field, a practical necessity on real world documents.

The first commercial system was installed at the Readers Digest in 1955, which, many years later, was donated by Readers Digest to the Smithsonian, where it was put on display. The second system was sold to the Standard Oil
Standard Oil

Standard Oil was a predominant United States integrated petroleum producing, transporting, refining, and marketing company. Established in 1870 as an Ohio Corporation, it was the largest oil refiner in the world and operated as a major company trust and was one of the world's first and largest multinational corporations until it was broken up...
 Company of California
California

California is a U.S. state on the West Coast of the United States of the United States, along the Pacific Ocean. It is bordered by Oregon to the north, Nevada to the east, Arizona to the southeast, and to the south the Mexico state of Baja California....
 for reading credit card
Credit card

A credit card is part of a system of payments named after the small plastic card issued to users of the system. It is a card entitling its holder to buy goods and services based on the holders promise to pay for these goods and services....
 imprints for billing purposes, with many more systems sold to other oil companies. Other systems sold by IMR during the late 1950s included a bill stub reader to the Ohio Bell Telephone Company and a page scanner to the United States Air Force
United States Air Force

The United States Air Force is the aerial warfare branch of the Military of the United States and one of the uniformed services of the United States....
 for reading and transmitting by teletype typewritten messages. IBM
IBM

International Business Machines Corporation, abbreviated IBM and nicknamed "Big Blue" , is a multinational corporation computer technology and consulting corporation headquartered in Armonk, New York, New York, United States....
 and others were later licensed on Shepard's OCR patents.

In about 1965 Readers Digest and RCA collaborated to build an OCR Document reader designed to digitize the serial numbers on Reader Digest coupons returned from advertisements. The font used on the documents were printed by an RCA Drum printer using the OCR-A font
OCR-A font

In the early days of computer Optical Character Recognition, there was a need for a font thatcould be recognized by the slow computers of that day, and by...
. The reader was connected directly to an RCA 301 computer (one of the first solid state computers). This reader was followed by a specialized document reader installed at TWA where the reader processed Airline Ticket stock (a task made more difficult by the carbonized backing on the ticket stock). The readers processed document at a rate of 1500 documents per minute and checked each document rejecting those it was not able to process correctly. The product became part of the RCA product line as a reader designed to process "Turn around Documents" such as those Utility and insurance bills returned with payments.

The United States Postal Service
United States Postal Service

The United States Postal Service is an Independent agencies of the United States government responsible for providing postal service in the United States....
 has been using OCR machines to sort mail since 1965 based on technology devised primarily by the prolific inventor Jacob Rabinow
Jacob Rabinow

Jacob Rabinow was an engineer who led a truly prolific career as an inventor. He earned a total of 230 U.S. patents on a variety of mechanical, optical and electrical devices....
. The first use of OCR in Europe was by the British General Post Office or GPO. In 1965 it began planning an entire banking system, the National Giro, using OCR technology, a process that revolutionized bill payment systems in the UK. Canada Post
Canada Post

Canada Post Corporation, known more simply as Canada Post , is the Canada Crown corporations of Canada which functions as the country's primary Postal administration....
 has been using OCR systems since 1971. OCR systems read the name and address of the addressee at the first mechanized sorting center, and print a routing bar code on the envelope based on the postal code
Postal code

A postal code is a series of letters and/or numerical digits appended to a address for the purpose of sorting mail.Germany was the first country to introduce a postal code system, in 1941....
. After that the letters need only be sorted at later centers by less expensive sorters which need only read the bar code. To avoid interference with the human-readable address field which can be located anywhere on the letter, special ink is used that is clearly visible under ultraviolet light. This ink looks orange in normal lighting conditions. Envelopes marked with the machine readable bar code may then be processed.

In 1974, Ray Kurzweil started the company Kurzweil Computer Products, Inc. and led development of the first omni-font
Typeface

In typography, a typeface is a set of one or more fonts, in one or more sizes, designed with stylistic unity, each comprising a coordinated set of glyphs....
 optical character recognition system--a computer program capable of recognizing text printed in any normal font. He decided that the best application of this technology would be to create a reading machine for the blind, which would allow blind people to understand written text by having a computer read it to them out loud. However, this device required the invention of two enabling technologies--the CCD
Charge-coupled device

A charge-coupled device is an analog signal shift register that enables the transportation of analog signals through successive stages , controlled by a clock signal....
 flatbed scanner and the text-to-speech synthesizer. On January 13 1976, the finished product was unveiled during a widely reported news conference headed by Kurzweil and the leaders of the National Federation of the Blind
National Federation of the Blind

The National Federation of the Blind is an organization of blind people in the United States. It is the oldest and most likely largest national organization to be led by blind people....
. Called the Kurzweil Reading Machine, the device covered an entire tabletop, but functioned exactly as intended. On the day of the machine's unveiling, Walter Cronkite
Walter Cronkite

Walter Leland Cronkite, Jr. is a retired United States Broadcast journalism, best known as anchorman for the The CBS Evening News for 19 years ....
 used the machine to give his signature soundoff, "And that's the way it was, January 13, 1976." While listening to The Today Show, musician Stevie Wonder
Stevie Wonder

Stevie Wonder is an American singer-songwriter, multi-instrumentalist, and record producer. A prominent figure in popular music during the latter half of the 20th century, Wonder has recorded more than thirty US top ten hits, won twenty-two Grammy Awards , plus one for Grammy Lifetime Achievement Award, won an Academy Award for Best Song, an...
 heard a demonstration of the device and personally purchased the first production version of the Kurzweil Reading Machine.

In 1978 Kurzweil Computer Products began selling a commercial version of the optical character recognition computer program. LexisNexis
LexisNexis

LexisNexis is a popular searchable archive of content from newspapers, magazines, legal documents and other printed sources. LexisNexis claims to be the "world?s largest collection of public records, unpublished opinions, forms, legal, news, and business information" while offering their products to a wide range of professionals in the lega...
 was one of the first customers, and bought the program to upload paper legal and news documents onto its nascent online databases. Two years later, Kurzweil sold his company to Xerox
Xerox

Xerox Corporation is a global document management company which manufactures and sells a range of color and black-and-white Computer printer, multifunction systems, photo copiers, digital production printing presses, and related consulting services and supplies....
, which had an interest in further commercializing paper-to-computer text conversion. Kurzweil Computer Products thus became a subsidiary of Xerox known as Scansoft (now Nuance
Nuance Communications

Nuance Communications is a multinational computer software technology corporation, headquartered in Burlington, Massachusetts, USA, that provides speech and imaging applications....
).

Current state of OCR technology


The accurate recognition of Latin-script
Latin alphabet

The Latin alphabet, also called the Roman alphabet, is the most widely used alphabetic writing system in the world today. It evolved from the western variety of the Greek alphabet called the Cumae alphabet, and was initially developed by the Ancient Romes to write the Latin....
, typewritten text is now considered largely a solved problem. Typical accuracy rates exceed 99%, although certain applications demanding even higher accuracy require human review for errors. Other areas--including recognition of hand printing, cursive
Cursive

Cursive is any style of penmanship that is designed for writing down notes and letters quickly by hand. In the Arabic, Latin languages, and Cyrillic writing systems, the letters in a word are connected, making a word one single complex stroke....
 handwriting, and printed text in other scripts (especially those with a very large number of characters)--are still the subject of active research.

Note:
  • Accuracy rates can be measured in several ways, and how they are measured can greatly affect the reported accuracy rate. For example, without the use of word context (basically a dictionary of words) to correct "spelling" errors, an error rate of 1% (or 99% accuracy) measured letter-by-letter may result in an error rate of 5% or more (or 95% accuracy), if the measurement is based instead on whether each whole word was recognized with no incorrect letters.


Optical Character Recognition (OCR) is sometimes confused with on-line character recognition (see Handwriting recognition
Handwriting recognition

Handwriting recognition is the ability of a computer to receive and interpret intelligible handwritten input from sources such as paper documents, photographs, touch-screens and other devices....
). OCR is an instance of off-line character recognition, where the system recognizes the fixed static shape of the character, while on-line character recognition instead recognizes the dynamic motion during handwriting. For example, on-line recognition, such as that used for gestures in the Penpoint OS
PenPoint OS

The PenPoint OS was a product of GO Corporation and was one of the earliest operating systems written specifically for graphical tablets and personal digital assistants....
 or the Tablet PC
Tablet PC

A Tablet PC is a laptop or slate-shaped Mobile computing, equipped with a touchscreen or graphics tablet/screen hybrid to operate the computer with a stylus or digital pen, or a fingertip, instead of a Computer keyboard or Mouse ....
 can tell whether a horizontal mark was drawn right-to-left, or left-to-right. On-line character recognition is also referred to by other terms such as dynamic character recognition, real-time character recognition, and Intelligent Character Recognition
Intelligent Character Recognition

In computer science, intelligent character recognition is an advanced optical character recognition or - rather more specific - Handwriting recognition system that allows fonts and different styles of handwriting to be learned by a computer during processing to improve accuracy and recognition levels....
 or ICR.

On-line systems for recognizing hand-printed text on the fly have become well-known as commercial products in recent years (see Tablet_PC#History
Tablet PC

A Tablet PC is a laptop or slate-shaped Mobile computing, equipped with a touchscreen or graphics tablet/screen hybrid to operate the computer with a stylus or digital pen, or a fingertip, instead of a Computer keyboard or Mouse ....
). Among these are the input devices for personal digital assistant
Personal digital assistant

A personal digital assistant is a handheld computer, also known as a palmtop computer. Newer PDAs also have both color screens and audio capabilities, enabling them to be used as mobile phones, , web browsers, or portable media players....
s such as those running Palm OS
Palm OS

Palm OS is an embedded operating system operating system initially developed by U.S. Robotics Corp.-owned Palm, Inc. for personal digital assistants in 1996....
. The Apple Newton
Apple Newton

The MessagePad was the first series of personal digital assistant devices developed by Apple Inc. for the Newton . Some electronic engineering and the manufacture of Apple's MessagePad devices was done in Japan by the Sharp Corporation....
 pioneered this product. The algorithms used in these devices take advantage of the fact that the order, speed, and direction of individual lines segments at input are known. Also, the user can be retrained to use only specific letter shapes. These methods cannot be used in software that scans paper documents, so accurate recognition of hand-printed documents is still largely an open problem. Accuracy rates of 80% to 90% on neat, clean hand-printed characters can be achieved, but that accuracy rate still translates to dozens of errors per page, making the technology useful only in very limited applications.

Recognition of cursive text is an active area of research, with recognition rates even lower than that of hand-printed text. Higher rates of recognition of general cursive script will likely not be possible without the use of contextual or grammatical information. For example, recognizing entire words from a dictionary is easier than trying to parse individual characters from script. Reading the Amount line of a cheque
Cheque

A cheque or check is a negotiable instrument instructing a financial institution to pay a specific amount of a specific currency from a specified demand account held in the maker/depositor's name with that institution....
 (which is always a written-out number) is an example where using a smaller dictionary can increase recognition rates greatly. Knowledge of the grammar of the language being scanned can also help determine if a word is likely to be a verb or a noun, for example, allowing greater accuracy. The shapes of individual cursive characters themselves simply do not contain enough information to accurately (greater than 98%) recognize all handwritten cursive script.

It is necessary to understand that OCR technology is a basic technology also used in advanced scanning applications. Due to this, an advanced scanning solution can be unique and patented and not easily copied despite being based on this basic OCR technology.

For more complex recognition problems, intelligent character recognition
Intelligent Character Recognition

In computer science, intelligent character recognition is an advanced optical character recognition or - rather more specific - Handwriting recognition system that allows fonts and different styles of handwriting to be learned by a computer during processing to improve accuracy and recognition levels....
 systems are generally used, as artificial neural network
Artificial neural network

An artificial neural network , often just called a "neural network" , is a mathematical model or computational model based on biological neural networks....
s can be made indifferent to both affine
Affine

Affine may refer to:*Affine cipher, a special case of the more general substitution cipher*Affine combination, a certain kind of constrained linear combination...
 and non-linear transformations.

Music OCR


Early research into recognition of printed sheet music was performed in the mid 1970s at MIT
Massachusetts Institute of Technology

The Massachusetts Institute of Technology is a private university research university located in Cambridge, Massachusetts, Massachusetts, United States....
 and other institutions. Successive efforts were made to localize and remove musical staff lines leaving symbols to be recognized and parsed. The first proprietary music-scanning program, MIDISCAN, was released in 1991. Three proprietary products are currently available. At this time (December 2007), Neuratron's Photoscore Ultimate 5 is the only OCR software that recognizes handwritten scores, within certain parameters.

Magnetic ink character recognition

One area where accuracy and speed of computer input of character information exceeds that of humans is in the area of magnetic ink character recognition
Magnetic ink character recognition

Magnetic Ink Character Recognition, or MICR, is a character recognition technology adopted mainly by the banking industry to facilitate the processing of Cheque....
, where the error rates range around one read error for every 20,000 to 30,000 checks. In the 1950s, Bank of America was the first bank to harness OCR to automate check processing; the result was ERMA
Electronic Recording Machine, Accounting

ERMA, for Electronic Recording Machine-Accounting, was a pioneering computer development project run at SRI International under contract to Bank of America in order to automate banking bookkeeping....
.

Optical Character Recognition in Unicode


In Unicode
Unicode

Unicode is a computing industry standard allowing computers to consistently represent and manipulate Character expressed in most of the world's writing systems....
, Optical Character Recognition symbol characters are placed in the hexadecimal
Hexadecimal

In mathematics and computer science, hexadecimal is a numeral system with a radix, or base, of 16. It uses sixteen distinct symbols, most often the symbols 09 to represent values zero to nine, and A, B, C, D, E, F to represent values ten to fifteen....
 range 0x2440–0x245F, as shown below (see also Unicode Symbols
Unicode Symbols

In computing, in addition to encoding characters for the various writing systems used throughout the World, Unicode also devotes several blocks of characters to symbols that have a well-defined place in plain text....
). These characters have special meanings within the OCR systems OCR-A
OCR-A font

In the early days of computer Optical Character Recognition, there was a need for a font thatcould be recognized by the slow computers of that day, and by...
 and E-13B
Magnetic ink character recognition

Magnetic Ink Character Recognition, or MICR, is a character recognition technology adopted mainly by the banking industry to facilitate the processing of Cheque....
.

  Symbol Name  
Hex
Symbol's Picture
? OCR Hook ? OCR Chair ? OCR Fork ? OCR Inverted Fork ? OCR Belt Buckle
0x2440 0x2441 0x2442 0x2443 0x2444
U+2440
U+2441
U+2442
U+2443
U+2444
? OCR Bow Tie ? OCR Branch Bank Identification ? OCR Amount Of Check ? OCR Customer Account Number ? OCR Dash
0x2445 0x2446 0x2447 0x2448 0x2449
U+2445
U+2446
U+2447
U+2448
U+2449
? OCR Double Backslash   Not Defined   Not Defined   Not Defined   Not Defined
0x244A 0x244B 0x244C 0x244D 0x244E
U+244a
- - - -


OCR software


NameLicenseOperating systemsNotes
ExperVision
ExperVision

File:ExperVision_logo.jpgExperVision, Inc. DBA ExperExchange, Inc. is a technology company in California which was founded in 1987 in Silicon Valley....
 TypeReader & RTK
Commercial Windows,Mac OS X,Unix,Linux,OS/2 ExperVision Inc. was founded in 1987, its OCR technology and product won the highest marks in the independent testing performed by UNLV for the consecutive years that ExperVision participated.

”TypeReader® has one big advantage: speed. This corporate-level OCR application processes faster than any product of its type we've ever tested: It converted a scanned image of a 700-page book into an editable Word file in a startling 6 minutes!” “TypeReader is worth considering for enterprise-level high-volume, high-speed OCR” Gary Berline, PC Magazine, 08.12.08
ABBYY
ABBYY

ABBYY is a software house based in Moscow, Russia. The company was founded in 1989 by David Yang. ABBYY had over 600 employees, as of November 2006, including offices in Russia , the USA , Ukraine , the UK , Germany and Japan ....
 FineReader OCR
Commercial Windows, Mac OS X For working with localized interfaces, corresponding language support is required.
OmniPage
OmniPage

OmniPage is an Optical character recognition application available from Nuance Communications. Nuance Communications was acquired by ScanSoft, which also took over its name in October 2005....
 
Commercial (Nuance EULA) Windows, Mac OS Product of Nuance Communications
Nuance Communications

Nuance Communications is a multinational computer software technology corporation, headquartered in Burlington, Massachusetts, USA, that provides speech and imaging applications....
Readiris
Readiris

Readiris is optical character recognition software for Microsoft Windows and Mac OS. It is produced by Belgian company Image Recognition Integrated Systems Group S.A....
 
Commercial Windows, Mac OS Product of I.R.I.S. Group
I.R.I.S. Group

IRIS : Image recognition integrated systems is a computer software technology company that provides text recognition and document management solutions....
 of Belgium. Asian and Middle Eastern editions.
CVision Technologies PdfCompressor and Maestro Recognition Server Commercial Windows Fast, Accurate, High-Volume OCR
Top Image Systems Commercial WindowsSpecialize in Invoice Readers.
Zonal OCR
Zonal OCR

Zonal OCR is the process by which Optical Character Recognition applications "read" specifically zoned text from a scanned image. Many batch document imaging applications allow the end user to identify and draw a "zone" on a sample image to be recognized....
 
Commercial Windows Zonal OCR is the process by which Optical Character Recognition (OCR) applications "read" specifically zoned text from a scanned image. Many batch document imaging applications allow the end user to identify and draw a "zone" on a sample image to be recognized. Once the zone has been established on the sample image, this zone will be applied to each image processed so that the data can be extracted from the image file and converted to a ASCII format. Zonal OCR helps to automate data extraction from digital images. However, zonal OCR, and OCR in general, is not entirely accurate and review of the extracted data will be required.
Computhink
Computhink

Computhink, Inc. provides Electronic Content Management / Document Management solutions for secure information sharing and compliance, targeting small and medium enterprises/business ....
's ViewWise
ViewWise

ViewWise is a Content/Document Management solution offered from Computhink that provides Electronic/Enterprise Content Management for secure information sharing and compliance, targeting small and medium size organizations ....
 
Commercial Windows Document Management system
CuneiForm
CuneiForm (software)

In computer software, CuneiForm is an Optical character recognition tool. It was originally developed at Cognitive Technologies and after few years with no development released as freeware on December 12, 2007....
 
BSD variant Windows, Linux, BSD, MacOSX. Enterprise-class system, multi language, can save text formatting and recognizes complicated tables of any structure
GOCR
GOCR

GOCR is a free software optical character recognition program, initially written by J?rg Schulenburg. It can be used to convert or scan image files into text files....
 
GPL Many (open source) Early development
Microsoft Office Document Imaging
Microsoft Office Document Imaging

Microsoft Office Document Imaging is a Microsoft Office application that supports editing documents scanned by Microsoft Office Document Scanning....
 
Commercial Windows, Mac OS X  
Microsoft Office OneNote 2007 Commercial Windows  
NovoDynamics
NovoDynamics

NovoDynamics is a software development company specializing in , Pattern_recognition and Data_mining.Though NovoDynamics is perhaps best known for developing Arabic Optical_character_recognition and image enhancement applications for Middle Eastern languages, the company?s data mining and pattern recognition capabilities have also be...
 VERUS
Commercial? ? Specializes in languages of the Middle East
Ocrad
Ocrad

Ocrad is an optical character recognition program, developed as part of the GNU Project. Based on a feature extraction method, it reads images in portable pixmap formats and produces text in byte or UTF-8 formats....
 
GPL Unix-like, OS/2  
Brainware
Brainware

Brainware, Inc is a privately-held American software company that provides data capture and search solutions, improving control of data-driven business processes....
 
Commercial Windows Template-free data extraction and processing of data from documents into any backend system; sample document types include invoices, remittance statements, bills of lading and POs
HOCR
HOCR (software)

In computer software, HOCR is a free software Hebrew optical character recognition software. It is based on the libhocr Hebrew optical character recognition engine....
 
GPL Linux Hebrew OCR
InstantOCR Freeware Online A multi language online recognition system.It can process files online , send results instantly.
OCRopus
OCRopus

OCRopus is a free Software document analysis and OCR system released under the Apache License, Version 2.0 with a very modular design through the use of plugins....
 
Apache Linux Pluggable framework which can use Tesseract
ReadSoft
ReadSoft

ReadSoft is a Swedish software company, established in 1991, that develops software for Document Automation and process optimization, including document and data capture, integration with ERP and subsequent workflow application....
 
Commercial Windows Scan, capture and classify business documents such forms, invoices and POs.
Recogniform Technologies Commercial Windows Advanced form processing solution to recognize and classify any kind of form easily. Also recognizes barcodes, OMR check-boxes, ICR handwritten text, OCR-A/B and CMC7/E13B codelines. Capture data from forms, release output on DBMS or files.
Alt-N Technologies'
RelayFax Network Fax Manager
RelayFax

RelayFax is a fax server for Microsoft Windows computer systems, produced by Alt-N Technologies. It supports the sending and receiving of faxes, on any scale from low to high volumes....
 
Commercial Windows Multi-language OCR Plug-in is used to convert faxed pages into editable document formats (doc, pdf, etc...) in many different languages.
Scantron
Scantron

Scantron is a company, based in Irvine, California, USA, that manufactures and sells machine-readable papers on which students mark answers to academic test questions, the machines to analyze those answers, survey and test scoring systems, the taking of school attendance and image-based data collection Computer software and ....
 Cognition
Commercial Windows For working with localized interfaces, corresponding language support is required.
SimpleOCR
SimpleOCR

SimpleOCR is a proprietary optical character recognition application developed originally by Cyril Cambien of France under the title WOCAR. It converts black and white scans or TIFF images to editable text files or Microsoft Word documents....
 
Freeware and commercial versions Windows
OCR Terminal Freeware and commercial versions Windows, Mac OS X, Linux Web-based OCR Service.
SmartScore
SmartScore

SmartScore is a music OCR and scorewriter program, written by Musitek Corporation based in Ojai, California. SmartScore runs on Windows and Macintosh computers....
 
Commercial Windows, Mac OS For musical scores
Tesseract
Tesseract (software)

In computer software, Tesseract is a free software optical character recognition engine. It was originally developed as proprietary software at Hewlett-Packard between 1985 until 1995....
 
Apache Windows, Mac OS X, Linux, OS/2 Under development by Google
Google

Google Inc. is an United States public company, earning revenue from AdWords related to its Google search, Gmail, Google Maps, Google Apps, Orkut, and YouTube services as well as selling advertising-free versions of the Google Search Appliance....
Freeware Online Online OCR Service, based on Tesseract.
Freeware Windows scan a single image or a list of images and search words in your images


See also

  • Automatic number plate recognition
    Automatic number plate recognition

    Automatic number plate recognition is a mass surveillance method that uses optical character recognition on images to read the license plate on vehicles....
  • CAPTCHA
    CAPTCHA

    A CAPTCHA or Captcha is a type of challenge-response authentication test used in computing to ensure that the response is not generated by a computer....
  • Computational linguistics
    Computational linguistics

    Computational linguistics is an interdisciplinary field dealing with the Statistics and/or rule-based modeling of natural language from a computational perspective....
  • Computer vision
    Computer vision

    Computer vision is the science and technology of machines that see. As a scientific discipline, computer vision is concerned with the theory for building artificial systems that obtain information from images....
  • Machine learning
    Machine learning

    Machine learning is the subfield of artificial intelligence that is concerned with the design and development of algorithms that allow computers to improve their performance over time based on data, such as from sensor data or databases....
  • OCR SDK
    OCR SDK

    An OCR SDK is a Software Development Kit for developers adding Optical_character_recognition technology into software applications, OCR SDK provide methods for incorporating Optical Character Recognition technology into forms processing applications, document imaging management system, e-discovery system, or records management solution to ex...
  • Optical mark recognition
    Optical mark recognition

    Optical Mark Recognition is the process of capturing human-marked data from such as surveys and tests....
  • Raster to vector
    Raster to vector

    In computer graphics, vectorization refers to the process of using software and hardware technology/services to convert raster graphics into vector graphics....
  • Raymond Kurzweil
    Raymond Kurzweil

    Raymond Kurzweil is an inventor and futurist. He has been a pioneer in the fields of optical character recognition , speech synthesis, speech recognition technology, and electronic keyboard instruments....
  • Speech recognition
    Speech recognition

    Speech recognition converts spoken words to machine-readable input . The term "voice recognition" is sometimes incorrectly used to refer to speech recognition, when actually referring to speaker recognition, which attempts to identify the person speaking, as opposed to what is being said....
  • Book scanning
    Book scanning

    Book scanning is the process of converting physical books into digital images or e-book via image scanning. This is a much less time-intensive method than re-typing all of the text; before scanning became feasible, re-typing was generally the only option....
  • Institutional Repository
    Institutional repository

    An Institutional Repository is an online locus for collecting, preserving, and disseminating -- in digital form -- the intellectual output of an institution, particularly a research institution....
  • Digital Library
    Digital library

    A digital library is a library in which collections are stored in digital formats and accessible by computers. The digital content may be stored locally, or accessed remotely via computer networks....


External links

  • , , a comprehensive conference on all aspects of document recognition
  • Explanation of basic handwriting recognition principles and history