All Topics  
Image scanner

 
Image Scanner

   Email Print
   Bookmark   Link






 

Image scanner



 
 


In computing, a scanner is a device that optically scans images, printed text, handwriting, or an object, and converts it to a digital image
Digital image

A digital image is a representation of a two-dimensional using ones and zeros . Depending on whether or not the is fixed, it may be of vector graphics or raster graphics type....
.






Discussion
Ask a question about 'Image scanner'
Start a new discussion about 'Image scanner'
Answer questions from other users
Full Discussion Forum



Recent Posts









Encyclopedia


Scanner
Scanner


In computing, a scanner is a device that optically scans images, printed text, handwriting, or an object, and converts it to a digital image
Digital image

A digital image is a representation of a two-dimensional using ones and zeros . Depending on whether or not the is fixed, it may be of vector graphics or raster graphics type....
. Common examples found in offices are variations of the desktop (or flatbed) scanner where the document is placed on a glass window for scanning. Hand-held scanners, where the device is moved by hand, have evolved from text scanning "wands" to 3D scanners
3D scanner

A 3D scanner is a device that analyzes a real-world object or environment to collect data on its shape and possibly its appearance . The collected data can then be used to construct digital, three dimensional models useful for a wide variety of applications....
 used for industrial design, reverse engineering, test and measurement, orthotics, gaming and other applications. Mechanically driven scanners that move the document are typically used for large-format documents, where a flatbed design would be impractical.

Modern scanners typically use a charge-coupled device
Charge-coupled device

A charge-coupled device is an analog signal shift register that enables the transportation of analog signals through successive stages , controlled by a clock signal....
 (CCD) or a Contact Image Sensor
Contact Image Sensor

Contact Image Sensors are a relatively recent technological innovation in the field of optical that are rapidly replacing Charge-coupled device in low power and portable applications....
 (CIS) as the image sensor, whereas older drum scanners use a photomultiplier
Photomultiplier

Photomultiplier tubes , members of the class of vacuum tubes, and more specifically phototubes, are extremely sensitive detectors of light in the ultraviolet, visible light, and near-infrared ranges of the electromagnetic spectrum....
 tube as the image sensor. A rotary scanner, used for high-speed document scanning, is another type of drum scanner, using a CCD array instead of a photomultiplier. Other types of scanners are planetary scanner
Planetary scanner

A planetary scanner is a type of for making scans of rare books and other easily damaged documents. In essence, such a scanner is a mounted camera taking photograph of a well-lit environment....
s, which take photographs of books and documents, and 3D scanners, for producing three-dimensional models of objects.

Another category of scanner is digital camera
Digital camera

A digital camera is a camera that takes video or still photographs, or both, digitally by recording digital image via an electronics .Many compact digital still cameras can record sound and moving video as well as still photographs....
 scanners, which are based on the concept of reprographic
Reprography

Reprography is the reproduction of graphics through mechanical or electrical means, such as photography or xerography. Reprography is commonly used in catalogs and archives, as well as in the architectural reprography....
 cameras. Due to increasing resolution and new features such as anti-shake, digital cameras have become an attractive alternative to regular scanners. While still having disadvantages compared to traditional scanners (such as distortion, reflections, shadows, low contrast), digital cameras offer advantages such as speed, portability, gentle digitizing of thick documents without damaging the book spine. New scanning technologies are combining 3D scanners with digital cameras to create full-color, photo-realistic 3D models of objects.

Historical precedent

Scanners can be considered the successors of early telephotography input devices, consisting of a rotating drum with a single photodetector at a standard speed of 60 or 120 rpm (later models up to 240 rpm). They send a linear analog signal|analog Amplitude Modulation|AM signal through standard telephone voice lines to receptors, which synchronously print the proportional intensity on special paper. This system was in use in press from the 1920s to the mid-1990s. Color photos were sent as three separated RGB
RGB color model

The RGB color model is an additive color in which red, green, and blue light are added together in various ways to reproduce a broad array of colors....
 filtered images consecutively, but this was used only for special events due to transmission costs.

Types


Drum

Drum scanners capture image information with photomultiplier tubes
Photomultiplier

Photomultiplier tubes , members of the class of vacuum tubes, and more specifically phototubes, are extremely sensitive detectors of light in the ultraviolet, visible light, and near-infrared ranges of the electromagnetic spectrum....
 (PMT), rather than the charge-coupled device
Charge-coupled device

A charge-coupled device is an analog signal shift register that enables the transportation of analog signals through successive stages , controlled by a clock signal....
 (CCD) arrays found in flatbed scanners and inexpensive film scanners. Reflective and transmissive originals are mounted on an acrylic cylinder, the scanner drum, which rotates at high speed while it passes the object being scanned in front of precision optics that deliver image information to the PMTs. Most modern color drum scanners use 3 matched PMTs, which read red, blue, and green light respectively. Light from the original artwork is split into separate red, blue, and green beams in the optical bench of the scanner.

The drum scanner gets its name from the clear acrylic cylinder, the drum, on which the original artwork is mounted for scanning. Depending on size it is possible to mount originals up to 11"x17", but maximum size varies by manufacturer. One of the unique features of drum scanners is the ability to control sample area and aperture size independently. The sample size is the area that the scanner encoder reads to create an individual pixel. The aperture is the actual opening that allows light into the optical bench of the scanner. The ability to control aperture and sample size separately is particularly useful for smoothing film grain when scanning black-and white and color negative originals.

While drum scanners are capable of scanning both reflective and transmissive artwork, a good-quality flatbed scanner can produce excellent scans from reflective artwork. As a result, drum scanners are rarely used to scan prints now that high quality inexpensive flatbed scanners are readily available. Film, however, is where drum scanners continue to be the tool of choice for high-end applications. Because film can be wet-mounted to the scanner drum and because of the exceptional sensitivity of the PMTs, drum scanners are capable of capturing very subtle details in film originals.

Only a few companies continue to manufacture drum scanners. While prices of both new and used units have come down over the last decade, they still require a considerable monetary investment when compared to CCD flatbed and film scanners. However, drum scanners remain in demand due to their capacity to produce scans that are superior in resolution, color gradation, and value structure. Also, since drum scanners are capable of resolutions up to 12,000 PPI
Pixels per inch

Pixels per inch or pixel density is a measurement of the resolution of devices in various contexts; typically computer displays, s or digital photography s....
, their use is generally recommended when a scanned image is going to be enlarged.

In most graphic-arts operations, very-high-quality flatbed scanners have replaced drum scanners, being both less expensive and faster. However, drum scanners continue to be used in high-end applications, such as museum-quality archiving of photographs and print production of high-quality books and magazine advertisements. In addition, due to the greater availability of pre-owned units many fine-art photographers are acquiring drum scanners, which has created a new niche market for the machines.

The first image scanner ever developed was a drum scanner. It was built in 1957 at the US National Bureau of Standards by a team led by Russell Kirsch. The first image ever scanned on this machine was a 5 cm square photograph of Kirsch's then-three-month-old son, Walden. The black and white image had a resolution of 176 pixel
Pixel

In digital imaging, a pixel is the smallest item of information in an image. Pixels are normally arranged in a 2-dimensional grid, and are often represented using dots, squares, or rectangles....
s on a side.

Flatbed

A flatbed scanner is usually composed of a glass pane (or platen
Platen

A platen is typically a flat metal plate pressed against a medium to cause an impression in letterpress printing. Platen may also refer to a typewriter roller which friction-feeds paper into position below the typebars or print head....
), under which there is a bright light (often xenon
Xenon

Xenon is a chemical element represented by the chemical symbol Xe. Its atomic number is 54. A colorless, heavy, odorless noble gas, xenon occurs in the Earth's atmosphere in trace amounts....
 or cold cathode fluorescent) which illuminates the pane, and a moving optical array in CCD
Charge-coupled device

A charge-coupled device is an analog signal shift register that enables the transportation of analog signals through successive stages , controlled by a clock signal....
 scanning. CCD type scanners typically contain three rows (arrays) of sensors with red, green, and blue filters. CIS
Contact Image Sensor

Contact Image Sensors are a relatively recent technological innovation in the field of optical that are rapidly replacing Charge-coupled device in low power and portable applications....
 scanning consists of a moving set of red, green and blue LEDs strobed for illumination and a connected monochromatic photodiode
Photodiode

A photodiode is a type of photodetector capable of converting light into either electric current or voltage, depending upon the mode of operation....
 array for light collection. Images to be scanned are placed face down on the glass, an opaque cover is lowered over it to exclude ambient light, and the sensor array and light source move across the pane, reading the entire area. An image is therefore visible to the detector only because of the light it reflects. Transparent images do not work in this way, and require special accessories that illuminate them from the upper side. Many scanners offer this as an option.

Film

"Slide" (positive) or negative
Negative (photography)

In photography, a negative may refer to three different things, although they are all related....
 film can be scanned in equipment specially manufactured for this purpose. Usually, uncut film strips of up to six frames, or four mounted slides, are inserted in a carrier, which is moved by a stepper motor
Stepper motor

A stepper motor is a Brushless DC electric motor, synchronous electric motor that can divide a full rotation into a large number of steps. The motor's position can be controlled precisely, without any feedback mechanism ....
 across a lens and CCD sensor inside the scanner. Some models mainly used for same-size scans.

Hand

Hand scanners come in two forms: document and 3D scanners. Hand held document scanners are manual devices that are dragged across the surface of the image to be scanned. Scanning documents in this manner requires a steady hand, as an uneven scanning rate would produce distorted images - a little light on the scanner would indicate if the motion was too fast. They typically have a "start" button, which is held by the user for the duration of the scan; some switches to set the optical resolution
Optical resolution

Optical resolution describes the ability of an imaging system to resolve detail in the object that is being imaged.An imaging system may have many individual components including a lens and recording and display components....
; and a roller, which generates a clock pulse for synchronization with the computer. Most hand scanners were monochrome
Monochrome

Monochrome comes from the Greek language ?????????? , meaning ?of one color?, which is a combination of ????? , meaning ?alone? or ?solitary?, and ????a , meaning ?color?....
, and produced light from an array of green LEDs to illuminate the image. A typical hand scanner also had a small window through which the document being scanned could be viewed. They were popular during the early 1990s and usually had a proprietary interface module specific to a particular type of computer, usually an Atari ST
Atari ST

The Atari ST is a home computer/personal computer that was commercially available from 1985 to the early 1990s. It was released by Atari Corporation in 1985....
 or Commodore Amiga
Amiga

The Amiga is a family of personal computers originally developed by Amiga Corporation. Development on the Amiga began in 1982 with Jay Miner as the principal hardware designer....
.

While popularity for document scanning has waned, use of hand held 3D scanners remains popular for many applications, including industrial design, reverse engineering, inspection & analysis, digital manufacturing and medical applications. To compensate for the uneven motion of the human hand, most 3D scanning systems rely on the placement of reference markers – typically adhesive reflective tabs that the scanner uses to align elements and mark positions in space.

Quality

Scanners typically read red-green-blue color
RGB color model

The RGB color model is an additive color in which red, green, and blue light are added together in various ways to reproduce a broad array of colors....
 (RGB) data from the array. This data is then processed with some proprietary algorithm to correct for different exposure conditions, and sent to the computer via the device's input/output
Input/output

In computing, input/output, or I/O, refers to the communication between an information processing system , and the outside world ? possibly a human, or another information processing system....
 interface (usually SCSI
SCSI

Small Computer System Interface, or SCSI , is a set of standards for physically connecting and transferring data between computers and peripheral devices....
 or bidirectional parallel port
Parallel port

A parallel port is a type of interface found on computers for connecting various peripherals. It is also known as a printer port or Centronics#The interface....
 in machines pre-dating the USB standard). Color depth
Color depth

Color depth or bit depth, is a computer graphics term describing the number of bits used to represent the color of a single pixel in a Raster graphicsped image or video frame buffer....
 varies depending on the scanning array characteristics, but is usually at least 24 bits. High quality models have 48 bits or more color depth. The other qualifying parameter for a scanner is its resolution
Image resolution

Image resolution describes the detail an holds. The term applies equally to digital images, film images, and other types of images. Higher resolution means more image detail....
, measured in pixels per inch
Pixels per inch

Pixels per inch or pixel density is a measurement of the resolution of devices in various contexts; typically computer displays, s or digital photography s....
 (ppi), sometimes more accurately referred to as Samples per inch
Samples per inch

Samples per inch is a measurement of the resolution of an , in particular the number of individual sample that are taken in the space of one linear inch....
 (spi). Instead of using the scanner's true optical resolution, the only meaningful parameter, manufacturers like to refer to the interpolated resolution, which is much higher thanks to software interpolation
Interpolation

In the mathematics subfield of numerical analysis, interpolation is a method of constructing new data points within the range of a discrete set of known data points....
. , a good flatbed scanner has an optical resolution of 1600–3200 ppi, high-end flatbed scanners can scan up to 5400 ppi, and a good drum scanner has an optical resolution of 8000–14,000 ppi.

Manufacturers often claim interpolated resolutions as high as 19,200 ppi; but such numbers carry little meaningful value, because the number of possible interpolated pixels
Digital zoom

Digital zoom is a method of decreasing the apparent angle of view of a digital photographic or video image. Digital zoom is accomplished by Cropping an image down to a centered area with the same aspect ratio as the original, and usually also interpolation the result back up to the pixel dimensions of the original....
 is unlimited.

The size of the file created increases with the square of the resolution; doubling the resolution quadruples the file size. A resolution must be chosen that is within the capabilities of the equipment, preserves sufficient detail, and does not produce a file of excessive size. The file size can be reduced for a given resolution by using "lossy" compression methods such as JPEG
JPEG

In computing, JPEG is a commonly used method of for photographic images. The degree of compression can be adjusted, allowing a selectable tradeoff between storage size and image quality....
, at some cost in quality. If the best possible quality is required lossless compression should be used; reduced-quality files of smaller size can be produced from such an image when required (e.g., image designed to be printed on a full page, and a much smaller file to be displayed as part of a fast-loading web page).

The third important parameter for a scanner is its density range. A high density range means that the scanner is able to reproduce shadow details and brightness details in one scan.

By combining full-color imagery with 3D models, modern hand-held scanners are able to completely reproduce objects electronically. The addition of 3D color printers enables accurate miniaturization of these objects, with applications across many industries and professions.

Computer connection

Scanning the document is only one part of the process. For the scanned image to be useful, it must be transferred from the scanner to an application running on the computer. There are two basic issues: (1) how the scanner is physically connected to the computer and (2) how the application retrieves the information from the scanner.

Physical connection to a computer


The amount of data generated by a scanner can be very large: a 600 DPI 9"x11" (slightly larger than A4 paper) uncompressed 24-bit image is about 100 megabyte
Megabyte

Megabyte is a SI prefix-multiple of the unit byte for digital information computer storage or transmission and is equal to 106 bytes....
s of data which must be transferred and stored. Recent scanners can generate this volume of data in a matter of seconds, making a fast connection desirable.

Four standard connections are used by scanners:

  • Parallel
    Parallel

    From Greek language: pa???????? Parallel may refer to:...
     - Connecting through a parallel port
    Parallel port

    A parallel port is a type of interface found on computers for connecting various peripherals. It is also known as a printer port or Centronics#The interface....
     is the slowest common transfer method. Early scanners had parallel port
    Parallel port

    A parallel port is a type of interface found on computers for connecting various peripherals. It is also known as a printer port or Centronics#The interface....
     connections that could not transfer data faster than 70 kilobyte
    Kilobyte

    Kilobyte is a unit of Computer data storage equal to either 1,024 bytes or 1,000 bytes , depending on context.It is abbreviated in a number of ways: KB, kB, K and Kbyte....
    s/second
    Second

    The second , sometimes abbreviated sec., is the name of a units of measurement of time, and is the International System of Units SI base unit of time....
    . The primary advantage of the parallel port connection was economic: it avoided adding an interface card to the computer.


  • Small Computer System Interface (SCSI)
    SCSI

    Small Computer System Interface, or SCSI , is a set of standards for physically connecting and transferring data between computers and peripheral devices....
    , which is supported by most computers only via an additional SCSI interface card. Some SCSI scanners are supplied together with a dedicated SCSI card for a PC, although any SCSI controller can be used. During the evolution of the SCSI standard speeds increased, with backwards compatibility; a SCSI connection can transfer data at the highest speed which both the controller and the device support. SCSI has been largely replaced by USB and Firewire, one or both of which are directly supported by most computers, and which are easier to set up than SCSI.


  • Universal Serial Bus (USB) scanners can transfer data quickly, and they are easier to use and cheaper than SCSI devices. The early USB 1.1 standard could transfer data at only 1.5 megabytes per second (slower than SCSI), but the later USB 2.0 standard can theoretically transfer up to 60 megabytes per second (although everyday rates are much lower), resulting in faster operation.


  • FireWire
    FireWire

    The IEEE 1394 interface is a serial communications interface standard for high-speed communications and isochronous real-time data transfer, frequently used by personal computers, as well as in digital audio, digital video, automotive, and aeronautics applications....
     is an interface that is much faster than USB 1.1 and comparable to USB 2.0. FireWire speeds are 25, 50, and 100, 400 and 800 megabits per second (but a device may not support all speeds).


  • Some early scanners used a proprietary interface card rather than a standard interface.


Applications Programming Interface


An application such as Adobe Photoshop must communicate with the scanner. There are many different scanners, and many of those scanners use different protocols. In order to simplify applications programming, some Applications Programming Interfaces ("API") were developed. The API presents a uniform interface to the scanner. This means that the application does not need to know the specific details of the scanner in order to access it directly. For example, Adobe Photoshop supports the TWAIN
TWAIN

TWAIN is a standard software protocol and applications programming interface that regulates communication between software applications and imaging devices such as and digital cameras....
 standard; consequently, (in an ideal world) Photoshop can acquire an image from any scanner that also supports TWAIN.

In practice, there are often problems with an application communicating with a scanner. Either the application or the scanner manufacturer (or both) may have faults in their implementation of the API.

Typically, the API is implemented as a dynamically linked library. Each scanner manufacturer provides software that translates the API procedure calls into primitive commands that are issued to a hardware controller (such as the SCSI, USB, or FireWire controller). The manufacturer's part of the API is commonly called a device driver
Device driver

In computing, a device driver or software driver is a computer program allowing higher-level computer programs to interact with a hardware device....
, but that designation is not strictly accurate: the API does not run in kernel mode and does not directly access the device.

Some scanner manufacturers will offer more than one API.

Most scanners use the TWAIN
TWAIN

TWAIN is a standard software protocol and applications programming interface that regulates communication between software applications and imaging devices such as and digital cameras....
 API. The TWAIN API, originally used for low-end and home-use equipment, is now widely used for large-volume scanning.

Other scanner API's are

ISIS
ISIS

ISIS is an industry standard interface for technologies, developed by Pixel Translations in 1990 .ISIS is an open standard for scanner control and a complete image-processing framework....
, created by Pixel Translations, which still uses SCSI-II for performance reasons, is used by large, departmental-scale, machines.

SANE
Scanner Access Now Easy

Scanner Access Now Easy is an application programming interface that provides standardized access to any raster hardware . The SANE API is public domain and its discussion and development is open to everybody....
 (Scanner Access Now Easy) is a free
Free software

Free Software or software libre is software that can be used, studied, and modified without restriction, and which can be copied and redistributed in modified or unmodified form either without restriction, or with minimal restrictions only to ensure that further recipients can also do these things and to prevent consumer-facing hardware...
/open source
Open source

Open source is an approach to design, development, and distribution offering practical accessibility to a product's source . Some consider open source as one of various possible design approaches, while others consider it a critical Strategy element of their business operations....
 API for accessing scanners. Originally developed for Unix
Unix

Unix is a computer operating system originally developed in 1969 by a group of American Telephone & Telegraph employees at Bell Labs, including Ken Thompson , Dennis Ritchie, Douglas McIlroy, and Joe Ossanna....
 and Linux
Linux

Linux is a generic term referring to Unix-like computer operating systems based on the Linux kernel. Their development is one of the most prominent examples of free and open source software collaboration; typically all the underlying source code can be used, freely modified, and redistributed by anyone under the terms of the GNU GPL license...
 operating systems, it has been ported to OS/2
OS/2

OS/2 is a computer operating system, initially created by Microsoft and IBM, then later developed by IBM exclusively. The name stands for "Operating System/2," because it was introduced as part of the same generation change release as IBM's "IBM Personal System/2 " line of second-generation personal computers....
, Mac OS X
Mac OS X

Mac OS X is a line of computer operating systems developed, marketed, and sold by Apple Inc., and since 2002 has been included with all new Macintosh computer systems....
, and Microsoft Windows
Microsoft Windows

Microsoft Windows is a series of software operating systems and graphical user interfaces produced by Microsoft. Microsoft first introduced an operating environment named Windows in November 1985 as an add-on to MS-DOS in response to the growing interest in graphical user interfaces ....
. Unlike TWAIN, SANE does not handle the user interface. This allows batch scans and transparent network access without any special support from the device driver.

Windows Image Acquisition
Windows Image Acquisition

Windows Image Acquisition is a Microsoft driver model and application programming interface for Microsoft Windows 2000 and later operating systems that enables graphics software to communicate with imaging hardware such as scanners, digital cameras and Digital Video-equipment....
 ("WIA") is an API provided by Microsoft.

Bundled applications


Although no software beyond a scanning utility is a feature of any scanner, many scanners come bundled with software. Typically, in addition to the scanning utility, some type of image-editing application (such as Photoshop), and optical character recognition
Optical character recognition

Optical character recognition, usually abbreviated to OCR, is the mechanical or Electronics translation of s of handwritten, typewritten or printed text into machine-editable text....
 (OCR) software are supplied. OCR software converts graphical images of text into standard text that can be edited using common word-processing and text-editing software; accuracy is rarely perfect.

Output data

The scanned result is a non-compressed RGB image, which can be transferred to a computer's memory. Some scanners compress and clean up the image using embedded firmware
Firmware

Firmware is a term sometimes used to denote the fixed, usually rather small, programs that internally control various electronic devices. Typical examples range from end user products such as remote controls or calculators, via computer parts and devices like harddisks, keyboard s, TFT screens or memory cards, all the way to scientific instr...
. Once on the computer, the image can be processed with a raster graphics
Raster graphics

In computer graphics, a raster graphics image or bitmap, is a data structure representing a generally Rectangle grid of pixels, or points of color, viewable via a Computer display, paper, or other display medium....
 program (such as Photoshop or the GIMP
GIMP

The GIMP is a free software, raster graphics editor used to process digital graphics and photographs. Typical uses include creating graphics and logos, resizing and cropping photos, altering colors, combining multiple images, removing unwanted image components, and converting between different image formats....
) and saved on a storage device (such as a hard disk
Hard disk

A hard disk drive , commonly referred to as a hard drive, hard disk, or fixed disk drive, is a non-volatile storage device which stores digitally encoded data on rapidly rotating hard disk platters with magnetic surfaces....
).

Images are usually stored on a hard disk
Hard disk

A hard disk drive , commonly referred to as a hard drive, hard disk, or fixed disk drive, is a non-volatile storage device which stores digitally encoded data on rapidly rotating hard disk platters with magnetic surfaces....
. Pictures are normally stored in image formats such as uncompressed Bitmap
Bitmap

In computer graphics, a bitmap or pixmap is a type of computer storage organization or used to store digital images. The term bitmap comes from the computer programming terminology, meaning just a map of bits, a spatially mapped bit array....
, "non-lossy" (lossless) compressed TIFF and PNG, and "lossy" compressed JPEG
JPEG

In computing, JPEG is a commonly used method of for photographic images. The degree of compression can be adjusted, allowing a selectable tradeoff between storage size and image quality....
. Documents are best stored in TIFF or PDF format; JPEG is particularly unsuitable for text. Optical character recognition
Optical character recognition

Optical character recognition, usually abbreviated to OCR, is the mechanical or Electronics translation of s of handwritten, typewritten or printed text into machine-editable text....
 (OCR) software allows a scanned image of text to be converted into editable text with reasonable accuracy, so long as the text is cleanly printed and in a typeface and size that can be read by the software. OCR capability may be integrated into the scanning software, or the scanned image file can be processed with a separate OCR program.

Document processing

The scanning or digitization of paper documents for storage makes different requirements of the scanning equipment used than scanning of pictures for reproduction. While documents can be scanned on general-purpose scanners, it is more efficiently performed on dedicated document scanners manufactured by Atiz Innovation, Böwe Bell & Howell
Böwe Bell & Howell

B?we Bell & Howell is a United States-based former manufacturer of motion picture machinery.According to its charter, Bell & Howell Company was incorporated February 17, 1907....
, Canon, Epson, Fujitsu
Fujitsu

is a Japanese company specializing in semiconductors, air conditioners, computers , telecommunications, and Service , and is headquartered in Minato, Tokyo, Tokyo....
, HP
Hewlett-Packard

The Hewlett-Packard Company , commonly referred to as HP, is a technology corporation headquartered in Palo Alto, California, United States....
, Kodak and other companies.

When scanning large quantities of documents, speed and paper-handling is very important, but the resolution of the scan will normally be much lower than for good reproduction of pictures.

Document scanners have document feeders, usually larger than those sometimes found on copiers or all-purpose scanners. Scans are made at high speed, perhaps 20 to 150 pages per minute, often in grayscale, although many scanners support color. Many scanners can scan both sides of double-sided originals (duplex operation). Sophisticated document scanners have firmware
Firmware

Firmware is a term sometimes used to denote the fixed, usually rather small, programs that internally control various electronic devices. Typical examples range from end user products such as remote controls or calculators, via computer parts and devices like harddisks, keyboard s, TFT screens or memory cards, all the way to scientific instr...
 of software that cleans up scans of text as they are produced, eliminating accidental marks and sharpening type; this would be unacceptable for photographic work, where marks cannot reliably be distinguished from desired fine detail. Files created are compressed as they are made.

The resolution used is usually from 150 to 300 dpi
Dots per inch

Dots per inch is a measure of spatial printing or video dot density, in particular the number of individual dots that can be placed within the span of one linear inch The DPI value tends to correlate with , but is related only indirectly....
, although the hardware may be capable of somewhat higher resolution; this produces images of text good enough to read and for optical character recognition
Optical character recognition

Optical character recognition, usually abbreviated to OCR, is the mechanical or Electronics translation of s of handwritten, typewritten or printed text into machine-editable text....
 (OCR), without the higher demands on storage space required by higher-resolution images.

Document scans are often processed using OCR
Optical character recognition

Optical character recognition, usually abbreviated to OCR, is the mechanical or Electronics translation of s of handwritten, typewritten or printed text into machine-editable text....
 technology to create editable and searchable files. Most scanners use ISIS
ISIS

ISIS is an industry standard interface for technologies, developed by Pixel Translations in 1990 .ISIS is an open standard for scanner control and a complete image-processing framework....
 or Twain
TWAIN

TWAIN is a standard software protocol and applications programming interface that regulates communication between software applications and imaging devices such as and digital cameras....
 device drivers to scan documents into TIFF format so that the scanned pages can be fed into a document management system
Document management system

A document management system is a computer system used to track and store electronic documents and/or digital images of paper documents. The term has some overlap with the concepts of content management systems and is often viewed as a component of enterprise content management systems and related to digital asset management, document ima...
 that will handle the archiving and retrieval of the scanned pages. Lossy JPEG compression, which is very efficient for pictures, is undesirable for text documents, as slanted straight edges take on a jagged appearance, and solid black (or other color) text on a light background compresses well with lossless compression formats.

While paper feeding and scanning can be done automatically and quickly, preparation and indexing are necessary and require much work by humans. Preparation involves manually inspecting the papers to be scanned and making sure that they are in order, unfolded, without staples or anything else that might jam the scanner. Additionally, some industries such as legal and medical may require documents to have Bates Numbering
Bates numbering

Bates numbering is used in the legal, medical, and business fields to place identifying numbers and/or date/time-marks on images and documents as they are scanned or processed ....
 or some other mark giving a document identification number and date/time of the document scan.

Indexing involves associating keywords to files so that they can be retrieved by content. This process can sometimes be automated to some extent, but is likely to involve manual labour. One common practice is the use of barcode
Barcode

A bar code is an optical machine-readable representation of data. Originally, bar codes represented data in the widths and the spacings of parallel lines and may be referred to as linear or 1D barcodes or symbologies....
-recognition technology: during preparation, barcode sheets with folder names are inserted into the document files, folders, and document groups. Using automatic batch scanning, the documents are saved into the appropriate folders, and an index is created for integration into document-management software systems.

A specialized form of document scanning is book scanning
Book scanning

Book scanning is the process of converting physical books into digital images or e-book via image scanning. This is a much less time-intensive method than re-typing all of the text; before scanning became feasible, re-typing was generally the only option....
. Technical difficulties arise from the books usually being bound and sometimes fragile and irreplaceable, but some manufacturers have developed specialized machinery to deal with this. For instance, uses a V-shaped cradle and a V-shaped transparent platen to handle brittle books. Often special robotic
Robotics

Robotics is the science and technology of robots, and their design, manufacture, and application. Robotics has connections to electronics, mechanics, and software....
 mechanisms are used to automate the page turning and scanning process.

Infrared cleaning

Infrared cleaning is a technique used to remove dust and scratches from film, and most modern scanners incorporate this feature. It works by scanning the film with infrared light. From this, it is possible to detect dust and scratches that cut off the infrared light; and they can then be automatically removed, by considering their position, size, shape, and surroundings.

Scanner manufacturers usually have their own name attached to this technique. For example, Epson, Nikon
Nikon

, also known as Nikon or Nikon Corp., is a multinational corporation headquartered in Tokyo, Japan specializing in optics and imaging....
, Microtek
Microtek

Microtek International Inc. is a Taiwan-based manufacturer of digital imaging products and other consumer electronics. Among its current product line are Microtek branded Plasma display and Liquid crystal display television, LCD flat panel monitors, LCD and DLP projectors, and digital cameras as well as their flagship line of affordable that...
, and others use Digital ICE
Digital ICE

Digital Image Correction and Enhancement is a set of technologies related to producing an altered image in a variety of frequency spectra. The objective of these technologies is to render an image more usable by Fourier or other filtering techniques....
, while Canon
Canon Inc.

is a Japanese multinational corporation that specialises in the manufacture of imaging and optical products, including cameras, Photocopying and computer printers....
 uses its own system FARE (Film Automatic Retouching and Enhancement system). Some independent software developers are designing their own infrared cleaning tools.

Scanner Music

Flatbed scanners are capable of synthesising simple musical scores, due to the variable speed (and tone) of their stepper motor
Stepper motor

A stepper motor is a Brushless DC electric motor, synchronous electric motor that can divide a full rotation into a large number of steps. The motor's position can be controlled precisely, without any feedback mechanism ....
s. This property can be applied for hardware diagnostics: for example the HP Scanjet 5 plays Ode to Joy
Ode to Joy

"To Joy" is an ode written in 1785 in literature by the German poet, playwright and historian Friedrich Schiller. The poem celebrates the ideal of unity and brotherhood of all mankind....
 if powered on with the Scan button held down and the SCSI ID set to zero. Windows- and Linux-based software is available for several brands and types of flatbed scanners to play MIDI files for fun purposes.

Scanner art

Scanner art is art
Art

Art is the process or product of deliberately arranging elements in a way that appeals to the senses or emotions. It encompasses a diverse range of human activities, creations, and modes of expression, including music and literature....
 made by placing objects on a flatbed scanner and scanning them. There has been some debate as to whether scanner art is a form of digital photography
Digital photography

Digital photography is a form of photography that utilizes digital technology to make s of subjects. Until the advent of such technology, photography used photographic film to create images which could be made visible by photographic processing....
. As the scanner has very little depth of field
Depth of field

In optics, particularly as it relates to film and photography, the depth of field is the portion of a scene that appears sharp in the image. Although a lens can precisely focus at only one distance, the decrease in sharpness is gradual on either side of the focused distance, so that within the DOF, the unsharpness is imperceptible under nor...
 and a constant light all over the surface, it is different from photography.

See also

  • Barcode reader
    Barcode reader

    A barcode reader is an electronic device for reading printed barcodes. Like a flatbed scanner, it consists of a light source, a lens and a light sensor translating optical impulses into electrical ones....
  • Film scanner
    Film scanner

    A film scanner is a device made for scanning photographic film directly into a computer without the use of any intermediate printmaking. They provide several benefits over using a flatbed scanner to scan in a print of any size — the photographer has direct control over cropping and aspect ratio from the original unmolested image on fi...
  • Optical character recognition
    Optical character recognition

    Optical character recognition, usually abbreviated to OCR, is the mechanical or Electronics translation of s of handwritten, typewritten or printed text into machine-editable text....


External links

  • from Digital Output by Jim Rich
  • by Bill Glickman
  • USPTO