OPTICAL CHARACTER RECOGNITION

  

Copyright © Philip M. Parker, INSEAD. Terms of Use.

OPTICAL CHARACTER RECOGNITION

Specialty Definition: OPTICAL CHARACTER RECOGNITION

DomainDefinition

Computing

Optical Character Recognition (OCR, sometimes /oh'k*/) Recognition of printed or written characters by computer. Each page of text is converted to a digital using a scanner and OCR is then applied to this image to produce a text file. This involves complex image processing algorithms and rarely achieves 100% accuracy so manual proof reading is recommended. (1999-08-26). Source: The Free On-line Dictionary of Computing.

Census

Technology that uses an optical scanner and computer software to "read" human handwriting and converts it into electronic form. (references)

Source: compiled by the editor from various references; see credits.

Top     

Specialty Definition: Optical character recognition

(From Wikipedia, the free Encyclopedia)

Optical character recognition, usually abbreviated to OCR, involves computer systems designed to translate images of typewritten text (usually captured by a scanner) into machine-editable text--to translate pictures of characters into a standard encoding scheme representing them (ASCII or Unicode). OCR began as a field of research in artificial intelligence and machine vision; though academic research in the field continues, the focus on OCR has shifted to implementation of proven techniques.

Originally, the distinction between optical character recognition (using optical techniques such as mirrors and lenses) and digital character recognition (using scanners and computer algorithms) were considered separate fields. Since very few applications survive that use true optical techniques the OCR term has now been broadened to cover digital character recognition as well.

Early systems required "training" (essentially, the provision of known samples of each character) to read a specific font. Currently, though, "intelligent" systems that can recognize most fonts with a high degree of accuracy are now common. Some systems are even capable of reproducing formatted output that closely approximates the original scanned page including images, columns and other non-textual components.

The United States Postal Service has been using OCR machines to sort mail since 1965. Mail sorting plays a small role in OCR research; OCR systems need only read the postal code on each envelope. After the postal code has been read, a bar code with the same information can be printed on the envelope. To avoid interference with the human-readable address field which can be located anywhere on the letter, special ink is used that is clearly visible under UV light. This ink looks orange in normal lighting conditions. Envelopes marked with the machine readable bar code may then be processed; machine readable codes can be decoded more quickly than human readable letters and numbers.

While the accurate recognition of Latin-script typewritten text is now considered largely a solved problem, recognition of hand printing and handwriting in general, and printed versions of some other scripts--particularly those with a very large number of characters--are still the subject of research.

See also

Source: adapted by the editor from Wikipedia, the free encyclopedia under a copyleft GNU Free Documentation License (GFDL) from the article "Optical character recognition."

Top     


Crosswords: OPTICAL CHARACTER RECOGNITION

Specialty definitions using "OPTICAL CHARACTER RECOGNITION": Key From Image, Key From PaperOCR, OCR inkSUPERVISOR, MICROFILM DUPLICATING UNIT. (references)

Source: compiled by the editor from various references; see credits.

Top     

Commercial Usage: OPTICAL CHARACTER RECOGNITION

DomainTitle

Books

  • Auerbach on optical character recognition (reference)

  • Optical Character Recognition (Wiley Series in Microwave and Optical Engineering) (reference)

  • The Froehlich/Kent Encyclopedia of Telecommunications: Microwave Communications Systems and Devices to Modern Optical Character Recognition (Vol 11) (reference)

    (more book examples)

Source: compiled by the editor from various references; see credits.

Top     

Frequency of Internet Keywords: OPTICAL CHARACTER RECOGNITION

The following statistics estimate the number of searches per day across the major English-language search engines as identified by various trade publications. Hyperlinks lead to commercial use of the expression at Amazon.com.
 
ExpressionFrequency
per Day

optical character recognition

95

optical character recognition software

14

optical character recognition ocr

4
Source: compiled by the editor from various references; see credits.

Top     

Modern Translation: OPTICAL CHARACTER RECOGNITION

Language Translations for "OPTICAL CHARACTER RECOGNITION"; alternative meanings/domain in parentheses.

Danish

  

optisk skriftlæsning, optisk skrift, OCR (optical character reader). (various references)

   

Dutch

  

optische schriftherkenning, optisch schrift lezen, optisch herkennen van tekens, machinale schriftherkenning. (various references)

   

Finnish

  

optinen luku. (various references)

   

French

  

ROC, reconnaissance optique des caractères, Reconnaissance optique de caractères, reconnaissance de caractères-machine. (various references)

   

German

  

optische Zeichenerkennung, optische Zeichenanerkennung, OCR (OCR), maschinelle Zeichenerkennung. (various references)

   

Greek 

  

αναγνώριση οπτικών χαρακτήρων, οπτική αναγνώριση των χαρακτήρων μηχανής. (various references)

   

Italian

  

riconoscimento ottico di caratteri, riconoscimento dei caratteri-macchina, lettura ottica di caratteri. (various references)

   

Pig Latin

  

opticalay aracterchay ecognitionray

   

Portuguese

  

reconhecimento visual, reconhecimento dos caracteres-máquina, reconhecimento óptico de caracteres. (various references)

   

Spanish

  

reconocimiento óptico de caracteres (Optical character recognition (OCR)). (various references)

   

Swedish

  

optisk teckenigenkänning. (various references)

Source: compiled by the editor from various translation references.

Top     



INDEX

1. Crosswords
2. Usage: Commercial
3. Expressions: Internet
4. Translations: Modern
5. Bibliography


  

Copyright © Philip M. Parker, INSEAD. Terms of Use.