Optical Character Recognition (OCR)
Quoting the corresponding Wikipedia article:
optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo) or from subtitle text superimposed on an image (for example: from a television broadcast).
Widely used as a form of data entry from printed paper data records – whether passport documents, invoices, bank statements, computerized receipts, business cards, mail, printouts of static-data, or any suitable documentation – it is a common method of digitizing printed texts so that they can be electronically edited, searched, stored more compactly, displayed on-line, and used in machine processes such as cognitive computing, machine translation, (extracted) text-to-speech, key data and text mining. OCR is a field of research in pattern recognition, artificial intelligence and computer vision.
Early versions needed to be trained with images of each character, and worked on one font at a time. Advanced systems capable of producing a high degree of recognition accuracy for most fonts are now common, and with support for a variety of digital image file format inputs.[2] Some systems are capable of reproducing formatted output that closely approximates the original page including images, columns, and other non-textual components.
OCR provides an invaluable support when it comes to making accessible documents that were not accessible before: it can be applied to convert image-PDF documents, as well as to digitize paper documents that could not be made accessible otherwise. It can be leveraged by blind and visually impaired people, as well as by institutions and content authors as a starting point to make their documents more accessible.
IMPORTANT: Please note that, as accurate as it can be, due to its nature OCR is not perfect. Artifacts, mistakes and extraneous characters can be introduced in the resulting document by the process. In addition to this, the quality of the end result is influenced by a number of factors, including the quality of the source image (e.g. resolution and brightness), its characteristics (e.g. text layout, color contrast), and the tools involved in the process.