What could manage character recognition from binary files? For example: Text in a .gif .jpg .png etc. Optics are more analog > digital, no?
3
@mollydotcom really needs to be something like a variant of the PDF OCR engine. If Adobe can do OCR from image why not a cloud OCR tool?
1
Replying to @absalomedia
@absalomedia PDF OCR? Then I turn to @SimonSapin - what can do this Simon?

Apr 19, 2013 · 6:57 AM UTC

2
Replying to @mholzschlag
@mollydotcom @absalomedia Just like HTML, PDF can contain text, vector images and pixel images. And images could contain text, eg. scans.
Replying to @mholzschlag
@mollydotcom @absalomedia OCR on an image extracted from PDF is the same as OCR on any image. That is: doable, but not always reliable.
1