[sane-devel] scanimage / tesseract interoperability

Jeff Breidenbach jeff at jab.org
Sat May 10 03:56:07 UTC 2014


Tesseract is an open source OCR program. It can already
produce searchable PDF and will soon support streaming.
It would be fun to support something like this:

   scanimage --batch | tesseract - - pdf > searchable.pdf

To make this work nicely, scanimage would need to
print the name of each file to stdout after it is written.

Thoughts?

Jeff
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.alioth.debian.org/pipermail/sane-devel/attachments/20140509/7f20972f/attachment.html>


More information about the sane-devel mailing list