- Name: OCRmyPDF
- Homepage: https://github.com/jbarlow83/OCRmyPDF
- Why should this be included in the repository? We already have tesseract, but it doesn't directly handle PDF files. You must first run imagemagick to covert the pages, etc etc. OCRmyPDF just automates this, plus it has some additional features. Please read the project README for details.
- Is it Open Source: yes
- Who and how many users do you anticipate will use this software? I don't know HOW many users, but students and old book collectors (thus with no digital copies) will benefit from this for sure.
- Link to source tarball/zip file: https://github.com/jbarlow83/OCRmyPDF/archive/v7.0.5.tar.gz