Convert PDF to Searchable PDF

pdf_to_searchable_pdf.py

1. Convert pdf pages to image pages (of Image type) in memory using pdf2image module 2. OCR each image page using pytesseract module and tesseract OCR and create a searchable pdf file for each page 3.

GitHub

rpolana/pdf_to_searchable_pdf

A cross-platform python command-line utility that converts any PDF file containing images or unsearcheable fonts to a searcheable text PDF file using tesseract OCR (optical character recognition) and ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

pdf_to_searchable_pdf.py

rpolana/pdf_to_searchable_pdf

Trending now