gscan2pdf
gscan2pdf can scan, clean the scan and do OCR on the scan or imported images (incl. existing PDFs, DjVus or other file types), and make PDF and DjVu-files with embedded OCR-text. It works together with tesseract, ocropus, cuneiform an gocr-OCR-engines