Workflow to run OCR on a pdf

KoljaE · December 6, 2023, 8:21pm

Hello there,
I am new to Wappler and am starting my journey at the moment and one of my project ideas is to run a postal scanner which delivers pdf’s to a folder. The pdf shall then be processed for OCR first and an AI second to create metadata. is there any framework or extension that provides OCR? Or shall I look for that externally? hints greatly appreciated

thank you in advance,
Kolja

ben · December 7, 2023, 12:55am

No, there is OCR extension available in Wappler - at this moment.

TMR · December 7, 2023, 8:39am

You would have to use a third party API for this, I think there will be a lot of them out there, here’s an example:

Wappler is great at plugging in to third party APIs. I’ve a feeling you would be able to do what you suggested above using one API call and not two (for the AI part).

HeikoK · December 7, 2023, 9:28am

Not build in, but you could use Google Vision to extract the text from a PDF or OCR (visual recognition) of a scanned document and then feed the document into any AI (OpenAI, Google PaLM…) for examination / summary / meta data.

(all this options of course come with a cost)

JonL · December 7, 2023, 4:12pm

You also have:

Which is just a web assembly wrapper for the tesseract ocr engine.

updates · December 7, 2023, 8:04pm

Tesseract is used also by chatGPT