Get accurate data from an image, PDF

godwinalugbin004 · July 28, 2023, 9:58am

Has anybody been able to extract data from an image or PDF?
If yes, how do you go about it?

You can as well provide tutorial links. Thank you!

georgecollier · July 28, 2023, 10:51am

Well it depends what data you’re trying to extract.

If you just want to convert a PDF to text or extract text from an image (OCR), there are many APIs that will do this for you.

If you’re trying to extract structured data (e.g upload a specifically formatted PDF invoice and extract the data) you can convert the PDF to text then use Regex to extract the necessary data, or if you want users to be able to upload any invoice then use GPT-3.5 and function calling to extract and output a structured response that you can save to the DB.

godwinalugbin004 · July 28, 2023, 12:25pm

I want to extract specific details from different documents as uploaded by the user

system · October 6, 2023, 9:58am

This topic was automatically closed after 70 days. New replies are no longer allowed.

Topic		Replies	Views
Saving PDF data as a database thing Need help	4	561	April 14, 2021
PDF data extractor Plugins	2	855	September 28, 2019
[New Plugin] - Free OCR - Extract text from your PDF's, JPG's or PNG's Implemented	28	13115	January 26, 2022
Is there a way to extract data from a standardized pdf? APIs	3	818	October 14, 2021
📝 OCR - Convert Images & PDF to text - New Plugin from Zeroqode Showcase	10	2322	January 13, 2020

Get accurate data from an image, PDF

Related topics