Has anybody been able to extract data from an image or PDF?
If yes, how do you go about it?
You can as well provide tutorial links. Thank you!
Has anybody been able to extract data from an image or PDF?
If yes, how do you go about it?
You can as well provide tutorial links. Thank you!
Well it depends what data you’re trying to extract.
If you just want to convert a PDF to text or extract text from an image (OCR), there are many APIs that will do this for you.
If you’re trying to extract structured data (e.g upload a specifically formatted PDF invoice and extract the data) you can convert the PDF to text then use Regex to extract the necessary data, or if you want users to be able to upload any invoice then use GPT-3.5 and function calling to extract and output a structured response that you can save to the DB.
I want to extract specific details from different documents as uploaded by the user
This topic was automatically closed after 70 days. New replies are no longer allowed.