Hi all,
Wanted to see if anyone has run into this problem before. I have set of pdf's which are semi-structured (attached) and i'm hoping to extract data from the 'pesticide production information' section. The issue is that for each pdf, they can have varying number of pages, and varying number of sections per page (a new section starts when 42. is the first cell in the table). Is there any way to build an AI model that will extract the pesticide production information no matter the pdf format? Or will i have to train the model with a bunch of examples of varying types. Is it possible to utilize power automate to do this? Thanks in advance!