Hi, We have a use case where we need extract information from documents that are mainly power point files converted to PDFs. Text around images and embedded within images. Is ADE the right tool set for this use case ?
1 Like
Thank you @jmadduri1 for your question! We reached out to the team at LandingAI for their guidance on PPT use-cases.
βyes, it would be particularly useful to use ADE in this case because the layout detection powered by DPT (Document Pre-trained Tranformer) is able to tell the figure and text apart.
Attached is an example of the parsed layout of a slide deck. You can then use extract to retrieve the specific information. Best way to figure out how it works is to test your pptx converted pdf in the playground va.landing.aiβ
cc: @lesly.zerna
