Detecting bounding box of white space in forms (think PDF, WEB FORM, ETC)

brendan_murphy · February 27, 2025, 9:06pm

I have been trying to find a direction that will work for a fairly generalizable solution/model that can detect bounding boxes for white space in a document (image) think of a pdf that needs to be filled in, where the pdf form can come in many different layouts, though there are some patterns (name form fields with letter indicators, white space (entry field for form) that is bordered by a black line, white space that doesnt have a specific border or hard-codeable way to relate the white space to fields label… that is each white space that can have text added to it is labelled in some way but not consistent, but should be intuituve enough for properlly trained vision model to detect. I have tried Landing AI vision agent, and the popular LLMs (claude/chatGPT) i suspect i may need a well annotated dataset to then finetune a YOLO or other such model, but maybe a few examples and few shot it with Gemini etc. Any good advice will be appreciated.

Topic		Replies	Views
Enhancing Document Layout Analysis by Adding Positional and Character Information to CNN Inputs AI Discussions ai-discussions , introductions , project	0	19	July 12, 2024
Help with pdf data AI Discussions project	7	153	September 12, 2024
W3 - Bounding boxes & semantic segmentation: true labels generation? Convolutional Neural Networks	1	498	July 8, 2022
Information retrieval model NLP with Probabilistic Models week-1 , introductions , project	1	11	August 15, 2024
Need help in creating a model which will be trained to Identify UI elements on web applications AI Discussions computer-vision , project	1	196	February 21, 2024

Detecting bounding box of white space in forms (think PDF, WEB FORM, ETC)

Related topics