Hello everybody!
I need help in extracting data from semi-structured documents.
What models are there that are multimodal?
I know LayoutLMv3, but I would like someone to show me a list of models with the same tasks?
On the other hand, if there is any link or help to be able to perform this task but locally, as with RAG, with vector databases, without having to perform fine-tuning.
Sorry,
I’m not sure which category I should put my query in.
It is a generic query, that is why I have put it in General discussion
You can help? Thank you