RAG - Parsing and Chunking the text

satyanarayana1 May 17, 2024, 11:21am 1

Hi,

I am looking at the strategy to be used to read the pdf and word documents and store it vector database and retrieve the same through LLM.

Any view on the procedure to be used that is libraries the pdf and word document has tables and images.

Topic		Replies	Views
LLMs chat with PDFs AI Discussions llm	2	365	January 21, 2024
How to extract arbitrary data and store them into a vector database, and a LLM can answer any questions based on the vector database AI Discussions ai-discussions	0	148	July 18, 2024
Text embedding and importing data from pdf files ChatGPT Prompt Engineering for Developers	1	90	May 25, 2023
Seeking Advice: Integrating LLM with Large Local Document Databases AI Discussions ai-discussions	8	6661	January 28, 2025
docAnalyzer - chat with large PDF dataset AI Discussions ai-discussions , careers , project	4	277	February 9, 2024