RAG pipeline for 10K reports

Hi,
I am working on a project to build a RAG application to chat with multiple 10K reports of different companies ranging from 50 to 100 documents, can you suggest me the best possible way to build it from data pre-processing, converting to embeddings, storing, and retrieving.

3 Likes