I’m trying to build a RAG using llamaindex. I created 2 indices (one for the JSON files another for HTMLs and PDFs) when I create 2 query engines and pass it to an agent as tools. The model is not able to query the JSON and provide response. Most of the time the model uses implicit information to give a response and it is always wrong and sometimes it says the documents shared doesnt have the required data.
I am planning to pass the JSONs to the LLM without creating a vector index. Will it work?
Any suggestions on how the model can understand the query and retrieve the data from the JSON will be helpful.
Hi.
I did work with vectorized JSON files and LLMs, and it works fine. I don’t know how you are vectorizing the JSONs, but what I do is vectorize the content of the JSON, not the JSON directly: I extract the info I want to vectorize from the JSON, then I vectorize that info.
Hi @haroldc , I passed the json files to simpledirectoryreader and converted the docs to vector index. And I am using an open source LLM zephyr-7b-beta.
Instead of vectorizing, I tried using the JSONalyze query engine and it is better. But still the usage of prior knowledge of the LLM is still there even after prompting it NOT to use it.
I am actually trying to implement a chatbot in GCP using VertexAI Agent builder. Even I have got 1000 json files each has a different structure. I have converted the json files into pdf and passed them to text-bison@002 model. Iam not getting appropriate results. Inaccurate results. Could you please guide me ways on how to make the model understand the json data effectively if we have 1000 json files.