Storing question forms in vector databases for better similarity scores

Nobu_Taka · February 6, 2024, 12:46pm

So I’m reading how each split segment is vectorized for storage in vector databases so that queries on those databases can be scored for similarity and then segments that are k most similar can be looked at but has anyone tried converting each of those segments into question form first using an LLM? Or using the LLM to come up with a question for that segment of text and then storing that alongside the original segment? Then similarity searches on those vector databases might come up with better matches?
Is this not worth doing because of the time and labor cost or there is not enough of a performance boost to warrant the work involved?

Topic		Replies	Views
Document splitting: Chunksize LangChain for LLM Application Development	0	97	July 6, 2023
Question Answering Stuff Documents LangChain for LLM Application Development	2	120	July 17, 2023
Embeddings, Vector DB, FAQs earch and ranking AI Discussions vector-database	4	134	June 7, 2024
How to extract arbitrary data and store them into a vector database, and a LLM can answer any questions based on the vector database AI Discussions ai-discussions	0	73	July 18, 2024
Similarity search fails to capture product numbers LangChain for LLM Application Development	0	98	July 16, 2023

Storing question forms in vector databases for better similarity scores

Related topics