Try filtering complex metadata from the document using langchain_community.vectorstores.utils.filter_complex_metadata

mrqasimasif · May 16, 2024, 4:40am

Hi I am running this code from the provided notebook.

documents =
for element in elements:
metadata = element.metadata.to_dict()
del metadata[“languages”]
metadata[“source”] = metadata[“filename”]
documents.append(Document(page_content=element.text, metadata=metadata))

embeddings = OpenAIEmbeddings()
vectorstore = Chroma.from_documents(documents, embeddings)

Running the vectorstore cell gives out the following error,
I am unsure what will be the accurate solution to it.

Error:
ValueError: Expected metadata value to be a str, int, float or bool, got [{‘x’: 0, ‘y’: 0, ‘w’: 1, ‘h’: 1, ‘content’: ‘NAVER CLOVA’}, {‘x’: 1, ‘y’: 0, ‘w’: 1, ‘h’: 1, ‘content’: ‘2NAVER Search’}, {‘x’: 2, ‘y’: 0, ‘w’: 1, ‘h’: 1, ‘content’: ‘3SNAVER AI Lal’}] which is a <class ‘list’>

Try filtering complex metadata from the document using langchain_community.vectorstores.utils.filter_complex_metadata.

Deepti_Prasad · May 17, 2024, 1:39am

Are you really using all the necessary files for your metadata yo be as provided by the course as the course mentions issue being on how you recalled your metadata.

pkoloveas · May 21, 2024, 9:02am

I ran into the same issue, which I solved like this:

First, import the function mentioned in the error:

from langchain_community.vectorstores.utils import filter_complex_metadata

Then change this line:

vectorstore = Chroma.from_documents(documents, embeddings)

to this:

vectorstore = Chroma.from_documents(filter_complex_metadata(documents), embeddings)

pbhadani · May 22, 2024, 9:05pm

thanks for sharing on how to wrap this up with “filter_complex_metadata”.
But now how to use this as filter option in vector_store.as_retriever(‘filter’: {‘source_name’:ABC}}).
Source_name is a list here.

rdyson · May 28, 2024, 1:12pm

Thanks! This worked for me using the hosted notebook.

jie2 · December 3, 2024, 8:03am

oh! its so good! thanks your solution!

Topic		Replies	Views
Error in Second Last Lecture [ Build your own RAG Bot] Preprocessing Unstructured Data 4 LLM Applications	0	11	March 6, 2025
? on using Metadata? LangChain for LLM Application Development	0	96	August 11, 2023
Error running L4-QnA cell number 5 LangChain for LLM Application Development	8	213	November 23, 2023
ValueError Self query retriever with Vector Store type <class 'langchain_community.vectorstores.chroma.Chroma'> not supported LangChain: Chat with Your Data	3	723	May 26, 2024
ChromaDB issue in Vectorstores and Embedding LangChain: Chat with Your Data	7	1081	October 24, 2023

Try filtering complex metadata from the document using langchain_community.vectorstores.utils.filter_complex_metadata

Related topics