Data secuirity

Nana_Adjei · June 7, 2024, 12:00am

So after watching these short courses on how you can effectively extract information from texts using LLMs and RAG, I had an idea that I wanted to do something for my company where I will build a database to store all our important documents, then build a chatbot that will be able to answer any questions my colleagues may ask it. I want to know if it is advisable since these documents are confidential and we don’t want the LLMs using it as a training set and potentially leaking it to the public.

A_Nandhini · July 12, 2024, 4:56am

Hey Nana. It’s good idea. I also want to implement an idea like this. Did you get an answer for this question? I am also curious…

Nana_Adjei · July 14, 2024, 7:09pm

Hello @A_Nandhini, you will basically replace the ChatOpenAI with a local LLM which you can download using Ollama.
So for embedding, I used the “nomic-embed-text”
“def get_embedding_function():
embeddings = OllamaEmbeddings(model=“nomic-embed-text”,show_progress=True)”

For query, I used phi3
“model = Ollama(model=“phi3”)”
All these models are local models you can download using Ollama.

Topic		Replies	Views
Protecting my data Knowledge Graphs for RAG	4	177	June 16, 2024
How can we apply "Langchain: Chat with your data" to an open source LLM GenAI with LLMs Resources langchain	3	487	December 29, 2023
Using local Llama2 to chat with data LangChain: Chat with Your Data	0	176	January 23, 2024
Converse with your own databases LangChain for LLM Application Development	0	119	July 27, 2023
Building a chatbot using langchain AI Discussions	12	224	September 4, 2023

Data secuirity

Related topics