LLM Security/Risk Concerns

mansourshams · July 8, 2023, 8:03pm

Hello,

Thank you for this great community. There are some security/risk concerns with respect to the LLM deployments as the information provided by the user leaves the boundaries of the company. This can hinder/stop the deployment of LLM systems. Is there a solution to this? Is there a localized version that can reside solely within the boundaries of the companies and that can have a good performance as ChatGPT?

Thank you.

Juan_Olano · July 8, 2023, 10:30pm

Hi @mansourshams , your concerns are valid: the data leaves your premises and at that point you lose control.

There are alternative.

There are multiple models (Bloom, Falcon, Llama, to name a few) that can be hosted inside your premises. These are smaller models or base models, so don’t expect the same performance that you get from the get go with ChatGPT or the GPT family of products.

These models need to be fine-tune to the tasks you need them for. As you may have seen or will see in the new LLM course, this can take different levels of resources and expertise.

Can you get a model to perform as well as ChatGPT? In my opinion, very hard. The good news is that OpenAI has announced a “personal assistant” coming up in the next few months and, as I remember reading, this product may tackle your privacy concerns. So lets wait and see.

mansourshams · July 10, 2023, 1:49am

Hello Juan, Thank you for your reply. I have a large number/amount of technical documents to train the model with, however, I would like the model to have a good amount of literary knowledge (say at the level of a college graduate) in order to understand/discover in-between-lines. Any suggestions for that? I have seen some Tensorflow implementations that download large set-up files (up to 42GB) and I guess they are trying to mimic ChatGpt. Also, which one of these localized models have a Python interface?

Thank you

Juan_Olano · July 10, 2023, 2:16pm

Have you tried looking for trainable models in Huggingface? I think that this could be the best source of information for your project.

In Huggingface you’ll find a good number of models that can be downloaded, along with the libraries to interact with them in python.

My experience, besides GPT and Anthropic, is limited to Bloom and Falcon, and now to the model introduced in this course: FLAN. But there are many more. One that has been highlighted is FALCON. Another set of models are those by MOSAIC.

Topic		Replies	Views
Map a Problem to an LLM Model? Generative AI with Large Language Models week-module-1	4	625	July 2, 2023
It will be very helpful if you use open source LLM like falcon-40b-instruct along side OpenAI LangChain for LLM Application Development	0	96	June 7, 2023
Week3 - I have just completed the course, excited to put my knowledge into practice! Generative AI with Large Language Models week-module-1	2	43	October 15, 2024
How do i create a custom LLM based on a pre-existing framework like GPT, Llama, Cohere? AI Discussions ai-discussions	4	446	August 13, 2024
Can I replace the GPT with a none OpenAI something open source Building and Evaluating Advanced RAG Applications	4	330	January 8, 2024

LLM Security/Risk Concerns

Related topics