Running Open Source LLM's Like LLama2 on GCP

juanpablo311 · August 3, 2023, 8:00pm

Hi, does somebody has some experience running some of this Open Source Models in principal Cloud providers such as Azure, GCP or AWS?

I know that they provide some of these models in a catalog, but I’m wondering, what would be the more general WorkFLow for example to run Llama 2 in Google Cloud for example.

I think there is a big gap in complexity between running one of these models locally or with Google Collab and loading a model into a VM an making a connection.

I have found several guides, using for example runpud.io, but not too many with GCP. I will really appreciate some guidance about this topic.

Topic		Replies	Views
ChatBot with LLama2 Building Systems with the ChatGPT API	10	442	January 31, 2024
Seeking advice on open-source llm selection AI Discussions ai-discussions , llm , project	1	227	April 17, 2024
Where to host my LLaMA2? Large Language Models with Semantic Search	2	220	August 17, 2023
Options to run LLMs from local laptop (non GPU) AI Discussions ai-discussions	0	221	August 12, 2024
Deployment of LLM AI Discussions ai-discussions , deployment	0	49	July 29, 2024

Running Open Source LLM's Like LLama2 on GCP

Related topics