Llama3.2 from Huggingface in Google Colab

abrar39 · October 28, 2024, 6:56am

Hi,

I am trying to run Llama3.2 11b-Vision model from Huggingface in Google Colab. However, while downloading checkpoints, Colab goes out of memory. I have also tried the same in Kaggle but the issue persists. Is there a way (hopefully free ) to achieve this?

Regards.

Nevermnd · October 28, 2024, 8:48am

@abrar39 I haven’t tried what you are attempting myself, but in Sharon’s excellent short course she discusses at one point ‘streaming’ the model rather than trying to load it all at once.

That might work.

lukmanaj · October 28, 2024, 9:01am

@abrar39

If you are using kaggle, I suggest adding the model to your notebook directly on kaggle, instead of downloading from huggingface.

Check out this notebook:

In addition, if you are trying to finetune, unsloth is a good option. They have sample colab notebooks you can use. Though, I haven’t seen that of the 11b model. The 1b model has a colab notebook that you can run fast and is memory and beginner friendly.

abrar39 · October 29, 2024, 6:07pm

Thank you for the response. I have used the 1b version in colab and it runs without problem. However, the generative ability of the 1b version is not very “interesting” to say the least. The kaggle option seems to be more valid. I shall use it.

abrar39 · October 29, 2024, 6:07pm

Thank you. I shall definitely follow the guidelines in the course.

William_Lizarazo · November 3, 2024, 5:47pm

Try with A100 gpu, with default gpu not works

abrar39 · November 7, 2024, 1:20pm

This works for me. Thank you very much.

Topic		Replies	Views
ChatBot with LLama2 Building Systems with the ChatGPT API	10	431	January 31, 2024
HuggingFace AutoTrain process stops soon after training starts AI Discussions ai-discussions	4	164	May 3, 2024
meta-llama/Llama-2-7b-chat-hf Generative AI with Large Language Models week-1	2	608	October 31, 2023
Seeking advice on lightweight Language Models for offline application development AI Discussions ai-discussions , project	2	84	June 21, 2025
Llama 3.2 finetuning and evaluations? Introducing Multimodal Llama 3.2	6	103	October 18, 2024

Llama3.2 from Huggingface in Google Colab

Related topics