Seeking advice on lightweight Language Models for offline application development

zhucebox · September 8, 2024, 3:41pm

Hey community,

I’m working on a project to create an app based on language models that can be installed on both Windows PCs and MacOS. The goal is for it to run completely offline, without needing an internet connection.

Ideally, after installation, the app should run without requiring any extra dependencies. It’s meant to work on regular laptops from the past few years, so think 9th Gen Intel® CPUs or newer, 8GB of RAM or more, and no need for a dedicated GPU.

The app should be able to read a few files (up to 5) in formats like txt, pdf, ppt, and excel. It doesn’t need to have long conversations with users, generate images, or write code, but it should understand the content of the files it reads.

Given these requirements, I’m wondering if using smaller language models (with fewer parameters) could be a good option. If yes, what could be the best choices?

I’d really appreciate any insights you can share. Thanks in advance!

Anna_Kay · September 16, 2024, 7:06pm

Hello @zhucebox,

take a look at the .gguf models; they can run on CPUs and typically require around 5-10 GB of storage. I’m not entirely sure about the RAM requirements, but 8 GB might be a bit tight. GGUF models are available for the entire LLaMA family, and I assume for other models as well. There is a trade-off in terms of accuracy. You can also fine-tune your own LLM and then export it in GGUF format.

I have not used the smaller models from other companies, so unfortunately I cannot suggest anything else.

Best

zhucebox · June 21, 2025, 9:31am

Hi Anna,
Thank you so much for the suggestion! I just saw your post on the forum.
Just to let you know, I’m using the Gemma 3 4B model, and it’s performing quite well.
Thanks again for your help！

Topic		Replies	Views
Couple Questions From Week 1 Generative AI with Large Language Models week-module-1	1	384	October 2, 2023
Llama3.2 from Huggingface in Google Colab AI Discussions ai-discussions	6	349	November 7, 2024
Executing the labs in home computer Generative AI with Large Language Models introductions , project	1	23	January 30, 2025
Seeking advice on open-source llm selection AI Discussions ai-discussions , llm , project	1	211	April 17, 2024
Small, offline model to run in your local machine Generative AI for Everyone week-module-1	1	715	December 18, 2023

Seeking advice on lightweight Language Models for offline application development

Related topics