Preference of LLMs API over custom models

Elemento · May 6, 2023, 12:44pm

Hey Guys,
I wanted to know more about when should I prefer to use a LLM API over a custom model, which could either be trained from scratch, or could be a fine-tuned & pre-trained model. For instance, I can imagine various reasons, for when I might prefer a custom model:

One of the reasons has been highlighted in this thread. It would be great, if you guys can present your opinion about this as well.
Cost of inference; if I use a LLM API, I would have to pay it’s cost of usage, whereas, if I will use a custom model, it would be undoubtedly be on a much lower scale, when it comes to computation resources. Additionally, since my org will be the owner of the custom model, essentially there would be no middlemen such as “OpenAI”.
Data Security; using a LLM API pushes me to expose my data to the owner of LLM. It is not uncommon for any of us to see data breaches on a day-to-day basis, at present.
Larger Inference Time; undoubtedly a large model like ChatGPT will consume comparatively more time than a custom model (trained and optimised just for a single application) for inference.
I am not sure about this, but I believe that the performance of a custom model (single-purpose), will also be better than a LLM (multi-purpose), if our aim is to squeeze out the last drop of performance that exists.

Cheers,
Elemento

takashisendo · May 6, 2023, 1:59pm

Interesting question. I would think it would depend on and the distinction of custom model (LLM trained in more specific data set?) and General model such as GPT. Your question would expand to camper GPT promoting and fine tuning mode. It would be interesting to make compare performance of different usage using labels Q&A pairs with some metrics (such as Blue Score).

Elemento · May 7, 2023, 7:22am

Hey @takashisendo,
Thanks for the quick response.

The thing to note here is that it’s not just about the performance. Let’s say that I obtain a boost of 2-3% in performance, by using a LLM API, instead of a fine-tuned + pre-trained model. But will that 2-3% performance boost make up for the other cons, that I mentioned above. I can see that there are a lot of applications coming up nowadays, based on ChatGPT API, but does that count towards Responsible AI?

Cheers,
Elemento

takashisendo · May 11, 2023, 8:19am

You are right. Speed is easier to measure, but performance as a quality of answer also associated cost is said to be very different between GP3.5 API and GPT4 API (according to the BATCH May 10 2023). Application developer should think very carefully. No simple answer.

Elemento · May 19, 2023, 5:37pm

Hey @takashisendo,
Thanks a lot for your inputs. I do agree with what you stated, i.e., there is no simple answer to this question, and I guess the answer varies a lot from one application to another.

Cheers,
Elemento

Topic		Replies	Views
Map a Problem to an LLM Model? Generative AI with Large Language Models week-module-1	4	645	July 2, 2023
Inferring and NLP Tasks - is there a model comparison? ChatGPT Prompt Engineering for Developers	5	115	September 24, 2023
Private database + prompt mgmt VS. finetuning a LLM LangChain for LLM Application Development	5	316	July 27, 2023
LLM Security/Risk Concerns AI Discussions	3	235	July 10, 2023
OpenAI API free alternative for development ChatGPT Prompt Engineering for Developers	1	240	May 11, 2023

Preference of LLMs API over custom models

Related topics