Differnce between Token , Weight and Parameter in a LLM

Rachana_Dikshit · July 6, 2023, 3:47pm

Can someone elaborate on the difference between Token, Weight and parameter in a LLM ?

Juan_Olano · July 6, 2023, 11:19pm

Tokens: An LLM receives an input as text. This text is passed by a tokenizer to convert the words into tokens. So tokens are, lets say, partial words. Every tokenizer is different, but in general, a token is a partial word.

Weights: LLMs, as any other ML model, is essentially a set of matrices, and the values on the cells of the matrices are called ‘weights’. These weights contain a statistical representation of the language in which the LLM was trained.

Parameters: The LLM can be steered to behave slightly different by means of some ‘parameters’. For example, temperature is one such parameter. When temperature is low, the model will be very ‘precise’ in its responses, while setting the temperature at a high value will create more random (creative) responses. In particular, the ‘temperature’ acts on the logits before softmax is applied.

Hope this sheds some light!

tjsachin · August 11, 2024, 11:10am

Parameters depend on the model structure. Parameters include the weights & biases, activation functions, and the learning rate.

The text/images sent as inputs to the model are broken down into pre-defined smaller units called tokens through tokenisation. Tokenisation improves the model’s accuracy and precision by allowing the model to learn the inherent patterns and relationships.

Topic		Replies	Views
Week 1: Computational challenges of training LLMs Generative AI with Large Language Models large-language-model , llm	3	31	November 18, 2024
Number of tokens mentioned in the first video to train a 248M model Pretraining LLMs	0	17	September 12, 2024
W2 "Neural Language Model" slide missing diagram Sequence Models	1	493	March 18, 2023
In-Context Learning vs Finetuning Generative AI with Large Language Models week-1	2	630	December 18, 2023
Question on how Base LLMs are trained Generative AI with Large Language Models week-2	4	425	August 3, 2023

Differnce between Token , Weight and Parameter in a LLM

Related topics