How can we change the vocab_size in bert model arguments?

Arjun_Reddy · May 14, 2023, 12:08am

I have seen Bert Argument for fine tuning on a specific task from hugging face library:

BertConfig {
 "attention_probs_dropout_prob": 0.1,
 "hidden_act": "gelu",
 "hidden_dropout_prob": 0.1,
 "hidden_size": 768,
 "initializer_range": 0.02,
 "intermediate_size": 3072,
 "layer_norm_eps": 1e-12,
 "max_position_embeddings": 512,
 "model_type": "bert",
 "num_attention_heads": 12,
 "num_hidden_layers": 12,
 "pad_token_id": 0,
 "type_vocab_size": 2,
 "vocab_size": 30522
}

So, my question is if the Bert model is pre-trained on a specific vocab_size, Then how can one change the vocab size as changing the vocab_size is changes the meaning and embeddings of some words which are not there in pre-trained vocabulary right, then how can the transformer model will be able to capture this difference?

TMosh · May 14, 2023, 12:59am

A quick internet search yielded this discussion:

Arjun_Reddy · May 14, 2023, 1:43pm

Thanks for providing the information, but I am not able to make proper conclusion from this and it would be a lot helpful , if we could establish a proper clear and crisp answer.
Thank you

Topic		Replies	Views
Lab 3 PPO model parameters - how did you get 768? Generative AI with Large Language Models project	4	59	February 17, 2025
Test vocab size mismatch for Exercise 1 Updated NER assignment NLP with Sequence Models week-module-2	10	457	January 18, 2024
Getting a vocal size of 26871 instead of 138858 Natural Language Processing in TensorFlow	2	418	November 2, 2021
Data sizes for programing assignment of week 1 NLP with Attention Models	3	79	June 27, 2024
How to extract body of a transformer like models and fine tune with that body on different shape dataset Convolutional Neural Networks coursera-platform	2	472	May 31, 2023

How can we change the vocab_size in bert model arguments?

Related topics