How CodeLLMs are trained

sujithanumala · April 23, 2024, 12:04pm

Hello

How do we train a code generating model. Are we just doing it Auto regressively or are there any specific ways to do so. Suppose I have an LLM , how do I fine tune it to generate codes. So will I follow next token prediction or what?
How the model is able to generalize the code. Based on the question we ask it generates code. But it is a model which generates next token based on probability. But there are potential chances that the syntax may slightly get incorrected due to selecting wrong word. I mean how the model overcomes syntax, runtime and other errors.

TMosh · April 23, 2024, 5:13pm

Have you searched for academic papers on the topic? That sort of work is state-of-the-art, and there is lots of commercial activity (OpenAI, Meta, Google, etc).

I’m not sure to what extent their methods are proprietary or public domain.

I suspect a lot of the keys to this technology are not widely discussed in public.

Topic		Replies	Views
Some questions about LLM Training AI Discussions ai-discussions	3	173	March 6, 2024
Training an LLM AI Discussions ai-discussions	3	283	April 24, 2024
Question on how Base LLMs are trained Generative AI with Large Language Models week-module-2	4	431	August 3, 2023
AI model for unit test code generation Generative AI with Large Language Models week-module-3	1	336	December 21, 2023
Need Guidance on automated report generation using LLM AI Discussions ai-discussions	4	70	March 2, 2025

How CodeLLMs are trained

Related topics