Can LLM like Code Llama learn project codebase?

abitrolly · November 22, 2023, 6:46am

GitLab had built merge request summary system based on AI. That system seems to take only diff and comments when producing output. The rest is devised from internal AI weights. In other words - NN doesn’t read the whole source code of the project to get deeper understanding of changes made to it.

So the question is, is it possible for make LLM read and learn the whole project codebase, and give exact 0 temperature answers about what new changes to the code are doing?

Is it possible to update LLM knowledge of the codebase when new commits are merged?

Is it possible for LLM to warn about potential bad practices and problems with the diff? Like when the code changed behavior in parts of the codebase that are invisible in the diff.

balaji.ambresh · November 22, 2023, 9:09am

Wouldn’t this require knowledge of external project dependencies as well?

abitrolly · November 23, 2023, 8:37pm

Dependencies usually have well documented API, and that should be enough to understand what is the expected behavior from calling them.

balaji.ambresh · November 24, 2023, 7:11am

What are the results with your suggested approach?

abitrolly · November 24, 2023, 9:41am

I don’t have sufficient understanding of LLM to get to the results.

balaji.ambresh · November 24, 2023, 1:00pm

How about mentioning your idea on the gitlab thread?

abitrolly · November 25, 2023, 1:25pm

The thread mentions that GitLab uses PaLM 2 (text-bison) model from Google’s Vertex AI. VertexAI docs say that models can be “tuned” for specific use cases using input-output examples. But I don’t understand how to represent 1Gb of source code as the input-output dataset.

balaji.ambresh · November 25, 2023, 4:29pm

Please see this short course.

abitrolly · November 26, 2023, 8:02am

Thanks. I actually started it two months ago, but because the learning platform videos don’t play in the browser I couldn’t progress far.

balaji.ambresh · November 26, 2023, 8:22am

Does this help?

abitrolly · November 26, 2023, 11:00am

I download videos and then open them with vlc, but then I lose the ability to watch subtitles.

balaji.ambresh · November 26, 2023, 11:17am

Sorry I don’t know how to download subtitles for the video. Can’t you save the text from the transcript section to your machine as a text file and use it?

abitrolly · November 26, 2023, 11:27am

I can right click, inspect element, expand <video> tag, then click the link to .vtt file to download it next to video file. Then vlc opens video with subtitles. But that’s an underworld of user experience. If DLAI platform was open source, I would already fix it.

balaji.ambresh · November 26, 2023, 3:10pm

I’ve notified the staff regarding your ask to provide a link / button to download subtitles. Let’s wait and see.

abitrolly · November 27, 2023, 5:55am

The better fix is to just encode the content as AV1 videos. Less bandwidth, and improved UX. I’ve sent my resume to Careers - DeepLearning.AI to fastlane the fix.

Topic		Replies	Views
Some questions about LLM Training AI Discussions ai-discussions	3	173	March 6, 2024
Seeking advice on open-source llm selection AI Discussions ai-discussions , llm , project	1	210	April 17, 2024
Is there any way to feed my c# code base in to the llaama model AI Discussions ai-discussions , langchain	0	109	February 2, 2024
Programming Assignment: Solving Versioning and Dependency conflicts with an LLM Team Software Engineering with AI week-3 , coursera-platform	63	617	November 18, 2024
Llama 3.2 finetuning and evaluations? Introducing Multimodal Llama 3.2	6	103	October 18, 2024

Can LLM like Code Llama learn project codebase?

Related topics