Where is the magic? Doing analysis on a data set

tedmasterweb · July 12, 2023, 8:00am

I enrolled in this course because, like many, I want to make our ISO documentation a little more accessible. I’ve finished the course but haven’t yet set up a system to run against my own data due to the time investment required to make an actual MVP (rather than a POC in a notebook).

While reflecting on what I learned in the course and planning the MVP, it occurred to me that there are two places where the magic happens but the main magic is not interacting with the LLM. Instead, the main magic is the vector store / search (a system that successfully associates semantic values). The second bit of magic is indeed the LLM’s ability to present the search results in a more human-like manner.

As I continued to consider the pros and cons and approaches to the MVP, it also occurred to me that there are certain types of questions that the system won’t be much help with. For example, using our ISO 27001 documentation as the data source, if I request: “Provide examples where the ISO 27001 policies contradict the work instructions or vice versa”, it seems unlikely I’ll get the response I’m looking for because I’m requesting an analysis of all the data whereas this system would first find documents similar to the request (which could be few or none) and then present the response in the LLM format.

This makes me wonder if there is any way to “extend” the LLM to include my own data rather than doing a search and passing in results in the context. Could I instead train my own, tiny LM, run the query above against my own model, and submit the output in the context to GPT for a more accurate response, or is that what a vector store is actually doing?

I would really love to hear what you all think.

gent.spah · July 12, 2023, 8:43am

You should check out Generetive AI course we just launched, it gives a few examples on how to extend training of an LLM with a specific dataset, which could be helpful to you.

About training your own language model from scratch its not a straightforward idea because the model will be very simplistic and wont offer an LLMs capabilities or behaviour, unless you have a lot of computing power, data and experience in building a large enough one.

tedmasterweb · July 12, 2023, 11:11am

Great tip about the generative AI course. I’ll definitely check that out. I’ve trained computer vision models in the past using fastai, which works more than well-enough for my needs. I totally understand what you are saying about training your own model but wasn’t sure if there was a way to use a pre-trained model and get something valuable out of it. Thanks for the tips!

tedmasterweb · July 13, 2023, 3:33pm

Is this the course you are referring to, @gent.spah?

gent.spah · July 13, 2023, 6:17pm

Yes thats the one I am sugesting!

Topic		Replies	Views
How to extend the knowledge base of an AI model? AI Discussions ai-discussions , langchain , open-ai , large-language-model , llm	1	257	March 11, 2024
ML workflow using LLMs AI Discussions ai-discussions	2	233	February 15, 2024
Determining When to Utilize External Search or Rely on LLM's Intrinsic Knowledge Functions, Tools and Agents with LangChain	0	141	November 4, 2023
Week 2: Intuition check for Step 2.1 in "Perform Full Fine-Tuning" Generative AI with Large Language Models week-2	3	426	July 24, 2023
Couple Questions From Week 1 Generative AI with Large Language Models week-1	1	383	October 2, 2023

Where is the magic? Doing analysis on a data set

Related topics