LLM to Classify User Utterance to Intent and Evaluate

Ganesh_Babu · July 26, 2024, 4:18pm

Enhancing User Utterance Classification with LLMs

I am currently working on a project that aims to leverage large language models (LLMs) to classify user utterances into corresponding intents. The dataset I am working with consists of user utterances related to car insurance. The main objective is to group and classify these utterances into specific clusters and predict the intent behind each cluster.

For example, in the context of car insurance, user intents might include:

Adding an additional driver
Requesting an evaluation
Updating an evaluation
Filing a claim
Inquiring about policy details

Due to client data privacy concerns, I am unable to use external LLM APIs. Therefore, I am running everything locally using Ollama.

Questions to the Community:

Project Execution After Preprocessing:

What is the best approach for clustering after preprocessing the data?
Which clustering mechanisms and embedding techniques would you recommend?
Which LLMs would be best suited for this task?

Performance Evaluation of the Classification Model:

How can I effectively evaluate the performance of the classification model?

Evaluating Intent Prediction from LLM:

What methods or metrics can I use to assess the accuracy of intent prediction by the LLM?

Identifying the Number of Clusters:

Since I do not know the number of clusters beforehand, what is the best mechanism to identify the optimal number of clusters?

Local Machine and Cloud Options:

I am currently using Anaconda Jupyter Lab on a local machine with 16GB RAM and a default Windows GPU. Is this setup sufficient, or should I consider using other machines or cloud options like SageMaker or Vertex AI?

Topic		Replies	Views
Week 1: Pretraining Large Language Models Generative AI with Large Language Models ai-discussions , large-language-model , llm	1	40	November 17, 2024
How to evaluate LLMS for labeling use case AI Discussions ai-discussions , langchain	1	27	July 17, 2024
Benchmarking accuracy of various large language models AI Discussions ai-discussions	2	55	August 7, 2023
Can Large Language Models Replace Data Analysts? AI Discussions ai-discussions	2	214	April 18, 2025
Seeking Collaborators for Innovative GenAI Projects AI Discussions ai-discussions , project	4	140	July 30, 2024

LLM to Classify User Utterance to Intent and Evaluate

Related topics