LLMs don't seem great with tabular data when it comes to RAG. How can i fix this folks?

Silverback · January 30, 2024, 11:19am

How do i work with tabular data when it comes to RAG. The traditional method has failed to give me correct answers from a very basic table.

marconi · January 30, 2024, 11:33am

Courtesy from CrewAI Assistant, GPT-4:

To address your query about working with tabular data in the context of Retrieval-Augmented Generation (RAG), it’s important to acknowledge that RAG, primarily used for text data, might struggle with tabular data due to its format and structure. Here’s a plan to tackle this challenge effectively:

Agent Setup: We’ll define agents specialized in handling and interpreting tabular data. For example, a ‘Data Analysis Expert’ agent can be tasked with understanding and processing table structures.
Task Definition: We’ll create tasks specifically for handling tabular data. These tasks will instruct the agent to interpret the table, extract relevant information, and reformat it if necessary for better understanding.
Tool Integration: If necessary, we can integrate tools that are specialized in tabular data handling and analysis. These tools can assist the agents in interpreting and extracting meaningful information from the tables.
Custom Code or Tool Development: In case existing tools are insufficient, we can consider developing custom code or a tool specifically for handling your type of tabular data. This could involve creating scripts that reformat the table or extract key data points for the agent to process.
Testing and Iteration: Once the agents and tasks are set up, we’ll need to test them with your specific data and iteratively refine their capabilities to ensure they can handle the data correctly and provide accurate answers.

Hope this helps.

L_D1 · June 9, 2024, 11:10pm

hey @marconi , thanks for your answer, in my tests it seems that LLM really struggles with tabular data, as you said. Even with the technique you suggested, Agent Setup, it is difficult to make an agent understand or interpret tabular data.

By the way, was your message generated by CrewAI?

TMosh · June 10, 2024, 12:31am

Any time you see a message that has five numbered paragraphs with bold titles, it’s an indicator it came from a chat tool.

Nevermnd · June 10, 2024, 2:56am

I haven’t worked with RAG but I wonder if one might have better results if the data was translated to an explicitly structured/typed format first before being fed in (i.e. JSON, etc)

Topic		Replies	Views
PDF with tabular data AI Discussions ai-discussions , project	9	2168	March 22, 2024
Chat with tabular data AI Discussions ai-discussions	0	190	January 21, 2024
Confusion on LLM inference capability AI Discussions ai-discussions , project	2	29	January 3, 2025
To use or not to use RAG AI Discussions ai-discussions	4	93	April 4, 2025
Looking to better understand AI Agents Multi AI Agent Systems with crewAI feedback	7	400	January 23, 2025

LLMs don't seem great with tabular data when it comes to RAG. How can i fix this folks?

Related topics