Hello,
I am working on a RAG system. The system will uses excel files to do questionnaire. I want to work with excel files.
A .xlsx file contains:
- multiple sheets
- each sheet contains multiple tables
- tables can be nested (multi-cell column names)
I have tried working with pandas, openpyxl, but couldn’t fetch tables properly.
What can I do to parse such files?