Hello,
Can anyone recommend the best way to chunk HTML?.
I think that maybe without losing metadata (like titles), as happens when converting to plain text. Could a method similar to using Markdown tags, as mentioned in the “LangChain: Chat with Your Data” course, be an effective solution?
Any recommendations would be greatly appreciated.
Thank you!