Collecting Custom dataset for fine tuning an open source LLM

Hello,

I’m interested in fine-tuning an open-source model like Falcon with a custom dataset. Are you aware of any tools that I could use to create my own dataset for this purpose?

Thanks,
Saeed