I want to make image to text my own model

I want to train my model to reach Gemini-level performance. Once I receive a response from the model, I will generate an image using Gemini, or create a new text-to-image model with NanoBanana-level efficiency.

1 Like

Do you have the resources and data to achieve a Gemini level performance? Don’t forget that there is a team of computer scientists doing that at google on full time jobs…

1 Like

Yes, I know there are jobs available, but I want to develop my own AI SaaS product. That’s why I want to build this—because I can’t afford the high prices of Gemini, Gemini Nano, BANA, and OpenAI. If you have any ideas related to this, please let me know. It would be a big help, and if it works, I’m open to sharing equity with you.

1 Like