Instead of a large model trained on massive amounts of data, I wanted a small, proprietary model (less than ten billion) trained only on the rules and paradigms of a programming language that already had a web site with instructions and code files for related paradigms. Through this learning task, I hope to learn how to help companies train their own specialized models with vertical professional depth.
Is there anything wrong with my idea? I hope to get your guidance.